16S rRNA gene sequences of shipworm bacterial symbiont species
Data files
May 27, 2026 version files 177.30 KB
-
NOAA_and_Moore_Isolates_with_16S_sequences.xlsx
175.17 KB
-
README.md
2.13 KB
May 29, 2026 version files 177.27 KB
-
16S_sequences.xlsx
175.17 KB
-
README.md
2.10 KB
Abstract
Shipworms perform a critical role in marine environments by recycling the energy in wood. The bacterial symbionts inside shipworms produce the enzymes needed to digest the lignocellulose in wood. A few symbionts have been grown in pure culture, but most remain poorly characterized. Little is known about the evolutionary relationships between symbiont species. Here, 16S rRNA gene sequences are listed for comparison between bacterial symbionts or free-living bacteria taken from either publicly available genomes or assembled MAGs. Comparisons to these sequences could help reveal the identity or phylogeny of shipworm symbiont bacteria in future sequencing experiments.
Dataset DOI: 10.5061/dryad.tx95x6bcs
Principal Investigator Contact Information
Name: Daniel L. Distel
Institution: Northeastern University
Email: d.distel@northeastern.edu
Description of the data and file structure
The data consists of 16S rRNA gene sequences taken from genomes or metagenome-assembled genomes (MAGs). Many of the sequences are from bacterial symbionts that live inside wood-boring shipworm mollusks and secrete enzymes that digest lignocellulose. These sequences can be compared to determine the evolutionary relatedness of the members of shipworm symbiont communities.
Files and variables
File: 16S_sequences.xlsx
Description: Excel spreadsheet containing a list of 16S rRNA gene sequences of shipworm symbiont bacteria and metadata. The sequences could be used to compare the relatedness of various shipworm symbionts, to see how a newly discovered symbiont species fits within the established phylogeny, or to determine the relatedness of a wild isolate to previously sequenced strains. Blank cells indicate missing information.
Abbreviations and Codes:
Sequence Name – The given name of the 16S rRNA gene sequence.
16S sequence – The nucleotides that compose the 16S rRNA gene sequence.
Sequence length – The nucleotide length of the 16S rRNA gene sequence in base pairs.
Method of obtaining sequence – If the 16S rRNA gene sequence came from an assembled metagenome, the method used to create the assembly is listed. Here, hybrid refers to a pipeline using both Illumina and Oxford Nanopore. If the 16S sequence was instead taken from a genome provided by a publicly available genome database, the database name is listed.
Clade – A putative classification of the organisms’ phylogeny based on the 16S rRNA gene sequence.
Source – The species of shipworm from which the bacteria were isolated. Source is listed as free-living if the bacteria do not live inside a shipworm host.
