Data from: Testing the waters: A comparison of DNA extraction methods for the metabarcoding of microbial communities
Data files
Mar 23, 2026 version files 19.17 MB
-
dna-sequences_16S.fasta
7.61 MB
-
dna-sequences_18S.fasta
518.66 KB
-
feature-table-16S.tsv
5.74 MB
-
feature-table-18S.tsv
1.04 MB
-
Metadata_16S.tsv
3.46 KB
-
Metadata_18S.tsv
3.18 KB
-
pr2-taxonomy-18S.tsv
339.24 KB
-
README.md
2.32 KB
-
silva-taxonomy-16S.tsv
3.90 MB
Abstract
DNA metabarcoding is an emerging technique for enhancing microbial community monitoring programs that assess water quality and environmental health. However, comparability between studies is limited by a lack of standardized laboratory protocols. To contribute to protocol optimization and decision making, we compare seven commercially available DNA extraction kits and evaluate their impact on DNA yield, PCR inhibitor removal, and taxonomic resolution of microplankton, microalgae, and particle-associated bacterial communities using 18S V9 and 16S V3-V4 amplicon sequencing of samples from two contrasting estuarine environments varying in salinity and productivity. We observed greater DNA yields and detected PCR inhibition at the higher productivity, lower salinity site. The Qiagen DNeasy PowerSoil Pro and Omega E.Z.N.A. Tissue DNA kits both yielded greater amounts of DNA, but only the soil- and plant-specific kits effectively removed PCR inhibitors. Notably, differences in taxonomic richness between kits were statistically significant but small (often <10 taxa). Additionally, beta diversity was more strongly influenced by abundance than by the presence/absence of taxa. All extraction methods shared 84-90% of resolved taxa and detected all major eukaryotic divisions/phyla identified by microscopy, as well as cyanobacteria. Furthermore, all kits detected several potentially harmful taxa, including Akashiwo sanguinea and Vibrio sp. These results suggest that DNA extraction methods should be chosen based on environmental conditions, and we recommend the Qiagen PowerSoil Kit for obtaining high yields of uninhibited DNA from productive estuarine water samples. While alpha- and beta-diversity are largely similar between extraction methods, pilot studies should be performed to confirm the resolution of taxa of interest.
Dataset DOI: 10.5061/dryad.98sf7m0w3
Description of the data and file structure
Files and variables
File: feature-table-16S.tsv
Description: ASV counts for the 16S dataset
Variables
- OTU ID = ASV ID
- Sample Number
File: silva-taxonomy-16S.tsv
Description: Taxonomic assignments for each ASV from the 16S dataset
Variables
- Feature ID: ASV ID
- Taxon: Taxonomic lineage
- Consensus: VSearch consensus
File: pr2-taxonomy-18S.tsv
Description: Taxonomic assignments for each ASV from the 18S dataset
Variables
- Feature ID: ASV ID
- Taxon: Taxonomic lineage
- Consensus: VSearch consensus
File: feature-table-18S.tsv
Description: ASV counts for the 18S dataset
Variables
- OTU ID = ASV ID
- Sample Number
File: dna-sequences_18S.fasta
Description: Sequences for each 18S ASV in FASTA format
File: Metadata_18S.tsv
Description: Metadata for 18S Sample Numbers
Variables
- sampleid: Sample Number
- method: DNA extraction method. OBB = Omega E.Z.N.A. Tissue DNA Kit+ Bead-Beating; OES = Omega E.Z.N.A. Soil DNA Kit; OMT = Omega E.Z.N.A. Tissue DNA Kit; QBB = Qiagen DNeasy Blood & Tissue Kit+ Bead-Beating; QBT = Qiagen DNeasy Blood & Tissue Kit; QPP = Qiagen DNeasy Plant Pro Kit; QPS = Qiagen DNeasy PowerSoil Pro Kit
- site: Sampling Site. FM = Fort Matanzas, LM = Lake Middle.
- i7id: i7 primer id
- i7index: i7 primer sequence
- i5id: i5 primer id
- i5index: i5 primer index
File: Metadata_16S.tsv
Description: Metadata for 16S Sample Numbers
Variables
- sampleid: Sample Number
- method: DNA extraction method. OBB = Omega E.Z.N.A. Tissue DNA Kit+ Bead-Beating; OES = Omega E.Z.N.A. Soil DNA Kit; OMT = Omega E.Z.N.A. Tissue DNA Kit; QBB = Qiagen DNeasy Blood & Tissue Kit+ Bead-Beating; QBT = Qiagen DNeasy Blood & Tissue Kit; QPP = Qiagen DNeasy Plant Pro Kit; QPS = Qiagen DNeasy PowerSoil Pro Kit
- site: Sampling Site
- i7id: i7 primer id
- i7index: i7 primer sequence
- i5id: i5 primer id
- i5index: i5 primer index
File: dna-sequences_16S.fasta
Description: Sequences for each 16S ASV in FASTA format
DNA was extracted from filtered surface water samples at two sites using seven commercially available DNA extraction methods: Omega E.Z.N.A. Tissue DNA Kit, Omega E.Z.N.A. Tissue DNA Kit+ Bead-Beating, Qiagen DNeasy Blood & Tissue Kit, Omega E.Z.N.A. Soil DNA Kit, Qiagen DNeasy Blood & Tissue Kit+ Bead-Beating, Qiagen DNeasy Plant Pro Kit, and Qiagen DNeasy PowerSoil Pro Kit.
Extracted DNA was amplified using a two-step PCR method and sequenced using an Illumina MiSeq. Reads were demultiplexed using the Illumina MiSeq Software and quality control was performed using fastp. Primer trimming, denoising, ASV generation, and taxonomic assignments were performed using QIIME2.
