Skip to main content
Dryad

Data from: Intraspecific DNA contamination distorts subtle population structure in a marine fish: decontamination of herring samples before restriction-site associated (RAD) sequencing and its effects on population genetic statistics

Cite this dataset

Petrou, Eleni L. et al. (2018). Data from: Intraspecific DNA contamination distorts subtle population structure in a marine fish: decontamination of herring samples before restriction-site associated (RAD) sequencing and its effects on population genetic statistics [Dataset]. Dryad. https://doi.org/10.5061/dryad.g28rh86

Abstract

Wild specimens are often collected in challenging field conditions, where samples may be contaminated with the DNA of conspecific individuals. This contamination can result in false genotype calls, which are difficult to detect, but may also cause inaccurate estimates of heterozygosity, allele frequencies, and genetic differentiation. Marine broadcast spawners are especially problematic, because population genetic differentiation is low and samples are often collected in bulk and sometimes from active spawning aggregations. Here, we used contaminated and clean Pacific herring (Clupea pallasi) samples to test (i) the efficacy of bleach decontamination, (ii) the effect of decontamination on RAD genotypes, and (iii) the consequences of contaminated samples on population genetic analyses. We collected fin tissue samples from actively spawning (and thus contaminated) wild herring and non-spawning (uncontaminated) herring. Samples were soaked for 10 minutes in bleach or left untreated, and extracted DNA was used to prepare DNA libraries using a restriction-site associated DNA (RAD) approach. Our results demonstrate that intraspecific DNA contamination affects patterns of individual and population variability, causes an excess of heterozygotes, and biases estimates of population structure. Bleach decontamination was effective at removing intraspecific DNA contamination and compatible with RAD sequencing, producing high-quality sequences, reproducible genotypes, and low levels of missing data. Although sperm contamination may be specific to broadcast spawners, intraspecific contamination of samples may be common and difficult to detect from high-throughput sequencing data, and can impact downstream analyses.

Usage notes

Funding

National Science Foundation, Award: 1203868

Location

North America