Population genomics of American and Eurasian wigeons (Mareca spp.) using ddRAD-seq
Data files
May 08, 2026 version files 6.48 GB
-
README.md
2.25 KB
-
SpecimenData.xlsx
22.58 KB
-
Wigeon.ddRADseq.fastq.zip
6.48 GB
Abstract
Eurasian wigeon (Mareca penelope) and American wigeon (Mareca americana) are sister species with diagnosable differences mostly in male plumage. They breed in the Palearctic and Nearctic, respectively, but due to transoceanic movements during migrations come in contact in North America, Western Europe, and north-eastern Asia, where they occasionally hybridize. To estimate genomic divergence and study their population structure, we sequenced mitochondrial (mt) DNA control region and obtained 3,092 autosomal and 189 Z-sex chromosome loci from double-digest restriction-associated DNA sequencing (ddRAD-seq). Consistent with previous work with a few nuclear loci, we observed discordant patterns between mtDNA and nuclear DNA divergence. Deeply divergent species-specific mtDNA haplogroups contrasted with low autosomal differentiation and moderate Z chromosome divergence. Meanwhile, Z-linked differentiation (Фst = 0.192) between taxa was five times higher than differentiation of autosomal loci (Фst = 0.0386), with four fixed and eight nearly fixed differences in SNPs discovered in three and six Z-linked outlier loci, respectively. No species-specific SNP variants were found among 83 autosomal outlier loci. This elevated Z-chromosome differentiation is most likely the result of selection that has been important in speciation. The lack of population genetic structure within Eurasian wigeon and American wigeon supports the common notion that migratory waterfowl have high dispersal abilities that contribute to strong genetic connectivity between geographic populations.
Dataset DOI: 10.5061/dryad.8931zcs4x
Description of the data and file structure
We obtained tissue samples from 70 Eurasian wigeons and 40 American wigeons collected and sequenced across Asia from Western Siberia to Western Beringia and across North America (see SpecimenData.xlsx). Double-digest restriction associated DNA sequencing (ddRAD-seq) libraries were prepared following the protocol of DaCosta and Sorenson (2014). In brief, genomic DNA was digested using SbfI and EcoRI restriction enzymes, and Illumina TruSeq compatible barcodes were ligated. Ligated DNA fragments 300 to 450 bp in length were extracted from 2% low-melt agarose gels and amplified using standard PCR. Libraries were sequenced (150 bp reads) on an Illumina HiSeq 2500.
Using the DaCosta and Sorenson (2014) bioinformatics pipeline, we recovered 3,281 ddRAD-seq loci, with a mean depth of 199.1 reads per locus per individual.
DaCosta JM, Sorenson MD (2014) Amplification biases and consistent recovery of loci in a double-digest RAD-seq protocol. PLoS One 9: e106713. https://doi.org/10.1371/journal.pone.0106713
Files and variables
File: Wigeon.ddRADseq.fastq.zip
Description: This zipped folder includes an individual fastq file containing the raw ddRAD-seq data for each individual included in this study.
File: SpecimenData.xlsx
Description: This file includes data for each specimen included in this study. This information includes species, museum voucher numbers or archived tissue numbers, sampling location including country, latitude and longitude coordinates, date of collection, and sex of each individual.
Missing values are indicated by "N/A" = not applicable or "no data".
Code/software
Scripts for the DaCosta and Sorenson pipeline used to filter and assemble the ddRAD-seq data are available at https://github.com/BU-RAD-seq/ddRAD-seq-Pipeline.
Access information
Other publicly accessible locations of the data:
- N/A
Data was derived from the following sources:
- N/A
