Data from: Double-digest RAD sequencing outperforms microsatellite loci at assigning paternity and estimating relatedness: a proof of concept in a highly promiscuous bird
Data files
Feb 06, 2018 version files 283.23 KB
-
2012-2016 MSAT Genotypes.txt
16.49 KB
-
2012-2016 SNP Genotypes.txt
266.74 KB
Feb 06, 2018 version files 1.75 GB
-
2012-2016 MSAT Genotypes.txt
16.49 KB
-
2012-2016 SNP Genotypes.txt
266.74 KB
-
popmap_1.txt
198 B
-
popmap_10.txt
218 B
-
popmap_11v2.txt
207 B
-
popmap_12v2.txt
207 B
-
popmap_9.txt
198 B
-
popmap_all.txt
1.60 KB
-
post_rxstacks_assembly_1.tar.gz
135.23 MB
-
post_rxstacks_assembly_10.tar.gz
171.33 MB
-
post_rxstacks_assembly_11.tar.gz
134.84 MB
-
post_rxstacks_assembly_12.tar.gz
102.95 MB
-
post_rxstacks_assembly_9.tar.gz
107.57 MB
-
post_rxstacks_assembly_all.tar.gz
1.10 GB
Abstract
Information on genetic relationships among individuals is essential to many studies of the behavior and ecology of wild organisms. Parentage and relatedness assays based on large numbers of SNP loci hold substantial advantages over the microsatellite markers traditionally used for these purposes. We present a double-digest restriction site-associated DNA sequencing (ddRAD-seq) analysis pipeline that, as such, simultaneously achieves the SNP discovery and genotyping steps and which is optimized to return a statistically powerful set of SNP markers (typically 150-600 after stringent filtering) from large numbers of individuals (up to 240 per run). We explore the tradeoffs inherent in this approach through a set of experiments in a species with a complex social system, the variegated fairy-wren (Malurus lamberti), and further validate it in a phylogenetically broad set of other bird species. Through direct comparisons with a parallel dataset from a robust panel of highly variable microsatellite markers, we show that this ddRAD-seq approach results in substantially improved power to discriminate among potential relatives and considerably more precise estimates of relatedness coefficients. The pipeline is designed to be universally applicable to all bird species (and with minor modifications to many other taxa), to be cost- and time-efficient, and to be replicable across independent runs such that genotype data from different study periods can be combined and analyzed as field samples are accumulated.