Data from: Next-generation sampling: pairing genomics with herbarium specimens provides species-level signal in Solidago (Asteraceae)
Data files
May 08, 2016 version files 757.27 KB
-
FastqToTagCountPlugin.xml
374 B
-
FastqToTBTPlugin.xml
423 B
-
GBSHapMapFiltersPlugin.chr1.snpmerged.xml
536 B
-
Junceae_30percent_hapmap.hmp.txt
139.92 KB
-
Junceae_70percent_hapmap.hmp.txt
47.29 KB
-
MergeDuplicateSNP_vcf_Plugin.chr1.xml
319 B
-
MergeDuplicateSNPsPlugin.chr1.xml
458 B
-
MergeMultipleTagCountPlugin.xml
327 B
-
MergeTagsByTaxaFilesPlugin.xml
326 B
-
Solidago_keyfile.txt
12.64 KB
-
Solidago_TASSEL_options.xlsx
50.13 KB
-
Squarrosae_30percent_hapmap.hmp.txt
170.78 KB
-
Squarrosae_70percent_hapmap.hmp.txt
22.73 KB
-
TagCountToFastqPlugin.xml
348 B
-
TagsToSNPByAlignmentPlugin.chr1.xml
629 B
-
tbt2vcfPlugin.chr1.xml
460 B
-
Triplinerviae_30percent_hapmap.hmp.txt
264.14 KB
-
Triplinerviae_70percent_hapmap.hmp.txt
44.70 KB
-
UExportTagPairPlugin.xml
395 B
-
UTagCountToTagPairPlugin.xml
349 B
Abstract
Premise of the study: The ability to conduct species delimitation and phylogeny reconstruction with genomic data sets obtained exclusively from herbarium specimens would rapidly enhance our knowledge of large, taxonomically contentious plant genera. In this study, the utility of genotyping by sequencing is assessed in the notoriously difficult genus Solidago (Asteraceae) by attempting to obtain an informative single-nucleotide polymorphism data set from a set of specimens collected between 1970 and 2010. Methods: Reduced representation libraries were prepared and Illumina-sequenced from 95 Solidago herbarium specimen DNAs, and resulting reads were processed with the nonreference Universal Network-Enabled Analysis Kit (UNEAK) pipeline. Multidimensional clustering was used to assess the correspondence between genetic groups and morphologically defined species. Results: Library construction and sequencing were successful in 93 of 95 samples. The UNEAK pipeline identified 8470 single-nucleotide polymorphisms, and a filtered data set was analyzed for each of three Solidago subsections. Although results varied, clustering identified genomic groups that often corresponded to currently recognized species or groups of closely related species. Discussion: These results suggest that genotyping by sequencing is broadly applicable to DNAs obtained from herbarium specimens. The data obtained and their biological signal suggest that pairing genomics with large-scale herbarium sampling is a promising strategy in species-rich plant groups.