Data from: Targeted multiplex next-generation sequencing: Advances in techniques of mitochondrial and nuclear DNA sequencing for population genomics
Data files
Dec 12, 2012 version files 27.15 MB
-
Beaked_75nucloci_fasta.txt
60.49 KB
-
BeakedWhale_array.zip
5.68 MB
-
Cmyd_array.zip
5.54 MB
-
Cmyd_nucFrags_fasta.txt
62.22 KB
-
FinWhale 43loci_Array1_fasta.txt
19.19 KB
-
FinWhale_array.zip
4.70 MB
-
merged vcf to genotype.r
3.38 KB
-
ngs-mtdna-pipeline-v3.2.r
10.11 KB
-
ngs-snp-pipeline-v3.2.r
10.13 KB
-
ngs.funcs.rdata
124.69 KB
-
README_for_Beaked_75nucloci_fasta.txt
1.29 KB
-
README_for_BeakedWhale_array.txt
1.29 KB
-
README_for_Cmyd_array.txt
1.29 KB
-
README_for_Cmyd_nucFrags_fasta.txt
1.29 KB
-
README_for_FinWhale 43loci_Array1_fasta.txt
1.29 KB
-
README_for_FinWhale_array.txt
1.29 KB
-
README_for_merged vcf to genotype.txt
2.07 KB
-
README_for_ngs-mtdna-pipeline-v3.2.txt
2.07 KB
-
README_for_ngs-snp-pipeline-v3.2.txt
2.07 KB
-
README_for_ngs.funcs.txt
2.07 KB
-
README_for_SpermWhale_50nucSeqs_SNPdisco_Fasta.txt
1.29 KB
-
README_for_Spinner-Spotted_85nuc_loci_Fasta.txt
1.29 KB
-
README_for_Spinner-SpottedDolphin_Array.txt
1.29 KB
-
SpermWhale_50nucSeqs_SNPdisco_Fasta.txt
30.65 KB
-
SpermWhale_Array.zip
4.78 MB
-
Spinner-Spotted_85nuc_loci_Fasta.txt
88.06 KB
-
Spinner-SpottedDolphin_Array.zip
6.03 MB
Abstract
Next-generation sequencing (NGS) is emerging as an efficient and cost-effective tool in population genomic analyses of nonmodel organisms, allowing simultaneous resequencing of many regions of multi-genomic DNA from multiplexed samples. Here, we detail our synthesis of protocols for targeted resequencing of mitochondrial and nuclear loci by generating indexed genomic libraries for multiplexing up to 100 individuals in a single sequencing pool, and then enriching the pooled library using custom DNA capture arrays. Our use of DNA sequence from one species to capture and enrich the sequencing libraries of another species (i.e. cross-species DNA capture) indicates that efficient enrichment occurs when sequences are up to about 12% divergent, allowing us to take advantage of genomic information in one species to sequence orthologous regions in related species. In addition to a complete mitochondrial genome on each array, we have included between 43 and 118 nuclear loci for low-coverage sequencing of between 18 kb and 87 kb of DNA sequence per individual for single nucleotide polymorphisms discovery from 50 to 100 individuals in a single sequencing lane. Using this method, we have generated a total of over 500 whole mitochondrial genomes from seven cetacean species and green sea turtles. The greater variation detected in mitogenomes relative to short mtDNA sequences is helping to resolve genetic structure ranging from geographic to species-level differences. These NGS and analysis techniques have allowed for simultaneous population genomic studies of mtDNA and nDNA with greater genomic coverage and phylogeographic resolution than has previously been possible in marine mammals and turtles.