Dryad Home > Main > Dryad Data Packages > View Item

Data from: Ultraconserved elements anchor thousands of genetic markers for target enrichment spanning multiple evolutionary timescales

When using this data, please cite the original article:

Faircloth BC, McCormack JE, Crawford NG, Harvey MG, Brumfield RT, Glenn TC (2011) Ultraconserved elements anchor thousands of genetic markers for target enrichment spanning multiple evolutionary timescales. Systematic Biology, online in advance of print. doi:10.1093/sysbio/sys004

Additionally, please cite the Dryad data package:

Faircloth BC, McCormack JE, Crawford NG, Harvey MG, Brumfield RT, Glenn TC (2011) Data from: Ultraconserved elements anchor thousands of genetic markers for target enrichment spanning multiple evolutionary timescales. Dryad Digital Repository. doi:10.5061/dryad.64dv0tg1
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)
Dryad Package Identifier doi:10.5061/dryad.64dv0tg1    321 views  
Abstract Although massively parallel sequencing has facilitated large-scale DNA sequencing, comparisons among distantly related species rely upon small portions of the genome that are easily aligned. Methods are needed to efficiently obtain comparable DNA fragments prior to massively parallel sequencing, particularly for biologists working with non-model organisms. We introduce a new class of molecular marker, anchored by ultraconserved genomic elements (UCEs), that universally enable target enrichment and sequencing of thousands of orthologous loci across species separated by hundreds of millions of years of evolution. Our analyses here focus on use of UCE markers in Amniota, because UCEs and phylogenetic relationships are well known in some amniotes. We perform an in silico experiment to demonstrate that sequence flanking 2,030 UCEs contains information sufficient to enable unambiguous recovery of the established primate phylogeny. We extend this experiment by performing an in vitro enrichment of 2,386 UCE-anchored loci from nine, non-model avian species. We then use alignments of 854 of these loci to unambiguously recover the established evolutionary relationships within and among three ancient bird lineages. Because many organismal lineages have UCEs, this type of genetic marker and the analytical framework we outline can be applied across the tree of life, potentially reshaping our understanding of phylogeny at many taxonomic levels.
Keywords ultra-conserved elements, genetic markers, sequence capture, target enrichment, phylogenomics, flanking sequence,
Date Deposited 2011-12-16T22:20:46Z
Show Full Metadata

primate-probes-matches.sqlite    15 views   45 downloads View File Details
SQLITE database of all-probes matches to various primate genome sequences. 'matches' tables shows match status (1 = TRUE) of probes to genomes and 'match-map' shows the primate contig matched and the orientation of the match.
Download: primate-probes-matches.sqlite ( 1.040Mb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



all-probes    17 views   50 downloads View File Details
FASTA-formatted text file of UCE-anchored probe sequences, designed from UCEs identified across Reptiles (birds and lizard). We (1) aligned these sequences to extant primate genomic sequences and (2) used synthetic oligos identical to a subset of these sequences to enrich target DNA in birds. Fasta header gives probes location relative to chromosomes of galGal3 (UCSC).
Download: all-probes.fasta ( 865.4Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



probe-matches-to-taxa.sqlite    15 views   48 downloads View File Details
SQLITE database of all-probes.fasta matches to genome-enabled vertebrate taxa (Supplementary Table 1). Column value of 1 = TRUE (meaning there was a match).
Download: probe-matches-to-taxa.sqlite ( 295.9Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



primate-uce-anchored-loci.fasta.tar.bz2    13 views   44 downloads View File Details
BZIP2 file of FASTA sequences sliced from primate genomes that include the match-site of probes within the all-probes.fasta file ± flanking sequence. The archive contains fasta files for each primate in Supplementary Table 2.
Download: primate-uce-anchored-loci.fasta.tar.bz2 ( 4.043Mb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



primate-uce-anchored-alignments.nexus.tar.bz2    18 views   43 downloads View File Details
NEXUS-formatted files providing alignments of UCE-anchored genomic regions identified in primate genomes where probes from all-probes.fasta matches respective primate genomes. We used these alignments to reconstruct the primate phylogeny.
Download: primate-uce-anchored-alignments.nexus.tar.bz2 ( 1.113Mb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-contigs-assembled-from-captures.tar.bz2    19 views   44 downloads View File Details
BZIP2 archive of FASTA files providing bird contigs assembled from reads following target enrichment with a subset of probes in all-probes.fasta. Archive contains one file per species.
Download: birds-contigs-assembled-from-captures.tar.bz2 ( 2.158Mb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-probe-matches.sqlite    17 views   45 downloads View File Details
SQLITE database of contigs (in birds-contigs-assembled-from-captures.tar.bz2), that match, without duplication, probes within all-probes.fasta.
Download: birds-probe-matches.sqlite ( 347.1Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-uce-anchored-loci.fasta.bz2    13 views   41 downloads View File Details
BZIP2 archive of FASTA files, corresponding to those contigs matching target enrichment probes (in birds-probe-matches.sqlite) that are not duplicated. We assemble these reads, on a locus-by-locus basis to generate the alignments in birds-uce-anchored-alignments.nexus.
Download: birds-uce-anchored-loci.fasta.bz2 ( 1.281Mb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-uce-anchored-alignments.nexus.tar.bz2    12 views   43 downloads View File Details
BZIP2 archive of NEXUS-formatted file alignments generated, on a locus-by-locus basis, from FASTA sequences in birds-uce-anchored-loci.
Download: birds-uce-anchored-alignments.nexus.tar.bz2 ( 479.4Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



dbSNP132-to-hg19-uce-200    11 views   44 downloads View File Details
CSV-formatted file giving the UCE overlapping SNP locations present in dbSNP132. The 'snp-name' column gives the rs-accession for the SNP record.
Download: dbSNP132-to-hg19-uce-200.csv ( 430.2Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



probe-matches-to-taxa.lastz.tar.bz2    13 views   44 downloads View File Details
TAB-delimited LASTZ output from matches of probes in all-probes.fasta to the vertebrate genomes in Supplementary Table 1. We parsed this file to remove duplicates. We archived results for non-duplicated matches in probe-matches-to-taxa.sqlite.
Download: probe-matches-to-taxa.lastz.tar.bz2 ( 754.5Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



primate-probes-matches.lastz.tar.bz2    12 views   40 downloads View File Details
TAB-delimited LASTZ output from matches of probes in all-probes.fasta to the primate genomes in Supplementary Table 2. We parsed this file to remove duplicates. We archived results for non-duplicated matches in primate-probes-matches.sqlite
Download: primate-probes-matches.lastz.tar.bz2 ( 858.6Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-probes-matches.lastz.tar.bz2    15 views   42 downloads View File Details
TAB-delimited LASTZ output from matches of probes in all-probes.fasta to the vertebrate genomes in Supplementary Table 3. We parsed this file to remove duplicates. We archived results for non-duplicated matches in birds-probe-matches.sqlite.
Download: birds-probes-matches.lastz.tar.bz2 ( 247.4Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



probe-subset-2560-synthesized    15 views   27 downloads View File Details
FASTA file of the subset of 2,560 probes from all-probes that we synthesized for in vitro targeted capture of 2,386 loci in birds. FASTA header gives the probe id, the probe position in the chicken (galGal3) genome, and the count of probes targeting that locus.
Download: probe-subset-2560-synthesized.fasta ( 419.2Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



all-sample-sim1000.txt    12 views   20 downloads View File Details
Simulations results used in Supplementary table indicating the effects of adding additional taxa on the reduction of loci in a complete data matrix.
Download: all-sample-sim1000.txt.bz2 ( 6.513Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



all-sample-sim1000.txt    13 views   25 downloads View File Details
Data used to create Supplementary Figure 7.
Download: all-sample-sim1000.txt.bz2 ( 5.646Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-contig-lengths-by-probe-filtered-birds.csv    13 views   19 downloads View File Details
Data used in Supplementary Figure 1.
Download: birds-contig-lengths-by-probe-filtered-birds.csv.bz2 ( 5.646Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



birds-gc-length-species-matches.csv    12 views   20 downloads View File Details
Data used in Supplementary Figures 2-5.
Download: birds-gc-length-species-matches.csv.bz2 ( 5.646Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



supplementary-tables-and-figures    14 views   26 downloads View File Details
Supplementary Tables 1-4 and Supplementary Figures 1-8.
Download: supplementary-tables-and-figures.doc ( 12.09Mb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  



computer-code-README    11 views   18 downloads View File Details
Location, commit (i.e., snapshot), and URL information for computer code used as part of this manuscript. Placed into Dryad in lieu of committing code under CC0.
Download: computer-code-README.txt ( 1.324Kb )
To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data.  


My Account

Browse

Information