Rees, Jonathan A.; Cranston, Karen Ann; Rees, Jonathan; Cranston, Karen (2017), Data from: Automated assembly of a reference taxonomy for phylogenetic data synthesis, Dryad, Dataset, https://doi.org/10.5061/dryad.vk775
Taxonomy and nomenclature data are critical for any project that synthesizes biodiversity data, as most biodiversity data sets use taxonomic names to identify taxa. Open Tree of Life is one such project, synthesizing sets of published phylogenetic trees into comprehensive summary trees. No single published taxonomy met the taxonomic and nomenclatural needs of the project. Here we describe a system for reproducibly combining several source taxonomies into a synthetic taxonomy, and we discuss the challenges of taxonomic and nomenclatural synthesis for downstream biodiversity projects.
README file for package
Describes all files in package and provides external links to additional documentation.
Contains a mapping of Genbank accession numbers to NCBI taxon ids and names. The accessions are those that are the reference sequences for SILVA clusters.
The taxonomy amendments used in preparation of OTT. Includes all amendments in https://github.com/OpenTreeOfLife/amendments-1 at the time that we built the version of OTT described in the article. Each amendment is a json file that includes the change being proposed as well as sources that support the change.
mapping of source ids to OTT ids
Cumulative mapping of identifiers in sources to OTT identifiers. Format of each line is source:sourceid,OTTid.
This is the previous version of the Open Tree Taxonomy. It is included because we use the previous version during identifier assignment (ott2.10 required to build ott3.0). See the enclosed README for details on files and formats.
OTT version 3.0
Version of the Open Tree Taxonomy described in the manuscript. See the enclosed README for details on files and formats.
The 'separation taxonomy' described in the manuscript. Contains a list of taxa and a list of synonyms.