Data from: Challenges with using names to link digital biodiversity information

Patterson DJ, Mozzherin D, Shorthouse DP, Thessen A

Date Published: May 27, 2016

DOI: http://dx.doi.org/10.5061/dryad.3160r

 

Files in this package

Content in the Dryad Digital Repository is offered "as is." By downloading files, you agree to the Dryad Terms of Service. To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data. CC0 (opens a new window) Open Data (opens a new window)

Title Tab delimited file of data used in and described in the Patterson et al Challenges paper in Biodiversity Data Journal (2016)
Downloaded 7 times
Description Data produced as a result of mapping of GenBank and DRYAD content against Catalogue of Life and subsequent reclassification as described in the 'Challenges' paper. For full details see publication ('The Challenges Paper'). Tab delimited format. Fields: Serial Number (as provided by FileMaker software); Input Name (name-string as obtained from source); GN UUID (UUID as created by Global Names as described in the challenges paper); Canonical version (canonical form of name-string as formed by Global Names software); GenBank (indicates if name-string was in the GenBank download); Dryad Original (indicates if the name-string was in the original downloads from DRYAD; DRYAD Preprocessed (indicates if the name-string was in the DRYAD compilation after pre-processing); Scientific Name (indicates if the name-string is a scientific name); Terminal Taxon (indicates if the name-string refers to a species or subordinate taxon); GNXM (classification of matches as formed by Global Names Cross Mapper, details in full publication); GN Matched Name (name-string that Global Names Cross Mapper matched to); GN Matched Canonical Form (canonical form of name-string that Global Names Cross Mapper matched to); Input Rank (rank assigned to name-string by data source); GNXM Matched Rank (rank inferred by Global Names Cross Mapper); GNXM Matched Edit Distance (edit distances as described in associated paper for fuzzy matches found by Global Names Cross Mapper); GNXM Matched Score (confidence measure as described in associated paper for matches found by Global Names Cross Mapper); and Final Annotation (final classification made as a result of the analyses of this paper, and as described in the paper).
Download Patterson_et_al_Challenges_with_Names_dat...30.tab (391.5 Mb)
Details View File Details

When using this data, please cite the original publication:

Patterson DJ, Mozzherin D, Shorthouse DP, Thessen A (2016) Challenges with using names to link digital biodiversity information. Biodiversity Data Journal 4: e8080. http://dx.doi.org/10.3897/BDJ.4.e8080

Additionally, please cite the Dryad data package:

Patterson DJ, Mozzherin D, Shorthouse DP, Thessen A (2016) Data from: Challenges with using names to link digital biodiversity information. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.3160r
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)

Search for data

Be part of Dryad

We encourage organizations to: