Data from: Rapid and accurate taxonomic classification of insect (Class Insecta) cytochrome c oxidase subunit 1 (COI) DNA barcode sequences using a naïve Bayesian classifier

Gibson JF, Shokralla S, Golding GB, Hajibabaei M, Porter TM, Baird DJ

Date Published: February 18, 2014

DOI: http://dx.doi.org/10.5061/dryad.bc8pc

 

Files in this package

Content in the Dryad Digital Repository is offered "as is." By downloading files, you agree to the Dryad Terms of Service. To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data. CC0 (opens a new window) Open Data (opens a new window)

Title Canadian benthos Fasta files
Downloaded 84 times
Description Fasta files for the benthos insect datasets from Fredericton, New Brunswick, Canada: 1) ~ 100 bp fragments (454), 2) ~ 200 bp fragments (454), 3) ~ 600 bp reference (Sanger) sequences.
Download CanadianBenthosFastas.zip (257.4 Kb)
Download README.docx (40.08 Kb)
Details View File Details
Title Training the naive Bayesian classifier
Downloaded 319 times
Description Contains the Fasta and taxonomy files needed to train the Ribosomal Database Project naive Bayesian classifier three different ways: 1) GenBank-genus, 2) GenBank-barcode, 3) GenBank-family.
Download TrainingTheClassifier.zip (21.64 Mb)
Download README.docx (120.2 Kb)
Details View File Details
Title Customizing the Insecta taxonomy used with the classifier
Downloaded 106 times
Description Contains three tab-delimited taxonomy files that can be used to help customize the taxonomic scheme used by the naive Bayesian classifier: 1) GenBank-genus, 2) GenBank-barcode, 3) GenBank-family. One of these files, should be used with one of the two provided Perl scripts to generate a taxonomy file in the format required by the Ribosomal Database Project naive Bayesian classifier during training: 1) make_NBC_taxonomy.plx, 2) make_NBC_taconomy_family.plx.
Download CustomizingInsectaTaxonomy.zip (97.05 Kb)
Download README.docx (58.77 Kb)
Details View File Details
Title Malaise insect dataset
Downloaded 15 times
Description Fasta file of Malaise insect data set, see also Gibson et al. (in prep).
Download malaise.fasta (355.9 Kb)
Details View File Details

When using this data, please cite the original publication:

Porter TM, Gibson JF, Shokralla S, Golding GB, Hajibabaei M, Baird DJ (2014) Rapid and accurate taxonomic classification of insect (Class Insecta) cytochrome c oxidase subunit 1 (COI) DNA barcode sequences using a naïve Bayesian classifier. Molecular Ecology Resources 14(5): 929-942. http://dx.doi.org/10.1111/1755-0998.12240

Additionally, please cite the Dryad data package:

Gibson JF, Shokralla S, Golding GB, Hajibabaei M, Porter TM, Baird DJ (2014) Data from: Rapid and accurate taxonomic classification of insect (Class Insecta) cytochrome c oxidase subunit 1 (COI) DNA barcode sequences using a naïve Bayesian classifier. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.bc8pc
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)

Search for data

Be part of Dryad

We encourage organizations to: