Data from: IM-TORNADO: a tool for comparison of 16S reads from paired-end libraries

Jeraldo, Patricio1 2; Kalari, Krishna1; Chen, Xianfeng1; Bhavsar, Jaysheel1; Mangalam, Ashutosh1; White, Bryan2; Nelson, Heidi1; Kocher, Jean-Pierre1; Chia, Nicholas1 2

Published Nov 12, 2015 on Dryad. https://doi.org/10.5061/dryad.fm67n

Data files

Nov 12, 2015 version files 14.59 GB

comparison_aligners.tar.bz2

222.08 MB
comparison_taxonomy.tar.bz2

203.92 MB
validation_realistic.tar.bz2

440.30 MB
validation_taxonomy_errors.tar.bz2

6.45 GB
validation_V3V5.tar.bz2

3.86 GB
validation_V6V9.tar.bz2

3.40 GB

Abstract

Motivation: 16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads. Results: We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity. Availability and Implementation: IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq.

Data from: IM-TORNADO: a tool for comparison of 16S reads from paired-end libraries

Data files

Abstract

Files for taxonomy comparison

Files for comparison of sequence aligners

Files for validation with realistic synthetic reads

Files for taxonomy comparison using reads with errors

Files for validation of beta diversity using 16S rDNA regions V3 to V5

Files for validation of beta diversity using 16S rDNA regions V6 to V9

Data from: IM-TORNADO: a tool for comparison of 16S reads from paired-end libraries

Data files

Abstract

Usage notes

Files for taxonomy comparison

Files for comparison of sequence aligners

Files for validation with realistic synthetic reads

Files for taxonomy comparison using reads with errors

Files for validation of beta diversity using 16S rDNA regions V3 to V5

Files for validation of beta diversity using 16S rDNA regions V6 to V9

Works referencing this dataset