Data from: Parasite infection of public databases: a data mining approach to identify apicomplexan contaminations in animal genome and transcriptome assemblies

Borner J, Burmester T

Date Published: January 20, 2017

DOI: http://dx.doi.org/10.5061/dryad.mn338

 

Files in this package

Content in the Dryad Digital Repository is offered "as is." By downloading files, you agree to the Dryad Terms of Service. To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data. CC0 (opens a new window) Open Data (opens a new window)

Title extracted contigs
Downloaded 13 times
Description Fasta files containing the extracted, parasite-derived contigs. Contigs from each Assembly are stored in a separate file.
Download extracted_contigs.zip (35.27 Mb)
Details View File Details
Title predicted amino acids
Downloaded 9 times
Description Fasta files containing the predicted amino acid sequences based on the extracted contigs. Sequences from each Assembly are stored in a separate file.
Download predicted_aa.zip (2.261 Mb)
Details View File Details
Title dataset 1 single genes
Downloaded 9 times
Description Fasta files containing the single gene amino acid alignments of dataset 1 prior to processing by Gblocks.
Download dataset_1_single_genes.zip (11.00 Mb)
Details View File Details
Title dataset 1 single genes after gblocks
Downloaded 11 times
Description Fasta files containing the single gene amino acid alignments of dataset 1 after processing by Gblocks.
Download dataset_1_single_genes_gblocks.zip (2.817 Mb)
Details View File Details
Title dataset 2 single genes
Downloaded 10 times
Description Fasta files containing the single gene amino acid alignments of dataset 2 prior to processing by Gblocks
Download dataset_2_single_genes.zip (2.847 Mb)
Details View File Details
Title dataset 2 single genes after gblocks
Downloaded 10 times
Description Fasta files containing the single gene amino acid alignments of dataset 2 after processing by Gblocks.
Download dataset_2_single_genes_gblocks.zip (811.3 Kb)
Details View File Details
Title dataset 1 superalignment in FASTA format
Downloaded 7 times
Description Concatenated superalignment of all 1420 single gene amino acid alignments of dataset 1 after processing by Gblocks.
Download dataset_1_superalignment.fa (14.51 Mb)
Details View File Details
Title dataset 2 superalignment in FASTA format
Downloaded 9 times
Description Concatenated superalignment of all 301 single gene amino acid alignments of dataset 2 after processing by Gblocks.
Download dataset_2_superalignment.fa (3.258 Mb)
Details View File Details
Title mitochondrial sequences from gorilla Plasmodium in FASTA format
Downloaded 9 times
Description Nucleotide alignment of mitochondrial Plasmodium sequences including two sequences that were extraceted from the gorilla genome. The alignment is based on data from Liu et al. (2010) and only contains sequences from Clades G1 and C1.
Download mito_gorilla.fa (193.1 Kb)
Details View File Details
Title 18S rRNA Piroplasmida in FASTA format
Downloaded 6 times
Description Nucleotide alignment of 18s rRNA sequences from Piroplasmida including a sequences that was extraceted from the platypus genome. The alignmnet is based on data from Paparini et al. (2015) and was processed by Gblocks.
Download 18s_piroplasmida.fa (19.06 Kb)
Details View File Details

When using this data, please cite the original publication:

Borner J, Burmester T (2017) Parasite infection of public databases: a data mining approach to identify apicomplexan contaminations in animal genome and transcriptome assemblies. BMC Genomics 18(1): 100. http://dx.doi.org/10.1186/s12864-017-3504-1

Additionally, please cite the Dryad data package:

Borner J, Burmester T (2017) Data from: Parasite infection of public databases: a data mining approach to identify apicomplexan contaminations in animal genome and transcriptome assemblies. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.mn338
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)

Search for data

Be part of Dryad

We encourage organizations to: