Data from: Immunomics of the koala (Phascolarctos cinereus )

Abts KC, Ivy JA, DeWoody JA

Date Published: March 20, 2015



Files in this package

Content in the Dryad Digital Repository is offered "as is." By downloading files, you agree to the Dryad Terms of Service. To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data. CC0 (opens a new window) Open Data (opens a new window)

Title DeconSeq vertebrate rRNA database
Downloaded 14 times
Description This is a fasta file containing the custom vertebrate ribosomal RNA database to be used with DeconSeq to remove rRNA contaminants from sequencing data. It was created by extracting vertebrate rRNA sequences from NCBI using the query "(ribosomal RNA) AND "vertebrates"[porgn:_txid7742]", checking the sequence descriptions by eye (removing undesired sequences), and then following the procedure to create a custom database from the DeconSeq manual ( Briefly, long stretches of ambiguous bases (Ns) were removed, and then the sequences were filtered (using PRINSEQ) to remove short sequences (< 200 bp), those with >10 ambiguous bases (Ns), and duplicate sequences. Finally, the fasta file was indexed using BWA and used as the database for all DeconSeq filtering in this project.
Download vetebrateRRNA_trimmed.split.filtered.fasta (64.73 Mb)
Details View File Details
Title spleen/buffy coat combined trinity assembly
Downloaded 17 times
Description RNA was extracted from the spleen and buffy coat (white blood cell portion of a blood sample) of one and two individuals from the San Diego Zoo koala colony, respectively, via TRIzol (Life Technologies) and Direct-zol (Zymo Research) according to manufacturer's instructions. ~5 ug of RNA was used to create cDNA libraries for each sample according to the Illumina TruSeq RNA sample preparation manual and using random hexamer priming. The buffy coat libraries were barcoded, pooled, and sequenced on one lane of the Illumina HiSeq 2000 platform while the spleen library was sequenced on one lane of the Illumina HiSeq 2500 platform. The raw reads were then filtered for quality (PHRED Q score > 20) and length ( > 30 bp). Ribosomal RNA reads were reduced using DeconSeq (version 0.4.2) and a custom vertebrate rRNA database. The remaining reads were then assembled using Trinity (version r2013-02-25) and the assembled sequences are contained within.
Download trinity.fasta.gz (73.96 Mb)
Details View File Details
Title trinotate annotation report of spleen/buffy coat trinity transcripts
Downloaded 19 times
Description The Trinity transcripts (contained within spleen/buffy coat combined trinity assembly) were annotated using the Trinotate pipeline (part of the Trinity package version r2013-02-25). Briefly this pipeline searches the transcripts for open reading frames, BLASTs the resulting protein sequences to the NCBI's SwissProt database, and assigns GO terms off of the best hit. It also searches the transcripts for conserved PFAM domains.
Download 08-02-13 trinotate_annotation_report_copy.txt.gz (15.35 Mb)
Details View File Details
Title alignments of target protein sequences
Downloaded 7 times
Description The target gene open reading frame protein sequences (MHC, TLR, RLR, KoRV) from the Trinotate analysis were first aligned with eutherian, marsupial, and other vertebrate sequences using Jalview 2.8.0b1 and MUSCLE using default parameters. An intital neighbor joining tree was created (using MEGA 5.2.2) and used to guide the alignments of each individual gene clade via Jalview and MUSCLE (with the exception of MHC sequences which were aligned using HMMER 3.1b and PFAM seed alignments). These gene group alignments were then combined using COACH (version 11/21/2002 Linux) and additional sequences were added as needed. The ends of the alignments were trimmed, when necessary, and both the full and trimmed alignments (.fas format) are included in this file package. For more detailed methods please refer to the associated publication.
Download Target_gene_alignments.tar.gz (186.5 Kb)
Details View File Details
Title bootstrap trees (maximum likelihood and neighbor joining) for target protein sequences
Downloaded 4 times
Description The target protein alignments (MHC, TLR, RLR, KoRV) were used to build both maximum likelihood and neighbor joining trees using MEGA 5.2.2. The alignments and PROTEST 3.2 were used to determine the most appropriate substitution model to use while building the tree. For more detailed methods, please refer to the associated publication. Each tree is presented here in Newick format. The trees were run with 1,000 replicates and the branch lengths represent the bootstrap values. The accession numbers for the sequences used in these trees can be found in "Online Resource 1" of the associated publication.
Download Bootstrap_trees.tar.gz (4.045 Kb)
Details View File Details

When using this data, please cite the original publication:

Abts KC, Ivy JA, DeWoody JA (2015) Immunomics of the koala (Phascolarctos cinereus). Immunogenetics 67(5-6): 305-321.

Additionally, please cite the Dryad data package:

Abts KC, Ivy JA, DeWoody JA (2015) Data from: Immunomics of the koala (Phascolarctos cinereus ). Dryad Digital Repository.
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)

Search for data

Be part of Dryad

We encourage organizations to: