Data from: Transcriptome resources for the frogs Lithobates clamitans and Pseudacris regilla, emphasizing antimicrobial peptides and conserved loci for phylogenetics

 

Files in this package

Content in the Dryad Digital Repository is offered "as is." By downloading files, you agree to the Dryad Terms of Service. To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this data. CC0 (opens a new window) Open Data (opens a new window)

Title Annotation spreadsheet for transcriptome assembly of Lithobates clamitans
Downloaded 135 times
Description An Excel spreadsheet of contig sequence similarity for the Lithobates clamitans assembly. BLASTX hits to Xenopus tropicalis, HMMER matches to the Pfam-A database (for predicted ORFs), and other annotation information is given. The column headings are as follows: Contig: contig name Length (bp): length in base pairs %GC content: percent of contig sequence that is C or G Longest ORF (min 50 amino acids): the longest ORF identified on the contig that was at least 50 amino-acids long Pfam (E-value threshold 0.1): A colon-delimited list of matched Pfam accessions, E-values, and domain descriptions Xenopus.tropicalis BLASTX E-value: E-value of the best BlastX match to X. tropicalis proteins. bit: bit score of this match id%: percentage identity (protein sequence) of this match to the query contig description: descriptive text in the fasta header of the best match RefSeq ID: associated RefSeq ID for best match Entrez gene ID: associated Entrez gene ID for best match UniProt ID: associated UniProt ID for best match
Download README (4.407 Kb)
Download Lithobates.clamitans.annotation.xls (10.21 Mb)
Details View File Details
Title Annotation spreadsheet for transcriptome assembly of Pseudacris regilla
Downloaded 100 times
Description A spreadsheet of contig sequence similarity for the Pseudacris regilla assembly. BLASTX hits to Xenopus tropicalis, HMMER matches to the Pfam-A database (for predicted ORFs), and other annotation information is given. Column headings are:Contig: contig name Length (bp): length in base pairs %GC content: percent of contig sequence that is C or G Longest ORF (min 50 amino acids): the longest ORF identified on the contig that was at least 50 amino-acids long Pfam (E-value threshold 0.1): A colon-delimited list of matched Pfam accessions, E-values, and domain descriptions Xenopus.tropicalis BLASTX E-value: E-value of the best BlastX match to X. tropicalis proteins. bit: bit score of this match id%: percentage identity (protein sequence) of this match to the query contig description: descriptive text in the fasta header of the best match RefSeq ID: associated RefSeq ID for best match Entrez gene ID: associated Entrez gene ID for best match UniProt ID: associated UniProt ID for best match
Download Pseudacris.regilla.annotation.xls (10.61 Mb)
Details View File Details
Title antimicrobial-peptide-clusters
Downloaded 16 times
Description A FASTA-formatted file containing aligned representative sequences for each cluster of antimicrobial peptide reads and/or contigs. The sequence headers correspond to the phylogeny in Figure 1.
Download antimicrobial-peptide-clusters.fasta (11.90 Kb)
Details View File Details
Title antimicrobial-peptide-cluster-reads
Downloaded 10 times
Description The raw reads that mapped to each sequence cluster given in antimicrobial-peptide-clusters.fasta. This file provides the raw data underlying our clustering of antimicrobial peptide transcripts.
Download antimicrobial-peptide-cluster-reads.fas (105.1 Kb)
Details View File Details
Title Rana-assembly
Downloaded 47 times
Download Rana-assembly.fasta (17.36 Mb)
Details View File Details
Title 3-way-orthologs
Downloaded 14 times
Description A simple list of contigs from each of three transcriptome assemblies that were reciprocal-best TBLASTX matches. Each row represents a triplet of putatively orthologous sequences that may be useful for comparative genomics, primer design, etc.
Download 3-way-orthologs.xls (174.0 Kb)
Details View File Details
Title 3-way-aligned-orthologous-segments
Downloaded 14 times
Description This FASTA-formatted file is a series of 56 sequence alignments. Each alignment contains one sequence from each frog transcriptome studied. The sequences were aligned at the protein level using MUSCLE (Edgar 2004) and then converted to nucleotide alignments. Annotation information for each set of contigs can be found in the two annotation.xls spreadsheets. This file provides the expected amplicons from each reference sequence for the primers listed in conserved-primer-candidates-for-orthologous-segments.xls, for the purposes of guiding the selection of sequences that may be useful for a given population-genetic or phylogenetic study.
Download 3-way-aligned-orthologous-segments.fas (74.98 Kb)
Details View File Details
Title conserved-primer-candidates-for-orthologous-segments
Downloaded 12 times
Description A spreadsheet containing sets of forward and reverse PCR primers in successive rows, for each of the 56 segments in 3-way-aligned-orthologous-segments.fas. The primers were predicted with BatchPrimer3 (You et al. 2008) and the principal output such as expected product size and Tm is included. We also include the estimated dN/dS ratio for each pairwise comparison of the three frog transcriptomes, as an index of the overall conservation of each set of predicted cDNAs. The primer sets have not been systematically evaluated on either cDNA or genomic DNA, and may or may not bridge intronic sequence.
Download conserved-primer-candidates-for-orthologo...ts.xls (33.79 Kb)
Details View File Details
Title Summary-figure-of-nucleotide-distance-conserved-regions
Downloaded 35 times
Description This figure summarizes the nucleotide distances from the spreadsheet "conserved-primer-candidates-for-orthologous-segments.xls" in order to better guide the selection of loci for molecular phylogenetics. Loci with greater nucleotide distances may be more informative for analysis of closely related species.
Download Summary-figure-of-nucleotide-distance-con...ns.pdf (54.99 Kb)
Details View File Details
Title Conserved signal peptides of AMPs
Downloaded 43 times
Description Sequence logos of the highly conserved signal peptide and acidic propiece region for each species, based on aligned cluster sequences. Standard amino-acid symbols are used, with dark font representing more acidic residues. The predicted cleavage site is C-terminal of the conserved cysteine residue. Position 24 of the P. regilla alignment is blank because the majority of sequences had a gap at this alignment position.
Download Conserved signal peptides of AMPs.pdf (157.7 Kb)
Details View File Details

When using this data, please cite the original publication:

Robertson LS, Cornman RS (2014) Transcriptome resources for the frogs Lithobates clamitans and Pseudacris regilla, emphasizing antimicrobial peptides and conserved loci for phylogenetics. Molecular Ecology Resources 14(1): 178-183. http://dx.doi.org/10.1111/1755-0998.12164

Additionally, please cite the Dryad data package:

Robertson LS, Cornman RS (2013) Data from: Transcriptome resources for the frogs Lithobates clamitans and Pseudacris regilla, emphasizing antimicrobial peptides and conserved loci for phylogenetics. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.j6676
Cite | Share
Download the data package citation in the following formats:
   RIS (compatible with EndNote, Reference Manager, ProCite, RefWorks)
   BibTex (compatible with BibDesk, LaTeX)

Search for data

Be part of Dryad

We encourage organizations to: