Skip to main content
Dryad

Data from: AmpuBase: a transcriptome database for eight species of apple snails (Gastropoda: Ampullariidae)

Cite this dataset

Ip, Jack C. H. et al. (2019). Data from: AmpuBase: a transcriptome database for eight species of apple snails (Gastropoda: Ampullariidae) [Dataset]. Dryad. https://doi.org/10.5061/dryad.117cf

Abstract

Background: Gastropoda, with approximately 80,000 living species, is the largest class of Mollusca. Among gastropods, apple snails (family Ampullariidae) have members that are widely distributed in tropical and subtropical freshwater ecosystems and are ecologically and economically important. They exhibit various morphological and physiological adaptations to their respective habitats, which make them ideal candidates for studying adaptation, population divergence, speciation, and larger-scale patterns of diversity, including biogeography of native and invasive populations. The limited availability of genomic data, however, hinders in-depth ecological and evolutionary studies of these non-model organisms. Results: Using Illumina Hiseq platforms, we sequenced 1,220 million reads for seven species of apple snails. Together with the RNA-Seq data of two apple snails, we conducted de novo transcriptome assembly of eight species covering five genera of Ampullariidae, including representatives of the Old World and New World lineages. There were 20,730 to 35,828 unigenes with predicted open read frames for the eight species, with N50 (shortest sequence length at 50% of the unigenes) ranging from 1,320 to 1,803 bp. 69.7 % to 80.2 % of these unigenes were functionally annotated by searching against databases of NCBI’s non-redundant, Gene Ontology and Kyoto Encyclopaedia of Genes and Genomes. With these data we developed AmpuBase, a relational database that features online BLAST for DNA/protein sequences, keyword search for unigenes/functional terms, and download functions for sequences and whole transcriptomes. Conclusions: In summary, we have generated comprehensive transcriptome data for multiple ampullariid genera and species, and created a publicly accessible database with a user-friendly interface to facilitate future basic and applied studies on ampullariids, and comparative molecular studies with other invertebrates.

Usage notes