Whole genome sequence of Paramecium endosymbiont, Chlorella sp. strain NIG-J1
Data files
Jan 05, 2026 version files 13.64 MB
-
Chlorella_sp_NIG-J1.zip
13.64 MB
-
README.md
1.35 KB
Abstract
Chlorella is a unicellular green alga and some species (strain) of Chlorella are known as endosymbiont of protists and animals. We performed whole genome sequencing of Chlorella strain isolated from Paramecium bursaria for comparative transcriptome analyses. Long-read sequencing data generated with PacBio Sequel II was assembled using canu and error-corrected using Pilon. The assembled contigs were used for gene prediction using BEAKER2. The data files included here are the BRAKER2 output: the nucleotide sequence of the gene (Chlorella_sp_NIG-J1.cds), the amino acid sequence (Chlorella_sp_NIG-J1.pep), and the annotation file (Chlorella_sp_NIG-J1.gff3).
Dataset DOI: [10.5061/dryad.zkh1893nm] (https://doi.org/10.5061/dryad.zkh1893nm)
Description of the data and file structure
Chlorella_sp_NIG-J1.zip contains the following files. Genome data and PacBio raw reads of Chlorella sp. strain NIG-J1 were deposited to DDBJ (BioProject: PRJDB19703, Biosample: SAMD00855393, sequence read archive: DRR626671).
File contents
Chlorella_sp_NIG-J1.pep: Protein sequences predicted from genome sequences using the BRAKER2 pipeline.
Chlorella_sp_NIG-J1.gff3: General feature format information of the assembled genome sequences. This file is a standardized format for describing genomic features, which have 9 columns; column 1: Genome contig ID, column 2: Software name that generated the feature, column 3: The type of the feature, column 4: Genomic start of the feature, column 5: Genomic end of the feature, column 6: Score, column 7: The strand of the feature, column 8: Phase, column 9: Gene ID. The number (ex. g1.t1) following "ID=" corresponds to the gene ID following "ChlorellaJ1_" (ex. ChlorellaJ1_g1.t1) in Chlorella_sp_NIG-J1.cds and Chlorella_sp_NIG-J1.pep.
Chlorella_sp_NIG-J1.cds: Coding DNA sequences predicted from genome sequences using BRAKER2 pipeline.
