Skip to main content

Potential causes and consequences of rapid mitochondrial genome evolution in thermoacidophilic Galdieria (Rhodophyta)

Cite this dataset

Cho, Chung Hyun et al. (2020). Potential causes and consequences of rapid mitochondrial genome evolution in thermoacidophilic Galdieria (Rhodophyta) [Dataset]. Dryad.


The Cyanidiophyceae is an early-diverged red algal class that thrives in extreme conditions around acidic hot springs. Although this lineage has been highlighted as a model for understanding the biology of extremophilic eukaryotes, little is known about the molecular evolution of their mitochondrial genomes (mitogenomes).

To fill this knowledge gap, we sequenced five mitogenomes from representative clades of Cyanidiophyceae and identified two major groups, here referred to as Galdieria-type (G-type) and Cyanidium-type (C-type). G-type mitogenomes exhibit the following three features: (i) reduction in genome size and gene inventory, (ii) evolution of unique protein properties including charge, hydropathy, stability, amino acid composition, and protein size, and (iii) distinctive GC-content and skewness of nucleotides. Based on GC-skew-associated characteristics, we postulate that unidirectional DNA replication may have resulted in the rapid evolution of G-type mitogenomes.

The high divergence of G-type mitogenomes was likely driven by natural selection in the multiple extreme environments that Galdieria species inhabit combined with their highly flexible heterotrophic metabolism. We speculate that the interplay between mitogenome divergence and adaptation may help explain the dominance of Galdieria species in diverse extreme habitats.


Gene alignments were constructed using MAFFT version 7 with the default options. Using IQ-TREE with 1,000 ultrafast bootstrap replications, maximum likelihood (ML)-based phylogenetic analysis and bootstrap methods (MLB) were conducted. The model test option integrated in IQ-TREE automatically selected the evolutionary models.

            The ‘Independent-Samples Kruskal-Wallis Test’ was used to evaluate hypotheses. The null hypothesis was that the distribution of factor (e.g., amino acid composition) is identical among taxa from two ingroup clades (i.e., Cyanidium-type and Galdieria-type) and outgroup taxa. If the null hypothesis is rejected, pairwise comparisons among three groups were applied. Sixteen conserved mitochondrial genes (sdhC, atp4, atp6, atp8, atp9, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, nad6) that all encoded in mitogenomes of 13 species were chosen to test amino acid properties.

            The amino acid compositions between species of each gene were calculated using the CODEML program in the PAML4 package. The alignments of each gene and their ML trees were used for pairwise comparisons and all results were summarized in electronic supplementary material. Only conserved sites excluding any gaps were selected from the protein alignment of conserved genes for analysis. Total 3,845 sites were chosen from 4,825 sites of the concatenated protein alignment. The average proportions of each group’s 20 amino acids were measured, and values were visualized in bar plots with standard error bars.

            To compare protein properties, GRAVY (grand average of hydropathy), aliphatic index, and instability index were calculated by ProtParam implied in ExPASy, and the results were summarized in electronic supplementary material. The mean values of each feature were illustrated by a bar plot (‘ggplot2’ in R) with standard error bars.

            For the EGT genes and transit peptide prediction, nuclear genomes of G. sulphuraria 074W and Cz. merolae 10D were used as the reference genomes to find mitochondrion-derived nuclear-encoded genes. EGT genes were further verified based on homologous search by MMSeqs2 and KEGG pathway. Transit peptide sequences in EGT candidates were predicted by TargetP for their potential localizations.

Usage notes

Supplementary files: description of all files, alignment files (*.fasta, *.phy) and result files of phylogenetic analyses (*.treefile) used in figures are provided.



Collaborative Genome Program of the Korea Institute of Marine Science and Technology Promotion (KIMST), Ministry of Oceans and Fisheries (MOF), Award: 20180430

National Research Foundation of Korea, Award: NRF-2017R1A2B3001923

Rural Development Administration, Award: PJ01389003

National Institute of Food and Agriculture, Award: NJ01170

Ministry of Oceans and Fisheries, Award: Collaborative Genome Program of the KIMST (20180430)

Rural Development Administration, Award: Next-generation BioGreen21 Program [PJ01389003]

National Research Foundation of Korea, Award: NRF-2017R1A2B3001923

National Institute of Food and Agriculture, Award: Hatch grant [NJ01170]