Data from: Effect of cellular nutrient economy on the evolution of genome size in phytoplankton
Data files
May 22, 2026 version files 1.20 MB
-
Data_S1.csv
33.82 KB
-
Data_S2.csv
12.24 KB
-
Data_S3.csv
8.30 KB
-
Data_S4.csv
844.39 KB
-
README_Data_S1.txt
2.36 KB
-
README_Data_S2.txt
2.34 KB
-
README_Data_S3.txt
1.87 KB
-
README_Data_S4.txt
1.87 KB
-
README.md
1.42 KB
-
references.pdf
232.93 KB
-
references.txt
56.76 KB
Abstract
The origin of genome size variation remains a central question in evolutionary biology. While energetic costs have been proposed to influence genome size through selection on insertions and deletions (indels), nutrient availability may be a more relevant constraint in primary producers such as phytoplankton. We derived an expression for the selection coefficient of indels based on the phosphorus and nitrogen costs of nucleotides and the cellular nutrient requirements. Selection coefficient estimates indicate that natural selection dominates over genetic drift and favors the fixation of mutations that reduce genome size in phytoplankton with low nutrient requirements. Model predictions are supported by comparative genomics and metagenomic analyses. Altogether, this model provides a rigorous quantitative framework for understanding genome size evolution, particularly in small cells and oligotrophic environments, highlighting how nutrient limitation drives genome streamlining.
The global dataset is composed of four separate CSV files, each accompanied by a corresponding README file, and a PDF containing the complete list of references.
Authors: Carlos Caceres, Marc Krasovec, Olivier Crispi, Sebastien Gourbiere, Gwenael Piganeau.
Date: 2026-05-05
Version: 1.0
Files and brief description:
- Data_S1.csv: measurements of genomic features for phytoplankton species.
- Data_S2.csv: minimum cellular quotas for phosphorus (QminP) and nitrogen (QminN) for phytoplankton taxa, and analyses in which they were used.
- Data_S3.csv: cell volume values for phytoplankton taxa, and analyses in which they were used.
- Data_S4.csv: indel polymorphisms for six phytoplankton species belonging to three genera of Mamiellales obtained from metagenomes.
- README_Data_S1.txt: description of the structure and variables of Data_S1.csv.
- README_Data_S2.txt: description of the structure and variables of Data_S2.csv.
- README_Data_S3.txt: description of the structure and variables of Data_S3.csv.
- README_Data_S4.txt: description of the structure and variables of Data_S4.csv.
- references.txt: complete list of references in plain text format, including sources from which data were obtained.
- references.pdf: complete list of references in PDF format, including sources from which data were obtained.
