Data and code from: Phylogenomics and systematics of African Sesuvium (Aizoaceae)
Data files
May 26, 2026 version files 1.97 GB
-
00_original_fasta_files.zip
59.84 MB
-
01_final_homologs.zip
121.13 MB
-
02_final_orthologs.zip
390.32 MB
-
03_astral.zip
3.49 MB
-
04_phyparts.zip
2.37 MB
-
05_QS.zip
19.44 KB
-
06_Theta_ILS.zip
98.81 MB
-
07_Reticulation_index.zip
278.90 MB
-
08_gene_tree_estimation_error.zip
462.84 MB
-
09_condordance_factors.zip
292.71 MB
-
10_importance_decomposition.zip
101.71 KB
-
11_snp_analyses.zip
250.37 MB
-
12_cpDNA.zip
8.36 MB
-
README.md
4.72 KB
Abstract
Sesuvium (Sesuvioideae-Aizoaceae) comprises leaf succulent annual and perennial herbs distributed in coastal or saline sites of subtropical and tropical regions. Some species of the genus tolerate highly salinized or polluted soils and show soil-improving properties. Sesuvium shows a high variation in photosynthetic types, including and integrating C3, C4, and CAM photosynthesis. Previous molecular phylogenetic studies failed to resolve the infrageneric relationships of the genus. However, a better understanding of the systematics of this group is a prerequisite for further evolutionary studies. We explored the suitability of genome skimming, including mainly herbarium material, to produce a robust phylogeny of the presumably young African clade of Sesuvium. The approach generated an average of 6,289 orthologs per genome skimming sample and 30 complete to mostly complete plastomes. The coalescent-based species tree, as well as the plastome tree, shows stable resolution at the backbone of the African Sesuvium clade. An annual C3 species, here newly described, is sister to a C4 clade that is subdivided into two subclades, one comprising the annual S. hydaspicum (incl. S. nyasicum) and the other three perennial species, S. crithmoides, S. congense, and S. sesuvioides. Within each of these two subclades, high gene tree discordance was observed, which can be partly attributed to gene tree estimation error and, to a lesser extent, to ILS and hybridization.
Description of the data and file structure
DATA PACKAGE FROM KADEREIT ET AL
Phylogenomics and Systematics of African Sesuvium (Aizoaceae)
This package contains the data and software outputs (i.e., fasta files, alignments, trees, etc).
- Data
00_original_fasta_files.zip
-original_fasta_files.tar.gz - Original unaligned FASTA files (*.fa) of nuclear loci
01_final_homologs.zip
-original_fasta_files.tar.gz - unaligned FASTA files (*.fa)
-final_aln.tar.gz - OMM_MACSE-aligned and pxclsq-clean FASTA files (*.aln-cln)
-treefiles.tar.gz - IQtree tree files (*.treefile)
02_final_orthologs.zip
-original_fasta_files.tar.gz - unaligned FASTA files (*.fa)
-final_aln.tar.gz - OMM_MACSE-aligned and pxclsq-clean FASTA files (*.aln-cln)
-treefiles.tar.gz - IQtree tree files (*.treefile) with ultra-fast bootstrap (UFBoot) support
-treefiles_and_boots_npbs.tar.gz - IQtree tree files (*.treefile) with non-parametric bootstrap support and bootstrap trees replicates (*.boottrees)
03_astral.zip
-sesuvium_7138_orthologs.ASTRAL.tre - ASTRAL tree
-sesuvium_7138_orthologs.ASTRAL.tre.rr - ASTRAL tree rooted
-sesuvium_7138_orthologs.tre.gz - input file of all 7138 ortholog tree
Note: sesuvium_7138_orthologs.tre.gz is a plain text file, despite ending in the extension ".gz"
04_phyparts.zip
-Output files from the Phyparts discodance analyses
05_QS.zip
-Output files from the Quartet Sampling discodance analyses
06_Theta_ILS.zip
00_astral_brl_cu
-sesuvium_7138_orthologs.ASTRAL_brl.tre - ASTRAL tree with branch lengths in coalescent units
-sesuvium_7138_orthologs.ASTRAL_brl.tre.no_node_labels.rr - ASTRAL tree with branch lengths in coalescent units, rooted without node labels
-sesuvium_7138_orthologs.ASTRAL_brl.tre.rr - ASTRAL tree with branch lengths in coalescent units, rooted
-sesuvium_7138_orthologs.tre.gz - input file of all 7138 ortholog tree
01_ultrametric_tree/
-sesuvium_7138_ortho_concat.PAUP.nex.gz - Nexus file input for PAUP containing the concatenated alignment
-sesuvium_7138_orthologs.cln_0.85.ASTRAL5.7.1.ml.clock.paup.tre - ultrametric tree output from PAUP
02_theta
-input and output trees of theta calculation
07_Reticulation_index.zip
00_astral_boostrap_trees
-sesuvium_7138_orthologs.ASTRAL5.7.1.boottrees.tre - 100 ASTRAL bootstrap trees
-sesuvium_7138_orthologs.ASTRAL5.7.1.boottrees.tre.rr.fx.100.no_node_labels.tre - 100 ASTRAL bootstrap trees rooted without node labels
-sesuvium_7138_orthologs.ASTRAL5.7.1.boottrees.tre.rr.fx.100.tre - 100 ASTRAL bootstrap trees rooted
-sesuvium_7138_orthologs.tre.gz - input file of all 7138 ortholog tree
01_triplets
-input and output files from https://github.com/lmcai/Coalescent_simulation_and_gene_flow_detection pipeline
08_gene_tree_estimation_error.zip
00_pruned_sp_tree
- run_alisim.sh command file to run Alisim
- trees.tar.gz - pruned ASTRAL-III species trees matching the sample occurrence of each of the 7,138 ortholog alignments
01_simulations_with_pruned_sp_tree
-iqtree_files.tar.gz - IQtree file from Alisim
02_simulation_on_species_tree
-RAxML_* - RAxML files from the annotated tree
-run_raxml.sh - command to run RAxML
-sesuvium_713800_sim_gene_trees.tre.gz - output simulated trees (*.treefile)
-sesuvium_7138_orthologs.ASTRAL5.7.1_no_node_labels.tre.rr - ASTRAL tree reference tree for annotation
-simulation_inference.txt - simulation commands
-simulation_parameters.txt - parameter for simulation on each gene tree
09_condordance_factors.zip
-boottrees.tre.gz - bootstrap input trees
-concord* - output files from IQtree
-run_iqtree_gcf.sh - IQtree command
-sesuvium_7138_orthologs.ASTRAL5.7.1_no_node_labels.tre.rr - input ASTRAL tree
10_importance_decomposition.zip
-input and output files, and commands to run the importance decomposition analyses and correlations
11_snp_analyses.zip
01_bwa_mapping
-Commands (*.sh) for SNP calling with GATK
-Filtered final VCF files *.vcf.gz
02_dsuite
- Commands (*.sh) input and output files to run site pattern analyses with Dsuite
03_admixture
-Commands (*.sh) input and output files to run ADMIXTURE analyses.
04_PCA
-Commands (*.sh) input and output files to run PLINK for PCA analyses.
12_cpDNA.zip
00_tree_inference
-input and output files from IQtree inference of plastome data
01_QS
-command and output files from Quarter Sampling for the platome data.
If you have any questions about the data, please do not hesitate to contact Diego F. Morales-Briones at dfmoralesb@gmail.com
