Data from: Phylogenomic insights into the higher-level relationships within Culicomorpha (Diptera) revealed by whole-genome sequencing
Data files
Jun 03, 2026 version files 25.29 KB
-
Matrix1_LG_C20_F_R.treefile
2.47 KB
-
Matrix1-kpi_ASTRAL.treefile
1.90 KB
-
Matrix1-kpi_partitioning.treefile
2.47 KB
-
Matrix1-kpi_PMSF(H1.guide).treefile
2.48 KB
-
Matrix1-kpi_PMSF(H2.guide).treefile
2.48 KB
-
Matrix1-smart_ASTRAL.tre
1.75 KB
-
Matrix1-smart_partitioning.treefile
2.47 KB
-
Matrix1-smart_PMSF(H1.guide).treefile
2.48 KB
-
Matrix1-smart_PMSF(H2.guide).treefile
2.48 KB
-
Matrix2_partitioning.treefile
2.47 KB
-
README.md
1.85 KB
Abstract
The infraorder Culicomorpha is a monophyletic group within Diptera, comprising 8 families. However, the precise higher-level phylogenetic relationships between these families remain debatable. To clarify this, we utilized whole-genome sequencing to extract universal single-copy orthologues (USCOs) from all families from 46 species, including 32 transcriptome or genome databases downloaded from the NCBI database and 14 newly sequenced genome databases. By filtering the amino acid (AA) and nucleotide (NUC) sequences of USCOs from each species and evaluating their gene attributes, 2 amino acid matrices (Matrix1-kpi and Matrix1-smart) and 1 nucleotide matrix (Matrix2) were assembled. Then, based on the concatenation method under site-homogeneous and site-heterogeneous models, and the coalescent-based method under the MSC model, the phylogenetic relationships of this group were reconstructed. Additionally, we used topology tests to examine different hypotheses. The model evaluation revealed that the heterogeneous model, which accounts for variation in evolutionary rates across different genomic regions, proved to be superior to the homogeneous model for obtaining the observed topology, strongly supporting hypothesis H1, namely ((Chironomidae, Ceratopogonidae),((((Culicidae, Chaoboridae), Corethrellidae), Dixidae),(Thaumaleidae, Simuliidae))). Overall, our phylogenomic analysis based on this comprehensive data offers novel insights into the phylogeny of Culicomorpha.
https://doi.org/10.5061/dryad.3bk3j9kvs
Description of the data and file structure
A total of ten phylogenetic tree files were generated using four different matrices.
Files and variables
File: Matrix1_LG_C20_F_R.treefile
Description: The tree file was generated using Matrix1 under LG+C20+F+R MODEL.
File: Matrix1-kpi_ASTRAL.treefile
Description: The tree file was generated using Matrix1-kpi under ASTRAL MODEL.
File: Matrix1-kpi_partitioning.treefile
Description: The tree file was generated using Matrix1-kpi under partitioning MODEL.
File:the Matrix1-kpi_PMSF(H2.guide).treefile
Description: The tree file was generated using Matrix1-kpi under a partitioned site-homogeneous MODEL.
File: Matrix1-smart_ASTRAL.tre
Description: The tree file was generated using Matrix1-smart under ASTRAL MODEL.
File: Matrix1-smart_partitioning.treefile
Description: This one depicts the evolutionary relationships among dipteran species, with branch lengths indicating genetic divergence and node values representing clade support.
File: Matrix1-smart_PMSF(H1.guide).treefile
Description: The tree file was generated using Matrix1-smart under partitioning MODEL.
File:the Matrix1-kpi_PMSF(H1.guide).treefile
Description: The tree file was generated using Matrix1-kpi under MSC MODEL.
File: Matrix1-smart_PMSF(H2.guide).treefile
Description: The tree file was generated using Matrix1-smart under MSC MODEL.
File: Matrix2_partitioning.treefile
Description: The tree file was generated using Matrix1-smart under partitioned sisite-homogeneousODEL.
