Complex models of sequence evolution improve fit, but not gene tree discordance, for tetrapod mitogenomes
Data files
Mar 14, 2024 version files 20.78 MB
-
alignments.zip
962.44 KB
-
cloudforest_input.zip
18.55 MB
-
README.md
1.02 KB
-
summary_trees.zip
1.27 MB
Abstract
Variation in gene tree estimates is widely observed in empirical phylogenomic data and is often assumed to be the result of biological processes. However, a recent study using tetrapod mitochondrial genomes to control for biological sources of variation due to their haploid, uniparentally inherited, and non-recombining nature found that levels of discordance among mitochondrial gene trees were comparable to those found in studies that assume only biological sources of variation. Additionally, they found that several of the models of sequence evolution chosen to infer gene trees were doing an inadequate job of fitting the sequence data. These results indicated that significant amounts of gene tree discordance in empirical data may be due to poor fit of sequence evolution models and that more complex and biologically realistic models may be needed. To test how the fit of sequence evolution models relates to gene tree discordance, we analyzed the same mitochondrial datasets as the previous study using two additional, more complex models of sequence evolution that each model a different biologically realistic aspect of the evolutionary process: a covarion model to incorporate heterotachy, and a model partitioned model to incorporate variable evolutionary patterns by codon position. Our results show that both additional models fit the data better than the models used in the previous study, with the covarion being consistently and strongly preferred as tree size increases. However, even these more preferred models still inferred highly discordant mitochondrial gene trees, thus deepening the mystery around what we label the “Mito-Phylo Paradox” and leading us to ask whether the observed variation could be biological after all.
README: Complex models of sequence evolution improve fit, but not gene tree discordance, for tetrapod mitogenomes
https://doi.org/10.5061/dryad.rr4xgxdgd
This repository contains supplementary material for the associated manuscript, as well as all datasets used (folder: alignments), and example scripts for each analysis. The datasets provided were used in Bayesian MCMC, Stepping-Stone Marginal Likelihood Estimation, Non-Linear Dimensionality Reduction, and Bipartition-Covariance analysis.
Description of the data and file structure
The example RevBayes scripts provided can be used for both MCMC, as well as stepping-stone analyses. Non-linear Dimensionality Reduction and Bipartition-Covariance analysis were performed in CloudForest, and tutorials for each can be found on the CloudForest website.
Code/Software
All example scripts provided are for RevBayes, and additional details on setting up and running these analyses can be found on the RevBayes website.