Phylogenetic conflicts, combinability, and deep phylogenomics in plants
Smith, Stephen, University of Michigan–Ann Arbor
Walker, Joseph, University of Michigan–Ann Arbor
Brown, Joseph, University of Michigan–Ann Arbor
Hale, Nathanael, University of Cambridge
Published Jun 08, 2021 on Dryad.
Cite this dataset
Smith, Stephen; Walker, Joseph; Brown, Joseph; Hale, Nathanael (2021). Phylogenetic conflicts, combinability, and deep phylogenomics in plants [Dataset]. Dryad. https://doi.org/10.5061/dryad.8g68615
Studies have demonstrated that pervasive gene tree conflict underlies several important phylogenetic relationships where different species tree methods produce conflicting results. Here, we present a means of dissecting the phylogenetic signal for alternative resolutions within a dataset in order to resolve recalcitrant relationships and, importantly, identify what the dataset is unable to resolve. These procedures extend upon methods for isolating conflict and concordance involving specific candidate relationships and can be used to identify systematic error and disambiguate sources of conflict among species tree inference methods. We demonstrate these on a large phylogenomic plant dataset. Our results support the placement of Amborella as sister to the remaining extant angiosperms, Gnetales as sister to pines, and the monophyly of extant gymnosperms. Several other contentious relationships, including the resolution of relationships within the bryophytes and the eudicots, remain uncertain given the low number of supporting gene trees. To address whether concatenation of filtered genes amplified phylogenetic signal for relationships, we implemented a combinatorial heuristic to test combinability of genes. We found that nested conflicts limited the ability of data filtering methods to fully ameliorate conflicting signal amongst gene trees. These analyses confirmed that the underlying conflicting signal does not support broad concatenation of genes. Our approach provides a means of dissecting a specific dataset to address deep phylogenetic relationships while also identifying the inferential boundaries of the dataset.
Sequence files used for the analyses.
ML gene trees with SH values calculated.
Files from the combinability runs on the Wickett data.
National Science Foundation, Award: NSF DEB 1354048
National Science Foundation, Award: NSF AVATOL 1207915