Data from: Major revisions in pancrustacean phylogeny through improved taxon sampling with recommendations for resolving remaining difficult nodes
Bernot, James (2022), Data from: Major revisions in pancrustacean phylogeny through improved taxon sampling with recommendations for resolving remaining difficult nodes, Dryad, Dataset, https://doi.org/10.5061/dryad.dr7sqvb2h
The clade Pancrustacea, comprising crustaceans and hexapods, is the most diverse group of animals on earth, comprising more than 80% of animal species, and has dominated much of the planet for over 500 million years. It has been the subject of several recent phylogenomic analyses, but despite multiple analyses of hundreds of genes, relationships within Pancrustacea show a notable lack of stability. In this study, the pancrustacean phylogeny is estimated with expanded taxon sampling, particularly of malacostracans, using 106 transcriptomes and genomes and a precise tree-based approach to orthology inference. We completed more than 50 independent tree searches on continually expanded taxon sets using ML, BI, and coalescent-based methods, and our results show that outgroup selection and changes in taxon sampling have a large impact on the topologies recovered. We show that small differences in taxon sampling have a large impact on ortholog identification and phylogenetic estimation. By analyzing only shared orthologs between two slightly different taxon sets, we show that the differences in the resulting species trees are due to the effects of taxon sampling on the phylogenetic estimation, not on ortholog identification. In our final taxon set, we identify 576 protein-coding orthologs comprising 121,508 amino acid positions. We compare trees resulting from our analyses with others from the literature to explore the large tree space of pancrustacean phylogenetic estimates and find that statistical topology tests of our dataset reject the previously published trees in favor of the ML trees produced here. Our results reject a number of clades including Caridoida, Eucarida, Multicrustacea, Vericrustacea, and Syncarida but support other classically recognized taxa, such as the Peracarida, and suggest some new clades. In contrast to the traditional Eucarida, we recover a novel relationship between decapods, euphausiids, and two clades of syncarids that we refer to as the Syneucarida. Additionally, with denser taxon sampling, we find Stomatopoda as sister to this clade. In recognition of these new relationships, we propose the name Stomatocaridea for Stomatopoda, Eucarida, and the two separate clades of “Syncarida”, which divides Malacostraca into three clades: Leptostraca, Peracarida, and Stomatocaridea. A new Bayesian divergence time estimation is conducted using 13 vetted fossils. We review our results in the context of other pancrustacean analyses and highlight key taxa and strategies for future phylogenetic studies.
Phylogenetic trees and alignments from analyses in this study.
National Science Foundation, Award: 2010898
National Science Foundation, Award: 1856679