Skip to main content
Dryad

Data from: Detection of implausible phylogenetic inferences using posterior predictive assessment of model fit

Cite this dataset

Brown, Jeremy M. (2014). Data from: Detection of implausible phylogenetic inferences using posterior predictive assessment of model fit [Dataset]. Dryad. https://doi.org/10.5061/dryad.nc866

Abstract

Systematic phylogenetic error caused by the simplifying assumptions made in models of molecular evolution may be impossible to avoid entirely when attempting to model evolution across massive, diverse datasets. However, not all deficiencies of inference models result in unreliable phylogenetic estimates. The field of phylogenetics lacks a direct method to identify cases where model specification adversely affects inferences. Posterior predictive simulation is a flexible and intuitive approach for assessing goodness-of-fit of the assumed model and priors in a Bayesian phylogenetic analysis. Here, I propose new test statistics for use in posterior predictive assessment of model fit. These test statistics compare phylogenetic inferences from posterior predictive datasets to inferences from the original data. A simulation study demonstrates the utility of these new statistics. The new tests reject the plausibility of inferred tree lengths or topologies more often when data/model combinations produce biased inferences. I also apply this approach to exemplar empirical datasets, highlighting the value of the novel assessments.

Usage notes