Skip to main content
Dryad

Data from: In vivo functional phenotypes from a computational epistatic model of evolution

Abstract

Computational models of evolution are valuable for understanding the dynamics of sequence variation, to infer phylogenetic relationships or potential evolutionary pathways, and for biomedical and industrial applications. Despite these benefits, few have validated their propensities to generate outputs with in vivo functionality, which would enhance their value as accurate and interpretable evolutionary algorithms. Utilizing the Hamiltonian of the joint probability of sequences in the family as fitness metric, we sampled and experimentally tested for in vivo beta-lactamase activity in E. coli TEM-1 variants.  These variants retain family-like functionality while being more active than their WT predecessor. We found that depending on the inference method used to generate the epistatic constraints, different parameters simulate diverse selection strengths. Under weaker selection, local Hamiltonian fluctuations reliably predict relative changes to variant fitness, recapitulating neutral evolution. In this dataset, we include input datasets, simulation trajectories as well as experimental data to support the publication: "In vivo functional phenotypes from a computationa epistatic model of evolution".