Skip to main content
Dryad

Epistasis plays a limited role in driving entrenchment during neutral protein evolution

Data files

Jun 05, 2026 version files 447.98 KB

Click names to download individual files

Abstract

This dataset contains curated orthologous domain (“orthodomain”) multiple sequence alignments generated for large-scale analyses of protein evolution, epistasis, and evolutionary constraint. The alignments were derived from protein domain family datasets analyzed in de la Paz et al. For each domain family, human domain sequences were identified through BLAST searches against human protein sequences obtained from UCSC 100-vertebrate alignments. Corresponding vertebrate alignments were then trimmed to retain only the regions homologous to each query domain while preserving the complete domain length. The resulting ortho-domain alignments comprise homologous vertebrate sequence sets for individual protein domains and were used to investigate evolutionary dynamics, sequence constraints, and epistatic interactions across proteins.