Data from: Robust and accurate prediction of residue-residue interactions across protein interfaces using evolutionary information
Data files
Jun 03, 2015 version files 316.13 MB
-
ecoli_paired_alignments.zip
177.36 MB
-
MDTP_MDTN.pdb
3.42 MB
-
METI_METQ.pdb
854.71 KB
-
PDB_benchmark_alignments_NDX.zip
47.53 MB
-
PDB_benchmark_alignments.zip
11.93 MB
-
PFLA_PFLB.pdb
1.28 MB
-
ribosome_50S_alignments.zip
72.65 MB
-
YIA_MNO.pdb
1.11 MB
Abstract
Do the amino acid sequence identities of residues that make contact across protein interfaces covary during evolution? If so, such covariance could be used to predict contacts across interfaces and assemble models of biological complexes. We find that residue pairs identified using a pseudo-likelihood based method to covary across protein-protein interfaces in the 50S ribosomal unit and 28 additional bacterial protein complexes with known structure are almost always in contact in the complex provided that the number of aligned sequences is greater than the average of the lengths of the two proteins. We use this method to make subunit contact predictions for an additional 36 protein complexes with unknown structures, and present models based on these predictions for the tripartite ATP-independent periplasmic (TRAP) transporter, the tripartite efflux system, the pyruvate formate lyase-activating enzyme complex, and the methionine ABC transporter.