Repeat proteins composed of tandem arrays of a short structural motif often mediate protein-protein interactions. Past efforts to design repeat protein-based molecular recognition tools have focused on the creation of templates from the consensus of individual repeats, regardless of their natural context. Such an approach assumes that all repeats are essentially equivalent. In this study we present the results of a ‘module-based’ approach, in which modules composed of tandem repeats are aligned to identify repeat-specific features. Using this approach to analyze tetratricopeptide repeat modules that contain 3 tandem repeats (3TPRs), we identify two classes of 3TPR modules with distinct structural signatures that are correlated with different sets of functional residues. Our analyses also reveal a high degree of correlation between positions across the entire ligand-binding surface, indicative of a coordinated, coevolving binding surface. Extension of our analyses to different repeat protein modules reveals more examples of repeat-specific features, especially in armadillio repeat (ARM) modules. In summary, the module-based analyses that we present effectively capture key repeat-specific features that will be important to include in future repeat protein design templates.
展开▼