PSI Structural Biology Knowledgebase

PSI | Structural Biology Knowledgebase
Header Icons
E-Collection

Related Articles
Signaling: A Platform for Opposing Functions
May 2015
Protein Folding and Misfolding: It's the Journey, Not the Destination
March 2015
Molecular Portraits of the Cell
February 2015
Nuclear Pore Complex: A Flexible Transporter
February 2015
Nuclear Pore Complex: Higher Resolution of Macromolecules
February 2015
Nuclear Pore Complex: Integrative Approach to Probe Nup133
February 2015
Piecing Together the Nuclear Pore Complex
February 2015
Updating ModBase
January 2015
Transmembrane Spans
December 2014
Mining Protein Dynamics
May 2014
Novel Proteins and Networks: Assigning Function
May 2014
Cancer Networks: Predicting Catalytic Residues from 3D Protein Structures
November 2013
The Immune System: A Brotherhood of Immunoglobulins
June 2013
The Immune System: Super Cytokines
June 2013
Infectious Diseases: Targeting Meningitis
May 2013
PDZ Domains
April 2013
Protein Interaction Networks: Adding Structure to Protein Networks
April 2013
Design and Discovery: Flexible Backbone Protein Redesign
February 2013
Pocket changes
July 2012
Predictive protein origami
July 2012
Refining protein structure prediction
March 2012
Metal mates
February 2012
Devil is in the details
January 2012
Playing while you work
November 2011
Docking and rolling
October 2011
Fit to serve
October 2011
Rosetta hone
July 2011
Structure from sequence
July 2011
An easier solution for symmetry
June 2011
Solutions in the solution
June 2011
Regulating nitrogen assimilation
January 2011
Guard cells pick up the SLAC
December 2010
Alpha/Beta Barrels
October 2010
Modeling RNA structures
May 2010
Deducing function from small structural clues
February 2010
Spot the pore
January 2010
Network coverage
November 2009
GPCR modeling: any good?
August 2009
Protein modeling made easy
July 2009
Model proteins in your lunch break
April 2009
Click for cancer-protein interactions
December 2008
Modeling with SAXS
October 2008
Designing activity
September 2008

Technology Topics Modeling

Cancer Networks: Predicting Catalytic Residues from 3D Protein Structures

SBKB [doi:10.1038/sbkb.2012.170]
Technical Highlight - November 2013
Short description: Identifying the structural features that can predict catalytic amino acids will enhance the functional assignment of unknown proteins in structural databases.

A best-case scenario of functional residue prediction. All six experimentally characterized catalytic residues in a thymidylate synthase (PDB 1LCB) were correctly predicted and are shown in red. Other residues predicted to be functional are in green. Figure courtesy of Andras Fiser.

The rapidly expanding number of structures emerging from structural genomics projects is far outpacing the rate of functional analysis. While the activities associated with new structures are sometimes evident from prior functional characterization of related proteins, 30% of the structures deposited in databases have no functional annotation, reinforcing the need for computational approaches to predict biological roles.

Sequence conservation is among the most powerful indicators of functional relevance, but its predictive power is limited by the extent of conservation and the number of related proteins identified. Because 75% of homologous proteins share less than 30% sequence identity, structural information is additionally required for reliable functional assignment. This type of “hybrid” approach would be strengthened by complementary methods that can identify functional residues within conserved regions. Toward that end, Fajardo and Fiser directly assessed the correlation of features used to distinguish functional residues from their nonfunctional counterparts in order to determine those that most reliably predict catalytic residues from sequence and structural data. This was accomplished by analyzing 439 structures of a training dataset and determining correlations between pairs of attributes ascribed to potential functional amino acids. The features analyzed, which included distance to the general center of mass (GCM) of the structure, relative solvent accessibility (RSA), sequence conservation and closeness, and other graph centrality measures, were then used to train neural networks to identify catalytic residues.

In agreement with previous reports, the authors found that sequence conservation displays the highest correlation with function, but additional parameters can be reliable guides to catalytic propensity. Both the distance of residues to the GCM and closeness could distinguish functional and nonfunctional residues; in contrast, RSA shows essentially no correlation to function. The best predictive performance was obtained from networks using distance to the GCM and amino acid type as inputs, and was optimal when residues were preselected based on sequence conservation. This approach out-performed structure-only prediction methods, and also compared favorably with currently employed sequence-based methods. The authors note that the rapidly changing composition of sequence databases requires that sequence conservation be regularly recalculated to ensure the usefulness of sequence profile-based methods. The expanded ability to annotate protein structures for which there are presently no known functions would appear to be worth this effort.

Beth Moorefield

References

  1. J.E. Fajardo and A. Fiser Protein structure based prediction of catalytic residues.
    BMC Bioinformatics. 14, 63 (2013). doi:10.1186/1471-2105-14-63

Structural Biology Knowledgebase ISSN: 1758-1338
Funded by a grant from the National Institute of General Medical Sciences of the National Institutes of Health