PSI Structural Biology Knowledgebase

PSI | Structural Biology Knowledgebase
Header Icons
E-Collection

Related Articles
Microbiome: Expanding the Gut Gene Catalog
November 2014
Complex Search
September 2014
Repairing a Rift
September 2014
iTRAQing the Ubiquitinome
July 2014
Immunity: Clustering Immunoglobulins
June 2014
Mining Protein Dynamics
May 2014
Design and Discovery: Identifying New Enzymes and Metabolic Pathways
January 2014
Epigenetics: Tracing Histone Demethylase Inhibitors
December 2013
Cancer Networks: Predicting Catalytic Residues from 3D Protein Structures
November 2013
Protein-Nucleic Acid Interaction: Inhibition Through Allostery
July 2013
Infectious Diseases: Targeting Meningitis
May 2013
Protein Interaction Networks: Reading Between the Lines
April 2013
Design and Discovery: A Cocktail for Proteins Without ID
February 2013
Targeting Enzyme Function with Structural Genomics
July 2012
More in one
June 2012
Disordered Proteins
February 2012
RNA Chaperone NMB1681
July 2011
Capsid assembly in motion
April 2011
One at a time
April 2011
A growing family
February 2011
Predicting functions within a superfamily
January 2011
Isoxanthopterin Deaminase
November 2010
Scaling up mutational scanning
November 2010
Alpha/Beta Barrels
October 2010
Mre11 Nuclease
May 2010
Assigning protein function: GeMMA
April 2010
Face off
October 2009

Technology Topics Annotation/Function

More in one

SBKB [doi:10.1038/sbkb.2011.83]
Technical Highlight - June 2012
Short description: FunTree integrates multiple kinds of data to address evolution of enzyme function in structurally defined superfamilies.

FunTree-generated tree for one superfamily. Figure provided by Nick Furnham.

In order to predict functions of newly identified enzymes or design new functions, an understanding of enzyme function evolution is necessary. This can be achieved by integrating data from structural genomics projects with literature curation and selected predictions. To date, however, this has not been realized on a sufficiently broad or detailed scale.

Now work funded by the Wellcome Trust from the PSI MCSG has yielded a pipeline for such integration and a resource, FunTree, which currently comprises data for 276 superfamilies that represent over 2 million sequences from UniProtKB.

In the FunTree pipeline, Furnham and colleagues account for possible enzyme function changes resulting from alterations in single or multiple domains. . Their workflow considers domains within superfamilies as structurally similar groups (SSGs), as well as multi-domain architecture (MDA). Starting with curated domain structure from CATH and MDA data from CATH-Gene3D, FunTree integrates mechanistic data from MACiE, sequence, Enzyme Commission (EC) number and taxonomic data from UniProtKB, Catalytic Site Atlas data and others for phylogenetic analysis. A subsequent metabolite analysis integrates small molecule and reaction information from KEGG.

FunTree output includes a sequence diversity summary, a similarity tree of small molecules involved, and EC number distribution. One can select SSGs to organize superfamily data by structural similarity, or MDA groups to view by overall domain composition. The next level has detailed information for SSGs and MDAs, including phylogenetic trees annotated with links to sequence, structure and mechanism data, which can be navigated via the Google Maps API.

The authors' analysis of trends across the 276 superfamilies revealed that most have few SSGs and MDAs, with a few notable exceptions. Most superfamilies have one or a few catalytic functions, as defined by different EC numbers. Exceptions here include one with 223 functions.

In addition to gaining insight into functional evolution, Furnham and colleagues envision the ability to input new sequences into FunTree to determine functional space. For now, the expansion of FunTree is being planned as more data are added to CATH/CATH-Gene3D.

Irene Kaganman

References

  1. N. Furnham et al. FunTree: a resource for exploring the functional evolution of structurally defined enzyme superfamilies.
    Nucleic Acids Res. 40, D776-D782 (2012). doi:10.1093/nar/gkr852

  2. N Furnham et al. Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies.
    PLoS Comp. Bio. 8 (2012). doi:10.1371/journal.pcbi.1002403

Structural Biology Knowledgebase ISSN: 1758-1338
Funded by a grant from the National Institute of General Medical Sciences of the National Institutes of Health