PSI Structural Biology Knowledgebase

PSI | Structural Biology Knowledgebase
Header Icons

Related Articles
Microbiome: Expanding the Gut Gene Catalog
November 2014
Complex Search
September 2014
Repairing a Rift
September 2014
iTRAQing the Ubiquitinome
July 2014
Immunity: Clustering Immunoglobulins
June 2014
Mining Protein Dynamics
May 2014
Design and Discovery: Identifying New Enzymes and Metabolic Pathways
January 2014
Epigenetics: Tracing Histone Demethylase Inhibitors
December 2013
Cancer Networks: Predicting Catalytic Residues from 3D Protein Structures
November 2013
Protein-Nucleic Acid Interaction: Inhibition Through Allostery
July 2013
Infectious Diseases: Targeting Meningitis
May 2013
Protein Interaction Networks: Reading Between the Lines
April 2013
Design and Discovery: A Cocktail for Proteins Without ID
February 2013
Targeting Enzyme Function with Structural Genomics
July 2012
More in one
June 2012
Disordered Proteins
February 2012
RNA Chaperone NMB1681
July 2011
Capsid assembly in motion
April 2011
One at a time
April 2011
A growing family
February 2011
Predicting functions within a superfamily
January 2011
Isoxanthopterin Deaminase
November 2010
Scaling up mutational scanning
November 2010
Alpha/Beta Barrels
October 2010
Mre11 Nuclease
May 2010
Assigning protein function: GeMMA
April 2010
Face off
October 2009

Technology Topics Annotation/Function

Scaling up mutational scanning

SBKB [doi:10.1038/sbkb.2010.51]
Technical Highlight - November 2010
Short description: Mutating a protein's sequence is a useful way of uncovering functionally important residues. A large-scale method tracking up to 600,000 variants at once will speed up this analysis.

A highly parallel assay for exploring protein sequence-function relationships.

The amino acid sequence of a protein is enough to determine its structure and function, yet how the sequence alone conveys this information continues to elude scientists. Mutating residues and then looking for a change in function has been the traditional—and often effective—way of understanding this. To get a detailed functional map has often been a laborious process, but now Stanley Fields and colleagues, writing in Nature Methods, present a method to speed it up and produce large-scale sequence-function analyses.

Purifying proteins with individual mutations to study their effects is, thankfully, a thing of the past for most large-scale analyses. Instead, thousands to millions of protein variants can be generated using a library of protein variants that are then displayed on the surface of phage, yeast or bacteria. These displayed mutants can be assayed simultaneously for a particular activity or function. The bottleneck, however, was sequencing using the Sanger method, which allowed at most a few thousand variants—generally those with highest activity—to be analyzed.

Now Fields and his team show how 600,000 protein variants can be followed at once. One important development was to use high-throughput DNA sequencing, with an Illumina paired-end approach. The other innovation was to apply only moderate selection pressure to the pool of variants. In this case, the group looked at the WW (two-tryptophan) domain of human YAP65 and displayed it on the surface of T7 bacteriophage, selecting variants that bound to the cognate peptide GTPPPPYTVG. By applying only moderate selection, they were able to study a wider range of mutations.

Looking at position-averaged effects of mutations, they identified a distinct region of the WW domain that could tolerate sequence variation. And, as expected, the two conserved tryptophans (WW) had to be maintained for ligand binding.

This method, combining protein display, low-intensity selection and very accurate high-throughput sequencing, allows simultaneous study of the activity of hundreds of thousands of protein variants. Modifications could lead to the mapping of sequence features that, for example, confer resistance to antibiotics or anticancer drugs.

Maria Hodges


  1. D. M. Fowler, C. L. Araya, S. J. Fleishman, E. H. Kellogg, J. J. Stephany et al. High-resolution mapping of protein sequence-function relationships.
    Nat. Meth. 7, 741-746 (2010). doi:10.1038/nmeth.1492

Structural Biology Knowledgebase ISSN: 1758-1338
Funded by a grant from the National Institute of General Medical Sciences of the National Institutes of Health