FASEB J. Cell Migration Consortium
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by FETROW, J. S.
Right arrow Articles by SKOLNICK, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by FETROW, J. S.
Right arrow Articles by SKOLNICK, J.
(The FASEB Journal. 1999;13:1866-1874.)
© 1999 FASEB

Structure-based functional motif identifies a potential disulfide oxidoreductase active site in the serine/threonine protein phosphatase-1 subfamily

JACQUELYN S. FETROW1, NAOMI SIEW and JEFFREY SKOLNICK

Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA

1Correspondence: GeneFormatics, Inc., 5830 Oberlin Dr., Ste. 200, San Diego, CA 92121, USA. E-mail: jacque{at}geneformatics.com


   ABSTRACT
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
In previous work, 3-dimensional descriptors of protein function (`fuzzy functional forms') were used to identify disulfide oxidoreductase active sites in high-resolution protein structures. During this analysis, a potential disulfide oxidoreductase active site in the serine/threonine protein phosphatase-1 (PP1) crystal structure was discovered. In PP1, the potential redox active site is located in close proximity to the phosphatase active site. This result is interesting in view of literature suggesting that serine/threonine phosphatases could be subject to redox control mechanisms within the cell; however, the actual source of this control is unknown. Additional analysis presented here shows that the putative oxidoreductase active site is highly conserved in the serine/threonine phosphatase-1 subfamily, but not in the serine/threonine phosphatase-2A or -2B subfamilies. These results demonstrate the significant advantages of using structure-based motifs for protein functional site identification. First, a putative disulfide oxidoreductase active site has been identified in serine-threonine phosphatases using a descriptor built from the glutaredoxin/thioredoxin family, proteins that have no apparent evolutionary relationship whatsoever to the PP1 proteins. Second, the proximity of the putative disulfide oxidoreductase active site to the phosphatase active site provides evidence toward a regulatory control mechanism. No sequence-based method could provide either piece of information.—Fetrow, J. S., Siew, N., Skolnick, J. Structure-based functional motif identifies a potential disulfide oxidoreductase active site in the serine/threonine protein phosphatase-1 subfamily.


Key Words: functional genomics • function prediction • structural genomics • structure-based function annotation • fuzzy functional form


   INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
ANNOTATION OF PROTEIN function in the genome sequence databases is becoming increasingly important. In the current era of functional genomics, a protein is often assigned the function of another protein if there is significant sequence identity between their two sequences. Several implicit assumptions underlie this annotation method. First, to assume that two proteins have a similar function, one must assume they are orthologously related, as described by Fitch (1) . Although the sequence similarity method for annotation is often used in today's genomic databases, the orthologous relationship between the two sequences is rarely proved. Second, the sequence similarity method for function annotation implicitly assumes that protein function can be adequately described by a single term such as aldolase or reductase or even serine/threonine phosphatase. This simple descriptor actually glosses over the many levels of function found in a cell. For example, for an enzyme, these levels range from the catalytic activity to the substrate specificity, to the binding of cofactors or modulators, to the interaction of the protein with other proteins or macromolecules present in the cell, to the enzyme's role in one or more pathways. A single term such as `aldolase' cannot describe the richness of phenomena associated with a given protein.

To rectify both shortcomings, we have developed a structure-based function identification method (2 3 4) . We use a geometric and residue-based descriptor of the active site associated with a given biochemical function (termed a `fuzzy functional form' or FFF2 ).If a protein structure satisfies this descriptor, then the protein is predicted to have the function of interest. Thus, we do not require any information about the evolutionary relationship of the protein of interest to other proteins that have the same function. Rather, the biochemical function is established based on the physical and chemical properties of the active site itself. Another advantage of using a structural model is that, in principle, one could address all the levels of function described above; in practice, however, only some levels have so far been addressed. Nevertheless, even at the present state of the art, if one finds two active sites ascribed to different biochemical functions to be in close spatial proximity, and if one of the functions has been suggested experimentally to exert control over the other, then knowledge of their spatial proximity would make a very strong circumstantial case for such control. When the relevant residues are far apart in sequence, no sequence-based approach can provide this type of information. On the other hand, a clear disadvantage of our approach is that the protein's structure needs to be known to apply the structural descriptors. However, we have shown that the descriptors can be applied not only to high-resolution X-ray and nuclear magnetic resonance structures, but also to inexact models produced by state-of-the-art threading, homology modeling, and ab initio folding programs (2) .

We have previously built a FFF for the thiol-disulfide oxidoreductase active site of the glutaredoxin/thioredoxin protein family (2 , 3) . Application of this FFF to a set of high-resolution structures from the Brookhaven Protein Data bank (5) yielded all known glutaredoxins, thioredoxins, and disulfide isomerase proteins. To our surprise, it also yielded the protein 1fjm (6) , a member of the serine/threonine phosphatase protein family. Subsequent analysis presented here shows that this protein might indeed have a disulfide oxidoreductase active site, suggesting a mechanism of redox control for these phosphatases. Identification of a putative regulatory active site in a well-studied protein whose structure was solved over three years ago demonstrates the multi-faceted nature of function in proteins and emphasizes the need for structural descriptors of protein function at all levels.


   MATERIALS AND METHODS
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
Sequence alignment and multiple sequence alignment
The proteins PP1 HUMAN, PP2A HUMAN, and PP2B HUMAN from the SwissProt database (7) were used as input sequences to the Web version of the Psi-BLAST program (8) . Default parameters were used and each of the three searches was done for one iteration only. From the resulting lists of serine/threonine phosphatases, those annotated as PP1, PP2A, or PP2B in SwissProt were selected for further analysis. To limit the list, we did not select those sequences labeled as `probable' or `putative' serine/threonine phosphatases. Lists of 31, 23, and 16 sequences annotated as belonging to the PP1, PP2A, and PP2B subfamilies, respectively, were created in this fashion. For each list, a multiple sequence alignment was created using the program Pileup (Wisconsin GCG sequence analysis software package, v9.1). Default parameters were used.

Sequence variability
Sequence variability was calculated using a program kindly provided by Dr. Peter Shenkin. This program takes the Pileup alignments as input and calculates the residue entropy and variability for each aligned residue in the sequence, as described previously (9) .

Cluster analysis
The 70 serine/threonine phosphatases, identified from the Psi-BLAST search described above, were aligned using the program Pileup. From the multiple alignment, a pairwise distance table was created using the program Distances. The Growtree program was used to create the tree figure from the Distances table. All programs are found in the Wisconsin GCG sequence analysis software package, v9.1. Default parameters were used.

Construction of a FFF for the thiol-disulfide oxidoreductase active site
By design, FFFs are meant to be `fuzzy' so that they can be applied to the inexact structures produced by current protein modeling techniques. Thus, they are built using only the alpha carbon coordinates of residues, that have been shown to be important. In the thiol-disulfide oxidoreductases, two cysteine residues are apparently critical for the complete functioning of the protein (10 11 12 13) . A cis-proline that is close in space, but not in sequence, is also structurally conserved in this diverse family (14) . An example of the active site in Escherichia coli thioredoxin (2trx; ref 15 ) is shown in Fig. 1 A. The FFF for the thiol-disulfide oxidoreductase active site of the glutaredoxin/thioredoxin family was built from the relative alpha carbon positions of these three active site residues (2 , 3) . The FFF also contains residues located on either side of these active site residues to create a structural motif of nine residues (4) . Use of these adjacent residues to describe the active site geometry specifies the location of the beta carbon of each of the three active site residues (16) . The FFF is described by the average distances between the alpha carbons of these nine residues plus or minus a small variance.



View larger version (49K):
[in this window]
[in a new window]
 
Figure 1. Comparison of the known thiol-disulfide oxidoreductase active site in thioredoxin and the putative oxidoreductase site in serine/threonine phosphatase-1. A) Structure of a representative active site from 2trx (15) , a thioredoxin from E. coli. B) The structure of the active site residues identified by the FFF in 1fjm (6) . C) The relative location of the putative thiol-disulfide oxidoreductase active site and the phosphatase active site identified from the 1fjm crystal structure (6) . In all three panels, the protein backbone appears as a white ribbon. The two cysteine and proline side chains are shown as black ball-and-stick models. The helix containing the two cysteines and the upside-down V-shaped structure containing the proline are colored as a gray ribbon. In panel C, the phosphatase active site histidine (His 125) is also shown as a black ball-and-stick model. Microcystin, a toxin that binds to the serine/threonine phosphatase-1 active site and was crystallized with the protein in the 1fjm structure, is shown in the active site as a black ball-and-stick figure.


   RESULTS
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
Application of the thiol-disulfide oxidoreductase FFF to experimentally determined protein structures
A set of 1501 high-resolution, single-chain proteins from the Brookhaven Protein Database (5) was screened for the thiol-disulfide oxidoreductase active site using the FFF described above. These 1501 proteins had less than 50% pairwise sequence identity among them. The FFF correctly identified eight glutaredoxins, thioredoxins, and disulfide isomerase proteins found in the set of 1501 proteins. It did not identify any other protein except 1fjm, a mammalian serine/threonine protein phosphatase-1 (PP1) (6) . Cys 155, Cys 158, and Pro 192 were identified as potential active-site residues in this phosphatase. Initially, this result was surprising, until the residues identified by the FFF in the serine/threonine phosphatase were analyzed.

Identification of a potential thiol-disulfide oxidoreductase active site in the serine/threonine phosphatase-1 subfamily
The thiol-disulfide oxidoreductase active site in the glutaredoxin/thioredoxin family exhibits some identifying characteristics (Fig. 1A ). The two active site cysteines are found at the amino terminus of an {alpha}-helix. The side chains of the two cysteines are found on one face of the helix and lie parallel to the conserved proline side chain. The proline is found in an upside-down V-shaped structure, where the proline is at the vertex of the V. In the eight structures belonging to the glutaredoxin/thioredoxin family in our database of 1501 proteins, the proline is always found in the cis configuration. None of these characteristics was encoded into the FFF, which is a simple structural descriptor that only uses distances between alpha carbons to define the active site.

Analysis of the putative thiol-disulfide oxidoreductase active site residues identified by the FFF in 1fjm (6) shows many of these same characteristics (Fig. 1B ). The two putative active site cysteines in 1fjm are found at a helix terminus. The proline is at the vertex of an upside-down V and the proline is in the cis configuration. The root mean square difference between the backbone atoms of the proline and the two residues on either side of it in 1fjm and 2trx (E. coli thioredoxin) is 0.58 Å.

There are two distinct structural differences between the potential redox site in serine/threonine phosphatase-1 (1fjm) and the actual disulfide oxidoreductase active site in thioredoxin (2trx). The first is that the putative active site in 1fjm is found at the carboxyl terminus of a helix, whereas the sites in the glutaredoxin/thioredoxin family are found at a helix amino terminus (Fig. 1) . Although the helix dipole has been suggested to be important for biological activity (13) , this contention is disputed (10) . Of course, the specific biological activity could be somewhat different in the serine/threonine phosphatases. The second difference between the active site structures is in the orientation of the helix relative to the proline. If the vertex of the V-shaped proline structure is oriented so that it is pointing up, then the cysteine side chains hang down from one face of the helix to lie parallel to the proline side chain in the glutaredoxin/thioredoxin structures (Fig. 1A ). In 1fjm, on the other hand, the cysteines point up from the face of the helix to lie next to the proline side chain when the vertex of the proline V is pointing up (Fig. 1B) . It should be noted that the cysteine side chains lie in virtually identical positions relative to the cis-proline, despite the orientation of the helix. However, the change in orientation causes the cysteines to be at the protein surface in the serine/threonine phosphatase-1, whereas they are somewhat more buried in the glutaredoxin and thioredoxin structures.

Structural analysis provides some evidence that the site identified in the serine/threonine phosphatase-1 1fjm might indeed be a thiol-disulfide oxidoreductase active site. Note that although the 1fjm structure has been solved (6) , the serine/threonine phosphatase-1 subfamily has been well studied (reviewed in refs 17 18 19 20 ) and reactive sulfhydryl groups are known to be present (21 , 22) , the actual disulfide oxidoreductase site in the PP1 subfamily has not been previously identified.

Analysis of the serine/threonine phosphatase sequences
To further investigate the putative redox site in 1fjm, we analyzed the serine/threonine phosphatase sequences. This family of proteins is divided into four subfamilies: PP1, of which 1fjm is a member; PP2A; PP2B (of which calcineurin is a member); and PP2C (19) . PP2C appears to be sequentially unrelated to PP1, PP2A, and PP2B, although members of this subfamily catalyze a similar reaction (19) and structural comparison suggests a similar catalytic reaction mechanism (23) . Members of subfamilies PP1, PP2A, and PP2B exhibit significant sequence similarity (Fig. 2 ). On average, there is 49% sequence identity between the catalytic domains of PP1 and PP2A sequences. Likewise, there is an average 40% sequence identity between catalytic domains of PP1 and PP2B sequences (6) . Although the sequences of PP1, PP2A, and PP2B are very similar, the proteins in these subfamilies differ in their substrate specificities and interactions with regulatory molecules.



View larger version (67K):
[in this window]
[in a new window]
 
Figure 2. Multiple sequence alignment of three representatives each of the phosphatase-1 (first three sequences), -2A (second three sequences), and -2B (third three sequences) subfamilies. The histidine residue found at the phosphatase active site is shown in boldface; the putative thiol-disulfide oxidoreductase residues are in boldface and italicized. The putative thiol-disulfide oxidoreductase active site is found only in the phosphatase-1 subfamily. The residue numbers correspond to the numbering of the 1fjm sequence in the Brookhaven database. Representative sequences were taken from Psi-BLAST searches of the SwissProt database. Multiple sequence analysis was done using the Pileup program (using its standard parameters) in the Wisconsin GCG sequence analysis software package. Amino- and carboxy-terminal sequences with extensive gaps were deleted from the figure.

To analyze the serine/threonine phosphatase subfamilies, we first used the PP1A HUMAN sequence (human serine/threonine phosphatase-1) as the probe sequence in a Psi-BLAST search (8) of the SwissProt sequence database (7) . This search revealed 31 sequences that are explicitly annotated as serine/threonine phosphatase-1 proteins. (To limit the search, sequences annotated as possible or putative PP1s were not considered.) A multiple sequence alignment of this subfamily shows that the putative redox active site cysteines and proline are almost invariant in this subfamily (Fig. 2 , Table 1 ). This initial analysis showed that one of the 31 sequences (P20604) contained none of the three residues. (Subsequent analysis shows that this sequence is unlikely to be a PP1 sequence; see below.) Of the remaining 30 sequences, the second cysteine and the proline are invariant. The first cysteine is conserved in 26 sequences and is replaced by a threonine in three sequences and by a serine in one sequence. A residue conservation analysis (9) shows that these three residues are better conserved than many residues in the family overall (Fig. 3 A). Such strong conservation suggests that these residues might indeed be important for function or structure of the proteins in this subfamily.


View this table:
[in this window]
[in a new window]
 
Table 1. A summary of the serine/threonine protein sequences and the residues that align to the putative disulfide oxidoreductase active sitea



View larger version (55K):
[in this window]
[in a new window]
 
Figure 3. Residue variability (k*) is plotted vs. the number of residues in the sequence for the PP1 (A), PP2A (B), and PP2B (C) subfamilies. A k* value of 1 indicates that the residue is invariant, while a higher value shows that more than one residue type occurs at a given position (9) . The positions of the putative disulfide oxidoreductase active site residues (Cys 155, Cys 158, and Pro 192) are shown as green bars. All three are conserved in the PP1 subfamily, but not the PP2A or PP2B subfamilies. The arginine and lysine residues (Arg 43 and Arg 191 in 1fjm) that have been identified as near the redox active site are shown as red bars. The glutamic and aspartic acid residues (Asp 154 and Glu 44 in 1fjm) near the putative site are shown as blue bars. The cysteine located close to the site (Cys 39 in 1fjm) is shown as a yellow bar. The residue numbering along the x axis corresponds to that found in the 1fjm protein sequence.

We also performed two other Psi-BLAST searches on the SwissProt database, using human serine/threonine phosphatase-2A (PP2A HUMAN) and a serine/threonine phosphatase-2B (PP2B HUMAN) as the probe sequences. These searches found 23 sequences explicitly annotated as serine/threonine phosphatase-2A and 16 sequences annotated as serine/threonine phosphatase-2B. (Again, proteins labeled as putative or `possible' were not considered.) Multiple alignments were created for each of the three sequence sets, and these alignments show that the putative oxidoreductase active site cysteines and proline are not conserved in the PP2A or PP2B families (Fig. 2) . Residue conservation analysis shows that the proline is rather unconserved in the PP2A subfamily and both cysteines are rather unconserved in the PP2B subfamily (Fig. 3) .

The above observations suggested some covariation among the putative redox active site cysteines and proline. Therefore, we performed a cluster analysis of the multiple sequence alignment created from the 70 PP1, PP2A, and PP2B sequences and created a tree diagram from the pairwise distances. As expected, the PP1, PP2A, and PP2B subfamilies are clearly separated in the tree (Fig. 4 ); however, the tree exhibited some interesting results. First, the one sequence (P20604) labeled as a PP1, but which did not contain either of the cysteines or the proline, does not belong in the PP1 subfamily, but rather belongs in the PP2A subfamily (red bar in Fig. 4 ). This strongly suggests that this sequence is incorrectly annotated in the SwissProt database. The second interesting result is that two of the four sequences that contain a threonine rather than the first cysteine in the PP1 sequences fall between the PP1 and PP2A subfamilies (cyan bars in Fig. 4 ). The two other sequences that contain a serine or threonine residue instead of the first cysteine are found as a subfamily of the PP1 subfamily (Fig. 4) . Thus, the second cysteine and the proline are invariant in 30 out of 30 PP1 sequences. The first cysteine is invariant in 26 out of 30 sequences, but two of the four sequences in which it is not a cysteine lie between the PP1 and PP2A subfamilies in the cluster analysis.



View larger version (17K):
[in this window]
[in a new window]
 
Figure 4. Cluster analysis of the multiple sequence alignment of PP1, PP2A, and PP2B sequences. The PP1 subfamily is shown as magenta bars, the PP2A subfamily as dark blue bars, and the PP2B subfamily as green bars. The sequence that is apparently improperly annotated as a PP1 protein is shown as a red bar. The sequences in the PP1 subfamily that contain serine or threonine in place of the first cysteine are shown as yellow or cyan bars, respectively.

This strong residue conservation in the PP1 subfamily contrasts with the sequences in the other subfamilies, where the identities of the residues that are sequentially homologous to the cysteines and proline are not conserved (Table 1) . In the PP2A subfamily, the second cysteine is replaced by an invariant tyrosine. The first cysteine is replaced by leucine, valine, or isoleucine, and the proline is replaced by either isoleucine, valine, leucine, alanine, lysine, or phenylalanine. In the PP2B subfamily, the position homologous to the proline is conserved as a phenylalanine, but the first cysteine is replaced by serine, threonine, or alanine whereas the second cysteine is replaced by asparagine, valine, cysteine, alanine, or serine. The strong conservation of the two cysteines and the proline in the PP1 subfamily, but not in the PP2A or PP2B subfamilies, strongly supports our contention that these residues are functionally important.

Proximity of putative disulfide oxidoreductase active site to the phosphatase active site
If the putative disulfide oxidoreductase active site is a location for activation or inactivation of the phosphatase, one would expect the redox site to be located somewhere near the phosphatase active site. This is indeed found to be the case. This fact points out the power of a structure-based approach to function prediction. Even if a sequence-based method had found the disulfide oxidoreductase active site (and none did), one could not tell if the two functions were independent or interdependent based on sequence alone. The active site residues in 1fjm are found to lie in a groove of the protein (6) . The inhibitor microcystin, which was cocrystallized with the phosphatase, lies along this groove, across the phosphatase active site residues (Fig. 1C ). The alpha carbon of the histidine in the phosphatase active site is an average of 14 Å from the three alpha carbons of the putative redox active site, which is about half the length of the groove along which the microcystin lies (Fig. 1C ). Thus, the putative disulfide oxidoreductase active site is in a location where it could potentially affect the phosphatase active site of the protein.

Analysis of other residues possibly involved in the disulfide exchange reaction
We then looked in the vicinity of the putative disulfide oxidoreductase active site for other residues that might be involved in the disulfide exchange mechanism. It has been reported that a buried aspartic acid residue (Asp 26 in E. coli thioredoxin) is responsible for decreasing the pKa of the active site cysteines, thus increasing the rate of the disulfide oxidoreductase reaction in vivo (10) . Assuming a similar mechanism might be at work here, we looked for aspartic acid and glutamic acid residues near the cysteines in the serine/threonine phosphatase-1 structure. We found two: Asp 154 and Glu 44. Asp 154 is adjacent to Cys 155, one of the putative active site cysteines, in the helix. This residue is well conserved and is either an aspartic acid or a glutamic acid in the 30 PP1 sequences analyzed (Figs. 2 and 4) . The residue that aligns with Asp 154 in the PP2A subfamily is aspartic acid in 20 out of 24 sequences and glutamic acid in two; the other two are Asn and Gln. In the PP2B subfamily in this position are glutamic acid, aspartic acid, histidine, glutamine, and arginine. Thus, the negative charge is invariant in the PP1 subfamily; it is conserved in the PP2A subfamily but is not conserved in the PP2B subfamily. Glu 44 is found in a helix that packs against and is parallel to the helix containing the potential active site cysteines. This residue is not as conserved in either of the subfamilies (Fig. 2) . The conservation of the negative charge on residue 154 suggests it might be involved in the disulfide exchange mechanism.

Positively charged side chains could also work to stabilize the negative thiolate anion that is thought to form during the disulfide exchange reaction. We found several arginine residues in close structural proximity to the cysteines, including Arg 43, whose side chain lies over the cysteine side chains, and Arg 191, which is adjacent to the cis-Pro 192. Arg 191 is strictly conserved in all the serine/threonine phosphatase subfamilies. The positive charge at residue 43 is conserved in the PP1 subfamily: it is an arginine in 21 sequences and a lysine in 8 sequences, but in the PP2A subfamily it can be arginine, lysine, valine, or glutamine. In the PP2B subfamily, there are no arginines, but alanine, threonine, or glycine can be found at this position (Fig. 2) .

In the 1fjm crystal structure, there is another cysteine, Cys 39, which lies in a position to potentially react with the putative redox Cys 155. This residue is usually a cysteine in PP1 sequences, but is replaced by a valine in two sequences: P23733 and P23734. These two sequences are the ones found between the PP1 and PP2A subfamilies in the cluster analysis (Fig. 4) . In these two sequences, the first cysteine is also replaced by a threonine. This residue covariation does suggest that all three cysteines might be involved in the redox reaction in the serine/threonine phosphatases. Cys 39 is conserved in the PP2A subfamily, except for one protein, where it is replaced by valine. In the PP2B subfamily, the residues found in this position are isoleucine, leucine, and valine.


   DISCUSSION
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 
Summary of results
By applying structure-based function identification methods to the known high-resolution structures found in the Brookhaven database, we have predicted the presence of a novel active site in the serine/threonine phosphatases. This particular active site had not previously been identified even though the protein has been well studied. Subsequent sequence analysis supports the prediction and demonstrates that the disulfide oxidoreductase site is found only in the PP1 subfamily. The prediction can now be tested experimentally. If it turns out to be correct, then a structure-based function identification method will have been shown to be crucial for identifying an important biological function of the well-studied serine/threonine protein phosphatase-1 subfamily.

Biological significance of a redox site in the serine/threonine phosphatase-1 subfamily
Serine/threonine protein phosphatases catalyze the removal of a phosphate group from a serine or threonine residue. They are found in all eukaryotic cell types and play a central role in the control of many cellular processes, including control of the cell cycle, metabolic regulation, and growth factor signaling pathways (reviewed in ref 24 ). Because of this central role, they are also subject to complex regulatory mechanisms (24) .

Control of serine/threonine protein phosphatases and other phosphatases by redox mechanisms has been hypothesized and in some cases demonstrated (17 , 22 , 25 26 27) ; however, the mechanisms are complex and not well understood. Serine/threonine protein phosphatases-1 and -2A have been shown to be inactivated by a variety of thiol group reagents, although PP1 is more rapidly inactivated by several of these compounds (21) . The in vivo significance of these observations is not understood. A redox-sensitive protein phosphatase is inactivated by tumor necrosis factor/interleukin-1 signal transduction (22) . More recent evidence has suggested that the particular redox-sensitive phosphatase in this case comes from the PP2A subfamily (28)

Given the multiplicity of pathways that the serine/threonine phosphatases regulate, there probably are multiple control mechanisms that act under many different specific conditions. Because PP1 proteins are more rapidly inactivated by some sulfhydryl reagents (21) and given the structural similarity of the site described here to the disulfide oxidoreductase active sites in the glutaredoxin/thioredoxin protein family, that this site is not conserved in the PP2A or PP2B subfamilies, and that the putative disulfide reductase active site is very close to the phosphatase active site, we propose that a specific pathway exists for redox regulation for the PP1 subfamily that involves this site.

Implications for functional genomics and function annotation
This prediction, if true, has some significant implications for the burgeoning field of functional genomics. First of all, by using structural descriptors of active sites, we do not require that proteins having the same function be evolutionary related. Indeed, in this method such information is not necessary. In the specific case of the serine/threonine phosphatases, we have identified a putative disulfide reductase active site using a descriptor based on proteins having no apparent evolutionary relationship whatsoever to this family. Furthermore, this study suggests that proteins with very similar sequences can gain additional functional sites during evolution. This has been shown in other proteins (see, for example, ref 29 ), but has been rarely predicted in advance of experiment. Thus, a protein can have more than one `function', and identification of a sequence as a serine/threonine phosphatase, or even as a serine/threonine phosphatase-1, is not enough to identify the full, multilevel biological function of the protein. The most useful annotation methods will explicitly identify all levels of function, including catalytic activities, substrate specificities, binding of cofactors, interaction with regulatory proteins, or interaction with other macromolecules. Structural descriptors of active sites, rather than linear sequence motifs, are especially well suited to this task.


   FOOTNOTES
 
2 FFF, fuzzy functional form; PPI, protein phosphatase-1.

Received for publication January 27, 1999. Accepted for publication July 29, 1999.


   REFERENCES
TOP
ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
REFERENCES
 

  1. Fitch, W. (1970) Distinguishing homologous from analogous proteins. Syst. Zool. 19,99-106[Medline]
  2. Fetrow, J. S., Skolnick, J. (1998) Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases. J. Mol. Biol. 281,949-968[Medline]
  3. Fetrow, J. S., Godzik, A., Skolnick, J. (1998) Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity. J. Mol. Biol. 282,703-711[Medline]
  4. Zhang, L., Godzik, A., Skolnick, J., Fetrow, J. S. (1998) Functional analysis of E. coli proteins for members of the {alpha}/ß hydrolase family. Folding Design 4,535-548
  5. Abola, E. E., Bernstein, F. C., Bryant, S. H., Koetzle, T. F., Weng, J. (1987) Allen, F. H. Bergerhoff, G. Sievers, R. eds. Protein Data Bank in Crystallographic Databases—Information Content, Software Systems, Scientific Application ,107-132 Data Commission of the International Union of Crystallography Bonn/Cambridge/Chester.
  6. Goldberg, J., Huang, H. B., Kwon, Y. G., Greengard, P., Nairn, A. C., Kuriyan, J. (1995) Three-dimensional structure of the catalytic subunit of protein serine/threonine phosphatase-1. Nature (London) 376,745-753[Medline]
  7. Bairoch, A., Apweiler, R. (1996) The SWISS-PROT protein sequence data bank and its new supplement TREMBL. Nucleic Acids Res 24,21-25[Abstract/Free Full Text]
  8. Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Shang, Z., Miller, W., Lipman, D. J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25,3389-3402[Abstract/Free Full Text]
  9. Shenkin, P. S., Erman, B., Mastrandrea, L. D. (1991) Information-theoretical entropy as a measure of sequence variability. Proteins Struct. Funct. Gen. 11,297-313
  10. Dyson, H. J., Jeng, M. F., Tennant, L. L., Slaby, I., Lindell, M., Dui, D. S., Kuprin, S., Holmgren, A. (1997) Effects of buried charged groups on cysteine thiol ionization and reactivity in Escherichia coli thioredoxin: structural and functional characterization of mutants of Asp 26 and Lys 57. Biochemistry 36,2622-2636[Medline]
  11. Yang, Y. F., Wells, W. W. (1991) Identification and characterization of the functional amino acids at the active center of pig liver thioltransferase by site-directed mutagenesis. J. Biol. Chem. 266,12759-12765[Abstract/Free Full Text]
  12. Yang, Y. F., Wells, W. W. (1991) Catalytic mechanism of thioltransferase. J. Biol. Chem. 266,12766-12771[Abstract/Free Full Text]
  13. Kortemme, T., Creighton, T. E. (1995) Ionisation of cysteine residues at the termini of model alpha-helical peptides. Relevance to unusual thiol pKa values in proteins of the thioredoxin family. J. Mol. Biol. 253,799-812[Medline]
  14. Chaohong, S., Holmgren, A., Bushweller, J. H. (1997) Complete 1H, 13C, and 15N NMR resonance assignments and secondary structure of human glutaredoxin in the fully reduced form. Protein Sci 6,383-390[Abstract]
  15. Katti, S. K., LeMaster, D. M., Eklund, H. (1990) Crystal structure of thioredoxin from Escherichia coli at 1.68 A resolution. J. Mol. Biol. 212,167-184[Medline]
  16. Rey, A., Skolnick, J. (1993) Computer modeling and folding of four-helix bundles. Proteins Struct. Funct. Gen. 16,8-28
  17. Holmes, C. F. B., Boland, M. P. (1993) Inhibitors of protein phosphatase-1 and -2A; two of the major serine/threonine protein phosphatases involved in cellular regulation. Curr. Opin. Struct. Biol. 3,934-943
  18. Hubbard, M. J., Cohen, P. (1993) On target with a new mechanism for the regulation of protein phosphorylation. Trends Biochem. Sci. 18,172-177[Medline]
  19. Jia, Z. (1997) Protein phosphatases: structures and implications. Biochem. Cell Biol. 75,17-26[Medline]
  20. Cohen, P. T. (1997) Novel protein serine/threonine phosphatases: variety is the spice of life. Trends Biochem. Sci. 22,245-251[Medline]
  21. Nemani, R., Lee, E. Y. C. (1993) Reactivity of sulfhydryl groups of the catalytic subunits of rabbit skeletal muscle protein phosphatases 1 and 2A. Arch. Biochem. Biophys. 300,24-29[Medline]
  22. Guy, G. R., Cairns, J., Ng, S. B., Tan, Y. H. (1993) Inactivation of a redox-sensitive protein phosphatase during the early events of tumour necrosis factor/interleukin-1 signal transduction. J. Biol. Chem. 268,2141-2148[Abstract/Free Full Text]
  23. Das, A. K., Helps, N. R., Cohen, P. T., Barford, D. (1996) Crystal structure of the protein serine/threonine phosphatase 2C at 2.0 A resolution. EMBO J. 15,6798-6809[Medline]
  24. Mumby, M. C., Walter, G. (1993) Protein serine/threonine phosphatases: structure, regulation, and functions in cell growth. Physiol. Rev. 73,673-699[Abstract/Free Full Text]
  25. Denu, J. M., Tanner, K. G. (1998) Specific and reversible inactivation of protein tyrosine phosphatases by hydrogen peroxide: evidence for a sulfenic acid intermediate and implications for redox regulation. Biochemistry 37,5633-5642[Medline]
  26. Soulie, J.-M., Buc, J., Riviere, M., Ricard, J. (1985) Equilibrium binding of thioredoxin fB to chloroplastic fructose-bisphosphatase: evidence for a thioredoxin site distinct from the active site. Eur. J. Biochem. 152,565-568[Medline]
  27. Jacquot, J.-P., Lopez-Jaramillo, J., Miginiac-Maslow, M., Lemaire, S., Cherfils, J., Chueca, A., Lopez-Gorge, J. (1997) Cysteine-153 is required for redox regulation of pea chloroplast fructose-1,6-bisphosphatase. FEBS Lett 401,143-147[Medline]
  28. Guy, G. R., Philp, R., Tan, Y. H. (1995) Activation of protein kinases and the inactivation of protein phosphatase 2A in tumour necrosis factor and interleukin-1 signal transduction pathway. Eur. J. Biochem. 229,503-511[Medline]
  29. Matthews, D. A., Smith, W. W., Ferre, R. A., Condon, B., Budahazi, G., Sisson, W., Villafranca, J. E., Janson, C. A., McElroy, H. E., Gribskov, C. L., et al (1994) Structure of human rhinovirus 3C protease reveals a trypsin-like polypeptide fold, RNA-binding site, and means for cleaving precursor polyprotein. Cell 77,761-771[Medline]



This article has been cited by other articles:


Home page
J. Biol. Chem.Home page
K. M. Humphries, J. K. Pennypacker, and S. S. Taylor
Redox Regulation of cAMP-dependent Protein Kinase Signaling: KINASE VERSUS PHOSPHATASE INACTIVATION
J. Biol. Chem., July 27, 2007; 282(30): 22072 - 22079.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Goyal, D. Mohanty, and S. C. Mande
PAR-3D: a server to predict protein active site residues
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W503 - W505.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Physiol. Lung Cell. Mol. Physiol.Home page
S. Chu and T. J. Ferro
Identification of a hydrogen peroxide-induced PP1-JNK1-Sp1 signaling pathway for gene regulation
Am J Physiol Lung Cell Mol Physiol, November 1, 2006; 291(5): L983 - L992.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
S. Quevillon-Cheruel, N. Leulliot, M. Graille, N. Hervouet, F. Coste, H. Benedetti, C. Zelwer, J. Janin, and H. Van Tilbeurgh
Crystal structure of yeast YHR049W/FSH1, a member of the serine hydrolase family
Protein Sci., May 1, 2005; 14(5): 1350 - 1356.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
S. M. Baxter, J. S. Rosenblum, S. Knutson, M. R. Nelson, J. S. Montimurro, J. A. Di Gennaro, J. A. Speir, J. J. Burbaum, and J. S. Fetrow
Synergistic Computational and Experimental Proteomics Approaches for More Accurate Detection of Active Serine Hydrolases in Yeast
Mol. Cell. Proteomics, March 1, 2004; 3(3): 209 - 225.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
S. Goldsmith-Fischman and B. Honig
Structural genomics: Computational methods for structure analysis
Protein Sci., September 1, 2003; 12(9): 1813 - 1821.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
C.-K. Wu, T. A. Dailey, H. A. Dailey, B.-C. Wang, and J. P. Rose
The crystal structure of augmenter of liver regeneration: A mammalian FAD-dependent sulfhydryl oxidase
Protein Sci., May 1, 2003; 12(5): 1109 - 1118.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Physiol. Cell Physiol.Home page
L. Samavati, M. M. Monick, S. Sanlioglu, G. R. Buettner, L. W. Oberley, and G. W. Hunninghake
Mitochondrial KATP channel openers activate the ERK kinase by an oxidant-dependent mechanism
Am J Physiol Cell Physiol, July 1, 2002; 283(1): C273 - C281.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. J. Ondrechen, J. G. Clifton, and D. Ringe
THEMATICS: A simple computational predictor of enzyme function from structure
PNAS, October 12, 2001; (2001) 211436698.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
J. S. Fetrow, N. Siew, J. A. Di Gennaro, M. Martinez-Yamout, H. J. Dyson, and J. Skolnick
Genomic-scale comparison of sequence- and structure-based methods of function prediction: Does structure provide additional insight?
Protein Sci., May 1, 2001; 10(5): 1005 - 1014.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. J. Ondrechen, J. G. Clifton, and D. Ringe
THEMATICS: A simple computational predictor of enzyme function from structure
PNAS, October 23, 2001; 98(22): 12473 - 12478.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by FETROW, J. S.
Right arrow Articles by SKOLNICK, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by FETROW, J. S.
Right arrow Articles by SKOLNICK, J.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS