FASEB J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


FJ EXPRESS SUMMARY ARTICLE
The
Full-length version of this article is also available, published online July 24, 2001 as doi:10.1096/fj.00-0889fje.
Published as doi: 10.1096/fj.00-0889fje.
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
15/11/2054
00-0889fjev1    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by VOHRADSKY, J.
Right arrow Articles by RAMSDEN, J. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by VOHRADSKY, J.
Right arrow Articles by RAMSDEN, J. J.
(The FASEB Journal. 2001;15:2054-2056.)
© 2001 FASEB

Genome resource utilization during prokaryotic development 1

J. VOHRADSKY*,{dagger} and J. J. RAMSDEN{ddagger},§2

* Department of Microbiology, Biozentrum of the University, Basel, Switzerland;
{dagger} Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic;
{ddagger} Department of Biophysical Chemistry, Biozentrum of the University, Basel, Switzerland; and
§ Collegium Basilea, Institute of Advanced Study, Basel, Switzerland

2Correspondence: Hochstrasse 51, CH-4053 Basel, Switzerland. E-mail: J.Ramsden{at}unibas.ch

SPECIFIC AIM

From our previous demonstration that the simplified canonical law (scl) that relates protein synthesis rates to the rank of those rates in an ordered list characterizes the quantitative state of gene expression in a prokaryote, we extend our work to monitoring the development of a bacterial culture, making use of protein expression data obtained using 2-dimensional (2D) electrophoresis. We had previously examined such data obtained from arbitrarily chosen epochs for cultures of several different microorganisms; here, our aim was to examine how the two characteristic parameters of the scl evolve during the natural development of a culture of a single organism, Streptomyces coelicolor.

PRINCIPAL FINDINGS

1. The scl can be fitted to the distribution of protein synthesis rates throughout development
Protein synthesis rates were measured in S. coelicolor in liquid culture (data from C. J. Thompson’s laboratory at the Biozentrum, Basel University). Samples were pulse radiolabeled with 35S-met/cys at successive epochs, and the proteins extracted and separated on 2D gels according to isoelectric point (3<pI<8) and molecular mass (15,000<Mr<90,000). Autoradiographs of the gels were scanned digitally to determine integrated spot densities Ir, from which the normalized rates of synthesis pr were obtained:

where R is the total number of measured spots (up to 1238) on a given gel.

The pr were then ranked in decreasing order of rate of synthesis. According to our previous work, these rates can be predicted from their ranks (r) by using the scl:

where {rho} and {theta} are the parameters of the distribution. P is simply a normalizing coefficient, chosen to ensure that

To find the characteristic parameters {rho} and {theta}, they were allowed to vary while fitting Eq. 2 to the data (under the constraint of Eq. 3 ) using a Levenberg-Marquardt nonlinear least-squares algorithm; a typical fit is shown in Fig. 1 .



View larger version (13K):
[in this window]
[in a new window]
 
Figure 1. Representative plot of log10pr vs. log10r for S. coelicolor (data from a single gel after 30 h growth). Points: data; solid line: eq. 2 fitted only as far as r = 468 (marked by an arrow). The complete data set contained 978 points.

2. The growth hiatus associated with change in nutrient utilization is accompanied by marked changes in the theta parameter (which characterizes the extent to which the genome is used) and in the rho parameter (which characterizes the extent of functional redundancy among the expressed genes)
{theta} began at a fairly steady, moderately high value, suddenly jumped up to a peak of almost 1, plunged to a deep minimum, and then regained its previous value (as the culture became senescent, {theta} slowly declined). {rho}, on the other hand, remained fairly steady during growth except for a sudden jump to a high peak, just when {theta} sank to a trough, and almost equally abruptly fell back to its original value (Fig. 2 ). These dramatic changes in {rho} and {theta} coincided precisely with the exhaustion of the initially exploited nutrient (maltose), when the organism had to readapt to a different carbon source (glutamate).



View larger version (22K):
[in this window]
[in a new window]
 
Figure 2. Plot of {theta} ({circ}, means connected by the solid line) and {rho} (•, means connected by the dashed line) vs. growth time. Ranges are indicated by vertical lines. Each growth time was sampled and analyzed in triplicate.

3. The scl concept is a novel and revealing concept for exploiting the explosion of data generated from advances in proteomics. The parameters provide a very compact description of the quantitative state of the proteome
Two-dimensional gel electrophoresis has now advanced sufficiently to allow the simultaneous quantification of almost all the expressed proteins in a cell (the proteome), opening up extensive new possibilities (in principle) for understanding the organization of metabolism. Fitting the scl to a ranked list of protein synthesis rates allows the state of gene expression to be characterized by just two parameters with clear biological significance, and allows the parameters of the distribution of synthesis rates to be interpreted according to global features of gene expression.

CONCLUSIONS AND SIGNIFICANCE

The ultimate utility of the scl concept depends on being able to assign biological significance to the parameters, and to understand the underlying reason for the remarkably good fits of the protein synthesis rates to this law. A vital clue came from comparing the abrupt changes in {rho} and {theta} with simultaneous measurements of the overall growth rate of the culture, combined with principal component analysis (PCA) of the expression pattern determined from the 2D electrophoretograms.

For PCA, the spots were ranked according to image quality, and the best 128 spots (i.e., ~10% of the total) selected as a minimal representative set. Each epoch is thus a point in 128D space, the coordinates being given by the spot densities. To apply PCA, a much smaller set (the principal components) of orthogonal linear combinations of these original variables were found, which incorporate all the variance of the original data. The first three principal component axes alone incorporate 56% of the variance of the whole data set.

Four distinct consecutive growth phases could be identified. The values of the first principal axis (PC1, bearing 26% of the variance of the whole database) remain fairly constant during each phase, whereas the passages from one phase to the next one are accompanied by abrupt and marked changes. These four phases are:

1) Standard exponential growth;

2) The transition phase (cessation of growth), associated with exhaustion of the nutrition source exploited initially. On the second and third principal axis (PC2 and PC3), this phase is demarcated by points 8 and 9 (32–36 h growth), which exhibit exceptionally high values, whereas the values of the other points remain close to 0 and therefore have no significant influence on the variance borne by PC2. The extraordinarily high values at points 8 and 9 reflect marked changes in the distribution of proteins synthesized during this phase of development;

3) The second (post-transition) phase of exponential growth;

4) Onset of the stationary phase; growth of the culture is arrested.

Comparison of the plain growth rate data, together with the far more sophisticated PCA analysis, with the scl parameters, clearly indicates a close correlation between the peaks and troughs in the evolution of {rho} and {theta} and the growth phases. The transition phase associated with the exhaustion of nutrient is anticipated by the peak in {theta}, and the transition phase proper corresponds to the trough in {theta} and the peak in {rho}.

Our application of the scl (Eq. 2) is a generalization of a result originally derived in the context of communication theory. It was shown that when a message is transmitted word by word, the scl is the distribution of word usage frequency that minimizes the overall cost of transmitting a message containing a given quantity of information.

Distributions of word usage in many different languages, ranging from Chinese and Latin to English, have been shown to follow the scl. One of the motivations for the derivation of the scl was a large body of prior data showing that word usage distributions could be approximated by a simplified form of Eq. 2 , known as Zipf’s law, in which {theta} is always 1 and {rho} is always 0. On the one hand, it is remarkable that a power law was found for so many languages, suggesting some common fundamental mechanism; on the other hand, there were clearly significant deviations between Zipf’s law and the data, which were then eliminated by allowing {theta} and {rho} to take on different values for each data set.

The parallel between communication and the cell is that the genome is communicating information to the organism (i.e., specifying form and function) protein by protein while always operating under the overall constraint of minimizing energy consumption.

By analogy with the familiar thermodynamic temperature, {theta} is called the ‘informational temperature’. It is a measure of resource utilization: low values correspond to a state in which the most strongly expressed proteins are almost the only ones present; conversely, a value of 1 corresponds to maximum exploitation of the available genetic resources.

{rho} characterizes the degree of functional redundancy prevailing among the most abundantly expressed proteins. High {rho} corresponds to high diversity in the sense of implying an absence of functional redundancy (note that redundancy often plays a key role in maintaining the stability of natural processes). This is exactly what would be expected under the emergency condition in which the culture, having almost starved itself to extinction, is suddenly able to start metabolizing a hitherto unexploited food resource. There is an immediate and imperative need to survive, and only those proteins that are absolutely essential (and not already available) will be synthesized. There are not enough nutritional resources to allow the cell the shrewd ‘luxury’ of creating backup pathways.

The scl concept is rooted in a minimization (of energy) condition operating under a constraint of transmission of a certain amount of information in discrete (word) units. Our generalization of words to include proteins is not merely fortuitous, but is actually a deep analogy. In particular, the discrete (gene by gene, protein by protein) way the genome encodes and transmits its message to its corporeal host stands in sharp contrast to Shannon’s approach to the transmission of messages in which an entire message is optimized en bloc. A priori, this seems improbable for cell metabolism: it would result in a far too rigid response and be hopelessly inflexible in dealing with the vicissitudes of a naturally fluctuating environment. We ascertained that the Shannon entropy H of the protein distribution

declined slowly and more or less uniformly throughout the evolution of the culture without displaying any abrupt features.

Characterization of an ‘emergency mode’
After a new source of nutrition has been identified along with the necessary metabolic machinery, {theta} drops abruptly, indicating strong concentration of effort onto relatively few proteins. The cell is now in emergency mode: a new food source has just been identified, and the cell’s utmost priority is to get its metabolism running again as fast as it can before it starves. At the same time, {rho} rises to unprecedentedly high values, indicating the absence of ‘redundant’ backup metabolic pathways. Hence, the organism briefly lives in a highly fragile state, during which it may be expected to be very vulnerable to any sudden stress of a different nature from the nutritional one it is in the process of overcoming. The new stress would require yet another set of different enzymes for its neutralization that the organism momentarily lacks the resources to provide.

Our arguments are summarized in Fig. 3 . Chip technology and quantitative proteomics have advanced to the ability to provide a quantitative picture of gene expression in time for large numbers of genes working in parallel. The scl condenses the information about overall gene expression into a small number of variables by which the state of the organism can be characterized.



View larger version (12K):
[in this window]
[in a new window]
 
Figure 3. Schematic diagram showing environmental input to the cell and its metabolic decisions. Signals from the external environment trigger an emergency response. In anticipation of impending starvation, the cell searches through its entire genome in an attempt to find enzymes to metabolize other potential nutrients; meanwhile, growth ceases. Once new metabolizers are found, the dwindling energy resources of the cell must be concentrated on their synthesis in order to get metabolism onstream again. The abrupt changes in {rho} and {theta} allow us to identify an ‘emergency’ condition of the culture as it responds and seeks to adapt to a new environmental situation.

The relative amounts of protein resolved on 2D electrophoretograms are fingerprints of protein biosynthesis (i.e., the quantitative state of gene expression) at a given epoch in development.

We show that the distribution of protein synthesis rates always follows the canonical law and the parameters depend only on the state of gene expression, independent of other variables such as cell type and type of organism. The parameters of the canonical law represent ‘state variables’ of gene expression analogous to those known from thermodynamics.

Mathematical approaches using 2D electrophoresis data are, or are becoming, an integral part of the technology and make it possible to interpret the data provided beyond mere enumeration of all the expressed genes.

FOOTNOTES

1 To read the full text of this article, go to http://www.fasebj.org/cgi/doi/10.1096/fj.00-0889fje ; to cite this article, use FASEB J. (July 24, 2001) 10.1096/fj.00-0889fje





This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
15/11/2054
00-0889fjev1    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by VOHRADSKY, J.
Right arrow Articles by RAMSDEN, J. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by VOHRADSKY, J.
Right arrow Articles by RAMSDEN, J. J.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS