|
|
||||||||
|
FJ
EXPRESS SUMMARY ARTICLE The Full-length version of this article is also available, published online February 6, 2004 as doi:10.1096/fj.03-0475fje. |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Chemical Engineering and Material Science, Michigan State University, East Lansing, Michigan, USA
2Correspondence: Michigan State University, Department of Chemical Engineering and Material Science, 1257 EB, East Lansing, MI 48824, USA. E-mail: krischan{at}egr.msu.edu
SPECIFIC AIMS
The goal of this study was to illustrate a Bayesian based framework capable of inferring hepatocellular networks from metabolic data. Reverse engineering of pathways and networks provides a framework for predictive modeling and hypotheses testing to gain deeper insight into biological systems. We developed a framework that combines data-driven reverse engineering and model-driven statistical testing methods to infer biochemical networks. Evaluating this methodology initially against the known biochemical network(s) provides confidence in novel networks that may be uncovered from the data using this framework.
PRINCIPAL FINDINGS
1. Sub-networks such as the tricarboxylic acid (TCA) and urea cycles were inferred from metabolic data
Using Bayesian network analysis we reverse-engineered metabolic subnetworks from the flux data, namely, the TCA (Fig. 1
A) and urea (Fig. 1C
) cycles. The TCA cycle is important for producing cellular energy (ATP) from the oxidation of fuels and generating intermediates for the gluconeogenic pathway. The analysis was able to infer causal relationships (solid lines), but not links between pathways 1011 and 1213 (dashed lines) shown in Fig. 1B
. The analysis identified the coupling between the TCA cycle and oxidative phosphorylation (flux nos. 5153). This recognition is encouraging, since oxidative phosphorylation by which electron transfer from NADH and FADH2 to O2 forms ATP is coupled to the TCA cycle. It is noteworthy that in our model, pathway no. 1011 combines two pathways, namely, citrate - isocitrate and isocitrate -
-ketoglutarate; and likewise, citrate -
-ketoglutarate combines with pathway no. 1213. Another important hepatic function is removal of NH4+ given that elevated levels of NH4+ are toxic. NH4+ (derived typically from protein and amino acid metabolism) is converted to urea by the liver through the urea cycle. Bayesian network analysis was able to "learn" most of the pathways in the urea cycle. The link between pathways 16 and 15 was not learned by Bayesian network analysis (Fig. 1D
).
|
2. Intracellular TG sub-networks were postulated and compared with Bayesian metric score
Despite the significant amount of information known about the hepatic metabolic network, understanding is incomplete. Due to imperfect knowledge of the metabolic network, it is plausible that important latent variables not measured but revealed by the model may be linked to currently unassociated pathways and may indeed exist. The ability of Bayesian network analysis to infer the known metabolic subnetworks from the flux data lend credence to the latent relationships it may uncover. Previously, we found that intracellular triglyceride (TG) accumulation was not captured completely by the partial least squares (PLS) method. A possible explanation may be due to insufficient details of the metabolic pathways pertinent to intracellular TG.
Applying a constraint-based algorithm to infer a sub-network containing pathways linked to TG, we found that intracellular TG (flux no. 76) was influenced most by ß-hydroxybutyrate (BOH) dehydrogenase (flux no. 50), glutaminase (flux no. 38), and glutamate dehydrogenase I (flux no. 36); while extracellular TG (flux no. 70) was affected by lactate dehydrogenase (LDH) (flux no. 8), glyceraldehyde-3-P (G3P) (flux no. 5), free fatty acid uptake (flux no. 72), glutamate dehydrogenase I (flux no. 36), and asparaginase (flux no. 45). Latent variable detection algorithm was applied to the aforementioned pathways, along with several other pathways (flux nos. 2,26,54,61,64,73,74,75) and results suggest that latent variables may exist which influence intracellular TG and the glutamate related pathways (flux nos. 36 and 38). From currently known pathways related to TG metabolism and inferred connections, we postulated several alternative networks, shown in Fig. 2
. We tested the hypothesis that flux no. 50 may be a latent variable between flux no. 76 and flux no. 36 (Fig. 2A
) and between flux no. 76 and flux no. 38 (Fig. 2C
). Similarly, we tested the hypothesis that flux no. 36 was a latent variable between flux nos. 76 and 38 (Fig. 2B
) and flux no. 38 was a latent variable between flux nos. 76 and 36 (not shown, but will be denoted as 2B alternate). Finally, in Fig. 2D
, we tested the possibility that flux no. 50 was the latent variable influencing flux nos. 76 and 38, in combination with flux no. 38 as the latent variable for flux nos. 76 and 36. We evaluated the likelihood of these models with a Bayesian metric score and prediction accuracy of the postulated model. With the help of a metric score and the models prediction accuracy the model illustrated in Fig. 2D
is the most likely of the several hypothetical models postulated. As an indirect confirmation of the model in Fig. 2D
, we applied the Markov Chain Monte Carlo (MCMC) algorithm, which is a search and score method, to pathways illustrated in Fig. 2
. Since this is a computationally expensive method, application is not feasible for the entire metabolic network. On the other hand, we were able to obtain a solution for the sub-network after 500 iterations. This search and score method identified the connection between flux nos. 38 and 76, which we postulated as being the latent variable between flux nos. 36 and 76. This indirectly lends support to our hypothesis that flux no. 38 may be the latent variable linking flux no. 36 with flux no. 76.
|
3. Sensitivity analysis of Bayesian network identifies the important pathways in a network
A sensitivity analysis was applied to the intracellular TG sub-network, with intracellular TG (flux no. 76) as the response variable. The results indicate that the intracellular TG level is most sensitive to ß-hydroxybutyrate (flux no. 50) and glutaminase (flux no. 38), and least sensitive to glutamate dehydrogenase (flux no. 36) and cholesterol ester uptake (flux no. 75). Although the sensitivity coefficients are all small, the coefficients for ß-hydroxybutyrate (flux no. 50) and glutaminase (flux no. 38) are generally an order of magnitude larger than the other pathways, indicating that they have more impact on TG storage. Thus, changing flux nos. 50 and 38 will more likely alter the level of intracellular TG accumulation as oppose to altering other pathways.
4. Noise in the data and omitted measurements are two possible reasons for the missing relationships
In the reversed Bayesian network, several causal relations were missed, for example, pathways 1213 in Fig. 1B
and 1516 in Fig. 1D
. Two possible explanations could account for this, either noise in the data or omitted measurements. We tested the first hypothesis by adding noise to our data and examining its effect on the resulting network structure. We added 3 levels of noise (5%, 10%, and 20%) to the data. When 5% noise was added, the pathway 1617 disappeared and pathway 1516 was learned. This indicated the possibility that pathway 1516 may have been missed due to noise in the data. Adding 10% noise to the data resulted in pathway 1718 being replaced by a new pathway 1618; and adding 20% noise eliminated both pathways 1516 and 1718. Results suggested that Bayesian network analysis could tolerate a certain degree (5%
10%) of noise within the data to provide a satisfactory level of performance, represented by the number of causal relation inferred. Although when noise was increased to a higher level (e.g., 20%), the performance of Bayesian network approach decreased as reflected in the higher number of missed causal relations. We tested the second explanation by omitting one measurement from the data and examining the pathways inferred. To illustrate this idea, we removed the lactate measurement from the dataset, which resulted in the omission of two additional pathways, pathways 8
7 and 79 in Fig. 1B
, which were no longer inferred by the Bayesian network analysis. Therefore, both noise and missing measurements could contribute to an incomplete or incorrect reversed Bayesian network.
CONCLUSIONS
We developed a Bayesian-based framework, schematically shown in Fig. 3
, to infer network structures from metabolic data. From the resulting network, we can identify the relevant variables that most affect target function(s), thereby providing a framework to optimize target function(s) and insight into the underlying mechanisms that govern the metabolic state. Finally, the ability of this methodology to infer well-known metabolic structures from experimental data provides confidence in the ability of this methodology to infer other networks, such as, genetic regulatory networks from gene data.
|
FOOTNOTES
1 To read the full text of this article, go to http://www.fasebj.org/cgi/doi/10.1096/fj.03-0475fje; doi: 10.1096/fj.03-0475fje ![]()
This article has been cited by other articles:
![]() |
D. R. Boverhof and T. R. Zacharewski Toxicogenomics in Risk Assessment: Applications and Needs Toxicol. Sci., February 1, 2006; 89(2): 352 - 360. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. V. HarshaRani, S. J. Vayttaden, and U. S. Bhalla Electronic Data Sources for Kinetic Models of Cell Signaling J. Biochem., June 1, 2005; 137(6): 653 - 657. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, L. Lu, K. F. Manly, E. J. Chesler, L. Bao, J. Wang, M. Zhou, R. W. Williams, and Y. Cui Inferring gene transcriptional modulatory relations: a genetical genomics approach Hum. Mol. Genet., May 1, 2005; 14(9): 1119 - 1125. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Li and C. Chan Integrating Gene Expression and Metabolic Profiles J. Biol. Chem., June 25, 2004; 279(26): 27124 - 27137. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |