Flux sampling is a powerful tool to study metabolism under changing environmental conditions

Herrmann, Helena A.; Dyson, Beth C.; Vass, Lucy; Johnson, Giles N.; Schwartz, Jean-Marc

doi:10.1038/s41540-019-0109-0

Download PDF

Article
Open access
Published: 02 September 2019

Flux sampling is a powerful tool to study metabolism under changing environmental conditions

npj Systems Biology and Applications volume 5, Article number: 32 (2019) Cite this article

9085 Accesses
53 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The development of high-throughput ‘omic techniques has sparked a rising interest in genome-scale metabolic models, with applications ranging from disease diagnostics to crop adaptation. Efficient and accurate methods are required to analyze large metabolic networks. Flux sampling can be used to explore the feasible flux solutions in metabolic networks by generating probability distributions of steady-state reaction fluxes. Unlike other methods, flux sampling can be used without assuming a particular cellular objective. We have undertaken a rigorous comparison of several sampling algorithms and concluded that the coordinate hit-and-run with rounding (CHRR) algorithm is the most efficient based on both run-time and multiple convergence diagnostics. We demonstrate the power of CHRR by using it to study the metabolic changes that underlie photosynthetic acclimation to cold of Arabidopsis thaliana plant leaves. In combination with experimental measurements, we show how the regulated interplay between diurnal starch and organic acid accumulation defines the plant acclimation process. We confirm fumarate accumulation as a requirement for cold acclimation and further predict γ–aminobutyric acid to have a key role in metabolic signaling under cold conditions. These results demonstrate how flux sampling can be used to analyze the feasible flux solutions across changing environmental conditions, whereas eliminating the need to make assumptions which introduce observer bias.

Identification of flux trade-offs in metabolic networks

Article Open access 10 December 2021

Seirana Hashemi, Zahra Razaghi-Moghadam & Zoran Nikoloski

High-resolution 13C metabolic flux analysis

Article 30 August 2019

Christopher P. Long & Maciek R. Antoniewicz

High-throughput metabolomics for the design and validation of a diauxic shift model

Article Open access 07 April 2023

Daniel Brunnsåker, Gabriel K. Reder, … Ross D. King

Introduction

High-throughput technologies have resulted in a rapid increase in available ‘omic data sets.¹ Large-scale metabolic networks constructed using these data integrate known and predicted metabolic pathways.² These large-scale networks can be constrained using experimental data and the system behavior can be analyzed using metabolic modeling. Metabolism describes a cellular phenotype under given conditions, and changes in metabolite concentrations and reaction fluxes can be used to assess a cellular response to changing environmental conditions.

With the existence of large-scale metabolic networks comes the need to have appropriate modeling techniques available for their analysis. The majority of techniques for analyzing large-scale metabolic networks fall within the paradigm of constraint-based modeling (CBM).³ CBM imposes stoichiometric constraints on the metabolic reactions and analyzes the possible flux solutions at steady state. Because the system is assumed to be at steady state, even genome-scale models can be solved at little computational expense.

Two of the most widely used forms of CBM are flux balance analysis (FBA) and flux variability analysis (FVA).⁴ The key feature of FBA and FVA is that they compute the steady state of a model using an objective function. The objective function defines a reaction that is to be maximized or minimized when solving the system under the set constraints. A typical objective is “maximum biomass production”, whereby essential macromolecules such as proteins and lipids are defined at known ratios or quantities in an outgoing reaction of the metabolic system.^5,6 FBA computes single steady-state solutions, which satisfy the objective. However, often multiple solutions exist and their range can be computed using FVA.⁷ FVA provides no indication as to whether all single-point solutions within the range are feasible and which solutions are the most likely. In order to reduce the number of feasible solutions of FBA and to further constrain the feasible flux ranges returned by FVA, it is common practice to introduce multiple objective functions.⁸

However, defining one or multiple objective function(s) intrinsically introduces an observer bias as to what the main “goal” of the cell is, in the context of the analysis.⁹ Although biomass production as estimated by FBA was seen to match experimental data in Escherichia coli,¹⁰ this may not be an appropriate objective when studying short-term environmental changes. In the green algae Chlamydomonas reinhardtii, a ¹³C-metabolic flux analysis was shown to contradict the assumptions of a prior FBA analysis by demonstrating that maximum biomass and maximum ATP production cannot stand alone as cellular objectives.¹¹ Optimal growth conditions, to which most objective functions are tailored, are an exception in natural environments.⁵ Evolutionarily, metabolism is most likely optimized for overall robustness across many conditions, rather than a single condition-specific objective. Bacillus subtilis mutants, which outperform the wild-type in terms of biomass production in control conditions have been found experimentally; the wild-type, however, is more robust to both environmental and genetic perturbations and therefore holds an evolutionary advantage.¹²

Understanding the optimization process that allows an organism to tolerate changing environmental conditions is of particular interest in crop sciences, where increasing yields will need to be achieved, despite rapidly changing environmental conditions. Owing to their sessile nature, plants are exposed to frequent and sometimes extreme environmental fluctuations. Plant metabolism can therefore be presumed to hold an inherent robustness to changing environmental conditions. Furthermore, plant metabolism is highly intricate as it includes multi-cellular autotrophic and heterotrophic tissues with complex cellular compartmentalization. Analyses of plant metabolism and of plant metabolic strategies can therefore be considered a comprehensive case study of metabolism in general. Genetic modifications do not always produce a desired effect owing to network robustness:^13,14 in crop sciences, optimization for increased yield in control conditions rather than in changing environments is a likely explanation for discrepancies between laboratory and in field results.^15,16

If we wish to use CBM techniques to assess network robustness and phenotypic plasticity, we must be able to capture all alternative solutions and the probability with which they occur (the solutions space) of a metabolic network across different conditions. To calculate the exact properties of the solution space, we can use mathematical techniques such as convex analysis and vertex enumeration;¹⁷ however, owing to their computational intensiveness, these methods are only efficient when applied to small, simple networks¹⁸ or genome-scale models with one or more objective function.¹⁹ When wanting to analyze genome-scale metabolic networks without the use of an objective function, such approaches cannot practically be applied. Sampling of feasible flux solutions provides a realistic alternative.

Flux sampling generates a sequence of feasible solutions (called a chain) that satisfy the network constraints, until the entire solution space is analyzed. Enough samples need to be generated for the samples to provide an accurate representation of the feasible solution space.¹⁸ A chain of samples is said to have converged once it contains enough samples to give an accurate representation of the solution space.²⁰ Flux sampling provides information both on the range of feasible flux solutions (similar to FVA) but also on their probability. Importantly, unlike FBA, flux sampling does not require (but also does not exclude the option for) an objective function to be specified. Therefore, flux sampling methods hold great potential for analyzing optimization strategies that are not defined by clear objectives such as a simple biomass reactions.

When plants are exposed to a change in environmental conditions, such as temperature, which last only for a few days, optimizing biomass production during this period is arguably of secondary importance; sustaining metabolic function with minimal cost may be a higher priority to plants. For example, the allocation of carbon into different transient storage compounds accumulated to maintain cellular processes, has been shown to change when plants are exposed to environmental stresses.^{21,22,23,24,25} Starch, malate, and fumarate are the three major carbon storage compounds which accumulate during the day in leaves of the model plant Arabidopsis thaliana.^21,26,27 Increased cytosolic fumarate accumulation is a known cold response of A. thaliana leaves and has been linked to an increased photosynthetic capacity that sustains metabolism in cooler temperatures.²⁴ Evidently, tight regulation of carbon partitioning is required for successful cold acclimation. Given the large number of reactions and pathways involved in linking primary carbon assimilation to its downstream storage products starch, malate, and fumarate, CBM seems appropriate. Of the CBM methods available, flux sampling allows us to gain a detailed understanding of the solution space and the interdependence of the different carbon stores under different temperature conditions, without imposing the constraint of an objective function.

Multiple large-scale metabolic networks of the model plant A. thaliana have been constructed.^{28,29,30,31,32} Here, we used three of them to formally assess for the efficiency of existing and easily accessible flux sampling algorithms: the coordinate hit-and-run with rounding (CHRR),³³ the artificially centered hit-and-run (ACHR)³⁴ and the optimized general parallel (OPTGP)³⁵ algorithms. We identified the most efficient sampling method based on run-time and convergence, and applied it to study plant acclimation to cold. We experimentally measured diurnal CO2 uptake and organic carbon accumulation of A. thaliana in control and cold conditions. By constraining a leaf metabolic model to the two conditions and using an appropriate flux sampling algorithm, we were able to explore inherent metabolic robustness to temperature and predict the metabolic changes required to support a photosynthetic acclimation response to cold.

Although flux sampling has previously been applied as a technique for studying the solution space of metabolic networks,^29,36,37 this will, to our knowledge, be the first time that the available algorithms are formally compared with one another in the context of metabolic modeling, and that flux sampling is applied to study network robustness across changing environmental conditions.

Results

Both run-time and convergence are fastest when using CHRR in MATLAB

We compared the run-time and convergence of the CHRR, ACHR, and OPTGP algorithms using three metabolic models of A. thaliana. We tested the run-times of 500,000, 5,000,000, and 50,000,000 samples (S), of which 5000 were stored and the rest were discarded with constant measures of thinning. For S = 50,000,000, the CHRR algorithm was 2.5 times faster than the OPTGP and 5.3 times faster than the ACHR for the Arnold model (Fig. 2). This difference in speed increases with model complexity, such that, for the Poolman model, the CHRR was 3.3 times faster than the OPTGP and 8.0 times faster than the ACHR (Fig. 2). The OPTGP was run in two parallel processes; however, even when running it as a single process it is faster than the ACHR. Although we cannot exclude the fact that MATLAB may have a faster connection to Gurobi than Python, flux sampling is fastest when using the CHRR setup as available in the COBRA toolbox for MATLAB.

The number of reactions that did not satisfied the convergence criteria were assessed for each chain (Table 1). The longer run-times of ACHR and OPTGP implementations in Python are not outweighed by faster convergence. In fact, as the three pilot chains with varying thinnings show, the CHRR algorithm, across all model reactions, converges the fastest, with the lowest number of samples required for convergence, the least amount of autocorrelation, and the lowest discrepancy between chains (Table 1). We confirmed this difference in convergence by inspecting trace and auto-correlations plots of individual reactions, such as those for the biomass reaction of the Arnold model shown in Fig. 2 (C). CHRR shows little dependence between consecutive samples even with a thinning of T = 100, whereas OPTGP and ACHR show low levels of autocorrelation only when T = 10,000.

Table 1 Convergence diagnostics comparing three chains of 5000 samples run for all reactions in the Poolman, Arnold, and Dal’Molin models using the ACHR, OPTGP, and CHRR algorithm with the indicated thinning

Full size table

Our results show differences in the outcomes when convergence is reached according to the different convergence diagnostics. All convergence diagnostics agree that the CHRR performs best (Table 1). According to the Raftery & Lewis and the IPSRF diagnostics, all of the flux samples of reactions in the Arnold and Poolman models converge in < 5000 samples with a thinning constant of 10,000 when using CHRR. The fact that such large numbers of samples are required for model convergence shows that, owing to the irregular solution shape of genome-scale metabolic networks, autocorrelation in chains is a common problem and must be overcome and tested for using appropriate convergence diagnostics. Analyzing samples that have not achieved convergence can lead to incorrect conclusions about the metabolic fluxes under study. Currently, many applications of the ACHR, OPTGP, and CHRR algorithms to biological networks do not report whether convergence has actually been achieved.^29,37,38,39

Previous comparisons between the ACHR and CHRR algorithms³³ have been made using 15 different metabolic models but were based only on a single convergence diagnostic, the potential scale reduction factor (PSRF).⁴⁰ Notably, different convergence diagnostics test different features and may not always be in good agreement. Therefore, more than one diagnostic should be used to confirm that the sampling chain is likely to have reached convergence.^20,41 The ACHR and OPTGP algorithms have been compared using five genome-scale metabolic models and three different convergence diagnostics, including the PSRF.³⁵ However, the PSRF assumes a normal distribution of solutions,⁴² which is questionable given that sampling is most needed when the distribution of flux estimates is non-normal,⁴¹ as is often the case in metabolic modeling.

CHRR-based flux sampling generates verifiable hypotheses concerning plant acclimation to cold

When plants of A. thaliana are transferred from 20 °C to 4 °C, photosynthesis is inhibited.²⁴ Metabolism slows down in cooler temperatures and, in order to sustain normal metabolic functions, plants need to acclimate their CO₂ uptake, altering the concentrations of metabolic enzymes to achieve a new steady state.^24,43,44 After 7 days of cold, we observe that the A. thaliana wild-type Col-0 is able to achieve the same level of photosynthesis as measured in control conditions (Fig. 3). The allocation of carbon to the three main carbon storage compounds, malate, fumarate, and starch, shifts in the cold as part of the metabolic acclimation response. After 7 days of acclimation both photosynthesis and transient carbon accumulation attain a new cold acclimated state. Most notably, after 7 days of cold treatment a larger proportion of carbon is partitioned into fumarate (Fig. 3).

Using the experimental data shown in Fig. 3 to constrain the CO₂ input and the malate, fumarate, and starch accumulation reactions using the Arnold model (please see methods for further details), we were able to compute converged flux sampling distributions for all reactions. We did so for both control and cold conditions, which allowed us to overlay the sampling distributions of reaction fluxes and to assess changes required in plant metabolic behavior for acclimation (Fig. 4).

In order to demonstrate how the application of an objective function for an FBA analysis can lead to vastly different conclusions, we have overlaid FBA results for maximum biomass production (under the same model constraints as applied for the sampling) over the flux sampling distributions (Fig. 4). This further emphasizes how an objective function, if inappropriate for the analyses under consideration, can be misleading.

Both sucrose export to other tissues and cytosolic pyruvate production as a precursor for the tricarboxylic acid (TCA) cycle are predicted to be unchanged. The model suggests that, given equal carbon assimilation in control and cold conditions, cellular maintenance and export functions are supported equally in both conditions on the time scale considered. Although sucrose export is difficult to measure experimentally, we observed the rate of respiration in the cold to be the same as in control conditions (Fig. 3). Given that, for both conditions, the model shows equal fluxes from cytosolic pyruvate into the TCA cycle (Fig. 4), which feeds directly into respiration, this model prediction is in agreement with our experimental data.

For fumarate (and other metabolites not shown) the model predicts a shift in cellular compartmentalization with temperature. Model results show an increase in fumarate export from the mitochondrion into the cytosol in the cold. The reverse is shown for fumarate export from the chloroplast, where it is produced via the breakdown of arginosuccinate (Fig. 4). Leaves developed in the cold have increased cytoplasmic and decreased vacuolar volumes;⁴⁵ a reshuffling of metabolites across cellular compartments has therefore previously been proposed as an important temperature acclimation response.⁴⁶

Model results suggest an increase in flux from cytosolic malate to fumarate via cytosolic fumarase (FUM2) (Fig. 4). This is consistent with previous experimental results from mutant studies that show that the fum2.2 mutant of Col-0 is unable to acclimate to cold.²⁴ This reaction is thus essential for photosynthetic acclimation to cold.

Flux sampling distributions suggest a link between carbon and nitrogen metabolism to support cold acclimation

The sampling distributions suggest a trade off between increased carbon compound accumulation and decreased amino-acid production (Fig. 4), linking nitrogen and carbon metabolism. Synthesis of γ-aminobutyric acid (GABA), however, is predicted to increase in the cold. GABA has previously been reported to accumulate in response to environmental stresses, including cold treatment.⁴⁷ GABA has been suggested as a signaling molecule of the carbon to nitrogen status in plant leaves and evidence for its role in regulating nitrate uptake exists for both rapeseed and A. thaliana.^48,49,50 A. thaliana plants in the cold show increased nitrogen assimilation compared with those in control conditions.⁵¹

If GABA is indeed involved in carbon:nitrogen signaling,⁴⁸ it may be counteracted by the increased accumulation of malate. Increased levels of malate have previously been shown to suppress nitrate reductase expression and activity in tobacco leaves.⁵² Malate levels in Col-0 may be kept below a certain threshold, by redirecting carbon to fumarate, thereby supporting adequate nitrogen assimilation and an increased photosynthetic capacity. This hypothesis is supported by the observation that A. thaliana mutants, which show increased levels of malate and decreased levels of fumarate, grow significantly less well in high-nitrogen conditions than Col-0.²² Fumarate has few known metabolic functions in A. thaliana leaves,²⁶ and may thus serve the purpose of a carbon storage buffer in changing environmental conditions.

Discussion

Based on run-time and platform, CHRR is faster than OPTGP, which is itself faster than ACHR. CHRR also converges faster than both OPTGP and ACHR. Users with unrestricted access to MATLAB are therefore recommended to use CHRR. For those who wish to work using an open-source platform, OPTGP is recommended over ACHR; in general, OPTGP converges faster than ACHR, has a shorter run-time and allows for parallel processes.

When running sampling algorithms, sets of flux samples are produced for each reaction in the model. Here, we have tested convergence for all reactions of the models using three different sample chains. We have highlighted the importance of checking for convergence using different diagnostics when analyzing an irregular solution space of large networks. If only a subset of reaction fluxes are of interest, only those distributions will have to be checked for convergence. Flux sampling provides a powerful tool for exploratory analyses assessing metabolic differences across different environmental conditions. Our results further confirm the notion that it is not possible to fully automate convergence analyses using a single diagnostic²⁰ and that results should confirmed via manual inspections of trace and autocorrelation plots.

Flux sampling is currently an under-utilized technique in the metabolic modeling of large-scale networks. Using cold acclimation of the model plant A. thaliana, we demonstrate how flux sampling can be used effectively to analyze alternative feasible solutions across multiple conditions whilst eliminating the need to make assumptions that introduce observer bias. Given short-term environmental changes, adequately sustaining basic metabolic functions with minimal resource investment may be a more-likely cellular objective on that time scale than, for example, maximizing growth. We therefore did not set an objective function for flux sampling but used four experimentally measured flux values (CO₂ input and fumarate, malate and starch accumulation) to constrain a leaf metabolic model.²⁸ We further demonstrated how these flux sampling results can lead to different conclusion than traditional FBA analyses.

By constraining the model to both cold and control conditions we were able to select reactions that show different flux distributions across the two conditions. Our model highlights reactions that are essential to change with temperature (i.e., the flux distribution of the two temperature conditions do not overlap) such as the production of cytosolic fumarate via malate.²⁴ The model further demonstrates the properties of the flux distributions of GABA to differ in cold and control conditions, highlighting GABA as a plausible signaling molecule for supporting a shift in the nitrogen and carbon balance, required to sustain photosynthesis in the cold. Thus, through flux sampling, we were able to generate novel hypotheses about the roles of GABA, fumarate and malate in cold acclimation, which would have been unfeasible to detect using FBA and FVA methods.

By overlaying different FBA solutions onto flux sampling distributions obtained under condition-specific model constraints, FBA in combination with flux sampling, could, in future work, be used to determine plausible objective functions and help generate predictions about how cellular objectives might be changing in response to environmental changes.

Methods

COBRA methods for flux sampling

Constraint-based reconstruction and analysis (COBRA) methods for genome-scale metabolic networks are integrated in the COBRA toolbox^53,54 for the MATLAB programming language and the COBRApy package⁵⁵ for the open-source Python programming language. Three algorithms for flux sampling exist across the two platforms: CHRR (MATLAB), ACHR (MATLAB and Python), and OPTGP (Python). Further flux-sampling algorithms exist;^37,56,57,58 however, as they are not currently available in the COBRA packages, they are here not considered for comparison.^59,60

1.
The artificially centered hit-and-run (ACHR) sampler estimates the center of the solution space in a “warm-up” phase. This estimate is then continuously revised with further sampling. The center estimate is used to inform the direction of further sampling such that the full solution space is covered in fewer steps than in traditional hit-and-run sampling (where the direction of the next sample is chosen at random).³⁴ Although the Markovian nature of hit-and-run (i.e., the fact that each future sampling state is dependent only on the current sampling state) is lost in the ACHR, it overcomes the edge-trapping limitation of the standard hit-and-run algorithm (i.e., it no longer gets stuck at the bounds of a solution space if these are of an elongated shape, a frequent feature in metabolic models).
2.
The optimized general parallel sampler (OPTGP) is argued to be an improvement on the ACHR, because from the warm-up point it generates multiple short chains from the estimated center and only considers the last point in the chain as a sample.³⁵ It thereby increases the randomness and efficiency with which the total solution space is explored. Furthermore, it allows for parallel sampling. Larger samples can thus be generated in shorter run-times.
3.
The coordinate hit-and-run with rounding (CHRR) algorithm starts with a pre-processing step that rounds the solution space to a more regular, convex shape, and therefore a Markov chain can be used to explore the rounded solution space without the limitation of edge-trapping. After sampling, the solutions are back-transformed to match the original solution space in order to obtain the true value of the sampled points.^33,61

Assessing convergence

The aim of flux sampling is to generate enough consecutive samples (a long enough chain) in order to get an accurate depiction of the solution space. A sample chain is considered to have converged once it can be assumed that the sampled subset of solutions represent the properties of the true solutions obtained from an infinite amount of samples (i.e., when the shape of the flux distribution no longer changes with more samples).⁶² In flux sampling, an algorithm’s efficiency is defined by the run-time and the number of samples required for apparent convergence to the true flux distribution. Here, we apply three different diagnostics in order to test for convergence:

1.
Raftery & Lewis diagnostic: this diagnostic estimates the total number of samples, N_max, required for a set of samples to achieve convergence, based on a given subset of samples (pilot chain).⁶³ We estimated N_max required for convergence using the CODA package in R.⁶⁴ The diagnostic further returns a dependence factor, I, which is indicative of autocorrelation (the degree of dependence between consecutive samples). Chains with I > 5 are here considered to be problematic as this value suggests high dependence between samples or influential starting values (i.e., the chain was not run long enough).
2.
Interval-based potential scale reduction factor (IPSRF): adapted from the original Gelman-Rubin diagnostic, PSRF,⁴⁰ the IPSRF is based on comparing the differences between consecutive samples (within sequence interval length) with the total differences observed between all samples (between sequence interval length). Because interval length, rather than variance of the samples (on which the original version is based), is considered, a normal distribution of the samples is no longer a requirement.⁴² As with PSRF, the IPSRF approaches 1 with chain convergence. Here, we consider chains to have failed this convergence diagnostic if IPSRF < 0.9 or IPSRF > 1.1 as calculated using the MCMC Diagnostics toolbox in MATLAB.
3.
Gweke diagnostic: this diagnostic, as implemented in the R CODA package,⁶⁴ tests whether the mean of the first samples of the chain (10%) is significantly different from the mean of the last set of samples (50%) of the chain.⁶⁵ Assuming that the last set of samples has converged to the stationary distribution, if the two subsets are not significantly different, the entire chain can be considered to have converged. Here we consider chains to have failed this convergence diagnostic if z > 1.28.

Because each of the convergence diagnostics tests for different criteria, we use all of the above in order to assure sample convergence. Here, we consider the flux sampling distribution of a reaction to have converged if it passes all of the above criteria.

COBRA toolbox and COBRApy setup

COBRA Toolbox version 3.0.0 and COBRApy version 0.10.1 were installed on MATLAB R2017a and Python version 3.5.2, respectively.⁵⁵ The linear programming solver used was Gurobi, version 8.0.0. The samplers were used in accordance with COBRA documentation using default parameter settings unless otherwise specified. Although an ACHR implementation is available both in the COBRA toolbox and in COBRApy, the comparisons made here are based on the Python implementation of the ACHR sampler because it is open-source and because the OPTGP, which is available only in Python, directly builds on the general parallel sampler (GP) of the ACHR. We did collect preliminary convergence and run-time data of the ACHR algorithm in MATLAB; however, because its convergence was evidently slower than CHRR, we have chosen to omit its MATLAB implementation from further analyses. OPTGP and CHRR algorithms were run using two parallel processes; there is currently no option to run ACHR as multiple processes.

Metabolic models

To compare the performance of the samplers, three published genome-scale A. thaliana metabolic models were obtained in SBML format and are from here on referred to by their first author: the Poolman,³² Arnold,²⁸ and Dal’Molin² models. The Poolman model is based on the Aracyc database⁶⁶ and describes a non-compartmentalized heterotrophic culture using 1406 reactions. The Arnold model describes a compartmentalized, photoautotrophic system and is based on 549 manually curated A. thaliana specific reactions. The Dal’Molin model is a compartmentalized network reconstruction of 1601 reactions, applicable to both photosynthetic and non-photosynthetic tissues of plant metabolism.² The original model constraints^2,28,32 were used when comparing sampler performance across these models, such that the Arnold, Poolman, and Dal’Molin model had 270, 645, and 330 degrees of freedom respectively.

Samples that are close together within a sample chain can be autocorrelated (i.e., be similar to one another due to the way in which the algorithm works). In order to avoid the effects of autocorrelation and to ensure convergence, a large number of samples should be run and a technique called thinning can be applied.⁶⁷ Sample chains that are thinned store only every k^th sample in the chain, where T = k is called the thinning constant. In order to compare autocorrelation of the three algorithms, we applied three different thinning constants (T = 100, 1000, 10,000) storing 5000 samples for each chain. Three replicate chains were run for each value of T. Convergence diagnostics were calculated for sample chains produced for each of the model reactions. Run-times were obtained on a personal laptop (7.2 GB RAM, i5-6200U CPU processor, 4 cores, 2.3 GHz capacity, Ubuntu 16.04.4 OS).

Experimental data

Carbon assimilation and respiration (CO₂ influx and outflux) by the A. thaliana wild-type Columbia-0 (Col-0) was measured using infrared gas analysis at 100 μmol m⁻² s⁻¹ light under control conditions (20 °C) and after 1 week of cold treatment (4 °C) as described previously.²⁴ Fumarate, malate and starch concentrations at the onset and at the end of the photoperiod were measured for both control and cold conditions using enzyme assays.²⁴ Averages of 3–4 replicated were calculated. Measurements taken across the different conditions were tested for significant differences using an unpaired t test (p < 0.05) assuming unequal variances, as implemented in R.

Given that previously published data confirm an approximately constant rate of accumulation of transient carbon storage products during the day,^21,24 we subtracted the beginning of day metabolite concentrations from the end of day concentrations for each of the products in order to obtain a flux value for carbon storage over one photoperiod. These flux values were used to constrain the Arnold model (Fig. 1). This reduced the degrees of freedom of the model to 266.

Model constraints

Given that the Arnold model is leaf specific and manually curated, it suits the purpose of studying photosynthetic plant acclimation. In order to do so we constrained the model with our experimental diurnal flux data for malate, fumarate, and starch accumulation as well as CO₂ influx. We set the cytosolic fumarase reaction, which produces cytosolic fumarate from malate, to be reversible.²⁴ Outgoing fumarate, malate, and starch reactions were added to the model in order to simulate diurnal carbon storage (Fig. 1). Diurnal accumulation of the metabolites was calculated by subtracting average beginning of day concentrations from average end of day concentration values. Metabolite accumulation and CO₂ influx were converted to mmol (gFW)⁻¹ Day⁻¹. The resulting values were applied as model constraints. Upper and lower bounds were applied according to the calculated standard errors of three to four replicates as shown in Fig. 3. To ensure convergence of all flux sampling distributions, 100,000 flux samples with a thinning of 10,000 were generated using the CHRR algorithm in the COBRA Toolbox. A Kruskal–Wallis test, as implemented in the SciPy Python package, version 0.19.1, was used to assess whether flux samples generated using either the cold or the control constrained model stemmed from the same distribution.⁶⁸

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The authors declare that all data supporting the findings of this study are available within the paper and online or can be generated using the publicly available source code (https://github.com/HAHerrmann/FluxSamplingPlantModels; https://doi.org/10.5281/zenodo.3239075).

References

Fondi, M. & Lio, P. Multi-omics and metabolic modelling pipelines: challenges and tools for systems microbiology. Microbiol. Res. 171, 52–64 (2015).
Article CAS PubMed Google Scholar
Dal’Molin, G. O., Queck, L.-E., Palfreyman, R. W., Brumbley, S. M. & Nielsen, L. K. AraGEM, a genome-scale reconstruction of the primary metabolic network in Arabidopsis. Plant Physiol. 152, 579–589 (2010).
Article Google Scholar
Boardbar, A., Monk, J. M., King, Z. A. & Palsson, B. O. Constraint-based models predict metabolic and associated cellular functions. Nat. Rev. Genet. 15, 107–120 (2014).
Article Google Scholar
Orth, J. D., Thiele, I. & Palsson, B. O. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010).
Article CAS PubMed PubMed Central Google Scholar
Feist, A. M. & Palsson, B. O. The biomass objective function. Curr. Opin. Microbiol. 13, 344–349 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yuan, H., Cheung, M., Hilbers, P. A. J. & van Riel, N. A. W. Flux balance analysis of plant metabolism: the effect of biomass composition and model structure on model predictions. Front. Plant Sci. 7, 537 (2016).
PubMed PubMed Central Google Scholar
Antoniewicz, M. R. Methods and advances in metabolic flux analysis: a mini-review. J. Ind. Microbiol. Biotechnol. 42, 317–325 (2015).
Article CAS PubMed Google Scholar
Budinich, M., Bourdon, J., Larhlimi, A. & Eveillard, D. A multi-objective constraint-based approach for modeling genome-scale microbial ecosystems. PLoS ONE 12, e0171744 (2017).
Article PubMed PubMed Central Google Scholar
García Sánchez, C. E. & Torres Sáez, R. G. Comparison and analysis of objective functions in flux balance analysis. Biotechnol. Prog. 30, 985–991 (2014).
Article PubMed Google Scholar
Varma, A. & Palsson, B. O. Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110. Appl. Environ. Microbiol. 60, 3724–3731 (1994).
CAS PubMed PubMed Central Google Scholar
Boyle, N. R., Sengupta, N. & Morgan, J. A. Metabolic flux analysis of heterotrophic growth in Chalmydomonas reinhardtii. PLoS ONE 12, e0177292 (2017).
Article PubMed PubMed Central Google Scholar
Fischer, E. & Sauer, U. Large-scale in vivo flux analysis shows rigidity and suboptimal performance of Bacillus subtilis metabolism. Nat. Genet. 6, 636–640 (2005).
Article Google Scholar
Kitano, H. Biological robustness. Nat. Rev. Genet. 11, 826–837 (2004).
Article Google Scholar
Kaneko, K. Phenotypic plasticity and robustness: evolutionary stability theory, gene expression dynamics model, and laboratory experiments. Adv. Exp. Med. Biol. 751, 249–278 (2012).
Article CAS PubMed Google Scholar
Long, S. P., Ainsworth, E. A., Leakey, A. D., Nösberger, J. & Ort, D. R. Food for thought: lower-than-expected crop yield stimulation with rising CO₂ concentrations. Science 30, 1918–1921 (2006).
Article Google Scholar
Lobell, D. B., Cassman, K. G. & Field, C. B. Crop yield gaps: their importance, magnitudes, and causes. Ann. Rev. Environ. Res. 34, 179–204 (2009).
Article Google Scholar
Schellenberger, J. & Palsson, B. O. Use of randomized sampling for analysis of metabolic networks. J. Biol. Chem. 27, 5457–5461 (2009).
Article Google Scholar
Wiback, S. J., Famili, I., Greenberg, H. J. & Palsson, B. O. Monte Carlo sampling can be used to determine the size and shape of the steady-state flux space. J. Theor. Biol. 228, 437–447 (2004).
Article PubMed Google Scholar
Maarleveld, T. R., Wortel, M. T., Olivier, B. G., Teusink, B. & Bruggeman, F. J. Interplay between constraints, objectives, and optimality for genome-scale stoichiometric models. PLoS Comput. Biol. 11, e1004166 (2015).
Article PubMed PubMed Central Google Scholar
Brooks, S. P. & Roberts, G. O. Convergence assessment techniques for Markov chain Monte Carlo. Stat. Comp. 8, 319–335 (1998).
Article Google Scholar
Smith, A. & Stitt, M. Coordination of carbon supply and plant growth. Plant Cell Environ. 30, 1126–1149 (2007).
Article CAS PubMed Google Scholar
Pracharoenwattana, I. et al. Arabidopsis has a cytosolic fumarase required for the massive allocation of photosynthate into fumaric acid and for rapid plant growth on high nitrogen. Plant J. 1, 785–795 (2010).
Article Google Scholar
Dyson, B. C. et al. Acclimation of metabolism to light in Arabidopsis thaliana: the glucose 6-phosphate/phosphate translocator GPT2 directs metabolic acclimation. Plant Cell Environ. 38, 1404–1417 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dyson, B. C. et al. FUM2, a cytosolic fumarase, is essential for acclimation to low temperature in Arabidopsis thaliana. Plant Physiol. 172, 118–127 (2016).
Article CAS PubMed PubMed Central Google Scholar
Küstner L., Nägele T. & Heyer A. G. Mathematical modeling of diurnal patterns of carbon allocation to shoot and root in Arabidopsis thaliana. Nat. Sys. Biol. Appl. 5 (2019).
Chia, D. W., Yoder, T. J., Reiter, W.-D. & Gibson, S. I. Fumaric acid: an overlooked form of fixed carbon in Arabidopsis. Planta 211, 743–751 (2000).
Article CAS PubMed Google Scholar
Zell, M. B. et al. Analysis of Arabidopsis with highly reduced levels of malate and fumarate sheds light on the role of these organic acids as storage molecules. Plant Physiol. 152, 1251–1562 (2010).
Article CAS PubMed PubMed Central Google Scholar
Arnold, A. & Nikoloski, Z. Bottom-up reconstruction of Arabidopsis and its application to determining the metabolic costs of enzyme production. Plant Physiol. 165, 1380–1391 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dal’Molin, C. G. O., Queck, L. E., Saa, P. A. & Nielsen, L. K. A multi-tissue genome-scale metabolic modeling framework for the analysis of whole plant systems. Front. Plant. Sci. 6, 4 (2015).
Google Scholar
Cheung, C. Y. M., Poolman, M. G., Fell, D. A., Ratcliffe, R. G. & Sweetlove, L. J. A diel flux balance model captures interactions between light and dark metabolism during day-night cycles in C3 and crassulacean acid metabolism leaves. Plant Physiol. 165, 917–929 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mintz-Oron, S. et al. Reconstruction of Arabidopsis metabolic network models accounting for subcellular compartmentalization and tissue-specificity. Proc. Natl. Acad. Sci. USA 109, 339–344 (2012).
Article CAS PubMed Google Scholar
Poolman, M. G., Miguet, L., Sweetlove, L. J. & Fell, D. A. A genome-scale metabolic model of Arabidopsis and some of its properties. Plant Physiol. 151, 1570–1581 (2009).
Article CAS PubMed PubMed Central Google Scholar
Haraldsdottir, H. S., Cousins, B., Thiele, I., Fleming, R. M. T. & Vempala, S. CHRR: coordinate hit-and-run with rounding for uniform sampling of constraint-based models. Bioinformatics 33, 1741–1743 (2017).
Article PubMed PubMed Central Google Scholar
Kaufman, D. E. & Smith, R. L. Direction choice for accelerated convergence in hit-and-run sampling. Oper. Res. 46, 1 (1998).
Article Google Scholar
Megchelenbrink, W., Huynen, M. & Marchiori, E. optGpSampler: an improved tool for uniformly sampling the solution-space of genome-scale metabolic networks. PLoS ONE 9, e86587 (2014).
Article PubMed PubMed Central Google Scholar
Price, N. D., Schellenberger, J. & Palsson, B. O. Uniform sampling of steady-state flux spaces: means to design experiments and to interpret enzymopathies. Biophys. J. 87, 2172–2186 (2004).
Article CAS PubMed PubMed Central Google Scholar
Bordel, S., Agren, R. & Nielsen, J. Sampling the solution space in genome-scale metabolic networks reveals transcriptional regulation in key enzymes. PLoS Comput. Biol. 6, e1000859 (2010).
Article PubMed PubMed Central Google Scholar
Mo, M. L., Palsson, B. O. & Herrgård, M. J. Connecting extracellular metabolomic measurements to intracellular flux states in yeast. BMC Syst. Biol. 3, 37 (2009).
Article PubMed PubMed Central Google Scholar
Shlomi, T., Benyamini, T., Gottlieb, E., Sharan, R. & Ruppin, E. Genome-scale metabolic modeling elucidates the role of proliferative adaptation in causing the Warburg effect. PLoS Comput. Biol. 7, e1002018 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gelman, A. et al. Bayesian data analysis, 3rd edn (London, UK: Chapman and Hall/CRC, 2013).
Cowles, M. K. & Carlin, B. P. Markov chain monte carlo convergence diagnostics: sa comparative review. J. Am. Stat. Assoc. 91, 883–904 (1996).
Article Google Scholar
Brooks, S. P. & Gelman, A. General methods for monitoring convergence of iterative simulations. J. Comp. Graph. Stat. 7, 434–455 (1996).
Google Scholar
Lundmark, M., Cavaco, A. M., Trevanion, S. & Hurry, V. Carbon partitioning and export in transgenic Arabidopsis thaliana with altered capacity for sucrose synthesis grown at low temperature: a role for metabolite transporters. Plant Cell. Environ. 29, 1703–1714 (2006).
Article CAS PubMed Google Scholar
Strand, A., Foyer, C. H., Gustafsson, P., Gardeström, P. & Hurry, V. Altering flux through the sucrose biosynthesis pathway in transgenic Arabidopsis thaliana modifies photosynthetic acclimation at low temperatures and the development of freezing tolerance. Plant Cell Environ. 26, 523–535 (2003).
Article CAS Google Scholar
Strand, A. et al. Acclimation of Arabidopsis leaves developing at low temperatures. Increasing cytoplamic volumes accompanies increased activities of enzymes in the Calvin cycle and in the sucrose-biosynthesis pathway. Plant Physiol. 119, 1387–1398 (1999).
Article CAS PubMed PubMed Central Google Scholar
Nägele, T. & Heyer, A. G. Approximating subcellular organisation of carbohydrate metabolism during cold acclimation in different natural accessions of Arabidopsis thaliana. New Phytol. 198, 777–787 (2013).
Article PubMed Google Scholar
Mazzucotelli, E., Tartari, A., Cattivelli, L. & Forlani, G. Metabolism of γ-aminobutyric acid during cold acclimation and freezing and its relationship to frost tolerance in barley and wheat. J. Exp. Bot. 57, 3755–3766 (2006).
Article CAS PubMed Google Scholar
Beuve, N. et al. Putative role of γ-aminobutyric acid (GABA) as a long-distance signal in up-regulation of nitrate uptake in Brassica napus L. Plant Cell Environ. 27, 1035–1046 (2004).
Article CAS Google Scholar
Michaeli, S. & Fromm, H. Closing the loop on the GABA shunt in plants: are GABA metabolism and signaling entwined? Front. Plant Sci. 6, 419 (2015).
Article PubMed PubMed Central Google Scholar
Barbosa, J. M., Singh, N. K., Cherry, J. H. & Locy, R. D. Nitrate uptake and utilization is modulated by exogenous γ-aminobutyric acid in Arabidopsis thaliana seedlings. Plant Physiol. Biochem. 48, 443–450 (2010).
Article CAS PubMed Google Scholar
Atkinson, L. J., Sherlock, D. J. & Atkin, O. K. Source of nitrogen associated with recovery of relative growth rate in Arabidopsis thaliana acclimated to sustained cold treatment. Plant Cell Environ. 38, 1023–1034 (2015).
Article CAS PubMed Google Scholar
Müller, C., Scheible, W.-R., Stitt, M. & Krapp, A. Influence of malate and 2-oxoglutarate on the NIA transcript level and nitrate reductase activity in tobacco leaves. Plant Cell Environ. 24, 191–203 (2001).
Article Google Scholar
Schellenberger, J. et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat. Protoc. 6, 1290–1307 (2011).
Article CAS PubMed PubMed Central Google Scholar
Heirendt, L. et al. Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0. Nat. Protoc. 14, 639–702 (2019).
Article CAS PubMed Google Scholar
Ebrahim, A., Lerman, J. A., Palsson, B. O. & Hyduke, D. R. COBRApy: COnstraints-based reconstruction and analysis for python. BMC Syst. Biol. 7, 74 (2013).
Article PubMed PubMed Central Google Scholar
Saa, P. A. & Nielsen, L. K. ll-ACHRB: a scalable algorithm for sampling the feasible solution space of metabolic networks. Bioinformatics 32, 2330–2337 (2016).
Article CAS PubMed Google Scholar
Becker, N. B., Allen, R. J. & ten Wolde, P. R. Non-stationary forward flux sampling. J. Chem. Phys. 136, 174118 (2012).
Article PubMed Google Scholar
Damiani, C. et al. An ensemble evolutionary constraint-based approach to understand the emergence of metabolic phenotypes. Nat. Comput. 13, 321–331 (2014).
Article Google Scholar
Agren, R. et al. The RAVEN toolbox and its use for generating a genome-scale metabolic model for Penicillium chrysogenum. PLoS Comput. Biol. 9, e1002980 (2013).
Article CAS PubMed PubMed Central Google Scholar
Damiani, C. et al. A metabolic core model elucidates how enhanced utilization of glucose and glutamine, with enhanced glutamine-dependent lactate production, promotes cancer cell growth: the WarburQ effect. PLoS Comput. Biol. 13, e1005758 (2017).
Article PubMed PubMed Central Google Scholar
De Martino, D., Mori, M. & Parisi, V. Uniform sampling of steady states in metabolic networks: heterogeneous scales and rounding. PLoS One 10, e0122670 (2015).
Article PubMed PubMed Central Google Scholar
Hamra, G., MacLehose, R. & Richardson, D. Markov chain Monte Carlo: an introduction for epidemiologists. Int. J. Epidemiol. 42, 627–634 (2013).
Article PubMed PubMed Central Google Scholar
Raftery A. E. & Lewis S. M. “How many iterations in the Gibbs sampler?“ Bernardo J. M., Berger J., Dawid A. P., Smith A. F. M. 4th edn, (Oxford: Bayesian Statistics 1992).
Plummer, M., Best, N., Cowles, K. & Vines, K. CODA: convergence diagnosis and output analysis for MCMC. R. News 6, 7–11 (2006).
Google Scholar
Gweke J. Evaluating the accuracy of sampling-based approaches to calculating posterior moments. Oxford: J. O. Berger, A. P. Dawid, Smith A. F. M. (ed. 4) Bayesian Statistics: (Clarendon Press 1991).
Mueller, L. A., Zhang, P. & Rhee, S. Y. AraCyc: a biochemical pathway database for arabidopsis. Plant Physiol. 132, 453–460 (2003).
Article CAS PubMed PubMed Central Google Scholar
Ray J., Pincar A. & Seshadhri C. Are We There Yet? When to Stop a Markov Chain while Generating Random Graphs. International Workshop on Algorithms and Models for the Web-Graph, WAW: Algorithms and Models for the Web Graph, pp 153–164 (2012).
Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 49, 583–621 (1952).
Article Google Scholar

Download references

Acknowledgements

H.A.H. is supported by a Biotechnology and Biological Sciences Research Council (BBSRC) Doctoral Training Partnership stipend (BB/M011208/1). B.C.D. was supported by a BBSRC research grant (BB/J041013/1).

Author information

Beth C. Dyson
Present address: Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
Lucy Vass
Present address: Bristol Veterinary School and Department of Population Health Sciences, University of Bristol, Bristol, UK

Authors and Affiliations

Department of Earth and Environmental Sciences, University of Manchester, Manchester, UK
Helena A. Herrmann, Beth C. Dyson & Giles N. Johnson
School of Biological Sciences, University of Manchester, Manchester, UK
Lucy Vass & Jean-Marc Schwartz

Authors

Helena A. Herrmann
View author publications
You can also search for this author in PubMed Google Scholar
Beth C. Dyson
View author publications
You can also search for this author in PubMed Google Scholar
Lucy Vass
View author publications
You can also search for this author in PubMed Google Scholar
Giles N. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Marc Schwartz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.M.S., G.N.J. and H.A.H. conceived and planned the analyses and wrote the manuscript. H.A.H. and L.V. performed the computational analyses. B.C.D. carried out the metabolite assays. H.A.H. carried out the plant physiology measurements. All authors reviewed the final manuscript.

Corresponding author

Correspondence to Jean-Marc Schwartz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Herrmann, H.A., Dyson, B.C., Vass, L. et al. Flux sampling is a powerful tool to study metabolism under changing environmental conditions. npj Syst Biol Appl 5, 32 (2019). https://doi.org/10.1038/s41540-019-0109-0

Download citation

Received: 14 March 2019
Accepted: 06 August 2019
Published: 02 September 2019
DOI: https://doi.org/10.1038/s41540-019-0109-0

This article is cited by

LooplessFluxSampler: an efficient toolbox for sampling the loopless flux solution space of metabolic models
- Pedro A. Saa
- Sebastian Zapararte
- Lars K. Nielsen
BMC Bioinformatics (2024)
Flux sampling in genome-scale metabolic modeling of microbial communities
- Patrick E. Gelbach
- Handan Cetin
- Stacey D. Finley
BMC Bioinformatics (2024)
Machine learning identifies key metabolic reactions in bacterial growth on different carbon sources
- Hyunjae Woo
- Youngshin Kim
- Sung Ho Yoon
Molecular Systems Biology (2024)
Genome-scale metabolic modelling enables deciphering ethanol metabolism via the acrylate pathway in the propionate-producer Anaerotignum neopropionicum
- Sara Benito-Vaquerizo
- Ivette Parera Olm
- Maria Suarez-Diez
Microbial Cell Factories (2022)
Addressing uncertainty in genome-scale metabolic model reconstruction and analysis
- David B. Bernstein
- Snorre Sulheim
- Daniel Segrè
Genome Biology (2021)