[PDF] Improved estimations of stochastic chemical kinetics by finite state expansion

Abstract

Stochastic reaction networks are a fundamental model to describe interactions between species where random fluctuations are relevant. The master equation provides the evolution of the probability distribution across the discrete state space consisting of vectors of population counts for each species. However, since its exact solution is often elusive, several analytical approximations have been proposed. The deterministic rate equation (DRE) gives a macroscopic approximation as a compact system of differential equations that estimate the average populations for each species, but it may be inaccurate in the case of nonlinear interaction dynamics. Here we propose finite state expansion (FSE), an analytical method mediating between the microscopic and the macroscopic interpretations of a stochastic reaction network by coupling the master equation dynamics of a chosen subset of the discrete state space with the mean population dynamics of the DRE. An algorithm translates a network into an expanded one where each discrete state is represented as a further distinct species. This translation exactly preserves the stochastic dynamics, but the DRE of the expanded network can be interpreted as a correction to the original one. The effectiveness of FSE is demonstrated in models that challenge state-of-the-art techniques due to intrinsic noise, multi-scale populations, and multi-stability.

Full PDF

IImproved estimations of stochastic chemical kinetics by finitestate expansion

Tabea Waizmann , Luca Bortolussi , Andrea Vandin , Mirco Tribastone IMT School for Advanced Studies, Lucca, Italy Department of Mathematics and Geosciences, University of Trieste, Italy Sant’Anna School for Advanced Studies, Pisa, Italy* [email protected] article has been submitted to the journal PLOS Computational Biology.

Abstract

Quantitative mechanistic models based on reaction networks with stochastic chemicalkinetics can help elucidate fundamental biological process where random fluctuationsare relevant, such as in single cells. The dynamics of such models is described by themaster equation, which provides the time course evolution of the probabilitydistribution across the discrete state space consisting of vectors of population levels ofthe interacting biochemical species. Since solving the master equation exactly is verydifficult in general due to the combinatorial explosion of the state space size, severalanalytical approximations have been proposed. The deterministic rate equation (DRE)offers a macroscopic view of the system by means of a system of differential equationsthat estimate the average populations for each species, but it may be inaccurate in thecase of nonlinear interactions such as in mass-action kinetics. Here we propose finitestate expansion (FSE), an analytical method that mediates between the microscopicand the macroscopic interpretations of a chemical reaction network by coupling themaster equation dynamics of a chosen subset of the discrete state space with thepopulation dynamics of the DRE. This is done via an algorithmic translation of aJuly 6, 2020 1/33 a r X i v : . [ q - b i o . M N ] J u l hemical reaction network into a target expanded one where each discrete state isrepresented as a further distinct chemical species. The translation produces a networkwith stochastically equivalent dynamics, but the DRE of the expanded network can beinterpreted as a correction to the original ones. Through a publicly available softwareimplementation of FSE, we demonstrate its effectiveness in models from systems biologywhich challenge state-of-the-art techniques due to the presence of intrinsic noise,multi-scale population dynamics, and multi-stability. Author summary

Many biological systems exhibit random fluctuations which are of fundamentalimportance, for example, at the level of single cells. The elucidation of such behaviorcan be assisted by quantitative mechanistic models that are typically described byreaction networks with stochastic kinetics, yielding a microscopic description that tracksdiscrete changes in the populations of the interacting biochemical species. In manycircumstances, however, exact analytical solutions of these models are not available, andsimulation methods can be too demanding computationally. On the other hand,macroscopic descriptions that are based on deterministic approximations can yieldinaccurate estimates because they fail to account for the inherent stochasticity in thesystem. In this article we present finite state expansion, a method that interpolatesbetween the microscopic and macroscopic views by explicitly tracking a subset of theoriginal discrete configurations and consistently coupling their dynamics withdeterministic variables. Using a software implementation of our method on models fromthe literature that challenge other state-of-the-art approximation techniques, we showthat finite state expansion improves deterministic estimates when the stochasticity ofthe system cannot be neglected.

Introduction

Chemical reaction networks are a fundamental model to analyze species that interactstochastically through reaction channels according to dynamics governed by thewell-known master equation [1]. This provides a microscopic description in terms of aJuly 6, 2020 2/33et of coupled linear differential equations, each defining the time course of a discretestate of the system as a vector of population counts of the chemical species involved. Itis widely understood, however, that the analysis of the master equation is intractable ingeneral, since analytical solutions are available only in special cases and directnumerical integration is hindered by the combinatorial growth of the state space as afunction of the abundances of the species. Networks with large numbers of species andreactions, and the correspondingly huge state spaces that they typically subsume, alsohave a considerable impact on the computational cost of stochastic simulationmethods [2]. In addition, forgoing an analytical treatment in favor of simulation maypreclude other important studies such as stability, perturbation analysis, bifurcation,and parameter inference [3, 4]. In all these cases it is useful to consider analyticalapproximations of the master equation that trade off precision with cost.The deterministic rate equation (DRE) provides a macroscopic dynamical view of achemical reaction network by associating one ordinary differential equation with eachspecies. The DRE solution gives the exact mean population levels as a function of timeif each reaction’s propensity function is linear, as occurs, for instance, in monomolecularchemical reaction networks [2]. With nonlinear propensity functions, under mildconditions the DRE does give the true expectations only in the thermodynamic limit [5].Away from this asymptotic regime, nonlinear propensities lead to DREs that provideonly an approximation to the true mean dynamics. This is the case, for example, inmodels of cell regulation which depend on low-abundance species (in the order of a fewunits) to describe the behavior of genes [6]. Processes such as activation anddeactivation that vary with time as a result of various interactions may introducesignificant variability in gene expression [7], caused by inherent stochasticity in thebio-molecular processes involved [8, 9]. Since such forms of noise are not accounted forin the DRE, approximation errors may be large.Here we present finite state expansion (FSE), a method which offers a mediationbetween the discrete and continuous representations of the master equation and theDRE, respectively. The former can be interpreted as corresponding to a situation whereevery discrete state is tracked; the latter, on the other hand, corresponds to thesituation where no discreteness is kept and the chemical species are observed onlythrough their approximate average populations. In essence, FSE bridges these twoJuly 6, 2020 3/33escriptions by keeping track discretely of only a user-defined subset of the state space,while collapsing the rest as a continuous approximation. In particular, the state spaceto be kept discretely is determined by a parameter which specifies the maximumpopulation level for each species to be tracked explicitly.FSE is realized by means of a systematic translation of a chemical reaction network(with arbitrary propensity functions) into an expanded one which features additionalspecies and modified reactions. Specifically, each tracked discrete state is represented asa new auxiliary species; the original set of reactions is transformed such that thedynamics of the auxiliary species are coupled with those of the original species, whoserole is to buffer the probability mass that falls out of the state space that is tracked. Inthis respect, FSE can be seen as a mass-preserving variation to the well-known finitestate projection method, which truncates the state space [10].FSE enjoys two useful properties. The first concerns its soundness , in the sense thatany expanded chemical reaction network is stochastically equivalent to the original one.In other words, their master equations can be put in exact correspondence with eachother. This is formally done by proving a result of aggregation for Markov chains knownas ordinary lumpability [11]. It essentially states that the state space of the expandednetwork can be projected onto a lower-dimensional one which still satisfies the Markovproperty and which turns out to correspond to the original network. Importantly, suchexact correspondence does not carry over to the respective DREs of the original and theexpanded networks. Indeed, any expansion arising from a strict subset of the discretestate space will lead to a DRE with more equations, which can be interpreted as refiningterms for the mean estimates. We demonstrate this experimentally with a number ofcase studies taken from the systems biology literature. Our second theoreticalcontribution is a result of asymptotic correctness , stating that if every discrete state istracked then the DRE of the expanded network corresponds to the master equation.There are several approaches for improving the accuracy of the DRE. These includemoment-closure approximations [12], the effective mesoscopic rate equation, which addscorrection terms to van Kampen’s well-known system size expansion [13, 14], and hybridtechniques [15]; ref. [3] offers an up-to-date review. However, these methods areapplicable under certain assumptions such as smoothness of the propensityfunctions [13, 16, 17], mass-action kinetics [18–22], specific structure of the chemicalJuly 6, 2020 4/33eaction network, e.g., to describe gene regulatory systems [23], and species that can bepartitioned into low- and high-abundance classes [15, 24, 25]. FSE, instead, can beapplied to any chemical reaction network in principle. Additionally, the case studiespresented in this paper were chosen as representative instances that may challenge thequality of the approximations by state-of-the-art methods. With these models we showthat FSE improves mean estimates when implementations of moment-closureapproximations give unphysical results or when the hybrid method from ref. [15]experiences numerical difficulties. Otherwise, FSE may outperform the effectivemesoscopic rate equation, and provide more accurate approximations than finite stateprojection when both methods track the same subset of the discrete state space.

Results

In this section we use a number of case studies from the literature to show that FSE canrefine the accuracy of the approximation of mean estimates even with modestexpansions. Analytical solutions of the master equations for the chosen models are notknown, and numerical solutions are difficult because the models give rise to Markovchains with infinite state spaces. For these reasons, we considered ground-truth meantrajectories computed by stochastic simulation via Gillespie’s algorithm [2]. Thenumerical experiments herein reported were performed with an implementation of FSEpublicly available with the software tool ERODE [26].

Schl¨ogl model

The well-known Schl¨ogl system is an autocatalytic process for a single species X [27].The DRE of the original Schl¨ogl model has two equilibrium points, owing to its strong(cubic) nonlinearity [28], deterministically converging only to one [29]. Its discrepancywith respect to the average mean trajectory computed by stochastic simulation has beenobserved for a long time [30]. Fig 1 provides a fully worked application of our FSE as afunction of the upper bound for species X , denoted by O X , which determines thelargest population level to be tracked explicitly in the expansion. The solutions to theDRE of the expanded networks show that larger values of such upper boundincreasingly improve the accuracy of the mean estimates.July 6, 2020 5/33 R1) 2 X k ! X (R2) 3 X k ! X (R3) k ! X (R4) X k ! AAACknicbZFLTwIxEMe76xsf4OPmpRE1eiH7wGg8KIaLBw9qRElYQrqlQEP3kXZWJZv9QH4db34by0qiApO0+WfmN9PpjB8LrsCyvgxzYXFpeWV1rbC+sblVLG3vPKsokZQ1aCQi2fSJYoKHrAEcBGvGkpHAF+zFH9bH8ZdXJhWPwicYxawdkH7Ie5wS0K5O6eMYe8DeIT15tE8zfIyx09S39y55fwBEyugtHXbsDLtN7HmFX9rJaXcO7GTjGv9gN4dnSDfDU2D1NNMtzClazTqlslWxcsOzwp6IMprYfaf06XUjmgQsBCqIUi3biqGdEgmcCpYVvESxmNAh6bOWliEJmGqn+UgzfKQ9XdyLpD4h4Nz7NyMlgVKjwNdkQGCgpmNj57xYK4HeRTvlYZwAC+nPQ71EYIjweD+4yyWjIEZaECq57hXTAZGEgt5iQQ/Bnv7yrHh2KrZbcR6q5drhZByraB8doBNko3NUQ7foHjUQNYrGmXFlXJt75qV5Y9Z/UNOY5Oyif2befQOMi8Bc (R1.1) J n K f ( n ) ! J n K + X, n = O X (R1.2) J n K f ( n ) ! J n + 1 K ,  n < O X f ( n ) = J n K k ( X + n )( X + n / J n K f ( n ) ! J n K ,  n  O X (R2.2) J n K f ( n ) ! J n K , < n < X + J K f ( n ) ! J K , n = 0 f ( n ) = J n K k ( X + n )( X + n X + n / J n K f ( n ) ! J n K + X, n = O X (R3.2) J n K f ( n ) ! J n + 1 K ,  n < O X f ( n ) = J n K k (R4.1) J n K f ( n ) ! J n K , < n  O X (R4.2) X + J n K f ( n ) ! J n K , n = 0 f ( n ) = J n K k ( X + n ) AAAGwHiclZRbbxJBFMeXWrDipa2+aHyZiDQQbLu3qA81aTQmvlkvbUkYQoZhKBt2Z7czs7a42Re/pR/Bb+HZS7kUWugkwNkzZ87/nN8ephu4jlS6/rewdm+9WLq/8aD88NHjJ5tb209PpB8Kyo6p7/qi2SWSuQ5nx8pRLmsGghGv67LT7vBTsn/6iwnp+PynGgWs7ZEz7vQdShS4Otvrf6ro82VAeBKB+r5AQ6OMFbtUUe27sWfUY7Szg/APRRSLODwgfCmcs4EiQvgXUb9j1DjETAIaqPkGojh8PiDsg3ZSWvQ17jQRxlOpzTumbhhxklhH2GXnaf6DBfmrEwErE0A1c5fXm1DYUqm5SNCE2CnNA2QmKjuQNzuUNHmVdgieZoPXk69do76fhaZVXWdsjkGYKzA2Z0DsZiCsCYjUuJm1uQLrhRI6IM5AW7P5crRTqHRw3JZSjydTATgxyhkmQWgWojkNMf0x6/tvb2ZpjQuzVmBpLZjX5eNqrYDQuuO4jufImmdgoYWt2hmCq6rsFfq1b3uxSwbHvup6yb9nVmP6TY97tOd7tGtJWnB3tir6np4uNG8YuVHR8nXU2S5Ucc+noce4oi6RsmXogWpHRCiHuiwu41CygNAhOWMtMDnxmGxH6W0Zoyp4einPvs8VSr3TJyLiSTnyuhDpETWQ1/cS56K9Vqj679uRw4NQMU4zoX7oIuWj5OpFPUcwqtwRGIQKB2pFdEAEoQou6BkVqTwiRqIHnXB2QX3PI7wXZeziltGOMOMyFCypIcKu24UkQ6ZQxUBYiPwpjgGscR3jvHECVxDM9ze7cvg6R7yhvdReaTXN0N5ph9oX7Ug71uj6v+Jm8XnxReljaVDyS+dZ6FohP/NMm1ml3/8BhvMT0g== D R A F T where the ‘+’ symbol in the reaction denotes multiset union,and multisets ÷ , Â and o Õ œ N S are given componentwise by ÷ S = max(0 , ﬂ S ≠ o S ) Â S = max(0 , max(0 , o S ≠ ﬂ S ) + ﬁ S ≠ O S ) o Õ S = min( O S , max(0 , o S ≠ ﬂ S ) + ﬁ S ) . Intuitively, for each original reaction, Eq. (1) considers its behavior with respect to each observed conﬁguration J o K . Any expanded reaction maintains the same overall counts of educts and products as the originating reaction, with a target observed conﬁguration J o Õ K that results from the addition of products and removal of educts within the upper bound O . The multisets of original population classes ÷ and Â act as buer pools for conﬁgurations that are not explicitly observed. An example of such a construction is discussed in detail in

Fig. 1. Finally, the propensity function f o is derived from that of the original reaction f as f o : R S O æ R +0 , with f o ( x ) = x J o K · f ( o + x | S ) , [2] where, for a given x œ R S O , x | S denotes its projection onto the original set of population classes S . This modiﬁcation accounts for the fact that the observed state J o K encodes additional population counts, as given by the multiset o . Importantly, we prove that such a translation preserves the stochastic properties of the RN in the sense of ordinary lumpability of Markov chains (26) (see

SI Appendix, SI Text ). Denoting by ˆ P the probability distribution in the expanded RN, ordinary lumpability implies that P ‡ ( t ) = ÿ o + › = ‡ ˆ P J o K + › ( t ) , for all t and ‡ œ N S . [3] That is, the ME solution for a state ‡ in the original RN will exactly correspond to the sum of the ME solutions for all states in the expanded RN that track the same overall population levels. Furthermore, when the RN is fully expanded, i.e., when O = N S , we recover the original ME. Although the stochastic behavior of the source RN and any expansion are equivalent in this speciﬁc sense, their respective

DREs are not. The target RN has | O | + | S | variables: its solution can be interpreted as a corrected estimate of the solution of the | S | -variable source DRE. Applications

Schlögl’s model.

The Schögl model is an extensively studiedtri-molecular scheme (29), given by the RN A + 2 X k ≠æ X + A X k ≠æ X [4] B k ≠æ X + B X k ≠æ ÿ [5]Here, the parameters k , k , k , k are mass-action kinetic parameters. The associated propensity function is deﬁned in the usual way, by counting the total distinct individual reactions that can occur in every state: for a reaction with reagents ﬂ and kinetic parameter k , the propensity function for state ‡ is thus given by f k ( ‡ ) = k r S œ S ! ‡ S ﬂ S " . The Schlögl model describes an autocatalytic process for species X in the presence of reservoirs for chemical species A and B which we assume not to vary with time. Overall, the scheme results in a one-dimensional RN which only tracks Time X (a) Individual simulation traces

Time X (b) Deterministic estimates

Fig. 2.

Evaluation of the Schlögl model with scheme in Eqs. 4-5 using kinetic param-eters k = 3 · ≠ , k = 10 ≠ , k = 10 ≠ , k = 3 . , taken from ref. (28).A) Representative realizations of the stochastic process demonstrate bimodality, withsteady state populations approaching ca. 600 and ca. 100, respectively, when startingfrom an initial condition with 200 molecules of species X , molecules of species A , and · molecules of species B . B) The DRE converges to a single equilibrium(ca. 85.50, blue line), causing a noticeable discrepancy with respect to the true mean(dotted line, computed as the average of simulations). Finite state expansionachieves excellent agreement with an upper bound O X = 650 . Time M A / M B SIM DRE 1-5 2-10 2-150 100 200 300 400

Time P A / P B Fig. 3.

Numerical simulations of the genetic toggle switch in scheme (6) comparingstochastic simulation, DRE and ﬁnite state expansions ﬁxing O PA = O PB = 0 while using different upper bounds O M – O S for the number of copies of M A / M B and S A / S B (as indicated in the legend), respectively. Initial condition was the zerostate. The ODE system size for the tested choices of upper bounds is equal to ( O M + 1) · ( O S + 1) + 6 (corresponding to 150, 1095 and 2310 equations for O M – O S = 1 – , O M – O S = 2 – , and O M – O S = 2 – , respectively). Kineticparameters were chosen as follows: k = 0 . , k = 0 . , k = 1 . , k = 10 . , k = 0 . , k = 0 . , k = 20 . . Protein production (right plot) is controlled by alow population of precursor mRNA (left plot), which causes signiﬁcant underestimationerrors with DRE. Increasing the upper bounds of ﬁnite state expansion consistentlyimproves the accuracy of the mean estimate. The corrections for species S A and S B , not reported here (see SI Fig.

S5), are similar. the number of molecules of X , while the initial populations of molecules A and B are taken as further model parameters. The discrepancy between the stochastic kinetics and the DRE approximation has been observed for a long time (30). Under an appropriate choice of kinetic parameters, the DRE features two equilibrium points owing to the strong (cubic) nonlinearity in the ODEs (31). Analogously, the underlying inﬁnite-state stochastic process is well-known for the bimodality of the steady-state probability distribution of species X . In this case, the DRE may provide inaccurate estimates of the average population of species X because the DRE will deterministically converge only to one steady state (32). Finite state expansion can correct the mean estimates by expanding increasingly larger populations of species X , paying a linear cost in the resulting ODE system size (Figure 2). Genetic Toggle Switch.

The toggle switch network is a funda-mental regulatory system of two mutually repressing genes (33).Models of toggle-switch networks are mathematically challeng- et al.

PNAS |

December 12, 2019 | vol. XXX | no. XX | Original reaction network ··· i ··· Reaction network with finite state expansion

Observation bound D R A F T where the ‘+’ symbol in the reaction denotes multiset union,and multisets ÷ , Â and o Õ œ N S are given componentwise by ÷ S = max(0 , ﬂ S ≠ o S ) Â S = max(0 , max(0 , o S ≠ ﬂ S ) + ﬁ S ≠ O S ) o Õ S = min( O S , max(0 , o S ≠ ﬂ S ) + ﬁ S ) . Intuitively, for each original reaction, Eq. (1) considers its behavior with respect to each observed conﬁguration J o K . Any expanded reaction maintains the same overall counts of educts and products as the originating reaction, with a target observed conﬁguration J o Õ K that results from the addition of products and removal of educts within the upper bound O . The multisets of original population classes ÷ and Â act as buer pools for conﬁgurations that are not explicitly observed. An example of such a construction is discussed in detail in

DREs are not. The target RN has | O | + | S | variables: its solution can be interpreted as a corrected estimate of the solution of the | S | -variable source DRE. Applications

Schlögl’s model.

Time X (b) Deterministic estimates

Fig. 2.

Time P A / P B Fig. 3.

The toggle switch network is a funda-mental regulatory system of two mutually repressing genes (33).Models of toggle-switch networks are mathematically challeng- et al.

PNAS |

December 13, 2019 | vol. XXX | no. XX | Continuous-time Markov chains Mean approximationsStochastic simulations Deterministic Rate Equations

Original coupled mass-action propensityconditional probability O X AAAB9XicbVC7SgNBFL0bXzG+onbaDAbBKuzGQjsDWtgZwTwgWcPsZDYZMju7zMwqYcl/2FgoYusH+BcWgt+h9s4mKTTxwMDhnHu4d44Xcaa0bX9Ymbn5hcWl7HJuZXVtfSO/uVVTYSwJrZKQh7LhYUU5E7Sqmea0EUmKA4/Tutc/Tf36DZWKheJKDyLqBrgrmM8I1ka6boXGTLPJxbDdaOcLdtEeAc0SZ0IKJ5/v32evO1+Vdv6t1QlJHFChCcdKNR070m6CpWaE02GuFSsaYdLHXdo0VOCAKjcZXT1E+0bpID+U5gmNRurvRIIDpQaBZyYDrHtq2kvF/7xmrP1jN2EiijUVZLzIjznSIUorQB0mKdF8YAgmkplbEelhiYk2ReVMCc70l2dJrVR0DoulS6dQRjBGFnZhDw7AgSMowzlUoAoEJNzBAzxat9a99WQ9j0cz1iSzDX9gvfwAqaOXrQ== J K , J K , ··· J K , J K , ······ ··· ··· J O X K , J O X K , ··· J O X K , J O X K , ··· AB CD

Original Finite state expansion E Finite state expansion F d J n K dt = I n { f ( n ) + f ( n ) } + I  n  O X f ( n I  n  O X f ( n + 1)+ I  n  O X f ( n I  n  O X f ( n + 1) dXdt = f ( O X ) + f ( O X ) f (0) f (0) AAAEKHicnVNbb9MwFHYTLqNctsEjLxaFaVXVKm4rsQc2TeKFPTEE3SrVJXIcp43qOMFxQZXlP8SvgSe0V34JzmUa7QYPsxTly/H5zuU7J0HG41x53kXDce/cvXd/60Hz4aPHT7Z3dp+e5elSUjaiKU/lOCA547FgIxUrzsaZZCQJODsPFm+L+/OvTOZxKj6pVcamCZmJOIopUdbk7zZ+4kgSqkP8URHFtDBGh8rAPXgIuxCfiFAL+Abi1AYpcuj3xh8biHXko33Rhh0Y+QMLsLXh5h7E2ZwIlSb60Fzxj6BXUKxr/5IzLMBNpE5FQhBz9gWK6rWZvkzeRW34T773H34XmaqUjo1w2wIGVQG3yW+FQaaUoC6gHsH4SvqiwXXSpdSbVhus6MVrl2Bogb/T8npeeeB1gGrQAvU5tUtwhMOULhMmFOUkzyfIy9RUE6liyplp4mXOMkIXZMYmFgqSsHyqy+Uz8JW1hDBKpX2EgqX1b4YmSZ6vksB6JkTN8827wnjjXa4SIlcy3MivooOpjkW2VEzQKn205FClsNhvGMaSUcVXFhAqY9sBpHNi9VX2L2hiwb7RNEmIHVG18WaCphpzHliXBVOwhbCU9YdZ97dzrbyLWoNInxhft5AxTas42tT3Ojjr99Cg1/8wbB2/rLXfAs/BC7APEHgNjsE7cApGgDoHzmdn5szd7+4P95d7Ubk6jZrzDKwd9/cfOftNfw== dXdt = k X / k X / k k X AAACGXicbZDLSsNAFIYn9VbrrerSzWAVBLFN0qJuhIIblxVsG+glTCaTduhkEmYmQgl5DTe+ihsXirjUlW/j9LLQ6g8DH/85hzPn92JGpTLNLyO3tLyyupZfL2xsbm3vFHf3WjJKBCZNHLFIOB6ShFFOmooqRpxYEBR6jLS90fWk3r4nQtKI36lxTHohGnAaUIyUttyi2Q0EwqnvZKmvMngFR67l9O2KDc802k6/WjmHpxqrU6PmuMWSWTangn/BmkMJzNVwix9dP8JJSLjCDEnZscxY9VIkFMWMZIVuIkmM8AgNSEcjRyGRvXR6WQaPtePDIBL6cQWn7s+JFIVSjkNPd4ZIDeVibWL+V+skKrjspZTHiSIczxYFCYMqgpOYoE8FwYqNNSAsqP4rxEOko1I6zIIOwVo8+S+07LJVLdu3tVL9aB5HHhyAQ3ACLHAB6uAGNEATYPAAnsALeDUejWfjzXifteaM+cw++CXj8xtMDZyQ Fig 1. The FSE method applied to the Schl¨ogl system [27]. (A) Mass-actionreactions with kinetic parameters taken from ref. [31]: k = 0 . k = 0 . k = 200, k = 3 .

5. (B) Stochastic simulations show the well-known bimodality of the steady-stateprobability distribution of species X . (C) For a given upper bound O X on thepopulation of X to be tracked explicitly, FSE yields the auxiliary species denoted by (cid:74) (cid:75) , (cid:74) (cid:75) , . . . , (cid:74) O X (cid:75) . The original species X acts as buffer which collects untrackedpopulations levels. For example, reaction R1.1 derives from reaction R1 when theautocatalytic formation of a new molecule occurs when the system tracks the discretestate (cid:74) O X (cid:75) , thus requiring to increase the buffer species X by one element. Even whenthe system tracks a discrete state which does not require buffering (R1.2), thepropensity function f ( n ) of the reaction effectively considers an overall kinetics ofmass-action type, since the factor k ( X + n )( X + n − / X + n indistinguishable molecules.Intuitively, the factor (cid:74) n (cid:75) conditions these events to the system tracking n discretemolecules. (D) The original state space counts the number of copies of X . The statespace in the expanded network consists of the pair tracked discrete state/populationlevel of the buffer species. Here, our result of soundness (see Methods) states that thesum of the probabilities across all pairs that have the same overall population matchesthe corresponding probability in the original Markov chain (as exemplified by matchingcolors of the states). (E) The single-dimensional DRE of the original Schl¨ogl model isexpanded into a DRE with O X + 1 variables (where I denotes the indicator function);an estimate of the total mean population at time t can be computed as X ( t ) + (cid:80) n n · (cid:74) n (cid:75) ( t ). F) Starting from a population of 200 elements of X , the originalbi-stable DRE converges to one equilibrium at ca. 85.50 (blue line). FSE achievesexcellent agreement with an upper bound O X = 650 (with respect to the averagecomputed by stochastic simulation with 100000 repetitions).July 6, 2020 6/33 Time P r o t e i n P Time G ene bound D b Fig 2. Genetic feedback switch in Eq 1.

Numerical simulations comparingstochastic simulation (1E+06 repetitions), DRE and FSEs for fixed O D u = O D b = 1and different upper bounds O P . The resulting DRE from FSE has 2 · O P + 2 equations.Kinetic parameters were set as follows: r u = 1 . r b = 0 . k f = 0 . k b = 1 . s b = 10 . s u = 0 .

5. The initial state is (

P, D u , D b ) = (0 , , Genetic feedback switch

Let us now consider a model for a genetic feedback switch taken from Refs. [32, 33]: D u r u −→ D u + P D b s u −→ D u + PD b r b −→ D b + P D u + P s b −→ D b (1) D b k b −→ D u P k f −→ Species D u and D b represent the state of a single gene when its promoter region isunbound (respectively, bound) to a protein P . The reaction propensities obey the law ofmass action. This is a basic model for negative autoregulation, a well-known motifappearing in more than 40% of the known transcription factors in E.coli [34]. Here, anatural choice of upper bounds for the gene species is O D u = O D b = 1, by which theDRE of the expanded network can be interpreted as the solution of the conditionalexpectation of the protein population based on the gene state. Small values of O P yielda significant correction of the protein levels as well as of the marginal probabilitydistribution of the gene state (Fig 2).July 6, 2020 7/33 enetic toggle switch The toggle switch network is a fundamental regulatory system of two mutuallyrepressing genes [35]. Its mathematical modeling is challenging because ofmultimodality [23, 36], as well as stochastic noise due to the species such as mRNApresent in low molecular abundances [37]. Here we study the reaction scheme analyzedin ref. [38], consisting of a mass-action variant presented in ref. [35]: k −→ M i M i k −→ M i k −→ S i S i k −→ S i + P i S i k −→ P i k −→ ∅ , (2) S i + M j k −→ S i , i, j ∈ { A, B } , i (cid:54) = j, where M i and S i denote the precursor mRNA and the mRNA for target protein P i .The last two reactions model mutual inhibition by means of a precursor of one proteinrepressing the mRNA of the other.When protein production is controlled by low populations of precursor mRNA, thestochastic fluctuations are not adequately approximated with DRE. By explicitlyobserving few copies of mRNA (up to tens) our method provides precise estimates ofthe time courses of the mean populations (Fig 3). The resulting equations, of size atmost 2310 in our tests, can be analyzed effectively, as opposed to time-consumingstochastic simulations using hybrid approaches such as those reported in ref. [38]. Comparison with related work

Using the models presented in the previous section, here we compare FSE withstate-of-the-art techniques which can be used to obtain approximate estimates of meanpopulation levels in stochastic reaction networks. Specifically, we considered thefollowing methods: • Moment-closure approximation (MCA). We considered the second-orderlow-dispersion moment closure [39, 40], in which variance and covariance are thehighest observed moments and all higher-order central moments are set to zero; inall models considered in this paper, computing approximations with higher-ordermoments did not improve the quality of the approximation.July 6, 2020 8/33

100 200 300 400

Time M A / M B SIM DRE 1-5 2-10 2-150 100 200 300 400

Time P A / P B Fig 3. Toggle switch network in Eq 2 . Numerical simulations comparingstochastic simulation (500000 repetitions), DRE and FSE by fixing O P A = O P B = 0while using different upper bounds O M and O S ( O M – O S in short) for the number ofcopies of M A / M B and S A / S B (as indicated in the legend), respectively. Initialcondition was the zero state. The size of the DRE for the tested choices of upperbounds is equal to ( O M + 1) · ( O S + 1) + 6 (corresponding to 150, 1095 and 2310equations for O M – O S = 1–5, O M – O S = 2–10, and O M – O S = 2–15, respectively).Kinetic parameters were chosen as follows: k = 0 . k = 0 . k = 1 . k = 10 . k = 0 . k = 0 . k = 20 .

0. Protein production (right plot) is controlled by a lowpopulation of precursor mRNA (left plot), which causes significant underestimationerrors with DRE. Increasing the upper bounds of FSE improves the accuracy of themean estimate. Corrections for species S A and S B , not reported here, are similar. • The effective mesoscopic rate equation (EMRE), which adds mean-correctionterms to the linear-noise approximation under the assumption of an underlyingGaussian process [13]. • The method of conditional moments (MCM), a hybrid analytical techniquecombining a discrete representation of low-abundance species and a moment-basedapproximation of high-abundance ones [15]. • Finite state projection (FSP), which truncates the state space of a Markov chainby redirecting transitions toward unobserved states into an absorbing state withprovable bounds [10].For this study, we used an implementation of the techniques as available on the softwaretool CERENA [39].The Schl¨ogl model is known to stress MCA because of their reported difficulties withmultimodal distributions [41, 42]. Fig 4 shows that MCA behaves similarly to DRE inthis case, while EMRE tends to overestimate the mean population of species X atlonger time horizons. Similar results were obtained on the toggle switch network (Fig 5).July 6, 2020 9/33 Time X SIMDREMCAEMREFSE

Fig 4. Comparison with related techniques on the Schl¨ogl model.

Theaverage population of X computed by stochastic simulation (100000 repetitions) iscompared against DRE, MCA, EMRE and FSE with O X = 650. Time P A / P B SIMDREMCAEMREFSE

Fig 5. Comparison with related techniques on the toggle switch network.

Stochastic simulation to compute the average populations of species P A /P B (500000repetitions) is compared against DRE, MCA, EMRE, and FSE. FSE is run with upperbounds O P = 0, O M = 2 and O S = 10. MCA estimates population levels approaching75000 (out of scale in this plot to improve readability) before dropping to zero.Here we additionally confirm physically meaningless moment-closure estimates due tothe presence of low-abundance species, as already reported in Ref. [21].In the genetic feedback switch model, species D u and D b describe the distinct binarystates of a single gene. Hence they represent the natural candidates of thelow-abundance class when applying MCM. On this model, however, the method couldnot return valid results as early as time point 0 .

36. We further tested a gene regulatorymodel with an inhibition feedback loop taken from Ref. [43], which however showedsimilar difficulties, thus confirming already reported numerical issues. [3] (Thenumerical results of this analysis, not reported here, are replicable using the supportingdata of this article.)July 6, 2020 10/33

Time X SIMFSP 300FSP 450FSP 650FSP 700FSP 750

Fig 6. Analysis by FSP of the Schl¨ogl model.

With parameter settings as inFig 1, for state spaces of equal size as in Fig 1F (450 and 650), FSE estimates theaverage population of X more precisely than finite state projection (stochasticsimulation performed with 100000 repetitions). In particular, while FSE requiresrequires a state space with 650 states to accurately match stochastic simulations, FSPneeds 750 states.Defining incoming and outgoing transitions with respect to the buffer speciesmaintained in the expanded network represents a crucial difference with FSP, wheretransitions toward unobserved state are collapsed into a sink state that absorbs theprobability mass. Experimentally, this results in increased accuracy of mean estimatesby FSE when tracking the same subset of the state space in both methods (Fig 6).The solution by FSP is a lower bound on the true probability distribution, andincreasing the set of observed states tightens that bound. Instead, although FSEensures that the expansion coincides with the master equation when the whole statespace is tracked, it does not give theoretical guarantees on the degree of accuracy, nordoes it guarantee monotonically increasing accuracy with larger observation bounds.Indeed, experimentally we confirmed that monotonicity of the error is model dependent.For instance, the relative percentage error between the mean population predicted byFSE and the estimated mean by stochastic simulation is not monotonic in the Schl¨oglsystem (Fig 7), while it is monotonic in the case of the genetic feedback switch andtoggle switch models (Figs. 8-9).July 6, 2020 11/33 Time X % E rr o r aga i n s t SSA

Fig 7. Sensitivity analysis of FSE in the Schl¨ogl model.

Using parameters as inFig 1, the relative percentage error on the estimate of the mean population of X computed by stochastic simulation (100000 repetitions) shows non-monotonic behaviorof the accuracy of FSE with increasing observation bound O X . Methods

Preliminaries

In order to introduce the FSE method, we first require some preliminary mathematicaltheory and notation. Consider a set of species S . Then, N S and R S are the sets of allinteger and real-valued vectors, respectively, with coordinates represented by theelements in S . For a given vector σ ∈ R S (or σ ∈ N S ), we denote by σ S the value ofthe component corresponding to species S ∈ S . We define the following operations forany two σ, µ ∈ R S . Minimum σ ∧ µ is such that ( σ ∧ µ ) S = min( σ S , µ S ) for all S ∈ S . Saturated subtraction σ (cid:9) µ is such that ( σ (cid:9) µ ) S = max(0 , σ S − µ S ) for all S ∈ S . Projection

Given P ⊆ S , σ | P ∈ R P is such that ( σ | P ) P = σ P for all P ∈ P . Mapping

Given P ⊆ S , and a function m : S → P , σ m ∈ R P is such that σ mP = (cid:80) m ( S )= P σ S , for all P ∈ P .We generalize binary operations to the case where operands σ and µ are such that σ ∈ R S and µ ∈ R S , with S (cid:54) = S : each binary operation treats them as elements of R S ∪ S .July 6, 2020 12/33 P % E rr o r aga i n s t SSA Time

020 41002030 6 P % E rr o r aga i n s t SSA

Fig 8. Sensitivity analysis of FSE in the genetic feedback switch model. (Top) Monotonic behavior of the time-dependent accuracy of FSE against stochasticsimulation (1E+06 repetitions) for increasing observation bounds of O P , fixing O D b = O D u = 1. (Bottom) Error behavior in the steady state (estimated at time point t = 50) shows a significant impact of explicitly tracking the discrete states D b , D u ofthe gene.July 6, 2020 13/33 P % E rr o r

50 560 17080 10 15 2

Fig 9. Sensitivity analysis of FSE in the genetic toggle switch model.

Monotonic behavior of the accuracy of FSE against stochastic simulation (500000repetitions) for increasing observation bounds of O M and O S at time point t = 400,representative of steady-state, fixing O P = 0.Formally we denote a reaction network as a pair ( S , R ), where R is a set ofreactions. Each reaction is provided as a triple denoted by ρ f −→ π, (3)where ρ ∈ N S are the reactants , π ∈ N S are the products , and f is the propensityfunction , f : R S → R +0 , with arbitrary form. We will use the standard notation forreactants and products whereby only the nonzero components are written out,separated by the plus sign. For instance, given the species S = { A, B, C } , the reaction A + 2 B f −→ C (4)corresponds to Eq 3 with ρ A = 1, ρ B = 2, ρ C = 0 and π A = π B = 0, π C = 1.A discrete state of a reaction network is described by a vector σ ∈ N S , where σ S denotes the population of species S in that state. Then, f ( σ ) is a non-negative realwhich gives the parameter of the exponential distribution that governs the firing time ofthat reaction. Upon firing, the system may transition from state σ to σ + π − ρ , thusdefining a stochastic dynamics in terms of a Markov jump process.Stochastically, the behavior of a reaction network is defined by the (chemical) masterJuly 6, 2020 14/33quation. It gives the probability P σ ( t ) of finding the Markov chain in state σ at time t : dP σ ( t ) dt = (cid:88) ρ f −−→ π − f ( σ ) P σ ( t ) + f ( σ + ρ − π ) P σ + ρ − π ( t ) . It is worth remarking that the master equation is defined for all σ ∈ N S . However,it solution will be nonzero only for those states that are reachable from the states thathave nonzero probability at time t = 0. The reachable set of states, also called the statespace , can be defined as the smallest set such that the following hold:1. σ is in the reachable set if P σ (0) > σ is in the reachable set if σ (cid:48) is in the reachable set and there exists a reaction ρ f −→ π such that σ (cid:48) + π − ρ = σ .For simplicity (and without loss of generality) we can consider networks where theinitial probability distribution P σ (0) is concentrated in one state only, which is calledthe initial state . Additionally we shall restrict to well-defined reaction networks whereeach propensity function evaluates to zero for all multisets that do not have theminimum population counts described by the reactants. Formally, a reaction network iswell-defined if every reaction ρ f −→ π is such that f ( σ ) = 0 if ρ > σ (the inequalities shallbe intended component-wise from now on). This guarantees that the Markov chain doesnot reach states with negative population counts.The state space, hence the number of equations required for stochastic analysis, maybe finite or infinite depending on the network stoichiometries. Even in the case of finitestate spaces, its size may grow combinatorially large with the population counts of theinitial state. This may practically preclude exact analysis in most models of interest.The DRE provides a compact model with | S | variables. Each variable approximatesthe expected population level of each species at time t , denoted by the vector X ( t ) ∈ R S , as the solution of the following system of differential equations: dX ( t ) dt = (cid:88) ρ f −−→ π f ( X ( t ))( π − ρ ) . Notice that the true expected population counts, denoted by E [ Y ], are known to satisfyJuly 6, 2020 15/33he system d E [ Y ( t )] dt = (cid:88) ρ f −−→ π E [ f ( Y ( t ))]( π − ρ ) , which is not self-consistent because there are no equations for the expected values of thepropensity functions appearing in the right-hand sides. Essentially, the DRE closes thetrue equations for the expected values by replacing E [ f ( Y ( t ))] with f ( E [ Y ( t )]),introducing an approximation error if the propensity functions are not linear. Sucherror is known to vanish asymptotically when the initial population levels go to infinityand the DRE is understood as a system of re-scaled equations for the concentrations ofspecies, rather than absolute population counts [5]. Finite state expansion

The FSE approach rests on the idea of obtaining DRE-type equations that mediatebetween the discreteness of the state space in the master equation and the continuousinterpretation of the DRE, through an expansion of the reaction network where thesetwo representations can be seen as “limit cases”. Specifically, given a reaction network,the original set of species S is meant to represent the continuous dynamics. This isexpanded with a set of auxiliary species, each representing a specific discrete state ofthe reaction network. The reaction set R is then modified by replacing each originalreaction with a set of reactions that account for the interaction between the continuousand the discrete parts. The core idea of FSE is that the standard DRE for theexpanded system carry additional information that enables a more precise estimate ofthe mean. In the following, we discuss how to formally extend the reaction network andsome consistency results of the transformation. All proofs are reported in S1 Appendix.The auxiliary set of species is defined by the user through a vector of parametersthat stipulate an upper bound to the population count to be tracked discretely for eachspecies. Thus, in effect FSE yields a lattice of expansions depending on the choice of theupper bounds. Let us denote by O ∈ N S the upper bound, where each component O S gives the maximum abundance of the species S that is to be tracked in the expansion.For each discrete state o ≤ O , we denote by (cid:74) o (cid:75) the corresponding auxiliary species thatis considered in the expansion. Thus we may define S O to be the set of species in theJuly 6, 2020 16/33xpanded network as S O = S ∪ (cid:8) (cid:74) o (cid:75) | o ≤ O (cid:9) . For example, in a network with the single reaction as in Eq 4, one may choose O A = O B = O C = 1. Then, the expanded network will have auxiliary species (cid:74) A (cid:75) , (cid:74) B (cid:75) , (cid:74) C (cid:75) , (cid:74) A + B (cid:75) , (cid:74) A + C (cid:75) , (cid:74) B + C (cid:75) , (cid:74) A + B + C (cid:75) , and (cid:74) (cid:75) , where the last species denotesthe zero vector being tracked. We remark that, similarly to the definition of the masterequation, it is convenient to consider all states within the upper bound when describingthe theory. However, also in this case, not all discrete states may be reached dependingon the stoichiometry of the reaction network, hence they can be removed in practiceduring the analysis.The expanded set of reactions is built by replacing each reaction ρ f −→ π in theoriginal network with a set of reactions for each tracked state (cid:74) o (cid:75) as follows: (cid:74) o (cid:75) + η f o −−→ (cid:74) o (cid:48) (cid:75) + ψ, for o ≤ O, (5)where η , ψ and o (cid:48) ∈ N S are defined componentwise by η = ρ (cid:9) o, (6) ψ = (( o (cid:9) ρ ) + π ) (cid:9) O,o (cid:48) = O ∧ (( o (cid:9) ρ ) + π ) . Intuitively, for each original reaction as in Eq 3, Eq 5 conditions its dynamics withrespect to (cid:74) o (cid:75) being the discrete state being tracked. Any expanded reaction maintainsthe same overall counts of reactants and products as the originating reaction, with aproduct tracked state (cid:74) o (cid:48) (cid:75) that results from the addition of products and removal ofreactants within the upper bound O ; η and ψ are vectors of original species of S .Essentially, they act as buffer species for populations that are not explicitly tracked.The propensity function f o is derived from the original one f as f o : R S O → R +0 , with f o ( x ) = x (cid:74) o (cid:75) f ( o + x | S ) . (7)This modification accounts for the fact that the tracked species (cid:74) o (cid:75) encodes additionalJuly 6, 2020 17/33opulation counts, as given by o .For example, let us consider an expansion for the reaction in Eq 4 assuming that itevolves with mass-action kinetics. In general, for a reaction with reagents ρ and kineticparameter k >

0, the propensity function by mass-action kinetics for state σ is given by f k ( σ ) = k (cid:81) S (cid:0) σ S ρ S (cid:1) . Here the propensity function reads: f ( x ) = kx A x B ( x B − / . Assuming that the upper bounds are O A = O B = O C = 1, the expansion for thetracked state (cid:74) A + B (cid:75) is given by (cid:74) A + B (cid:75) + B f (cid:48) −→ (cid:74) C (cid:75) . Since the tracked state (cid:74) A + B (cid:75) does not have enough copies of B , one further copy isused by the buffer species. The product of this reaction does not involve any of thebuffer species because (cid:74) C (cid:75) is within the chosen bounds. By Eq 7, the propensityfunction in the expanded reaction becomes f (cid:48) ( x ) = kx (cid:74) A + B (cid:75) (1 + x A )(1 + x B ) (cid:0) (1 + x B ) − (cid:1) / kx (cid:74) A + B (cid:75) (1 + x A )(1 + x B ) x B / . We denote by R O the set of reactions in the expanded networks where thetransformation in Eq 5 is applied to every reaction in R .Every expansion is stochastically equivalent to the original network, in the sense thatthere is a unique marginal probability distribution for the overall population of eachspecies at every time point. This equivalence can be stated in the sense of ordinarylumpability for Markov chains [11]. Using the master equation, we show that theprobability of being in a state in the original reaction network equals the sum of theprobabilities across all states in the expanded network with the same overall abundancesfor each species. This relation holds at all time points, provided that it is satisfied forthe respective probability distributions at time 0. Theorem 1.

Let P and ˆ P denote the solutions of the master equation in the original July 6, 2020 18/33 nd expanded network, respectively. Then it holds that (cid:88) o + ξ = σ ˆ P (cid:74) o (cid:75) + ξ (0) = P σ (0) = ⇒ (cid:88) o + ξ = σ ˆ P (cid:74) o (cid:75) + ξ ( t ) = P σ ( t ) , for all t . The main idea behind the proof is to show the following equivalence of thederivatives of the master equation: (cid:88) o + ξ = σ d ˆ P (cid:74) o (cid:75) + ξ dt = dP σ dt for all σ ∈ N S . The previous result says that when O is finite any expansion is stochasticallyequivalent to the original reaction network. By construction, if O = then the originaland expanded networks coincide. Now we consider the other limit case, namely whenthe auxiliary set of species contains all discrete states, corresponding to a fullyexpanded reaction network. In this case, the DRE of the expanded network correspondsto the master equation of the original network, hence no approximation occurs. In orderto do so, we state two preliminary results. Lemma 2.

The expansion of a well-defined reaction network is well-defined.

The following statement proves that the expansion preserves the overall populationjumps. That is, for each original reaction, every expanded reaction is such that eachspecies is subject to the same change of its abundance level.

Lemma 3.

Let ρ f −→ π be a reaction and (cid:74) o (cid:75) + η f o −→ (cid:74) o (cid:48) (cid:75) + ψ its expansion according toEq 5. Then it holds that:1. ( o + η ) (cid:9) ( o (cid:48) + ψ ) = ρ (cid:9) π ;2. ( o (cid:48) + ψ ) (cid:9) ( o + η ) = π (cid:9) ρ ;3. σ (cid:9) ( o + η ) + ( o (cid:48) + ψ ) = σ (cid:9) ρ + π , for all σ ∈ N S such that ( o + η ) ⊆ σ . With these two lemmata, we are now ready to state and prove our main result ofasymptotic correctness for a fully expanded reaction network.July 6, 2020 19/33 heorem 4.

Consider a well-defined reaction network ( S , R ) and let ( S O , R O ) be itsexpansion where S O = S ∪ (cid:8) (cid:74) o (cid:75) | o ∈ N S (cid:9) . Let X ( t ) be the DRE solution of the expanded network and P ( t ) the solution of themaster equation of the original network at time t . Then it holds that:i) if X S (0) = 0 then X S ( t ) = 0 for all t and S ∈ S ;ii) if X (cid:74) o (cid:75) (0) = P o (0) then X (cid:74) o (cid:75) ( t ) = P o ( t ) , for all t and o ∈ N S . The proof of this theorem is based on showing that the right hand side of the DREassociated with a fully expanded reaction network coincides with the master equation,based on the fact that the DRE for an expanded network reads dX S dt = (cid:88) ρ fo −→ π ∈ R O ( π S − ρ S ) · f o ( X ) , for all S ∈ S . (8) Conclusion

We have presented finite state expansion (FSE) as a novel analytical method that offersa trade off between the exactness of the solution of the master equation and theapproximation errors introduced by the deterministic rate equation (DRE) for chemicalreaction networks. FSE maintains a user-defined subset of the discrete state space andcouples this with whole-population continuous dynamics to account for the behavior ofstates that are not explicitly tracked. By an algorithmic translation of a chemicalreaction network into an expanded one with auxiliary species and modified reactions,FSE leads to equations that can be interpreted as a refinements of the original DRE. Atheoretical result of asymptotic correctness increases the confidence as to theeffectiveness of the method, since it shows that the DRE of the expanded networkcorresponds to the original master equation.The performance of FSE in correcting the original DRE when tracking a strictsubset of the discrete state space has been shown numerically in models that turn out tobe challenging for related state-of-the-art techniques. The effective mesoscopic rateequation relies on perturbation arguments around the linear-noise approximation, henceit inherently assumes a limiting regime, unlike FSE. Experimentally, we found that thisJuly 6, 2020 20/33esulted in less accurate mean estimates than FSE. With respect to analyticalapproximations of the master equation based on moment closure, the case studiesproved difficult since the analyses returned unphysical results or exhibited numericalissues, as also reported in the literature. The use of buffer species makes it possible forFSE to outperform finite state projection when using the same observation bounds forthe tracked state space, because with FSE the probability mass does not absorb into asink state. However, this inhibits the possibility to obtain error bounds between theFSE solution and the original master equation, unlike with finite state projection. Still,numerically we found excellent accuracy when the observed state space is large enough,both during the transient evolution and in the steady state. Overall, these findingsmake FSE a useful tool to study chemical reaction networks for which exact stochasticanalysis through the master equation is not accessible.Despite these encouraging results, the applicability of FSE may not always befeasible. Since it is based on an enumeration of the discrete state space—albeit up tothe given observation bound—it too may suffer from combinatorial complexity, suchthat the number of equations can grow rapidly large. If significant probability mass fallsoutside the tracked state space, the performance of FSE may not be adequate, as theSchl¨ogl system shows when small enough bounds are used.There are a number of methods that are worth investigating in the future in order totackle these challenges. Model-reduction techniques could help relieve thecomputational cost of the analysis of the DRE by providing a lower-orderapproximation that preserves the dynamics of interest [44, 45]. In principle, there mightbe other expansions than the one presented here, which give rise to different correctionbehavior of the DRE while still preserving the stochastic dynamics. A further line ofimprovement might consist in devising variants of FSE where the tracked state spacecan be arbitrarily fixed, instead of being dependent on an upper bound for thepopulation counts. This might allow the fine-tuning of the choice of the discrete regionwhere the probability mass is mostly concentrated. For such expansions, smallerobservation bounds (hence lower computational cost) may suffice to obtain the samedegree of accuracy as in this paper, thus potentially extending the practical applicabilityof FSE to models of higher complexity.July 6, 2020 21/33 upporting information

S1 Appendix. Proofs of the results presented in the paper.

Acknowledgments

This work has been partially supported by the Italian Ministry for Education andResearch under grant SEDUCE no. 2017TWRCNB and by the IMT School forAdvanced Studies Lucca under grant PROCOPE.

References

1. Van Kampen NG. Stochastic Processes in Physics and Chemistry. Elsevier; 2007.2. Gillespie DT. Exact Stochastic Simulation of Coupled Chemical Reactions.Journal of Physical Chemistry. 1977;81(25):2340–2361.3. Schnoerr D, Sanguinetti G, Grima R. Approximation and inference methods forstochastic biochemical kinetics—a tutorial review. Journal of Physics A:Mathematical and Theoretical. 2017;50(9):093001.4. Fr¨ohlich F, Thomas P, Kazeroonian A, Theis FJ, Grima R, Hasenauer J.Inference for Stochastic Chemical Kinetics Using Moment Equations and SystemSize Expansion. PLOS Computational Biology. 2016;12(7):1–28.doi:10.1371/journal.pcbi.1005030.5. Kurtz TG. The Relationship between Stochastic and Deterministic Models forChemical Reactions. The Journal of Chemical Physics. 1972;57(7):2976–2978.6. Guptasarma P. Does replication-induced transcription regulate synthesis of themyriad low copy number proteins of Escherichia coli? BioEssays.1995;17(11):987–997. doi:10.1002/bies.950171112.7. Elowitz MB, Levine AJ, Siggia ED, Swain PS. Stochastic Gene Expression in aSingle Cell. Science. 2002;297(5584):1183–1186.July 6, 2020 22/33. Paulsson J. Models of stochastic gene expression. Physics of Life Reviews.2005;2(2):157–175. doi:https://doi.org/10.1016/j.plrev.2005.03.003.9. Swain PS, Elowitz MB, Siggia ED. Intrinsic and extrinsic contributions tostochasticity in gene expression. Proceedings of the National Academy ofSciences. 2002;99(20):12795–12800. doi:10.1073/pnas.162041399.10. Munsky B, Khammash M. The finite state projection algorithm for the solutionof the chemical master equation. The Journal of Chemical Physics.2006;124(4):044104. doi:10.1063/1.2145882.11. Buchholz P. Exact and Ordinary Lumpability in Finite Markov Chains. Journalof Applied Probability. 1994;31(1):59–75.12. Kuehn C. Moment Closure—A Brief Review. In: Sch¨oll E, Klapp SHL, H¨ovel P,editors. Control of Self-Organizing Nonlinear Systems. Springer InternationalPublishing; 2016. p. 253–271.13. Grima R. An effective rate equation approach to reaction kinetics in smallvolumes: Theory and application to biochemical reactions in nonequilibriumsteady-state conditions. The Journal of Chemical Physics. 2010;133(3):035101.doi:10.1063/1.3454685.14. Thomas P, Matuschek H, Grima R. Computation of biochemical pathwayfluctuations beyond the linear noise approximation using iNA. In: IEEEInternational Conference on Bioinformatics and Biomedicine; 2012. p. 1–5.15. Hasenauer J, Wolf V, Kazeroonian A, Theis FJ. Method of conditional moments(MCM) for the Chemical Master Equation. Journal of Mathematical Biology.2014;69(3):687–735. doi:10.1007/s00285-013-0711-5.16. G´omez-Uribe CA, Verghese GC. Mass fluctuation kinetics: Capturing stochasticeffects in systems of chemical reactions through coupled mean-variancecomputations. The Journal of Chemical Physics. 2007;126(2):024109.doi:10.1063/1.2408422.July 6, 2020 23/337. Ale A, Kirk P, Stumpf MPH. A general moment expansion method for stochastickinetic models. The Journal of Chemical Physics. 2013;138(17):174101.doi:10.1063/1.4802475.18. Lee CH, Kim KH, Kim P. A moment closure method for stochastic reactionnetworks. The Journal of Chemical Physics. 2009;130(13):134107.doi:10.1063/1.3103264.19. Sotiropoulos V, Kaznessis YN. Analytical derivation of moment equations instochastic chemical kinetics. Chemical Engineering Science. 2011;66(3):268 – 277.doi:https://doi.org/10.1016/j.ces.2010.10.024.20. Gillespie CS. Moment-closure approximations for mass-action models. IETSystems Biology. 2009;3(1):52–58. doi:10.1049/iet-syb:20070031.21. Singh A, Hespanha JP. Approximate Moment Dynamics for Chemically ReactingSystems. IEEE Transactions on Automatic Control. 2011;56(2):414–418.doi:10.1109/TAC.2010.2088631.22. Smadbeck P, Kaznessis YN. A closure scheme for chemical master equations.Proceedings of the National Academy of Sciences. 2013;110(35):14261–14265.doi:10.1073/pnas.1306481110.23. Thomas P, Popovi´c N, Grima R. Phenotypic switching in gene regulatorynetworks. Proceedings of the National Academy of Sciences.2014;111(19):6994–6999. doi:10.1073/pnas.1400049111.24. Jahnke T. On Reduced Models for the Chemical Master Equation. MultiscaleModeling & Simulation. 2011;9(4):1646–1676. doi:10.1137/110821500.25. Menz S, Latorre JC, Sch¨utte C, Huisinga W. Hybrid Stochastic-DeterministicSolution of the Chemical Master Equation. SIAM Interdisciplinary JournalMultiscale Modeling and Simulation. 2012;10(4):1232–1262.26. Cardelli L, Tribastone M, Vandin A, Tschaikowski M. ERODE: A Tool for theEvaluation and Reduction of Ordinary Differential Equations. In: Tools andAlgorithms for the Construction and Analysis of Systems — 23rd InternationalJuly 6, 2020 24/33onference, TACAS; 2017.Available from: http://cse.lab.imtlucca.it/~mirco.tribastone/papers/tacas2017.pdf .27. Schl¨ogl F. Chemical reaction models for non-equilibrium phase transitions.Zeitschrift f¨ur Physik. 1972;253(2):147–161.28. Bishop LM, Qian H. Stochastic Bistability and Bifurcation in a MesoscopicSignaling System with Autocatalytic Kinase. Biophysical Journal.2010;98(1):1–11. doi:https://doi.org/10.1016/j.bpj.2009.09.055.29. Vellela M, Qian H. Stochastic dynamics and non-equilibrium thermodynamics ofa bistable chemical system: the Schl¨ogl model revisited. Journal of The RoyalSociety Interface. 2009;6(39):925–940. doi:10.1098/rsif.2008.0476.30. Zheng Q, Ross J. Comparison of deterministic and stochastic kinetics fornonlinear systems. The Journal of Chemical Physics. 1991;94(5):3644–3648.doi:10.1063/1.459735.31. Li H, Cao Y, Petzold LR, Gillespie DT. Algorithms and Software for StochasticSimulation of Biochemical Reacting Systems. Biotechnology Progress.2008;24(1):56–61. doi:10.1021/bp070255h.32. Hornos JEM, Schultz D, Innocentini GCP, Wang J, Walczak AM, Onuchic JN,et al. Self-regulating gene: An exact solution. Phys Rev E. 2005;72:051907.doi:10.1103/PhysRevE.72.051907.33. Grima R, Schmidt DR, Newman TJ. Steady-state fluctuations of a geneticfeedback loop: An exact solution. The Journal of Chemical Physics.2012;137(3):035104. doi:10.1063/1.4736721.34. Shen-Orr SS, Milo R, Mangan S, Alon U. Network motifs in the transcriptionalregulation network of Escherichia coli. Nature Genetics. 2002;31(1):64–68.doi:10.1038/ng881.35. Gardner TS, Cantor CR, Collins JJ. Construction of a genetic toggle switch inEscherichia coli. Nature. 2000;403(6767):339–342.July 6, 2020 25/336. Tian T, Burrage K. Stochastic models for regulatory networks of the genetictoggle switch. Proceedings of the National Academy of Sciences.2006;103(22):8372–8377. doi:10.1073/pnas.0507818103.37. Kærn M, Elston TC, Blake WJ, Collins JJ. Stochasticity in gene expression:from theories to phenotypes. Nature Reviews Genetics. 2005;6:451 EP –.38. Hepp B, Gupta A, Khammash M. Adaptive hybrid simulations for multiscalestochastic reaction networks. The Journal of Chemical Physics.2015;142(3):034118. doi:10.1063/1.4905196.39. Kazeroonian A, Fr¨ohlich F, Raue A, Theis FJ, Hasenauer J. CERENA:ChEmical REaction Network Analyzer—A Toolbox for the Simulation andAnalysis of Stochastic Chemical Kinetics. PLOS ONE. 2016;11(1):1–15.doi:10.1371/journal.pone.0146732.40. Schnoerr D, Sanguinetti G, Grima R. Comparison of different moment-closureapproximations for stochastic chemical kinetics. The Journal of Chemical Physics.2015;143(18):185101. doi:10.1063/1.4934990.41. Lakatos E, Ale A, Kirk PDW, Stumpf MPH. Multivariate moment closuretechniques for stochastic kinetic models. The Journal of Chemical Physics.2015;143(9):094107. doi:10.1063/1.4929837.42. Schnoerr D, Sanguinetti G, Grima R. Validity conditions for moment closureapproximations in stochastic chemical kinetics. The Journal of Chemical Physics.2014;141(8):084103.43. Winkelmann S, Sch¨utte C. Hybrid models for chemical reaction networks:Multiscale theory and application to gene regulatory systems. The Journal ofChemical Physics. 2017;147(11):114115. doi:10.1063/1.4986560.44. Cardelli L, Tribastone M, Tschaikowski M, Vandin A. Maximal aggregation ofpolynomial dynamical systems. Proceedings of the National Academy of Sciences.2017;114(38):10029–10034. doi:10.1073/pnas.1702697114.July 6, 2020 26/335. Kim JK, Sontag ED. Reduction of multiscale stochastic biochemical reactionnetworks using exact moment derivation. PLOS Computational Biology.2017;13(6):1–24. doi:10.1371/journal.pcbi.1005571.July 6, 2020 27/33

Theorem 1

Let P and ˆ P denote the solutions of the master equation in the original and expandednetwork, respectively. Then it holds that (cid:88) o + ξ = σ ˆ P (cid:74) o (cid:75) + ξ (0) = P σ (0) = ⇒ (cid:88) o + ξ = σ ˆ P (cid:74) o (cid:75) + ξ ( t ) = P σ ( t ) , for all t . Proof.

We prove the following equivalence for the derivatives of the solutions of therespective master equations (cid:88) o + ξ = σ d ˆ P (cid:74) o (cid:75) + ξ dt = dP σ dt for all σ ∈ N S , July 6, 2020 28/33rom which the statement holds under the assumption of consistent initial conditions. (cid:88) o + ξ = σ d ˆ P (cid:74) o (cid:75) + ξ dt = (cid:88) o + ξ = σ (cid:88) ( (cid:74) (cid:15) (cid:75) + η ) f(cid:15) −→ ( (cid:74) o (cid:48) (cid:75) + ψ ) ∈ R O (cid:16) f (cid:15) ( (cid:74) o (cid:75) + ξ + (cid:74) (cid:15) (cid:75) + η − (cid:74) o (cid:48) (cid:75) − ψ ) ˆ P (cid:74) o (cid:75) + ξ + (cid:74) (cid:15) (cid:75) + η − (cid:74) o (cid:48) (cid:75) − ψ − f (cid:15) ( (cid:74) o (cid:75) + ξ ) · ˆ P (cid:74) o (cid:75) + ξ (cid:17) = (cid:88) o + ξ = σ  (cid:88) ( (cid:74) (cid:15) (cid:75) + η ) f(cid:15) −→ ( (cid:74) o (cid:75) + ψ ) ∈ R O f (cid:15) ( (cid:74) (cid:15) (cid:75) + ξ + η − ψ ) · ˆ P (cid:74) (cid:15) (cid:75) + ξ + η − ψ − (cid:88) ( (cid:74) o (cid:75) + η ) fo −→ ( (cid:74) o (cid:48) (cid:75) + ψ ) ∈ R O f o ( (cid:74) o (cid:75) + ξ ) · ˆ P (cid:74) o (cid:75) + ξ  = (cid:88) ( (cid:74) (cid:15) (cid:75) + η ) f(cid:15) −→ ( (cid:74) o (cid:75) + ψ ) ∈ R O o + ξ = σ f (cid:15) ( (cid:74) (cid:15) (cid:75) + ξ + η − ψ ) · ˆ P (cid:74) (cid:15) (cid:75) + ξ + η − ψ − (cid:88) ( (cid:74) o (cid:75) + η ) fo −→ ( (cid:74) o (cid:48) (cid:75) + ψ ) ∈ R O o + ξ = σ f o ( (cid:74) o (cid:75) + ξ ) · ˆ P (cid:74) o (cid:75) + ξ = (cid:88) ( (cid:74) (cid:15) (cid:75) + η ) f(cid:15) −→ ( (cid:74) o (cid:75) + ψ ) ∈ R O (cid:15) + ξ + η − ψ = σ − ( o + ψ )+( (cid:15) + η ) f ( (cid:15) + ξ + η − ψ ) · ( (cid:74) (cid:15) (cid:75) + ξ + η − ψ ) (cid:74) (cid:15) (cid:75) (cid:124) (cid:123)(cid:122) (cid:125) =1 · ˆ P (cid:74) (cid:15) (cid:75) + ξ + η − ψ + − (cid:88) ( (cid:74) o (cid:75) + η ) fo −→ ( (cid:74) o (cid:48) (cid:75) + ψ ) ∈ R O o + ξ = σ f ( o + ξ ) · ( (cid:74) o (cid:75) + ξ ) (cid:74) o (cid:75) (cid:124) (cid:123)(cid:122) (cid:125) =1 · ˆ P (cid:74) o (cid:75) + ξ = (cid:88) ρ f −→ π ∈ R (cid:15) + ξ + η − ψ = σ − π + ρ f ( (cid:15) + ξ + η − ψ ) · ˆ P (cid:74) (cid:15) (cid:75) + ξ + η − ψ − (cid:88) ρ f −→ π ∈ R o + ξ = σ f ( σ ) · ˆ P (cid:74) o (cid:75) + ξ = (cid:88) ρ f −→ π ∈ R f ( σ − π + ρ ) · P σ − π + ρ − (cid:88) ρ f −→ π ∈ R f ( σ ) · P σ = (cid:88) ρ f −→ π ∈ R (cid:16) f ( σ − π + ρ ) · P σ − π + ρ − f ( σ ) · P σ (cid:17) = dP σ dt . Lemma 2

The expansion of a well-defined reaction network is well-defined.

Proof.

Let ρ f −→ π be a reaction of a well-defined network and (cid:74) o (cid:75) + η f o −→ (cid:74) o (cid:48) (cid:75) + ψ itsexpansion according to Eq 5 in the main text. Let us take z ∈ N S O such that( (cid:74) o (cid:75) + η ) (cid:42) z and separately consider the two cases for which this holds. If (cid:74) o (cid:75) / ∈ z , thenpropensity function f o in the expanded reaction is equal to zero by definition, keepingwith the requirement for the reaction being well-defined. If η (cid:42) z , then we have that( ρ (cid:9) o ) (cid:42) z | S because ρ and o are both members of N S . This implies thatJuly 6, 2020 29/33 (cid:42) ( o + z | S ), and since, the reaction is well-defined we have that f ( o + z | S ) = 0, fromwhich f o ( z ) = 0. Lemma 3

Let ρ f −→ π be a reaction and (cid:74) o (cid:75) + η f o −→ (cid:74) o (cid:48) (cid:75) + ψ its expansion according to Eq 5 in themain text. Then it holds that:1. ( o + η ) (cid:9) ( o (cid:48) + ψ ) = ρ (cid:9) π ;2. ( o (cid:48) + ψ ) (cid:9) ( o + η ) = π (cid:9) ρ ;3. σ (cid:9) ( o + η ) + ( o (cid:48) + ψ ) = σ (cid:9) ρ + π , for all σ ∈ N S such that ( o + η ) ⊆ σ . Proof.

For case (1):( o + η ) (cid:9) ( o (cid:48) + ψ ) =( o + ( ρ (cid:9) o )) (cid:9) (( O ∧ ( o (cid:9) ρ + π )+ (( o (cid:9) ρ + π ) (cid:9) O ))=( o + ( ρ (cid:9) o )) (cid:9) ( o (cid:9) ρ + π )=( ρ + ( o (cid:9) ρ )) (cid:9) (( o (cid:9) ρ ) + π )= ρ (cid:9) π. For case (2): ( o (cid:48) + ψ ) (cid:9) ( o + η ) =(( O ∧ ( o (cid:9) ρ + π ) + (( o (cid:9) ρ + π ) (cid:9) O )) (cid:9) ( o + ( ρ (cid:9) o ))=( o (cid:9) ρ + π ) (cid:9) ( o + ( ρ (cid:9) o ))=(( o (cid:9) ρ ) + π ) (cid:9) ( ρ + ( o (cid:9) ρ ))= π (cid:9) ρ. July 6, 2020 30/33or case (3): σ (cid:9) ρ + π = σ (cid:9) ( ρ (cid:9) π ) + ( π (cid:9) ρ )= σ (cid:9) (( o + η ) (cid:9) ( o (cid:48) + ψ )) + ( o (cid:48) + ψ ) (cid:9) ( o + η )= σ (cid:9) (( o + η ) (cid:9) (( o + η ) ∧ ( o (cid:48) + ψ )))+ ( o (cid:48) + ψ ) (cid:9) (( o + η ) ∧ ( o (cid:48) + ψ )) (SE9)= σ (cid:9) ( o + η ) + (( o + η ) ∧ ( o (cid:48) + ψ ))+ ( o (cid:48) + ψ ) (cid:9) (( o + η ) ∧ ( o (cid:48) + ψ )) (SE10)= σ (cid:9) ( o + η ) + ( o (cid:48) + ψ ) + (( o + η ) ∧ ( o (cid:48) + ψ )) (cid:9) (( o + η ) ∧ ( o (cid:48) + ψ ))= σ (cid:9) ( o + η ) + ( o (cid:48) + ψ ) , where Eq. (SE10) follows from Eq. (SE9) because of the relations: o + η ≤ σ and ( o + η ) ≥ ( o + η ) ∧ ( o (cid:48) + ψ ) ≤ ( o (cid:48) + ψ ) . Theorem 4