Paper The following article is Open access

Liquid–liquid phase separation driven compartmentalization of reactive nucleoplasm

and

Published 30 December 2020 © 2020 The Author(s). Published by IOP Publishing Ltd
, , Special Issue on Building Models for Single Cell Biology Citation Rabia Laghmach and Davit A Potoyan 2021 Phys. Biol. 18 015001 DOI 10.1088/1478-3975/abc5ad

1478-3975/18/1/015001

Abstract

The nucleus of eukaryotic cells harbors active and out of equilibrium environments conducive to diverse gene regulatory processes. On a molecular scale, gene regulatory processes take place within hierarchically compartmentalized sub-nuclear bodies. While the impact of nuclear structure on gene regulation is widely appreciated, it has remained much less clear whether and how gene regulation is impacting nuclear order itself. Recently, the liquid–liquid phase separation emerged as a fundamental mechanism driving the formation of biomolecular condensates, including membrane-less organelles, chromatin territories, and transcriptional domains. The transience and environmental sensitivity of biomolecular condensation are strongly suggestive of kinetic gene-regulatory control of phase separation. To better understand kinetic aspects controlling biomolecular phase-separation, we have constructed a minimalist model of the reactive nucleoplasm. The model is based on the Cahn–Hilliard formulation of ternary protein–RNA–nucleoplasm components coupled to non-equilibrium and spatially dependent gene expression. We find a broad range of kinetic regimes through an extensive set of simulations where the interplay of phase separation and reactive timescales can generate heterogeneous multi-modal gene expression patterns. Furthermore, the significance of this finding is that heterogeneity of gene expression is linked directly with the heterogeneity of length-scales in phase-separated condensates.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Phase separation is a fundamental mechanism for the emergent order in an ordinary and biological matter [1, 2]. Recently, phase separation of bio-molecules has also become a cornerstone physical mechanism for understanding the intracellular organization [24]. A wide range of membrane-less compartments are found to form through biomolecular phase separation, including nucleoli [57], stress granules [810], chromatin domains [1113] and transcriptional centers [14, 15]. The primary components driving the intra-cellular phase separation are proteins and nucleic acids with multi-valent interaction centers [16, 17].

The stickers-and-spacers framework has emerged as a viable model explaining the existence of a broad class of sequence encoded driving forces of disordered proteins which serve as nucleating centers for biomolecular condensates [3, 18]. These newly appreciated abilities of proteins and nucleic acids for forming large-scale liquid bodies is offering fresh avenues for understanding mechanisms of the coordinated action of biomolecules in gene regulation and cellular organization that go beyond single-molecule action. It is well known that eukaryotic nuclei are rich in disordered proteins linked with transcription and chromatin architecture reorganization activities [19, 20]. There have been several proposals of the functional roles that may include catalysis of biochemical reactions, noise buffering and inducing ultra-sensitive signals [2123]. However, understanding the mechanistic picture that links phase separation to the functional gene regulatory processes inside the nucleus has remained elusive [22] due to heterogeneous, multi-scale, and non-equilibrium nature of the nuclear environment [24].

In this work, we propose a minimal model of a reactive nucleoplasm (figure 1) with an objective to illustrate, in a proof of principle manner; (i) how spatially resolved non-equilibrium reactive en-events generate qualitatively distinct from equilibrium phase behavior and (ii) how the interplay of various kinetic timescales in the system impacts gene expression patterns. The minimal reactive nucleoplasm model consists of a ternary solution filled with incompressible fluid consisting of 'active' protein, RNA components, and 'passive' nucleoplasmic buffer. The mathematical formulation of the model is based on a generic ternary diffuse interface model of one-step transcription/translation and phase-separation inside the nucleoplasm. The model resolves the formation and dissolution of protein–RNA droplets as well as reactive events of influx/creation and outflux/degradation of proteins and RNA. By an extensive set of simulations exploring the interplay of timescales in the system, we found a broad kinetic regime dominated by length-scale heterogeneity of phase-separated droplets which correspond to heterogeneous gene expression patterns.

Figure 1.

Figure 1. (A) Phase diagram of the symmetric ternary protein–RNA–nucleoplasm mixture and the bulk free energy governing the solution thermodynamics fbulk(φ1, φ2, φ3) for χ = 3. The solid-line and dash-line correspond to the spinodal and binodal curves, respectively. (B) The schematic of the minimal reactive nucleoplasm model. Shown are the main reactive components (protein, RNA, DNA-seed, nucleoplasm reservoir), the corresponding reactive processes involving each components as well as their spatial generation profiles.

Standard image High-resolution image

Various simple mathematical models have been used to explore generic aspects of equilibrium and non-equilibrium phase-separation as well as the physical properties of resulting condensates [2533]. The most relevant to the present contribution are the work by Berry et al [29] , Tang and Yang [33], and Glotzer [32], where authors have considered a binary and ternary fluid model of nucleoplasm, which couples phase separation with first-order exchange among soluble and insoluble components. Authors have simulated different stages of spinodal decomposition and explored its impact on Ostwald ripening of droplets. It was shown that kinetics of the RNA flux accelerates the ripening of droplets, thereby showing a link between the thermodynamics of phase separation and kinetics of droplet formation. Other recent notable studies worth highlighting here include the work by Yamamoto et al [30], which have studied the non-spatial model of architectural RNA phase separation coupled with non-equilibrium production. Finally, an important theoretical framework by Ilker and Joanny [34] establishes and equivalence of phase-separation kinetics with the Cahn–Hilliard effective temperature.

Some of the key distinctions of the present model from previous studies include (i) spatially resolved formulation of gene expression and phase separation (ii) explicit connection of phase-separated patterns with transcription and translation models of gene expression (iii) exploring the impact of dynamic turnover in RNA–protein–nucleoplasm ternary diagram on global patterning. Thus, the present work clarifies as a first step the dual nature of nuclear order and gene expression, and provides a useful theoretical framework for understanding equilibrium and non-equilibrium origins of intra-nuclear patterning.

2. Minimal reactive nucleoplasm model

In order to explore the kinetics of phase separation of RNA–proteins–nucleoplasm reacting mixture, we use a local thermodynamic approach that describes the phase separation of a ternary mixture based on the Flory–Huggins model [35, 36] for polymer solutions coupled with chemical reaction-diffusion equations (figure 1). In this section, we describe the mathematical formulation of the model of a 'minimal reactive nucleoplasm': where the liquid–liquid phase separation of single RNA and protein components is coupled with reactive events of transcription, translation, and degradation.

A minimal one-step model of independent, unregulated transcription and translation [37] is used to set the lifetimes of protein and RNA components comprising the nucleoplasmic milieu defined by the following the chemical reactions:

Equation (1)

where the kR, kP and kd are the rate coefficients for transcription, translation and degradation, respectively; The ∅ symbol is used to denote the degradation of RNA and protein. The kinetics of phase separation of ternary reactive protein–RNA–nucleoplasm mixture is given by the following set of reaction–diffusion equations [32]:

Equation (2)

Equation (3)

where φi (i = 1, 2, 3) are the order parameter variables associated with the local concentration of ith-component forming the mixture, Mi are their corresponding mobility coefficients, and F is the free energy functional describing the thermodynamics of ternary mixture. The total local density in the system is considered constant, respecting the incompressibility condition expressed as: ${\sum }_{i=1}^{3}{\varphi }_{i}=1$. The indices 1, 2, and 3 denote RNA, protein, and nucleoplasm, respectively. The function f1(r) and f2(r) are used to define the spatially heterogeneous distribution of components mimicking RNA expression inside nucleus and protein translation and flow into nucleus. The RNA is created in the center of nucleoplasm thereby mimicking a transcriptional process [38], herein we used the function defined as ${f}_{1}\left(r\right)=\mathrm{exp}\left(-{\left(r-{r}_{\mathrm{c}}\right)}^{2}/a\right)$ where r is the distance from the center of the domain located at rc. The coefficient a is a localization lengthscale which is set to a = 1. The protein component is flown into the nucleus from the nuclear boundaries thereby mimicking translation [38], herein we used the function defined as f2(r) = 1 on the domain boundary ∂Ω and 0 otherwise. Heterogeneous mobility is an interesting aspect of nucleoplasm [39], however, and is undoubtedly deserving of a separate investigation. For simplicity, we assume the mobility coefficients for all components to be constant M1({φi }) = M2({φi }) = M.

The free energy functional taken in this model is based on the Flory–Huggins energy formulation of the ternary mixture system [40, 41], which is given by:

Equation (4)

with

where fbulk(φ1, φ2, φ3) is the local free energy density, and the square-gradient coefficients ${\kappa }_{{\varphi }_{i}}$ are positive constants controlling the interfacial free energies. χij are the interaction parameters between species ij, and Ni is the degree of polymerization.

We note here that there also exist other mathematical formulations of the local free energy for describing the phase separation of ternary polymeric mixtures. For instance, the generalized multiwell potential used by Yang et al [42] to study multiphase systems where the local free energy density is the summation of all the double-well potential for each phase-field variable complemented by a polynomial term that connects all the variables altogether.

The dynamic equations are brought to a dimensionless form, which reveals the essential time- and length-scales of the problem. Let us introduce l, the characteristic length of the system, and τ is the characteristic time. The dimensionless form of the kinetic equations yields four dimensionless parameters that control the dynamics of phase-separation where components are undergoing chemical reactions. The τD/τ = l2/(M × τ) = 1 is the ratio between diffusion timescale and characteristic time which is fixed to be a unit. The τ/τT = τ × kT is the ratio between characteristic time and time-scale associated with transcription. The τ/τP = τ × kP is the ratio between characteristic time and time-scale associated with protein formation. The τ/τd = τ × kd is ratio between characteristic time and time-scale associated with degradation. To solve the dynamic equations (2) and (3) in the dimensionless form, we use a fully implicit finite element C++ library from the multiphysics object-oriented simulation environment (MOOSE) [43]. The simulations were performed on a rectangular domain of dimensions Lx × Ly : (50 × 50), with periodic boundary conditions. A quadrilateral element QUAD4 with four nodes was used for domain meshing with the refinement. The total number of elements used for the fine mesh is 10 000. The time step of integration Δt is fixed at 0.05 (a.u.). In this work, we only consider the case of a symmetric ternary mixture undergoing a chemical reaction with χ12 = χ23 = χ13 = 3 that ensures phase-separation of protein–RNA droplets in the absence of chemical reactions χ > χc, where χc denote the critical point. We also assumed that the chemical reaction takes place at the comparable scales with the phase separation process. Near the critical point χc, the mean-field Flory–Huggins free energy can be approximated through Taylor expansion via Landau form [32]. It is important to construct representative free energy functional that accounts for fluctuations of the phase-field variables that are dominant near the critical point. To express the critical behavior a new formulation of the free energy of mixing renormalized by the spatial variation has been proposed by Yamamoto et al [44, 45]. The chain lengths Ni , or degree of polymerization, are assumed to be the same: N1 = N2 = N3 = 1. It is noted here that the entropic term of the local free energy is proportional to the inverse polymeric lengths Ni . As a consequence of the reduction of entropy for long-chain lengths, the phase diagrams and the spinodal curve will be modified in the way to increase the separating-phase region of the phase diagram. In this case, a small variation of the interaction parameter between species will lead easily to phase separation. The impact of phase-separation of small and large chains such as genomic regions and proteins/RNA could be an interesting study to address in future investigations. The other parameters of simulations were set to ${\kappa }_{{\phi }_{1}}={\kappa }_{{\phi }_{2}}=0.031\enspace 25$. The initial configuration of phase-field variables is generated by φi (r) = ⟨φi ⟩ + δφi , where ⟨φi ⟩ is an initial average concentration associated with φi , and δφi is a small random perturbation amplitude. For all simulations presented here, the initial average of species i are set to ⟨φ1⟩ = 0.3 and ⟨φ2⟩ = 0.09, with δφ1 ∈ [−0.05, 0.05] and δφ2 ∈ [−0.01, 0.01].

3. Results

Here we report the findings obtained by analyzing the results of simulations with a minimal reactive nucleoplasm model. We have organized the results in three broad kinetic regimes, which are described by three orders of magnitude in degradation time-scale τd = τ, 10τ, 100τ with respect to the diffusion time-scale τ. These time-scales correspond to different rates of turnover of nucleoplasmic components from fast to slow. For each regime, we have carried out extensive grid-based kinetic parameter sweeps exploring the coupling of transcriptional and transnational time-scales on the backdrop of fixed phase-separating free energy landscape of RNA–protein–nucleoplasm components. Despite the incredible simplicity of the minimal reactive nucleoplasm model, the time course of simulations (figures 24) has revealed a non-trivial patterning of nucleoplasm which are a dramatic departure from equilibrium thermodynamics of ternary phase separation in the absence of spatially non-uniform reaction–diffusion.

Figure 2.

Figure 2. Evolution of phase-field variables φi for three phase mixture undergoing the chemical reaction with a degradation time-scale fixed at the same of diffusion (τd = τ). From up to bottom: snapshots corresponding to simulation results with (A) τR = τP = τ; (B) τR = 100τP = 100τ; (C) τP = 100τR = 100τ; and (D) τR = τP = 100τ. The color code in blue, green, and red indicates the RNA, protein, and nucleoplasm regions, respectively.

Standard image High-resolution image
Figure 3.

Figure 3. Evolution of phase-field variables φi for three phase mixture undergoing the chemical reaction with a degradation time-scale fixed at the same of diffusion (τd = 10τ). From up to bottom: snapshots corresponding to simulation results with (A) τR = τP = τ; (B) τR = 50τP = 50τ; (C) τP = 100τR = 100τ; and (D) τR = τP = 50τ. The color code in blue, green, and red indicates the RNA, protein, and nucleoplasm regions, respectively.

Standard image High-resolution image
Figure 4.

Figure 4. Evolution of phase-field variables φi for three phase mixture undergoing the chemical reaction with a degradation time-scale fixed at the same of diffusion (τd = 100τ). From up to bottom: snapshots corresponding to simulation results with (A) τP = 10τR = 10τ; (B) τR = 10τP = 50τ; (C) τR = 50τP = 10τ; and (D) τR = τP = 100τ. The color code in blue, green, and red indicates the RNA, protein, and nucleoplasm regions, respectively.

Standard image High-resolution image

We start by investigating the fast dynamical regime corresponding to τd = τ. Figure 2 shows the spatial and temporal evolution of RNA–protein components in a reactive nucleoplasm environment. Four interesting cases are highlighted in figure 2, corresponding to different time-scale separation between translation τP and transcription τR processes. When τP = τR = τ we observe rapid coarsening dynamics (droplet growth kinetics) and emergence of RNA droplets with stable 'protein front' by which we refer to the formation of protein layer surrounding nucleoplasm domain edges [figure 2(A)]. Coarsening dynamics for either protein front or RNA droplets are also observed for fast translation/slow transcription or slow translation/fast transcription, respectively [figures 2(B) and (C)]. Naturally, it is expected that the more rapid degradation timescales for RNA and protein lead to the disappearance of both RNA droplets and protein fonts [figure 2(D)]. Thus, we can conclude that in the fast dynamical regime, nucleoplasm patterns are entirely set by the kinetic parameters with thermodynamic free energy landscape taking the back seat.

The formation of RNA–protein patterns is more intricate in the intermediate dynamical regime corresponding to τd = 10τ (figure 3). In the figure 3, we show the patterns arising from competing transcription, translation, and degradation with different transcription and translation timescales. When both transcription and translation have comparable time scales [figure 3(A)] to that of diffusion, we find a rapid progression of protein front on the one hand and a rapid RNA seed droplet growth at on the other. In this regime, once the nucleoplasm reaches a non-equilibrium steady state, droplet patterning is now becomes dictated by the thermodynamics of Flory–Huggins interaction parameters, which set the extend of mixing between protein and RNA components. When both the transcription and translation have the same time scale but are now significantly slower than diffusion [figure 3(D)], then the system displays heterogeneity in protein RNA droplet distribution. The situation with faster transcription and the faster translation is predictable: [figures 3(B) and (C)] the system tends to increase in protein front and RNA nucleating center, respectively. We have only highlighted the most interesting cases for each dynamical regime. The simulation results for the full set of time-scales are presented in the supporting information (https://stacks.iop.org/PB/18/015001/mmedia) figure S1–S10. The temporal evolution of average concentration for different kinetic regimes with τR = τP = 100τ is shown in figure S14.

We now turn to the analysis of the slow dynamical regime corresponding to a degradation timescale τd = 100τ. In this regime, we once again highlight four interesting cases [figures 4(A)–(D)]. When the system has comparable timescales for transcription, translation, and diffusion τP = τR = τ, the nucleoplasmic component gets eliminated, and the system rapidly reaches a balance between protein front and RNA seed. Slowing down the translation τP = 100τR = 100τ by two orders of magnitude leads to a non-trivial patterning with the RNA seed, nucleoplasm and RNA droplets all con-existing at a non-equilibrium steady-state. On the other hand, slowing down the transcription [figure 4(C)] leads to a steady-state with significantly downsized RNA-seed at the expense of the peripheral protein front. Slowing down both transcription and translation [figure 4(D)] leads to a non-trivial patterning with protein front, RNA droplets, and nucleoplasmic environment, but this time with no dominant RNA-seed.

To quantify the emergence of different droplet patterns, we have summarized the phase behavior of a reactive nucleoplasm model via. Global kinetic phase diagram, figure 5. Here one can see a global picture of how the interplay of kinetic timescales is favoring uniform vs binary vs ternary phases.

Figure 5.

Figure 5. Phase diagram: phase diagram showing the dominant steady-state phase and summarizing various patterns that arise at three kinetic regimes (A) τd = τ; (B) τd = 10τ; (C) τd = 100τ). The blue star indicates that the RNA domain becomes the dominant phase in the long timescale limit. The green star means that the proteins-droplets become the dominant phase in the long timescale limit. The pink star indicates three coexisting phases of the ternary mixture. In contrast, the red star indicates that RNA–proteins turn totally into the nucleoplasm.

Standard image High-resolution image

In order to quantify the length-scales of emergent patterns, we have analyzed the azimuthally averaged dynamic structure factors S(k, t) = ∫dkΩ S(k, t) associated with each dynamical regime (see figure 6 and supporting information, figure S11–S13). Analyzing dynamic structure factors reveals characteristic length-scales of protein/RNA droplets as well as emergent dynamical heterogeneity manifesting in the fast or slow coarsening of the droplets. The computed structure factors show a broad range of kinetically controlled states where one has bi-modality of RNA (or protein) components. This bimodality or more broadly heterogeneity of distribution is directly linked to the heterogeneity of droplet sizes and shapes that one can quantify from simulation images (figures 3 and 4). Multi-modal gene expression is a feature often linked with phenotypic heterogeneity and which has been mostly explained by citing the underlying non-linear dynamics of dichotomous switching noise in a spatially uniform master equation formalism [4648]. The results of figure 6 clearly show that transcriptional heterogeneity can originate purely from the spatially non-uniform nature of gene expression, which is being modulated by timescales of phase separation, transcription, and translation.

Figure 6.

Figure 6. The dynamic structure factors for the three representative kinetic regimes with significant time-scale disparity between transcription and translation τP = 100τR (see supporting information for all the results). The inset shows the scaling and power law exponent. The three panels stand for three dynamical regimes: (A) τd = τ (B) τd = 10τ (C) τd = 100τ.

Standard image High-resolution image

Finally, we have computed the length scale of patterns summarizing transitional heterogeneity in three kinetic regimes of the minimal reactive nucleoplasm model (figure 7). The length-scales patterns are quantified by using pre-computed azimuthally averaged dynamic structure factors $R\left(t\right)=\frac{\int \mathrm{d}kS\left(\mathbf{\text{k}},t\right){k}^{-2}}{\int \mathrm{d}kS\left(\mathbf{\text{k}},t\right){k}^{-1}}$ [49]. The length-scale patterns R(t) quantify the dominant length-scales which emerge during the time-evolution of a reactive nucleoplasm. Analyzing the evolution of length-scale patterns for three dynamical regimes, we see clearly that for the fast dynamical regime, there is only one dominating length-scale, which is dictated by the rapid degradation kinetic timescale. For the intermediate and slow dynamical regimes, however, we find heterogeneity of length-scales, which emerges from the disparity in transcription and translational timescales. We note that this heterogeneity has both structural and dynamical manifestations, as one can see by analyzing the power low of length-scale patterns R(t) ∼ Atα . We find two exponents, one which is characteristic for the early coarsening stage (α ∼ 1/3) and second (α ∼ 3/8) for the later accelerated phase separating evolution toward a steady state. We note that power laws in the dynamical variables of the nucleoplasmic environment have been detected and characterized in a large number of experiments [39, 50]. These power-law dependencies, however, are hard to disentangle in terms of distinct contributions since their origin may come from any combination of phase-separation, polymeric effects, confinement, and non-equilibrium motorized activities in the nucleus. In this work, we have only managed to scratch at the surface of the fascinating dynamical patterning potential of the active nucleoplasmic environment. For future studies, it would be interesting to investigate the impact of differential mobility, hydro-dynamical coupling, and as well as investigate gene expression beyond a simple one-step model of unregulated reactions that have not been done in the present contribution.

Figure 7.

Figure 7. The dominant length-scale patterns for the three representative kinetic regimes. Each panel shows a combination of transcription and translation time-scales sorted with an increasing time-scale disparity from blue to green to orange. (A) τd = τ (B) τd = 10τ (C) τd = 100τ.

Standard image High-resolution image

4. Conclusion

Recent in vitro experiments with binary and ternary protein and RNA mixtures [5153] have shown the rich complexity of phases which can emerge through liquid–liquid phase separation via modulation of stoichiometry of components and point mutations. These experiments show potentially novel regulatory strategies which when combined with non-equilibrium cellular processes such as transcription and translation could produce coordinated gene regulatory and signaling actions.

To this end, in the present contribution we introduce a minimal reactive nucleoplasm model combining spatially dependent transcription and translation to liquid–liquid phase separation of RNA and protein components embedded in the backdrop of passive nucleoplasm buffer. We use the minimal reactive-nucleoplasm model to cleanly dissect how the interplay of transcription, translation, and degradation time scales couples with the liquid–liquid phase separation of RNA and protein components in a model ternary solution. By carrying out extensive grid-based sweeps of kinetic parameter space, we uncover various non-trivial patterning and length-scale heterogeneity compared to the classic Flory–Huggins thermodynamic picture for ternary polymeric solutions. Our central finding is the existence of a broad kinetic regime characterized by a slow turnover of components and timescale disparity between transcription and translation under which a phase separating system can display bi-modal distribution. The significance of this finding is that the observed heterogeneity of gene expression is linked directly with the heterogeneity of length-scales in phase-separated condensates. The main findings and the minimal reactive nucleoplasm model thus establishes a useful framework with which one can further elucidate the emergence of nucleoplasm patterns and phenotype heterogeneity from first principles modeling of phase-separation and reaction–diffusion processes.

Acknowledgments

Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award No. R35GM138243. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation Grant No. ACI-1548562 [54] on the Stampede2 machine at the Texas Advanced Computing Center (TACC) through allocation CTS190023. The authors acknowledge the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing HPC resources that have contributed to the research results reported within this paper. The authors also acknowledge financial support from Iowa State University.

Please wait… references are loading.
10.1088/1478-3975/abc5ad