Distributional observability of probabilistic Boolean networks

https://doi.org/10.1016/j.sysconle.2021.105001Get rights and content

Abstract

A probabilistic Boolean network (PBN) is a discrete network composed of a family of Boolean networks together with a set of probabilities governing the selection of a Boolean network at each time step. We introduce in this paper a novel observability problem for PBNs. Specifically, we assume that the values of the initial state of the PBN are not known with certainty, but can be described by probability distributions. We ask ourselves under which conditions it is possible to uniquely determine the probability distribution of initial states when giving only knowledge of the evolution of the probability distributions of network outputs. We propose a complete answer to this problem using a linear algebra approach. Several examples, both artificial and real-world, are given and illustrate the viability of the proposed theoretical results.

Introduction

Mathematical modeling of biological systems is an attractive way for systematically studying the functions of biological processes and their relationship with the environment. One of the frequently used frameworks for modeling biological systems is that of Boolean models. After assembling individual components and interactions involved in a system, Boolean network (BN) models can properly capture the qualitative temporal behavior of the system without the need for much kinetic details [1]. Typically, a BN is presented in the form of a (deterministic) nonlinear system, while it was shown in [2] that such nonlinear dynamics can always be mapped invertably to a standard discrete-time linear dynamics using an algebraic state representation approach. The main idea behind is that a BN with n variables has 2n possible states, and if such states are represented by canonical vectors of length 2n, then the one-step transitions between states can be equivalently specified by a 2n×2n matrix. This formal simplicity makes it comparatively easy to formulate and solve control theoretic problems for BNs, and thereby has stimulated a significant increase in the number of studies in this area [3], [4], [5], [6], [7], [8], [9], [10], [11], [12]. For some recent work on the analysis and control of BNs based on other approaches, see, e.g., [13], [14], [15].

A probabilistic Boolean network (PBN) is a stochastic extension of the classical (deterministic) BN. It may be viewed as a collection of BNs endowed with a probability structure determining the likelihood with which a constituent network is active. PBNs retain the appealing properties of BNs such as requiring few kinetic parameters, but are more robust in the face of uncertainty [16]. The algebraic state representation has also proved a nice framework for addressing control-related problems in PBNs [17], [18], [19], [20], [21], [22], [23].

Observability is one of the main topics in control theory. The classical definition of observability, roughly speaking, says that a system is observable if any two distinct initial states can be distinguished by observing the output of the given system [24]. Observability and related problems for classical BNs have been extensively studied by many authors; see, e.g., [25], [26], [27], [28], and more recently [29], [30], [31], [32], [33], [34], and references therein. Compared with the many studies focusing on BNs, the investigation of the observability of PBNs is currently at an early stage. The authors of [35] considered a type of observability for PBNs, which requires that for any two distinct initial states there is a nonzero probability that different output sequences are produced so that the two states become distinguishable. Such an observability notion was revisited in a recent study [36] and was referred to as finite-time observability in probability. Two slightly different notions of observability were also explored in that study. The first one is termed finite-time observability with probability one, and requires that for any two distinct initial states the corresponding two outputs on a finite time window are distinct with probability one. The second is termed asymptotical observability in distribution and it corresponds to the case where for any pair of distinct initial states, the probability that the outputs are distinct tends to one as the length of the observation window grows to infinity. The authors of [37] suggested an alternative characterization of the property of observability with probability one, and proposed set-theoretic algorithms for determining all possible initial (or final) states compatible with a given output sequence. A necessary and sufficient condition for a multiplex PBN (i.e., a set of PBNs linked through interconnected layers) to be finite-time observable with probability one was given in [38].

We consider in this paper a novel observability problem for PBNs. The problem is to distinguish between the probability distributions of initial states in a PBN from knowledge of the probability distributions of outputs over time. In many existing research works on PBNs, both theoretical and practical, the initial state of the PBN is assumed to be randomly distributed according to a certain probability distribution [39], [40], [41], [42]. This is not only because such an assumption is in better agreement with the stochastic nature of the model, but also due to the fact that there are many real situations in which the initial state of the model cannot be known exactly. We therefore consider initial probability distributions rather than deterministic initial states for starting a PBN (see Remark 1 below for more comments on the motivation). The fact that the initial state is a random variable and that the updating rule of the network is randomly chosen at each time step leads to the fact that the output of a PBN at any given time is also a random variable. Since every stochastic process induces a family of finite dimensional distributions that determines the stochastic process uniquely (see, e.g., [43]), it is then quite reasonable to think that two initial state distributions of a PBN are indistinguishable statistically if the corresponding output processes have the same finite dimensional distributions. Observability based on such indistinguishability is then a direct extension of the classical definition when dealing with PBNs, and is to some extent more stochastic in nature compared to the aforementioned three existing observability notions. Clearly, this type of distributional observability is stronger than the property of finite-time observability in probability, but neither implies nor is implied by the other two observability notions. The main question addressed in this paper is under which conditions a PBN is initial-distribution observable. We provide a complete answer to this question. As in previous studies [35], [36], [37], [38], the theoretical framework developed in this paper is based on the powerful algebraic representation approach.

The remainder of this paper is organized as follows. Section 2 contains the basic notation and briefly reviews PBNs. The main contributions, establishing conditions for distributional observability, are presented in Section 3. Several examples, both contrived and biological, are given in Section 4. A summary of the paper is provided in the final section.

Section snippets

Preliminaries

Let δki denote the ith k×1 canonical basis vector, and Δk the set consisting of the canonical vectors δk1,,δkk. The set of all k×r matrices whose columns are canonical vectors is denoted by Lk×r. Let A and B be k1×r1 and k2×r2 matrices, respectively, and let l be the least common multiple of r1 and k2. The (left) semitensor product [44] of A and B is a (k1l/r1)×(r2l/k2) matrix, denoted by AB, and defined by AB=(AIl/r1)(BIl/k2), where is the Kronecker product of matrices and Il/r1 and Il/k

Main results

Let us consider a PBN with output, described by the algebraic representation2 x(t+1)=Fθ(t)u(t)x(t),y(t)=Hx(t),xΔN,uΔM,yΔQ.As assumed above, {θ(t)} is a Bernoulli process with outcome space {1,,S} and probability pi for the occurrence of outcome i, FiLN×NM for each 1iS, and HLQ×N. We denote by y|μ,u the output stochastic process generated by (6) corresponding to the initial state distribution

Examples

This section presents some examples that illustrate the use of our results.

Example 1

Consider a PBN as in (6), with N=4, M=Q=2, F1=[δ41δ44δ44δ43δ43δ44δ44δ43], F2=[δ41δ42δ42δ44δ43δ44δ44δ44], H=[δ21δ22δ21δ22], and the selection probabilities given by p1=0.6 and p2=0.4. The matrices Pu’s and Jz’s are Pδ21=100000.40.400000.600.60.60.4,Pδ22=000000001000.60110.4,Jδ21=diag(1,0,1,0),Jδ22=diag(0,1,0,1), and the subspace V0 is V0=imH=span[1,0,1,0],[0,1,0,1]. Direct calculation shows that Pδ21V0=Pδ22V0=span{[1,

Summary

We have addressed in this paper the problem about observability of a PBN. We assume that the initial states of the PBN are not explicitly known but are instead described by probability distributions. The problem is about whether one is able to distinguish between different initial state distributions of the network using only the probability distributions of outputs over time. We have provided a complete solution to this problem, by giving a necessary and sufficient condition for a PBN to be

CRediT authorship contribution statement

Rui Li: Conceptualization, Methodology, Writing – original draft, Writing – review & editing. Qi Zhang: Conceptualization, Writing – original draft, Writing – review & editing. Jianlei Zhang: Methodology, Writing – original draft, Writing – review & editing. Tianguang Chu: Conceptualization, Methodology, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors gratefully acknowledge constructive and insightful feedback received from the anonymous reviewers and the Associate Editor.

This research was supported by the National Natural Science Foundation of China under Grants 61503375, 61673027, 62073174, and 62073175, the Humanity and Social Science Youth Foundation of Ministry of Education of China under Grant 20YJCZH228, and the Excellent Young Scholars Funding Project in UIBE under Grant 19YQ10.

References (48)

  • MengM. et al.

    Controllability of Markovian jump Boolean control networks

    Automatica

    (2019)
  • LiuY. et al.

    Logical matrix factorization towards topological structure and stability of probabilistic Boolean networks

    Systems Control Lett.

    (2021)
  • ChengD. et al.

    Controllability and observability of Boolean control networks

    Automatica

    (2009)
  • LaschovD. et al.

    Observability of Boolean networks: a graph-theoretic approach

    Automatica

    (2013)
  • ChengD. et al.

    Observability of Boolean networks via set controllability approach

    Systems Control Lett.

    (2018)
  • GuoY. et al.

    Redefined observability matrix for Boolean networks and distinguishable partitions of state space

    Automatica

    (2018)
  • YuY. et al.

    Observability of Boolean networks via matrix equations

    Automatica

    (2020)
  • ZhouR. et al.

    Set reachability and observability of probabilistic Boolean networks

    Automatica

    (2019)
  • WangR.-S. et al.

    Boolean modeling in systems biology: an overview of methodology and applications

    Phys. Biol.

    (2012)
  • ChengD. et al.

    A linear representation of dynamics of Boolean networks

    IEEE Trans. Automat. Control

    (2010)
  • LuJ. et al.

    On pinning controllability of Boolean control networks

    IEEE Trans. Automat. Control

    (2016)
  • ZhangX. et al.

    Output tracking of Boolean control networks

    IEEE Trans. Automat. Control

    (2020)
  • LiY. et al.

    A necessary and sufficient graphic condition for the original disturbance decoupling of Boolean networks

    IEEE Trans. Automat. Control

    (2020)
  • LiR. et al.

    Controllability and observability of Boolean networks arising from biology

    Chaos

    (2015)
  • Cited by (7)

    • Linear system identifiability from single-cell data

      2022, Systems and Control Letters
      Citation Excerpt :

      They also use higher moments of the measurement distribution, but restrict to steady-state distributions. A related problem of state observability has been studied by [19] for systems governed by differential equations and by [20] for Boolean models. Assumption (i) could potentially be relaxed, whereas (ii) and (iii) are necessary conditions.

    • Minimal observability of Boolean control networks

      2022, Systems and Control Letters
      Citation Excerpt :

      It can multiply two matrices with any dimension and easily transform the logical functions into the algebraic form and has an ability to convert BNs into standard discrete-time linear systems [8]. Based on algebraic form of BNs, many fundamental results have been obtained including observability [9–23], detectability [24,25], synchronization [26–28], controllability [29–33], stabilization [34–37], as well as biological systems [38], etc. Observability is a fundamental concept in system science and control theory.

    • Finite-Time Observability of Probabilistic Logical Control Systems

      2023, Journal of Systems Science and Complexity
    View all citing articles on Scopus
    View full text