Elsevier

Neurocomputing

Volume 436, 14 May 2021, Pages 103-113
Neurocomputing

Soft-sensing of Wastewater Treatment Process via Deep Belief Network with Event-triggered Learning

https://doi.org/10.1016/j.neucom.2020.12.108Get rights and content

Abstract

Due to the complex dynamic behavior of a Wastewater Treatment Process (WWTP), the existing soft-sensing models usually fail to efficiently and accurately predict its effluent water quality. Especially when a lot of practical data is provided and we do not know which data-pair is more valuable, WWTP modeling becomes a time-consuming process. The main reason is that the existing soft-sensing models update their parameters at each data-pair in one iteration, while some update operations are meaningless. To address this thorny problem, this paper proposes a Deep Belief Network with Event-triggered Learning (DBN-EL) to improve the efficiency and accuracy of soft-sensing model in WWTP. First, some events are defined according to different running condition during the process of training DBN-based soft-sensing model. The different running condition is dominated by the fluctuation of error-reduction rate. Second, an event-triggered learning strategy is designed to construct DBN-EL, whose parameters are updated only when a positive event is triggered. Thirdly, we present the convergence analysis of DBN-EL based on the optimization in a Markov process. Finally, the effectiveness of DBN-EL is demonstrated on soft-sensing of total phosphorus concentration in a practical WWTP system. In experiment, DBN-EL is compared with nine different models on soft-sensing of WWTP. The experimental results show that the efficiency of DBN-EL is 27.6%–64.9% higher than that of nine competitive models, which indicates that the proposed model is readily available for industrial deployment.

Introduction

Wastewater treatment is absolutely essential for municipal civilization. Nowadays, biological wastewater treatment is the most feasible one, which is widely applied to a municipal Wastewater Treatment Process (WWTP). Due to its complex nonlinear dynamics with large disturbances and uncertain time-delay, WWTP modeling is a challenging task [26], [9]. Under the strict environmental regulations, soft-sensing models for effluent quality of wastewater are so vital [6], [30], [19]. A fast and accurate soft-sensing model of effluent quality not only gives an explicit description of dynamics, but also provides an alert reference for the next action in advance if abnormal operation occurs. Therefore, how to obtain excellent modeling performance of effluent quality is indispensable for the optimal operation and control of WWTP.

In existing soft-sensing models of effluent quality in WWTP, artificial neural networks (ANNs) have been widely applied to model and predict effluent quality. In particular, Canete et al. develop an ANN-based soft sensor for the online prediction of effluent chemical oxygen demand [1], which has been proved to be effective. Li et al. propose a self-organizing cascade ANN based on random weights to approximate the effluent total phosphorous (TP) [11], which achieves desired performances in practical WWTP. In Han et al. [8], a self-organizing recurrent neural network is proposed to predict the dissolved oxygen in WWTP, where a spiking-based self-adjusting algorithm is developed to tune the structure and parameters. Qiao et al. propose an incremental radial basis function neural network to model and predict effluent TP in WWTP [20], where a modified second-order algorithm is utilized to train the neuronal activity. Additionally, some other improved ANNs are applied to model and predict effluent quality and achieve desired performances [4]. These ANN-based soft-sensing models have achieved success in applications of WWTP modeling. However, they face two main challenges:

  • 1)

    Poor Capability of Extracting Deep Features. The existing ANN-based soft-sensing models are of shallow learning structures, which cannot mine and extract effective features from practical-complex data in WWTP. For some practical applications, however, effective features from a dataset are helpful to enable an efficient learning process. Therefore, their absence not only leads to a high computational need, but also results in unsatisfactory modeling performance.

  • 2)

    Time-Consuming Learning Process. The existing ANN-based soft-sensing models update their weight parameters at each data-pair in one iteration. When much practical data is provided and we do not know which one is more valuable for a training process, the operation of updating weight parameters at each data-pair tend to be useless. In this case, a time-consuming learning process is inevitable.

To solve these two problems, many efforts have been made in recent years [13], [14], [15], [16], [17], [18], [34], [35], [36]. Deep belief network (DBN), as a deep learning model, has been widely studied and applied to extract deep features and model various industrial processes [27], [37], [38], [39]. DBN can encode raw complex data into feature-vectors with different levels by training several stacked restricted Boltzmann machines (RBMs). The performance of extracting effective features can be guaranteed by minimizing reconstruction error. With its hierarchical learning capability, DBN-based learning models can achieve outstanding performance by using more effective features of practical-complex data. Sun et al. propose a deep belief echo-state network (DBESN) for nonlinear system modeling [22], where DBN is considered as a features extractor to provide echo-state network with effective input. The experimental results of DBEN indicate that DBN is effective in extracting deep features. In Wang et al. [28], DBN is first used to extract effective features from raw data, then its output is considered as the input of a fuzzy neural network (FNN). The resulting SDB-FNN model achieves desired modeling accuracy. In recent years, event-triggered optimization is used to reduce the computational complexity of modeling and control of industrial processes. Because event-triggered optimization updates parameters only when certain events occur, it is more efficient than time-triggered methods, which is demonstrated by extensive experiments in Luo and Zhou [12]. Wang et al. present a dynamic event-triggered optimization for nonlinear stochastic systems, and its effectiveness is demonstrated by a class of numerical examples [32]. Ramesh et al. propose a feasible method that gives comprehensive analysis and theoretical proof for the effectiveness of event-triggered optimization [21]. However, there is no unified method that can effectively address the two problems.

How to effectively address both problems is what we concern in this work. Considering the effluent TP concentration modeling in WWTP as a specific application background, we propose an Event-Triggered DBN (DBN-EL) model to pursue faster and more accurate modeling performance than traditional neural network. In the proposed DBN-EL model, updating weight parameters is conducted only when a certain event is triggered. First, the triggering event is defined according to error-reduction rate and corresponding number of iterations. If the fluctuation of error-reduction rate goes beyond or falls within a predefined threshold, a corresponding event is triggered. Second, once some events are triggered, and means that the current data-pair is valuable, the weight parameters are updated at the current iteration. Compared with the existing methods, the main contributions of this work are:

  • 1)

    It proposes an event-triggered DBN learning model, DBN-EL, which can extract effective feature from practical-complex data of WWTP. More importantly, DBN-EL is trained on the extracted effective feature and updates its weight parameters only when a positive event is triggered. In this case, the real dynamics of TP concentration in WWTP system can be efficiently and accurately approximated.

  • 2)

    It proposes an effective method to improve the effectiveness of data by using event-triggered strategy. The event-triggered strategy is actually a selective response mechanism that does not respond to any outlier data.

  • 3)

    It presents the convergence and stability analysis of event-triggered learning algorithm based on the optimization in Markov process, which can improve interpretability and transferability of DBN-EL.

The rest of this work is organized as follows. We present problem formulations and preliminaries in Section 2, provide DBN-EL and its computational complexity analysis in Section 3, give convergence analysis in Section 4, give experimental results in Section 5, present the discussion in Section 6, and conclude this work in Section 7.

Section snippets

WWTP characteristics

In many practical applications, the most common method for WWTP is a biologically activated sludge process as shown in Fig. 1, where inputs are influent flow rate and initial concentration of influent. The output are the effluent flow rate and the effluent concentration. The middle variables are the component concentrations, such as TP, biochemical oxygen demand (BOD), chemical oxygen demand (COD), ammonium nitrogen NH4-N and nitrate nitrogen NO3-N, and so on. However, it has some

Deep Belief Network with Event-triggered Learning

This section presents a DBN with event-triggered learning (DBN-EL) to further solving the two prior-mentioned problem, which are also highlighted in Remarks 2 and 3.

Convergence analysis

As analyzed above, the convergence of DBN-EL is very important to its successful applications. As a result, this section presents a theoretical analysis for its convergence, which includes two parts: 1) error minimization process; and 2) event-triggered training process.

Experimental results

This section verifies the performance of DBN-EL on total phosphorus (TP) modeling and prediction of a practical WWTP. The experiment framework is shown in Fig. 6, including three parts: data acquisition, input features selection and DBN-EL modeling and prediction.

Discussion

With respect to the experimental results, we draw the following conclusions:

  • 1)

    Efficient learning process: The TP data effectiveness is low because of the complexity and external disturbance of WWTP. Fortunately, the event-triggered selective learning algorithm of DBN-EL can ignore the ineffective data-pairs successfully. In this way, DBN-EL achieves an efficient learning process, which is indicated by Fig. 8 and Table 2.

  • 2)

    Advantages of Event-Triggered Learning: The dynamics of WWTP are dominated by

Conclusion

In this work, a deep belief network with n event-triggered learning (DBN-EL) is proposed to model and predict effluent quality in a wastewater Treatment Process (WWTP). This soft-sensing model integrates the event-triggered response strategy into the DBN learning process, which is more suitable for WWTP modeling with event properties. Unlike the traditional models whose parameters are updated at each data-pair iteration, DBN-EL takes action to update parameters only when a certain event is

CRediT authorship contribution statement

Gongming Wang: Methodology, Software, Validation, Formal analysis, Writing - original draft. : . Qing-Shan Jia: Conceptualization, Writing - review & editing, Supervision, Funding acquisition. MengChu Zhou: Writing - review & editing, Supervision. Jing Bi: Formal analysis, Writing - review & editing, Resources, Project administration, Funding acquisition. Junfei Qiao: Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work was supported in part by the National Natural Science Foundation of China (62003185 and 62073182), in part by the National Science and Technology Major Project (2018ZX07111005). No conflict of interest exits in this manuscript and it has been approved by all authors for publication.

Gongming Wang (S'17-M'19) received the M.S. and Ph.D. degrees in control science and engineering from Shenyang University of Chemical Technology, Shenyang, China, in 2015, and Beijing University of Technology Beijing, China, in 2019, respectively. He was a Visiting Scholar with the department of Computer and Information Science, University of Oregon (UO), Eugene, OR, USA, from 2017 to 2018. He is currently a Post-Doctoral Fellow with Center for Intelligent and Networked Systems, Department of

References (40)

  • de J.F. Canete et al.

    Soft-sensing estimation of plant effluent concentrations in a biological wastewater treatment plant using an optimal neural network

    Expert Syst. Appl.

    (2016)
  • S. Gao et al.

    Dendritic neuron model with effective learning algorithms for classification, approximation, and prediction

    IEEE Trans. Neural Networks Learn. Syst.

    (2019)
  • G. Hinton et al.

    A fast learning algorithm for deep belief nets

    Neural Comput.

    (2006)
  • H. Han et al.

    Nonlinear model-predictive control for industrial processes: an application to wastewater treatment process

    IEEE Trans. Ind. Electron.

    (2014)
  • H.G. Han et al.

    Nonlinear model predictive control based on a self-organizing recurrent neural network

    IEEE Trans. Neural Networks Learn. Syst.

    (2016)
  • A. Iratni et al.

    Advances in control technologies for wastewater treatment processes: status, challenges, and perspectives

    IEEE/CAA J. Autom. Sin.

    (2019)
  • Y. LeCun et al.

    Deep learning

    Nature

    (2015)
  • J. Luo et al.

    Petri-net controller synthesis for partially controllable and observable discrete event systems

    IEEE Trans. Autom. Control

    (2016)
  • X. Luo et al.

    Efficient and high-quality recommendations via momentum-incorporated parallel stochastic gradient descent-based learning

    IEEE/CAA J. Autom. Sin.

    (2020)
  • X. Luo et al.

    An inherently nonnegative latent factor model for high-dimensional and sparse matrices from industrial applications

    IEEE Trans. Ind. Inf.

    (2018)
  • Cited by (28)

    • Attention mechanism-aided data- and knowledge-driven soft sensors for predicting blast furnace gas generation

      2023, Energy
      Citation Excerpt :

      Ge [29] introduced an active learning strategy into a Gaussian process regression modeling framework, which maximized the representativeness of samples and predicted the butane content in the production process with high accuracy. In the chemical industry, Wang et al. [30] optimized the characteristic parameters and proposed a deep belief network driven by events and data to predict the total phosphorus concentration in industrial wastewater. Zhao et al. [31] applied soft-sensing technology to the field of sewage treatment, reduced the dimension of auxiliary variables by sparse principal component analysis, and proposed an ELM soft-sensing method based on the lion swarm optimizer to predict the oxygen content in sewage.

    • Applications of deep learning in water quality management: A state-of-the-art review

      2022, Journal of Hydrology
      Citation Excerpt :

      The resulted hybrid model outperformed the base DBN and LSTM models in forecasting algae density. To improve the model training efficiency, Wang et al. (2021) proposed the DBN with an event-triggered learning algorithm (DBN-EL) to ignore ineffective data pairs such as outliers. The DBN-EL converged much faster without noticeable fluctuations as it benefited from the selective response mechanism.

    • Development of a wide-range soft sensor for predicting wastewater BOD<inf>5</inf> using an eXtreme gradient boosting (XGBoost) machine

      2022, Environmental Research
      Citation Excerpt :

      Some examples include applications in predicting the concentration of novel pollutants and pathogens of interest (Abdeldayem et al., 2022). It can also capture a broad range of operating conditions, whereas traditional mechanistic modelling is typically limited to steady state analysis (Wang et al., 2021). Currently, the most popular machine learning models applied in wastewater treatment are artificial neural networks (ANN) and support vector machines (SVM) (Ye et al., 2020).

    • Detection of sludge bulking using adaptive fuzzy neural network and mechanism model

      2022, Neurocomputing
      Citation Excerpt :

      Recently, with the increasingly complex treatment technology and upgrading scale of activated sludge process (ASP), safe and stable operation becomes essentially important [1,2].

    View all citing articles on Scopus

    Gongming Wang (S'17-M'19) received the M.S. and Ph.D. degrees in control science and engineering from Shenyang University of Chemical Technology, Shenyang, China, in 2015, and Beijing University of Technology Beijing, China, in 2019, respectively. He was a Visiting Scholar with the department of Computer and Information Science, University of Oregon (UO), Eugene, OR, USA, from 2017 to 2018. He is currently a Post-Doctoral Fellow with Center for Intelligent and Networked Systems, Department of Automation, Tsinghua University. His current research interests include deep learning, structure design and optimization of neural networks, knowledge-based process control.

    Qing-Shan Jia (S'02-M'06-SM'11) received the B.E. and Ph.D. degrees in automation and control science and engineering from Tsinghua University, Beijing, China, in 2002 and 2006, respectively. He was a Visiting Scholar with Harvard University, Cambridge, MA, USA, The Hong Kong University of Science and Technology, Hong Kong, and the Massachusetts Institute of Technology, Cambridge, MA, USA, in 2006, 2010, and 2013, respectively. He is currently a tenured Associate Professor with the Center for Intelligent and Networked Systems, Department of Automation, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University. His current research interests include theories and applications of discrete event dynamic systems, simulation-based performance evaluation, and optimization of cyber-physical energy systems.

    MengChu Zhou (S'88-M'90-SM'93-F'03) received the B.S. degree from the Nanjing University of Science and Technology, Nanjing, China, in 1983, the M.S. degree from the Beijing Institute of Technology, Beijing, China, in 1986, and the Ph.D. degree from the Rensselaer Polytechnic Institute, Troy, NY, USA, in 1990. In 1990, he joined the New Jersey Institute of Technology (NJIT), Newark, NJ, USA, where he is currently a Distinguished Professor of electrical and computer engineering. He has over 800 publications, including 12 books, over 500 journal articles (over 400 in the IEEE transactions), 26 patents, and 29 book chapters. His research interests include Petri nets, intelligent automation, the Internet of Things, and intelligent transportation. Dr. Zhou is a fellow of the International Federation of Automatic Control (IFAC), the American Association for the Advancement of Science (AAAS), and the Chinese Association of Automation (CAA).

    Jing Bi (M'13--SM'16) received her B.S., and Ph.D. degrees in Computer Science from Northeastern University, Shenyang, China, in 2003 and 2011, respectively. She was a Post-doc researcher at Department of Automation, Tsinghua University, Beijing, China. She was a research scientist at the Beijing Research Institute of Electronic Engineering Technology, Beijing, China. She was a research assistant and participated in research on cloud computing at the IBM Research, Beijing, China. She was a Visiting Research Scholar with the Department of Electrical and Computer Engineering, New Jersey Institute of Technology, Newark, NJ, USA. She is currently an Associate Professor with the Faculty of Information Technology, School of Software Engineering, Beijing University of Technology, Beijing, China. She has over 80 publications including journal and conference papers. Her research interests are in distributed computing, cloud computing, large-scale data analytics, machine learning and performance optimization. Dr. Bi was the recipient of the IBM Fellowship Award, and the recipient of the Best Paper Award in the 17th IEEE International Conference on Networking, Sensing and Control. She is now an Associate Editor of IEEE ACCESS. She is a senior member of the IEEE.

    Junfei Qiao (M'11-SM'20) received the B.E. and M.E. degrees in control engineering from Liaoning Technical University, Huludao, China, in 1992 and 1995, respectively, and the Ph.D. degree from Northeast University, Shenyang, China, in 1998. He was a Post-Doctoral Fellow with the School of Automatics, Tianjin University, Tianjin, China, from 1998 to 2000. He joined Beijing University of Technology, 2000, where he is currently a Professor and Vice-President. He is also the Director of the Intelligence Systems Laboratory. His interests include neural networks, intelligent systems and wastewater treatment process control. Prof. Qiao is a scholar of Yangtze River and the winner of National Science Fund for Outstanding Young.

    View full text