Echo State Network Based Soft Sensor for Monitoring and Fault Detection of Industrial Processes

https://doi.org/10.1016/j.compchemeng.2021.107512Get rights and content

Highlights

  • An ESN-based system is developed for fault detection of industrial processes

  • The system can attest whether the predictions can be used to replace measurements

  • ESNs are used for the first time to detect faults in real assets of O&G industry

  • Successful performance, confirmed potential to use at real industry in real time

  • Wide anticipation in fault detection, enabling operational and economic gains

Abstract

In this paper a semi-automatic computationally inexpensive system is developed and implemented for monitoring and fault detection of industrial processes. The system uses a soft sensor based on Echo State Networks (ESN) and is able to capture the non-linear dynamic relationships in the process data, making it convenient for real-time monitoring applications. The soft sensor is set to simulate normal operating conditions, so that when the process is governed by other causes, possibly in failure, high residues occur and allow the failure identification. In addition, the system monitors the reliability of the model predictions by tracking the internal states of the ESN dynamic reservoir, indicating whether the model predictions can be used instead of the measured data. The system is successfully applied to the Mackey-Glass Anomaly Benchmark (MGAB) and to the monitoring of critical pieces of equipment of a real oil and gas plant.

Introduction

The management of industrial processes has been carried out in the last decades based mostly on corrective and preventive maintenance programs (CM/PM). These strategies, even though well established in the industrial environment, are quite expensive, since, in most cases, they require the operation to be stopped in order to perform the maintenance tasks, compromising the process availability. With the increasing complexity of the industries in the 21st century, inserted in an ever-growing competitive scenario, where the operation must be clean, safe and of high productivity, more modern and efficient technologies for of industrial assets management must be adopted (Jardine et al., 2006).

This is particularly important for the oil and gas industry, where ecological issues related to global warming and the consequent need in reducing CO2 emissions, with forecasts showing the increasing replacement of oil by other cleaner energy sources. One can also cite the operational safety issues, in which faults can put both the health of operators and the process itself at risk. Finally, there is a strong geopolitical influence in this sector, which is prevalent in the well-known and cyclical oil crises (Hwang et al., 2018).

To achieve such results, companies are investing in proactive maintenance schemes, through the continuous prognosis of the processes condition, so that the monitoring can provide conditions for decision making about maintenance in a more reasonable time frame, before or as soon as the faults happen (Vachtsevanos et al., 2007; Muller et al., 2008). These strategies are generally referred to as conditional-based monitoring (CBM) and can maximize the useful life of the equipment, reduce maintenance costs, avoiding faults and increasing the operational availability of the process. In this context, digital monitoring tools based on data and machine learning are increasingly occupying the application space in industrial environments. In general, this monitoring approach is based on transforming the data collected by sensors distributed throughout the process into statistical indicators for the integrity of the monitored process (Karpenko et al., 2001; Wang, 2007; Du et al., 2013).

It is common to build monitoring systems based on the use of Non-linear AutoRegressive Moving Average with eXogenous input (NARMAX) models, using delayed inputs to consider the dynamic behavior of the systems, or on recurrent neural networks (RNN), known as universal approximators of dynamic systems (Funahashi & Nakamura, 1993; Du et al., 2013). However, the use of RNN in real-time applications is often hampered by training methods, based on gradient descent, generally implemented as backpropagation-through-time technique (BPTT) (Werbos, 1990), which presents very slow convergence features and no guaranteed global convergence, suffering from the known problems of bifurcations (Vega et al., 2008) and vanishing gradient (Hochreiter, 1998).

Reservoir Computing (RC), when only the output layer is fitted, with the internal synaptic weights of the network model being selected at random, was developed to overcome these training problems (Lukoševičius & Jaeger, 2009). A particular type of RC, the Echo State Networks (ESN) (Jaeger, 2001) have become popular due to its the ability to capture the dynamic relationships among the data with closed solutions for training, making them simpler and computationally cheaper than traditional RNN (Antonelo et al., 2017; Jordanou et al. 2019; Dias et al., 2019).

As usual with other types of neural networks, the major limitation on the use of ESN is the lack of methods to select the large number of hyperparameters. The most used methods for adjustment of hyperparameters (Grid Search and Random Search) suffer with the multidimensionality of the search space and often provide models with poor fit quality, generated without the appropriate selection of the hyperparameters, discouraging the application of ESN to solve real-world problems (Jaeger, 2005; Behar et al., 2013).

Some works published in different areas proposed the use of ESNs for monitoring and fault detection. Nevertheless, this area is still incipient, as most of these works have not been implemented in real world industrial environments, using synthetic data or controlled data collected in pilot experimental units. For example, Morando et al. (2013, 2015) used ESNs to create a fuel cell aging model, while Fan et al. (2016) used ESNs to predict faults in air compressor equipment based on synthetic data. Xie & Zhang (2017) applied ESNs for fault detection in rotating machinery and showed that the technique was quite robust and can provide good results even when the available data set is small. Particularly, Ribeiro et al. (2018) applied ESNs to detect leaks from a pilot unit piping. More recently, ESNs were used to monitor the performances of wind turbine gearboxes (Wu et al., 2019) and the transmission condition of 3D printers (Zhang et al., 2019).

In the scenario of chemical and petrochemical industries, the implemented CBM generally uses more traditional techniques, such as Principal Component Analysis (PCA) and Canonical Correlate Analysis (CCA) (Thorsen & Dalva, 1995; Ly et al., 2009).

Particularly, the use of ESNs in this industrial segment is even more incipient. These applications involve the use of ESN models to compose predictive control structures or to predict some relevant output variable of the process, which frequently cannot be measured regularly in line due to the aggressiveness of the environment or even the inexistence of appropriate sensors. Antonelo et al. (2017) used ESN models to estimate the downhole pressure using real data obtained from sensors available in an operating oil well. The measurements of this pressure are useful to map regions of instability (slugging) in the oil lift process. Jordanou et al. (2019) employed an online adaptive controller based on ESNs in diverse scenarios for control of an oil production platform. As a matter of fact, ESN models proved useful for derivation of model-based control laws, having been used effectively for control of complex dynamic systems. For instance, Dias et al. (2019) showed that ESN models are able to model the gas lift process in oil wells very closely and reported the use of ESNs as the internal models of predictive controllers that were able to deal with measurement noise, unmeasured disturbances and complex dynamic transitions. However, it must be emphasized that, to the best of our knowledge, published reports have not described the used of ESN models for purposes of monitoring and fault detection in chemical processes using real data.

Based on the previous paragraphs, in the present manuscript an ESN-based system is developed and implemented for process monitoring and fault detection, and some characteristics and relative advantages of using these models are discussed. The system uses known procedures for data cleaning, variable and hyperparameters selection to build ESN models that are employed as soft sensors to describe and monitor the regular operation of the analyzed process. For the hyperparameters selection, the Tree Parzen Estimator (TPE) (Bergstra et al., 2011) is used, which is a Bayesian-inspired sequential optimization method that reduces the amount of objective function evaluations in order to reduce the computational effort of the method, making real-time implementations possible.

Based on the formulation of the soft sensors, the proposed methodology, in addition to monitoring and detecting faults, also monitors the reliability of the model predictions, attesting, for example, whether the data predicted by the model in a faulty situation can be used to replace the measured data. In order to do that, the internal states of the ESN reservoir are tracked, so that the modification of the internal state values provides valuable information about the occurrence of process changes while the stability of the internal state values provides information about the reliability of the operation. Additionally, tracking of internal state values also provides indications about the need to retrain the model, which can be very important since the complex and real industrial processes tend to evolve naturally to other operational conditions, without necessarily indicating a fault behavior, either due to aging of equipment or changes in the quality of raw materials, among many other possible reasons (Kruger & Xie, 2012).

The proposed monitoring system is initially tested with synthetic time series datasets generated by the Mackey-Glass Equation (Mackey & Glass, 1977), which is a widely used benchmark in the field of nonlinear dynamic analysis, more specifically in time series forecasting, being especially used in the context of ESNs (Jaeger & Haas, 2004; Wang & Yan, 2015; Løkse et al., 2017; Liu & Zhang, 2020). In addition, ESNs are used here for the first time to detect faults in real assets of the oil and gas industry, monitoring the operation conditions of equipment installed in a plant of Petrobras (Petróleo Brasileiro S.A.). As described in previous studies (Clavijo et al., 2019, 2021), the monitored pieces of equipment are key components of an oil and gas fiscal metering station and are indeed very important, because the measurements they provide are related to allocation and custody transfer of product streams of the platform.

The proposed methodology is presented in Section 2, as well as the ESN structure and TPE Optimization procedure, which are the key parts of the proposed monitoring procedure. In addition, the monitored systems, MGAB model and Fiscal Metering Station are presented in detail. The obtained monitoring results are discussed in Section 3, while the conclusions are presented in Section 4.

Section snippets

Echo State Network (ESN) and Tree Parzen Estimators Optimization (TPE)

ESNs (Jaeger, 2001) are composed of three layers, similar to other popular neural network: an input layer, where the patterns are fed into the network; a hidden layer, called dynamic reservoir, consisting of a large number of sparsely connected neurons with a high degree of recurrence; and an output layer, with the values predicted by the network, as one can see in Fig. 1.

Given a vector of inputs u(n), where n is the discrete time, a vector of reservoir states x(n) and an output vector y^(n),

Mackey-Glass Anomaly Benchmark

The selected model presented the following hyperparametric configuration: N = 316, SR 4.3955.10−1, υ = 3.4749.10−4 and Wback term = True. The selection process performed with the TPE procedure evaluated the objective function 4000 times; however, no significant improvement could be detected after iteration number 1414, with MSE = 1.0066.10−4. Fig. 5A shows a comparison of the values of the MG time series (originals and predicted by the ESN model), as well as the values of the SPE and

Conclusions

The present manuscript described a system for monitoring and identifying processes in real time through the construction of Echo State Networks-based soft sensors. This development is relevant and can assist the identification of processes and support decision making in industrial environments, thus allowing to obtain safer and more efficient processes. The system can be implemented in common computing frameworks without the need to use sophisticated numerical techniques, where the critical

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES)-Finance Code 001. The authors also thank CNPq-Conselho Nacional de Desenvolvimento Científico e Tecnológico, and Petrobras (Petróleo Brasileiro SA), for the financial support of this work.

CRediT authorship contribution statement

Tiago Lemos: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing – original draft, Writing – review & editing, Visualization, Project administration. Luiz Felipe Campos: Validation, Formal analysis, Investigation, Data curation, Visualization, Project administration. Afrânio Melo: Validation, Formal analysis, Investigation, Visualization. Nayher Clavijo: Validation, Formal analysis, Investigation, Visualization. Rafael Soares: Validation,

Declaration of Competing Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; nor in the decision to publish the results.

References (58)

  • H. Wang et al.

    Optimizing the echo state network with a binary particle swarm optimization algorithm

    Knowledge-Based Syst.

    (2015)
  • J. Behar et al.

    An Echo State Neural Network for Foetal ECG Extraction Optimised by Random Search

    Nips

    (2013)
  • A. Bekraoui et al.

    Uncertainty study of fiscal orifice meter used in a gas Algerian field

    Flow Meas. Instrum.

    (2019)
  • J. Bergstra et al.

    Algorithms for hyper-parameter optimization

  • J. Bergstra et al.

    Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures

  • N. Clavijo et al.

    Development and application of a data-driven system for sensor fault diagnosis in an oil processing plant

    Processes

    (2019)
  • N. Clavijo et al.

    Variable Selection for Fault Detection Based on Causal Discovery Methods: Analysis of an Actual Industrial Case

    Processes

    (2021)
  • A.C.S.R. Dias et al.

    Extracting valuable information from big data for machine learning control: An application for a gas lift process

    Processes

    (2019)
  • X. Dutoit et al.

    Pruning and regularization in reservoir computing

    Neurocomputing

    (2009)
  • K.ichi Funahashi et al.

    Approximation of dynamical systems by continuous time recurrent neural networks

    Neural Networks

    (1993)
  • J.E. Gallaghe

    Natural Gas Measurement Handbook

    (2006)
  • A. Géron

    Hands-on machine learning with Scikit-Learn, Keras and TensorFlow: concepts, tools, and techniques to build intelligent systems

    (2019)
  • R.S. Halinski et al.

    The Selection of Variables in Multiple Regression Analysis

    J. Educ. Meas.

    (1970)
  • S. Hochreiter

    The vanishing gradient problem during learning recurrent neural nets and problem solutions

    Int. J. Uncertainty, Fuzziness Knowlege-Based Syst

    (1998)
  • H.J. Hwang et al.

    A study of the development of a condition-based maintenance system for an LNG FPSO

    Ocean Eng

    (2018)
  • H. Jaeger

    The “echo state” approach to analysing and training recurrent neural networks

    GMD Rep

    (2001)
  • H. Jaeger et al.

    Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication

    Science (80-.)

    (2004)
  • H. Jaeger

    Reservoir riddles: Suggestions for echo state network research (extended abstract)

  • A.K.S. Jardine et al.

    A review on machinery diagnostics and prognostics implementing condition-based maintenance

    Mech. Syst. Signal Process.

    (2006)
  • Cited by (16)

    • Artificial intelligence modeling of ultrasonic fatigue test to predict the temperature increase

      2022, International Journal of Fatigue
      Citation Excerpt :

      The random search algorithm is an alternative intended to address the computational cost issue but has been found unreliable for training complex models [35]. One of the main issues of hyperparameter optimization is that each time a new set of hyperparameters is evaluated, it is necessary to call the objective function that trains a model on the training data, make predictions on the validation data and then calculate the validation score [36]. Bayesian optimization suits well for hyperparameter tuning problems, since it keeps track of past results and proposes better candidate hyperparameters, leading to fewer overall evaluations of the objective function [37].

    View all citing articles on Scopus
    View full text