Generative adversarial network-based semi-supervised learning for real-time risk warning of process industries
Introduction
Industrial risk management often depends on two strategies: decreasing the frequency of hazardous events, or reducing the consequence of accidents. The former strategy mostly applies quantitative risk methods to identify critical sub-events and preform risk-based designs (Khan & Abbasi, 1998). Another strategy often reduces the loss of accidents through strengthening device redundancies, designing precursor warnings, and implementing personnel emergencies. Present industries enormously rely on failure data to monitor operating performance of processes, and thus the required improvement or change can only be identified after an incident has occurred (Wang, Khan, Ahmed & Imtiaz, 2016). The United States Center for Chemical Process Safety (CCPS) suggests that: “Facilities should monitor the real-time performance of management system activities rather than wait for accidents to happen” (CCPS, 2007). Investigations of industrial accidents reveal that almost in all cases, a variety of risk symptoms were observable before the crashes but were unfortunately ignored by managers and regulators (Zheng, Chen, Xue & Xue, 2017). As a result, risk-based early warning is a vital topic for improving system security in both academics and practices.
Risk warning system integrates both risk analysis and process alarm for the assessment of process deviations to convey impending hazard information in real-time (Hashemi, 2016). Most of the related researchers have focused on the application in financial risk, earthquake prediction, and flood forecasting (Oliveira, Sa, Lopes, Ferreira & Pais, 2015; Pappenberger, Cloke, Parker, Wetterhall & Richardson, 2015; Sevim, Oztekin, Bali, Gumus & Guresen, 2014). Often, risk warning procedures include three steps: analyzing deviation, establishing identification models, and estimating risk. For instance, Chang, Khan and Ahmed (2011) proposed a risk-based alarm management system with the consideration of hazard probability, hazard impact, and process safety time. Salzano, Agreda, Carluccio and Fabbrocino (2009) constructed an early warning system by predicting the fragility of industrial equipment. Chen, Zhou, Zhang, Du and Zhou (2015) presented a multi-index fuzzy method based on the measured data to accomplish risk early warning. Recently, loss function provides a specific representation of the dynamic relationship between industrial process deviation and system risk. Based on the loss function, Hashemi, Ahmed and Khan (2015) proposed an operational risk-based design approach for early warning systems. Wang et al. (2016) estimated the continuously updated probabilities of undesirable events by using dynamic loss functions and detecting multiple vital variables. Also, loss function has been integrated with the Bayesian theory to construct risk-based online warning systems under uncertainties (Ali & Riaz, 2019; Economou, Stephenson, Rougier, Neal & Mylne, 2016). Since process plants handle hazardous materials in daily operations, the risk warning system is critical to monitor the state of a process in real-time to identify any unsafe conditions before deviation leads to a more severe event (Wang, Khan & Ahmed, 2015). Risk analysis is a systematic and scientific method to predict the risk of possible accidents in industrial systems. However, traditional loss-based risk warning technology requires accident probabilities, which dynamic feature generally takes a month or year as a period, so that it cannot cope with emergencies. Besides, it is difficult to judge situations and make decisions due to the uncertainty, imprecision, and inconsistency of risk symptoms. With the rapid development of timeliness and complexity in process industries, traditional risk analysis technology is increasingly challenging to meet the needs of current business to reduce the accidents. In order to provide risk decision-making in the operational stage in case of the failure of critical protection layers, intelligent risk assessment and online early warning systems need to be developed and deployed (Dai, Wang, Khan & Zhao, 2016).
Machine learning has powerful data processing capabilities and real-time computing functions, which are suitable for offsetting the shortcomings of traditional risk methods. Different from model-based approaches, machine learning focuses on driving potential information to process multidimensional and time-varying data features from process variables. Typically, support vector machine (SVM), artificial neural network (ANN), and hidden Markov model (HMM) provide considerable performance for predictive warning (Dabrowski, Beyers & Villiers, 2016; Xu, Yang & Wang, 2017; Yang, Li, Ji & Xu, 2001). As can be seen, machine learning has unique potentialities in developing real-time early warning systems. More recently, the computational complexity of machine learning methods caused by a large amount of process data can be improved by deep learning techniques (LeCun, Bengio & Hinton, 2015). Zhang and Zhao (2017) proposed an extensible deep belief network (DBN) based fault diagnosis model for fault classification and early warning. Wu and Zhao (2018) developed a convolutional neural network (CNN) for chemical process fault warning. Zheng et al. (2017) integrated significant accident warning features and proposed Pythagorean-type fuzzy deep denoising autoencoder (PFDDAE) technology to achieve high accuracy for risk classification and accident warning. However, in terms of the early warning system, a critical problem is the lack of labeled faulty-case data. Numerous unbalanced data contains a small amount of accident data and a large amount of fault-free data, which cannot be directly trained by machine learning. Risk analysis techniques can be employed to identify risks and perform risk-based data classification from a set of process data. However, the risk analysis process is too complicated to label corresponding risk levels for thousands of process data from plants. Therefore, due to the unbalanced information and scarce labeled data, numerous deep supervised learning methods are difficult to be exploited for warning system modeling in the real application.
To address the above problems, a semi-supervised deep learning model was first proposed for soft sensing (Yao & Ge, 2018). Traditional semi-supervised learning methods including generative models, mixture models and graph-based methods, utilize unlabeled data to either modify or reprioritize hypotheses obtained from labeled data alone, and thus the representation of past models is constrained by the high computational complexity affected by the generated model (Zhu, 2008). As a result, a generative adversarial network (GAN) was proposed to overcome the shortcoming of approximate calculation for thorny probabilities and to provide a super presentation for semi-supervised learning (Goodfellow, Abadie, Mirza, Xu & Farley, 2014). However, the emphasis of most GAN-based application is placed on the generator rather than the discriminator. That is, the purpose of GAN-based applications is to guide the generator to generate data or pictures that are close to the real data (Douzas & Bacao, 2018; Wang & Liu, 2020; Lian, Jia, Zareapoor, Zheng & Luo, 2019; Chen, Lv & Wang, 2019). By contrast, we focus on the data analysis capability of GAN with a small amount of labeled data. Several related works can be found in the field of bearing fault diagnosis, wind turbine fault detection, and gear reliability classification (Guo, Li, Song, Wang & Chen, 2019; Li, He, Li & Chen, 2019; Liu, Qu, Hong & Zhang, 2019). GAN could be trained in an enormous precision with limited labeled data and a large amount of unlabeled data. It means that a large number of unlabeled process data from distributed control systems (DCSs) can be exploited reasonably.
Present paper focuses on the construction of real-time risk warning systems by using GAN. The significant contribution of present works is twofold: first, based on the process data, a complete fuzzy HAZOP-based risk analysis procedure is proposed to perform hazard identification, risk analysis, and decision making; second, GAN is developed to address the problem of the scarce labeled data for real-time risk assessment. Besides, deep network architecture integrated with CNN is exploited to codify multi-dimensional data and enhance the generalization of warning models. Proposed real-time risk warning framework not only can deeply extract the feature information from scarce labeled data but also can apply extra numerous unlabeled process samples to improve the model performance as well.
The remaining papers are organized as follows. In Section 2, the mechanism of risk warning methodology including risk-based pretreatment process and GAN-based semi-supervised learning is presented. A case study of a multizone circulating reactor is manifested in Section 3. Finally, conclusions are made in Section 4.
Section snippets
Real-time risk warning framework
Integrating with HAZOP analysis and DCS monitoring, we propose a semi-supervised learning based industrial risk warning framework in this section. The training process mainly consists of two parts. At first, qualitative factors such as human factors, environmental factors, and historical information are codified as one-dimensional one-channel data x*(t)∈ Rk for the training of a former network, in which t is the present time point, k is the number of qualitative information, and the input
Risk analysis of MZCR
Gas-phase multizone circulating reactor (MZCR) is the most central production unit in the polyolefin process (Covezzi & Mei, 2001). In the MZCR, the organic polymer granule is continuously circulated between two polymerization zones: upward by fast fluidization, in the “riser” leg and downward through gravity, in the “downer” leg. The multiple short passes of the organic particle between the two zones lead to intimate and adequate mixing of very different polymers (Severn, Chadwick, Duchateau &
Conclusions
In the present paper, a novel methodology of GAN-based semi-supervised learning is utilized for real-time risk warning of process industries. Fuzzy HAZOP is applied to perform risk-based pretreatment. Based on deep convolutional network structure and adversarial learning algorithm, a deep neural network of GAN for the accident risk warning that is of crucial importance in industrial operations is constructed. The integration of unsupervised feature extraction and semi-supervised learning
CRediT authorship contribution statement
Rui He: Investigation, Methodology, Writing - original draft, Software, Writing - review & editing. Xinhong Li: Conceptualization, Methodology. Guoming Chen: Conceptualization, Funding acquisition, Methodology. Guoxing Chen: Writing - review & editing, Writing - original draft. Yiwei Liu: Writing - review & editing.
Declaration of Competing Interest
No conflict of interest exists in the submission of this manuscript, and all authors approve the manuscript for publication. We would like to declare on behalf of co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Acknowledgments
The authors gratefully acknowledge the financial support provided by China National Key Research and Development Program (No: 2016YFC0802305).
References (59)
- et al.
Fuzzy-based HAZOP study for process industry
Journal of Hazardous Materials
(2016) - et al.
A risk-based approach to design warning system for processing facilities
Process Safety and Environmental Protection
(2011) - et al.
Urban flood risk warning under rapid urbanization
Environmental Research
(2015) - et al.
The multizone circulating reactor technology
Chemical Engineering Science
(2001) - et al.
Systemic banking crisis early warning systems using dynamic Bayesian networks
Expert Systems with Applications
(2016) - et al.
Abnormal situation management for smart chemical process operation
Current Opinion in Chemical Engineering
(2016) - et al.
Effective data generation for imbalanced learning using conditional generative adversarial networks
Expert Systems with Applications
(2018) - et al.
Risk assessment of a compound feed process based on HAZOP analysis and linguistic terms
Journal of Loss Prevention in the Process Industries
(2016) - et al.
HAZOP analysis based on sensitivity evaluation
Safety Science
(2016) - et al.
Techniques and methodologies for risk analysis in chemical process industries
Journal of Loss Prevention in the Process Industries
(1998)
Fuzzy risk matrix
Journal of Hazardous Materials
The monetary benefit of early flood warnings in Europe
Environmental Science & Policy
Risk assessment and early warning systems for industrial facilities in seismic zones
Reliability Engineering & System Safety
Developing an early warning system to predict currency crises
European Journal of Operational Research
Dynamic quantitative operational risk assessment of chemical processes
Chemical Engineering Science
Data supplement for a soft sensor using a new generative model based on a variational autoencoder and Wasserstein GAN
Journal of Process Control
Deep convolutional neural network model based chemical process fault diagnosis
Computers and Chemical Engineering
Air quality early-warning system for cities in China
Atmospheric Environment
An early warning system for loan risk assessment using artificial neural networks
Knowledge-Based Systems
A deep belief network based fault diagnosis model for complex chemical processes
Computers and Chemical Engineering
On designing a new Bayesian dispersion chart for process monitoring
Arabian Journal for Science and Engineering
Guidelines for risk based process safety
Traffic flow imputation using parallel data and generative adversarial networks
IEEE Transactions on Intelligent Transportation Systems
Good semi-supervised learning that requires a bad GAN
Semi-supervised learning with context-conditional generative adversarial networks
Hazard and operability (HAZOP) analysis: A literature review
Journal of Hazardous Materials
On the use of Bayesian decision theory for issuing natural hazard warnings
Proceedings of the Royal Society A: Mathematical, Physical and Engineering
Generative adversarial nets
Intelligent fault diagnosis method based on full 1D convolutional generative adversarial network
IEEE Transactions on Industrial Informatics
Cited by (36)
Domain adaptation with label-aligned sampling (DALAS) for cross-domain fault diagnosis of rotating machinery under class imbalance
2024, Expert Systems with ApplicationsSemi-supervised learning for industrial fault detection and diagnosis: A systemic review
2023, ISA TransactionsGas turbine failure classification using acoustic emissions with wavelet analysis and deep learning
2023, Expert Systems with ApplicationsDiscriminating the default risk of small enterprises: Stacking model with different optimal feature combinations
2023, Expert Systems with ApplicationsAn unsupervised domain adaptation approach with enhanced transferability and discriminability for bearing fault diagnosis under few-shot samples
2023, Expert Systems with ApplicationsThe role of artificial intelligence-driven soft sensors in advanced sustainable process industries: A critical review
2023, Engineering Applications of Artificial Intelligence