Generative adversarial network-based semi-supervised learning for real-time risk warning of process industries

doi:10.1016/j.eswa.2020.113244

Expert Systems with Applications

Volume 150, 15 July 2020, 113244

https://doi.org/10.1016/j.eswa.2020.113244 Get rights and content

Highlights

•
Deep learning methods are developed for building real-time risk warning systems.
•
GAN-based semi-supervised learning requires scarce labeled data.
•
Semi-supervised model incorporates numerous unlabeled samples into evaluations.
•
CNN architectures handle multi-dimensional HAZOP data to enhance warning accuracy.
•
Semi-supervised model has better performance for industrial data training.

Abstract

Due to the non-cognition of real-time data, rare loss-based risk warning methods can effectively respond to unexpected emergencies. Machine learning has powerful data processing capabilities and real-time computing functions and thus is suitable for offsetting the shortcomings of traditional risk methods. Risk analysis can be easily employed to perform risk-based data classification for a set of process data. However, the risk analysis process is too complicated to label risk levels for all processes, which is hard to satisfy the requirements of the amount of data for supervised learning. Therefore, the present paper focuses on developing semi-supervised learning methods for the construction of real-time risk-based early warning systems. By using fuzzy HAZOP, we estimate the risk of systems quantitatively based on the process data. With the consideration of scarce labeled data and numerous unlabeled information, we develop the generative adversarial network (GAN)-based semi-supervised learning method to identify the process risk timely. Besides, deep network architecture integrated with the convolutional neural network (CNN) is used for the codification of multi-dimensional process data to enhance the generalization of warning models. Finally, the effectiveness of the proposed method is evaluated through a comparative study with different algorithms on a case of multizone circulating reactor (MZCR).

Introduction

Industrial risk management often depends on two strategies: decreasing the frequency of hazardous events, or reducing the consequence of accidents. The former strategy mostly applies quantitative risk methods to identify critical sub-events and preform risk-based designs (Khan & Abbasi, 1998). Another strategy often reduces the loss of accidents through strengthening device redundancies, designing precursor warnings, and implementing personnel emergencies. Present industries enormously rely on failure data to monitor operating performance of processes, and thus the required improvement or change can only be identified after an incident has occurred (Wang, Khan, Ahmed & Imtiaz, 2016). The United States Center for Chemical Process Safety (CCPS) suggests that: “Facilities should monitor the real-time performance of management system activities rather than wait for accidents to happen” (CCPS, 2007). Investigations of industrial accidents reveal that almost in all cases, a variety of risk symptoms were observable before the crashes but were unfortunately ignored by managers and regulators (Zheng, Chen, Xue & Xue, 2017). As a result, risk-based early warning is a vital topic for improving system security in both academics and practices.

Risk warning system integrates both risk analysis and process alarm for the assessment of process deviations to convey impending hazard information in real-time (Hashemi, 2016). Most of the related researchers have focused on the application in financial risk, earthquake prediction, and flood forecasting (Oliveira, Sa, Lopes, Ferreira & Pais, 2015; Pappenberger, Cloke, Parker, Wetterhall & Richardson, 2015; Sevim, Oztekin, Bali, Gumus & Guresen, 2014). Often, risk warning procedures include three steps: analyzing deviation, establishing identification models, and estimating risk. For instance, Chang, Khan and Ahmed (2011) proposed a risk-based alarm management system with the consideration of hazard probability, hazard impact, and process safety time. Salzano, Agreda, Carluccio and Fabbrocino (2009) constructed an early warning system by predicting the fragility of industrial equipment. Chen, Zhou, Zhang, Du and Zhou (2015) presented a multi-index fuzzy method based on the measured data to accomplish risk early warning. Recently, loss function provides a specific representation of the dynamic relationship between industrial process deviation and system risk. Based on the loss function, Hashemi, Ahmed and Khan (2015) proposed an operational risk-based design approach for early warning systems. Wang et al. (2016) estimated the continuously updated probabilities of undesirable events by using dynamic loss functions and detecting multiple vital variables. Also, loss function has been integrated with the Bayesian theory to construct risk-based online warning systems under uncertainties (Ali & Riaz, 2019; Economou, Stephenson, Rougier, Neal & Mylne, 2016). Since process plants handle hazardous materials in daily operations, the risk warning system is critical to monitor the state of a process in real-time to identify any unsafe conditions before deviation leads to a more severe event (Wang, Khan & Ahmed, 2015). Risk analysis is a systematic and scientific method to predict the risk of possible accidents in industrial systems. However, traditional loss-based risk warning technology requires accident probabilities, which dynamic feature generally takes a month or year as a period, so that it cannot cope with emergencies. Besides, it is difficult to judge situations and make decisions due to the uncertainty, imprecision, and inconsistency of risk symptoms. With the rapid development of timeliness and complexity in process industries, traditional risk analysis technology is increasingly challenging to meet the needs of current business to reduce the accidents. In order to provide risk decision-making in the operational stage in case of the failure of critical protection layers, intelligent risk assessment and online early warning systems need to be developed and deployed (Dai, Wang, Khan & Zhao, 2016).

Machine learning has powerful data processing capabilities and real-time computing functions, which are suitable for offsetting the shortcomings of traditional risk methods. Different from model-based approaches, machine learning focuses on driving potential information to process multidimensional and time-varying data features from process variables. Typically, support vector machine (SVM), artificial neural network (ANN), and hidden Markov model (HMM) provide considerable performance for predictive warning (Dabrowski, Beyers & Villiers, 2016; Xu, Yang & Wang, 2017; Yang, Li, Ji & Xu, 2001). As can be seen, machine learning has unique potentialities in developing real-time early warning systems. More recently, the computational complexity of machine learning methods caused by a large amount of process data can be improved by deep learning techniques (LeCun, Bengio & Hinton, 2015). Zhang and Zhao (2017) proposed an extensible deep belief network (DBN) based fault diagnosis model for fault classification and early warning. Wu and Zhao (2018) developed a convolutional neural network (CNN) for chemical process fault warning. Zheng et al. (2017) integrated significant accident warning features and proposed Pythagorean-type fuzzy deep denoising autoencoder (PFDDAE) technology to achieve high accuracy for risk classification and accident warning. However, in terms of the early warning system, a critical problem is the lack of labeled faulty-case data. Numerous unbalanced data contains a small amount of accident data and a large amount of fault-free data, which cannot be directly trained by machine learning. Risk analysis techniques can be employed to identify risks and perform risk-based data classification from a set of process data. However, the risk analysis process is too complicated to label corresponding risk levels for thousands of process data from plants. Therefore, due to the unbalanced information and scarce labeled data, numerous deep supervised learning methods are difficult to be exploited for warning system modeling in the real application.

To address the above problems, a semi-supervised deep learning model was first proposed for soft sensing (Yao & Ge, 2018). Traditional semi-supervised learning methods including generative models, mixture models and graph-based methods, utilize unlabeled data to either modify or reprioritize hypotheses obtained from labeled data alone, and thus the representation of past models is constrained by the high computational complexity affected by the generated model (Zhu, 2008). As a result, a generative adversarial network (GAN) was proposed to overcome the shortcoming of approximate calculation for thorny probabilities and to provide a super presentation for semi-supervised learning (Goodfellow, Abadie, Mirza, Xu & Farley, 2014). However, the emphasis of most GAN-based application is placed on the generator rather than the discriminator. That is, the purpose of GAN-based applications is to guide the generator to generate data or pictures that are close to the real data (Douzas & Bacao, 2018; Wang & Liu, 2020; Lian, Jia, Zareapoor, Zheng & Luo, 2019; Chen, Lv & Wang, 2019). By contrast, we focus on the data analysis capability of GAN with a small amount of labeled data. Several related works can be found in the field of bearing fault diagnosis, wind turbine fault detection, and gear reliability classification (Guo, Li, Song, Wang & Chen, 2019; Li, He, Li & Chen, 2019; Liu, Qu, Hong & Zhang, 2019). GAN could be trained in an enormous precision with limited labeled data and a large amount of unlabeled data. It means that a large number of unlabeled process data from distributed control systems (DCSs) can be exploited reasonably.

Present paper focuses on the construction of real-time risk warning systems by using GAN. The significant contribution of present works is twofold: first, based on the process data, a complete fuzzy HAZOP-based risk analysis procedure is proposed to perform hazard identification, risk analysis, and decision making; second, GAN is developed to address the problem of the scarce labeled data for real-time risk assessment. Besides, deep network architecture integrated with CNN is exploited to codify multi-dimensional data and enhance the generalization of warning models. Proposed real-time risk warning framework not only can deeply extract the feature information from scarce labeled data but also can apply extra numerous unlabeled process samples to improve the model performance as well.

The remaining papers are organized as follows. In Section 2, the mechanism of risk warning methodology including risk-based pretreatment process and GAN-based semi-supervised learning is presented. A case study of a multizone circulating reactor is manifested in Section 3. Finally, conclusions are made in Section 4.

Section snippets

Real-time risk warning framework

Integrating with HAZOP analysis and DCS monitoring, we propose a semi-supervised learning based industrial risk warning framework in this section. The training process mainly consists of two parts. At first, qualitative factors such as human factors, environmental factors, and historical information are codified as one-dimensional one-channel data x*(t)∈ R^k for the training of a former network, in which t is the present time point, k is the number of qualitative information, and the input

Risk analysis of MZCR

Gas-phase multizone circulating reactor (MZCR) is the most central production unit in the polyolefin process (Covezzi & Mei, 2001). In the MZCR, the organic polymer granule is continuously circulated between two polymerization zones: upward by fast fluidization, in the “riser” leg and downward through gravity, in the “downer” leg. The multiple short passes of the organic particle between the two zones lead to intimate and adequate mixing of very different polymers (Severn, Chadwick, Duchateau &

Conclusions

In the present paper, a novel methodology of GAN-based semi-supervised learning is utilized for real-time risk warning of process industries. Fuzzy HAZOP is applied to perform risk-based pretreatment. Based on deep convolutional network structure and adversarial learning algorithm, a deep neural network of GAN for the accident risk warning that is of crucial importance in industrial operations is constructed. The integration of unsupervised feature extraction and semi-supervised learning

CRediT authorship contribution statement

Rui He: Investigation, Methodology, Writing - original draft, Software, Writing - review & editing. Xinhong Li: Conceptualization, Methodology. Guoming Chen: Conceptualization, Funding acquisition, Methodology. Guoxing Chen: Writing - review & editing, Writing - original draft. Yiwei Liu: Writing - review & editing.

Declaration of Competing Interest

No conflict of interest exists in the submission of this manuscript, and all authors approve the manuscript for publication. We would like to declare on behalf of co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Acknowledgments

The authors gratefully acknowledge the financial support provided by China National Key Research and Development Program (No: 2016YFC0802305).

References (59)

J. Ahn et al.
Fuzzy-based HAZOP study for process industry
Journal of Hazardous Materials
(2016)
Y. Chang et al.
A risk-based approach to design warning system for processing facilities
Process Safety and Environmental Protection
(2011)
Y. Chen et al.
Urban flood risk warning under rapid urbanization
Environmental Research
(2015)
M. Covezzi et al.
The multizone circulating reactor technology
Chemical Engineering Science
(2001)
J.J. Dabrowski et al.
Systemic banking crisis early warning systems using dynamic Bayesian networks
Expert Systems with Applications
(2016)
Y.Y. Dai et al.
Abnormal situation management for smart chemical process operation
Current Opinion in Chemical Engineering
(2016)
G. Douzas et al.
Effective data generation for imbalanced learning using conditional generative adversarial networks
Expert Systems with Applications
(2018)
J.L. Fuentes-Bargues et al.
Risk assessment of a compound feed process based on HAZOP analysis and linguistic terms
Journal of Loss Prevention in the Process Industries
(2016)
J. Kang et al.
HAZOP analysis based on sensitivity evaluation
Safety Science
(2016)
F. Khan et al.
Techniques and methodologies for risk analysis in chemical process industries
Journal of Loss Prevention in the Process Industries
(1998)

A.S. Markowski et al.

Fuzzy risk matrix

Journal of Hazardous Materials

(2008)

F. Pappenberger et al.

The monetary benefit of early flood warnings in Europe

Environmental Science & Policy

(2015)

E. Salzano et al.

Risk assessment and early warning systems for industrial facilities in seismic zones

Reliability Engineering & System Safety

(2009)

C. Sevim et al.

Developing an early warning system to predict currency crises

European Journal of Operational Research

(2014)

H.Z. Wang et al.

Dynamic quantitative operational risk assessment of chemical processes

Chemical Engineering Science

(2016)

X. Wang et al.

Data supplement for a soft sensor using a new generative model based on a variational autoencoder and Wasserstein GAN

Journal of Process Control

(2020)

H. Wu et al.

Deep convolutional neural network model based chemical process fault diagnosis

Computers and Chemical Engineering

(2018)

Y. Xu et al.

Air quality early-warning system for cities in China

Atmospheric Environment

(2017)

B. Yang et al.

An early warning system for loan risk assessment using artificial neural networks

Knowledge-Based Systems

(2001)

Z.P. Zhang et al.

A deep belief network based fault diagnosis model for complex chemical processes

Computers and Chemical Engineering

(2017)

S. Ali et al.

On designing a new Bayesian dispersion chart for process monitoring

Arabian Journal for Science and Engineering

(2019)

Guidelines for risk based process safety

(2007)

Y. Chen et al.

Traffic flow imputation using parallel data and generative adversarial networks

IEEE Transactions on Intelligent Transportation Systems

(2019)

Z.H. Dai et al.

Good semi-supervised learning that requires a bad GAN

E. Denton et al.

Semi-supervised learning with context-conditional generative adversarial networks

J. Dunjóa et al.

Hazard and operability (HAZOP) analysis: A literature review

Journal of Hazardous Materials

(2010)

T. Economou et al.

On the use of Bayesian decision theory for issuing natural hazard warnings

Proceedings of the Royal Society A: Mathematical, Physical and Engineering

(2016)

L. Goodfellow et al.

Generative adversarial nets

Q. Guo et al.

Intelligent fault diagnosis method based on full 1D convolutional generative adversarial network

IEEE Transactions on Industrial Informatics

(2019)

Cited by (36)

Domain adaptation with label-aligned sampling (DALAS) for cross-domain fault diagnosis of rotating machinery under class imbalance
2024, Expert Systems with Applications
Though existing cross-domain fault-diagnosis methods have shown promising results under domain shift conditions, existing approaches are only valid for class-balanced data. However, situations of class imbalance are inevitable in industrial fields, due to the difficulty in acquiring fault data from in-use machinery. Thus, if existing cross-domain fault diagnosis approaches are directly applied when domain shift and class imbalance coexist, performance degradation can occur. Thus, this research develops a domain adversarial learning network for class-imbalanced data to address situations where class imbalance and domain shift coexist; this situation is referred to as the problem of class imbalance domain adaptation (CIDA). In the proposed method, domain adversarial training is implemented for learning domain-invariant features by reducing the domain shift, and a label-aligned sampling strategy is utilized to deal with the class imbalance. In addition, for further performance enhancement of label-aligned sampling by increasing the accuracy of pseudo labels, metric learning is introduced to enhance the feature distinctiveness by expanding the distance of samples from different classes while decreasing the distance of samples from the same class. The efficiency of the proposed method is validated by applying it to various circumstances using two bearing datasets. The proposed method demonstrates superior performance compared to conventional algorithms in addressing the CIDA problem, according to quantitative and qualitative evaluations.
Semi-supervised learning for industrial fault detection and diagnosis: A systemic review
2023, ISA Transactions
The automation of Fault Detection and Diagnosis (FDD) is a central task for many industries today. A myriad of methods are in use, although the most recent leading contenders are data-driven approaches and especially Machine Learning (ML) methods. ML algorithms fall into two main categories: supervised and unsupervised methods, depending on whether or not the instances are labeled with the expected outputs. However, a new approach called Semi-Supervised Learning (SSL) has recently emerged that uses a few labeled instances together with other unlabeled instances for the training process. This new approach can significantly improve the accuracy of conventional ML models for industrial environments where labeled data are scarce. SSL has been tested as a promising solution over the past few years for several FDD problems, although there have been no systemic reviews of this sort of approach up until the present review. In this study, an attempt to organize the existing literature on SSL for FDD using the taxonomy of van Engelen & Hoos is reported. The most and the least frequently used SSL algorithms are identified and considered in terms of different fault detection tasks and their most common dataset structure. Moreover, a set of best practices are proposed in the conclusions of this work for implementation under real industrial conditions, so as to avoid some of the most common faults.
Gas turbine failure classification using acoustic emissions with wavelet analysis and deep learning
2023, Expert Systems with Applications
Compared to vibration monitoring, acoustic emission (AE) monitoring in gas turbines is highly sensitive to changes that do not involve whole-body motion, such as wear, rubbing, and fluid-induced faults. AE signals captured by suitably mounted sensors can potentially provide early indications of abnormal turbine operation before such abnormalities manifest in structural vibration or emitted airborne noise. However, developing an online fault detection system requires extensive real-time data treatment to extract appropriate features and indicators from raw AE records. To build such a system for industrial turbines, researchers need to understand the AE-generating mechanisms associated with turbine operation and the sources of background noise. In this study, we aim to develop such an understanding using a small-scale turbine whose operational conditions can be modified safely to reflect both normal and faulty conditions. Our signal processing approach involves first extracting a time-series envelope using an averaging time selected to enhance major features and eliminate irrelevant noise. We then generate time–frequency features using a continuous wavelet transform, which are used to train a deep convolutional neural network to classify gas turbine conditions. The resulting model demonstrates high accuracy in classifying two normal running conditions and two faulty conditions at various turbine speeds. Overall, the proposed methodology offers a powerful tool for gas turbine condition monitoring, and we make all associated data available in open-source format to facilitate further research in this field.⁴
Discriminating the default risk of small enterprises: Stacking model with different optimal feature combinations
2023, Expert Systems with Applications
Small enterprise default discrimination establishes a risk discriminant model based on financial data, non-financial data, and external macro conditions of small enterprises to obtain their default discriminant. This study constructs the final default discriminant model, taking the predicted default probability vectors from the first modelling as independent variables. From the 2^e feature combinations composed of e features, it maximises the default discriminant precision of the same training sample. Accordingly, the study inversely infers three optimal feature combinations corresponding to three models, including logistic regression, support vector machine, and linear discriminant analysis, ensuring the discrimination accuracy of the first modelling by the stacking method. Moreover, five features—industry prosperity index, EBITDA margin, current assets turnover ratio, net profit, and per capita disposable income of urban residents—account for 11% of the features in the optimal feature combination, but their importance accounts for 63%; thus, they are critical to the default risk of small enterprises. Further, the macro features significantly influence the default risk of small enterprises. For example, the importance of four macro features—industry prosperity index, consumer price index, per capita disposable income of urban residents, and Engel coefficient—accounts for 26.26% of the features in the optimal feature combination. Notably, the importance of the ‘industry prosperity index’, which is the greatest influencing factor in the feature combination, accounts for 17.63%. Ultimately, the discrimination accuracy of the proposed model is better than that of the seven classical default discriminant models.
An unsupervised domain adaptation approach with enhanced transferability and discriminability for bearing fault diagnosis under few-shot samples
2023, Expert Systems with Applications
As a key component widely used in electric multiple units (EMU), fault diagnosis of EMU bearing is an important link. Typically, labeled data from different conditions provides the most usable domain knowledge. However, many devices face the bottleneck of lacking sufficient labeled data under special conditions, known as few-shot samples distribution. Although unsupervised domain adaptation (UDA) can solve the above problems, existing models achieve sample transfer mainly by learning domain-invariant features in the source and target domains. Moreover, learning domain-invariant features does not necessarily guarantee sufficient discriminability and transferability of the sample. In turn, the samples transfer and the fault discrimination will be greatly affected. In this study, we propose an unsupervised domain adaptation approach with enhanced transferability and discriminability (ETDS-UDA) for bearing fault diagnosis of EMU under few-shot samples. First, we construct an efficient feature extractor (MiniNet) for fault feature extraction. Then, we construct ETDS-UDA based on UDA model by designing strategies that enhance simultaneously transferability and discriminability. Finally, we also propose a balanced strategy and a discriminative feature learning strategy to further optimize the final fault diagnosis. Ultimately, multiple results verify the performance of ETDS-UDA in EMU bearing fault diagnosis under few-shot samples.
The role of artificial intelligence-driven soft sensors in advanced sustainable process industries: A critical review
2023, Engineering Applications of Artificial Intelligence
With the predicted depletion of natural resources and alarming environmental issues, sustainable development has become a popular as well as a much-needed concept in modern process industries. Hence, manufacturers are quite keen on adopting novel process monitoring techniques to enhance product quality and process efficiency while minimizing possible adverse environmental impacts. Hardware sensors are employed in process industries to aid process monitoring and control, but they are associated with many limitations such as disturbances to the process flow, measurement delays, frequent need for maintenance, and high capital costs. As a result, soft sensors have become an attractive alternative for predicting quality-related parameters that are ‘hard-to-measure’ using hardware sensors. Due to their promising features over hardware counterparts, they have been employed across different process industries. This article attempts to explore the state-of-the-art artificial intelligence (Al)-driven soft sensors designed for process industries and their role in achieving the goal of sustainable development. First, a general introduction is given to soft sensors, their applications in different process industries, and their significance in achieving sustainable development goals. AI-based soft sensing algorithms are then introduced. Next, a discussion on how AI-driven soft sensors contribute toward different sustainable manufacturing strategies of process industries is provided. This is followed by a critical review of the most recent state-of-the-art AI-based soft sensors reported in the literature. Here, the use of powerful AI-based algorithms for addressing the limitations of traditional algorithms, that restrict the soft sensor performance is discussed. Finally, the challenges and limitations associated with the current soft sensor design, application, and maintenance aspects are discussed with possible future directions for designing more intelligent and smart soft sensing technologies to cater the future industrial needs.

View all citing articles on Scopus

View full text

Generative adversarial network-based semi-supervised learning for real-time risk warning of process industries

Highlights

Abstract

Introduction

Section snippets

Real-time risk warning framework

Risk analysis of MZCR

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Journal of Hazardous Materials

Process Safety and Environmental Protection

Environmental Research

Chemical Engineering Science

Expert Systems with Applications

Current Opinion in Chemical Engineering

Expert Systems with Applications

Journal of Loss Prevention in the Process Industries

Safety Science

Journal of Loss Prevention in the Process Industries

Journal of Hazardous Materials

Environmental Science & Policy

Reliability Engineering & System Safety

European Journal of Operational Research

Chemical Engineering Science

Journal of Process Control

Computers and Chemical Engineering

Atmospheric Environment

Knowledge-Based Systems

Computers and Chemical Engineering

On designing a new Bayesian dispersion chart for process monitoring

Arabian Journal for Science and Engineering

Guidelines for risk based process safety

Traffic flow imputation using parallel data and generative adversarial networks

IEEE Transactions on Intelligent Transportation Systems

Good semi-supervised learning that requires a bad GAN

Semi-supervised learning with context-conditional generative adversarial networks

Hazard and operability (HAZOP) analysis: A literature review

Journal of Hazardous Materials

On the use of Bayesian decision theory for issuing natural hazard warnings

Proceedings of the Royal Society A: Mathematical, Physical and Engineering

Generative adversarial nets

Intelligent fault diagnosis method based on full 1D convolutional generative adversarial network

IEEE Transactions on Industrial Informatics