Review
An analysis of process fault diagnosis methods from safety perspectives

https://doi.org/10.1016/j.compchemeng.2020.107197Get rights and content

Highlights

  • This paper analyzes the interconnection of fault detection and diagnosis, risk assessment and abnormal situation management.

  • The paper reviews the existing methods and models related to fault detection and diagnosis, abnormal situation management, and risk.

  • The paper also present authors’ thoughts on the research direction in the area of process safety in industrial 4.0.

  • This paper aims to serve as a road map for process safety research to enable safer and sustainable development.

Abstract

Industry 4.0 provides substantial opportunities to ensure a safer environment through online monitoring, early detection of faults, and preventing the faults to failures transitions. Decision making is an important step in abnormal situation management. Assigning risk based on the consequences may provide additional information for abnormal situation management decisions to prevent the accident before it occurs. This paper analyzes the interconnections between the three essential aspects of process safety: fault detection and diagnosis (FDD), risk assessment (RA), and abnormal situation management (ASM) in the context of the current and next generation of process systems. The authors present their thoughts on research directions in process safety in Industry 4.0. This article aims to serve as a road map for the next generation of process safety research to enable safer and sustainable process operations and development.

Introduction

Modern process plants are becoming more complicated due to process units’ interconnectivity resulting from plantwide control and optimization. In such a plant, control systems connect with many sensors and actuators to control plant operations. The sensors aid in monitoring process conditions while the actuators control the process by physically adjusting the system's variables. Despite these monitoring and control measures, processes can drift beyond their safe operating range due to actuator, sensor, or system faults. These faults may lead to a system failure and ultimately cause a plant accident.

In process systems, potential accidents are prevented using layers of protection. Failures in such layers increase accident probability and lead to its consequences. Hazard identification, probability assessment, and the consequences of hazardous incidents, considering the layers of protection in a system, provide an understanding of the system safety status.

Hazard identification, FDD, RA, and mitigation action play vital roles in maintaining plant safety. Many researchers have reviewed hazard identification approaches [Dunjo et al. (2010); Cameron et al. (2017); Willey (2014)]. Also, there are several review articles on FDD methods, by [Gao et al. (2015); Zhong et al. (2018); Puncochar and Skach (2018); Md Nor et al. (2019); Hoang and Kang (2019)], and on RA in process systems, reviewed by [Khan and Abbasi (1998); Khan et al. (2010); Swuste et al. (2016); Amin et al. (2019)]. However, these articles focus on different process safety elements without defining the interrelation and the overall ASM process. For process safety management, the methods and models used must be analyzed in combination to obtain a holistic view of the safety management framework. Therefore, this article attempts to review and analyze process safety elements’ methods and models, focusing on their interrelations. Specifically, the article focuses on addressing the following questions:

  • 1.

    How can risk be used for fault diagnosis and abnormal situation management (process safety perspective)?

    • a.

      How can fault detection and diagnosis be used from the process safety perspective?

    • b.

      How is abnormal situation management practised from the process safety perspective?

    • c.

      How are FDD and ASM integrated with the safety system?

    • d.

      What are the key knowledge and technological gaps in the preceding areas?

  • 2.

    How could operational risk be a tool for process safety management for Industry 4.0?

    • a.

      What are the available approaches for operational risk assessment?

    • b.

      What are the potential uses of machine learning techniques in assessing operational risk?

    • c.

      What are the key knowledge and technological gaps in implementing novel machine learning tools in process safety management?

  • 3.

    What is the way forward with Industry 4.0 to make a smart process plant a safe environment?

According to Iserman and Balle (1997), based on the SAFEPROCESS committee's definition, ASM is a centralized, continuous, and comprehensive process to prevent and control the potential hazards in process systems. Moreover, ASM should identify the deviation from normal operation to faulty and failure conditions and bring the system back to normal operation.

In the process industry, determining the risk margin, using appropriate modeling such as failure models, accident models, and risk models, helps to provide information to prevent the fault from becoming a failure condition.

A failure model evaluates the accident probability by determining system failure based on a data-driven or physical model approach. Similarly, the accident model relates to the causes and effects to address the consequences. However, to develop the failure model and the accident model, hazard identification will be an initial step. When fault leads to failure, the failure model and accident model can evaluate the possible hazards and consequences.

In the process industry, FDD, RA, and ASM may apply in a closed loop. FDDs approaches to determine the fault condition and are initiated to identify the possible hazard. The failure model and accident model evaluate the probability and consequences of the system hazard when the process systems fail to identify and control the system's fault condition by ASM. Assessing a risk margin using risk models gives feedback to ASM regarding the hazardous event. With the feedback information, ASM changes the decision to control the operation.

From the Industry 4.0 perspective, interconnecting FDD, ASM, and RA help to develop an intelligent safety system by learning the risk and taking necessary action autonomously to prevent the hazard.

Several review articles on the recent methods and models for FDD and RA have been published and are available in the open literature. However, these methods have been limited in scope, mainly focusing on FDD methods. This review comprehensively studies ASM from a holistic perspective, discusses the past and present methods and techniques, and directs the future trend of process safety for Industry 4.0. The review's scope includes the topics directly related to system faults, failure analysis, RA, mitigation action, and process safety, published in journal papers. However, to discuss the FDD and ASM standards, some industrial standards and conference papers are used.

The rest of the paper is organized as follows: Sections 2 and 3 focus on past and present methods and models for FDD, and ASM's role to prevent hazards in process systems. Section 4 summarizes the past and present risk assessment models, failure models, and hazard identification to protect the plant from accident consequences. Finally, the last section highlights the next generation research needed for Industry 4.0 and its challenges.

Section snippets

FDD models and ASM methods from a safety perspective

Investigation of past accidents reveals that more than 70% of process accidents have been caused by technical and design failures, including piping system failure, deterioration of construction materials, corrosion and erosion, mass and heat transfer, and failure of the control system[Khan and Abbasi (1999); Duguid (2001) ; Kidam and Hurme (2013)]. Moreover, the AIChE center for chemical process safety investigation reports that almost all accidents are the ultimate result of deviation from

ASM approaches to protect the hazard using FDD

Process systems can be protected from hazards by providing appropriate prevention and control by ASM. This section discusses the approaches and safety standards used in the process industry in ASM.

Review of risk assessment approaches

A process accident generally follows the sequence of initiation (starting event for the accident), propagation (maintaining or expanding an event to prolong an accident scenario), and termination (an event that stops the accident). Hazardous events continuously change in each stage of accident scenarios. Therefore, appropriately investigating the accident scenario at each stage is important in process safety management to identify the hazardous environment. System failures cause process systems

Next-generation process safety and risk management based on process system failure

With extended technology, process system plants become more complex and advanced. Hence, process safety will be a challenging topic in the upcoming years. According to Kamble et al. (2018); Lee et al. (2019); Angelopoulos et al. (2020), Industry 4.0 and smart industrial development process system confer several effects on the design and development of the largest plant industry. This will profoundly affect FDD methods and models, risk assessment approaches, and ASM strategies.

Based on Industry

Conclusions

This review's main objective is to illustrate the safety framework for the process industry by integrating fault detection and diagnosis, abnormal situation management, and risk assessment. The review's main scope was restricted to published journal articles on topics directly related to these three areas. Limited conference papers and industrial standards were also reviewed to discuss the safety levels and standard industrial approaches related to process safety. Overall, this review article

CRediT authorship contribution statement

Rajeevan Arunthavanathan: Conceptualization, Methodology, Software, Investigation, Writing - original draft, Writing - review & editing. Faisal Khan: Conceptualization, Methodology, Formal analysis, Writing - review & editing, Supervision, Project administration, Funding acquisition. Salim Ahmed: Methodology, Validation, Formal analysis, Writing - review & editing, Supervision, Funding acquisition. Syed Imtiaz: Methodology, Writing - review & editing, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors acknowledges financial support from the Natural Sciences and Engineering Research Council (NSERC) of Canada through the Discovery grant and Canada Research Chair Tier I program in Offshore Safety and Risk Engineering.

References (268)

  • P.T. Bullemer et al.

    Common procedural execution failure modes during abnormal situations

    J. Loss Prevent. Process Ind.

    (2011)
  • P. Bullemer et al.

    Common operations failure modes in the process industries

    J. Loss Prevent. Process Ind.

    (2010)
  • I. Cameron et al.

    Process hazard analysis, hazard identification, and scenario definition: Are the conventional tools sufficient, or should and can we do much better?

    Process Saf. Environ. Protect.

    (2017)
  • M. Catelani et al.

    Sensitivity analysis with MC simulation for the failure rate evaluation and reliability assessment

    Measurement

    (2015)
  • M. Caterino et al.

    Appraisal of a new safety assessment method using the petri nets for the machines safety

    IFAC-Pap. Online

    (2018)
  • Y. Chen et al.

    A novel deep learning method based on attention mechanism for bearing remaining useful life prediction

    Appl. Soft Comput.

    (2020)
  • Y. Cheng et al.

    Pattern matching of alarm flood sequences by a modified Smith-Waterman algorithm

    Chem. Eng. Res. Des.

    (2013)
  • J.E. Cockshott

    Probability bow-ties a transparent risk management tool

    Process Saf. Environ. Protect.

    (2005)
  • Y. Dai et al.

    Abnormal situation management for smart chemical process operation

    Curr. Opin. Chem. Eng.

    (2016)
  • J. Dunjo et al.

    Hazard and operability (HAZOP) analysis. A literature review

    J. Hazard. Mater.

    (2010)
  • F. Eljack et al.

    Process safety and abnormal situation management

    Curr. Opin. Chem. Eng.

    (2016)
  • G. Fan et al.

    A hybrid fuzzy evaluation method for curtain grouting efficiency assessment based on an AHP method extended by D numbers

    Expert Syst. Appl.

    (2016)
  • P.M. Frank

    Fault diagnosis in dynamic systems using analytical and knowledge-based redundancy: a survey and some new results

    Automatica

    (1990)
  • P.M. Frank

    Analytical and qualitative model-based fault diagnosis – a survey and some new results

    Eur. J. Control

    (1996)
  • G. Fu et al.

    The development history of accident causation models in the past 100 years: 24model, a more modern accident causation model

    Process Saf. Environ. Protect.

    (2020)
  • Z. Ge

    Review on data-driven modeling and monitoring for plant-wide industrial processes

    Chemometr. Intell. Lab. Syst.

    (2017)
  • P. Goel et al.

    Industrial alarm systems: challenges and opportunities

    J. Loss Prevent. Process Ind.

    (2017)
  • P. Goel et al.

    A data-driven alarm and event management framework

    J. Loss Prevent. Process Ind.

    (2019)
  • L. Guo et al.

    A recurrent neural network-based health indicator for remaining useful life prediction of bearings

    Neurocomputing

    (2017)
  • D.H. Ham et al.

    The effects of presenting functionally abstracted information in fault diagnosis tasks

    Reliab. Eng. Syst. Saf.

    (2001)
  • T.J. Harris et al.

    A review of performance monitoring and assessment techniques for univariate and multivariate control systems

    J. Process Control

    (1999)
  • F. Harrou et al.

    An unsupervised monitoring procedure for detecting anomalies in photovoltaic systems using a one-class support vector machine

    Solar Energy

    (2019)
  • J. Hegde et al.

    Applications of machine learning methods for engineering risk assessment – a review

    Saf. Sci.

    (2020)
  • D.T. Hoang et al.

    A survey on Deep Learning based bearing fault diagnosis

    Neurocomputing

    (2019)
  • E. Hollnagel

    Cognitive Reliability and Error Analysis Method (CREAM)

    (1998)
  • E.S. Hong et al.

    Quantitative risk evaluation based on event tree analysis technique: application to the design of shield TBM

    Tunnel. Undergr. Space Technol.

    (2009)
  • J.C. Hoskins et al.

    Artificial neural network models of knowledge representation in chemical engineering

    Comput. Chem. Eng.

    (1988)
  • G. Hu et al.

    A deep Boltzmann machine and multi-grained scanning forest ensemble collaborative method and its application to industrial fault diagnosis

    Comput. Ind.

    (2018)
  • J. Hu et al.

    A two-level intelligent alarm management framework for process safety

    Saf. Sci.

    (2016)
  • K.C. Hyun et al.

    Risk analysis using fault-tree analysis (FTA) and analytic hierarchy process (AHP) applicable to shield TBM tunnels

    Tunnel. Undergr. Space Technol.

    (2015)
  • M. Ahmed et al.

    Fault detection and diagnosis using principal component analysis of vibration data from a reciprocating compressor

  • M. Alauddin et al.

    A bibliometric review and analysis of data-driven fault detection and diagnosis methods for process systems

    Ind. Eng. Chem. Res.

    (2018)
  • J.S. Alford et al.

    Alarm management for regulated industries

    Chem. Eng. Progress

    (2005)
  • M.H. Amoozgar et al.

    Experimental test of a two-stage Kalman filter for actuator fault detection and diagnosis of an unmanned quadrotor helicopter

    J. Intell. Robot. Syst.: Theory Appl.

    (2013)
  • N. Amruthnath et al.

    A research study on unsupervised machine learning algorithms for early fault detection in predictive maintenance

  • A. Angelopoulos et al.

    Tackling faults in the industry 4.0 era—a survey of machine-learning solutions and key aspects

    Sens. (Switzerland)

    (2020)
  • J. Antonio et al.

    Fault-tree analysis: a knowledge-engineering approach

    IEEE Trans. Reliab.

    (1995)
  • A. Arvanitogeorgos

    Risk analysis in industry. greek institute of health and safety in the work

    ELINYAE

    (1999)
  • S. Ballı et al.

    Operating system selection using fuzzy AHP and TOPSIS methods

    Math. Comput. Appl.

    (2009)
  • L. Beckman

    Easily assess complex safety loops

    Chem. Eng. Progress

    (2001)
  • Cited by (75)

    • Industrial process fault diagnosis based on domain adaptive broad echo network

      2024, Journal of the Taiwan Institute of Chemical Engineers
    • A novel triage-based fault diagnosis method for chemical process

      2024, Process Safety and Environmental Protection
    View all citing articles on Scopus
    View full text