A consistency regularization based semi-supervised learning approach for intelligent fault diagnosis of rolling bearing
Introduction
Rolling bearing is one of the most important mechanical components, which has been widely used in rotating machinery. Since the rolling bearing usually carries a large dynamic load in its service life, it is easy to cause an unattended failure on its surface [1], [2], [3], [4]. Their failure will directly affect the performance of the whole production line and further lead to a huge economic loss. It is thus essential to propose a bearing fault diagnosis method to accurately detect a bearing defect at the first hand [5], [6], [7].
Data-driven fault diagnosis methods are becoming a current research trend due to their high automatic level and fast implement rate [8], [9], [10]. Several data-driven bearing fault diagnosis methods have been proposed which can be generally classified into two categories, (1) shallow learning based diagnostic methods [11], [12], and (2) deep learning based diagnostic methods [13], [14], [15]. Shallow learning based diagnostic methods used none or just a couple hidden layers in the network structure which lack capacities in handling complicated fault classification cases. Moreover, a feature extraction step is essential in shallow learning based diagnostic methods, which increases the computational complexity and reduces the automatic level of these techniques [11], [12]. On the other hand, deep learning based diagnostic methods integrate the feature extraction and fault classification into an ensemble form to increase the accuracy in the fault diagnosis task, which can greatly address the issues in shallow learning based diagnostic methods and significantly improve the performance of the deep learning algorithms. Various deep architectures have been proposed in literatures to achieve the fault diagnostic applications [16], [17], [18], [19], [20], [21]. For examples, Kong et al. [16] integrated the multiple deep auto-encoder architectures into the ensemble learning framework to achieve the bearing fault diagnosis. Jing et al. [17] proposed a convolutional neural network (CNN) to directly extract the discriminative features from the Fourier data and realize the automatic gearbox fault diagnosis. Jiao et al. [18] proposed a deep coupled CNN architecture to fuse the feature streams from various information sources. Lei et al. [19] proposed a long short-term memory (LSTM) network to extract discriminative features from the multivariate time series with considering the long-term dependencies, and effectively achieve the fault diagnosis of wind turbine. An et al. [20] adopted an LSTM network to eliminate the effect of the nonstationary characteristic of the vibration data on learning performance.
Deep learning based diagnostic methods can effectively classify the fault conditions from the recorded data samples without the need of manual interventions. However, most of existing deep learning based diagnostic methods were established on the assumption that the recorded samples are labeled samples, which ignore the fact that there exist no label information for most of the recorded samples on account of that labeling a sample will takes more manpower and resources in industrial fields. Semi-supervised learning (SSL) methods which can make use of both labeled and unlabeled samples appear to be a powerful approach to address this issue. A series of SSL approaches have been proposed recently to address the limited labeled samples in industrial fields [22], [23], [24], [25], [26], [27]. For example, Zhang et al. [22] proposed a deep semi-supervised method of multiple association layers networks on the basis of the ladder network to achieve the fault diagnosis of planetary gearbox. Jiang et al. [23] tried to achieve the process monitoring with both limited labeled data and sufficient unlabeled data in a two-step manner. Razavi-Far et al. [24] integrated the statistic features from multiple sensory streams into a semi-supervised deep ladder network to achieve the gearbox fault diagnosis. A semi-supervised smooth alpha layering algorithm is proposed in Ref. [25] to realize the bearing fault identification under limited labeled samples situation. Li. et al. [26] proposed a semi-supervised gear fault diagnosis method using a combination of both augmented auto-encoder and augmented monitoring data. Liang et al. [27] transformed the one-dimensional monitoring data into two-dimensional time–frequency image and then put them into the semi-supervised generative adversarial network to achieve the fault diagnosis of rotating machinery. Though the above mentioned methods can improve the identification ability in addressing the SSL task, the tedious procedures adopted in the above mentioned methods make it unavailable to achieve the fault diagnosis task with a high efficiency.
Data augmentation (DA) is an effective approach to enrich the data library for the recorded data and it has been widely used in the fields of object recognition and image processing [28]. The basic idea of the data augmentation is to change the partial structure of the recorded data to generate more variant versions, whilst the label information for the recorded data remains unchanged [29], [30]. When the augmented data are introduced to update the parameters of the network, it can be viewed as an extra perturbation imposed on the current network. However, on account of the fact that the label information for the augmented data remains unchanged, the trained network is expected to be less sensitive to the deformation of the recorded data. This idea is an effective approach to improve the generalization ability of the network and also coincide with the principle of consistency regularization based SSL methods. In consistency regularization based SSL methods, an extra perturbation is firstly imposed on the input samples or the hidden states of the network architecture, and then a regularization term is introduced to make the model predictions invariant to the extra perturbation. Various SSL techniques have been proposed under this framework, such as virtual adversarial training (VAT) [31], MixMatch [32], unsupervised data augmentation (UDA) [33], ladder network [34] and mean teacher [35]. Among them, the main difference is how and where the extra perturbation is imposed. In VAT [31], an additive perturbation which can maximally change the output distribution is imposed on the input samples and the model prediction is enforced to be less sensitive to the imposed perturbation. In MixMatch [32], the random horizontal flips and crops are initially imposed on the input image data and then a MixUp is used to further mix the labeled and unlabeled samples. Corresponding, two consistency costs are designed for the labeled data and unlabeled data to regularize the model. In UDA [33], a consistency regularization framework is firstly proposed and then the effectiveness of various perturbation strategies is investigated. In ladder network [34], two parallel encoders are built for labeled and unlabeled data simultaneously. The encoder for labeled data is corrupted by adding isotropic Gaussian noise on its hidden states and the encoder for unlabeled data is noise free, a regularization term is formulated to enforce the consistency between the hidden states of these two encoders. In mean teacher [35], though various perturbations are imposed on the input samples, the output distributions from student model and teacher model are assumed to be consistent. In these literatures, most of the extra perturbations were designed for two-dimensional image data and cannot be adopted to achieve the bearing fault diagnosis task on account of that the bearing condition monitoring data is typically one-dimensional data. To address this issue, a new SSL method based on the principle of consistency regularization is proposed in this study to achieve the bearing fault diagnosis task under the limited labeled samples situation. In the proposed method, a DA method (DAM) designed for one-dimensional bearing fault data is proposed to impose the extra perturbation on the original samples, which can effectively enrich the data library for both labeled samples and unlabeled samples. Then, an appreciable label distribution for the unlabeled samples is formulated by using the model predictions for the augmented unlabeled samples. After that, two consistency loss terms are formulated to regularize the model predictions to be invariant to the extra perturbations. Among them, a supervised cross entropy loss is adopted to enforce the model predictions for augmented labeled samples to be consistent with the true label information for original labeled samples and an unsupervised consistency loss is formulated to minimize the discrepancy between label distributions for the original unlabeled sample and its appreciable label distributions. A schematic illustration of the proposed method is given in Fig. 1. In the proposed method, the proposed DAM can greatly improve the diversity of the original samples, which can lead to a high identification performance under limited labeled samples. Moreover, the consistency regularization principle on the unlabeled samples can help the classifier better detect the membership for unlabeled sample, which is beneficial to push the decision boundary far away from the high-density region of the marginal data distribution. The classification result on an experimental bearing dataset demonstrates that the proposed method can provide an accurate bearing fault diagnosis under the limited labeled samples situation.
The main contributions of this study are highlighted as follows:
- 1)
In this study, an effective SSL method based on the principle of consistency regularization is proposed for automatic bearing fault diagnosis under limited labeled samples situation. In the proposed method, a DAM designed for one-dimensional vibration data can be viewed as the extra perturbations imposed on the samples. Correspondingly, two consistency loss terms targeted for the labeled samples and the unlabeled samples are used to regularize the model predictions to be invariant to the extra perturbations. The effectiveness and necessity of the proposed two consistency manners are validated in the experimental study.
- 2)
Though the labeled samples are limited in the SSL task, the proposed data augmentation strategies conducted on the labeled samples can greatly improve its diversity. Meanwhile, considering that the label information for the labeled samples still maintains, a standard cross entropy conducted on augmented labeled samples can greatly improve the identification performance in addressing the SSL task.
- 3)
The appreciable label information for the unlabeled samples is estimated by conducting the average and highlight on the one-hot labels for the augmented unlabeled samples. Then, an unsupervised consistency loss is formulated to minimize the discrepancy between the label distributions for the original unlabeled samples and its appreciable label distributions. The experimental result illustrates that this regularization method can also further improve the identification performance to a certain extent.
The rest of this paper is organized as follows: Firstly, the principle of the proposed method are given in Section 2. Then, an experimental case study is presented in Section 3 to validate the effectiveness of the proposed method. Finally, conclusions are drawn in Section 4.
Section snippets
The algorithm of the proposed semi-supervised method
The SSL task in the field of mechanical fault diagnosis is investigated in this study. Taking an assumption that the labeled dataset is represented as , where means the i-th labeled data and N is the time length of the input data, denotes the m-th healthy condition for the i-th labeled data, is the number of the labeled samples. Correspondingly, the unlabeled dataset is expressed as , where is the unlabeled data and is
Experimental setup
The experimental bearing data are acquired from a bearing test-rig as shown in Fig. 4 which constitute the dataset in this study. A single-row angular contact ball bearing of type ER-16 K (the geometrical parameter of the bearing is given in Table 3) is used as the testing bearing in the experiment where five bearing operation conditions, namely, healthy (H), inner race fault (IRF), ball fault (BF), outer race fault (ORF) and a compound fault (CF) are simulated in the experiment. The simulated
Conclusion
In this study, a new SSL method for intelligent fault diagnosis of rolling bearings is presented on the principle of consistency regularization. In the proposed method, a DAM consisting eight DA strategies is proposed to impose extra perturbations on both labeled and unlabeled samples. A label predicting process is then put forward to produce the appreciable label distributions for the unlabeled samples using the model predictions for the augmented unlabeled samples. At the same time, two
CRediT authorship contribution statement
Kun Yu: Conceptualization, Methodology, Writing - original draft. Hui Ma: Supervision. Tian Ran Lin: Validation, Writing - review & editing. Xiang Li: Resources, Supervision, Data curation.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgment
This project is supported by the National Natural Science Foundation (Grant nos. 11972112, 11772089), the Fundamental Research Funds for the Central Universities (Grant nos. N170308028, N2003014, N180708009 and N180306005) and LiaoNing Revitalization Talents Program (Grant No. XLYC1807008). This project is also support by the “Taishan Scholar” program from Shandong provincial government of the People’s Republic of China. We would like to express our deepest appreciation for the valuable
References (53)
- et al.
Calculation of mesh stiffness of spur gears considering complex foundation types and crack propagation path
Mech. Syst. Sig. Process.
(2019) - et al.
Time-varying mesh stiffness calculation of cracked spur gears
Eng. Fail. Anal.
(2014) - et al.
Fault diagnosis of rolling bearing based on feature reduction with global-local margin Fisher analysis
Neurocomputing
(2018) - et al.
A multi-ensemble method based on deep auto-encoders for fault diagnosis of rolling bearings
Measurement
(2020) - et al.
A convolutional neural network based feature learning and fault diagnosis method for the condition monitoring of gearbox
Measurement
(Dec. 2017) - et al.
Fault diagnosis of wind turbine based on Long Short-term memory networks
Renewable Energy
(2019) - et al.
A novel bearing intelligent fault diagnosis framework under time-varying working conditions using recurrent neural network
ISA Trans.
(2020) - et al.
Digital currency forecasting with chaotic meta-heuristic bio-inspired signal processing techniques
Chaos, Solitons Fractals
(2019) - et al.
Fault diagnosis of planetary gearbox using a novel semi-supervised method of multiple association layers networks
Mech. Syst. Signal Process.
(2019) - et al.
Semi-supervised fault classification based on dynamic sparse stacked auto-encoders model
Chemom. Intell. Lab. Syst.
(2017)
Semi-supervised gear fault diagnosis using raw vibration signal based on deep learning
Chin. J. Aeronaut.
A data augmentation method based on cycle-consistent adversarial networks for fluorescence encoded microsphere image analysis
Signal Process.
Rolling element bearing diagnostics-A tutorial
Mech. Syst. Signal Process.
Mechanical model development of rolling bearing-rotor systems: A review
Mech. Syst. Signal Process.
Deep residual learning-based fault diagnosis method for rotating machinery
ISA Trans.
Model-based uneven loading condition monitoring of full ceramic ball bearings in starved lubrication
Mech. Syst. Signal Process.
A two-stage method based on extreme learning machine for predicting the remaining useful life of rolling-element bearings
Mech. Syst. Sig. Process.
Application of weighted contribution rate of nonlinear output frequency response functions to rotor rub-impact
Mech. Syst. Sig. Process.
Mechanism and method for outer raceway defect localization of ball bearings
IEEE Access
Model predictive control of three-axis gimbal system mounted on UAV for real-time target tracking under external disturbances
Mech. Syst. Sig. Process.
Correlation clustering imputation for diagnosing attacks and faults with missing power grid data
IEEE Trans. Smart Grid
Applications of machine learning to machine fault diagnosis: A review and roadmap
Mech. Syst. Signal Process.
Deep laplacian auto-encoder and its application into imbalanced fault diagnosis of rotating machinery
Measurement
Cited by (55)
Vibration measurement from an adaptive phase-based motion estimation using parameter optimised log-Gabor filter
2024, Measurement: Journal of the International Measurement ConfederationA label information vector generative zero-shot model for the diagnosis of compound faults
2023, Expert Systems with ApplicationsInterpretable hierarchical error correction GRU model for effective observation selection
2023, Applied Soft ComputingSemi-supervised learning for industrial fault detection and diagnosis: A systemic review
2023, ISA TransactionsFeature-level consistency regularized Semi-supervised scheme with data augmentation for intelligent fault diagnosis under small samples
2023, Mechanical Systems and Signal Processing