Transfer fault diagnosis based on local maximum mean difference and K-means

doi:10.1016/j.cie.2022.108568

Computers & Industrial Engineering

Volume 172, Part A, October 2022, 108568

https://doi.org/10.1016/j.cie.2022.108568 Get rights and content

Highlights

•
A novel transfer fault diagnosis framework is proposed.
•
Local maximum mean difference acts on a sparse auto-encoder to achieve subdomain alignment of features.
•
The K-means-based method is put forward to explore the structure information of unlabeled target samples.

Abstract

Existing feature-based transfer learning methods have achieved great performance in the transfer fault diagnosis with unlabeled data. While most of them are global alignment methods based on maximum mean difference (MMD), which ignore the differences between different faults and pay little attention to the structural information in the unlabeled target samples. This paper proposes a transfer sparse auto-encoder (SAE) based on local maximum mean difference (LMMD) and K-means to solve the above problems. Firstly, we build a deep network based on SAE and LMMD for learning a common latent feature space where source and target subdomains are aligned. Subsequently, to fully explore the target domain information, we put forward the K-means-based method which can obtain final diagnosis results by synthesizing the source and target domain information in the latent feature space. Lastly, a case study is conducted to verify the robustness and effectiveness of the proposed methods. The experimental result demonstrates that the proposed methods outperform the MMD-based methods in the transfer fault diagnosis problem.

Introduction

Bearing is an essential but easily damaged component of rotating machinery, and often works in complex and various environments (Lin et al., 2020). Once bearing fault occurs, it will substantially affect the efficiency of machines, thereby causing economic losses (Patel & Upadhyay, 2020). Thus, bearing fault diagnosis methods has emerged to address the aforementioned concerns (Fernández-Francos et al., 2013, Wang and Chen, 2011).

Recently, deep learning-based methods have achieved successful cases for fault diagnosis such as deep belief network (DBN) (Yu and Liu, 2020, Zhong et al., 2021), deep sparse auto-encoder (SAE) (Li et al., 2019b; Wang et al., 2019), convolutional neural network (CNN) (Chen et al., 2020a; Souza et al., 2021). While these methods are data greedy and need a lot of labeled data to assist training (Guo et al., 2019). In the real world, bearings often operate in numerous working conditions, it is very difficult to record the bearing failures under each working condition due to the huge consumption of human and financial resources. Moreover, the collected vibration signals undergo linearly or nonlinearly under different working conditions. That is, only some operating data without labels can be collected under some working conditions. Furthermore, the models trained under other working conditions cannot be directly applied to the working conditions owing to different data distribution.

Transfer learning (TL) (Chen et al., 2020b; Li et al., 2020) introduces an efficient means to handle the preceding problems by using knowledge learned from one working condition to solve problems in different but related working conditions. Most of the recent works are statistic moment matching-based methods (Si et al., 2021, Wei et al., 2021). They aim to seek a common feature space where the difference between target and source domains is minimized. To achieve this goal, several metrics have been raised to measure the differences between domains (Che et al., 2020, Jin et al., 2020, Lu et al., 2020). As an effective distance measurement method without parameters, the maximum mean difference (MMD) has become one of the most popular methods in transfer fault diagnosis (Li et al., 2019c; Yang et al., 2019). For example, Wen et al. (2019) successfully applied the SAE combined with the MMD penalty term to the transfer diagnosis task on the CRWU dataset.

However, these MMD-based methods are global alignment approaches without considering the differences in distribution between faults. As result, they align the differences between domains, while also confusing the differences between faults, leading to negative transfer. To take into account the differences of different domains and different categories simultaneously, Long et al. (2013) proposed joint maximum mean difference (JMMD), which achieves conditional probability alignment by generating pseudo-labels on the target domain data. This pseudo-labels-based method is not robust enough, since misalignment accumulates with pseudo-labels errors. For the robust subdomain alignment, Zhu et al. (2020) design a local maximum mean discrepancy (LMMD) which achieves subdomain alignment by assigning different weights to samples. And the weights of samples are generated iteratively according to the output probability on the classifier.

Nevertheless, since LMMD is an iterative method of generating weights, it is often affected by the initial weights. It is very meaningful to design an effective starting weight for LMMD. The SAE provides a robust way to generate the initial parameters of the network by capturing the distribution characteristics of raw data. And the previous combination of SAE and MMD worked well. Therefore, a neural network constructed by fusing LMMD and SAE is expected to achieve better results.

In addition, most of the existing methods in transfer fault diagnosis obtained the final classifier results through the trained softmax classifier (Wu et al., 2020) or support vector machine (SVM) (Zhang et al., 2020). These methods only use the data in the source domain and ignore the structure information in the unlabeled target domain. While, these unlabeled data often contain meaningful information, which can help us further enhance the accuracy of the diagnosis. To fully explore this valuable information, we resort to the K-means-based-method (Tian et al., 2020) which is proved to be an efficient and useful method in pseudo-label propagation.

In summary, the majority of the aforementioned researches are presented in Table 1.

Inspired by the preceding challenges, we proposed novel transfer SAE based on LMMD and K-means (SAE-LK) for transfer fault diagnosis. Firstly, we establish a deep network based on SAE and LMMD (SAE-MMD) to train a transfer encoder that can obtain domain-adaptive features by aligning subdomains. Then we map the source and the target domain data into a common latent feature space by the trained encoder. Moreover, to fully mine the structural information in the target domain, we employed the K-means-based method to generate fault centers of the target domain and obtained the final diagnosis results by measuring the distance between samples and the fault centers. An extensive experiment based on a bearing data set was conducted to show the superiority of the proposed SAE-LK.

The remainder of this paper is organized as follows. Section 2 introduces the basic theory of SAE and LMMD. Section 3 proposes the detail of the proposed SAE-LK. Section 4 conducts six transfer tasks based on a bearing data set to show the efficiency of the proposed TEDAE. Lastly, Section 5 draws the conclusion.

Section snippets

Basic theory of sparse auto-encoder

Auto-encoders (AE) (Hinton & Salakhutdinov, 2006) has been viewed as an extremely beneficial base model in fault diagnosis. As shown in Fig. 1, each AE consists of an encoder and a decoder. The encoder aims to learn hidden representation from input data, while the decoder is used to reconstruct input data from the hidden representation. The basic definition is as follows.

Suppose $\{x {\}}_{k = 1}^{K}$ is input data with N dimension and K is the number samples of the data, Hence, the hidden representation

Problem description

Under some working conditions, the labels of the fault data are completely unavailable owing to huge consumption and difficulty of collection. Some basic definitions are as follows.

The source and target domain data can be represented as $D^{S} = {x_{i}^{S}, y_{i}^{S}}_{i = 1}^{n_{S}}$ and $D^{T} = {x_{i}^{T}, y_{i}^{T}}_{i = 1}^{n_{T}}$ , respectively. Moreover, $x_{i}^{s}$ with label information $y_{i}^{s}$ is the $ith$ sample in $D^{S}$ . Similarly, $x_{i}^{t}$ with label information $y_{i}^{t}$ is the $ith$ sample of $D^{T}$ , but $y_{i}^{t}$ is unavailable. The sample numbers in $D^{S}$ and $D^{T}$ are $n_{s}$ and $n_{T}$ ,

Experimental results and comparative analysis

In this section, the experiment is conducted based on the bearing data set, which is collected on the Spectra Quest rotor experimental platform by Li et al. (2019a).

Conclusion

To fully reduce the difference between source and target domain and mine the structure information in unlabeled target samples, a novel transfer diagnosis framework is proposed in this paper. Firstly, we built an SAE-LMMD network based on SAE and LMMD to train an encoder that can map source and target data to a common feature space. Based on the trained encoder, we put forward the method of K-means to synthesize the source domain and target domain information and make final diagnosis results.

CRediT authorship contribution statement

Xue-yang Zhang: Conceptualization, Methodology, Investigation, Funding acquisition, Writing – original draft, Writing – review & editing. Lang He: Data curation, Visualization, Software, Methodology. Xiao-kang Wang: Software, Validation, Visualization. Jian-qiang Wang: Supervision, Software, Validation, Visualization. Peng-fei Cheng: Investigation, Validation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This work was supported by the Basic and Applied Basic Research Foundation of Guangdong Province (No. 2020A1515110576).

References (35)

C. Che et al.
Domain adaptive deep belief network for rolling bearing fault diagnosis
Computers & Industrial Engineering
(2020)
S. Chen et al.
One-dimensional convolutional auto-encoder-based feature learning for fault diagnosis of multivariate processes
Journal of Process Control
(2020)
Y. Chen et al.
Deep conditional adaptation networks and label correlation transfer for unsupervised domain adaptation
Pattern Recognition
(2020)
D. Fernández-Francos et al.
Automatic bearing fault diagnosis based on one-class ν-SVM
Computers & Industrial Engineering
(2013)
X. Jin et al.
Joint distribution matching embedding for unsupervised domain adaptation
Neurocomputing
(2020)
C. Li et al.
A systematic review of deep transfer learning for machinery fault diagnosis
Neurocomputing
(2020)
K. Li et al.
A novel fault diagnosis algorithm for rotating machinery based on a sparsity and neighborhood preserving deep extreme learning machine
Neurocomputing
(2019)
X. Li et al.
Multi-Layer domain adaptation method for rolling bearing fault diagnosis
Signal Processing
(2019)
H. Lin et al.
Rolling bearing fault diagnosis using impulse feature enhancement and nonconvex regularization
Mechanical Systems and Signal Processing
(2020)
C. Lu et al.
Deep transfer neural network using hybrid representations of domain discrepancy
Neurocomputing
(2020)

S.P. Patel et al.

Euclidean distance based feature ranking and subset selection for bearing fault diagnosis

Expert Systems with Applications

(2020)

J. Si et al.

Unsupervised deep transfer learning with moment matching: A new intelligent fault diagnosis approach for bearings

Measurement

(2021)

R.M. Souza et al.

Deep learning for diagnosis and classification of faults in industrial rotating machinery

Computers & Industrial Engineering

(2021)

H. Wang et al.

Intelligent diagnosis method for rolling element bearing faults using possibility theory and neural network

Computers & Industrial Engineering

(2011)

X. Wang et al.

ReLTanh: An activation function with vanishing gradient resistance for SAE-based DNNs and its application to rotating machinery fault diagnosis

Neurocomputing

(2019)

D. Wei et al.

Weighted domain adaptation networks for machinery fault diagnosis

Mechanical Systems and Signal Processing

(2021)

Z. Wu et al.

An adaptive deep transfer learning method for bearing fault diagnosis

Measurement

(2020)

Cited by (15)

Industrial process fault diagnosis based on domain adaptive broad echo network
2024, Journal of the Taiwan Institute of Chemical Engineers
In response to the challenge that traditional fault diagnosis models are difficult to maintain satisfactory accuracy when data distribution changes due to changes in process conditions, a fault diagnosis model of industrial process based on domain adaptive broad echo network (DABEN) is proposed.
The DABEN model first constructs feature nodes through random feature mapping to extract shallow features of process data, and then inputs feature nodes into cascade reservoirs to extract dynamic features of different levels. On this basis, the objective function of DABEN is constructed, which starts from the four goals of minimizing the prediction error, maximum mean discrepancy distribution alignment, manifold regularization and minimizing cross-domain error to ensure that the features are as similar as possible between the source domain and the target domain.
Finally, two simulation cases show that DABEN can achieve good transfer fault diagnosis performance under different data distributions
Self-paced decentralized federated transfer framework for rotating machinery fault diagnosis with multiple domains
2024, Mechanical Systems and Signal Processing
Leveraging distributed data from various clients to tackle target issues has become a prominent trend in fault diagnosis. However, the growing concerns about data privacy have gained significant attention in the research community. Addressing this, a self-paced decentralized federated transfer framework is developed for diagnosing faults in rotating machinery across diverse domains. To improve efficiency and enhance security in data privacy protection, a decentralized federated optimization strategy is formulated to address communication challenges across varied data domains. Initially, pre-trained source models extract self-supervised information from the target client data and utilize the self-paced mechanism to integrate this information into auxiliary models. At the same time, this paper employs a nonlinear hashing mapping scheme to encode features from the target client. Subsequently, contributions of different source models are assessed to determine their respective weights. The federated source models, along with auxiliary models, are then weighted appropriately to integrate the final target model. Finally, the obtained target model and encoded target features are transmitted back to the source clients for updates and feature alignment, with iterations continuing until convergence is reached. Thus, the proposed framework effectively addresses the gap in data distribution while ensuring data privacy protection. Comprehensive experiments validate the effectiveness and security of the proposed framework for fault diagnosis.
A new multi-source information domain adaption network based on domain attributes and features transfer for cross-domain fault diagnosis
2024, Mechanical Systems and Signal Processing
Compared to the single-source domain adaptation fault diagnosis methods, the multi-source domain adaptation methods can not only take advantage of the rich and diverse diagnostic information of multiple source domains but also draw on the feature alignment of single-source setting to reduce the domain discrepancy. However, forcing the alignment of feature distributions is challenging and may lead to negative transfer. Meanwhile, labeled data are often scarce and difficult to collect in actual production, which can be mitigated by multi-source information, but the diagnostic performance of the model is degraded by large domain differences. To tackle the above issues, a domain attribute and feature transfer network is proposed to model multi-source information domains in a unified deep network and achieve cross-domain fault diagnosis. In the transferable attributes learning section, we adopt an attention mechanism and a domain attribute loss function to extract transferable latent attributes from multi-source information. In the transferable features learning section, we apply the local maximum mean discrepancy metric to adjust the category distribution of single-source information and target domains. Then, intra-class compactness learning and pseudo-labeling learning strategies are utilized to further obtain richer feature representations. Finally, we propose the knowledge fusion module to fuse the results of multi-source information classifiers to yield a more reliable diagnosis result. Extensive experiments on three different multi-source information datasets show the superiority of our method compared to the state-of-the-art (SOTA) methods by comparing indicators from various aspects.
A zero-sample intelligent fault diagnosis method for bearings based on category relationship model
2024, Engineering Applications of Artificial Intelligence
Collecting data containing all categories is crucial for fault diagnosis methods based on machine learning, which can be difficult in real industrial scenarios, particularly under new working condition. Therefore, a zero-sample intelligent fault diagnosis method is proposed for the bearings working in new condition. Specifically, a health-fault category relationship model among the different categories is developed using improved sparsity-constrained generative adversarial network. The fault data in new working condition can be generated by the model. Then, a fault diagnosis model in new working condition is established using transfer learning between existing working condition and new working condition. Finally, two bearing datasets are used to verify the proposed method. The results show that the model is effective for the bearings under new working condition without fault data, especially for the big difference of data distribution caused by the changing working conditions. It indicates that the proposed method has great potential in practical industrial applications.
Deep discriminative feature learning based on classification-enhanced neural networks for visual process monitoring
2024, Journal of the Taiwan Institute of Chemical Engineers
Process monitoring plays an important role in ensuring plant safety and product quality. Among various monitoring methods, visual process monitoring provides an intuitive indication of process status, and is gaining increasing attention. However, the feature learning, which is the core of the visual process monitoring, has not been well discussed.
A Deep Discriminative Feature learning-based Supervised Neural Network (DFNN) is proposed for effective visual process monitoring. The DFNN is composed of an Extended Stacked Autoencoder (ESAE) and a Feedforward Neural Network (FNN). The ESAE augments system data into a novel feature subspace, serving as the FNN's input. The DFNN imposes class center and classification constraints on data to extract discriminative features. Concurrently, a t-Distributed Stochastic Neighbor Embedding-based Neural Network (t-SNE-based NN) maps these deep features into a 2D space, facilitating high-dimensional data visualization and intuitive presentation of operational status.
Case studies on the Tennessee Eastman benchmark process and a wastewater treatment process are carried out. Comparison results show that the proposed method can provides more accurate fault classification results than some state-of-the-art methods.
Multi-source weighted source-free domain transfer method for rotating machinery fault diagnosis
2024, Expert Systems with Applications
The mainstream approach to addressing the issues of insufficient historical data and high annotation costs in the domain of rotating machinery is to build transfer learning models based on labeled multi-source data. However, the practical diagnosis of failure cases often relies on data privacy, thereby limiting the widespread application of current multi-source domain transfer approaches for the ‘data silos’ problem of. In view of the above problem, a multi-source weighted source-free domain transfer approach is designed for rotating machinery fault diagnosis, and the designed scheme can efficiently achieve data privacy and domain transfer. Specifically, the proposed approach achieves knowledge transfer from the source to the target during the training process of the unlabeled target data without accessing the source data. This is accomplished through the utilization of a designed reinforced information maximization strategy and improved self-training mechanism. Additionally, a weighted strategy is devised to automatically apply optimal values to all source domains based on their relevance to the target domain. The proposed framework demonstrates accuracy exceeding 96% across eight cross-domain diagnostic cases in two sets of rotating machinery data, with an average accuracy of 98.26%. These results underscore the exceptional ability of the proposed method to address cross-domain fault diagnosis in rotating machinery while ensuring privacy protection.

View all citing articles on Scopus

View full text

Transfer fault diagnosis based on local maximum mean difference and K-means

Highlights

Abstract

Introduction

Section snippets

Basic theory of sparse auto-encoder

Problem description

Experimental results and comparative analysis

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Computers & Industrial Engineering

Journal of Process Control

Pattern Recognition

Computers & Industrial Engineering

Neurocomputing

Neurocomputing

Neurocomputing

Signal Processing

Mechanical Systems and Signal Processing

Neurocomputing

Expert Systems with Applications

Measurement

Computers & Industrial Engineering

Computers & Industrial Engineering

Neurocomputing

Mechanical Systems and Signal Processing

Measurement