1 Introduction

In recent years, equipment fault diagnosis, as a new technology that crosses various disciplines, has been developed rapidly and has produced huge economic benefits [1,2,3,4,5,6]. The centrifugal pump is an important energy conversion and liquid transmission device in the process industry and its working state directly affects the production of the entire operating equipment. A slight damage to the impeller will shorten the running time of the centrifugal pump and disturb the operation of the equipment. When the impeller fails, it will cause damage to the centrifugal pump components or personal injury accidents, which will cause significant economic losses [7, 8]. The normal operation and failure of the centrifugal pump will cause the equipment to vibrate. The vibration signal contains rich information of the pump body running state and is easy to be collected, which can be used to monitor and diagnose the running state of the centrifugal pump. Fault diagnosis is generally divided into three steps: first, we collect the relevant vibration signal of the diagnostic object; then, the signal is analyzed and processed to acquire the characteristics of the vibration signal; finally, pattern recognition and fault diagnosis are performed through the corresponding extracted special diagnosis [9, 10]. The core content is to obtain the effective characteristics of the vibration signal. Due to the complex structure of the centrifugal pump, many excitation sources and mutual interference, the vibration signal of the centrifugal pump is a non-linear and non-stationary signal. Researchers have proposed various effective diagnostic methods to process the collected raw vibration signals of the centrifugal pump, extract effective information, and improve the accuracy of diagnosis. Wenjian Huang et al. [11] extracted the characteristic parameters of the vibration signal through time-domain signal analysis, then the PCA was used to reduce the amount of data, and finally the main component with the largest contribution rate was used as the input signal of SPRT to evaluate the proposed algorithm. Literature [12] proposed an improved deep convolutional neural network (CNN) to identify defects in centrifugal pumps by using sound and image recognition. A feature extraction method based on empirical mode decomposition (EMD) was developed to detect the gravity of cavitation in the centrifugal pump by Azizi et al. [1]. Liu Yang et al. [13] proposed the new method for analysis of big data based on particle swarm optimization wavelet neural network for diagnosis in the gearbox. Literature [8] applies variational mode decomposition (VMD) with different input parameters to fault diagnosis of multi-stage pumps. Signal processing combined with empirical mode decomposition (EMD) and fuzzy c-means clustering is used for monitoring piston pump defects in literature [14]. The traditional decomposition method of processing high-dimensional data will not only lose the integrity of the data but also increase the amount of calculation and introduce redundancy [15,16,17,18]. These methods of extracting time-frequency characteristics from single-channel signals, such as Fourier transform, cannot reflect the internal relationship of non-linear changes between multi-source channel characteristic signals, nor can they eliminate information interference.

Mechanical non-linear multi-fault mode multi-source dynamic feature identification is a technical bottleneck and difficult problem encountered in the application of fault diagnosis in process industry production lines. It not only needs to extract the time-frequency characteristics of multi-source fault signals, but also to ensure the correspondence between non-linear variables and multi-fault modes and multi-source fault features in time, frequency, and space after feature extraction. Parallel factor analysis proposed by Carroll, Chang [19], and Harshman [20] in 1970 is a three-dimensional or multi-dimensional signal processing algorithm that uses iterative least squares to resolve the decomposition and identification of multi-dimensional matrices. The general time-frequency decomposition ignores the spatial information of the vibration signal and cannot handle multi-channel data [21,22,23]. Data framing in the form of a three-way array indexed by channel, frequency and time allows the application of a unique decomposition called Parallel Factor Analysis (PARAFAC). The decomposition uniqueness of PARAFAC model can obtain its model parameters without ambiguity so that the PARAFAC model has important application value. As a data processing algorithm, PARAFAC model has been successfully applied in fluorescence spectroscopy, psychology, signal processing, food science, and other fields. Multi-channel electroencephalogram EEG data can usually be expressed as an M×N×P three-way data set and the components of the three-way data array correspond to the channel (electrodes at different positions), time (data samples) and frequency components. Schmitz, S. applied PARAFAC to analyze the temporal and spatial patterns of functional connections between neurons, which were revealed in the sequence of peaks recorded in the cat’s main visual cortex (area 18) [24]. This parallel factor analysis was applied for decomposing EEG data into space-time-frequency components during the resting state and mental arithmetic by Miwakeichi, Fumikazu et al. [25]. Rost'akova compares non-negative Tucker decomposition with parallel factor analysis to identify and measure human brain electrical rhythms [26]. In the literature [27], Choi, Ji Yeh proposed a new extension function PARAFAC for processing response to three-dimensional data arranged along a two-dimensional domain and one-dimensional parameters. Technically, this method combines PARAFAC with basis function expansion approximation and is applied to EEG data to prove its empirical usefulness. A parallel factor analysis study showed that the frontal lobe area with higher frequency response is the main feature of laser evoked potential in rats with chronic inflammatory pain [28].

Parallel decomposition has attracted great attention, because parallel factorization can process the constructed high-dimensional data as a whole, which not only retains the overall structure information of the data, but also makes the structure more compact and easy to understand. In the literature [29], parallel factor analysis was used as the diagnostic tool through decomposing centrifugal pump diagnostic signal into time-frequency-space modes. Considering the difficulty of extracting fault features from rolling bearings under strong background noise, Yang Cheng [30] proposed a new method based on variable mode decomposition (VMD) and phase space parallel factor analysis to detect weak fault signals of rolling bearings. In order to overcome the inability to extract sparse and interpretable latent variables from batch data, literature [31] proposed a batch three-way data array sparse model based on sparse parallel factor (SPARAFAC) decomposition. Sparse factor matrices have the potential advantage of being easy to interpret because they eliminate redundant data information and show significant variable correlation. In chemistry, medicine, and food science commonly used fluorescence excitation and emission data typically contain several chemical components at different concentrations. Fluorescence spectroscopy can generate a three-way data set with the mode “sample × excitation × emission.” The main purpose of the analysis of this data type is to determine which chemicals are present in each sample and their relative concentrations. Reference [32] conducted a comparison between parallel factor analysis (PARAFAC) and support vector machine (SVM) to identify and distinguish the fluorescence spectrum of coconut water brands. The above results indicate that fluorescence spectroscopy combined with PARAFAC and SVM method has been proved to be a simple and rapid detection method for coconut water and other beverages. This study [33] aims to determine whether the composition or distribution of humus in lake sediments can be characterized by chemometric spectral data. This method determines the three-dimensional excitation emission matrix in the extracted humus and performs spectral analysis of the data by using parallel factor analysis (PARAFAC) with classification tree and regression tree (CART).

The theory of sequential probability ratio test (SPRT) that is a branch of mathematical statistics was proposed by Abraham Wald in 1947 in order to solve the problem of sampling and acceptance of valuable military products. This method provides an approximate formula for the critical value of accepting the null hypothesis H0 or accepting the alternative hypothesis H1 based on the sample values obtained from each observation, and also provides the average sampling times and power function of this test method. In 1948, Abraham Wald and American statistician Wolfowitz proved that the above-mentioned sequential probability is the smallest number of sampling times required for the test in all the two types of tests whose error probability does not exceed α and β, respectively. The sequential probability ratio test is the most fundamental sequential test in sequential analysis proposed by Abraham Wald, and it has subsequently been widely developed in various fields. Almost all the hypothesis testing problems of SPRT in mechanical fault diagnosis, such as signal detection, model variable point detection, life data analysis of centrifugal pump, and crack detection of gearbox, can be well applied. This research [34] were performed on actual faults in a laboratory-scale distillation plan based on artificial neural network-multi-layer perceptron (ANN-MLP) and the Wald sequential probability ratio test (SPRT). In the literature [35], Guo Peng proposed Gaussian process and SPRT wind turbine power curve modeling and monitoring. The modeling and monitoring method proposed in this paper successfully identified two wind anemometer failures and pitch system failures. Literature [36] proposed a fault detection algorithm based on the sequential probability ratio test (SPRT) and chi-square test for redundant multi-sensor navigation systems for supersonic cruise ships (HCV).

The rest of this paper is organized as follows. In Section 2, the parallel factorization model and simulation is described. The multi-scale parallel factorization optimization algorithm for non-linear multi-source fault characteristic signal extraction is established. The characteristic factor signal is successfully obtained from the matrix factor, and the “loading” factor and “component” factor are defined. In Section 3, we studied the multi-channel data decision theory based on SPRT, and established an adaptive optimization diagnosis method for tracking and identifying the non-linear multi-dimensional dynamic optimal characteristic signal. To research the validity of the multi-scale parallel factor analysis and SPRT for multi-channel signal in actual complex industrial production, the centrifugal pump fault diagnosis experimental system was designed and implemented in Section 4. Following that, multi-source dynamic feature extraction based on parallel factorization and SPRT for the multi-source condition monitoring of centrifugal pump are presented in Section 5. Finally, the conclusions are drawn in Section 6.

2 The model and simulation

2.1 Parallel factorization model

In a two-dimensional matrix, xi, j is generally used to represent the elements in the two-dimensional matrix (i represents any row, j represents any column). Similarly, in the three-dimensional matrix, we use xi, j, kto represent any element in the three-dimensional matrix. At present, there is no definite name naming subscript k, let us call it “page” [37]. The subscript of the three-dimensional matrix consists of three index value row, column, and page composition. The left picture in Fig. 1 shows the three-dimensional matrix and the right picture shows its sub-matrix. When a certain dimension in a three-dimensional matrix is fixed, it constitutes a sub-matrix of the three-dimensional matrix that is called a slice of the three-dimensional matrix along a certain dimension.

Fig. 1
figure 1

Three-dimensional matrix and its sub-matrix

The expansion of the three-dimensional matrix is actually to rearrange the slices of the three-dimensional matrix to constitute the new two-dimensional matrix. For example, we fixed the rows and columns of a three-dimensional matrix and rearranged its pages to formulate the new two-dimensional matrix. At this time, the number of rows is equivalent to the number of rows I of the original matrix and the number of columns changes from the original J to J × K, denoted asXI × JK. It is expressed as shown in Eq. (1).

$$ {X}^{I\times JK}=\left[{X}_{k=1},{X}_{k=2}\cdots {X}_{k=K}\right] $$
(1)

Of course, it can also be expanded by column, such as XIK × J, which is defined as formula (2).

$$ {X}^{IK\times J}=\left[\begin{array}{c}{X}_{k=1}\\ {}{X}_{k=2}\\ {}\vdots \\ {}{X}_{k=K}\end{array}\right] $$
(2)

After expanding by columns, we acknowledge that I × K is displayed as the number of rows of the new matrix and parameter “J” is the number of columns.

The symbol \( {x}_{i,j,k}=\sum \limits_{f=1}^F{a}_{i,f}{b}_{j,f}{c}_{k,f} \) can be used to express any element in a three-dimensional or larger than three-dimensional matrix, the variables i, j, and k in the formula can be any natural numbers. The elements in the i-th row, j-th column, and k-th page of the matrix X can be represented by xi, j, k. According to the definition, the low-rank decomposition of a two-dimensional matrix can be popularized to the low-rank decomposition of a three-dimensional matrix. Let the element X ∈ CI × J × Kof the three-dimensional matrix be defined as xijk, the variables i, j, and k in the formula can be any natural numbers. Similarly, it can be seen that the three-dimensional matrix can be indicated as the modality of the vector outer product of the following formula (3).

$$ X={a}_1\circ {b}_1\circ {c}_1+,\dots, +{a}_R\circ {b}_R\circ {c}_R=\sum \limits_{r=1}^R{a}_r\circ {b}_r\circ {c}_r $$
(3)

It gives the process of a three-dimensional matrix low rank decomposition in formula (3) and the symbol R of the formula is indicated as the rank of the three-dimensional matrix X, where cr ∈ CK, br ∈ CJ,ar ∈ CI, r = 1,...,R. Harshman names the low-rank decomposition model of three-dimensional matrix given by formula (3) as the general model of parallel factor. The general forms of the parallel factorization model are interpreted in Fig. 2, in which X can be displayed by the popular geometric cube.

Fig. 2
figure 2

The general form of the parallel factorization model

TheseC = [c1, …, cR];B = [b1, …, bR];A = [a1, …, aR] are any given three two-dimensional matrix definitions. We define A, B, and C as the three loading matrices of the parallel factorization model of the general form. Equation (4) is shown as the scalar form of the general parallel factorization model. They are labeled as ckr = [C]k, r, bjr = [B]j, r,air = [A]i, r, and \( {\underline{x}}_{ijk}={\left[\underline{X}\right]}_{i,j,k} \).

$$ {\underline{x}}_{ijk}=\sum \limits_{r=1}^R{a}_{ir}{b}_{jr}{c}_{kr} $$
(4)

The general form of the parallel factorization model can be viewed as the low-rank decomposition of a two-dimensional matrix extending to a three-dimensional matrix. The formula (4) indicates that these subitems \( {\underline{x}}_{ijk} \)of the three-dimensional matrix X can also be denoted as the sum of the products of R elements a, b, and c. Compared with the matrix elements xij in PCA, \( {\underline{x}}_{ijk} \) has three independently changing dimensions called “mode A,” “mode B,” and “mode C.”

2.2 Matrix essential equalization

There is a matrix A ∈ CI × J. If the matrix satisfies \( \overline{A}=A\Pi \Delta \), it is said that A and \( \overline{A} \) are matrix essential equalization, denoted as \( A\cong \overline{A} \). Among them, ∏ is the column exchange matrix and △ is the diagonal scale matrix. There is one and only one non-zero element “1” in each row and each column of the column exchange matrix ∏. The function of the column exchange matrix is to rearrange the column vector of A in the order of ∏ without changing the value of the elements in the vector. The diagonal scale matrix △ is a J×J diagonal matrix with non-zero diagonal elements. The function of △ is to multiply each column of matrix A by a non-zero amplitude.

According to the concept of matrix essential equalization, we take a two-dimensional matrixX = ABTas an example, where A ∈ CM × F,B ∈ CN × F. For any matrix \( \overline{A}\in {C}^{M\times F},\overline{B}\in {C}^{N\times F} \), if \( X={AB}^T=\overline{A}{\overline{B}}^T \)is satisfied, then we can get to formula (5).

$$ \overline{A}=A{\prod}_1{\Delta}_1,\overline{B}=B{\prod}_2{\Delta}_2 $$
(5)

∏2 and ∏2 in the formula are column exchange matrices, which means to rearrange the columns of the A and B matrices. △1 and △2 are diagonal scale matrices, which means that each column of matrix A and B is multiplied by the non-zero coefficient. At this time, the matrix decomposition is said to be unique.

When the matrix factorization is unique, the matrix \( \overline{A},\overline{B} \)obtained by the matrix factorization is not completely equal to the original matrices A and B, they are only the essential equality relationship of the matrix. The essential equal relationship of matrix A and B is shown in the following formula (6) (7).

$$ \overline{A}=A{\prod}_A{\Delta}_A\cong A $$
(6)
$$ \overline{B}=B{\prod}_B{\Delta}_B\cong B $$
(7)

Due to the existence of column exchange matrices ∏A, ∏B and diagonal scale matrices △A, △B, the order and magnitude of the column vectors in matrix \( \overline{A},\overline{B} \) can be different from those of A and B. In matrix theory, they are used to be called column blur and scale blur, which are represented by column exchange matrix ∏ and diagonal scale matrix △, respectively. In the process of matrix decomposition, if there is no structural constraint on the matrices A and B, column blur and scale blur are unavoidable. The above problem can be explained in the vector form of matrix decomposition, which is denotedA = [a1, ⋯, aF], B = [b1, ⋯, bF], whereaf ∈ CI × 1bf ∈ CJ × 1 (f=1,...,F) is the column vector of A and B, respectively. The above formula can be expressed as the following vector form.

$$ X={AB}^T={\mathrm{a}}_1{\mathrm{b}}_1^T+{\mathrm{a}}_2{\mathrm{b}}_2^T+\cdots +{\mathrm{a}}_F{\mathrm{b}}_F^T $$
(8)

In formula (8), \( {\mathrm{a}}_1{\mathrm{b}}_1^T,\cdots, {\mathrm{a}}_F{\mathrm{b}}_F^T \) are F matrices with rank 1. At this time, if the order of \( {\mathrm{a}}_1{\mathrm{b}}_1^T \),...,\( {\mathrm{a}}_F{\mathrm{b}}_F^T \) is changed arbitrarily, the value of matrix X is unchanged. Similarly, if the vector af is multiplied by the non-zero coefficient λf and the corresponding bf is multiplied by a non-zero coefficient 1/λf, the value of X will not change either. Assuming that the order of \( {\mathrm{a}}_1{\mathrm{b}}_1^T \)and \( {\mathrm{a}}_2{\mathrm{b}}_2^T \) in formula (8) is exchanged, it can be rewritten into the following form:

$$ {\displaystyle \begin{array}{c}X={AB}^T={\mathrm{a}}_1{\mathrm{b}}_1^T+{\mathrm{a}}_2{\mathrm{b}}_2^T+\cdots +{\mathrm{a}}_F{\mathrm{b}}_F^T\\ {}={\lambda}_2{a}_2\frac{1}{\lambda_2}{b}_2^T+{\lambda}_1{a}_1\frac{1}{\lambda_1}{b}_1^T+\cdots +{\lambda}_F{a}_F\frac{1}{\lambda_F}{b}_F^T\\ {}=A{\prod}_A{\Delta}_A{\left(B{\prod}_B{\Delta}_B\right)}^T\\ {}=\overline{A}{\overline{B}}^T\end{array}} $$
(9)

The result \( X={AB}^T=\overline{A}{\overline{B}}^T \) can be obtained, the code in the equation is specifically expressed as follows \( \overline{A}=A{\prod}_A{\Delta}_A \),\( \overline{B}=B{\prod}_B{\Delta}_B \).

It can be seen that there are always \( \overline{A} \) and \( \overline{B} \) to achieve matrix decomposition, but they are only essentially equal to A and B. Therefore, column ambiguity and scale ambiguity are inherent ambiguities in the matrix decomposition process. Without additional constraints, the order and magnitude of loading matrix columns cannot be determined through matrix decomposition. Therefore, the uniqueness of matrix factorization given by the definition can also be called “essential uniqueness.” In the actual application process, some methods can be used to eliminate column blur and proportion blur caused by matrix decomposition.

2.3 Recognizability and uniqueness of parallel factorization

The essential feature of the parallel factorization model is the uniqueness of the model. When there is no array blur, the matrices A, B, and C can be identified. The following conclusions can be obtained. When Xi = BDi(A)CT,i=1,2,...,I, A ∈ CI × F,B ∈ CJ × F,C ∈ CK × F is given, if kA + kB + kC ≥ 2F + 2, then these matrices A, B, and C are uniqueness for column exchange and plurality transformation or scale transformation.

The matrix composed of relatively independent columns taken from the absolute continuous distribution has full k-rank. If all three matrices meet this condition, the sufficient condition for recognizability is shown in formula (10).

$$ \min \left(I,F\right)+\min \left(J,F\right)+\min \left(K,F\right)\ge 2F+2 $$
(10)

If the matrices A, B, and C have other structural constraints, better identifiable results may be obtained. PARAFAC uniqueness theorem can be used to obtain the ith submatrix of the X-axis of PARAFAC model:

$$ {X}_i^{J\times K}={BD}_i(A){C}^T,i=1,\dots, I $$
(11)

The matrix in formula (11) satisfies the following A ∈ RI × R, B ∈ RJ × R, C ∈ RK × R. If the following conditions are met in Eq. (12).

$$ {k}_A+{k}_B+{k}_C\ge 2\left(R+1\right) $$
(12)

Even if there is column blur and scale blur, the matrix A, B, and C are unique. In mathematical language, when formula (13) is satisfied

$$ {X}_i^{J\times K}=\hat{B}{D}_i\left(\hat{A}\right){\hat{C}}^T,i=1,\dots, I $$
(13)

The relation shown in formula (14) can be obtained.

$$ \hat{A}=A\prod {\Delta}_1,\hat{B}=B\prod {\Delta}_2,\hat{C}=C\prod {\Delta}_3 $$
(14)

Equation (14) shows that ∏ is a column fuzzy matrix, Δ1Δ2 and Δ3 are the scale fuzzy matrix and the equation of Δ1Δ2Δ3 = I needs to be satisfied.

2.4 Trilinear alternating least square for parallel factor analysis

There are many methods to achieve the decomposition of PARAFAC, and the trilinear alternate least squares (TALS) algorithm is the most widely adopted methodology for data detection of parallel factor trilinear models. The fundamental principle of the TALS is to update the matrix in each step. First of all, TALS is updated by employing least squares (LS) to renovate the residual matrix based on the results of the previous estimate; then, it continues to update other matrices; finally, stop running until the result converges or reaches the set number of iterations after repeating the above steps. The trilinear model of three-dimensional data set X has the configuration shown in formula (15) below.

$$ {x}_{i,j,k}=\sum \limits_{f=1}^F{a}_{i,f}{b}_{j,f}{c}_{k,f}+{e}_{ijk},i=1\dots I;j=1\dots J;k=1\dots K $$
(15)

Where F is the number of factors, ai,f is the i-th element in vector af, bj,f is the j-th element in vector bf, and ck,f is the k-th element in vector cf. The data set X of third-order tensor I × J × K is indicated as “xi,j,k.” The “eijk” represents the error set E of the third-order tensor I × J × K. Equation A = [a1, a2, ⋯, aI] is defined as the I × F matrix; the B = [b1, b2, ⋯, bJ], and C = [c1, c2, ⋯, cK] are defined as the J × F matrix and the K × F matrix.

  1. (1)

    First, matrix A is calculated by formula (16).

$$ \left[\begin{array}{l}{X}_{\mathrm{K}1}\\ {}{X}_{\mathrm{K}2}\\ {}\mathrm{M}\\ {}{X}_{\mathrm{K}\ K}\end{array}\right]=\left[\begin{array}{l} BdiagC\left(1,:\right)\\ {} BdiagC\left(2,:\right)\\ {}\mathrm{M}\\ {} BdiagC\left(K,:\right)\end{array}\right]{A}^T+{E}_K $$
(16)

Formula (16) satisfies XK k = Bdiag(C(k,:))AT +EK k. The error is expressed in terms of EK. The least squares (LS) estimate of AT is calculated by Eq. (17).

$$ {A}^T={\left[\begin{array}{c} BdiagC\left(1,:\right)\\ {} BdiagC\left(2,:\right)\\ {}M\\ {} BdiagC\left(2,:\right)\end{array}\right]}^{+}\left[\begin{array}{c}{X}_{\mathrm{K}1}\\ {}{X}_{\mathrm{K}2}\\ {}\mathrm{M}\\ {}{X}_{\mathrm{K}K}\end{array}\right] $$
(17)

The generalized inverse in formula (17) is represented by []+.

  1. (2)

    Secondly, matrix B is calculated by formula (18).

$$ \left[\begin{array}{c}{Y}_{\mathrm{K}1}\\ {}{Y}_{\mathrm{K}2}\\ {}\mathrm{M}\\ {}{Y}_{\mathrm{K}\ I}\end{array}\right]=\left[\begin{array}{c} CdiagA\left(1,:\right)\\ {} CdiagA\left(2,:\right)\\ {}M\\ {} CdiagA\left(I,:\right)\end{array}\right]{B}^T+{E}_I $$
(18)

Formula (18) satisfies the following YK i = Cdiag(A(i,:))BT +EK i, The error is expressed in terms ofEI. The least squares (LS) estimate of BT is calculated by Eq. (19).

$$ {B}^T={\left[\begin{array}{c} CdiagA\left(1,:\right)\\ {} CdiagA\left(2,:\right)\\ {}\mathrm{M}\\ {} CdiagA\left(I,:\right)\end{array}\right]}^{+}\left[\begin{array}{c}{Y}_{\mathrm{K}1}\\ {}{Y}_{\mathrm{K}2}\\ {}\mathrm{M}\\ {}{Y}_{\mathrm{K}I}\end{array}\right] $$
(19)
  1. (3)

    Thirdly, matrix C is calculated by formula (20).

$$ \left[\begin{array}{l}{Z}_{\mathrm{K}1}\\ {}{Z}_{\mathrm{K}2}\\ {}\mathrm{M}\\ {}{Z}_{\mathrm{K}J}\end{array}\right]=\left[\begin{array}{l} AdiagB\left(1,:\right)\\ {} AdiagB\left(2,:\right)\\ {}\mathrm{M}\\ {} AdiagB\left(J,:\right)\end{array}\right]{C}^T+{E}_J $$
(20)

Where ZKj = Adiag(B(j,:))CT + EKj, j = 1, 2, ⋯, J. The error is expressed in terms of EJ. The least squares estimate of CT is calculated by Eq. (21).

$$ {C}^T={\left[\begin{array}{c} AdiagB\left(1,:\right)\\ {} AdiagB\left(2,:\right)\\ {}\vdots \\ {} AdiagB\left(J,:\right)\end{array}\right]}^{\div}\left[\begin{array}{c}{Z}_{\dots 1}\\ {}{Z}_{\dots 2}\\ {}\vdots \\ {}{Z}_{\dots J}\end{array}\right] $$
(21)
  1. (4)

    Finally, stop running until the result converges or reaches the set number of iterations after repeating the above steps (1)–(3).

Multi-channel vibration signals are collected in this paper to research the fault state of equipment, and a third-order tensor is constructed through continuous wavelet transform. Figure 3 shows the basic structure of the parallel factor analysis decomposition model for fault diagnosis.

Fig. 3
figure 3

The model of multi-scale parallel factor decomposition for fault diagnosis

The Nt, Nd, and Nf of the data matrix \( {\mathrm{S}}_{\left({\mathrm{N}}_{\mathrm{d}}\times {\mathrm{N}}_{\mathrm{f}}\times {\mathrm{N}}_{\mathrm{t}}\right)} \) are the number of data points, the number of channels, and the frequency step size, respectively.

$$ {\hat{S}}_{dft}=\sum \limits_{k=1}^{N_k}{a}_{dk}{b}_{fk}{c}_{ik}+{e}_{dft} $$
(22)

The key issue of this parallel factorization model is to obtain the matrices A, B, and C. The adk, bfk, and ctk are their elements, in which component k represents an atom. These spatial signals, spectral signals, and temporal signals for each atom are indicated as the ak = {adk}, bk = {bfk} and ck = {ctk}. The “eijk” is the error, which forms error set E of the third order tensor I × J × K. The uniqueness of the solution of parallel factor decomposition for fault diagnosis is guaranteed through rank(A) + rank(B) + rank(C) ≥ 2Nk + 2. The decomposition of formula (22) is achieved by solving \( \underset{a_{dk}{b}_{jk}{c}_{ik}}{\min}\left\Vert {\hat{S}}_{dft}-{\sum}_{k=1}^{N_k}{a}_{dk}{b}_{fk}{c}_{tk}\right\Vert \). The main advantage of this method is that the spectrum decomposition of time-varying vibration signal is unique and the best model can be obtained under the principle of minimum square deviation.

The basic steps for implementing the multi-scale parallel factorization model of fault diagnosis are as follows .

  1. (1).

    After the vibration signal is collected, the third order tensor is constructed by continuous wavelet transform.

  2. (2).

    The number of factor F is determined by the principle of consistency in MATLAB.

  3. (3).

    Initialization for load matrix B and C.

  4. (4).

    After initializing and running the matrices B and C, the matrix A is estimated by the least square regression algorithm. A = XZ'(ZZ')−1, Z = (b ⊗ c).

  5. (5).

    Similarly, the matrices B and C are estimated.

  6. (6).

    Continue from step (3) until the result converges or reaches the set number of iterations.

  7. (7).

    Corresponding results obtained.

2.5 Numerical simulation based on parallel factor analysis

Simulation experiments can investigate the characteristic of the results of input signals with different parameters after parallel factor analysis for fault diagnosis. Therefore, the simulation signals are used to simulate the running state of the centrifugal pump to test the method proposed in this article. The simulation signal is shown in the following formula (23).

$$ \mathrm{y}\left(\mathrm{t}\right)=0.01\ast \cos \left(2\ast \mathrm{pi}\ast 400\ast \mathrm{t}-5\right)\ast \exp \left(-0.5\ast \left(\left(\left(\mathrm{t}-0.03\right)/0.003\right)\hat{\mkern6mu} 2\right)\right) $$
(23)

Figure 4 shows the time-domain diagram of the simulated signal. An impact signal appears in the graph at 0.03 μs, which simulates the signal generated when the system fails. It is shown in Fig. 5 that the time-frequency diagram of the simulation signal is obtained by the continuous wavelet transform. It can be seen from the time-frequency diagram of the simulation signal that the dominant frequency of the signal is 400 Hz and the impact signal in the simulation signal appears in the frequency range of 180–400 Hz. For the time-frequency diagram of the simulated signal in Fig. 5, it can also be seen that the impact signal appears at 0.03 μs.

Fig. 4
figure 4

Time-domain diagram of the original simulated signal

Fig. 5
figure 5

Time-frequency diagram of original simulation signal

The simulation signal is constructed into a third-order tensor after continuous wavelet transformation, and then the third-order tensor is decomposed by parallel factors to obtain the result shown in Fig. 6. After parallel factor analysis, we can get the loading value and residual error corresponding to frequency, time, and channel. Comparing the loading value of frequency and time after parallel factor analysis with the time-frequency diagram, we can find their corresponding relationship. The hypothetical frequency curve in the graph fluctuates in the range of 180–400 Hz, which is a contrast relationship between the fluctuations of the simultaneous frequency graph. The time curve fluctuates at 0.03 μs and has the maximum value of the third component and the minimum value of the second component. It can be seen from the simulation signal corresponding to the ground that the simulation signal also has an impact signal at 0.03 μs. This indicates that the parallel factor analysis for high-dimensional data of multi-source feature factors can well detect the characteristics of the shock signal generated by the simulated fault.

Fig. 6
figure 6

Parallel factor analysis of the original simulation signal

The vibration signals collected in engineering are generally mixed with various noise signals. In order to check on the effectiveness in complex conditions, we add the noise signal to the original simulation signal and perform parallel factor analysis on it. Figures 7 and 8, respectively, show the time-domain diagram of the original simulation signal after adding noise to the signal and the time-frequency diagram obtained through continuous wavelet transform. After adding the noise signal to the original simulation signal, it can be seen that the waveform of the noisy simulation signal is similar to the original simulation signal in Fig. 4 and the impact signal is almost covered by the noise signal. The waveform of the noisy simulation signal in Fig. 8 is steeper and more rapid, and there is a larger blurred signal at 10–20 Hz.

Fig. 7
figure 7

Time-domain diagram of noise-added simulation signal

Fig. 8
figure 8

Time-frequency diagram of noise-added simulation signal

The simulation signal with noise is transformed into a third-order tensor after continuous wavelet transformation. The result of the parallel factor analysis of the third-order tensor is shown in Fig. 9. After parallel factor analysis, we can get the loading value and residual parameters corresponding to frequency, time, and channel. Comparing the loading value of frequency and time after parallel factor analysis with the time-frequency diagram, we can find the correspondence between them relationship. The hypothetical frequency curve in the graph fluctuates in the range of 180–400 Hz, which is a contrast relationship between the fluctuations of the simultaneous frequency graph. The time curve fluctuates at 0.03 μs and has a maximum value. We get the normal probability plot and the residual variance corresponding to the data in mode 1, mode 2, and mode 3. This shows that the parallel factor analysis proposed can well detect the characteristics of the impact signal in this paper even when the collected signal contains noise.

Fig. 9
figure 9

Parallel factor analysis of noise added simulation signal

3 Proposed method

Likelihood function is a function of statistical model parameters, which plays a great role in statistical inference. The general method of using likelihood ratio test statistics was proposed by Neyman-Pearson in 1982 [38]. Its basic idea is similar to the maximum likelihood method of parameter estimation theory, which is called likelihood ratio test. For hypothesis H0 : θ = θ0, alternative hypothesis H1 : θ = θ1, x is a set of random variables. When H0 is true, the probability density function of the random variable x is expressed as f(x, θ0). When H1 is true, the probability density function of the random variable x is expressed as f(x, θ1). The likelihood function of the sample is the following formula (24).

$$ L\left(\theta \right)=\prod \limits_{i=1}^nf\left({x}_i,\theta \right) $$
(24)

Therefore, the likelihood ratio test is performed to obtain the statistic L in the following formula (25).

$$ L=\frac{L\left({\theta}_1\right)}{L\left({\theta}_0\right)}=\frac{\prod \limits_{i=1}^nf\left({x}_i,{\theta}_1\right)}{\prod \limits_{i=1}^nf\left({x}_i,{\theta}_0\right)} $$
(25)

If the likelihood ratio L is larger, the parameter θ is more likely to be θ1; it shows that the result may tend to negate H0. On the contrary, if the ratio is smaller, the parameter θ is more likely to be θ0, which indicates that the result may be inclined to accept H0. For a certain limit k, L is defined as shown in the following formula (26).

$$ \varphi (x)=\left\{\begin{array}{cc}1& l>k\\ {}0& l\le k\end{array}\right. $$
(26)

Test φ(x) is called the likelihood ratio test of the above test problem.

Neyman-Pearson proposes a principle to determine the optimal test method: parameter α satisfies formula (27).

$$ \beta \left(\theta \right)\le \alpha \kern0.5em \forall \theta \in {\Theta}_0 $$
(27)

In formula (27), β(θ) is the power function of the test, Θ0 is the parameter space of the null hypothesis H0, and θ is the test parameter. Look for a test that satisfies the above formula so that β(θ) is as large as possible when θ ∈ Θ0.To ensure that the probability of making two types of errors is very small, the sample size must be increased. For field testing, the smaller the sample size, the better when ensuring the reliability of the conclusion. The sequential method proposed by A. Wald solves the problem of optimal selection of sample size and play an important milestone in the history of statistical development.

The probability function f(x, θ) represents the distribution of the random variable x, H0(θ = θ0) and H1(θ = θ1) are the null hypothesis and alternative hypothesis of the random variable x, respectively. When accepting H1, the probability of the sample x1, …, xm for any positive integer m is given by P1m = f(x1, θ1), …, f(xm, θ1), and the probability is given by P0m = f(x1, θ0), …, f(xm, θ0) when accepting H0.The definition of the sequential probability ratio test is as follows: select two normal numbers A and B (B < A) and calculate the probability ratio P1m/P0m at each stage of the test.

  1. (a)

    If p1m/p0m ≥ A, the sequential probability ratio test ends, H1is accepted and H0 is discarded.

  2. (b)

    If p1m/p0m ≤ B, the sequential probability ratio test ends, H0 is accepted and H1 is discarded.

  3. (c)

    If B < p1m/p0m < A, we continue to observe the sequential probability ratio test until the requirement is met.

When SPRT is applied to target recognition, it is first assumed that one of the M alternative hypotheses is the initial hypothesis. The signal propagation waveform is denoted as s(t). When a signal is transmitted, one of the possible waveforms is received and recorded as follows:

$$ y(t)=s(t)\ast {h}_i(t)+n(t)\kern0.5em i\in \left\{1,2,\mathrm{K},M\right\} $$
(28)

Where n(t) is additive white Gaussian noise; the impulse response of the target hypothetical channel is expressed as hi(t) and “*” is the convolution factor.

The signal channel receiving data is defined in formula (29), where Qi represents the target convolution matrix defined in the literature.

$$ y={Q}_is+n $$
(29)

The M target hypotheses are denoted as H1, H2, …, HM, respectively. The parameter αi,j is the probability (i ≠ j) when the true hypothesis Hi is wrongly selected as Hi. After obtaining k-th observations, suppose the likelihood ratio of i and j can be defined as shown in formula (30)

$$ {\Lambda}_{i,j}^k=\frac{p_{i1}\left({\mathrm{y}}_1\right){p}_{i2}\left({\mathrm{y}}_2\right)\Lambda \kern0.5em {p}_{ik}\left({\mathrm{y}}_k\right)\kern0.5em {P}_i}{p_{j1}\left({\mathrm{y}}_1\right){p}_{j2}\left({\mathrm{y}}_2\right)\Lambda \kern0.5em {p}_{j2}\left({\mathrm{y}}_k\right)\kern0.5em {P}_j} $$
(30)

Where pik(yk) is the probability density function (PDF) with k-th data under the i-th hypothesis and yk is the k-th observation data. When the likelihood ratio satisfies formula (31), accept the assumption Hm.

$$ {\Delta}_{i,j}^k>\frac{1-{\theta}_{i,j}}{\theta_{i,j}}\kern0.5em j\ne i $$
(31)

When the likelihood ratio satisfies the formula (31), stop the loop. If the likelihood ratio does not meet the stopping condition, continue to the next iteration. In fact, the probability density function of the observed data is constant and satisfies pi1(y) = pi2(y) = … = pik(y). The intensity waveform is updated with the number of iterations, so the probability density function of the observation data under the condition of additive white Gaussian noise can be defined as formula (32).

$$ {p}_{ik}\left({y}_k\right)=\frac{1}{{\left(\sqrt{2{\pi \sigma}_n^2}\right)}^{L_y}}\times \exp \left[-\frac{1}{2{\sigma}_n^2}{\left({y}_k-{Q}_i{s}_k\right)}^T\kern0.5em \left({y}_k-{Q}_i{s}_k\right)\right] $$
(32)

4 Experiments

4.1 Slurry pump fault test system and experimental design

The experimental system to be established in this project is required to operate the slurry pump under controlled conditions of speed, flow rate, slurry density, and inlet pressure, and to use and replace the impeller of the slurry pump of different grades and wear. Common failure parts of centrifugal pumps include rotor impeller, rolling bearing, seal, coupling, etc., of which impeller and rolling bearing failure account for a large proportion. The schematic diagram of the slurry pump fault diagnosis test system is shown in Fig. 10. The figure shows the three-dimensional schematic diagram of the test circuit and identifies the key components. It mainly includes motors, centrifugal pumps, data acquisition systems, control instruments, glycol cooling tanks, pressure gauges, flow meters, conveyor belts, sand tanks, pipelines, pressure control tanks, density meters, and sampling ports. First, the normal impeller is used in the centrifugal pump to run the slurry pump fault test system for collecting and testing the signal data of the slurry pump vibration, flow, slurry density, motor speed, and pump inlet and outlet pressure. Then impeller perforation, impeller edge damage and blade damage, and its impellers with different degrees of damage were selected to replace the original centrifugal pump impeller. After running the slurry pump fault diagnosis and test system, the data of the vibration, speed, and pump speed of the slurry pump experiment system were collected.

Fig. 10
figure 10

Schematic diagram of slurry pump fault diagnosis test system

Figure 11 shows the process flow chart of the slurry pump fault diagnosis test system. The arrow direction in the figure is the flow direction of the mud when the mud pump fault diagnosis experiment system is running. It is the basis for establishing and running the centrifugal pump fault diagnosis experiment system in this article. The serial number and related schematic diagram in Figure 11 indicate the following meanings: 1—centrifugal pump, 2—motor, 3—inverter, 4—power meter sensor, 5—accelerometer, 6—pressure sensor, 7—flow meter, 8—hole plate, 9—heat exchanger, 10—cooler, 11—temperature sensor, 12—sand, 13—suction pressure control tank, and 14—suction pressure sensor. The fault diagnosis test system for slurry pump contains a Weir/Warman 3/2 CAH slurry pump (40 HP) with impeller C2147(8.4"). The process flow chart of fault diagnosis test system for slurry pump covers the key issues mentioned in this article, but does not cover all aspects of the design of the experimental system. The key issues include that the medium of the cooler in the pipeline is ethylene glycol, the process water is municipal water, and the heat exchanger medium is steam. Microphone means for sound collector.

Fig. 11
figure 11

Process flow chart of fault diagnosis test system for slurry pump

In order for the experiment to run successfully, the designer first needs to design the system after engineering calculation and determine the components. The main equipment required for the experiment includes the centrifugal pump, data acquisition system for vibration data acquisition, sensors, and a laptop computer. Auxiliary equipment including storage tanks, valves, instruments, and drive motors are used to control various functions. The data collected by vibration accelerometers is used to analyze the centrifugal pump system in this experiment. The detailed explanation of the vibration sensor for the system signal acquisition is shown below. In the experiment, three three-axis vibration accelerometers are used. Two of the PCB three-axis ICP (Integrated Circuit Piezoelectric) accelerometers have the sensitivity of 100 mV/g and the frequency range of 2–5 kHz. Another PCB three-axis ICP accelerometer has the range of 0.5–3 kHz and the sensitivity of 1000 mV/g.

4.2 Slurry pump experimental equipment and signal acquisition system

To research the validity of the multi-scale parallel factor analysis and sequential probability ratio test proposed in this paper for multi-channel signal in actual complex industrial production, the centrifugal pump fault diagnosis experimental system was designed. The general Fig. 12a shows the centrifugal pump fault diagnosis experimental system. The data acquisition system is shown in Fig. 12b based on a combination of PC measurement hardware and software, which can input electrical signals from sensors and other instruments into a computer for processing. NI LabView 7.0 was chosen as the measurement standard application software because it is easy to build a graphical measurement interface with the help of a large number of tools and objects. The selected hardware is provided by NI DAQ and is highly compatible with our software applications. In order to collect the vibration signals of the centrifugal pump in three directions for each state, it is necessary to install a short-range but high-sensitivity sensor at the key position. Figure 12d is a schematic diagram of the position of the accelerometer. The standard accelerometer and the high-sensitivity accelerometer are installed on the pump casing near the pump suction port, where they will be close to the parts that are prone to failure. Another standard accelerometer is mounted on the shaft bearing because this location is sensitive to vibrations transmitted from the stuffing box. Real-time signals such as flow rate, pressure, speed, and vibration, can be collected synchronously by the experimental data acquisition system. By commanding the pressure and the flow of the equipment’s loop, we simulate the non-linear operating state of the industrial process of the mechanical system to establish the non-linear multi-fault mode, synchronously collect multi-channel signals, and obtain multi-source signals. The internal interaction mechanism between fluid excitation and vibration response under non-linear operation mechanism can be analyzed.

Fig. 12
figure 12

Experiment system of slurry pump. a Centrifugal pump experimental equipment. b Signal acquisition system. c Sensor for flow rate. d Installation diagram of acceleration sensor

The liquid transported in this experiment is set as mud to better collect vibration signals. In this experiment, normal impeller, and three types of faulty impellers, including impeller perforation, impeller edge damage and blade damage, were set to simulate failures in industrial production. Among them, these three failure modes have clear differences and the typical failures of centrifugal pump impellers can be represented well. The impeller in the normal state is denoted as S1, and the three types of impellers with impeller perforation, impeller edge damage, and blade damage are denoted as S2, S3, and S4. In order to avoid aliasing, the sampling frequency in this experiment is 9009 Hz according to the Nyquist sampling theorem and the data acquisition time is 20 s for each group.

In the experiment, different impellers were replaced to collect the vibration signals of the centrifugal pump under different operation conditions. The steps of the whole experiment are summarized as follows:

  1. (1)

    Establish the experimental device according to the schematic diagram of slurry pump test system shown in Fig. 11. The normal impeller shown in Fig. 13 is used as the impeller of the pump and sediment is added as the pumping medium. After starting the motor, we adjust the motor speed to 1200 rpm, 1600 rpm, 1800 rpm, 2200 rpm, and 2600 rpm through the known voltage, motor power, motor efficiency, and other coefficients and instructions. According to the sampling time of 20 s and sampling frequency of 9009 Hz shown in the previous article, NI LabView 7.0 application software and NI DAQ signal acquisition system were run to collect the three sets of three-dimensional vibration signals of the corresponding pump.

  2. (2)

    The normal impeller in the original centrifugal pump is replaced by the impeller perforation of the fault S2 in Fig. 13, and the other parts remain unchanged. Follow the previous steps to start the centrifugal pump and collect data. When a set of data is collected, the speed is set to 1400 rpm, 1600 rpm, and 2600 rpm and the above steps are repeated to collect data.

  3. (3)

    The S3 of impeller edge damage in Fig. 13c is selected to replace the impeller of S2 in the original centrifugal pump and other parts remain unchanged. Similarly, follow the previous steps to start the centrifugal pump and collect data.

  4. (4)

    The S4 of blade damage in Fig. 13d is selected to replace the impeller of S3 in the original centrifugal pump and other parts remain unchanged. Similarly, follow the previous steps to start the centrifugal pump and collect data.

  5. (5)

    After the experiment, the outlet valve of the pump was closed. Close the inlet valve after turning off the motor. Store experimental data to prepare for subsequent vibration signal analysis.

Fig. 13
figure 13

Four failure modes of the impeller in the experiment. a S1 normal impeller. b S2 impeller perforation. c S3 impeller edge damage. d S4 blade damage

5 Results and discussion

5.1 Multi-source dynamic feature extraction based on parallel factorization

This multi-scale parallel factorization method for the extraction of characteristic signals in non-linear multi-source and multi-fault modes is proposed in the article. Parallel factorization can not only perform high-dimensional data processing, but also has the uniqueness of the decomposition. This property makes the results of parallel factorization more realistic and has specific physical meanings. The third-order tensor constructed by multi-channel vibration signals through continuous wavelet transform is decomposed into the series of different modes of channel/frequency/time by the multi-scale parallel factor analysis algorithm. The spatial information is introduced into the time-frequency analysis of signals to form the three-dimensional spatial/time/frequency characteristic analysis of each factor. The simulation results show that the parallel factor decomposition for the tensor built by multi-channel signal has the compatibility of decomposition path and overall consistency. As a result, the topographic map, spectrum, and time contour of the multi-source fault signal in the centrifugal pump experiment are acquired. The multi-scale parallel decomposition method for extracting multi-source feature signals of non-linear failure modes is applied in the fault diagnosis of centrifugal pumps. It analyzes the internal connection between the optimal decomposition paths of multi-source signal feature factors. The optimal non-linear correspondence relationship between failure modes and characteristic signals in time, frequency, and space are constructed. Based on the correspondence and overall consistency of multi-source feature factor decomposition paths, we remodeled three-dimensional fault feature models such as the frequency spectrum and time profile of the fault feature factors, successfully extracted non-linear multi-dimensional dynamic fault feature signals. Finally, the corresponding fluctuation regularities of the homologous non-linear failure mode in the multi-source signals were displayed.

Figure 14 shows the time-frequency diagram obtained of the vibration signals collected in the X-axis direction of the three vibration signal collection points by continuous wavelet transformation when the slurry pump is in normal operation. Figure 15 shows the result of the parallel factor analysis of vibration signal of slurry pump after continuous wavelet transform in normal state. In this experiment, three-dimensional vibration sensors are set up at three measuring points. We analyze the vibration signals of these three measuring points to explore the three-dimensional spatial distribution and characteristic propagation path of dynamic characteristics on the mechanical structure of the slurry pump. Three groups of original vibration signals are transformed by continuous wavelet to obtain three-dimensional time-frequency signals to construct a third-order matrix. After multi-scale parallel factor analysis for the third-order tensor, the loading values and residual variance of the aisle, time, and frequency factors are obtained.

Fig. 14
figure 14

Time-frequency diagram of 3 vibration signals in normal state S1

Fig. 15
figure 15

Parallel factor analysis when the centrifugal pump in normal state S1

Figure 16 shows the time-frequency diagram of the vibration signals collected in the X-axis direction of the three vibration signal collection points by continuous wavelet transformation when the slurry pump is in S2 impeller perforation. In the S2 state, the third-order tensor of 3 × 126 × 4096 is constructed by continuous wavelet transform. Figure 17 indicates the result of the loading values and residual variance of the aisle, time, and frequency modes by the parallel factor analysis for the third-order tensor of slurry pump in state S2.

Fig. 16
figure 16

Time-frequency diagrams of 3 vibration signals under fault state S2

Fig. 17
figure 17

Parallel factor analysis when the centrifugal pump fails state S2

Figure 18 shows the time-frequency diagram of the vibration signals collected in the X-axis direction of the three vibration signal collection points by continuous wavelet transformation when the slurry pump is in S3 impeller edge damage. In the S3 state, the third-order tensor of 3 × 126 × 4096 is constructed by continuous wavelet transform. Figure 19 indicates the result of the loading values and residual variance of the aisle, time, and frequency modes by the parallel factor analysis for the third-order tensor of slurry pump in state S3.

Fig. 18
figure 18

Time-frequency diagrams of 3 vibration signals under fault state S3

Fig. 19
figure 19

Parallel factor analysis when the centrifugal pump fails state S3

The operating state of the slurry pump system in our experimental device system has normal operation and three failure states. The failure states include impeller holes, leading edge damage, and propeller blade damage. Similarly, Fig. 20 shows the time-frequency diagram when the slurry pump is in S4 propeller blade damage. Figure 21 indicates the result of the loading values and residual variance of the aisle, time and frequency modes by the parallel factor analysis for state S4.

Fig. 20
figure 20

Time-frequency diagrams of 3 vibration signals under fault state S4

Fig. 21
figure 21

Parallel factor analysis when the centrifugal pump fails state S4

We analyze the vibration signals of these three measuring points to discuss the three-dimensional spatial distribution and characteristic propagation path of dynamic characteristics on the mechanical structure of the slurry pump. By comparing the decomposition results of parallel factor analysis in the normal and fault state, there are obvious difference in the time loading factor and frequency loading factor component. Due to the phenomenon of characteristic coupling and aliasing of mechanical multi-source signals, the parallel factor analysis can optimize the independent characteristics of each channel on the surface of the mechanical structure and eliminate the mutual interference, overlap, and redundancy of the characteristic signals between the channels. Therefore, the parallel factor analysis are effective in providing a basis for subsequent diagnosis of SPRT and the fault identification can be successfully implemented.

5.2 SPRT for the multi-source condition monitoring of centrifugal pump

The proportions of the standard deviation “σ” and mean “μ” of the test signal sequences have significant influence on the likelihood ratio in the sequential probabilistic ratio test. Therefore, the mean value and standard deviation of the frequency loading value after the parallel factor decomposition should be calculated first for the test signal sequence. Assuming the probability distribution of the frequency load value sequence of one set of signals under the multi-source condition monitoring of the centrifugal pump meets the null hypothesis Hi : μ = μi, and the probability distribution of the frequency load value sequence of the other set of signals satisfies the alternative hypothesis Hj : μ = μj [39]. Their corresponding standard deviation σ remains unchanged. When the original hypothesis and the alternative hypothesis are both true, the joint probability density functions of these two sets of sequences are shown below.

$$ {p}_{ik}\left({y}_k\right)=\frac{1}{\sigma \sqrt{2\pi }}\exp \left(-\frac{1}{2{\sigma}^2}{\left({y}_k-{\mu}_i\right)}^2\right) $$
(33)
$$ {p}_{jk}\left({y}_k\right)=\frac{1}{\sigma \sqrt{2\pi }}\exp \left(-\frac{1}{2{\sigma}^2}{\left({y}_k-{\mu}_i\right)}^2\right) $$
(34)

In formula (33), Pik(yk) is the probability density function null hypothesis. Pjk(yk) in formula (34) is the probability density function under the alternative hypothesis. The SPRT probability ratio is calculated in formula (35).

$$ {\lambda}_{i,j}\left({Y}_{Sm}\right)=\frac{\prod \limits_{k=1}^n{P}_{jk}}{\prod \limits_{k=1}^n{P}_{ik}}=\frac{P_{j1}\left({y}_1\right){P}_{j2}\left({y}_2\right)\Lambda \kern0.5em {P}_{jk}\left({y}_k\right)}{P_{i1}\left({y}_1\right){P}_{i2}\left({y}_2\right)\Lambda \kern0.5em {P}_{ik}\left({y}_k\right)}\times \frac{P_{j0}}{P_{i0}} $$
(35)

In order to make the calculation easier in practical applications, the likelihood ratio formula is further derived and simplified to obtain the formula (36). Where YSi and YSj are the to-be-checked sequences of vibration signals Si and Sj, respectively, Δi, j(YSi) and Δi, j(YSj) are the likelihood ratios of the sequence to be tested YSi and YSj, respectively.

$$ {\Delta}_{i,j}\left({Y}_{Sm}\right)=1\mathrm{n}{\lambda}_{i,j}\left({Y}_{Sm}\right)=1\mathrm{n}\frac{\prod \limits_{k=1}^n{P}_{jk}}{\prod \limits_{k=1}^n{P}_{ik}}=\sum \limits_{k=1}^n1\mathrm{n}\frac{P_{jk}}{P_{ik}}\kern0.5em m=i,j $$
(36)

Referring to the sequential probability ratio test algorithm, we compare the likelihood ratio with the thresholds A and B to identify different forms of failure of the centrifugal pump. The size of A and B are closely related to the probability α of type I error and the probability β of type II error. The variables α, β, A, and B are satisfied with the following relationship:

$$ a=\ln A=\ln \frac{1-\beta }{\alpha } $$
(37)
$$ b=\ln B=\ln \frac{\beta }{1-\alpha } $$
(38)

For S1, S2, S3, and S4 four different impeller fault states of the centrifugal pump experimental system, Figure 22 shows the process of using the sequential probability ratio test algorithm to identify the fault. Vibration signals at three different positions collected under two different impeller fault conditions (sj) and (sj) are decomposed by parallel factors to obtain frequency loading values, which are calculated according to formulas (33)–(36) to obtain the likelihood ratio Δi, j of the sequential probability ratio test. The process of centrifugal pump fault identification is shown below. (1) If Δi, j =  ln (λi, j) ∈ (−∞, b], accept Hj, the centrifugal pump system is under the condition (sj). (2) If Δi, j =  ln (λi, j) ∈ [a, ∞), accept Hi, the centrifugal pump is under the condition (si). (3) If Δi, j ∈ [a,  b], the likelihood ratio of sequential probabilistic ratio test continues to be calculated by extracting the next data in the test sequence according to formulas (33)–(36). The likelihood ratio will continue to be compared with the threshold value until the condition (1) or (2) is met or the number of iterations is reached. After the test is stopped and the probability parameters λ1, 2(YS1), λ1, 2(YS2), λ1, 3(YS1), λ1, 3(YS3), λ1, 4(YS1) andλ1, 4(YS4) are obtained, the conditions of centrifugal pump will be distinguished.

Fig. 22
figure 22

Diagnostic process based on binary SPRT for centrifugal pump

The means of the signal S1, S2, S3 and S4 under the four conditions parameters areμ1, μ2, μ3, μ4. Then, the likelihood ratio is calculated and analyzed according to formulas (33)~(36). SPRT probability ratios λi, j(YSi) and λi, j(YSj) are calculated by importing the testing data (YSi, YSj) of the signal waveform for slurry pump Si and Sj conditions to Eq. (35). Compare the likelihood ratios Δi, j(YSi) and Δi, j(YSj) with the threshold to determine the state Si and Sj of the centrifugal pump.

The difference in the variables λi, j(YSi) and λi, j(YSj) is used to distinguish different condition (Si,Sj) of the centrifugal pump. When the iteration periods are determined, the condition S1 is compared with S2, S3, S4 in the relation between SPRT probability ratios. Figure 23a shows the fluctuation curve between the likelihood ratios Δ1, 2(YS1) and Δ1, 2(YS2) of the signals S1 and S2 during the determined number of iteration. It can be seen from the curve in Fig. 23a that Δ1, 2(YS1) < b displays that the centrifugal pump is in the normal state of S1; Δ1, 2(YS2) > a indicates that the centrifugal pump is in a fault state of the S2 impeller perforation. From the curve in Fig. 23b, it can be seen that Δ1, 3(YS1) < b indexes that the centrifugal pump is in the normal state of S1; Δ1, 3(YS3) > a indicates that the centrifugal pump is in the fault state of S3 impeller edge damage. In Fig. 23c, the inequality (Δ1, 4(YS1)) < b is satisfied, the condition is S1. The inequality(Δ1, 4(YS4)) > a is satisfied, the condition of the centrifugal pump is S4 impeller blade damage. The SPRT parameters Δ1, m, m = 2, 3, 4 in Fig. 22a–c are adopted to distinguish that the normal condition (S1) of the centrifugal pump from the conditions (S2,S3,S4).

Fig. 23
figure 23

Result of SPRT in variables λ1, 2, λ1, 3, λ1, 4 with calculation iteration

We found that the different fault states of the centrifugal pump in the experiment can also be distinguished by this method of SPRT. Figure 24a indicates the SPRT parameters λ2, 3 in Eqs. (33–36) of the testing sequences YS2 and YS3. The SPRT inequality (Δ2, 3(YS2)) < b is satisfied, then the condition of the centrifugal pump is S2. The inequality (Δ2, 3(YS3)) > a is satisfied, then the condition of the slurry pump is S3 (impeller edge damage). It can be seen from the curve in Fig. 24b that when (Δ2, 4(YS2)) < b is satisfied, the centrifugal pump is fault S2 (impeller perforation); when (Δ2, 4(YS4)) > a is satisfied, the centrifugal pump is fault S4 (impeller blade damage). When the inequality (Δ3, 4(YS3)) < b is satisfied in Fig. 24c, the condition is S3 (impeller edge damage). When (Δ3, 4(YS4)) > a is satisfied, the condition of the centrifugal pump is S4 (impeller blade damage). The parameters (λi, j(YSi), λi, j(YSj)) are effective indicator to monitor the different conditions of the multi-fault and multi-source centrifugal pump.

Fig. 24
figure 24

Result of SPRT in variables λ2, 3, λ2, 4, λ3, 4 with calculation cycles

6 Conclusion

Parallel factorization can not only perform high-dimensional data processing, but also has the uniqueness of the decomposition. This property makes the results of parallel factorization more realistic and has specific physical meanings. Through numerical simulation, the parallel factorization is used to explore the non-linear correspondence relationship of multi-source signal characteristic factors in time, frequency, and space under different simulation states. By adjusting the different frequency, time, phase, and amplitude of the analog signal, the loading values of the three modes are captured after parallel factorization to research the corresponding relationship between the analog signals. Then, the parallel factor analysis is applied to the centrifugal pump fault diagnosis experimental system to analyze the state characteristics under multiple fault modes. The non-linear multi-dimensional actional fault characteristic parameter of the impellers with different faults of the centrifugal pump was triumphantly acquired, and the corresponding fluctuation regularities of the homologous fault mode characteristics in the multi-source signals were displayed.

The analysis of the comprehensive result graph shows that the centrifugal pump fault diagnosis methodology based on parallel factor analysis of multiple scales and sequential probability ratio test is effective and reliable. This method first analyzes the collected vibration signals through parallel factor analysis and then conducts sequential probability ratio testing. It identifies different failure modes by comparing likelihood ratios and thresholds. Not only normal conditions and fault conditions can be identified, but also different fault conditions can be distinguished. Therefore, the methodology proposed in these contents of article is very suitable for non-linear multi-source and multi-fault signal analysis and processing. The PARAFAC theory proposed in this paper can also be used in the blind separation of mechanical multiple faults.