Skip to content
BY 4.0 license Open Access Published by De Gruyter Open Access November 7, 2020

On-line detection algorithm of ore grade change in grinding grading system

  • Jianjun Zhao EMAIL logo and Junwu Zhou
From the journal Open Physics

Abstract

In process industry control, process data is critical for both control and fault diagnosis. Timely detection of outliers and mutation point in process data can quickly adjust control parameters or discover potential failures throughout the system. Aiming at the shortcomings of the traditional mutation point detection method, such as the detection time delay and not being suitable for the process industrial data that mixed outliers, this paper proposes a mutation point and outliers detection method that is suitable for the grinding grading system using the wavelet method to analyze the “Efficient Scoring Vector.” In this algorithm, to distinguish between outliers and mutation points in the detection process, we propose a detection framework based on the relationship between Lipschitz index and wavelet coefficients. Under this framework, the detection algorithm proposed in this paper can detect outliers and mutation points simultaneously. The advantage of this is that the accuracy of the mutation point detection is not affected by the outliers. This means that the method can process grinding grading system process data containing outliers and mutation points under actual operating conditions and is more suitable for practical applications. Finally, the effectiveness and practicability of the proposed detection method are proved by simulation results.

1 Introduction

In process industry control, process data is critical for both control parameter adjustment and fault diagnosis [1]. Timely detection of outliers and trends in process data can quickly adjust control parameters or discover potential failures throughout the system [2]. For example, in the grinding grading system shown in Figure 1, factors such as changes in ore composition, changes in the production side, and wear of the steel ball can cause changes in the condition of the ball mill, resulting in a change in the control effect of the grinding grading process [3]. Whether the change of the ore composition or the wear of the steel ball will first be reflected in the migration of process data, so the rapid and accurate capture of small changes in process data is essential for the grinding and classification process control industry.

Figure 1 
               Configuration of typical SABC grinding and classification process.
Figure 1

Configuration of typical SABC grinding and classification process.

Since the characteristics of the mutation point and the abnormal point have similarity in a short time, it is difficult to distinguish the mutation point and the abnormal point in the initial stage of the process data change in the grinding classification system, so that the change in the ore grade or the potential cannot be found in time. It can be seen that for a process industry such as grinding grading system, it is necessary and important to detect and distinguish abnormal points and change points in a short time. From the current research situation, the detection of abnormal points and change points is almost independent of each other, but in the actual industry, these two kinds of data exist at the same time. In summary, in the research of the process control industry, it is necessary to simultaneously detect and distinguish these two types of data in one detection algorithm.

At present, there are relatively many researches on the problem of abnormal data detection, and some mature and practical methods have been formed. For example, statistical detection algorithms [4], batch-based detection algorithms based on distance or density [5,6], intelligent methods represented by neural networks [7], support vector methods suitable for small sample analysis [8], and cluster analysis method [9]. Since wavelet analysis has good local characteristics of the signal in both the frequency domain and the time domain, it is usually used for signal analysis [10]. In the field of anomaly data detection, the most representative one is the time series anomaly detection method based on the wavelet transform modulus maxima principle proposed by Mallatand and Hwang [11]. This method is suitable for extracting unstable changes from still signals. For unstable process signals, this method is no longer applicable because the modulus maximal principle method does not distinguish between the unstable state (i.e., the change point) and the abnormal change (i.e., the abnormality) of the signal.

In the field of research on mutation point detection, detection methods mainly focus on statistical theoretical methods. For example, in 1996, a method of mutation point detection using the idea of edge likelihood probability was proposed in ref. [12]. The method separates all the data from different positions in the middle and compares the maximum probability of the two parts before and after the segmentation to determine the location of the mutation point. Therefore, the calculation amount is large, which is not suitable for online mutation of a large number of time series in the process industry. Ref. [13] used the Gustafsson theory to determine the location of the mutation, which has the advantage of iterative calculation, so it is very suitable for the solution of the mutation point detection problem of the growth time series. However, the problem with this method is that its computational complexity is closely related to the frequency of occurrence of the abrupt point, which means that the amount of calculation of the method becomes abnormally large without a point of change for a long time. Therefore, in fact, it is only applicable to the problem of sequence data detection in which frequent mutations occur, and when the sequence data to be detected is stable for a long time, the method will have a catastrophic calculation amount. Ref. [14] proposed a change point detection method based on a wavelet footprint. The advantage of this method is that it has good detection accuracy and performance, but it is not suitable for the process industry because it is also a batch detection method. Ref. [15] proposes a method for detecting a sudden change point by comparing the difference between the reference window and the data in the sliding window. The advantage of this method is to achieve on-line detection of the point of mutation. The disadvantages are as follows: first, as the data in the reference window increases, the characterization value of the mutation is also increased, which means that if there is no sudden occurrence for a long time, the characterization value will increase indefinitely. This indicates that the selection of the mutation point characterization value in the algorithm is not ideal. Second, the accuracy of the detection method is proportional to the length of the sliding window, so that when the mutation amplitude is small, the detection algorithm has a large detection delay. For the grinding grading process system, this detection delay is very undesirable.

Considering the two shortcomings of two kinds of window structure detection methods and the necessity of detecting anomalies and change points at the same time, in this paper, we propose a method which can run on-line and detect abnormal points and change-points in time and at the same time – an algorithm for detecting abnormal points and change-points based on an effective fractional vector of wavelet analysis. The advantages and innovations of this method are summarized as follows: (1) in a framework, it can simultaneously detect the abnormal data and change-points, more in line with the actual application needs. (2) The detection algorithm proposed in this paper can realize the on-line real-time monitoring of the change-point and outliers, not the batch method. (3) This method does not have a large delay problem for the detection of the change-point, and the statistics in the detection process do not grow with time.

2 The ESV (effective score vector) detected method [16]

2.1 The efficient score vector

The ESV expression is defined as follows [17]:

(1) V k ( θ , η ) = i = 1 k ξ log f ( x i ; θ , η ) , ξ = ( θ , η ) ,

where f ( x ; θ , η ) is the density function, θ Ω 1 d , d 1 denotes the parameter variable of “interested” in the algorithm; η Ω 2 p , p 0 denotes the parameter variable “not interested” in the algorithm, also called the redundant parameter; Ω = Ω 1 × Ω 2 is the space in which the parameter variable is located; and η ˆ k is the estimated value of the redundant parameter η .

Then, ESV can be expressed as follows:

(2) V k ( θ 0 , η ˆ k ) = i = 1 k ξ log f ( x i ; θ 0 , η ˆ k ) = i = 1 k θ log f ( x i ; θ 0 , η ˆ k ) .

Here, we give two assumptions [16]:

(3) H 0 : θ = θ 0 , η unknown, for all observations; H A : x 1 , , x τ 1 have f ( x ; θ 0 , η ) , η unknown , x τ , x τ + 1 , have f ( x ; θ A , η ) , η , θ A unknown .

In which, θ 0 is the parameter in the distribution of the data before the occurrence of the change-point, θ A is the parameter in the distribution of the data after the occurrence of the change-point, and τ is the moment of the change-point. It can be seen from the assumption that the redundant parameters are unknown.

It can be seen from the ESV expression { V k , k > 1 } in equation (2): when the assumption of H 0 is true, the mean value of the sequence value { V k , k > 1 } is zero, When H A is to be true, the value of the sequence value { V k , k > 1 } of ESV will gradually increase, and its increase will increase linearly with the increase in the data after the change-points.

2.2 Change-point detection algorithm-based ESV [19]

2.2.1 For finite data

Let the total amount of data be n, first give two functions as follows:

(4) a ( x ) = ( 2 log x ) 1 / 2 , b ( x ) = 2 log x + 1 2 log log x 1 2 log π

According to equation (4) and the Lemmas 1 and 2 in ref. [18], the principle of unilateral detection and the principle of bilateral detection are as follows:

  1. Unilateral detection: as the number of data k continues to increase

    (5) k 1 / 2 W k C 1 ( α ) ,

    then the data sequence has a change-point near k , where W k is an expression that contains ESV:

    (6) W k = Γ 1 / 2 ( θ 0 , η ) V k ( θ 0 , η ˆ k ) .

    In which, the expression for Γ ( θ , η ) is

    (7) Γ ( θ , η ) = I 11 I 12 I 22 1 I 21 ,

    where I = I 11 I 12 I 21 I 22 is the information matrix and α = [ α 1 , α i , α d ] is the confidence vector.

  2. Bilateral detection: as the number of data k continues to increase

(8) k 1 / 2 W k C 1 ( α ) ,

then the data sequence has a change-point near k . Likewise, α is the confidence vector.

2.2.2 For infinite data

Here, we introduce a method based on bilateral tests only for infinite data.

In bilateral testing, as the number of data t increases

(9) W ( t ) C 2 ( α ) ,

then the data sequence has a change-point near t . Likewise, α is the confidence vector. And each component in C 2 ( α ) can be calculated according to equation (19):

(10) C 2 i ( α ) = [ 2 log α i + log ( t + 1 ) ] 1 / 2 [ t + 1 ] 1 / 2

3 Improved outlier and change-point detection algorithm

3.1 Solutions for data distribution problems

In the ESV algorithm, it is assumed that the detected data sequence obeys the distribution f . For the grinding analysis system process data, if we want to detect the ore grade value, we can think that it obeys the Gaussian distribution, N ( μ , σ 2 ) .

In this case, we can see that the variance of the Gaussian distribution has changed. Based on the above analysis, we re-give the following assumptions:

(11) H 0 : σ 2 = σ 0 2 , μ unknown , for all observations . H A : e 1 , , e τ 1 have N ( e ; μ , σ 0 2 ) , μ unknown , e τ , e τ + 1 , have N ( e ; μ , σ A 2 ) , μ , σ A 2 unknown .

3.2 Solution for detecting delay problems

For the traditional ESV-based detection algorithm, the disadvantage is that there is a change-point detection delay problem: the smaller the amplitude of the data sequence changed, the more serious the detection delay. The above description can be seen from the simulation (Figure 2).

Figure 2 
                  The chart of 
                        
                           
                           
                              
                                 
                                    W
                                 
                                 
                                    k
                                 
                              
                           
                           {W}_{k}
                        
                      for change-point.
Figure 2

The chart of W k for change-point.

In this paper, the wavelet transform theory is introduced to improve this shortcoming in the ESV algorithm. The detailed contents are the following paragraphs.

In Figure 2, it can be seen that the W k value before the change-point is almost zero, and the wavelet coefficient obtained by the decomposition of the data sequence should be near zero. From the point of the change, the W k value becomes the slope function form, the wavelet coefficients at this change-point will appear to be a modulus maximum. According to the above analysis, we can determine the position of the mutation point by using the wavelet decomposition W k value curve according to the relationship between the Lipschitz index of the function and the maximum value of the wavelet transform modulus [11].

Considering a large amount of data in the data sequence of the grinding classification system, the online recursive wavelet decomposition method proposed in ref. [20] is used to analyze the W k value by online wavelet analysis.

3.3 Change-point, anomaly uniform detection algorithm

3.3.1 ESV representation of abnormal point

First, we use the zero-mean white noise sequence as an example to illustrate the different representations of ESV statistics for change points and outliers. Add an outlier at step 200 of this set of data and start data changes at step 500. The image is shown in Figure 3.

Figure 3 
                     The values of 
                           
                              
                              
                                 
                                    
                                       W
                                    
                                    
                                       k
                                    
                                 
                              
                              {W}_{k}
                           
                         for outlier and change-point.
Figure 3

The values of W k for outlier and change-point.

As can be seen from Figure 3, the W k value of the anomaly point and the change-point is different, the W k curve at the anomaly point is the step function form, and the W k curve of the change-point is expressed as the slope function form.

3.3.2 Change-point and anomaly point detection algorithm based on wavelet analysis

Because of the different characteristics of the ESV statistic of the outliers and the mutation points under the wavelet transform, we propose to use the characterization relationship between the modulus maxima principle of the wavelet transform and the Lipschitz exponent [21] to detect and distinguish the anomaly points and change-points:

When υ > 0 , the wavelet coefficients increase with an increase in the wavelet scale.

When υ = 0 , the wavelet coefficients are independent of the scale.

Use this relationship to detect and distinguish between outliers and change-point. The specific algorithm is as follows:

Step 1. Wavelet decomposition of the fitting residual E ( t ) is carried out at two wavelet scales.

Step 2. Calculate the modulus of the wavelet decomposition coefficients at two scales and make the difference to get the value of E k .

Step 3. According to the following principles for outlier and change-point detection and differentiation:

Case 1. In step 1, there is no modulus max point, and the value of E k in step 2 does not change, indicating that the value of W k curve is always kept near zero, and there is no change, indicating that there is no abnormal point and no change-point here.

Case 2. In step 1, the modulus maxima point appears, and in step 2, E k has no modulus max point, indicating that the wavelet coefficients are the same at the two scales here, and should be where the anomaly is.

Case 3. In step 1 and step 2, there are modulus maxima points, which indicate that the wavelet coefficients are different at two scales and should be the point where the change-points are.

4 Simulation verification

4.1 Verification

To verify the effectiveness of the algorithm, we perform the mean shift processing of the ore grade data in the grinding classification system, which turns it into a set of zero-mean data sequences. As can be seen from the previous analysis, when the ore grade remains unchanged, the grade data processed by the mean shifting should obey the zero-mean normal distribution [22,23]. When the ore product changes, it no longer obeys the zero-mean Gaussian distribution. To simulate the presence of outliers in the data, we randomly add extremum data to the ore grade data. The following two sets of algorithm validation data are obtained:

  1. At 500 steps, the ore grade changes slightly and an abnormal value is added in step 200 to form the first set of data.

  2. The data sequence at 500 steps, the ore grade changes slightly, and the abnormal value is increased in 200 steps to form a second set of data.

The above two sets of data are shown in Figure 4, and the detected results are shown in Figures 5 and 6.

Figure 4 
                  Two groups of data for detection. (a) White noise data, (b) white noise data.
Figure 4

Two groups of data for detection. (a) White noise data, (b) white noise data.

Figure 5 
                  Detection results for the first group of data. (a) The chart of the statistics W
                     
                        k
                     , (b) wavelet decomposition for the statistics W
                     
                        k
                     , (c) the chart of the statistics E
                     
                        k
                     .
Figure 5

Detection results for the first group of data. (a) The chart of the statistics W k , (b) wavelet decomposition for the statistics W k , (c) the chart of the statistics E k .

Figure 6 
                  Detection results for the second group of data. (a) The chart of the statistics W
                     
                        k
                     , (b) wavelet decomposition for the statistics W
                     
                        k
                     , (c) the chart of the statistics E
                     
                        k
                     .
Figure 6

Detection results for the second group of data. (a) The chart of the statistics W k , (b) wavelet decomposition for the statistics W k , (c) the chart of the statistics E k .

In Figures 5(a) and 6(a), the curve for the statistics W k is marked as the outliers. Since the step of the W k curve at the outlier is not obvious, we will enlarge the step here and see it in the small window in Figures 5(a) and 6(a). In conjunction with Figures 5(a) and 6(a), we can see that the W k curve does exhibit a step change at the outliers, and after the change-point at 500 steps, the W k curve exhibits a slope curve whose slope is proportional to the degree of changing in the variance of the data sequence.

Figures 5(b) and 6(b) show the wavelet decomposition coefficient images of the statistical W k at two scales, where the dotted line is the wavelet coefficient at f = 13 and the solid line is the wavelet coefficient under f = 15 . From Figures 5(b) and 6(b), it can be seen that the wavelet coefficients exhibit a modulus maxima at both the outliers and the change-points.

In addition, we calculate the wavelet decomposition coefficient difference of the statistical curves of the two scales, and the results are shown in Figures 5(c) and 6(c). It can be seen from the figure that the wavelet coefficients of the two scales where the outliers are located are almost the same, and the difference is close to zero, indicating that the wavelet modulus maxima are independent of the wavelet scale, that is, the Lipschitz exponent of the step function is zero. Near the 500 steps, the difference between the wavelet coefficients of the two-scale curves increases, that is, for the slope function, the wavelet coefficients will increase with the wavelet scale as they increase [24,25]. It can be seen from the above simulation results that the outlier and change point detection algorithms proposed in this paper have certain effectiveness.

4.2 Algorithm comparison

To prove the advantage of the proposed mutation detection algorithm in detecting the delay, we use the ESV algorithm described in Section 3 to perform the mutation detection to facilitate comparison. Because the traditional ESV algorithm can only detect the abrupt change point and cannot detect the abnormal value, we adjust the two sets of data to be detected, that is, remove the abnormal value [26]. The results of detecting the mutation points of the above two sets of data based on the conventional ESV algorithm are shown in Figure 7.

Figure 7 
                  Detection results by the conventional ESV method. (a) The detection result by ESV method for the data in Figure 3a, (b) the detection result by ESV method for the data in Figure 3b.
Figure 7

Detection results by the conventional ESV method. (a) The detection result by ESV method for the data in Figure 3a, (b) the detection result by ESV method for the data in Figure 3b.

Figure 7(a and b) shows the results of the two sets of data. In Figure 7(a) and (b), the slope curve after 500 steps is a statistical w curve; the broken line is a detection threshold curve calculated by equation (10). The other solid line (always zero) is the zero reference line, indicating that the statistical value before the change point is almost zero.

It can be seen from Figure 7 that the traditional ESV-based change point detection algorithm judges the position of the change point by finding the intersection position of the detection threshold curve and the statistical w curve, but the time for detecting the sudden change point according to this detection mode is far delayed. At the location where the mutation occurred (500 steps). Moreover, this delay is inversely proportional to the magnitude of the mutation, i.e., the smaller the magnitude of the mutation, the later the detected time. From the detection results obtained by the detection method proposed in this paper (see Figures 5 and 6), the detection delay is smaller than the delay of the conventional ESV detection method regardless of the magnitude of the change point. The results show that the proposed detection method has advantages over the traditional ESV algorithm in detecting the change point.

5 Conclusion

In this paper, the grinding grading system is used as the application background. For the characteristics of process industrial process data similar to the grinding grading system and the real-time testing requirements, a method for detecting and distinguishing outliers and sudden points is proposed. The main ideas of the method are as follows: first, the wavelet value is used to analyze the ESV statistical value, which overcomes the shortcoming of the traditional ESV algorithm based on the large delay in detecting the sudden change point. Second, using the difference between the ESV values of the mutation points and outliers and the relationship between the wavelet modulus maxima and the Lipschitz exponent, a method of wavelet decomposition of the ESV curve is proposed to detect and distinguish outliers and mutations point. This makes it possible to detect and distinguish outliers and abrupt points within the same framework and greatly reduces the detection delay of the point of mutation. Simulation results show that the method has certain effectiveness and practicability.

References

[1] Raveendran R, Huang B. Variational Bayesian approach for causality and contemporaneous correlation features inference in industrial process data. IEEE Trans Cybern. 2019;49(7):2580–90.10.1109/TCYB.2018.2829440Search in Google Scholar PubMed

[2] Bao SY, Luo LJ, Mao JF. Robust monitoring of industrial processes in the presence of outliers in training data. Ind Eng Chem Res. 2018;57(24):38.10.1021/acs.iecr.8b00464Search in Google Scholar

[3] Weng XQ, Li HQ, Song SX. Reducing the entrainment of gangue fines in low grade microcrystalline graphite ore flotation using multi-stage grinding-flotation process. Minerals. 2017;7(3):7234–48.10.3390/min7030038Search in Google Scholar

[4] Gumedze FN. Use of likelihood ratio tests to detect outliers under the variance shift outlier model. J Appl Stat. 2019;46(4):598–620.10.1080/02664763.2018.1508559Search in Google Scholar

[5] Jones PR. A note on detecting statistical outliers in psychophysical data. Attention Percept Psychophys. 2019;81:1189–96.10.3758/s13414-019-01726-3Search in Google Scholar PubMed PubMed Central

[6] Ramaswamy S, Rastogi R, Shim K. Efficient algorithms for mining outliers from large data sets. In: Proc of the ACM SIGMOD. New York: ACM; 2000. p. 427–38.10.1145/342009.335437Search in Google Scholar

[7] Markou M, Singh S. Novelty detection: a review – part 2: neural network based approaches. Signal Process. 2003;83(12):2499–521.10.1016/j.sigpro.2003.07.019Search in Google Scholar

[8] Mourao-Miranda J, Hardoon DR, Hahn T, Marquand AF, Williams SCR, Shawe-Taylor J, et al. Patient classification as an outlier detection problem: an application of the one-class support vector machine. Neuroimage. 2011;58(3):793–804.10.1016/j.neuroimage.2011.06.042Search in Google Scholar PubMed PubMed Central

[9] Wang JS, Chiang JC. A cluster validity measure with outlier detection for support vector clustering. IEEE Trans Syst Man Cybern Part B Cybernet. 2008;38(1):78–89.10.1109/TSMCB.2007.908862Search in Google Scholar PubMed

[10] Percival DB, Walden AT. Wavelet Methods for Time Series Analysis. Cambridge: Cambridge University Press; 2006.Search in Google Scholar

[11] Mallat S, Hwang WL. Singularity detection and processing with wavelets. IEEE Trans Inf Theory. 1992;38(2):617–42.10.1109/18.119727Search in Google Scholar

[12] Gustafsson F. The marginalized likelihood ratio test for detecting abrupt changes. IEEE Trans Autom Control. 1996;41(1):66–78.10.1109/9.481608Search in Google Scholar

[13] Guralnik V, Srivastava J. Event detection from time series data. In: Proc of the 5th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining. New York: ACM; 1999. p. 33–42.10.1145/312129.312190Search in Google Scholar

[14] Sharifzadeh M, Azmoodeh F, Shahabi C. Change detection in time series data using wavelet footprints. In: Proc of the 9th int Conf on Advances in Spatial and Temporal Databases. Berlin: Springer; 2005. p. 127–44.10.1007/11535331_8Search in Google Scholar

[15] Alarcon-aquino V, Barria JA. Change detection in time series using the maximal overlap discrete wavelet transform. Lat Am Appl Res. 2009;39(2):145–52.Search in Google Scholar

[16] Gombay E, Serban D. Monitoring parameter change in AR(p) time series models. J Multivar Anal. 2009;100(4):715–25.10.1016/j.jmva.2008.08.005Search in Google Scholar

[17] Gombay E. Parametric sequential tests in the presence of nuisance parameters. Theory Stoch Process. 2002;8(24):106–18.Search in Google Scholar

[18] Gombay E. Change detection in utoregressive time series. J Multivar Anal. 2008;99(3):451–64.10.1016/j.jmva.2007.01.003Search in Google Scholar

[19] Gombay E. Sequential change-point detection and estimation. Sequential Anal. 2003;22(3):203–22.10.1081/SQA-120025028Search in Google Scholar

[20] Chaari O, Meunier M, Brouaye F. Wavelets: a new tool for the resonant grounded power distribution systems relaying. IEEE Trans Pover Delivery. 1996;11(3):1301–8.10.1109/61.517484Search in Google Scholar

[21] Pittner S, Kamarthi SV. Feature extraction from wavelet coefficients for pattern recognition tasks. IEEE Trans Pattern Anal Mach Intell. 1999;21(1):83–8.10.1109/ICNN.1997.614016Search in Google Scholar

[22] Atangana A, Gomez-Aguilar JF. A new derivative with normal distribution kernel: Theory, methods and applications. Phys A. 2017;476:1–14.10.1016/j.physa.2017.02.016Search in Google Scholar

[23] Atangana A. Extension of the Sumudu homotopy perturbation method to an attractor for one-dimensional Keller–Segel equations. Appl Math Model. 2015;39(10–11):2909–16.10.1016/j.apm.2014.09.029Search in Google Scholar

[24] Baig AQ, Naeem M, Gao W. Revan and hyper-Revan indices of Octahedral and icosahedral networks. Appl Math Nonlinear Sci. 2018;3(1):33–40.10.21042/AMNS.2018.1.00004Search in Google Scholar

[25] Dewasurendra M, Vajravelu K. On the method of inverse mapping for solutions of coupled systems of nonlinear differential equations arising in nanofluid flow, heat and mass transfer. Appl Math Nonlinear Sci. 2018;3(1):1–14.10.21042/AMNS.2018.1.00001Search in Google Scholar

[26] Qin Y, Lou Y, Zhao Y, et al. Research on relationship between tourism income and economic growth based on meta-analysis. Appl Math Nonlinear Sci. 2018;3:105–14.10.21042/AMNS.2018.1.00008Search in Google Scholar

Received: 2020-04-02
Accepted: 2020-08-24
Published Online: 2020-11-07

© 2020 Jianjun Zhao and Junwu Zhou, published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 26.4.2024 from https://www.degruyter.com/document/doi/10.1515/phys-2020-0142/html
Scroll to top button