Abstract
Prognostic health management minimizes system downtime and improves overall equipment effectiveness. Accurate prediction of remaining useful life (RUL) is key to prognostics. Prominent machine learning algorithms implement handcrafted feature extraction to improve RUL prediction. Deep learning automates feature extraction from raw data but requires large datasets and computationally expensive fine-tuning. Data-specific handcrafting and fine-tuning limit the generalization capability of existing models. Proposed framework addresses these challenges using Temporal Multivariate 3D Convolutional Network (TM3C) and Kernel-based Transformation (KT) of features. KT generates 3D features that incorporate trendable degradation patterns from multivariate temporal relationship among sensor data. TM3C implements 3D convolutional layers with temporal filters for RUL prediction. KT is generalizable and improves feature relevance. Full-width filters in TM3C reduce number of tunable parameters and convolution operations. Proposed TM3C-KT capitalizes on the strength of deep learning while lowering the cost for feature discovery, parameter learning, and model fine-tuning. TM3C-KT is evaluated on three prognostics applications, (1) RUL prediction for turbofan engines, (2) Failure state estimation for hydraulic pumps, and (3) Component wear prediction for milling machines. Performance of the framework is comparable and better than benchmark methods in literature. Characteristics of the framework are reviewed on generalizability, prognosability and versatility metrics. Results and corresponding analysis demonstrate suitability of TM3C-KT for industrial applications of machine health prognostics.
Similar content being viewed by others
References
Agogino A, Goebel K (2007) Milling data set. http://ti.arc.nasa.gov/project/prognostic-data-repository, (visited on 2019-07-15)
Babu, G. S., Zhao, P., & Li, X. L. (2016). Deep convolutional neural network based regression approach for estimation of remaining useful life. Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9642, 214–228. https://doi.org/10.1007/978-3-319-32025-0_14.
Bengio, Y. (2012). Practical recommendations for gradient-based training of deep architectures. In Neural networks: Tricks of the trade (pp. 437–478). Germany: Springer.
Chandrashekar, G., & Sahin, F. (2014). A survey on feature selection methods. Computers and Electrical Engineering, 40(1), 16–28. https://doi.org/10.1016/j.compeleceng.2013.11.024.
Coble JB (2010) Merging data sources to predict remaining useful life - an automated method to identify prognostic parameters. PhD thesis, University of Tennessee
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition
Friedman, M. (1937). The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of the American Statistical Association, 32(200), 675–701. https://doi.org/10.1080/01621459.1937.10503522.
Gamboa JCB (2017) Deep learning for time-series analysis. http://arxiv.org/abs/170101887
Genton, M. G. (2002). Classes of kernels for machine learning: a statistics perspective. Journal of Machine Learning Research, 2, 299–312.
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction. Germany: Springer Science & Business Media.
Helwig N, Pignanelli E, Schutze A (2015) Condition monitoring of a complex hydraulic system using multivariate statistics. Conference Record - IEEE Instrumentation and Measurement Technology Conference 2015-July:210–215, https://doi.org/10.1109/I2MTC.2015.7151267
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735.
Huang B, Di Y, Jin C, Lee J (2017) Review of data-driven prognostics and health management techniques: Lessons learned from PHM data challenge competitions. MFPT 2017 Annual Conference: 50 Years of Failure Prevention Technology Innovation pp 1–17
Huang, Z., Zhu, J., Lei, J., Li, X., & Tian, F. (2020). Tool wear predicting based on multi-domain feature fusion by deep convolutional neural network in milling operations. Journal of Intelligent Manufacturing, 31(4), 953–966. https://doi.org/10.1007/s10845-019-01488-7.
Jayasinghe L, Samarasinghe T, Yuen C, Low JCN, Ge SS (2018) Temporal convolutional memory networks for remaining useful life estimation of industrial machinery. arXiv:181005644
Ji, S., Xu, W., Yang, M., & Yu, K. (2013). 3D convolutional neural networks for human action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(1), 221–231. https://doi.org/10.1109/TPAMI.2012.59.
Khalid S, Khalil T, Nasreen S (2014) A survey of feature selection and feature extraction techniques in machine learning. In: Science and Information Conference, pp 1–8
Khan, S., & Yairi, T. (2018). A review on the application of deep learning in system health management. Mechanical Systems and Signal Processing, 107, 241–265. https://doi.org/10.1016/j.ymssp.2017.11.024.
Kim, T. S., & Sohn, S. Y. (2020). Multitask learning for health condition identification and remaining useful life prediction: Deep convolutional neural network approach. Journal of Intelligent Manufacturing,. https://doi.org/10.1007/s10845-020-01630-w.
Krizhevsky, A., Hinton, G., et al. (2009). Learning multiple layers of features from tiny images. Citeseer: Tech. rep.
Kumar S, Torres M, Chan YC, Pecht M (2008) A hybrid prognostics methodology for electronic products. Proceedings of the International Joint Conference on Neural Networks pp 3479–3485, https://doi.org/10.1109/IJCNN.2008.4634294
Laredo, D., Chen, Z., Schütze, O., & Sun, J. Q. (2019). A neural network-evolutionary computational framework for remaining useful life estimation of mechanical systems. Neural Networks, 116, 178–187. https://doi.org/10.1016/j.neunet.2019.04.016.
Lei, Y., Li, N., Guo, L., Li, N., Yan, T., & Lin, J. (2018). Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mechanical Systems and Signal Processing, 104, 799–834. https://doi.org/10.1016/j.ymssp.2017.11.016.
Li, X., Ding, Q., & Sun, J. Q. (2017). Remaining useful life estimation in prognostics using deep convolution neural networks. Reliability Engineering and System Safety, 172, 1–11. https://doi.org/10.1016/j.ress.2017.11.021.
Li, X., Zhang, W., & Ding, Q. (2018). Deep learning-based remaining useful life estimation of bearings using multi-scale feature extraction. Reliability Engineering and System Safety, 182, 208–218. https://doi.org/10.1016/j.ress.2018.11.011.
Liao, L., Jin, W., & Pavel, R. (2016). Prognosability regularization for prognostics and health assessment. IEEE Transactions on Industrial Electronics, 63, 7076–7083.
Listou Ellefsen, A., Bjørlykhaug, E., Æsøy, V., Ushakov, S., & Zhang, H. (2018). Remaining useful life predictions for turbofan engine degradation using semi-supervised deep architecture. Reliability Engineering and System Safety, 183, 240–251. https://doi.org/10.1016/j.ress.2018.11.027.
Liu, Q., Basu, S., Ganguly, S., Mukhopadhyay, S., DiBiano, R., Karki, M., et al. (2020). DeepSat V2: Feature augmented convolutional neural nets for satellite image classification. Remote Sensing Letters, 11(2), 156–165. https://doi.org/10.1080/2150704X.2019.1693071.
Malhotra P, TV V, Ramakrishnan A, Anand G, Vig L, Agarwal P, Shroff G (2016) Multi-sensor prognostics using an unsupervised health index based on LSTM encoder-decoder. arXiv:160806154
Marwan, N., Carmen Romano, M., Thiel, M., & Kurths, J. (2007). Recurrence plots for the analysis of complex systems. Physics Reports, 438(5–6), 237–329. https://doi.org/10.1016/j.physrep.2006.11.001.
Naduvil-Vadukootu S, Angryk RA, Riley P (2017) Evaluating preprocessing strategies for time series prediction using deep learning architectures. FLAIRS 2017 - Proceedings of the 30th International Florida Artificial Intelligence Research Society Conference pp 520–525
Nemenyi, P. B. (1963). Distribution-free multiple comparisons (doctoral dissertation, princeton university, 1963). Dissertation Abstracts International, 25(2), 1233.
Poggio, T., Mhaskar, H., Rosasco, L., Miranda, B., & Liao, Q. (2017). Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review. International Journal of Automation and Computing, 14(5), 503–519. https://doi.org/10.1007/s11633-017-1054-2.
Ramasso E, Saxena A (2014) Review and analysis of algorithmic approaches developed for prognostics on CMAPSS dataset. PHM 2014 - Proceedings of the Annual Conference of the Prognostics and Health Management Society 2014 pp 612–622
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2015). ImageNet large scale visual recognition challenge. International Journal of Computer Vision, 115(3), 211–252. https://doi.org/10.1007/s11263-015-0816-y.
Sadouk, L. (2018). CNN approaches for time-series classification. In: Time series analysis-data, methods, and applications. London: IntechOpen.
Saxena A, Goebel K, Simon D, Eklund N (2008) Damage propagation modeling for aircraft engine run-to-failure simulation. In: 2008 International Conference on Prognostics and Health Management, https://doi.org/10.1109/PHM.2008.4711414
Schneider, T., Helwig, N., & Schütze, A. (2017). Automatic feature extraction and selection for classification of cyclical time series data. Technisches Messen, 84(3), 198–206. https://doi.org/10.1515/teme-2016-0072.
Sharma, A., Vans, E., Shigemizu, D., Boroevich, K. A., & Tsunoda, T. (2019). DeepInsight: A methodology to transform a non-image data to an image for convolution neural network architecture. Scientific Reports, 9(1), 1–7. https://doi.org/10.1038/s41598-019-47765-6.
Shi X, Chen Z, Wang H, Yeung DY, Wong WK, Woo WC (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Advances in Neural Information Processing Systems 2015-January:802–810
Souza C (2010) Kernel Functions for Machine Learning Applications. http://crsouza.com/2010/03/17/kernel-functions-for-machine-learning-applications/, (visited on 2020-01-25)
Sun L, Jia K, Yeung DY, Shi BE (2015) Human action recognition using factorized spatio-temporal convolutional networks. Proceedings of the IEEE International Conference on Computer Vision pp 4597–4605, https://doi.org/10.1109/ICCV.2015.522
Tran D, Bourdev L, Fergus R, Torresani L, Paluri M (2015) Learning spatiotemporal features with 3d convolutional networks. In: Proceedings of the IEEE international conference on computer vision, pp 4489–4497
Vogl, G. W., Weiss, B. A., & Helu, M. (2019). A review of diagnostic and prognostic capabilities and best practices for manufacturing. Journal of Intelligent Manufacturing, 30(1), 79–95. https://doi.org/10.1007/s10845-016-1228-8.
Wienberger K (2018) Lecture 12: bias-variance tradeoff. http://www.cs.cornell.edu/courses/cs4780/2018fa/lectures/lecturenote12.html, (visited on 2020-03-12)
Wu G, Chang EY, Panda N (2005) Formulating distance functions via the kernel trick. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pp 703–709
Wu, Q., Ding, K., & Huang, B. (2020). Approach for fault prognosis using recurrent neural network. Journal of Intelligent Manufacturing, 31(7), 1621–1633. https://doi.org/10.1007/s10845-018-1428-5.
Yoon AS, Lee T, Lim Y, Jung D, Kang P, Kim D, Park K, Choi Y (2017) Semi-supervised Learning with Deep Generative Models for Asset Failure Prediction.
Yuan M, Wu Y, Lin L (2016) Fault diagnosis and remaining useful life estimation of aero engine using LSTM neural network. AUS 2016 - 2016 IEEE/CSAA International Conference on Aircraft Utility Systems pp 135–140, https://doi.org/10.1109/AUS.2016.7748035
Zhang, C., Lim, P., Qin, A. K., & Tan, K. C. (2017a). Multiobjective deep belief networks ensemble for remaining useful life estimation in prognostics. IEEE Transactions on Neural Networks and Learning Systems, 28(10), 2306–2318. https://doi.org/10.1109/TNNLS.2016.2582798.
Zhang, W., Min-Ping Jia, B., Lin Zhu, B., & Xiao-An Yan, B. (2017b). Comprehensive overview on computational intelligence techniques for machinery condition monitoring and fault diagnosis. Chinese Journal of Mechanical Engineering,. https://doi.org/10.1007/s10033-017-0150-0.
Zhang, X., Xiao, P., Yang, Y., Cheng, Y., Chen, B., Gao, D., et al. (2019). Remaining useful life estimation using CNN-XGB with extended time window. IEEE Access, 7, 154386–154397. https://doi.org/10.1109/ACCESS.2019.2942991.
Zhao, R., Yan, R., Wang, J., & Mao, K. (2017). Learning to monitor machine health with convolutional Bi-directional LSTM networks. Sensors (Switzerland), 17(2), 1–18. https://doi.org/10.3390/s17020273.
Zhao, R., Yan, R., Chen, Z., Mao, K., Wang, P., & Gao, R. X. (2019). Deep learning and its applications to machine health monitoring. Mechanical Systems and Signal Processing, 115, 213–237. https://doi.org/10.1016/j.ymssp.2018.05.050.
Zheng S, Ristovski K, Farahat A, Gupta C (2017) Long short-term memory network for remaining useful life estimation. 2017 IEEE International Conference on Prognostics and Health Management, ICPHM 2017 pp 88–95, https://doi.org/10.1109/ICPHM.2017.7998311
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest in the research work presented.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
A TM3C model parameter configuration for different datasets
Data-specific fine-tuning of TM3C model parameters aids in improvement of prediction performance. Table 6 provides list of key parameter values for C-MAPSS subsets, UCI Hydraulics, and milling machine data.
B An example illustrating the effect of model complexity and number of features on prediction performance
Example below demonstrates the effect of model complexity and feature transformation on the prediction performance of a neural network based model. A bivariate regression problem is defined where the output is a sinusoidal function of the sum of inputs (Fig. 10a, c). An area of the input data range is selected from which 200 samples are randomly picked for training the model. Different model variants are generated by modifying the number of neurons and layers. New features are generated by applying transformation functions on input variables (Fig. 10b). It is observed that, increasing model complexity and number of features help improve model performance (M2-F2 in Fig. 10). However, overly complex models or too many features lead to overfitting (M3-F3 in Fig. 10). (Table 6)
Rights and permissions
About this article
Cite this article
Pillai, S., Vadakkepat, P. Deep learning for machine health prognostics using Kernel-based feature transformation. J Intell Manuf 33, 1665–1680 (2022). https://doi.org/10.1007/s10845-021-01747-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10845-021-01747-6