A new graph-based semi-supervised method for surface defect classification

doi:10.1016/j.rcim.2020.102083

Robotics and Computer-Integrated Manufacturing

Volume 68, April 2021, 102083

https://doi.org/10.1016/j.rcim.2020.102083 Get rights and content

Highlights

l
The semi-supervised learning is more suitable in surface defect classification.
l
The poor class separation causes semi-supervised algorithms to perform poorly.
l
A graph-based semi-supervised method is proposed for improving class separation.
l
Multiple micrographs replace a large graph to perform graph convolution.

Abstract

Vision-based defect classification is an important technology to control the quality of product in manufacturing system. As it is very hard to obtain enough labeled samples for model training in the real-world production, the semi-supervised learning which learns from both labeled and unlabeled samples is more suitable for this task. However, the intra-class variations and the inter-class similarities of surface defect, named as the poor class separation, may cause the semi-supervised methods to perform poorly with small labeled samples. While graph-based methods, such as graph convolution network (GCN), can solve the problem well. Therefore, this paper proposes a new graph-based semi-supervised method, named as multiple micrographs graph convolutional network (MMGCN), for surface defect classification. Firstly, MMGCN performs graph convolution by constructing multiple micrographs instead of a large graph, and labels unlabeled samples by propagating label information from labeled samples to unlabeled samples in the micrographs to obtain multiple labels. Weighting the labels can obtain the final label, which can solve the limitations of computation complexity and practicality of original GCN. Secondly, MMGCN divides unlabeled dataset into multiple batches and sets an accuracy threshold. When the model accuracy reaches the threshold, the unlabeled datasets are labeled in batches. A famous case has been used to evaluate the performance of the proposed method. The experimental results demonstrate that the proposed MMGCN can achieve better computation complexity and practicality than GCN. And for accuracy, MMGCN can also obtain the best performance and the best class separation in the comparison with other semi-supervised surface defect classification methods.

Introduction

Surface defect is a common problem in industrial production [1], and the causes of which are various [2]. The correct classification of surface defects can judge the failure of the machine to prevent further losses [3]. However, manual check is time-consuming and labor-intensive, so automatic surface inspection (ASI) becomes a research hotspot [4]. The ASI consists of a data collection device and a data classification system [5]. With the rise of Big Data [6], collecting defect samples becomes convenient. Therefore, how to improve the accuracy of the defect classification system becomes more and more important.

With the improvement of computing power and the convenience of data acquisition, more and more deep learning methods have been applied in surface defect classification [7], [8], [9], [10], [11], [12], [13]. Zhou et al. used convolutional neural network (CNN) for classification of surface defect [7]. Ren et al. proposed a transfer method for steel surface defect classification [8]. Most of them were based on the supervised learning. Since labeling defect samples requires specialized knowledge, it is very difficult to obtain massive labeled samples. Therefore, semi-supervised learning, which trains model with both labeled and unlabeled samples, provides a good way to solve this problem. Gao et al. [14] proposed pseudo-label CNN, which used pseudo-labels to train the model for semi-supervised steel surface defect classification. He et al. [15] trained two classifiers based on different learning strategies to use labeled and unlabeled samples for semi-supervised learning. Di et al. [16] trained a convolutional autoencoder as a feature extractor on unlabeled samples to form a new classifier for defect classification. However, some defect classes are difficult to classify due to the intra-class variations and the inter-class similarities of surface defect [17]. To present it intuitively, this situation is called as the poor class separation in this paper. Poor class separation can influence the performance of the semi-supervised methods and training with small labeled samples will further exacerbate this problem [17]. Therefore, how to improve the class separation for semi-supervised methods becomes important.

Graph convolutional network (GCN), a newly developed method and widely used in semi-supervised learning, can solve this problem [18]. GCN can construct a graph where images are nodes and their relationships are edges, and propagate feature information between connected nodes. Therefore, in feature space, the distance of connected nodes is closer and the distance of the disconnected nodes is further. This is why GCN can improve the class separation. Kipf and Welling [19] proposed the concept of GCN and applied it to semi-supervised learning. Li et al. [20] proposed dimension wise separable graph convolution, which can reduce intra-class variance of node features. Li et al. [21] proposed feature-fusing graph neural network to address the intercommunication of image with other images. Li et al. [22] proposed graph-like attention network, which used global features and local information to extract discriminative features. Sun et al. [23] proposed to learn the graph structure to address huge intra-class variability and high inter-class ambiguity. The GCN methods have a good performance for improving class separation, but it is hard to apply GCN in semi-supervised surface defect classification directly because of the computation complexity. For the space complexity, constructing a graph from all samples will occupy too much memories. For the time complexity, the constructed graph is too large to meet the needs of training with large-scale datasets, which will take longer time to train.

To solve the problems, this paper proposes multiple micrographs graph convolutional network (MMGCN) for semi-supervised surface defect classification. The motivation of the proposed method is to apply GCN in semi-supervised surface defect classification to improve the class separation, and some improvements are made to adapt GCN in the task. Firstly, MMGCN embeds images as nodes and their relationships as edges to construct micrographs and performs graph convolution to improve class separation. Secondly, MMGCN constructs multiple micrographs instead of a large graph to reduce occupied memories and training time. Thirdly, to make full use of the unlabeled dataset, MMGCN divides the unlabeled dataset into multiple batches and trains the model to reach a threshold, then labels the unlabeled datasets by batches and adds them to the labeled dataset. The main contributions are summarized as the following three points. Firstly, a new graph-based method, named MMGCN, is proposed to reduce occupied memories and training time by constructing multiple micrographs instead of a large-scale graph. Secondly, the proposed method is applied in semi-supervised surface defect classification to improve class separation of surface defect. Thirdly, the proposed method is applied on a surface defect dataset and verified that it can improve class separation as well as achieve the best performance compared with other semi-supervised surface defect classification methods. In order to evaluate the performance of MMGCN for semi-supervised surface defect classification, the proposed method is tested on a famous benchmark dataset, NEU-CLS. Firstly, the experiment of the influences of micrographs size is performed in different size of micrographs and different number of labeled samples. This part is to find how the size of micrographs affects the performance of MMGCN under different number of labeled samples. Secondly, the effectiveness of improvement of MMGCN is performed. This part is to research the performance of the graph-based method in class separation and the improvement of MMGCN compared with GCN in computation complexity and practicality. Thirdly, the comparison of MMGCN with other semi-supervised deep learning methods is performed. This part is to find whether MMGCN has a better performance in class separation and overall accuracy than other well performed methods.

The rest of this paper is organized as follows. Section 2 illustrates the background research about GCN. Section 3 introduces the structure of the proposed method. Section 4 presents the experimental results and discussion. Section 5 shows the conclusion and future work.

Section snippets

Background research

Graph Convolutional Network (GCN) is proposed by Kipf and Welling [19], which is a Laplacian Smoothing [24]. GCN convolves a graph consisting of labeled nodes and unlabeled nodes, and then propagates label from labeled nodes to unlabeled nodes. In the procedure, GCN need to propagate label information and feature information between connected nodes, which can make connected nodes similar and disconnected nodes dissimilar.

The procedure of GCN can be divided into two layers, including Aggregate

Proposed MMGCN for semi-supervised surface defect classification

In this paper, a new graph-based method, named as MMGCN, is proposed and applied in semi-supervised surface defect classification.

The purpose of MMGCN is to construct multiple micrographs from images to improve the adaptability of original GCN, including Sample, Embedding, Micrographs construction. MMGCN randomly samples from dataset in Sample module, and embeds them to nodes in Embedding module and constructs micrographs with the nodes in Micrographs construction module. Finally, graph

Experimental results and discussion

In this section, the proposed method is tested on a famous benchmark dataset [30] to evaluate the improvements of MMGCN compared with GCN and the performance of MMGCN compared with other well performed deep learning methods in semi-supervised surface defect classification.

Conclusion and future work

This paper proposes a new graph-based semi-supervised method, named as MMGCN, for surface defect classification. The proposed method constructs multiple micrographs instead of a large one and then the micrographs are convolved to improve class separation. Firstly, when applying graph-based methods in semi-supervised surface defect classification, using multiple micrographs is more suitable for this task than a large graph because of less computation time and occupied memories. And using the

Author statement

Yucheng Wang: Writing-Original draft preparation, Methodology, Software. Liang Gao: Conceptualization, Supervision. Yiping Gao: Visualization, Investigation. Xinyu Li: Conceptualization, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work was supported by the National Key R&D Program of China [Grant Number 2018AAA0101700], National Natural Science Foundation of China [Grant Number 51775216], and the Program for HUST Academic Frontier Youth Team [Grant Number 2017QYTD04].

References (36)

P. Muñoz-Escalona et al.
Influence of cutting environments on surface integrity and power consumption of austenitic stainless steel
Robot. Comput. Integr. Manuf.
(2015)
I. Vilček et al.
Residual stresses evaluation in precision milling of hardened steel based on the deflection-electrochemical etching technique
Robot. Comput. Integr. Manuf.
(2017)
Q. Luo et al.
A cost-effective and automatic surface defect inspection system for hot-rolled flat steel
Robot. Comput. Integr. Manuf.
(2016)
S. Zhou et al.
Classification of surface defects on steel sheet using convolutional neural networks
Mater. Tehnol.
(2017)
Y. Gao et al.
A multi-level information fusion-based deep leaning method for vision-based defect recognition
IEEE Trans. Instrum. Meas.
(2019)
A. Bustillo et al.
Using artificial intelligence models for the prediction of surface wear based on surface isotropy levels
Robot. Comput. Integr. Manuf.
(2018)
Y. Gao et al.
A semi-supervised convolutional neural network-based method for steel surface defect recognition
Robot. Comput. Integr. Manuf.
(2020)
Y. He et al.
Semi-supervised defect classification of steel surface based on multi-training and generative adversarial network
Opt. Lasers Eng.
(2019)
H. Di et al.
Surface defect classification of steels with a new semi-supervised learning method
Opt. Lasers Eng.
(2019)
D. He et al.
Design of multi-scale receptive field convolutional neural network for surface inspection of hot rolled steels
Image Vis. Comput.
(2019)

K. Song et al.

A noise robust method based on completed local binary patterns for hot-rolled steel strip surface defects

Appl. Surf. Sci.

(2013)

S. Ghorai et al.

Automatic defect detection on hot-rolled flat steel products

IEEE Trans. Instrum. Meas.

(2013)

J. Gan et al.

Online rail surface inspection utilizing spatial consistency and continuity

IEEE Trans. Syst. Man, Cybern. Syst.

(2018)

Y. Gao et al.

A zero-shot learning method for fault diagnosis under unknown working loads

J. Intell. Manuf.

(2019)

R. Ren et al.

A generic deep-learning-based approach for automated surface inspection

IEEE Trans. Cybern.

(2018)

J. Masci et al.

Steel defect classification with max-pooling convolutional neural networks

S. Mei et al.

An unsupervised-learning-based approach for automated defect inspection on textured surfaces

IEEE Trans. Instrum. Meas.

(2018)

Y. Li et al.

Deformable patterned fabric defect detection with fisher criterion-based deep learning

IEEE Trans. Autom. Sci. Eng.

(2017)

Cited by (43)

A GAN-based method for diagnosing bodywork spot welding defects in response to small sample condition
2024, Applied Soft Computing
Due to the hidden nature and complexity of resistance spot welding weld nugget formation, how to avoid the time-consuming and money-consuming problem of traditional defect diagnosis methods and accurately grasp the weld nugget status is still an urgent problem. In this paper, an improved GAN model is proposed to solve the corresponding problem by combining the weld nugget defects with the dynamic resistance curve. Aiming at the problem that traditional GAN algorithms are prone to pattern collapse, this paper utilizes a variational autoencoder integrated with a channel attention mechanism as the generator part of the generative adversarial network, which helps the model pay better attention to the high-weight part of the defective sample data and combines the encoding and decoding processes to highlight defective features, thus reconstructing the defective samples with higher quality. Convolutional neural networks are then utilized to identify the features of the generated samples and diagnose the type of weldment defects. The test results show that the proposed scheme is highly reliable and the model outperforms other schemes in diagnosing welded nugget defects under the same conditions, avoiding undesirable effects such as underfitting. The validation of the actual dataset shows that, compared with other diagnostic methods that generally have an accuracy rate of less than 75%, the accuracy of the weld nugget defects diagnosis of this paper's method reaches more than 94%, which is a positive impetus to the development of auto body welding diagnosis.
Toward generalizable robot vision guidance in real-world operational manufacturing factories: A Semi-Supervised Knowledge Distillation approach
2024, Robotics and Computer-Integrated Manufacturing
The complexity and diversity of scenarios, along with the presence of environmental noise in factory settings, pose significant challenges to the implementation of deep learning-based vision-guided robots for smart manufacturing. In response to these challenges, we introduce a novel Semi-Supervised Knowledge Distillation (SSKD) framework that has been extensively validated and deployed across numerous real-world production lines. The proposed SSKD framework combines the advantages of semi-supervised learning and knowledge distillation to offer optimization for the majority of deep learning models. Experiments conducted in real-world factory settings demonstrate that the SSKD framework significantly enhances the performance of deep learning models, reducing inference time from 185 ms to 45 ms and improving generalizability across different working environments, achieving recall and precision values that exceed 99.5% and 92.6%, respectively, achieved a remarkable 200% improvement in labor efficiency. Our innovative SSKD framework provides a reliable and scalable solution for enhancing manufacturing productivity and product quality. The success of this approach in transforming vision-guided robotic systems for smart manufacturing highlights its potential for broader industry adoption. The SSKD framework offers a reliable and scalable solution for enhancing manufacturing productivity and product quality. Our results underscore the potential of this innovative approach to transform vision-guided robot systems in smart manufacturing, making it an attractive candidate for widespread adoption in the industry. We are proud to report that, as of the end of 2022, the SSKD framework has been successfully implemented in 50 robots – a more than ten-fold increase from the initial 4 in 2020 – resulting in an annual yarn production capacity exceeding 100,000 kg. This accomplishment underscores the practical impact and effectiveness of the SSKD framework in real-world production lines.
Collaborative deep semi-supervised learning with knowledge distillation for surface defect classification
2023, Computers and Industrial Engineering
Defect inspection plays a vital role in ensuring high-quality production in industrial automation. While supervised approaches have been successful, they rely on costly labeled data. To address this limitation, semi-supervised methods have gained popularity, utilizing both labeled and unlabeled data for training. This research addresses the challenge of noisy semi-supervised training caused by incorrect pseudo-labels in Convolutional Neural Network based models. To enhance the accuracy and reliability of pseudo-label selection, a novel collaborative learning strategy with knowledge distillation for defect classification is proposed. The proposed approach involves training a set of teacher networks collaboratively, allowing them to collectively determine the pseudo-labels for each unlabeled image and improving the quality of pseudo-labeling. Subsequently, each teacher network is trained using these pseudo-labeled data. Finally, the acquired collaborative knowledge is transferred to a single student network, reducing model complexity, memory requirements, and enabling faster inference during deployment. The proposed approach demonstrates competitive performance on three publicly available defect classification datasets: NEU steel surfaces, SLS laser powder beds, and Surface Textures, achieving results comparable to the state-of-the-art. Notably, remarkable accuracy is achieved even with limited labeled data during training. For instance, on the SLS dataset, the proposed approach achieves 97% accuracy, which is comparable to the state-of-the-art’s 98% accuracy when using 100% of labeled data. Remarkably, the proposed approach accomplishes this level of accuracy using only 3% of the labeled training data, showcasing its efficiency and effectiveness in leveraging limited labeled data to achieve impressive results. Source code is available at https://github.com/M-Siyamalan/CDSSLwithKD/.
Semi-supervised learning for industrial fault detection and diagnosis: A systemic review
2023, ISA Transactions
The automation of Fault Detection and Diagnosis (FDD) is a central task for many industries today. A myriad of methods are in use, although the most recent leading contenders are data-driven approaches and especially Machine Learning (ML) methods. ML algorithms fall into two main categories: supervised and unsupervised methods, depending on whether or not the instances are labeled with the expected outputs. However, a new approach called Semi-Supervised Learning (SSL) has recently emerged that uses a few labeled instances together with other unlabeled instances for the training process. This new approach can significantly improve the accuracy of conventional ML models for industrial environments where labeled data are scarce. SSL has been tested as a promising solution over the past few years for several FDD problems, although there have been no systemic reviews of this sort of approach up until the present review. In this study, an attempt to organize the existing literature on SSL for FDD using the taxonomy of van Engelen & Hoos is reported. The most and the least frequently used SSL algorithms are identified and considered in terms of different fault detection tasks and their most common dataset structure. Moreover, a set of best practices are proposed in the conclusions of this work for implementation under real industrial conditions, so as to avoid some of the most common faults.
Knowledge augmented broad learning system for computer vision based mixed-type defect detection in semiconductor manufacturing
2023, Robotics and Computer-Integrated Manufacturing
Citation Excerpt :
In the model-based approaches, the pre-defined defect functions are designed to calculate the feature vectors of wafer maps for pattern recognition [18], such as classic Poisson, compound Poisson, and generalized Poisson [19]. Data-driven approaches take wafer maps as training data to build a pattern recognition model, which has shown superior performance in defect detection problems [20,21]. Jessnor et al. [22] proposed an evaluation framework for real wafer defects, which utilizes the extracted features from Inception V3 to train the machine learning classifier to detect the defects.
Defect detection is a critical measurement process for intelligent manufacturing systems to provide insights for product quality improvement. For complex products such as integrated circuit wafers, several types of defects are usually coupled in a piece of wafer to form a mixed-type defect, which poses a challenge to current defect detection methods. This paper proposed a knowledge augmented broad learning system with a knowledge module and broad selective sampling module, which provides a multichannel selective sampling network to decouple the mixed-type defects. In this model, each channel is equipped with a pre-trained deformable convolution model to extract the feature of a fixed single-type defect. The knowledge module is designed to activate the candidate network channel by pre-detection of wafer maps. The experiment results indicated that the proposed model outperforms conventional models and other deep learning models, which demonstrated that the knowledge augmented broad selective sampling mechanism is effective for mixed-type defect detection.
A hierarchical training-convolutional neural network with feature alignment for steel surface defect recognition
2023, Robotics and Computer-Integrated Manufacturing
Citation Excerpt :
Although CNN-based methods have addressed the problem of manual feature design well and achieved good performances, the current methods still need to be improved. On the one hand, some steel surface defects are similar to the background, and the different defects might show similar textures, which is called poor class separation [23]. For example, in Fig. 1a), the crazing area is very similar to the background, while the same defects in Fig. 1c) are considerable differences.
Steel is a basic material, and vision-based defect recognition is important for quality. Recently, deep learning, especially convolutional neural network (CNN), has become a research hotspot. However, steel defects have poor class separation, which is similar to the background, and different defects show similar textures. This causes some defects unrecognizable and influences production greatly. Thus, current CNNs still need to be improved. With this goal, this paper proposes a hierarchical training-CNN with feature alignment. The proposed method introduces a feature alignment, which maps the unrecognizable defects to the recognizable area, and a hierarchical training strategy is used to integrate the feature alignment into the training process. With these improvements, the proposed method achieves improved performance. The recognition results on a public dataset achieve 100%, which outperforms the other CNNs. And it has been developed into a real-world case successfully, which is significantly improved.

View all citing articles on Scopus

View full text

A new graph-based semi-supervised method for surface defect classification

Highlights

Abstract

Introduction

Section snippets

Background research

Proposed MMGCN for semi-supervised surface defect classification

Experimental results and discussion

Conclusion and future work

Author statement

Declaration of Competing Interest

Acknowledgement

Robot. Comput. Integr. Manuf.

Robot. Comput. Integr. Manuf.

Robot. Comput. Integr. Manuf.

Mater. Tehnol.

IEEE Trans. Instrum. Meas.

Robot. Comput. Integr. Manuf.

Robot. Comput. Integr. Manuf.

Opt. Lasers Eng.

Opt. Lasers Eng.

Image Vis. Comput.

Appl. Surf. Sci.

Automatic defect detection on hot-rolled flat steel products

IEEE Trans. Instrum. Meas.

Online rail surface inspection utilizing spatial consistency and continuity

IEEE Trans. Syst. Man, Cybern. Syst.

A zero-shot learning method for fault diagnosis under unknown working loads

J. Intell. Manuf.

A generic deep-learning-based approach for automated surface inspection

IEEE Trans. Cybern.

Steel defect classification with max-pooling convolutional neural networks

An unsupervised-learning-based approach for automated defect inspection on textured surfaces

IEEE Trans. Instrum. Meas.

Deformable patterned fabric defect detection with fisher criterion-based deep learning

IEEE Trans. Autom. Sci. Eng.