Dual attention deep learning network for automatic steel surface defect segmentation,Computer-Aided Civil and Infrastructure Engineering

当前位置： X-MOL 学术 › Comput. Aided Civ. Infrastruct. Eng. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Dual attention deep learning network for automatic steel surface defect segmentation
Computer-Aided Civil and Infrastructure Engineering ( IF 8.5 ) Pub Date : 2021-11-19 , DOI: 10.1111/mice.12792
Y. Pan ₁ , L. Zhang ₂

Affiliation

A dual attention deep learning network is developed to classify three types of steel defects, locate their positions, and depict their shapes on the steel surface in an automatic and accurate manner. The novel pixel-level detection algorithm called DAN-DeepLabv3+ integrates a dual attention module into the DeepLabv3+ framework in pursue of more precise segmentation results. For one thing, the dual parallel attention module helps to explicitly model rich contextual dependencies over local feature representations in the spatial and channel dimensions. For another, the popular DeepLabv3+ in an encoder-decoder architecture is useful in capturing multi-scale contextual information and sharp object boundaries. The DAN-DeepLabv3+ is applied to an available dataset containing 6666 images, where three types of steel defects are taken by high-frequency cameras and have been annotated manually. Experimental results show that, compared with other deep learning models, DAN-DeepLabv3+ based on the Xception backbone exhibits the best segmentation performance under the mean intersection over union (IoU) of 89.95% and the frequency-weighted IoU of 97.34%. Besides, the F1-score for the three kinds of defects can reach 86.90%, 99.20%, and 92.81%. From the comparative study, it has been found that the adoption of the dual attention module and DeepLabv3+ contributes to boosting the segmentation performance. The significance of the proposed hybrid model lies in the enhancement in accurately detecting single or multiple steel defects, which has proven to outperform other classical methods.

中文翻译：

用于自动钢表面缺陷分割的双注意力深度学习网络

开发了一种双注意力深度学习网络，以自动和准确的方式对三种类型的钢缺陷进行分类、定位它们的位置并在钢表面上描绘它们的形状。新的像素级检测算法 DAN-DeepLabv3+ 将双重注意力模块集成到 DeepLabv3+ 框架中，以追求更精确的分割结果。一方面，双并行注意模块有助于显式地对空间和通道维度中局部特征表示的丰富上下文依赖关系进行建模。另一方面，编码器-解码器架构中流行的 DeepLabv3+ 可用于捕获多尺度上下文信息和清晰的对象边界。DAN-DeepLabv3+ 应用于包含 6666 张图像的可用数据集，其中三种类型的钢缺陷由高频相机拍摄并已手动注释。实验结果表明，与其他深度学习模型相比，基于 Xception 主干的 DAN-DeepLabv3+ 在 89.95% 的平均交叉联合（IoU）和 97.34% 的频率加权 IoU 下表现出最佳的分割性能。此外，三种缺陷的F1-score分别可以达到86.90%、99.20%和92.81%。从比较研究中发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。实验结果表明，与其他深度学习模型相比，基于 Xception 主干的 DAN-DeepLabv3+ 在 89.95% 的平均交叉联合（IoU）和 97.34% 的频率加权 IoU 下表现出最佳的分割性能。此外，三种缺陷的F1-score分别可以达到86.90%、99.20%和92.81%。从比较研究中发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。实验结果表明，与其他深度学习模型相比，基于 Xception 主干的 DAN-DeepLabv3+ 在 89.95% 的平均交叉联合（IoU）和 97.34% 的频率加权 IoU 下表现出最佳的分割性能。此外，三种缺陷的F1-score分别可以达到86.90%、99.20%和92.81%。从比较研究中发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。基于 Xception 主干的 DAN-DeepLabv3+ 在 89.95% 的平均交叉联合 (IoU) 和 97.34% 的频率加权 IoU 下表现出最佳分割性能。此外，三种缺陷的F1-score分别可以达到86.90%、99.20%和92.81%。从比较研究中发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。基于 Xception 主干的 DAN-DeepLabv3+ 在 89.95% 的平均交叉联合 (IoU) 和 97.34% 的频率加权 IoU 下表现出最佳分割性能。此外，三种缺陷的F1-score分别可以达到86.90%、99.20%和92.81%。从比较研究中发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。已经发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。已经发现，采用双注意力模块和 DeepLabv3+ 有助于提高分割性能。所提出的混合模型的意义在于提高了准确检测单个或多个钢缺陷的能力，这已被证明优于其他经典方法。

更新日期：2021-11-19

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11