Lightweight Oriented Object Detection Using Multiscale Context and Enhanced Channel Attention in Remote Sensing Images,IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

当前位置： X-MOL 学术 › IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Lightweight Oriented Object Detection Using Multiscale Context and Enhanced Channel Attention in Remote Sensing Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ( IF 5.5 ) Pub Date : 2021-05-13 , DOI: 10.1109/jstars.2021.3079968
Qiong Ran , Qing Wang , Boya Zhao , Yuanfeng Wu , Shengliang Pu , Zijin Li

Object detection is a focal point in remote sensing applications. Remote sensing images typically contain a large number of small objects and a wide range of orientations across objects. This results in great challenges to small object detection approaches based on remote sensing images. Methods directly employ channel relations with equal weights to construct information features leads to inadequate feature representation in complex image small object detection tasks. Multiscale detection methods improve the speed and accuracy of detection, while small objects themselves contain limited information, and the features are easily lost following down-sampling. During the detection, the feature images are independent across scales, resulting in a discontinuity at the detection scale. In this article, we propose the multiscale context and enhanced channel attention (MSCCA) model. MSCCA employs PeleeNet as the backbone network. In particular, the feature image channel attention is enhanced and the multiscale context information is fused with multiscale detection methods to improve the characterization ability of the convolutional neural network. The proposed MSCCA method is evaluated on two real datasets. Results show that for 512 × 512 input images, MSCCA was able to achieve 80.4% and 94.4% mAP on the DOTA and NWPU VHR-10, respectively. Meanwhile, the model size of MSCCA is 21% smaller than that of its predecessor. MSCCA can be considered as a practical lightweight oriented object detection model in remote sensing images.

中文翻译：

在遥感图像中使用多尺度上下文和增强的通道注意力进行轻量级定向目标检测

物体检测是遥感应用的焦点。遥感图像通常包含大量小物体和跨物体的广泛方向。这给基于遥感图像的小物体检测方法带来了巨大挑战。方法直接利用等权重的通道关系来构建信息特征，导致复杂图像小目标检测任务中的特征表示不足。多尺度检测方法提高了检测的速度和准确度，而小物体本身包含的信息有限，下采样后特征容易丢失。在检测过程中，特征图像跨尺度独立，导致检测尺度不连续。在本文中，我们提出了多尺度上下文和增强通道注意力（MSCCA）模型。MSCCA 采用 PeleeNet 作为骨干网络。特别是增强了特征图像通道attention，将多尺度上下文信息与多尺度检测方法融合，提高了卷积神经网络的表征能力。建议的 MSCCA 方法在两个真实数据集上进行评估。结果表明，对于 512 × 512 的输入图像，MSCCA 在 DOTA 和 NWPU VHR-10 上分别能够实现 80.4% 和 94.4% 的 mAP。同时，MSCCA 的模型尺寸比其前身小 21%。MSCCA 可以被认为是遥感图像中一种实用的面向轻量级的物体检测模型。增强特征图像通道注意力，将多尺度上下文信息与多尺度检测方法融合，提高卷积神经网络的表征能力。建议的 MSCCA 方法在两个真实数据集上进行评估。结果表明，对于 512 × 512 的输入图像，MSCCA 在 DOTA 和 NWPU VHR-10 上分别能够实现 80.4% 和 94.4% 的 mAP。同时，MSCCA 的模型尺寸比其前身小 21%。MSCCA 可以被认为是遥感图像中一种实用的面向轻量级的物体检测模型。增强特征图像通道注意力，将多尺度上下文信息与多尺度检测方法融合，提高卷积神经网络的表征能力。建议的 MSCCA 方法在两个真实数据集上进行评估。结果表明，对于 512 × 512 的输入图像，MSCCA 在 DOTA 和 NWPU VHR-10 上分别能够实现 80.4% 和 94.4% 的 mAP。同时，MSCCA 的模型尺寸比其前身小 21%。MSCCA 可以被认为是遥感图像中一种实用的面向轻量级的物体检测模型。结果表明，对于 512 × 512 的输入图像，MSCCA 在 DOTA 和 NWPU VHR-10 上分别能够实现 80.4% 和 94.4% 的 mAP。同时，MSCCA 的模型尺寸比其前身小 21%。MSCCA 可以被认为是遥感图像中一种实用的面向轻量级的物体检测模型。结果表明，对于 512 × 512 的输入图像，MSCCA 在 DOTA 和 NWPU VHR-10 上分别能够实现 80.4% 和 94.4% 的 mAP。同时，MSCCA 的模型尺寸比其前身小 21%。MSCCA 可以被认为是遥感图像中一种实用的面向轻量级的物体检测模型。

更新日期：2021-06-18

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>