PBNet: Part-based convolutional neural network for complex composite object detection in remote sensing imagery,ISPRS Journal of Photogrammetry and Remote Sensing

当前位置： X-MOL 学术 › ISPRS J. Photogramm. Remote Sens. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

PBNet: Part-based convolutional neural network for complex composite object detection in remote sensing imagery
ISPRS Journal of Photogrammetry and Remote Sensing ( IF 12.7 ) Pub Date : 2021-01-16 , DOI: 10.1016/j.isprsjprs.2020.12.015
Xian Sun , Peijin Wang , Cheng Wang , Yingfei Liu , Kun Fu

In recent years, deep learning-based algorithms have brought great improvements to rigid object detection. In addition to rigid objects, remote sensing images also contain many complex composite objects, such as sewage treatment plants, golf courses, and airports, which have neither a fixed shape nor a fixed size. In this paper, we validate through experiments that the results of existing methods in detecting composite objects are not satisfying enough. Therefore, we propose a unified part-based convolutional neural network (PBNet), which is specifically designed for composite object detection in remote sensing imagery. PBNet treats a composite object as a group of parts and incorporates part information into context information to improve composite object detection. Correct part information can guide the prediction of a composite object, thus solving the problems caused by various shapes and sizes. To generate accurate part information, we design a part localization module to learn the classification and localization of part points using bounding box annotation only. A context refinement module is designed to generate more discriminative features by aggregating local context information and global context information, which enhances the learning of part information and improve the ability of feature representation. We selected three typical categories of composite objects from a public dataset to conduct experiments to verify the detection performance and generalization ability of our method. Meanwhile, we build a more challenging dataset about a typical kind of complex composite objects, i.e., sewage treatment plants. It refers to the relevant information from authorities and experts. This dataset contains sewage treatment plants in seven cities in the Yangtze valley, covering a wide range of regions. Comprehensive experiments on two datasets show that PBNet surpasses the existing detection algorithms and achieves state-of-the-art accuracy.

中文翻译：

PBNet：基于零件的卷积神经网络，用于遥感影像中的复杂复合物检测

近年来，基于深度学习的算法为刚性物体检测带来了巨大的进步。除刚性物体外，遥感影像还包含许多复杂的合成物体，例如污水处理厂，高尔夫球场和机场，它们既没有固定的形状，也没有固定的尺寸。在本文中，我们通过实验验证了现有方法在检测复合物体方面的结果不够令人满意。因此，我们提出了一种基于零件的统一卷积神经网络（PBNet），该网络是专门为遥感影像中的复合目标检测而设计的。PBNet将组合对象视为一组零件，并将零件信息合并到上下文信息中以改善组合对象检测。正确的零件信息可以指导复合对象的预测，从而解决了各种形状和尺寸引起的问题。为了生成准确的零件信息，我们设计了零件定位模块，以仅使用边界框注释来学习零件点的分类和定位。上下文优化模块设计为通过聚合局部上下文信息和全局上下文信息来生成更多区分特征，从而增强零件信息的学习并提高特征表示的能力。我们从公共数据集中选择了三种典型的复合对象类别来进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，我们设计了零件定位模块，仅使用边界框注释来学习零件点的分类和定位。上下文优化模块设计为通过聚合局部上下文信息和全局上下文信息来生成更多区分特征，从而增强零件信息的学习并提高特征表示的能力。我们从公共数据集中选择了三种典型的复合对象类别来进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，我们设计了零件定位模块，仅使用边界框注释来学习零件点的分类和定位。上下文优化模块设计为通过聚合局部上下文信息和全局上下文信息来生成更多区分特征，从而增强零件信息的学习并提高特征表示的能力。我们从公共数据集中选择了三种典型的复合对象类别来进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，上下文优化模块设计为通过聚合局部上下文信息和全局上下文信息来生成更多区分特征，从而增强零件信息的学习并提高特征表示的能力。我们从公共数据集中选择了三种典型的复合对象类别来进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，上下文优化模块设计为通过聚合局部上下文信息和全局上下文信息来生成更多区分特征，从而增强零件信息的学习并提高特征表示的能力。我们从公共数据集中选择了三种典型的复合对象类别来进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，我们从公共数据集中选择了三种典型的复合对象类别进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，我们从公共数据集中选择了三种典型的复合对象类别来进行实验，以验证该方法的检测性能和泛化能力。同时，我们针对典型的复杂复合对象建立了更具挑战性的数据集，即污水处理厂。它引用了来自权威机构和专家的相关信息。该数据集包含长江流域七个城市的污水处理厂，涉及范围广泛。在两个数据集上的综合实验表明，PBNet超越了现有的检测算法，并达到了最新的准确性。

更新日期：2021-01-18

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>