Mirror Detection With the Visual Chirality Cue,IEEE Transactions on Pattern Analysis and Machine Intelligence

当前位置： X-MOL 学术 › IEEE Trans. Pattern Anal. Mach. Intell. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Mirror Detection With the Visual Chirality Cue
IEEE Transactions on Pattern Analysis and Machine Intelligence ( IF 20.8 ) Pub Date : 6-10-2022 , DOI: 10.1109/tpami.2022.3181030
Xin Tan ₁ , Jiaying Lin ₂ , Ke Xu ₂ , Pan Chen ₃ , Lizhuang Ma ₁ , Rynson W.H. Lau ₂

Affiliation

Mirror detection is challenging because the visual appearances of mirrors change depending on those of their surroundings. As existing mirror detection methods are mainly based on extracting contextual contrast and relational similarity between mirror and non-mirror regions, they may fail to identify a mirror region if these assumptions are violated. Inspired by a recent study of applying a CNN to help distinguish whether an image is flipped or not based on the visual chirality property, in this paper, we rethink this image-level visual chirality property and reformulate it as a learnable pixel level cue for mirror detection. Specifically, we first propose a novel flipping-convolution-flipping (FCF) transformation to model visual chirality as learnable commutative residual. We then propose a novel visual chirality embedding (VCE) module to exploit this commutative residual in multi-scale feature maps, to embed the visual chirality features into our mirror detection model. Besides, we also propose a visual chirality-guided edge detection (CED) module to integrate the visual chirality features with contextual features for detection refinement. Extensive experiments show that the proposed method outperforms state-of-the-art methods on three benchmark datasets.

中文翻译：

使用视觉手性线索进行镜像检测

镜子检测具有挑战性，因为镜子的视觉外观会根据周围环境的变化而变化。由于现有的镜像检测方法主要基于提取镜像和非镜像区域之间的上下文对比度和关系相似性，因此如果违反这些假设，它们可能无法识别镜像区域。受到最近一项应用 CNN 根据视觉手性属性来帮助区分图像是否翻转的研究的启发，在本文中，我们重新思考了这种图像级视觉手性属性，并将其重新表述为镜像的可学习像素级线索检测。具体来说，我们首先提出了一种新颖的翻转卷积翻转（FCF）变换，将视觉手性建模为可学习的交换残差。然后，我们提出了一种新颖的视觉手性嵌入（VCE）模块，以利用多尺度特征图中的交换残差，将视觉手性特征嵌入到我们的镜像检测模型中。此外，我们还提出了一种视觉手性引导边缘检测（CED）模块，将视觉手性特征与上下文特征相结合以进行检测细化。大量实验表明，所提出的方法在三个基准数据集上优于最先进的方法。

更新日期：2024-08-28

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11