Content-Aware Unsupervised Deep Homography Estimation and its Extensions,IEEE Transactions on Pattern Analysis and Machine Intelligence

当前位置： X-MOL 学术 › IEEE Trans. Pattern Anal. Mach. Intell. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Content-Aware Unsupervised Deep Homography Estimation and its Extensions
IEEE Transactions on Pattern Analysis and Machine Intelligence ( IF 23.6 ) Pub Date : 2022-05-10 , DOI: 10.1109/tpami.2022.3174130
Shuaicheng Liu ₁ , Nianjin Ye ₂ , Chuan Wang ₃ , Kunming Luo ₄ , Jue Wang ₅ , Jian Sun ₆

Affiliation

Homography estimation is a basic image alignment method in many applications. It is usually done by extracting and matching sparse feature points, which are error-prone in low-light and low-texture images. On the other hand, previous deep homography approaches use either synthetic images for supervised learning or aerial images for unsupervised learning, both ignoring the importance of handling depth disparities and moving objects in real-world applications. To overcome these problems, in this work, we propose an unsupervised deep homography method with a new architecture design. In the spirit of the RANSAC procedure in traditional methods, we specifically learn an outlier mask to only select reliable regions for homography estimation. We calculate loss with respect to our learned deep features instead of directly comparing image content as did previously. To achieve the unsupervised training, we also formulate a novel triplet loss customized for our network. We verify our method by conducting comprehensive comparisons on a new dataset that covers a wide range of scenes with varying degrees of difficulties for the task. Experimental results reveal that our method outperforms the state-of-the-art, including deep solutions and feature-based solutions.

中文翻译：

内容感知无监督深度单应性估计及其扩展

单应性估计是许多应用中的基本图像对齐方法。它通常通过提取和匹配稀疏特征点来完成，这些特征点在低光和低纹理图像中容易出错。另一方面，以前的深度单应性方法要么使用合成图像进行监督学习，要么使用航拍图像进行无监督学习，两者都忽略了在实际应用中处理深度差异和移动物体的重要性。为了克服这些问题，在这项工作中，我们提出了一种具有新架构设计的无监督深度单应性方法。本着传统方法中 RANSAC 过程的精神，我们专门学习了一个离群值掩码来只选择可靠的区域进行单应性估计。我们根据学习到的深度特征计算损失，而不是像以前那样直接比较图像内容。为了实现无监督训练，我们还制定了为我们的网络定制的新型三元组损失。我们通过对一个新数据集进行全面比较来验证我们的方法，该数据集涵盖了任务难度不同的各种场景。实验结果表明，我们的方法优于最先进的方法，包括深度解决方案和基于特征的解决方案。

更新日期：2022-05-10

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>