Unsupervised Deep Learning of Compact Binary Descriptors,IEEE Transactions on Pattern Analysis and Machine Intelligence

当前位置： X-MOL 学术 › IEEE Trans. Pattern Anal. Mach. Intell. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Unsupervised Deep Learning of Compact Binary Descriptors
IEEE Transactions on Pattern Analysis and Machine Intelligence ( IF 20.8 ) Pub Date : 5-8-2018 , DOI: 10.1109/tpami.2018.2833865
Kevin Lin , Jiwen Lu , Chu-Song Chen , Jie Zhou , Ming-Ting Sun

Binary descriptors have been widely used for efficient image matching and retrieval. However, most existing binary descriptors are designed with hand-craft sampling patterns or learned with label annotation provided by datasets. In this paper, we propose a new unsupervised deep learning approach, called DeepBit, to learn compact binary descriptor for efficient visual object matching. We enforce three criteria on binary descriptors which are learned at the top layer of the deep neural network: 1) minimal quantization loss, 2) evenly distributed codes and 3) transformation invariant bit. Then, we estimate the parameters of the network through the optimization of the proposed objectives with a back-propagation technique. Extensive experimental results on various visual recognition tasks demonstrate the effectiveness of the proposed approach. We further demonstrate our proposed approach can be realized on the simplified deep neural network, and enables efficient image matching and retrieval speed with very competitive accuracies.

中文翻译：

紧凑二进制描述符的无监督深度学习

二进制描述符已广泛用于高效的图像匹配和检索。然而，大多数现有的二进制描述符都是通过手工采样模式设计的，或者通过数据集提供的标签注释来学习的。在本文中，我们提出了一种新的无监督深度学习方法，称为 DeepBit，来学习紧凑的二进制描述符以实现高效的视觉对象匹配。我们对在深度神经网络顶层学习的二进制描述符执行三个标准：1）最小量化损失，2）均匀分布的代码和3）变换不变位。然后，我们通过反向传播技术优化所提出的目标来估计网络参数。对各种视觉识别任务的广泛实验结果证明了所提出方法的有效性。我们进一步证明我们提出的方法可以在简化的深度神经网络上实现，并且能够以非常有竞争力的精度实现高效的图像匹配和检索速度。

更新日期：2024-08-22

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11