Skip to main content
Log in

A General Framework for Deep Supervised Discrete Hashing

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

With the rapid growth of image and video data on the web, hashing has been extensively studied for image or video search in recent years. Benefiting from recent advances in deep learning, deep hashing methods have shown superior performance over the traditional hashing methods. However, there are some limitations of previous deep hashing methods (e.g., the semantic information is not fully exploited). In this paper, we develop a general deep supervised discrete hashing framework based on the assumption that the learned binary codes should be ideal for classification. Both the similarity information and the classification information are used to learn the hash codes within one stream framework. We constrain the outputs of the last layer to be binary codes directly, which is rarely investigated in deep hashing algorithms. Besides, both the pairwise similarity information and the triplet ranking information are exploited in this paper. In addition, two different loss functions are presented: \({l_2}\) loss and hinge loss, which are carefully designed for the classification term under the one stream framework. Because of the discrete nature of hash codes, an alternating minimization method is used to optimize the objective function. Experimental results have shown that our approach outperforms current state-of-the-art methods on benchmark datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Cao, Y., Long, M., Wang, J., Zhu, H., & Wen, Q. (2016). Deep quantization network for efficient image retrieval. In AAAI conference on artificial intelligence, pp. 3457–3463.

  • Chatfield, K., Simonyan, K., Vedaldi, A., & Zisserman, A. (2014). Return of the devil in the details: Delving deep into convolutional nets. In British machine vision conference

  • Cui, Y., Jiang, J., Lai, Z., Hu, Z., & Wong, W. (2018). Supervised discrete discriminant hashing for image retrieval. Pattern Recognition, 78, 79–90.

    Google Scholar 

  • Gionis, A., Indyk, P., Motwani, R., et al. (1999). Similarity search in high dimensions via hashing. In International conference on very large data bases, Vol. 99, pp. 518–529.

  • Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In IEEE conference on computer vision and pattern recognition, pp 580–587

  • Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2013). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12), 2916–2929.

    Google Scholar 

  • Gorisse, D., Cord, M., & Precioso, F. (2012). Locality-sensitive hashing for chi2 distance. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(2), 402–409.

    Google Scholar 

  • Gui, J., Liu, T., Sun, Z., Tao, D., & Tan, T. (2016). Supervised discrete hashing with relaxation. IEEE Transactions on Neural Networks and Learning Systems, 29(3), 608–617.

    Google Scholar 

  • He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In IEEE conference on computer vision and pattern recognition, pp .770–778.

  • Hsu, C. W., & Lin, C. J. (2002). A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks, 13(2), 415–425.

    Google Scholar 

  • Ji, J., Li, J., Yan, S., Zhang, B., & Tian, Q. (2012). Super-bit locality-sensitive hashing. In Advances in neural information processing systems, pp. 108–116.

  • Kang, W. C., Li, W. J., & Zhou, Z. H. (2016). Column sampling based discrete supervised hashing. In AAAI conference on artificial intelligence, pp. 1230–1236.

  • Koutaki, G., Shirai, K., & Ambai, M. (2016). Fast supervised discrete hashing and its analysis. arXiv:1611.10017.

  • Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pp. 1097–1105.

  • Kulis, B., & Darrell, T. (2009). Learning to hash with binary reconstructive embeddings. In Advances in neural information processing systems, pp. 1042–1050.

  • Kulis, B., & Grauman, K. (2009). Kernelized locality-sensitive hashing for scalable image search. In International conference on computer vision, pp. 2130–2137.

  • Kulis, B., Jain, P., & Grauman, K. (2009). Fast similarity search for learned metrics. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(12), 2143–2157.

    Google Scholar 

  • Lai, H., Pan, Y., Liu, Y., & Yan, S. (2015). Simultaneous feature learning and hash coding with deep neural networks. In IEEE conference on computer vision and pattern recognition, pp. 3270–3278.

  • Li, K., Qi, G. J., Ye, J., & Hua, K. A. (2017a). Linear subspace ranking hashing for cross-modal retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(9), 1825–1838.

    Google Scholar 

  • Li, Q., Sun, Z., He, R., & Tan, T. (2017b). Deep supervised discrete hashing. In Advances in neural information processing systems, pp. 2479–2488.

  • Li, Q., Sun, Z., Lin, Z., He, R., & Tan, T. (2016a). Transformation invariant subspace clustering. Pattern Recognition, 59, 142–155.

    Google Scholar 

  • Li, W. J., Wang, S., & Kang, W. C. (2016b). Feature learning based deep supervised hashing with pairwise labels. In International joint conference on artificial intelligence, pp. 1711–1717.

  • Lin, G., Shen, C., Shi, Q., van den Hengel, A., & Suter, D. (2014). Fast supervised hashing with decision trees for high-dimensional data. In IEEE conference on computer vision and pattern recognition, pp. 1963–1970.

  • Lin, K., Yang, H. F., Hsiao, J. H., & Chen, C. S. (2015). Deep learning of binary hash codes for fast image retrieval. In IEEE conference on computer vision and pattern recognition workshops, pp. 27–35.

  • Liong, V. E., Lu, J., Duan, L., & Tan, Y. (2018). Deep variational and structural hashing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(3), 580–595.

  • Liong, V. E., Lu, J., Tan, Y. P., & Zhou, J. (2017). Cross-modal deep variational hashing. In IEEE conference on computer vision and pattern recognition, pp. 4077–4085.

  • Liu, B., Cao, Y., Long, M., Wang, J., & Wang, J. (2018). Deep triplet quantization. In ACM international conference on multimedia, pp. 755–763.

  • Liu, H., Wang, R., Shan, S., & Chen, X. (2016). Deep supervised hashing for fast image retrieval. In IEEE conference on computer vision and pattern recognition, pp. 2064–2072.

  • Liu, L., Chen, J., Fieguth, P., Zhao, G., Chellappa, R., & Pietikäinen, M. (2019a). From bow to cnn: Two decades of texture representation for texture classification. International Journal of Computer Vision, 127(1), 74–109.

    Google Scholar 

  • Liu, L., Ouyang, W., Wang, X., Fieguth, P., Chen, J., Liu, X., et al. (2020). Deep learning for generic object detection: A survey. International Journal of Computer Vision, 128(2), 261–318.

  • Liu, W., Wang, J., Ji, R., Jiang, Y. G., & Chang, S. F. (2012). Supervised hashing with kernels. In IEEE conference on computer vision and pattern recognition, pp. 2074–2081.

  • Liu, W., Wang, J., Kumar, S., & Chang, S. F. (2011). Hashing with graphs. In International conference on machine learning, pp. 1–8.

  • Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In IEEE conference on computer vision and pattern recognition, pp. 3431–3440.

  • Mu, Y., & Yan, S. (2010). Non-metric locality-sensitive hashing. In AAAI Conference on Artificial Intelligence, pp. 539–544.

  • Qiu, Z., Pan, Y., Yao, T., & Mei, T. (2017). Deep semantic hashing with generative adversarial networks. In International ACM SIGIR conference on research and development in information retrieval, pp. 225–234.

  • Raginsky, M., & Lazebnik, S. (2009). Locality-sensitive binary codes from shift-invariant kernels. In Advances in neural information processing systems, pp. 1509–1517.

  • Rumelhart, D. E., Hinton, G. E., Williams, R. J., et al. (1988). Learning representations by back-propagating errors. Cognitive Modeling, 5(3), 1.

    MATH  Google Scholar 

  • Salakhutdinov, R., & Hinton, G. (2009). Semantic hashing. International Journal of Approximate Reasoning, 50(7), 969–978.

    Google Scholar 

  • Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In IEEE conference on computer vision and pattern recognition, pp. 37–45.

  • Shen, F., Xu, Y., Liu, L., Yang, Y., Huang, Z., & Shen, H. T. (2018). Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(12), 3034–3044.

    Google Scholar 

  • Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. In International conference on learning representations

  • Song, J. (2018). Binary generative adversarial networks for image retrieval. In AAAI conference on artificial intelligence, pp. 394–401.

  • Su, S., Zhang, C., Han, K., & Tian, Y. (2018). Greedy hash: towards fast optimization for accurate hash coding in cnn. In Advances in neural information processing systems, pp. 798–807.

  • Tomar, V. S., & Rose, R. C. (2013). Locality sensitive hashing for fast computation of correlational manifold learning based feature space transformations. In Interspeech, pp. 1776–1780.

  • Wang, J., Kumar, S., & Chang, S. F. (2010). Sequential projection learning for hashing with compact codes. In International conference on machine learning, pp. 1127–1134.

  • Wang, J., Wang, J., Yu, N., & Li, S. (2013). Order preserving hashing for approximate nearest neighbor search. In ACM international conference on multimedia, pp. 133–142.

  • Wang, J., & Zhang, T. (2019). Composite quantization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(6), 1308–1322.

    Google Scholar 

  • Wang, J., Zhang, T., Sebe, N., & Shen, H. T. (2018). A survey on learning to hash. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(4), 769–790.

    Google Scholar 

  • Wang, W., Huang, Y., & Wang, L. (2019). Language-driven temporal activity localization: A semantic matching reinforcement learning model. In IEEE conference on computer vision and pattern recognition, pp. 334–343.

  • Wang, X., Shi, Y., & Kitani, K. M. (2016). Deep supervised hashing with triplet labels. In Asian conference on computer vision, pp. 70–84.

  • Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in neural information processing systems, pp. 1753–1760.

  • Xia, R., Pan, Y., Lai, H., Liu, C., & Yan, S. (2014). Supervised hashing for image retrieval via image representation learning. In AAAI conference on artificial intelligence, Vol. 1, p. 2.

  • Yang, E., Liu, T., Deng, C., Liu, W., & Tao, D. (2019). Distillhash: Unsupervised deep hashing by distilling data pairs. In IEEE conference on computer vision and pattern recognition, pp. 2946–2955.

  • Yang, H. F., Lin, K., & Chen, C. S. (2018). Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(2), 437–451.

    Google Scholar 

  • Yao, T., Long, F., Mei, T., & Rui, Y. (2016). Deep semantic-preserving and ranking-based hashing for image retrieval. In International joint conference on artificial intelligence, pp .3931–3937.

  • Yelamarthi, S. K., Reddy, S. K., Mishra, A., & Mittal, A. (2018). A zero-shot framework for sketch based image retrieval. In European conference on computer vision, pp. 316–333.

  • Yu, T., Yuan, J., Fang, C., & Jin, H. (2018). Product quantization network for fast image retrieval. In European conference on computer vision, pp. 186–201.

  • Zhang, H., Li, Q., Sun, Z., & Liu, Y. (2018a). Combining data-driven and model-driven methods for robust facial landmark detection. IEEE Transactions on Information Forensics and Security, 13(10), 2409–2422.

    Google Scholar 

  • Zhang, J., & Peng, Y. (2017). Ssdh: semi-supervised deep hashing for large scale image retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 29(1), 212–225.

    MathSciNet  Google Scholar 

  • Zhang, J., Peng, Y., & Yuan, M. (2018b). Unsupervised generative adversarial cross-modal hashing. In AAAI conference on artificial intelligence, pp. 539–546.

  • Zhang, P., Zhang. W., Li, WJ., & Guo, M. (2014) Supervised hashing with latent factor models. In International ACM SIGIR conference on research and development in information retrieval, pp. 173–182.

  • Zhang, R., Lin, L., Zhang, R., Zuo, W., & Zhang, L. (2015). Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Transactions on Image Processing, 24(12), 4766–4779.

    MathSciNet  MATH  Google Scholar 

  • Zhang, Z., Chen, Y., & Saligrama, V. (2016). Efficient training of very deep neural networks for supervised hashing. In IEEE conference on computer vision and pattern recognition, pp. 1487–1495.

  • Zhao, F., Huang, Y., Wang, L., & Tan, T. (2015). Deep semantic ranking based hashing for multi-label image retrieval. In IEEE conference on computer vision and pattern recognition, pp. 1556–1564.

  • Zhao, X., Ding, G., Guo, Y., Han, J., & Gao, Y. (2017). Tuch: Turning cross-view hashing into single-view hashing via generative adversarial nets. In International joint conferences on artificial intelligence, pp. 3511–3517.

  • Zhu, H., Long, M., Wang, J., & Cao, Y. (2016). Deep hashing network for efficient similarity retrieval. In AAAI conference on artificial intelligence, pp. 2415–2421.

Download references

Acknowledgements

This work was partially supported by the Natural Science Foundation of China (Grant Nos. U1836217, 61702513, 61721004, and 61427811). This work was also partially supported by CAS-AIR and Shandong Provincial Key Research and Development Program (Major Scientific and Technological Innovation Project ) (No. 2019JZZY010119).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenan Sun.

Additional information

Communicated by Li Liu, Matti Pietikäinen, Jie Qin, Jie Chen, Wanli Ouyang, Luc Van Gool.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Q., Sun, Z., He, R. et al. A General Framework for Deep Supervised Discrete Hashing. Int J Comput Vis 128, 2204–2222 (2020). https://doi.org/10.1007/s11263-020-01327-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11263-020-01327-w

Keywords

Navigation