Gesture recognition using deep-learning in single-pixel-imaging with high-frame-rate display with latent random dot patterns

Takatsuka, Hiroki; Yasugi, Masaki; Suyama, Shiro; Yamamoto, Hirotsugu

doi:10.1007/s10043-023-00848-2

Gesture recognition using deep-learning in single-pixel-imaging with high-frame-rate display with latent random dot patterns

Special Section: Regular Paper
Laser Display and Lighting Conference (LDC’ 23), Yokohama, Japan
Published: 11 December 2023

Volume 31, pages 116–125, (2024)
Cite this article

Optical Review Aims and scope Submit manuscript

96 Accesses
Explore all metrics

Abstract

Gesture recognition using cameras capable of capturing detailed images for gesture recognition is not feasible in many places due to concerns regarding privacy and information leakage. To address this problem, we have proposed a method of capturing shadow pictures using single-pixel-imaging to realize privacy-conscious gesture recognition. As an implementation method of single-pixel-imaging in public spaces, we have studied using a high-frame-rate LED display as a light source. By using a high-frame-rate LED display, random patterns can be latent while the observer perceives an apparent image. However, the image reconstructed by single-pixel-imaging using a high-frame-rate LED display is influenced by the apparent image, making gesture recognition difficult. In this study, we show that the influence of the apparent image can be removed by restoring the restored image using deep learning with a convolutional network called U-Net, and high classification accuracy with a small number of illuminations by using LeNet to classify restored images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

A review of object detection based on deep learning

Article 12 June 2020

Convolutional neural network: a review of models, methodologies and applications to object detection

Article 20 December 2019

Data availability

The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Mitra, S., Acharya, T.: Gesture recognition: a survey. IEEE Trans Syst Man Cybern Part C 37(3), 311–324 (2007)
Article Google Scholar
Yasui YM, Alvissalim MS, Takahashi M, Tomiyama Y, Suyama S, Ishikawa M. Floating display screen formed by AIRR (Aerial imaging by retro-reflection) for interaction in 3D space. In: 2014 International Conference on 3D Imaging (IC3D) (IEEE, 2014), pp. 1–5.
Rossol, N., Cheng, I., Basu, A.: A Multisensor technique for gesture recognition through intelligent skeletal pose analysis. IEEE Trans Hum Mach Syst 46, 350–359 (2016)
Article Google Scholar
Nishihori, M., Izumi, T., Nagano, Y., Sato, M., Tsukada, T., Kropp, A.E., Wakabayashi, T.: Development and clinical evaluation of a contactless operating interface for three-dimensional image-guided navigation for endovascular neurosurgery. Int J Comput Assist Radiol Surg 16, 663–671 (2021)
Article PubMed PubMed Central Google Scholar
Dai J, Wu J, Saghafi B, Konrad J, Ishwar P. Towards privacy-preserving activity recognition using extremely low temporal and spatial resolution cameras. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (IEEE, 2015), pp. 68–76.
Wu Z, Wang Z, Wang Z, Jin H. Towards privacy-preserving visual recognition via adversarial training: A pilot study. In: Proceedings of the European Conference on Computer Vision (ECCV) (Springer, 2018), pp. 606–624.
Mukojima, N., Yasugi, M., Mizutani, Y., Yasui, T., Yamamoto, H.: Deep-learning-assisted single-pixel imaging for gesture recognition in consideration of privacy. IEICE Trans Electron E105-C. 2, 79–85 (2022)
Article ADS Google Scholar
Gibson, G.M., Johnson, S.D., Padgett, M.J.: Single-pixel imaging 12 years on: a review. Opt Express 28, 28190–28208 (2020)
Article ADS PubMed Google Scholar
Onose, S., Takahashi, M., Mizutani, Y., Yasui, T., Yamamoto, H.: Single pixel imaging with a high-frame-rate LED digital signage. Proc Int Display Worksh 23, 1495–1498 (2016)
Google Scholar
Mukojima, N., Talatsuka, H., Yasugi, M., Suyama, S., Yamamoto, H.: Reconstruction of gesture images by using banner as illumination of single-pixel imaging. Proc. IDW 29, 1039–1042 (2022)
Google Scholar
Takahashi M, Yamamoto H. Encryption by spatiotemporal scrambling on a high-frame-rate display. In: The 63rd JSAP Spring Meeting, 21a-S224–5. 2016. [in Japanese].
Mukojima N, Yasugi M, Suyama S, Yamamoto H. The possibility of using banner images as the mask pattern of single-pixel imaging. In: 2022 Information Photonics (IP) (OSJ, 2022) IPp-09.
Takatsuka H, Yasugi M, Suyama S, Yamamoto H. Reconstruction performance of U-Net in single-pixel-imaging with random-dot-embedded apparent images. In: The 12th laser display and lighting conference 2023, p. LDC7–05. 2023.
Shibuya, K., Minamikawa, T., Mizutani, Y., Yamamoto, H., Minoshima, K., Yasui, T., Iwata, T.: Scan-less hyperspectral dual-comb single-pixel-imaging in both amplitude and phase. Opt Express 25, 21947–21957 (2017)
Article ADS CAS PubMed Google Scholar
Takatsuka H, Yasugi M, Mukojima N, Suyama S, Yamamoto H. Elimination of apparent image on single-pixel-imaging by use of high-frame-rate display with latent random dot patterns. In: Proc. IDW 29, 1035–1038. 2022.
Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. 2015. arXiv:1505.04597.
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc IEEE 86(11), 2278–2324 (1998)
Article Google Scholar

Download references

Funding

A part of this work was supported by JSPS KAKENHI (20H05702).

Author information

Authors and Affiliations

Utsunomiya University, Utsunomiya, Tochigi, Japan
Hiroki Takatsuka, Shiro Suyama & Hirotsugu Yamamoto
Fukui Prefectural University, Obama, Fukui, Japan
Masaki Yasugi

Authors

Hiroki Takatsuka
View author publications
You can also search for this author in PubMed Google Scholar
Masaki Yasugi
View author publications
You can also search for this author in PubMed Google Scholar
Shiro Suyama
View author publications
You can also search for this author in PubMed Google Scholar
Hirotsugu Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

HT contributed for this paper as first author. He conducted the experiments, analyzed the data, and wrote the original draft. MY and SS and HY designed the experiments and edited the manuscript.

Corresponding author

Correspondence to Hirotsugu Yamamoto.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest associated with this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Takatsuka, H., Yasugi, M., Suyama, S. et al. Gesture recognition using deep-learning in single-pixel-imaging with high-frame-rate display with latent random dot patterns. Opt Rev 31, 116–125 (2024). https://doi.org/10.1007/s10043-023-00848-2

Download citation

Received: 31 May 2023
Accepted: 01 November 2023
Published: 11 December 2023
Issue Date: February 2024
DOI: https://doi.org/10.1007/s10043-023-00848-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Gesture recognition using deep-learning in single-pixel-imaging with high-frame-rate display with latent random dot patterns

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

A review of object detection based on deep learning

Convolutional neural network: a review of models, methodologies and applications to object detection

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Gesture recognition using deep-learning in single-pixel-imaging with high-frame-rate display with latent random dot patterns

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

A review of object detection based on deep learning

Convolutional neural network: a review of models, methodologies and applications to object detection

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation