Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny

Xiang, Xuezhi; Meng, Fanda; Lv, Ning; Yin, Hang

doi:10.1007/s11063-022-10982-8

Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny

Published: 04 August 2022

Volume 55, pages 2743–2759, (2023)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Xuezhi Xiang ORCID: orcid.org/0000-0003-3488-4548^1,2,
Fanda Meng¹,
Ning Lv¹ &
…
Hang Yin¹

349 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

The engineering vehicle detection is a key issue for the raw material warehouse scenes. Through the engineering vehicle detection, the working conditions of engineering vehicles in the raw material warehouse can be intelligently managed to prevent large-scale smoke pollution and the danger of smoke and dust. In this paper, we propose an intelligent method based on the framework of YOLOv4-Tiny for locating and identifying the engineering vehicles. In our detection task, the monitoring scenes are complex with a lot of interference. And the scope of monitoring is large. In order to solve these challenging problems, we introduce the Split-attention module to the network, which can adaptively extract important information of the image and improve the receptive field of detection. In addition, we introduce the Dynamic ReLU function to the network, which allow the network to adaptively learn more suitable ReLU parameters based on the input. We also collect a large number of images obtained from the front-end cameras and create a self-built dataset of engineering vehicles. In this paper, we test our method on the COCO dataset and the self-built engineering vehicle dataset. Experimental results show that our method proposed in this paper can detect engineering vehicles with higher accuracy and faster speed, which can be used for engineering vehicle detection in the scenes of raw material storage warehouses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Conv-Attention: A Low Computation Attention Calculation Method for Swin Transformer

Article Open access 24 February 2024

Zhehang Lou, Suyun Luo, … Dan Wei

3D Object Detection for Autonomous Driving: A Comprehensive Survey

Article 27 April 2023

Jiageng Mao, Shaoshuai Shi, … Hongsheng Li

BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

References

Bochkovskiy A, Wang C-Y, Liao H-YM (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint arXiv:2004.10934
Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z (2020) Dynamic ReLU. arXiv preprint arXiv:2003.10027
Choudhury S, Chattopadhyay SP, Hazra TK (2017) Vehicle detection and counting using haar feature-based classifier 106-109
Dai J, Li Y, He K, Sun J (2016) R-fcn: Object detection via region-based fully convolutional networks. Adv in neural inf process syst 29:379–387
Google Scholar
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, vol. 1. IEEE, pp 886–893
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, pp 1–8
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Hahnloser RH, Sarpeshkar R, Mahowald MA, Douglas RJ, Seung HS (2000) Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 405(6789):947–951
Article Google Scholar
Harjoko A, Candradewi I, Bakhtiar AAA (2017) Intelligent Traffic Monitoring Systems: Vehicles Detection, Tracking, And Counting using Haar Cascade Classifier And Optical Flow. ICVIP 2017: Proceedings of the International Conference on Video and Image Processing
He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European conference on computer vision. pp 346–361. Springer
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing humanlevel performance on imagenet classification. In: ICCV
Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal Deep Autoencoder for Human Pose Recovery. IEEE Trans on Image Process 24(12):5659–5670. https://doi.org/10.1109/TIP.2015.2487860
Article MathSciNet MATH Google Scholar
Hong C, Yu J, Zhang J, Jin X, Lee K (2019) Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning. IEEE Trans on Industrial Inf 15(7):3952–3961. https://doi.org/10.1109/TII.2018.2884211
Article Google Scholar
Hong C, Yu J, Chen X (2013) Image-based 3D human pose recovery with locality sensitive sparse retrieval. In: 2013 IEEE international conference on systems, man, and cybernetics, pp 2103–2108. https://doi.org/10.1109/SMC.2013.360
Huansheng S, Xiangqing Z, Baofeng Z, Teng Y (2018) Vehicle detection based on deep learning in complex scene. Application Research of Computers
Iandola F, Moskewicz M, Karayev S, Girshick R, Keutzer K (2014) Densenet: implementing efficient convnet descriptor pyramids. Eprint Arxiv
Lienhart R, Maydt J (2002) An extended set of Haar-like features for rapid object detection. Proceedings. International Conference on Image Processing. IEEE
Lin TY, Maire M, Belongie S, Hays J, Zitnick CL (2014) Microsoft coco: common objects in context
Lin T-Y, Dollar P, Girshick RB, He K, Hariharan B, Belongie SJ (2017) Feature pyramid networks for object detection. CVPR 1(2):4
Google Scholar
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: Single shot multibox detector. European conference on computer vision. Springer, pp 21–37
Liu X, Zhang Y, Zhang SY, Wang Y, Liang ZY, Ye XZ (2015) Detection of engineering vehicles in high-resolution monitoring images. Frontiers of Information Technology & Electronic Engineering
Liu Z, Zheng T, Xu G, et al. (2019) Training-Time-Friendly Network for Real-Time Object Detection. arXiv preprint arXiv:1909.00700
Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. ICML Workshop on Deep Learning for Audio, Speech and Language Processing
Qin et al. Z (2019) ThunderNet: Towards Real-Time Generic Object Detection on Mobile Devices. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp 6717-6726. https://doi.org/10.1109/ICCV.2019.00682
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Redmon J, Farhadi A (2017) YOLO9000: Better, Faster, Stronger. IEEE Conference on Computer Vision & Pattern Recognition, IEEE
Redmon J, Farhadi, A (2018) YOLOv3: An Incremental Improvement. arXiv preprint arXiv:1804.02767
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv in neural inf process syst 28:91–99
Google Scholar
Suhao L, Jinzhao L, Guoquan L, Tong B, Huiqian W, Yu P (2018) Vehicle type detection based on deep learning in traffic scene. Procedia Comput Sci 131:564–572
Article Google Scholar
Wang CY, Liao HYM, Yeh IH et al. (2019) CSPNet: A New Backbone that can Enhance Learning Capability of CNN
Wang RJ, Li X, Ling CX (2018) Pelee: A Real-Time Object Detection System on Mobile Devices. arXiv preprint arXiv:1804.06882
Xuezhi X, Ning L, Xinli G, Shuai W, Abdulmotaleb ES (2018) Engineering vehicles detection based on modified faster r-cnn for power grid surveillance. Sensors 18(7):2258
Article Google Scholar
Yang J, Fu X, Hu Y, et al. (2017) PanNet: A Deep Network Architecture for Pan-Sharpening. 2017 IEEE International Conference on Computer Vision (ICCV)
Yan Z, Jun Z, Wei G (2020) Research on Object Detection of Traffic Scene Based on Deep Learning. AICSconf ’20: 2020 Artificial Intelligence and Complex Systems Conference
Yu J, Rui Y, Tao D (2014) Click Prediction for Web Image Reranking Using Multimodal Sparse Coding. IEEE Trans on Image Process 23(5):2019–2032. https://doi.org/10.1109/TIP.2014.2311377
Article MathSciNet MATH Google Scholar
Yu J, Tao D, Wang M, Rui Y (2015) Learning to Rank Using User Clicks and Visual Features for Image Retrieval. IEEE Trans on Cybern 45(4):767–779. https://doi.org/10.1109/TCYB.2014.2336697
Article Google Scholar
Yu J, Zhu C, Zhang J, Huang Q, Tao D (2020) Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition. IEEE Trans on Neural Networks and Learning Syst 31(2):661–674. https://doi.org/10.1109/TNNLS.2019.2908982
Article Google Scholar
Yu J, Tan M, Zhang H, Tao D, Rui Y Hierarchical Deep Click Feature Prediction for Fine-grained Image Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2019.2932058
Zhang J, Yu J, Tao D (2018) Local Deep-Feature Alignment for Unsupervised Dimension Reduction. IEEE Trans on Image Process 27(5):2420–2432. https://doi.org/10.1109/TIP.2018.2804218
Article MathSciNet MATH Google Scholar
Zhou JJ, Duan JM (2015) Moving object detection for intelligent vehicles based on occupancy grid map. Syst Eng & Electronics 37(2):436–442
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Communication Engineering, Harbin Engineering University, Harbin, 150001, China
Xuezhi Xiang, Fanda Meng, Ning Lv & Hang Yin
Key Laboratory of Advanced Marine Communication and InformationTechnology, Ministry of Industry and Information Technology, Harbin, China
Xuezhi Xiang

Authors

Xuezhi Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Fanda Meng
View author publications
You can also search for this author in PubMed Google Scholar
Ning Lv
View author publications
You can also search for this author in PubMed Google Scholar
Hang Yin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xuezhi Xiang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by the National Natural Science Foundation of China under Grant 61401113, in part by the Natural Science Foundation of Heilongjiang Province of China under Grant LC201426, in part by the Fundamental Research Funds for the Central Universities of China under Grant 3072020CF0807.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xiang, X., Meng, F., Lv, N. et al. Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny. Neural Process Lett 55, 2743–2759 (2023). https://doi.org/10.1007/s11063-022-10982-8

Download citation

Accepted: 21 July 2022
Published: 04 August 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s11063-022-10982-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny

Abstract

Access this article

Similar content being viewed by others

Conv-Attention: A Low Computation Attention Calculation Method for Swin Transformer

3D Object Detection for Autonomous Driving: A Comprehensive Survey

BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny

Abstract

Access this article

Similar content being viewed by others

Conv-Attention: A Low Computation Attention Calculation Method for Swin Transformer

3D Object Detection for Autonomous Driving: A Comprehensive Survey

BEVFormer: Learning Bird’s-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation