Abstract
The engineering vehicle detection is a key issue for the raw material warehouse scenes. Through the engineering vehicle detection, the working conditions of engineering vehicles in the raw material warehouse can be intelligently managed to prevent large-scale smoke pollution and the danger of smoke and dust. In this paper, we propose an intelligent method based on the framework of YOLOv4-Tiny for locating and identifying the engineering vehicles. In our detection task, the monitoring scenes are complex with a lot of interference. And the scope of monitoring is large. In order to solve these challenging problems, we introduce the Split-attention module to the network, which can adaptively extract important information of the image and improve the receptive field of detection. In addition, we introduce the Dynamic ReLU function to the network, which allow the network to adaptively learn more suitable ReLU parameters based on the input. We also collect a large number of images obtained from the front-end cameras and create a self-built dataset of engineering vehicles. In this paper, we test our method on the COCO dataset and the self-built engineering vehicle dataset. Experimental results show that our method proposed in this paper can detect engineering vehicles with higher accuracy and faster speed, which can be used for engineering vehicle detection in the scenes of raw material storage warehouses.
Similar content being viewed by others
References
Bochkovskiy A, Wang C-Y, Liao H-YM (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint arXiv:2004.10934
Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z (2020) Dynamic ReLU. arXiv preprint arXiv:2003.10027
Choudhury S, Chattopadhyay SP, Hazra TK (2017) Vehicle detection and counting using haar feature-based classifier 106-109
Dai J, Li Y, He K, Sun J (2016) R-fcn: Object detection via region-based fully convolutional networks. Adv in neural inf process syst 29:379–387
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, vol. 1. IEEE, pp 886–893
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, pp 1–8
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Hahnloser RH, Sarpeshkar R, Mahowald MA, Douglas RJ, Seung HS (2000) Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 405(6789):947–951
Harjoko A, Candradewi I, Bakhtiar AAA (2017) Intelligent Traffic Monitoring Systems: Vehicles Detection, Tracking, And Counting using Haar Cascade Classifier And Optical Flow. ICVIP 2017: Proceedings of the International Conference on Video and Image Processing
He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European conference on computer vision. pp 346–361. Springer
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing humanlevel performance on imagenet classification. In: ICCV
Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal Deep Autoencoder for Human Pose Recovery. IEEE Trans on Image Process 24(12):5659–5670. https://doi.org/10.1109/TIP.2015.2487860
Hong C, Yu J, Zhang J, Jin X, Lee K (2019) Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning. IEEE Trans on Industrial Inf 15(7):3952–3961. https://doi.org/10.1109/TII.2018.2884211
Hong C, Yu J, Chen X (2013) Image-based 3D human pose recovery with locality sensitive sparse retrieval. In: 2013 IEEE international conference on systems, man, and cybernetics, pp 2103–2108. https://doi.org/10.1109/SMC.2013.360
Huansheng S, Xiangqing Z, Baofeng Z, Teng Y (2018) Vehicle detection based on deep learning in complex scene. Application Research of Computers
Iandola F, Moskewicz M, Karayev S, Girshick R, Keutzer K (2014) Densenet: implementing efficient convnet descriptor pyramids. Eprint Arxiv
Lienhart R, Maydt J (2002) An extended set of Haar-like features for rapid object detection. Proceedings. International Conference on Image Processing. IEEE
Lin TY, Maire M, Belongie S, Hays J, Zitnick CL (2014) Microsoft coco: common objects in context
Lin T-Y, Dollar P, Girshick RB, He K, Hariharan B, Belongie SJ (2017) Feature pyramid networks for object detection. CVPR 1(2):4
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: Single shot multibox detector. European conference on computer vision. Springer, pp 21–37
Liu X, Zhang Y, Zhang SY, Wang Y, Liang ZY, Ye XZ (2015) Detection of engineering vehicles in high-resolution monitoring images. Frontiers of Information Technology & Electronic Engineering
Liu Z, Zheng T, Xu G, et al. (2019) Training-Time-Friendly Network for Real-Time Object Detection. arXiv preprint arXiv:1909.00700
Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. ICML Workshop on Deep Learning for Audio, Speech and Language Processing
Qin et al. Z (2019) ThunderNet: Towards Real-Time Generic Object Detection on Mobile Devices. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp 6717-6726. https://doi.org/10.1109/ICCV.2019.00682
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Redmon J, Farhadi A (2017) YOLO9000: Better, Faster, Stronger. IEEE Conference on Computer Vision & Pattern Recognition, IEEE
Redmon J, Farhadi, A (2018) YOLOv3: An Incremental Improvement. arXiv preprint arXiv:1804.02767
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv in neural inf process syst 28:91–99
Suhao L, Jinzhao L, Guoquan L, Tong B, Huiqian W, Yu P (2018) Vehicle type detection based on deep learning in traffic scene. Procedia Comput Sci 131:564–572
Wang CY, Liao HYM, Yeh IH et al. (2019) CSPNet: A New Backbone that can Enhance Learning Capability of CNN
Wang RJ, Li X, Ling CX (2018) Pelee: A Real-Time Object Detection System on Mobile Devices. arXiv preprint arXiv:1804.06882
Xuezhi X, Ning L, Xinli G, Shuai W, Abdulmotaleb ES (2018) Engineering vehicles detection based on modified faster r-cnn for power grid surveillance. Sensors 18(7):2258
Yang J, Fu X, Hu Y, et al. (2017) PanNet: A Deep Network Architecture for Pan-Sharpening. 2017 IEEE International Conference on Computer Vision (ICCV)
Yan Z, Jun Z, Wei G (2020) Research on Object Detection of Traffic Scene Based on Deep Learning. AICSconf ’20: 2020 Artificial Intelligence and Complex Systems Conference
Yu J, Rui Y, Tao D (2014) Click Prediction for Web Image Reranking Using Multimodal Sparse Coding. IEEE Trans on Image Process 23(5):2019–2032. https://doi.org/10.1109/TIP.2014.2311377
Yu J, Tao D, Wang M, Rui Y (2015) Learning to Rank Using User Clicks and Visual Features for Image Retrieval. IEEE Trans on Cybern 45(4):767–779. https://doi.org/10.1109/TCYB.2014.2336697
Yu J, Zhu C, Zhang J, Huang Q, Tao D (2020) Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition. IEEE Trans on Neural Networks and Learning Syst 31(2):661–674. https://doi.org/10.1109/TNNLS.2019.2908982
Yu J, Tan M, Zhang H, Tao D, Rui Y Hierarchical Deep Click Feature Prediction for Fine-grained Image Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2019.2932058
Zhang J, Yu J, Tao D (2018) Local Deep-Feature Alignment for Unsupervised Dimension Reduction. IEEE Trans on Image Process 27(5):2420–2432. https://doi.org/10.1109/TIP.2018.2804218
Zhou JJ, Duan JM (2015) Moving object detection for intelligent vehicles based on occupancy grid map. Syst Eng & Electronics 37(2):436–442
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was supported in part by the National Natural Science Foundation of China under Grant 61401113, in part by the Natural Science Foundation of Heilongjiang Province of China under Grant LC201426, in part by the Fundamental Research Funds for the Central Universities of China under Grant 3072020CF0807.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xiang, X., Meng, F., Lv, N. et al. Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny. Neural Process Lett 55, 2743–2759 (2023). https://doi.org/10.1007/s11063-022-10982-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-022-10982-8