Skip to main content

Advertisement

Log in

Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

The engineering vehicle detection is a key issue for the raw material warehouse scenes. Through the engineering vehicle detection, the working conditions of engineering vehicles in the raw material warehouse can be intelligently managed to prevent large-scale smoke pollution and the danger of smoke and dust. In this paper, we propose an intelligent method based on the framework of YOLOv4-Tiny for locating and identifying the engineering vehicles. In our detection task, the monitoring scenes are complex with a lot of interference. And the scope of monitoring is large. In order to solve these challenging problems, we introduce the Split-attention module to the network, which can adaptively extract important information of the image and improve the receptive field of detection. In addition, we introduce the Dynamic ReLU function to the network, which allow the network to adaptively learn more suitable ReLU parameters based on the input. We also collect a large number of images obtained from the front-end cameras and create a self-built dataset of engineering vehicles. In this paper, we test our method on the COCO dataset and the self-built engineering vehicle dataset. Experimental results show that our method proposed in this paper can detect engineering vehicles with higher accuracy and faster speed, which can be used for engineering vehicle detection in the scenes of raw material storage warehouses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Bochkovskiy A, Wang C-Y, Liao H-YM (2020) YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint arXiv:2004.10934

  2. Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z (2020) Dynamic ReLU. arXiv preprint arXiv:2003.10027

  3. Choudhury S, Chattopadhyay SP, Hazra TK (2017) Vehicle detection and counting using haar feature-based classifier 106-109

  4. Dai J, Li Y, He K, Sun J (2016) R-fcn: Object detection via region-based fully convolutional networks. Adv in neural inf process syst 29:379–387

    Google Scholar 

  5. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, vol. 1. IEEE, pp 886–893

  6. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, pp 1–8

  7. Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448

  8. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

  9. Hahnloser RH, Sarpeshkar R, Mahowald MA, Douglas RJ, Seung HS (2000) Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 405(6789):947–951

    Article  Google Scholar 

  10. Harjoko A, Candradewi I, Bakhtiar AAA (2017) Intelligent Traffic Monitoring Systems: Vehicles Detection, Tracking, And Counting using Haar Cascade Classifier And Optical Flow. ICVIP 2017: Proceedings of the International Conference on Video and Image Processing

  11. He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional networks for visual recognition. In: European conference on computer vision. pp 346–361. Springer

  12. He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing humanlevel performance on imagenet classification. In: ICCV

  13. Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal Deep Autoencoder for Human Pose Recovery. IEEE Trans on Image Process 24(12):5659–5670. https://doi.org/10.1109/TIP.2015.2487860

    Article  MathSciNet  MATH  Google Scholar 

  14. Hong C, Yu J, Zhang J, Jin X, Lee K (2019) Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning. IEEE Trans on Industrial Inf 15(7):3952–3961. https://doi.org/10.1109/TII.2018.2884211

    Article  Google Scholar 

  15. Hong C, Yu J, Chen X (2013) Image-based 3D human pose recovery with locality sensitive sparse retrieval. In: 2013 IEEE international conference on systems, man, and cybernetics, pp 2103–2108. https://doi.org/10.1109/SMC.2013.360

  16. Huansheng S, Xiangqing Z, Baofeng Z, Teng Y (2018) Vehicle detection based on deep learning in complex scene. Application Research of Computers

  17. Iandola F, Moskewicz M, Karayev S, Girshick R, Keutzer K (2014) Densenet: implementing efficient convnet descriptor pyramids. Eprint Arxiv

  18. Lienhart R, Maydt J (2002) An extended set of Haar-like features for rapid object detection. Proceedings. International Conference on Image Processing. IEEE

  19. Lin TY, Maire M, Belongie S, Hays J, Zitnick CL (2014) Microsoft coco: common objects in context

  20. Lin T-Y, Dollar P, Girshick RB, He K, Hariharan B, Belongie SJ (2017) Feature pyramid networks for object detection. CVPR 1(2):4

    Google Scholar 

  21. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: Single shot multibox detector. European conference on computer vision. Springer, pp 21–37

  22. Liu X, Zhang Y, Zhang SY, Wang Y, Liang ZY, Ye XZ (2015) Detection of engineering vehicles in high-resolution monitoring images. Frontiers of Information Technology & Electronic Engineering

  23. Liu Z, Zheng T, Xu G, et al. (2019) Training-Time-Friendly Network for Real-Time Object Detection. arXiv preprint arXiv:1909.00700

  24. Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. ICML Workshop on Deep Learning for Audio, Speech and Language Processing

  25. Qin et al. Z (2019) ThunderNet: Towards Real-Time Generic Object Detection on Mobile Devices. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp 6717-6726. https://doi.org/10.1109/ICCV.2019.00682

  26. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788

  27. Redmon J, Farhadi A (2017) YOLO9000: Better, Faster, Stronger. IEEE Conference on Computer Vision & Pattern Recognition, IEEE

  28. Redmon J, Farhadi, A (2018) YOLOv3: An Incremental Improvement. arXiv preprint arXiv:1804.02767

  29. Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Adv in neural inf process syst 28:91–99

    Google Scholar 

  30. Suhao L, Jinzhao L, Guoquan L, Tong B, Huiqian W, Yu P (2018) Vehicle type detection based on deep learning in traffic scene. Procedia Comput Sci 131:564–572

    Article  Google Scholar 

  31. Wang CY, Liao HYM, Yeh IH et al. (2019) CSPNet: A New Backbone that can Enhance Learning Capability of CNN

  32. Wang RJ, Li X, Ling CX (2018) Pelee: A Real-Time Object Detection System on Mobile Devices. arXiv preprint arXiv:1804.06882

  33. Xuezhi X, Ning L, Xinli G, Shuai W, Abdulmotaleb ES (2018) Engineering vehicles detection based on modified faster r-cnn for power grid surveillance. Sensors 18(7):2258

    Article  Google Scholar 

  34. Yang J, Fu X, Hu Y, et al. (2017) PanNet: A Deep Network Architecture for Pan-Sharpening. 2017 IEEE International Conference on Computer Vision (ICCV)

  35. Yan Z, Jun Z, Wei G (2020) Research on Object Detection of Traffic Scene Based on Deep Learning. AICSconf ’20: 2020 Artificial Intelligence and Complex Systems Conference

  36. Yu J, Rui Y, Tao D (2014) Click Prediction for Web Image Reranking Using Multimodal Sparse Coding. IEEE Trans on Image Process 23(5):2019–2032. https://doi.org/10.1109/TIP.2014.2311377

    Article  MathSciNet  MATH  Google Scholar 

  37. Yu J, Tao D, Wang M, Rui Y (2015) Learning to Rank Using User Clicks and Visual Features for Image Retrieval. IEEE Trans on Cybern 45(4):767–779. https://doi.org/10.1109/TCYB.2014.2336697

    Article  Google Scholar 

  38. Yu J, Zhu C, Zhang J, Huang Q, Tao D (2020) Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition. IEEE Trans on Neural Networks and Learning Syst 31(2):661–674. https://doi.org/10.1109/TNNLS.2019.2908982

    Article  Google Scholar 

  39. Yu J, Tan M, Zhang H, Tao D, Rui Y Hierarchical Deep Click Feature Prediction for Fine-grained Image Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2019.2932058

  40. Zhang J, Yu J, Tao D (2018) Local Deep-Feature Alignment for Unsupervised Dimension Reduction. IEEE Trans on Image Process 27(5):2420–2432. https://doi.org/10.1109/TIP.2018.2804218

    Article  MathSciNet  MATH  Google Scholar 

  41. Zhou JJ, Duan JM (2015) Moving object detection for intelligent vehicles based on occupancy grid map. Syst Eng & Electronics 37(2):436–442

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xuezhi Xiang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by the National Natural Science Foundation of China under Grant 61401113, in part by the Natural Science Foundation of Heilongjiang Province of China under Grant LC201426, in part by the Fundamental Research Funds for the Central Universities of China under Grant 3072020CF0807.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiang, X., Meng, F., Lv, N. et al. Engineering Vehicles Detection for Warehouse Surveillance System Based on Modified YOLOv4-Tiny. Neural Process Lett 55, 2743–2759 (2023). https://doi.org/10.1007/s11063-022-10982-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-022-10982-8

Keywords

Navigation