A novel convolutional neural network method for crowd counting,Frontiers of Information Technology & Electronic Engineering

当前位置： X-MOL 学术 › Front. Inform. Technol. Electron. Eng. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

A novel convolutional neural network method for crowd counting
Frontiers of Information Technology & Electronic Engineering ( IF 3 ) Pub Date : 2020-08-20 , DOI: 10.1631/fitee.1900282
Jie-hao Huang , Xiao-guang Di , Jun-de Wu , Ai-yue Chen

Crowd density estimation, in general, is a challenging task due to the large variation of head sizes in the crowds. Existing methods always use a multi-column convolutional neural network (MCNN) to adapt to this variation, which results in an average effect in areas with different densities and brings a lot of noise to the density map. To address this problem, we propose a new method called the segmentation-aware prior network (SAPNet), which generates a high-quality density map without noise based on a coarse head-segmentation map. SAPNet is composed of two networks, i.e., a foreground-segmentation convolutional neural network (FS-CNN) as the front end and a crowd-regression convolutional neural network (CR-CNN) as the back end. With only the single dot annotation, we generate the ground truth of segmentation masks in heads. Then, based on the ground truth, FS-CNN outputs a coarse head-segmentation map, which helps eliminate the noise in regions without people in the density map. By inputting the head-segmentation map generated by the front end, CR-CNN performs accurate crowd counting estimation and generates a high-quality density map. We demonstrate SAPNet on four datasets (i.e., ShanghaiTech, UCF-CC-50, WorldExpo’10, and UCSD), and show the state-of-the-art performances on ShanghaiTech part B and UCF-CC-50 datasets.

中文翻译：

一种新的卷积神经网络人群计数方法

通常，人群密度估计是一项艰巨的任务，因为人群中头部的大小差异很大。现有方法始终使用多列卷积神经网络（MCNN）来适应这种变化，从而在具有不同密度的区域中产生平均效果，并给密度图带来很多噪声。为了解决此问题，我们提出了一种称为分割感知先验网络（SAPNet）的新方法，该方法会基于粗略的头部分割图生成高质量的无噪声的密度图。SAPNet由两个网络组成，即前端分段卷积神经网络（FS-CNN）作为前端，而人群回归卷积神经网络（CR-CNN）作为后端。仅使用单点注释，就可以生成头部中的分割蒙版的地面真相。然后，基于地面事实，FS-CNN输出一个粗略的头部分割图，这有助于消除密度图中没有人的区域中的噪声。通过输入由前端生成的头部分割图，CR-CNN可以进行准确的人群计数估计并生成高质量的密度图。我们在四个数据集（即ShanghaiTech，UCF-CC-50，WorldExpo'10和UCSD）上演示了SAPNet，并展示了ShanghaiTech部分上的最新性能B和UCF-CC-50数据集。

更新日期：2020-08-20

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>