Counting congested crowds under wild conditions with a multi-task Inception network
Communications in Information and Systems
Counting of congested crowds has been widely applied in surveillance event detection, public safety control, and traffic monitoring. Early studies mainly focus on designing hand-crafted features. However, counting performance based on hand-crafted features maybe easily influenced by issues such as partial occlusion, scale variation, and illumination change. Convolutional neural network (CNN) has shown great success in visual crowd counting. In this work, a multi-task Inception network is
... d for crowd counting in congested crowds. Crowd count is predicted through summing up a density map estimated by the proposed network. Three Inception blocks are employed to automatically extract multi-scale features from different patches cropped from the crowd image. The network can jointly estimate the density map, crowd density level, and background / foreground separation. Counting performance obtained through multi-task learning is superior to that obtained through only estimating density map. Contrastive evaluations based on three benchmarking datasets are implemented with several state-of-the-art CNN-based crowd counting approaches. Results indicate the accuracy and robustness of our network in counting congested crowds. The multi-task Inception network almost outperforms the state-of-the-art counting approaches in terms of mean absolute error and mean squared error.