Crowd Counting Network with Self-attention Distillation

Yaoyao Li, Li Wang, Huailin Zhao, Zhen Nie
2020 Journal of Robotics, Networking and Artificial Life (JRNAL)  
A B S T R A C T Context information is essential for crowd counting network to estimate crowd numbers, especially in the congested scene accurately. However, shallow layers of common crowd counting networks (i.e., congested scene recognition network) do not own large receptive filed so that they can't efficiently utilize context information from the crowd scene. To solve this problem, in this paper, we propose a crowd counting network with self-attention distillation. Each input image is first
more » ... ent to the visual geometry group (VGG)-16 network for feature extracting. Then, the extracted features are processed by the dilated convolutional part for the final crowd density estimation. Specially, we apply self-attention distillation strategy at different locations of the dilated convolutional part to use the global context information from the deeper layers to guide the shallower layers to learn. We compare our method with the other state-of-the-art works on the UCF-QNRF dataset, and the experiment results demonstrate the superiority of our method.
doi:10.2991/jrnal.k.200528.009 fatcat:gh6hsa5snrbq7azl7wnj5usdbm