Mask Guided GAN for Density Estimation and Crowd Counting

Hai-yan Yao, Wang-gen Wan, Xiang Li
2020 IEEE Access  
Density estimation aims to predict the spatial distribution of a crowd scene, and crowd counting aims to automatically check the number of heads as close as the ground truth. We propose a mask guided GAN (Generative Adversarial Network) architecture to solve these two problems synthetically. Step one is generating a segmentation mask, separating the crowd region from the background and redundant information. Step two is predicting the density map with an adversarial learning process guided by
more » ... process guided by the former mask information. Moreover, we branch out from the base network and feed into a counting regressor, which is solely to provide more accurate counting results. The whole scheme is trained collaboratively by compositing density loss (a weighted loss to balance the influence of different data) and counting loss (an MSE loss for counting regression branch). Through the mask information guidance, GAN likely finds its way training to capture more distinguishing features. Therefore, we also achieve more accurate prediction results. Experimental results on different datasets including collected from the internet and the actual scene in Shanghai, the second-most populous city located on China's southeast coast, indicate the validity and robustness by comparable counting numbers and high-quality density maps focusing on crowd area. INDEX TERMS Adversarial learning, mask guided network, density estimation, crowd counting. 31432 This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/ VOLUME 8, 2020
doi:10.1109/access.2020.2973333 fatcat:vhofxih45zda7ok6kzwpfdhfhi