Journal of Frontiers of Computer Science and Technology ›› 2021, Vol. 15 ›› Issue (8): 1469-1475.DOI: 10.3778/j.issn.1673-9418.2005066

• Artificial Intelligence • Previous Articles     Next Articles

Crowd Density Detection Technology Based on Deep Semantic Segmentation

MA Yu, DU Huimin, MAO Zhili, ZHANG Xia   

  1. School of Electronic Engineering, Xi'an University of Posts & Telecommunications, Xi'an 710121, China
  • Online:2021-08-01 Published:2021-08-02



  1. 西安邮电大学 电子工程学院,西安 710121


With the development of society, people are going out more and more, which leads to more and more crowded scenes. The detection of crowd density is particularly important. Aiming at the multi-scale problem of different human scales caused by camera angles in the crowd, a crowd density detection method based on deep semantic segmentation is proposed. The front-end of the network uses an improved VGG network to extract the crowd characteristics, so that the output feature map is 1/8 of the original image to improve the accuracy of the predicted density map. The back-end designs two atrous convolution modules with different array dilation rates to capture the multi-scale features of the crowd. The multi-scale features of the network enable the network to capture more scale details and edge information. The network finally uses 1×1 convolution to cascade the output to obtain a high-quality prediction density map. At the same time, in order to solve the grid effect caused by atrous convolution, a zigzag network structure is designed, so that every pixel in the convolution operation after zero filling is calculated to ensure the continuity of the information, thereby improving the accuracy of the network. The network performance is tested on the ShanghaiTech and UCF_CC_50 datasets. The test results are better than the current mainstream crowd density detection methods. The MAE value obtained from the test is 42.4% and 38.1% higher than that of the MCNN network. Compared with SANet, network performance is increased by 5.3% and 9.6%.

Key words: crowd density detection, deep learning, atrous convolution, density map, zigzag network



关键词: 人群密度检测, 深度学习, 空洞卷积, 密度图, 锯齿状网络