Atrous convolutions spatial pyramid network for crowd counting and density estimation

Scale variation because of perspective distortion is still a challenge for crowd analysis. To address this problem, an atrous convolutions spatial pyramid network (ACSPNet) is proposed to perform crowd counts and density maps for both sparse and congested scenarios. Atrous Convolutions sequenced wit...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Ma, Junjie, Dai, Yaping, Tan, Yap Peng
مؤلفون آخرون: School of Electrical and Electronic Engineering
التنسيق: مقال
اللغة:English
منشور في: 2021
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/151340
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة: Nanyang Technological University
اللغة: English
الوصف
الملخص:Scale variation because of perspective distortion is still a challenge for crowd analysis. To address this problem, an atrous convolutions spatial pyramid network (ACSPNet) is proposed to perform crowd counts and density maps for both sparse and congested scenarios. Atrous Convolutions sequenced with increasing atrous rates are utilized to exaggerate the receptive field and maintain the resolution of extracted features. Different rates of atrous convolution blocks in the pyramid are skip-connected to integrate multi-scale information and extent scale perception ability. Atrous Spatial Pyramid Pooling (ASPP) is employed to resample information at different scales and contain global context. We evaluate our ACSPNet on five challenging benchmark crowd counting datasets and our method achieves state-of-the-art mean absolute error (MAE) and mean squared error (MSE) performances.