Object counting using machine learning
In this thesis, to improve the accuracy of multi-modal crowd count estimation, a three-stream adaptive fusion network (TAFNet) and a scale-aware self-differential attention network (SDANet) are proposed. The proposed TAFNet is adopted to adaptively extract and fuse the optical information with therm...
محفوظ في:
المؤلف الرئيسي: | |
---|---|
مؤلفون آخرون: | |
التنسيق: | Thesis-Master by Research |
اللغة: | English |
منشور في: |
Nanyang Technological University
2022
|
الموضوعات: | |
الوصول للمادة أونلاين: | https://hdl.handle.net/10356/162531 |
الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
المؤسسة: | Nanyang Technological University |
اللغة: | English |
الملخص: | In this thesis, to improve the accuracy of multi-modal crowd count estimation, a three-stream adaptive fusion network (TAFNet) and a scale-aware self-differential attention network (SDANet) are proposed. The proposed TAFNet is adopted to adaptively extract and fuse the optical information with thermal information, increasing the effectiveness of multi-modal information fusing. The proposed SDANet utilizes multi-scale features to estimate the density map and predict crowd number, which solves the scale variation problem of crowds.
Several novel modules are proposed to highlight the scale information and avoid information redundancy. The experiments on RGBT-CC benchmark show the effectiveness of proposed methods for RGB-T crowd counting compared with state-of-the-art methods.
The experiments on ShanghaitechRGBD benchmark demonstrate that proposed networks are capable of RGB-D crowd counting. In addition, the estimated density maps have high quality and are close to the ground truth density maps. |
---|