Mask-guided deformation adaptive network for human parsing
Due to the challenges of densely compacted body parts, nonrigid clothing items, and severe overlap in crowd scenes, human parsing needs to focus more on multilevel feature representations compared to general scene parsing tasks. Based on this observation, we propose to introduce the auxiliary task o...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2022
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/7873 https://ink.library.smu.edu.sg/context/sis_research/article/8876/viewcontent/3467889.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-8876 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-88762024-02-16T08:57:55Z Mask-guided deformation adaptive network for human parsing MAO, Aihua LIANG, Yuan JIAO, Jianbo LIU, Yongtuo HE, Shengfeng Due to the challenges of densely compacted body parts, nonrigid clothing items, and severe overlap in crowd scenes, human parsing needs to focus more on multilevel feature representations compared to general scene parsing tasks. Based on this observation, we propose to introduce the auxiliary task of human mask and edge detection to facilitate human parsing. Different from human parsing, which exploits the discriminative features of each category, human mask and edge detection emphasizes the boundaries of semantic parsing regions and the difference between foreground humans and background clutter, which benefits the parsing predictions of crowd scenes and small human parts. Specifically, we extract human mask and edge labels from the human parsing annotations and train a shared encoder with three independent decoders for the three mutually beneficial tasks. Furthermore, the decoder feature maps of the human mask prediction branch are further exploited as attention maps, indicating human regions to facilitate the decoding process of human parsing and human edge detection. In addition to these auxiliary tasks, we further alleviate the problem of deformed clothing items under various human poses by tracking the deformation patterns with the deformable convolution. Extensive experiments show that the proposed method can achieve superior performance against state-of-the-art methods on both single and multiple human parsing datasets. Codes and trained models are available https://github.com/ViktorLiang/MGDAN. 2022-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7873 info:doi/10.1145/3467889 https://ink.library.smu.edu.sg/context/sis_research/article/8876/viewcontent/3467889.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Human parsing multi-task learning deformable convolution Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Human parsing multi-task learning deformable convolution Graphics and Human Computer Interfaces |
spellingShingle |
Human parsing multi-task learning deformable convolution Graphics and Human Computer Interfaces MAO, Aihua LIANG, Yuan JIAO, Jianbo LIU, Yongtuo HE, Shengfeng Mask-guided deformation adaptive network for human parsing |
description |
Due to the challenges of densely compacted body parts, nonrigid clothing items, and severe overlap in crowd scenes, human parsing needs to focus more on multilevel feature representations compared to general scene parsing tasks. Based on this observation, we propose to introduce the auxiliary task of human mask and edge detection to facilitate human parsing. Different from human parsing, which exploits the discriminative features of each category, human mask and edge detection emphasizes the boundaries of semantic parsing regions and the difference between foreground humans and background clutter, which benefits the parsing predictions of crowd scenes and small human parts. Specifically, we extract human mask and edge labels from the human parsing annotations and train a shared encoder with three independent decoders for the three mutually beneficial tasks. Furthermore, the decoder feature maps of the human mask prediction branch are further exploited as attention maps, indicating human regions to facilitate the decoding process of human parsing and human edge detection. In addition to these auxiliary tasks, we further alleviate the problem of deformed clothing items under various human poses by tracking the deformation patterns with the deformable convolution. Extensive experiments show that the proposed method can achieve superior performance against state-of-the-art methods on both single and multiple human parsing datasets. Codes and trained models are available https://github.com/ViktorLiang/MGDAN. |
format |
text |
author |
MAO, Aihua LIANG, Yuan JIAO, Jianbo LIU, Yongtuo HE, Shengfeng |
author_facet |
MAO, Aihua LIANG, Yuan JIAO, Jianbo LIU, Yongtuo HE, Shengfeng |
author_sort |
MAO, Aihua |
title |
Mask-guided deformation adaptive network for human parsing |
title_short |
Mask-guided deformation adaptive network for human parsing |
title_full |
Mask-guided deformation adaptive network for human parsing |
title_fullStr |
Mask-guided deformation adaptive network for human parsing |
title_full_unstemmed |
Mask-guided deformation adaptive network for human parsing |
title_sort |
mask-guided deformation adaptive network for human parsing |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2022 |
url |
https://ink.library.smu.edu.sg/sis_research/7873 https://ink.library.smu.edu.sg/context/sis_research/article/8876/viewcontent/3467889.pdf |
_version_ |
1794549701817663488 |