Tackling background ambiguities in multi-class few-shot point cloud semantic segmentation

Few-shot point cloud semantic segmentation learns to segment novel classes with scarce labeled samples. Within an episode, a novel target class is defined by a few support samples with corresponding binary masks, where only the points of this class are labeled as foreground and others are regarded a...

全面介紹

Saved in:
書目詳細資料
Main Authors: Lai, Lvlong, Chen, Jian, Zhang, Chi, Zhang, Zehong, Lin, Guosheng, Wu, Qingyao
其他作者: School of Computer Science and Engineering
格式: Article
語言:English
出版: 2022
主題:
在線閱讀:https://hdl.handle.net/10356/163370
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Few-shot point cloud semantic segmentation learns to segment novel classes with scarce labeled samples. Within an episode, a novel target class is defined by a few support samples with corresponding binary masks, where only the points of this class are labeled as foreground and others are regarded as background. In the tasks involving multiple target classes, since the meanings of background are diverse for different target classes, background ambiguities appear: Some points labeled as background in one support sample may be of other target classes. It will result in incorrect guidance and damage model's segmentation performance. However, previous methods in the literature do not consider this problem. In this paper, we propose a simple yet effective approach to tackle background ambiguities, which adopts the entropy of predictions on query samples to the training objective function as an additional regularization. Besides, we design a feature transformation operation to reduce the feature differences between support and query samples. With our proposed approach, fine-tuning, a weak baseline method for few-shot segmentation, gains significant performance improvement (e.g., 7.48% and 7.04% in 2-way-1-shot and 3-way-1-shot tasks of S3DIS, respectively) and outperforms current state-of-the-art methods in all the task settings of S3DIS and ScanNet benchmark datasets.