Language-guided object segmentation

Language-guided Video Object Segmentation (LVOS) is a multi-modal AI task that segments objects in videos based on natural language expressions. Although there has been significant research on Referring-Video Object Segmentation (R-VOS), which enables LVOS, these methods still face limitations that...

全面介紹

Saved in:
書目詳細資料
主要作者: John Benedict, Remelia Shirlley
其他作者: Chen Change Loy
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/175326
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!