An empirical study on adaptation methods for large-scale vision-language models

Since the rise of powerful large-scale pre-trained Vision-Language (VL) models, such as CLIP and ALIGN, pre-training and fine-tuning have become promising paradigms to build transferable models for different downstream tasks. However, it is often prohibitive to fine-tune the whole pre-trained VL mod...

全面介紹

Saved in:
書目詳細資料
主要作者: Wang, Annan
其他作者: Chen Change Loy
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2023
主題:
在線閱讀:https://hdl.handle.net/10356/165970
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English