Using pre-trained models for vision-language understanding tasks
In recent years, remarkable progress has been made in Artificial Intelligence (AI), with an increasing focus on integrating AI systems into people’s daily lives. In the context of our diverse world, research attention has shifted towards applying AI to multimodal understanding tasks. This thesis spe...
Saved in:
Main Author: | CAO, Rui |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/etd_coll/595 https://ink.library.smu.edu.sg/context/etd_coll/article/1593/viewcontent/Rui_Thesis_PTMs_VLU.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Enhancing visual grounding in vision-language pre-training with position-guided text prompts
by: WANG, Alex Jinpeng, et al.
Published: (2024) -
Injecting descriptive meta-information into pre-trained language models with hypernetworks
by: DUAN, Wenying, et al.
Published: (2021) -
Multimedia question answering
by: NIE LIQIANG
Published: (2013) -
On the transferability of pre-trained language models for low-resource programming languages
by: CHEN, Fuxiang, et al.
Published: (2022) -
Disentangling hate in online memes
by: LEE, Ka Wei, Roy, et al.
Published: (2021)