INDONESIAN IMAGE CAPTIONING USING VISION-LANGUAGE MODEL
The success of pre-train and fine-tune schemes in the fields of computer vision and natural language processing has led to the increase of research exploring Vision-Language Models, commonly known as VL Models. Previous research on Indonesian language image captioning generally relied on limited...
Saved in:
Main Author: | Astrada Fathurrahman, Raihan |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/78303 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
Similar Items
-
Aligning vision and language for image captioning using deep learning
by: Cai, Chen
Published: (2024) -
IMAGE CAPTIONING WITH SENTIMENT FOR INDONESIAN
by: Khumaeni -
INDONESIAN IMAGE CAPTIONING USING SEMANTIC COMPOSITIONAL NETWORKS
by: Andrew Obaja Sinurat, Ray -
An Empirical Study of Language CNN for Image Captioning
by: Gu J., et al.
Published: (2018) -
Mitigating fine-grained hallucination by fine-tuning large vision-language models with caption rewrites
by: WANG, Lei, et al.
Published: (2024)