INDONESIAN IMAGE CAPTIONING USING VISION-LANGUAGE MODEL

INDONESIAN IMAGE CAPTIONING USING VISION-LANGUAGE MODEL

The success of pre-train and fine-tune schemes in the fields of computer vision and natural language processing has led to the increase of research exploring Vision-Language Models, commonly known as VL Models. Previous research on Indonesian language image captioning generally relied on limited...

Full description

Saved in:

Bibliographic Details
Main Author:	Astrada Fathurrahman, Raihan
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/78303
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

Similar Items

Aligning vision and language for image captioning using deep learning
by: Cai, Chen
Published: (2024)

IMAGE CAPTIONING WITH SENTIMENT FOR INDONESIAN
by: Khumaeni

INDONESIAN IMAGE CAPTIONING USING SEMANTIC COMPOSITIONAL NETWORKS
by: Andrew Obaja Sinurat, Ray

An Empirical Study of Language CNN for Image Captioning
by: Gu J., et al.
Published: (2018)

Mitigating fine-grained hallucination by fine-tuning large vision-language models with caption rewrites
by: WANG, Lei, et al.
Published: (2024)

Automated image captioning
by: Teo, Sabrina Jingya
Published: (2017)

Text-based image retrieval using image captioning
by: Tan, Kah Hwa
Published: (2019)

Neural image and video captioning
by: Lam, Ting En
Published: (2024)

The Language Principles Used in Writing Captions of Instagram Posts at Malang Tourism
by: Rara Wienda Nautica
Published: (2021)

Generative image captioning in Urdu using deep learning
by: Afzal M.K.
Published: (2023)

Deep learning-based image captioning
by: Chong, Kaydon
Published: (2019)

Incorporating additional knowledge into image captioners
by: Xu, Yang
Published: (2021)

Neural image and video captioning (NIVC)
by: Lee, Jeremy Kian Kiat
Published: (2022)

Learning transferable perturbations for image captioning
by: WU, Hanjie, et al.
Published: (2022)

Visual search using artificial intelligence (deep learning models for image caption)
by: Qiao, Guanheng
Published: (2020)

MOTION-BASED IMAGE CAPTIONING WITH INJECTION METHOD
by: Wibisono Haryadi, Husnulzaki

Evaluations of training paradigms in neural image captioning
by: Lee, Si Min
Published: (2019)

Improved image captioning techniques with comparative study
by: He, Cari
Published: (2021)

Deconfounded image captioning: a causal retrospect
by: Yang, Xu, et al.
Published: (2022)

Image captioning via semantic element embedding
by: ZHANG, Xiaodan, et al.
Published: (2020)

IMAGE CAPTIONING WITH EMOTION USING ENCODER-DECODER FRAMEWORK LSTM AND FACTORED LSTM
by: Rahman Ahaddienata, Dery

IMAGE CAPTIONING ON GEOLOGICAL ROCKS WITH TRANSFORMER ARCHITECTURE AND DATA AUGMENTATION
by: Iqbal Sigid, Muhammad

Learning to collocate Visual-Linguistic Neural Modules for image captioning
by: Yang, Xu, et al.
Published: (2023)

CgT-GAN: CLIP-guided text GAN for image captioning
by: YU, Jiarui, et al.
Published: (2023)

PENGAPLIKASIAN MODEL NEURAL IMAGE CAPTION PADA PEMBANGKITAN TEKS JUDUL UNTUK GAMBAR PRODUK
by: Ihsanul Amal, Irfan

IMAGE CAPTIONING WITH TEXT AUGMENTATION AND TRANSFORMER CASE STUDY: TOURISM DATA
by: Thoriq Ahmada, Marsa

The Writing Principles Used In Globalunair’s Instagram Caption
by: Oppie Agasta
Published: (2020)

Stack-VS : stacked visual-semantic attention for image caption generation
by: Cheng, Ling, et al.
Published: (2021)

Context-aware visual policy network for fine-grained image captioning
by: Zha, Zheng-Jun, et al.
Published: (2022)

Keyword-driven image captioning via Context-dependent Bilateral LSTM
by: ZHANG, Xiaodan, et al.
Published: (2017)

A Qualitative Study of Closed Captions in English Language Teaching (ELT) YouTube Videos
by: Hernandez, Queenie Mae G., et al.
Published: (2024)

PERSONALIZED VISUAL INFORMATION CAPTIONING
by: WU SHUANG
Published: (2023)

Are vision language models multimodal learners?
by: Lee, Gyeonggeon
Published: (2024)

Interactive change-aware transformer network for remote sensing image change captioning
by: Cai, Chen, et al.
Published: (2024)

Using pre-trained models for vision-language understanding tasks
by: CAO, Rui
Published: (2024)

More is better : precise and detailed image captioning using online positive recall and missing concepts mining
by: Zhang, Mingxing, et al.
Published: (2020)

PEMBANGKITAN CAPTION BERKONTEKS DARI GAMBAR
by: Donnyson, Jessin

Insert caption: A study on whether to provide a new limitation in favor of the deaf with regards to closed captioning
by: Lago, Veronica Marie M., et al.
Published: (2013)

Multimodal fashion knowledge extraction as captioning
by: YUAN, Yifei, et al.
Published: (2023)

MULTI-LABEL CLASSIFICATION OF HATE SPEECH AND ABUSIVE LANGUAGE IN INDONESIAN TWITTER
by: Raihan Asyraf Desanto, Muhammad