Paired cross-modal data augmentation for fine-grained image-to-text retrieval

Paired cross-modal data augmentation for fine-grained image-to-text retrieval

This paper investigates an open research problem of generating text-image pairs to improve the training of fine-grained image-to-text cross-modal retrieval task, and proposes a novel framework for paired data augmentation by uncovering the hidden semantic information of StyleGAN2 model. Specific...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wang, Hao, Lin, Guosheng, Hoi, Steven C. H., Miao, Chunyan
Other Authors:	School of Computer Science and Engineering
Format:	Conference or Workshop Item
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering Image-to-Text Retrieval Computing Methodologies
Online Access:	https://hdl.handle.net/10356/164145
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

Learning structural representations for recipe generation and food retrieval
by: Wang, Hao, et al.
Published: (2022)

Adaptation of language models via text augmentation
by: Prachaseree, Chaiyasait
Published: (2023)

Cycle-consistent inverse GAN for text-to-image synthesis
by: Wang, Hao, et al.
Published: (2022)

Deep learning-based text augmentation for named entity recognition
by: Surana, Tanmay
Published: (2023)

Imaged document text retrieval without OCR
by: Tan, C.L., et al.
Published: (2013)

Structure-aware generation network for recipe generation from images
by: Wang, Hao, et al.
Published: (2021)

Decomposing generation networks with structure prediction for recipe generation
by: Wang, Hao, et al.
Published: (2022)

Discovery of interesting phrases from text streams
by: Pang, Jeffrey Jian Hao
Published: (2011)

Event detection for biomedical text
by: Pham, Nguyen Minh Thu
Published: (2022)

Korean jamo-level byte-pair encoding for neural machine translation
by: Lee, Junyoung
Published: (2023)

Cocktail: mixing multi-modality controls for text-conditional image generation
by: Hu, Minghui, et al.
Published: (2023)

Knowledge graph construction from text
by: Yong, Shan Jie
Published: (2021)

Efficient text classification
by: Tan, Cheryl Qian Ru.
Published: (2010)

Fine-grained image classification using deep learning
by: Sun, Deguang
Published: (2022)

Personality detection from text, based on the MBTI model
by: Christienne Grace Regodon, Visco
Published: (2020)

Topical analysis of text streams
by: He, Qi
Published: (2009)

Image-based document vectors for text retrieval
by: Yu, Z., et al.
Published: (2013)

Sentiment-oriented metric learning for text-to-image retrieval
by: TRUONG, Quoc Tuan, et al.
Published: (2021)

Text mining with minimum human supervision
by: Lim, Kewin Hong Kwan.
Published: (2012)

Paraphrase detection of semantically equivalent text
by: Lim, Linus Ji Wei
Published: (2017)

Developing web crawler and categorization of newspaper text
by: Singh, Rakhi
Published: (2015)

An object-oriented, logic based approach to document retrieval
by: Tan, Nam Beng.
Published: (2009)

Master-proxy host structure for efficient document retrieval
by: Soh, Ying Kwang.
Published: (2008)

Exploiting text mining for Java package mappings
by: Ong, Kent Long Xiong
Published: (2017)

Deep learning techniques for text classification
by: Raihan, Diardano
Published: (2021)

Ontology building for concept indexing on multimedia information retrieval system
by: Liu, Xiao
Published: (2015)

TFIDF meets deep document representation : a re-visit of co-training for text classification
by: Chen, Zhiwei
Published: (2020)

Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
by: Xue, Chuhui, et al.
Published: (2022)

Privacy-Preserving Similarity-Based Text Retrieval
by: PANG, Hwee Hwa, et al.
Published: (2010)

Chinese text retrieval system
by: Lim, Hong Koon.
Published: (2008)

Keyword and named entity recognition on air traffic control text
by: Tay, Nikole Qiwei
Published: (2020)

Text retrieval from document images based on word shape analysis
by: Tan, C.L., et al.
Published: (2013)

SIRE: A Social Image Retrieval Engine
by: HOI, Steven C. H., et al.
Published: (2011)

Selecting training samples from large and noisy corpora for efficient text classification
by: Wong, Daji
Published: (2011)

String Processing and Information Retrieval
Published: (2017)

Training deep network models for accurate recognition of texts in scene images
by: Chen, Pengfei
Published: (2021)

Chinese text segmentation for information retrieval
by: Li, Hui
Published: (2008)

Online multi-modal distance metric learning with application to image retrieval
by: WU, Pengcheng, et al.
Published: (2014)

Online multi-modal distance metric learning with application to image retrieval
by: WU, Pengcheng, et al.
Published: (2016)

Gated convolutional neural network for fine-grained automatic essay scoring
by: Lee, Xing Zhao
Published: (2017)