Learning decoupled models for cross-modal generation

Learning decoupled models for cross-modal generation

Cross-modal generation is playing an important role in translating information between different data modalities, such as image, video and text. Two representative tasks under the cross-modal generation umbrella are visual-to-text generation and text-to-visual generation. For the visual-to-text gene...

Full description

Saved in:

Bibliographic Details
Main Author:	Wang, Hao
Other Authors:	Miao Chun Yan
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/169609
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

A decoupled learning framework for contrastive learning
by: Xu, Yicheng
Published: (2022)

Cross-modal graph with meta concepts for video captioning
by: Wang, Hao, et al.
Published: (2022)

Paired cross-modal data augmentation for fine-grained image-to-text retrieval
by: Wang, Hao, et al.
Published: (2023)

Audio captioning and retrieval with improved cross-modal objectives
by: Koh, Andrew Jin Jie
Published: (2023)

JALAD : joint accuracy- and latency-aware deep structure decoupling for edge-cloud execution
by: Li, Hongshan, et al.
Published: (2020)

Fully decoupled neural network learning using delayed gradients
by: Zhuang, Huiping, et al.
Published: (2024)

A Passive Decoupling Mechanism for Misalignment Compensation in Master-Slave Teleoperation
by: Shen Treratanakulchai, et al.
Published: (2022)

Cocktail: mixing multi-modality controls for text-conditional image generation
by: Hu, Minghui, et al.
Published: (2023)

Managing Complexity Through Selective Decoupling
by: WOODARD, C. Jason, et al.
Published: (2014)

Decoupling and robust decoupling for multivariable generalized predictive control design
by: Qin, Xiao Fan
Published: (2009)

A decoupled approach for near-field source localization using a single acoustic vector sensor
by: Hari, V. N., et al.
Published: (2013)

Single-shot image generation and stylisation via cross-domain correspondance
by: Kalyan, Harikishan
Published: (2022)

Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
by: Song, Ge, et al.
Published: (2023)

Speeding up deep neural network training with decoupled and analytic learning
by: Zhuang, Huiping
Published: (2021)

Music generation with deep learning techniques
by: Tan, Wen Xiu
Published: (2023)

Learning structural representations for recipe generation and food retrieval
by: Wang, Hao, et al.
Published: (2022)

Multi modal personalized explanation generation
by: Marantika, Winda Kirana
Published: (2024)

Deep interactive learning for fine-grained opinion mining : single-domain, cross-domain & cross-lingual
by: Wang, Wenya
Published: (2018)

Decoupling cross-quadrature correlations using passive operations
by: Assad, Syed Muhamad, et al.
Published: (2020)

Visual-to-EEG cross-modal knowledge distillation for continuous emotion recognition
by: Zhang, Su, et al.
Published: (2022)

Distance metric learning for multi-modal image retrieval and annotation
by: Wu, Pengcheng
Published: (2014)

Collaborative cross-modal fusion with Large Language Model for recommendation
by: LIU, Zhongzhou, et al.
Published: (2024)

Latte: cross-framework Python package for evaluation of latent-based generative models
by: Watcharasupat, Karn N., et al.
Published: (2023)

Synthesizing photorealistic images with deep generative learning
by: Zheng, Chuanxia
Published: (2021)

Cross-modal recipe retrieval with stacked attention model
by: CHEN, Jing-Jing, et al.
Published: (2018)

Sequence-to-sequence learning for motion prediction and generation
by: Wu, Shuang
Published: (2022)

R2GAN: Cross-modal recipe retrieval with generative adversarial network
by: ZHU, Bin, et al.
Published: (2019)

Music generation using deep learning
by: Lee, Ray Chong
Published: (2023)

Music generation with deep learning techniques
by: Lee, Daniel Yu Sheng
Published: (2021)

Accumulated decoupled learning with gradient staleness mitigation for convolutional neural networks
by: Zhuang, Huiping, et al.
Published: (2024)

Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search
by: Han, Chuchu, et al.
Published: (2023)

EICIC configuration of downlink and uplink decoupling with SWIPT in 5G dense IoT HetNets
by: Zheng, Jie, et al.
Published: (2022)

Building a mobile app for joke-generating with machine learning model
by: Toh, Rui Sheng
Published: (2021)

Automatic model learning and its applications in malware detection
by: Hao, Xiao
Published: (2017)

Decoupling multiuser cross-layer adaptive transmission
by: Hoang, A.T., et al.
Published: (2014)

Decoupling control of multivariable system
by: Zhang, Guo Hua
Published: (2008)

Learning cross-modal embeddings with adversarial networks for cooking recipes and food images
by: WANG, Hao, et al.
Published: (2019)

Image and video generation via deep learning
by: Jiang, Liming
Published: (2023)

Unsupervised learning with diffusion models
by: Wang, Jiankun
Published: (2023)

Cross-position activity recognition with stratified transfer learning
by: Chen, Yiqiang, et al.
Published: (2020)