Multimodal fashion knowledge extraction as captioning

Social media plays a significant role in boosting the fashion industry, where a massive amount of fashion-related posts are generated every day. In order to obtain the rich fashion information from the posts, we study the task of social media fashion knowledge extraction. Fashion knowledge, which ty...

Full description

Saved in:

Bibliographic Details
Main Authors:	YUAN, Yifei, ZHANG, Wenxuan, DENG, Yang, LAM, Wai
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Fashion industry Fashion knowledge extraction Knowledge extraction Multi-modal Multi-modal data Multimodal data mining Product images Rich texts Social media Social media analysis Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/9170 https://ink.library.smu.edu.sg/context/sis_research/article/10173/viewcontent/MultimodalFashionK_av.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10173
record_format	dspace
spelling	sg-smu-ink.sis_research-101732024-10-30T03:21:48Z Multimodal fashion knowledge extraction as captioning YUAN, Yifei ZHANG, Wenxuan DENG, Yang LAM, Wai Social media plays a significant role in boosting the fashion industry, where a massive amount of fashion-related posts are generated every day. In order to obtain the rich fashion information from the posts, we study the task of social media fashion knowledge extraction. Fashion knowledge, which typically consists of the occasion, person attributes, and fashion item information, can be effectively represented as a set of tuples. Most previous studies on fashion knowledge extraction are based on the fashion product images without considering the rich text information in social media posts. Existing work on fashion knowledge extraction in social media is classification-based and requires to manually determine a set of fashion knowledge categories in advance. In our work, we propose to cast the task as a captioning problem to capture the interplay of the multimodal post information. Specifically, we transform the fashion knowledge tuples into a natural language caption with a sentence transformation method. Our framework then aims to generate the sentence-based fashion knowledge directly from the social media post. Inspired by the big success of pre-trained models, we build our model based on a multimodal pre-trained generative model and design several auxiliary tasks for enhancing the knowledge extraction. Since there is no existing dataset which can be directly borrowed to our task, we introduce a dataset consisting of social media posts with manual fashion knowledge annotation. Extensive experiments are conducted to demonstrate the effectiveness of our model. 2023-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9170 info:doi/10.1145/3624918.3625315 https://ink.library.smu.edu.sg/context/sis_research/article/10173/viewcontent/MultimodalFashionK_av.pdf http://creativecommons.org/licenses/by/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Fashion industry Fashion knowledge extraction Knowledge extraction Multi-modal Multi-modal data Multimodal data mining Product images Rich texts Social media Social media analysis Databases and Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Fashion industry Fashion knowledge extraction Knowledge extraction Multi-modal Multi-modal data Multimodal data mining Product images Rich texts Social media Social media analysis Databases and Information Systems
spellingShingle	Fashion industry Fashion knowledge extraction Knowledge extraction Multi-modal Multi-modal data Multimodal data mining Product images Rich texts Social media Social media analysis Databases and Information Systems YUAN, Yifei ZHANG, Wenxuan DENG, Yang LAM, Wai Multimodal fashion knowledge extraction as captioning
description	Social media plays a significant role in boosting the fashion industry, where a massive amount of fashion-related posts are generated every day. In order to obtain the rich fashion information from the posts, we study the task of social media fashion knowledge extraction. Fashion knowledge, which typically consists of the occasion, person attributes, and fashion item information, can be effectively represented as a set of tuples. Most previous studies on fashion knowledge extraction are based on the fashion product images without considering the rich text information in social media posts. Existing work on fashion knowledge extraction in social media is classification-based and requires to manually determine a set of fashion knowledge categories in advance. In our work, we propose to cast the task as a captioning problem to capture the interplay of the multimodal post information. Specifically, we transform the fashion knowledge tuples into a natural language caption with a sentence transformation method. Our framework then aims to generate the sentence-based fashion knowledge directly from the social media post. Inspired by the big success of pre-trained models, we build our model based on a multimodal pre-trained generative model and design several auxiliary tasks for enhancing the knowledge extraction. Since there is no existing dataset which can be directly borrowed to our task, we introduce a dataset consisting of social media posts with manual fashion knowledge annotation. Extensive experiments are conducted to demonstrate the effectiveness of our model.
format	text
author	YUAN, Yifei ZHANG, Wenxuan DENG, Yang LAM, Wai
author_facet	YUAN, Yifei ZHANG, Wenxuan DENG, Yang LAM, Wai
author_sort	YUAN, Yifei
title	Multimodal fashion knowledge extraction as captioning
title_short	Multimodal fashion knowledge extraction as captioning
title_full	Multimodal fashion knowledge extraction as captioning
title_fullStr	Multimodal fashion knowledge extraction as captioning
title_full_unstemmed	Multimodal fashion knowledge extraction as captioning
title_sort	multimodal fashion knowledge extraction as captioning
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/sis_research/9170 https://ink.library.smu.edu.sg/context/sis_research/article/10173/viewcontent/MultimodalFashionK_av.pdf
_version_	1814777823925108736

Multimodal fashion knowledge extraction as captioning

Similar Items