Online multimodal co-indexing and retrieval of social media data

Effective indexing of social media data is key to searching for information on the social Web. However, the characteristics of social media data make it a challenging task. The large-scale and streaming nature is the first challenge, which requires the indexing algorithm to be able to efficiently up...

Full description

Saved in:
Bibliographic Details
Main Authors: MENG, Lei, TAN, Ah-hwee, WUNSCH, Donald C.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9809
https://ink.library.smu.edu.sg/context/sis_research/article/10809/viewcontent/Online_Multimodal.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10809
record_format dspace
spelling sg-smu-ink.sis_research-108092024-12-18T05:14:14Z Online multimodal co-indexing and retrieval of social media data MENG, Lei TAN, Ah-hwee WUNSCH, Donald C. Effective indexing of social media data is key to searching for information on the social Web. However, the characteristics of social media data make it a challenging task. The large-scale and streaming nature is the first challenge, which requires the indexing algorithm to be able to efficiently update the indexing structure when receiving data streams. The second challenge is utilizing the rich meta-information of social media data for a better evaluation of the similarity between data objects and for a more semantically meaningful indexing of the data, which may allow the users to search for them using the different types of queries they like. Existing approaches based on either matrix operations or hashing usually cannot perform an online update of the indexing base to encode upcoming data streams, and they have difficulty handling noisy data. This chapter presents a study on using the Online Multimodal Co-indexing Adaptive Resonance Theory (OMC-ART) for an effective and efficient indexing and retrieval of social media data. More specifically, two types of social media data are considered: (1) the weakly supervised image data, which is associated with captions, tags and descriptions given by the users; and (2) the e-commerce product data, which includes product images, titles, descriptions and user comments. These scenarios make this study related to multimodal web image indexing and retrieval. Compared with existing studies, OMC-ARTonline multimodal co-indexing adaptive resonance theory has several distinct characteristics. First, OMC-ART is able to perform online learning of sequential data. Second, instead of a plain indexing structure, OMC-ART builds a two-layer one, in which the first layer co-indexes the images by the key visual and textual features based on the generalized distributions of the clusters they belong to; while in the second layer, the data objects are co-indexed by their own feature distributions. Third, OMC-ART enables flexible multimodal searching by using either visual features, keywords, or a combination of both. Fourth, OMC-ART employs a ranking algorithm that does not need to go through the whole indexing system when only a limited number of images need to be retrieved. Experiments on two publicly accessible image datasets and a real-world e-commerce dataset demonstrate the efficiency and effectiveness of OMC-ART. 2019-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9809 info:doi/10.1007/978-3-030-02985-2_7 https://ink.library.smu.edu.sg/context/sis_research/article/10809/viewcontent/Online_Multimodal.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Social Media Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Social Media
Theory and Algorithms
spellingShingle Databases and Information Systems
Social Media
Theory and Algorithms
MENG, Lei
TAN, Ah-hwee
WUNSCH, Donald C.
Online multimodal co-indexing and retrieval of social media data
description Effective indexing of social media data is key to searching for information on the social Web. However, the characteristics of social media data make it a challenging task. The large-scale and streaming nature is the first challenge, which requires the indexing algorithm to be able to efficiently update the indexing structure when receiving data streams. The second challenge is utilizing the rich meta-information of social media data for a better evaluation of the similarity between data objects and for a more semantically meaningful indexing of the data, which may allow the users to search for them using the different types of queries they like. Existing approaches based on either matrix operations or hashing usually cannot perform an online update of the indexing base to encode upcoming data streams, and they have difficulty handling noisy data. This chapter presents a study on using the Online Multimodal Co-indexing Adaptive Resonance Theory (OMC-ART) for an effective and efficient indexing and retrieval of social media data. More specifically, two types of social media data are considered: (1) the weakly supervised image data, which is associated with captions, tags and descriptions given by the users; and (2) the e-commerce product data, which includes product images, titles, descriptions and user comments. These scenarios make this study related to multimodal web image indexing and retrieval. Compared with existing studies, OMC-ARTonline multimodal co-indexing adaptive resonance theory has several distinct characteristics. First, OMC-ART is able to perform online learning of sequential data. Second, instead of a plain indexing structure, OMC-ART builds a two-layer one, in which the first layer co-indexes the images by the key visual and textual features based on the generalized distributions of the clusters they belong to; while in the second layer, the data objects are co-indexed by their own feature distributions. Third, OMC-ART enables flexible multimodal searching by using either visual features, keywords, or a combination of both. Fourth, OMC-ART employs a ranking algorithm that does not need to go through the whole indexing system when only a limited number of images need to be retrieved. Experiments on two publicly accessible image datasets and a real-world e-commerce dataset demonstrate the efficiency and effectiveness of OMC-ART.
format text
author MENG, Lei
TAN, Ah-hwee
WUNSCH, Donald C.
author_facet MENG, Lei
TAN, Ah-hwee
WUNSCH, Donald C.
author_sort MENG, Lei
title Online multimodal co-indexing and retrieval of social media data
title_short Online multimodal co-indexing and retrieval of social media data
title_full Online multimodal co-indexing and retrieval of social media data
title_fullStr Online multimodal co-indexing and retrieval of social media data
title_full_unstemmed Online multimodal co-indexing and retrieval of social media data
title_sort online multimodal co-indexing and retrieval of social media data
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/9809
https://ink.library.smu.edu.sg/context/sis_research/article/10809/viewcontent/Online_Multimodal.pdf
_version_ 1819113151219630080