Multimedia information fusion.
Information in the ubiquitous media age is typically fragmented and appears in various unstructured and unlabelled fonns as data, text, image, audio, and video. For transforming raw information content into knowledge, there is a need to develop various cross-media and media-specific technologies fo...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/41506 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-41506 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-415062019-12-10T12:03:18Z Multimedia information fusion. Woon, Kia Yan. Tan Ah Hwee Wee Kim Wee School of Communication and Information DRNTU::Social sciences::Mass media Information in the ubiquitous media age is typically fragmented and appears in various unstructured and unlabelled fonns as data, text, image, audio, and video. For transforming raw information content into knowledge, there is a need to develop various cross-media and media-specific technologies for modeling and working with text, audio, image, and video data as well as their unification and association at the semantic level. As part of the research endeavor of the 12R-SCE, NTU joint project, "Intelligent Technologies for Media Analysis, Representation and Fusion (Intelligent Media)", this dissertation aims to contribute the techniques for information fusion. Following a thorough research of the literature review on the related work, this dissertation presents a self-organizing network model known as fusion Adaptive Resonance Theory (fusion ART) for the fusion of multimedia infonnation. By synchronizing the encoding of infonnation across multiple media channels, the fusion ART model generates clusters that encode the associative mappings among multimedia information in a real-time and continuous manner. The fusion ART's functionalities are illustrated through experiments on two multimedia data sets, namely the terrorist domain data set and Corel data set. In the experiments using the terrorist domain data set, it demonstrates that by incorporating a semantic category channel, fusion ART further enables multi-media infonnation to be fused into predefined themes or semantic categories. In the experiments using the Corel data set, the results suggest the viability of the proposed approach in comparison with other prior work in image annotations, image classification and image-text fusion. Master of Science (Information Studies) 2010-07-16T01:29:29Z 2010-07-16T01:29:29Z 2008 2008 Thesis http://hdl.handle.net/10356/41506 en Nanyang Technological University 79 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Social sciences::Mass media |
spellingShingle |
DRNTU::Social sciences::Mass media Woon, Kia Yan. Multimedia information fusion. |
description |
Information in the ubiquitous media age is typically fragmented and appears in various
unstructured and unlabelled fonns as data, text, image, audio, and video. For transforming raw information content into knowledge, there is a need to develop various cross-media and media-specific technologies for modeling and working with text, audio, image, and video data as well as their unification and association at the semantic level. As part of the research
endeavor of the 12R-SCE, NTU joint project, "Intelligent Technologies for Media Analysis, Representation and Fusion (Intelligent Media)", this dissertation aims to contribute the techniques for information fusion. Following a thorough research of the literature review on the related work, this dissertation presents a self-organizing network model known as fusion Adaptive Resonance Theory (fusion ART) for the fusion of multimedia infonnation. By synchronizing the encoding of infonnation across multiple media channels, the fusion ART model generates clusters that encode the associative mappings among multimedia
information in a real-time and continuous manner. The fusion ART's functionalities are
illustrated through experiments on two multimedia data sets, namely the terrorist domain data set and Corel data set. In the experiments using the terrorist domain data set, it demonstrates that by incorporating a semantic category channel, fusion ART further enables multi-media infonnation to be fused into predefined themes or semantic categories. In the experiments using the Corel data set, the results suggest the viability of the proposed approach in
comparison with other prior work in image annotations, image classification and image-text fusion. |
author2 |
Tan Ah Hwee |
author_facet |
Tan Ah Hwee Woon, Kia Yan. |
format |
Theses and Dissertations |
author |
Woon, Kia Yan. |
author_sort |
Woon, Kia Yan. |
title |
Multimedia information fusion. |
title_short |
Multimedia information fusion. |
title_full |
Multimedia information fusion. |
title_fullStr |
Multimedia information fusion. |
title_full_unstemmed |
Multimedia information fusion. |
title_sort |
multimedia information fusion. |
publishDate |
2010 |
url |
http://hdl.handle.net/10356/41506 |
_version_ |
1681045723312816128 |