Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts

An overwhelming volume of news videos from different channels and languages is available today, which demands automatic management of this abundant information. To effectively search, retrieve, browse and track cross-lingual news stories, a news story similarity measure plays a critical role in asse...

Full description

Saved in:
Bibliographic Details
Main Authors: WU, Xiao, HAUPTMANN, Alexander G., NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6380
https://ink.library.smu.edu.sg/context/sis_research/article/7383/viewcontent/1291233.1291274.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7383
record_format dspace
spelling sg-smu-ink.sis_research-73832021-11-23T02:42:48Z Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah An overwhelming volume of news videos from different channels and languages is available today, which demands automatic management of this abundant information. To effectively search, retrieve, browse and track cross-lingual news stories, a news story similarity measure plays a critical role in assessing the novelty and redundancy among them. In this paper, we explore the novelty and redundancy detection with visual duplicates and speech transcripts for cross-lingual news stories. News stories are represented by a sequence of keyframes in the visual track and a set of words extracted from speech transcript in the audio track. A major difference to pure text documents is that the number of keyframes in one story is relatively small compared to the number of words and there exist a large number of non-near-duplicate keyframes. These features make the behavior of similarity measures different compared to traditional textual collections. Furthermore, the textual features and visual features complement each other for news stories. They can be further combined to boost the performance. Experiments on the TRECVID-2005 cross-lingual news video corpus show that approaches on textual features and visual features demonstrate different performance, and measures on visual features are quite effective. Overall, the cosine distance on keyframes is still a robust measure. Language models built on visual features demonstrate promising performance. The fusion of textual and visual features improves overall performance. 2007-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6380 info:doi/10.1145/1291233.1291274 https://ink.library.smu.edu.sg/context/sis_research/article/7383/viewcontent/1291233.1291274.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Cross-lingual information retrieval Language model Multimodality Near-duplicate keyframes News videos Novelty and redundancy detection Similarity measure Programming Languages and Compilers Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Cross-lingual information retrieval
Language model
Multimodality
Near-duplicate keyframes
News videos
Novelty and redundancy detection
Similarity measure
Programming Languages and Compilers
Theory and Algorithms
spellingShingle Cross-lingual information retrieval
Language model
Multimodality
Near-duplicate keyframes
News videos
Novelty and redundancy detection
Similarity measure
Programming Languages and Compilers
Theory and Algorithms
WU, Xiao
HAUPTMANN, Alexander G.
NGO, Chong-wah
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
description An overwhelming volume of news videos from different channels and languages is available today, which demands automatic management of this abundant information. To effectively search, retrieve, browse and track cross-lingual news stories, a news story similarity measure plays a critical role in assessing the novelty and redundancy among them. In this paper, we explore the novelty and redundancy detection with visual duplicates and speech transcripts for cross-lingual news stories. News stories are represented by a sequence of keyframes in the visual track and a set of words extracted from speech transcript in the audio track. A major difference to pure text documents is that the number of keyframes in one story is relatively small compared to the number of words and there exist a large number of non-near-duplicate keyframes. These features make the behavior of similarity measures different compared to traditional textual collections. Furthermore, the textual features and visual features complement each other for news stories. They can be further combined to boost the performance. Experiments on the TRECVID-2005 cross-lingual news video corpus show that approaches on textual features and visual features demonstrate different performance, and measures on visual features are quite effective. Overall, the cosine distance on keyframes is still a robust measure. Language models built on visual features demonstrate promising performance. The fusion of textual and visual features improves overall performance.
format text
author WU, Xiao
HAUPTMANN, Alexander G.
NGO, Chong-wah
author_facet WU, Xiao
HAUPTMANN, Alexander G.
NGO, Chong-wah
author_sort WU, Xiao
title Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
title_short Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
title_full Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
title_fullStr Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
title_full_unstemmed Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
title_sort novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/6380
https://ink.library.smu.edu.sg/context/sis_research/article/7383/viewcontent/1291233.1291274.pdf
_version_ 1770575945248276480