Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
An overwhelming volume of news videos from different channels and languages is available today, which demands automatic management of this abundant information. To effectively search, retrieve, browse and track cross-lingual news stories, a news story similarity measure plays a critical role in asse...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2007
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6380 https://ink.library.smu.edu.sg/context/sis_research/article/7383/viewcontent/1291233.1291274.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7383 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-73832021-11-23T02:42:48Z Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah An overwhelming volume of news videos from different channels and languages is available today, which demands automatic management of this abundant information. To effectively search, retrieve, browse and track cross-lingual news stories, a news story similarity measure plays a critical role in assessing the novelty and redundancy among them. In this paper, we explore the novelty and redundancy detection with visual duplicates and speech transcripts for cross-lingual news stories. News stories are represented by a sequence of keyframes in the visual track and a set of words extracted from speech transcript in the audio track. A major difference to pure text documents is that the number of keyframes in one story is relatively small compared to the number of words and there exist a large number of non-near-duplicate keyframes. These features make the behavior of similarity measures different compared to traditional textual collections. Furthermore, the textual features and visual features complement each other for news stories. They can be further combined to boost the performance. Experiments on the TRECVID-2005 cross-lingual news video corpus show that approaches on textual features and visual features demonstrate different performance, and measures on visual features are quite effective. Overall, the cosine distance on keyframes is still a robust measure. Language models built on visual features demonstrate promising performance. The fusion of textual and visual features improves overall performance. 2007-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6380 info:doi/10.1145/1291233.1291274 https://ink.library.smu.edu.sg/context/sis_research/article/7383/viewcontent/1291233.1291274.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Cross-lingual information retrieval Language model Multimodality Near-duplicate keyframes News videos Novelty and redundancy detection Similarity measure Programming Languages and Compilers Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Cross-lingual information retrieval Language model Multimodality Near-duplicate keyframes News videos Novelty and redundancy detection Similarity measure Programming Languages and Compilers Theory and Algorithms |
spellingShingle |
Cross-lingual information retrieval Language model Multimodality Near-duplicate keyframes News videos Novelty and redundancy detection Similarity measure Programming Languages and Compilers Theory and Algorithms WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
description |
An overwhelming volume of news videos from different channels and languages is available today, which demands automatic management of this abundant information. To effectively search, retrieve, browse and track cross-lingual news stories, a news story similarity measure plays a critical role in assessing the novelty and redundancy among them. In this paper, we explore the novelty and redundancy detection with visual duplicates and speech transcripts for cross-lingual news stories. News stories are represented by a sequence of keyframes in the visual track and a set of words extracted from speech transcript in the audio track. A major difference to pure text documents is that the number of keyframes in one story is relatively small compared to the number of words and there exist a large number of non-near-duplicate keyframes. These features make the behavior of similarity measures different compared to traditional textual collections. Furthermore, the textual features and visual features complement each other for news stories. They can be further combined to boost the performance. Experiments on the TRECVID-2005 cross-lingual news video corpus show that approaches on textual features and visual features demonstrate different performance, and measures on visual features are quite effective. Overall, the cosine distance on keyframes is still a robust measure. Language models built on visual features demonstrate promising performance. The fusion of textual and visual features improves overall performance. |
format |
text |
author |
WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah |
author_facet |
WU, Xiao HAUPTMANN, Alexander G. NGO, Chong-wah |
author_sort |
WU, Xiao |
title |
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
title_short |
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
title_full |
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
title_fullStr |
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
title_full_unstemmed |
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
title_sort |
novelty detection for cross-lingual news stories with visual duplicates and speech transcripts |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2007 |
url |
https://ink.library.smu.edu.sg/sis_research/6380 https://ink.library.smu.edu.sg/context/sis_research/article/7383/viewcontent/1291233.1291274.pdf |
_version_ |
1770575945248276480 |