Moving-object detection, association, and selection in home videos

Due to the prevalence of digital video camcorders, home videos have become an important part of life-logs of personal experiences. To enable efficient video parsing, a critical step is to automatically extract objects, events and scene characteristics present in videos. This paper addresses the prob...

Full description

Saved in:
Bibliographic Details
Main Authors: PAN, Zailiang, NGO, Chong-wah
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6332
https://ink.library.smu.edu.sg/context/sis_research/article/7335/viewcontent/10.1.1.495.1914.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7335
record_format dspace
spelling sg-smu-ink.sis_research-73352021-11-23T04:55:45Z Moving-object detection, association, and selection in home videos PAN, Zailiang NGO, Chong-wah Due to the prevalence of digital video camcorders, home videos have become an important part of life-logs of personal experiences. To enable efficient video parsing, a critical step is to automatically extract objects, events and scene characteristics present in videos. This paper addresses the problem of extracting objects from home videos. Automatic detection of objects is a classical yet difficult vision problem, particularly for videos with complex scenes and unrestricted domains. Compared with edited and surveillant videos, home videos captured in uncontrolled environment are usually coupled with several notable features such as shaking artifacts, irregular motions, and arbitrary settings. These characteristics have actually prohibited the effective parsing of semantic video content using conventional vision analysis. In this paper, we propose a new approach to automatically locate multiple objects in home videos, by taking into account of how and when to initialize objects. Previous approaches mostly consider the problem of how but not when due to the efficiency or real-time requirements. In home-video indexing, online processing is optional. By considering when, some difficult problems can be alleviated, and most importantly, enlightens the possibility of parsing semantic video objects. In our proposed approach, the how part is formulated as an object detection and association problem, while the when part is a saliency measurement to determine the best few locations to start multiple object initialization. 2007-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6332 info:doi/10.1109/TMM.2006.887992 https://ink.library.smu.edu.sg/context/sis_research/article/7335/viewcontent/10.1.1.495.1914.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computer Sciences
Graphics and Human Computer Interfaces
spellingShingle Computer Sciences
Graphics and Human Computer Interfaces
PAN, Zailiang
NGO, Chong-wah
Moving-object detection, association, and selection in home videos
description Due to the prevalence of digital video camcorders, home videos have become an important part of life-logs of personal experiences. To enable efficient video parsing, a critical step is to automatically extract objects, events and scene characteristics present in videos. This paper addresses the problem of extracting objects from home videos. Automatic detection of objects is a classical yet difficult vision problem, particularly for videos with complex scenes and unrestricted domains. Compared with edited and surveillant videos, home videos captured in uncontrolled environment are usually coupled with several notable features such as shaking artifacts, irregular motions, and arbitrary settings. These characteristics have actually prohibited the effective parsing of semantic video content using conventional vision analysis. In this paper, we propose a new approach to automatically locate multiple objects in home videos, by taking into account of how and when to initialize objects. Previous approaches mostly consider the problem of how but not when due to the efficiency or real-time requirements. In home-video indexing, online processing is optional. By considering when, some difficult problems can be alleviated, and most importantly, enlightens the possibility of parsing semantic video objects. In our proposed approach, the how part is formulated as an object detection and association problem, while the when part is a saliency measurement to determine the best few locations to start multiple object initialization.
format text
author PAN, Zailiang
NGO, Chong-wah
author_facet PAN, Zailiang
NGO, Chong-wah
author_sort PAN, Zailiang
title Moving-object detection, association, and selection in home videos
title_short Moving-object detection, association, and selection in home videos
title_full Moving-object detection, association, and selection in home videos
title_fullStr Moving-object detection, association, and selection in home videos
title_full_unstemmed Moving-object detection, association, and selection in home videos
title_sort moving-object detection, association, and selection in home videos
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/6332
https://ink.library.smu.edu.sg/context/sis_research/article/7335/viewcontent/10.1.1.495.1914.pdf
_version_ 1770575936033390592