Key frame extraction from a big dataset

An autonomous vehicle is an automobile platform capable of sensing and reacting to its immediate environment in an attempt to eliminate the need for human drivers. In autonomous driving, taking decisions like overtaking a vehicle or defining a route requires environmental perception, localization, a...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Agarwal, Mehal
مؤلفون آخرون:	Guan Yong Liang
التنسيق:	Final Year Project
اللغة:	English
منشور في:	Nanyang Technological University 2021
الموضوعات:	Engineering::Electrical and electronic engineering
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/153862
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Nanyang Technological University
اللغة:	English

id	sg-ntu-dr.10356-153862
record_format	dspace
spelling	sg-ntu-dr.10356-1538622023-07-07T18:12:18Z Key frame extraction from a big dataset Agarwal, Mehal Guan Yong Liang School of Electrical and Electronic Engineering Continental Automotive Singapore EYLGuan@ntu.edu.sg Engineering::Electrical and electronic engineering An autonomous vehicle is an automobile platform capable of sensing and reacting to its immediate environment in an attempt to eliminate the need for human drivers. In autonomous driving, taking decisions like overtaking a vehicle or defining a route requires environmental perception, localization, and planning. In particular, object detection is one of the core modules in an autonomous vehicle as perception plays a central role in many tasks ranging from localization to obstacle avoidance and general motion planning. To enable multi-modal perception, autonomous vehicles use several on-board sensors such as cameras, radars, and lidars which provide data which are utilized by deep-learning based object detectors. A large, diverse and accurately labeled dataset is essential for perception tasks like object detection. However, the problem encountered is that the costs pertaining to human annotation of a large dataset is very expensive even for large companies and results in diminishing returns. This project aims to solve this problem by designing a keyframe filter package to extract keyframes from a big dataset and the extracted useful images are sent to annotators for manual labeling. The approach to perform keyframe extraction explored in this project uses heuristics to determine keyframes in a dataset by performing 2D multi-label tagging of images. The multi-label tagging of images is implemented using one of the state-of-the-art object detection frameworks, Faster Region-based Convolutional Neural Network (Faster-RCNN). This project proposes a novel addition to improve the Faster-RCNN model by including the visibility detection feature. The keyframe filter package permits the use of a subset of the raw data for annotation while optimizing the model performance and reducing the costs incurred. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-12-15T12:35:19Z 2021-12-15T12:35:19Z 2021 Final Year Project (FYP) Agarwal, M. (2021). Key frame extraction from a big dataset. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153862 https://hdl.handle.net/10356/153862 en B3365-202 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Agarwal, Mehal Key frame extraction from a big dataset
description	An autonomous vehicle is an automobile platform capable of sensing and reacting to its immediate environment in an attempt to eliminate the need for human drivers. In autonomous driving, taking decisions like overtaking a vehicle or defining a route requires environmental perception, localization, and planning. In particular, object detection is one of the core modules in an autonomous vehicle as perception plays a central role in many tasks ranging from localization to obstacle avoidance and general motion planning. To enable multi-modal perception, autonomous vehicles use several on-board sensors such as cameras, radars, and lidars which provide data which are utilized by deep-learning based object detectors. A large, diverse and accurately labeled dataset is essential for perception tasks like object detection. However, the problem encountered is that the costs pertaining to human annotation of a large dataset is very expensive even for large companies and results in diminishing returns. This project aims to solve this problem by designing a keyframe filter package to extract keyframes from a big dataset and the extracted useful images are sent to annotators for manual labeling. The approach to perform keyframe extraction explored in this project uses heuristics to determine keyframes in a dataset by performing 2D multi-label tagging of images. The multi-label tagging of images is implemented using one of the state-of-the-art object detection frameworks, Faster Region-based Convolutional Neural Network (Faster-RCNN). This project proposes a novel addition to improve the Faster-RCNN model by including the visibility detection feature. The keyframe filter package permits the use of a subset of the raw data for annotation while optimizing the model performance and reducing the costs incurred.
author2	Guan Yong Liang
author_facet	Guan Yong Liang Agarwal, Mehal
format	Final Year Project
author	Agarwal, Mehal
author_sort	Agarwal, Mehal
title	Key frame extraction from a big dataset
title_short	Key frame extraction from a big dataset
title_full	Key frame extraction from a big dataset
title_fullStr	Key frame extraction from a big dataset
title_full_unstemmed	Key frame extraction from a big dataset
title_sort	key frame extraction from a big dataset
publisher	Nanyang Technological University
publishDate	2021
url	https://hdl.handle.net/10356/153862
_version_	1772825940538163200

Key frame extraction from a big dataset

مواد مشابهة