Automatic generation of keywords from images

Every year, there are approximately 700 movies released into cinema. As a result, movies contribute a large portion of database in the entertainment industries. While movie genre is an important tool for film industries, simply classifying huge collection of movies without automation is not an easy...

Full description

Saved in:
Bibliographic Details
Main Author: Ho, Galvin Yuan Hao
Other Authors: Teoh Eam Khwang
Format: Final Year Project
Language:English
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10356/67656
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-67656
record_format dspace
spelling sg-ntu-dr.10356-676562023-07-07T17:14:39Z Automatic generation of keywords from images Ho, Galvin Yuan Hao Teoh Eam Khwang School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research DRNTU::Engineering Every year, there are approximately 700 movies released into cinema. As a result, movies contribute a large portion of database in the entertainment industries. While movie genre is an important tool for film industries, simply classifying huge collection of movies without automation is not an easy task. A commonly used method by film industries is classifying genre based on movie synopsis. This approach is considered to be semi-automatic as it requires manual process of collecting synopsis from movie viewers. While this method yields a decent performance of 64.34%, 54.42% and 79.77% for Recall, Precision and Accuracy respectively, it suffers from the need of manual work and may potentially prone to human error. Hence there is a demand for fully automation to speed up classifying process and eliminate human error. This report presents MovieNet, a designed framework for automatic movie genre classification. MovieNet preprocess raw movie film into image frame via scene change detection. Image classifier, consist of image captioning and object recognition is used to give caption and labels to every single image frame. These sentences are combined together simulating a movie synopsis which is later convert into feature of vector using bag-of-words. Classifier such as SVM and ELM are used for classifying the features to predict genres. The robustness of MovieNet was tested with 396 movies, consisting of 10 popular movie genres which are Action, Animated, Comedy, Crime, Epic, Horror, Romance, Science Fiction, War and Western. Results show MovieNet yield a 57.57%, 51.26% and 76.04% for Recall, Precision and Accuracy respectively. Even though it is slightly underperformed to first approach, it enjoys the benefit of automation and human free error. Final studies also shows that MovieNet is capable of recommending and identify existing human error made in current database on genre. Possible recommendations such as retraining image classifier, implement spatial localization or detection, noise removal, extracting closed caption or subtitle methods and exploiting audio features are consider to be feasible and beneficial to the current MovieNet framework in improving its performance significantly. Bachelor of Engineering 2016-05-19T02:19:54Z 2016-05-19T02:19:54Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/67656 en Nanyang Technological University 145 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Ho, Galvin Yuan Hao
Automatic generation of keywords from images
description Every year, there are approximately 700 movies released into cinema. As a result, movies contribute a large portion of database in the entertainment industries. While movie genre is an important tool for film industries, simply classifying huge collection of movies without automation is not an easy task. A commonly used method by film industries is classifying genre based on movie synopsis. This approach is considered to be semi-automatic as it requires manual process of collecting synopsis from movie viewers. While this method yields a decent performance of 64.34%, 54.42% and 79.77% for Recall, Precision and Accuracy respectively, it suffers from the need of manual work and may potentially prone to human error. Hence there is a demand for fully automation to speed up classifying process and eliminate human error. This report presents MovieNet, a designed framework for automatic movie genre classification. MovieNet preprocess raw movie film into image frame via scene change detection. Image classifier, consist of image captioning and object recognition is used to give caption and labels to every single image frame. These sentences are combined together simulating a movie synopsis which is later convert into feature of vector using bag-of-words. Classifier such as SVM and ELM are used for classifying the features to predict genres. The robustness of MovieNet was tested with 396 movies, consisting of 10 popular movie genres which are Action, Animated, Comedy, Crime, Epic, Horror, Romance, Science Fiction, War and Western. Results show MovieNet yield a 57.57%, 51.26% and 76.04% for Recall, Precision and Accuracy respectively. Even though it is slightly underperformed to first approach, it enjoys the benefit of automation and human free error. Final studies also shows that MovieNet is capable of recommending and identify existing human error made in current database on genre. Possible recommendations such as retraining image classifier, implement spatial localization or detection, noise removal, extracting closed caption or subtitle methods and exploiting audio features are consider to be feasible and beneficial to the current MovieNet framework in improving its performance significantly.
author2 Teoh Eam Khwang
author_facet Teoh Eam Khwang
Ho, Galvin Yuan Hao
format Final Year Project
author Ho, Galvin Yuan Hao
author_sort Ho, Galvin Yuan Hao
title Automatic generation of keywords from images
title_short Automatic generation of keywords from images
title_full Automatic generation of keywords from images
title_fullStr Automatic generation of keywords from images
title_full_unstemmed Automatic generation of keywords from images
title_sort automatic generation of keywords from images
publishDate 2016
url http://hdl.handle.net/10356/67656
_version_ 1772826106755284992