Image aesthetic style classification and region detection using Convolutional Neural Network

Convolutional Neural Network (CNN) becomes popular in recent years, especially in the field of image processing. This algorithm has been successfully applied on object image classification, object detection, video analysis and so on with good results. Due to good feature extraction performance of CN...

Full description

Saved in:
Bibliographic Details
Main Author: Xue, Chuhui
Other Authors: Chia Liang Tien
Format: Final Year Project
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/70171
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Convolutional Neural Network (CNN) becomes popular in recent years, especially in the field of image processing. This algorithm has been successfully applied on object image classification, object detection, video analysis and so on with good results. Due to good feature extraction performance of CNN, research on automatically aesthetic analysis of images by deep learning has started. However, previous work for image aesthetic analysis like [5] are mainly about image aesthetic rating or image aesthetic binary classification. Therefore, our project aims at learning the image aesthetic styles using CNN as well as generating the bounding box of region for corresponding styles. This project comprises of two main parts, which are image aesthetic style classification and image aesthetic style region detection. We firstly build the network based on [5] and train an image aesthetic style classification model on AVA Dataset [4] with some selected style classes after data cleaning. By using this pre-trained model, we then apply Faster R-CNN [1] algorithm on image aesthetic style region detection. This is implemented by firstly manually labeling image aesthetic style region in selected images in AVA Dataset, building corresponding Region Proposal Network and Fast R-CNN Network [1] based on RAPID Network [5] and training on these labeled images with pre-trained image aesthetic style classification model.