Study of fine-tuning the pre-trained deep convolutional neural networks for image recognition

In this era, machine learning and deep learning has become very ubiquitous and dominant in our society and it is starting to ingrain itself in our day to day lives whether we realise it or not. From the emergence of smartphones, to smart TVs and smart watches, all the small everyday items have been...

Full description

Saved in:
Bibliographic Details
Main Author: Nur Azila Azman
Other Authors: Jiang Xudong
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/145298
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In this era, machine learning and deep learning has become very ubiquitous and dominant in our society and it is starting to ingrain itself in our day to day lives whether we realise it or not. From the emergence of smartphones, to smart TVs and smart watches, all the small everyday items have been utilizing a certain kind of artificial intelligence that is easily overlooked as just technology. In reality, the technological sphere is vastly broad and AI is only the tip of an iceberg. Deep Learning is a branch of AI that is growing at an accelerating rate in the tech industry. In this paper, we will be riding on the trends of training a Convolutional Neural Network (CNN), more specifically, we will be focusing our premise on a single pre-trained network called MobileNet. MobileNet is a very popular, robust and lightweight pre-trained model that is available in Keras. We aim to study and understand the parameters that will allow us to increase the accuracy of the pretrained model, MobileNet, through a process called ‘fine-tuning’. We hope to be able to produce and infer from our experiments whereby we confirm if or not these parameters do affect the accuracy of the output model and if so, the degree of significance that a parameter holds in affecting the models’ accuracy. Furthermore, we will be learning how to prepare and process the dataset and samples collected to aid in our study. Our experiments were able to show that fine-tuning by removing the last 5 layers of the pretrained model and retraining them yielded that best result with an accuracy of 99%. We also measured that by increasing the learning rate by tenfold and increasing the trainable layers to 20 on two separate experiments whilst keeping other parameters constant; both yielded a poor performance of similar accuracy, approximately 56% .