A novel deep learning approach for instance segmentation under rainy conditions

In recent years, artificial intelligence (AI) has become one of the hottest topics. In particular, with the rapid development of deep learning algorithm, object detection and image segmentation can be realized with high accuracy and high speed. These outcomes almost allow machine to possess their ow...

Full description

Saved in:
Bibliographic Details
Main Author: Du, Kaiwen
Other Authors: Soong Boon Hee
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141145
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141145
record_format dspace
spelling sg-ntu-dr.10356-1411452023-07-04T16:42:42Z A novel deep learning approach for instance segmentation under rainy conditions Du, Kaiwen Soong Boon Hee School of Electrical and Electronic Engineering EBHSOONG@ntu.edu.sg Engineering::Electrical and electronic engineering In recent years, artificial intelligence (AI) has become one of the hottest topics. In particular, with the rapid development of deep learning algorithm, object detection and image segmentation can be realized with high accuracy and high speed. These outcomes almost allow machine to possess their own “vision”. However, many approaches are affected under rainy conditions. The rain streaks are likely to leading to wrong classification and wrong location. In order to solve this problem, a common approach is to remove the rain streaks first, followed by the detection operation. Nevertheless, we wanted to pursue novel deep learning approaches, processing the images directly. We used two different approaches and focused on detecting the objects on the roads, such as cars, trucks, traffic signs and pedestrians. The first approach was to use Mask R-CNN framework, an excellent framework for the task of instance segmentation. The key features of this model are the mask prediction branch and RoIAlign. We did lots of hand annotations for our rainy image dataset, and then used it as the training dataset. The experimental results were really excellent. This model did the correct classification with high confidence level and generated fine masks. My Final Year Project (FYP) was also about the detection under rainy conditions. In my FYP, this model removed the rain streaks from the image first, followed by the detection. Even though the operation of de-rain improved the accuracy, the final confidence level was merely 75% approximately. By contrast, the confidence level of our first approach was 99% approximately. In addition, our second approach was to use PayAttention network model with the attention mechanism, which means that the relevant objects would be assigned the bright color in the final output. The key feature of this model is the three estimators, which combine the local feature vectors with the global feature vector. The estimators can generate three different level attention maps. Even though the masks generated by the second approach were coarser than those generated by the first approach, the second approach did not require a lot of hand annotations, which could save much time. We wish that more researches about the models can be done in the future, aiming to further improve them. For example, if some relevant coordinates of the attention maps generated by PayAttention network model can be obtained, they can be used as the training dataset of Mask-RCNN model. As a result, the accuracy can be kept high without lots of hand annotations. Master of Science (Computer Control and Automation) 2020-06-04T07:06:01Z 2020-06-04T07:06:01Z 2020 Thesis-Master by Coursework https://hdl.handle.net/10356/141145 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Du, Kaiwen
A novel deep learning approach for instance segmentation under rainy conditions
description In recent years, artificial intelligence (AI) has become one of the hottest topics. In particular, with the rapid development of deep learning algorithm, object detection and image segmentation can be realized with high accuracy and high speed. These outcomes almost allow machine to possess their own “vision”. However, many approaches are affected under rainy conditions. The rain streaks are likely to leading to wrong classification and wrong location. In order to solve this problem, a common approach is to remove the rain streaks first, followed by the detection operation. Nevertheless, we wanted to pursue novel deep learning approaches, processing the images directly. We used two different approaches and focused on detecting the objects on the roads, such as cars, trucks, traffic signs and pedestrians. The first approach was to use Mask R-CNN framework, an excellent framework for the task of instance segmentation. The key features of this model are the mask prediction branch and RoIAlign. We did lots of hand annotations for our rainy image dataset, and then used it as the training dataset. The experimental results were really excellent. This model did the correct classification with high confidence level and generated fine masks. My Final Year Project (FYP) was also about the detection under rainy conditions. In my FYP, this model removed the rain streaks from the image first, followed by the detection. Even though the operation of de-rain improved the accuracy, the final confidence level was merely 75% approximately. By contrast, the confidence level of our first approach was 99% approximately. In addition, our second approach was to use PayAttention network model with the attention mechanism, which means that the relevant objects would be assigned the bright color in the final output. The key feature of this model is the three estimators, which combine the local feature vectors with the global feature vector. The estimators can generate three different level attention maps. Even though the masks generated by the second approach were coarser than those generated by the first approach, the second approach did not require a lot of hand annotations, which could save much time. We wish that more researches about the models can be done in the future, aiming to further improve them. For example, if some relevant coordinates of the attention maps generated by PayAttention network model can be obtained, they can be used as the training dataset of Mask-RCNN model. As a result, the accuracy can be kept high without lots of hand annotations.
author2 Soong Boon Hee
author_facet Soong Boon Hee
Du, Kaiwen
format Thesis-Master by Coursework
author Du, Kaiwen
author_sort Du, Kaiwen
title A novel deep learning approach for instance segmentation under rainy conditions
title_short A novel deep learning approach for instance segmentation under rainy conditions
title_full A novel deep learning approach for instance segmentation under rainy conditions
title_fullStr A novel deep learning approach for instance segmentation under rainy conditions
title_full_unstemmed A novel deep learning approach for instance segmentation under rainy conditions
title_sort novel deep learning approach for instance segmentation under rainy conditions
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/141145
_version_ 1772825687532503040