A novel deep learning approach for instance segmentation under rainy conditions
In recent years, artificial intelligence (AI) has become one of the hottest topics. In particular, with the rapid development of deep learning algorithm, object detection and image segmentation can be realized with high accuracy and high speed. These outcomes almost allow machine to possess their ow...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/141145 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-141145 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1411452023-07-04T16:42:42Z A novel deep learning approach for instance segmentation under rainy conditions Du, Kaiwen Soong Boon Hee School of Electrical and Electronic Engineering EBHSOONG@ntu.edu.sg Engineering::Electrical and electronic engineering In recent years, artificial intelligence (AI) has become one of the hottest topics. In particular, with the rapid development of deep learning algorithm, object detection and image segmentation can be realized with high accuracy and high speed. These outcomes almost allow machine to possess their own “vision”. However, many approaches are affected under rainy conditions. The rain streaks are likely to leading to wrong classification and wrong location. In order to solve this problem, a common approach is to remove the rain streaks first, followed by the detection operation. Nevertheless, we wanted to pursue novel deep learning approaches, processing the images directly. We used two different approaches and focused on detecting the objects on the roads, such as cars, trucks, traffic signs and pedestrians. The first approach was to use Mask R-CNN framework, an excellent framework for the task of instance segmentation. The key features of this model are the mask prediction branch and RoIAlign. We did lots of hand annotations for our rainy image dataset, and then used it as the training dataset. The experimental results were really excellent. This model did the correct classification with high confidence level and generated fine masks. My Final Year Project (FYP) was also about the detection under rainy conditions. In my FYP, this model removed the rain streaks from the image first, followed by the detection. Even though the operation of de-rain improved the accuracy, the final confidence level was merely 75% approximately. By contrast, the confidence level of our first approach was 99% approximately. In addition, our second approach was to use PayAttention network model with the attention mechanism, which means that the relevant objects would be assigned the bright color in the final output. The key feature of this model is the three estimators, which combine the local feature vectors with the global feature vector. The estimators can generate three different level attention maps. Even though the masks generated by the second approach were coarser than those generated by the first approach, the second approach did not require a lot of hand annotations, which could save much time. We wish that more researches about the models can be done in the future, aiming to further improve them. For example, if some relevant coordinates of the attention maps generated by PayAttention network model can be obtained, they can be used as the training dataset of Mask-RCNN model. As a result, the accuracy can be kept high without lots of hand annotations. Master of Science (Computer Control and Automation) 2020-06-04T07:06:01Z 2020-06-04T07:06:01Z 2020 Thesis-Master by Coursework https://hdl.handle.net/10356/141145 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Du, Kaiwen A novel deep learning approach for instance segmentation under rainy conditions |
description |
In recent years, artificial intelligence (AI) has become one of the hottest topics. In particular, with the rapid development of deep learning algorithm, object detection and image segmentation can be realized with high accuracy and high speed. These outcomes almost allow machine to possess their own “vision”. However, many approaches are affected under rainy conditions. The rain streaks are likely to leading to wrong classification and wrong location.
In order to solve this problem, a common approach is to remove the rain streaks first, followed by the detection operation. Nevertheless, we wanted to pursue novel deep learning approaches, processing the images directly. We used two different approaches and focused on detecting the objects on the roads, such as cars, trucks, traffic signs and pedestrians. The first approach was to use Mask R-CNN framework, an excellent framework for the task of instance segmentation. The key features of this model are the mask prediction branch and RoIAlign. We did lots of hand annotations for our rainy image dataset, and then used it as the training dataset. The experimental results were really excellent. This model did the correct classification with high confidence level and generated fine masks. My Final Year Project (FYP) was also about the detection under rainy conditions. In my FYP, this model removed the rain streaks from the image first, followed by the detection. Even though the operation of de-rain improved the accuracy, the final confidence level was merely 75% approximately. By contrast, the confidence level of our first approach was 99% approximately. In addition, our second approach was to use PayAttention network model with the attention mechanism, which means that the relevant objects would be assigned the bright color in the final output. The key feature of this model is the three estimators, which combine the local feature vectors with the global feature vector. The estimators can generate three different level attention maps. Even though the masks generated by the second approach were coarser than those generated by the first approach, the second approach did not require a lot of hand annotations, which could save much time.
We wish that more researches about the models can be done in the future, aiming to further improve them. For example, if some relevant coordinates of the attention maps generated by PayAttention network model can be obtained, they can be used as the training dataset of Mask-RCNN model. As a result, the accuracy can be kept high without lots of hand annotations. |
author2 |
Soong Boon Hee |
author_facet |
Soong Boon Hee Du, Kaiwen |
format |
Thesis-Master by Coursework |
author |
Du, Kaiwen |
author_sort |
Du, Kaiwen |
title |
A novel deep learning approach for instance segmentation under rainy conditions |
title_short |
A novel deep learning approach for instance segmentation under rainy conditions |
title_full |
A novel deep learning approach for instance segmentation under rainy conditions |
title_fullStr |
A novel deep learning approach for instance segmentation under rainy conditions |
title_full_unstemmed |
A novel deep learning approach for instance segmentation under rainy conditions |
title_sort |
novel deep learning approach for instance segmentation under rainy conditions |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/141145 |
_version_ |
1772825687532503040 |