Real time semantic segmentation by fully convolutional network for UAV localization in urban environment
Deep learning is an improvement over neural networks that includes more layers of computation, allowing for higher levels of abstraction and prediction in the data. To date, it has become the leading machine learning tool for general imaging and computer vision. Current research trends also indicate...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/78780 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-78780 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-787802023-07-04T16:40:40Z Real time semantic segmentation by fully convolutional network for UAV localization in urban environment Jiang, Muyun Wang Dan Wei School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Deep learning is an improvement over neural networks that includes more layers of computation, allowing for higher levels of abstraction and prediction in the data. To date, it has become the leading machine learning tool for general imaging and computer vision. Current research trends also indicate that deep convolutional neural networks (DCNN) are very effective for automatically analyzing images. Therefore, machine learning has a wide range of uses in the perceptual positioning of robots. This project aims to localize UAV equipped with a camera automatically flying among urban streets. The UAV navigation suffers from day-night luminance and weather change,so the UAV should only remember the invariant features in the environment such as road boundaries and building structures, and ignore those may change over time, such as cars, trees. And because UAV on-board computer is a computational power limited platform, so the network design should leverage speed and accuracy. This work proposes an effective and efficient Full Convolutional Network based End-to-End Encoder-Decoder Architecture for automatically semantic segmentation in urban environment. Because of the computational power limit of the UAV platform, the network is designed to be a 12-layer FCN with Dilated Convolution to to enlarge the field of view and extract multiscale information, and depthwise and pointwise separable convolution to reduce the number of parameters and cut down the computation cost, without causing degrading performance. In case of insufficient dataset images, we use data augmentation and Drop Block to provide sufficient and training data and improve generalization capability. The proposed model can achieve real-time operation standard of 11Hz image processing frequency on NVIDIA TX2 platform with over 86.5% mean IOU. Master of Science (Computer Control and Automation) 2019-06-27T02:21:19Z 2019-06-27T02:21:19Z 2019 Thesis http://hdl.handle.net/10356/78780 en 96 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Jiang, Muyun Real time semantic segmentation by fully convolutional network for UAV localization in urban environment |
description |
Deep learning is an improvement over neural networks that includes more layers of computation, allowing for higher levels of abstraction and prediction in the data. To date, it has become the leading machine learning tool for general imaging and computer vision. Current research trends also indicate that deep convolutional neural networks (DCNN) are very effective for automatically analyzing images. Therefore, machine learning has a wide range of uses in the perceptual positioning of robots. This project aims to localize UAV equipped with a camera automatically flying among urban streets. The UAV navigation suffers from day-night luminance and weather change,so the UAV should only remember the invariant features in the environment such as road boundaries and building structures, and ignore those may change over time, such as cars, trees. And because UAV on-board computer is a computational power limited platform, so the network design should leverage speed and accuracy. This work proposes an effective and efficient Full Convolutional Network based End-to-End Encoder-Decoder Architecture for automatically semantic segmentation in urban environment. Because of the computational power limit of the UAV platform, the network is designed to be a 12-layer FCN with Dilated Convolution to to enlarge the field of view and extract multiscale information, and depthwise and pointwise separable convolution to reduce the number of parameters and cut down the computation cost, without causing degrading performance. In case of insufficient dataset images, we use data augmentation and Drop Block to provide sufficient and training data and improve generalization capability. The proposed model can achieve real-time operation standard of 11Hz image processing frequency on NVIDIA TX2 platform with over 86.5% mean IOU. |
author2 |
Wang Dan Wei |
author_facet |
Wang Dan Wei Jiang, Muyun |
format |
Theses and Dissertations |
author |
Jiang, Muyun |
author_sort |
Jiang, Muyun |
title |
Real time semantic segmentation by fully convolutional network for UAV localization in urban environment |
title_short |
Real time semantic segmentation by fully convolutional network for UAV localization in urban environment |
title_full |
Real time semantic segmentation by fully convolutional network for UAV localization in urban environment |
title_fullStr |
Real time semantic segmentation by fully convolutional network for UAV localization in urban environment |
title_full_unstemmed |
Real time semantic segmentation by fully convolutional network for UAV localization in urban environment |
title_sort |
real time semantic segmentation by fully convolutional network for uav localization in urban environment |
publishDate |
2019 |
url |
http://hdl.handle.net/10356/78780 |
_version_ |
1772826796303056896 |