Scene understanding based on visual and acoustic data
Convolutional Neural Networks (CNN) is the latest development of neural network. It is a deep learning network having multiple layers of convolution operations to extract image or video features and make predictions. With recent years of rapid development, CNN can be seen everywhere with various app...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/77672 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-77672 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-776722023-07-07T17:34:10Z Scene understanding based on visual and acoustic data Feng, Nijing Mao Kezhi School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Convolutional Neural Networks (CNN) is the latest development of neural network. It is a deep learning network having multiple layers of convolution operations to extract image or video features and make predictions. With recent years of rapid development, CNN can be seen everywhere with various applications in real life. New models of CNNs are being developed continuously. In this report, the basic principles of CNN will be discussed, experiments performed will be described, attempts to improve CNN performance with additional acoustic data will be explained. Bachelor of Engineering (Electrical and Electronic Engineering) 2019-06-04T02:20:15Z 2019-06-04T02:20:15Z 2019 Final Year Project (FYP) http://hdl.handle.net/10356/77672 en Nanyang Technological University 67 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Feng, Nijing Scene understanding based on visual and acoustic data |
description |
Convolutional Neural Networks (CNN) is the latest development of neural network. It is a deep learning network having multiple layers of convolution operations to extract image or video features and make predictions. With recent years of rapid development, CNN can be seen everywhere with various applications in real life. New models of CNNs are being developed continuously. In this report, the basic principles of CNN will be discussed, experiments performed will be described, attempts to improve CNN performance with additional acoustic data will be explained. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Feng, Nijing |
format |
Final Year Project |
author |
Feng, Nijing |
author_sort |
Feng, Nijing |
title |
Scene understanding based on visual and acoustic data |
title_short |
Scene understanding based on visual and acoustic data |
title_full |
Scene understanding based on visual and acoustic data |
title_fullStr |
Scene understanding based on visual and acoustic data |
title_full_unstemmed |
Scene understanding based on visual and acoustic data |
title_sort |
scene understanding based on visual and acoustic data |
publishDate |
2019 |
url |
http://hdl.handle.net/10356/77672 |
_version_ |
1772825563388444672 |