Scene understanding based on visual and acoustic data

Convolutional Neural Networks (CNN) is the latest development of neural network. It is a deep learning network having multiple layers of convolution operations to extract image or video features and make predictions. With recent years of rapid development, CNN can be seen everywhere with various app...

Full description

Saved in:
Bibliographic Details
Main Author: Feng, Nijing
Other Authors: Mao Kezhi
Format: Final Year Project
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10356/77672
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-77672
record_format dspace
spelling sg-ntu-dr.10356-776722023-07-07T17:34:10Z Scene understanding based on visual and acoustic data Feng, Nijing Mao Kezhi School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Convolutional Neural Networks (CNN) is the latest development of neural network. It is a deep learning network having multiple layers of convolution operations to extract image or video features and make predictions. With recent years of rapid development, CNN can be seen everywhere with various applications in real life. New models of CNNs are being developed continuously. In this report, the basic principles of CNN will be discussed, experiments performed will be described, attempts to improve CNN performance with additional acoustic data will be explained. Bachelor of Engineering (Electrical and Electronic Engineering) 2019-06-04T02:20:15Z 2019-06-04T02:20:15Z 2019 Final Year Project (FYP) http://hdl.handle.net/10356/77672 en Nanyang Technological University 67 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Feng, Nijing
Scene understanding based on visual and acoustic data
description Convolutional Neural Networks (CNN) is the latest development of neural network. It is a deep learning network having multiple layers of convolution operations to extract image or video features and make predictions. With recent years of rapid development, CNN can be seen everywhere with various applications in real life. New models of CNNs are being developed continuously. In this report, the basic principles of CNN will be discussed, experiments performed will be described, attempts to improve CNN performance with additional acoustic data will be explained.
author2 Mao Kezhi
author_facet Mao Kezhi
Feng, Nijing
format Final Year Project
author Feng, Nijing
author_sort Feng, Nijing
title Scene understanding based on visual and acoustic data
title_short Scene understanding based on visual and acoustic data
title_full Scene understanding based on visual and acoustic data
title_fullStr Scene understanding based on visual and acoustic data
title_full_unstemmed Scene understanding based on visual and acoustic data
title_sort scene understanding based on visual and acoustic data
publishDate 2019
url http://hdl.handle.net/10356/77672
_version_ 1772825563388444672