Audio and visual tracking system in indoor environments
An audio and visual tracking system incorporates existing speech recognition technologies and visual aids to form a seamless autonomous system using Raspberry Pi 4. This project explores existing algorithms in both audio and visual formats and uses these algorithms via Raspberry Pi. While this proje...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/149921 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-149921 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1499212023-07-07T18:29:42Z Audio and visual tracking system in indoor environments Aathiq, M. N. M. Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering::Electrical and electronic engineering::Computer hardware, software and systems An audio and visual tracking system incorporates existing speech recognition technologies and visual aids to form a seamless autonomous system using Raspberry Pi 4. This project explores existing algorithms in both audio and visual formats and uses these algorithms via Raspberry Pi. While this project can be used in various areas in real-life situations, one emphasis used throughout the project will be in conferences. Speech recognition can be used to pinpoint a speaker and have visual tracking on current audio. In that scenario, a Raspberry Pi was used as the computing power, while a microphone array was used to determine the direction of the speech and the intensity of the speech. A camera is connected to the Raspberry Pi and will change direction according to where the microphone senses the speech is coming from. To that extent, I took over a project that was in progress in the previous year. The student was able to identify a connection between the three components and connected them via a python program on Raspberry Pi. My project was to continue and improve on the progress of the project and add advancements if needed. During my project, I was able to do testing on two types of microphone arrays, do 3D design and printing, prototyping, iOS development, and further testing with all combined. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-06-10T09:14:47Z 2021-06-10T09:14:47Z 2021 Final Year Project (FYP) Aathiq, M. N. M. (2021). Audio and visual tracking system in indoor environments. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149921 https://hdl.handle.net/10356/149921 en A3090-201 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering::Computer hardware, software and systems |
spellingShingle |
Engineering::Electrical and electronic engineering::Computer hardware, software and systems Aathiq, M. N. M. Audio and visual tracking system in indoor environments |
description |
An audio and visual tracking system incorporates existing speech recognition technologies and visual aids to form a seamless autonomous system using Raspberry Pi 4. This project explores existing algorithms in both audio and visual formats and uses these algorithms via Raspberry Pi. While this project can be used in various areas in real-life situations, one emphasis used throughout the project will be in conferences. Speech recognition can be used to pinpoint a speaker and have visual tracking on current audio. In that scenario, a Raspberry Pi was used as the computing power, while a microphone array was used to determine the direction of the speech and the intensity of the speech. A camera is connected to the Raspberry Pi and will change direction according to where the microphone senses the speech is coming from. To that extent, I took over a project that was in progress in the previous year. The student was able to identify a connection between the three components and connected them via a python program on Raspberry Pi. My project was to continue and improve on the progress of the project and add advancements if needed. During my project, I was able to do testing on two types of microphone arrays, do 3D design and printing, prototyping, iOS development, and further testing with all combined. |
author2 |
Gan Woon Seng |
author_facet |
Gan Woon Seng Aathiq, M. N. M. |
format |
Final Year Project |
author |
Aathiq, M. N. M. |
author_sort |
Aathiq, M. N. M. |
title |
Audio and visual tracking system in indoor environments |
title_short |
Audio and visual tracking system in indoor environments |
title_full |
Audio and visual tracking system in indoor environments |
title_fullStr |
Audio and visual tracking system in indoor environments |
title_full_unstemmed |
Audio and visual tracking system in indoor environments |
title_sort |
audio and visual tracking system in indoor environments |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/149921 |
_version_ |
1772828371761233920 |