Audio and visual tracking system in indoor environments

An audio and visual tracking system incorporates existing speech recognition technologies and visual aids to form a seamless autonomous system using Raspberry Pi 4. This project explores existing algorithms in both audio and visual formats and uses these algorithms via Raspberry Pi. While this proje...

Full description

Saved in:
Bibliographic Details
Main Author: Aathiq, M. N. M.
Other Authors: Gan Woon Seng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149921
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-149921
record_format dspace
spelling sg-ntu-dr.10356-1499212023-07-07T18:29:42Z Audio and visual tracking system in indoor environments Aathiq, M. N. M. Gan Woon Seng School of Electrical and Electronic Engineering EWSGAN@ntu.edu.sg Engineering::Electrical and electronic engineering::Computer hardware, software and systems An audio and visual tracking system incorporates existing speech recognition technologies and visual aids to form a seamless autonomous system using Raspberry Pi 4. This project explores existing algorithms in both audio and visual formats and uses these algorithms via Raspberry Pi. While this project can be used in various areas in real-life situations, one emphasis used throughout the project will be in conferences. Speech recognition can be used to pinpoint a speaker and have visual tracking on current audio. In that scenario, a Raspberry Pi was used as the computing power, while a microphone array was used to determine the direction of the speech and the intensity of the speech. A camera is connected to the Raspberry Pi and will change direction according to where the microphone senses the speech is coming from. To that extent, I took over a project that was in progress in the previous year. The student was able to identify a connection between the three components and connected them via a python program on Raspberry Pi. My project was to continue and improve on the progress of the project and add advancements if needed. During my project, I was able to do testing on two types of microphone arrays, do 3D design and printing, prototyping, iOS development, and further testing with all combined. Bachelor of Engineering (Electrical and Electronic Engineering) 2021-06-10T09:14:47Z 2021-06-10T09:14:47Z 2021 Final Year Project (FYP) Aathiq, M. N. M. (2021). Audio and visual tracking system in indoor environments. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/149921 https://hdl.handle.net/10356/149921 en A3090-201 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering::Computer hardware, software and systems
spellingShingle Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Aathiq, M. N. M.
Audio and visual tracking system in indoor environments
description An audio and visual tracking system incorporates existing speech recognition technologies and visual aids to form a seamless autonomous system using Raspberry Pi 4. This project explores existing algorithms in both audio and visual formats and uses these algorithms via Raspberry Pi. While this project can be used in various areas in real-life situations, one emphasis used throughout the project will be in conferences. Speech recognition can be used to pinpoint a speaker and have visual tracking on current audio. In that scenario, a Raspberry Pi was used as the computing power, while a microphone array was used to determine the direction of the speech and the intensity of the speech. A camera is connected to the Raspberry Pi and will change direction according to where the microphone senses the speech is coming from. To that extent, I took over a project that was in progress in the previous year. The student was able to identify a connection between the three components and connected them via a python program on Raspberry Pi. My project was to continue and improve on the progress of the project and add advancements if needed. During my project, I was able to do testing on two types of microphone arrays, do 3D design and printing, prototyping, iOS development, and further testing with all combined.
author2 Gan Woon Seng
author_facet Gan Woon Seng
Aathiq, M. N. M.
format Final Year Project
author Aathiq, M. N. M.
author_sort Aathiq, M. N. M.
title Audio and visual tracking system in indoor environments
title_short Audio and visual tracking system in indoor environments
title_full Audio and visual tracking system in indoor environments
title_fullStr Audio and visual tracking system in indoor environments
title_full_unstemmed Audio and visual tracking system in indoor environments
title_sort audio and visual tracking system in indoor environments
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/149921
_version_ 1772828371761233920