Towards audio-assist cognitive computing : algorithms and applications
Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two c...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/136992 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-136992 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1369922020-10-28T08:29:21Z Towards audio-assist cognitive computing : algorithms and applications Liu, Ziyuan Wen Yonggang School of Computer Science and Engineering YGWEN@ntu.edu.sg Engineering::Computer science and engineering::Computer systems organization::Computer system implementation Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two cognitive computing algorithms are developed: Audio Tag and Audio Fingerprint algorithms. The implementation and experiment results of the algorithms suggest that the information hidden in acoustic signals, either manually implanted or innate, can be utilized by proper techniques. The experiment results demonstrate that the audio tag and audio fingerprint algorithm have high accuracy and low time cost. The audio tag algorithm achieves 100\% accuracy (recognition under 5 seconds), with loud noises existing in specific experiment environments. The audio fingerprint algorithm achieves over 95\% accuracy(recognition under 5 seconds), with proper parameter settings. Based on the two core algorithms, two android applications are developed: Hey!Shake and Parking Loud application. They utilize these algorithms in the TV watching and parking lot access control scenarios and provide services with better quality, less hardware cost, and more convenience for users. The results of this research project confirm the possibility that we can improve the quality of multimedia services by digging into the often-overlooked acoustic information. Master of Engineering 2020-02-11T01:23:09Z 2020-02-11T01:23:09Z 2019 Thesis-Master by Research Liu, Z. (2019). Towards audio-assist cognitive computing : algorithms and applications. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/136992 10.32657/10356/136992 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computer systems organization::Computer system implementation |
spellingShingle |
Engineering::Computer science and engineering::Computer systems organization::Computer system implementation Liu, Ziyuan Towards audio-assist cognitive computing : algorithms and applications |
description |
Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two cognitive computing algorithms are developed: Audio Tag and Audio Fingerprint algorithms. The implementation and experiment results of the algorithms suggest that the information hidden in acoustic signals, either manually implanted or innate, can be utilized by proper techniques. The experiment results demonstrate that the audio tag and audio fingerprint algorithm have high accuracy and low time cost. The audio tag algorithm achieves 100\% accuracy (recognition under 5 seconds), with loud noises existing in specific experiment environments. The audio fingerprint algorithm achieves over 95\% accuracy(recognition under 5 seconds), with proper parameter settings. Based on the two core algorithms, two android applications are developed: Hey!Shake and Parking Loud application. They utilize these algorithms in the TV watching and parking lot access control scenarios and provide services with better quality, less hardware cost, and more convenience for users. The results of this research project confirm the possibility that we can improve the quality of multimedia services by digging into the often-overlooked acoustic information. |
author2 |
Wen Yonggang |
author_facet |
Wen Yonggang Liu, Ziyuan |
format |
Thesis-Master by Research |
author |
Liu, Ziyuan |
author_sort |
Liu, Ziyuan |
title |
Towards audio-assist cognitive computing : algorithms and applications |
title_short |
Towards audio-assist cognitive computing : algorithms and applications |
title_full |
Towards audio-assist cognitive computing : algorithms and applications |
title_fullStr |
Towards audio-assist cognitive computing : algorithms and applications |
title_full_unstemmed |
Towards audio-assist cognitive computing : algorithms and applications |
title_sort |
towards audio-assist cognitive computing : algorithms and applications |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/136992 |
_version_ |
1683494345674588160 |