Towards audio-assist cognitive computing : algorithms and applications

Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two c...

Full description

Saved in:
Bibliographic Details
Main Author: Liu, Ziyuan
Other Authors: Wen Yonggang
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/136992
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-136992
record_format dspace
spelling sg-ntu-dr.10356-1369922020-10-28T08:29:21Z Towards audio-assist cognitive computing : algorithms and applications Liu, Ziyuan Wen Yonggang School of Computer Science and Engineering YGWEN@ntu.edu.sg Engineering::Computer science and engineering::Computer systems organization::Computer system implementation Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two cognitive computing algorithms are developed: Audio Tag and Audio Fingerprint algorithms. The implementation and experiment results of the algorithms suggest that the information hidden in acoustic signals, either manually implanted or innate, can be utilized by proper techniques. The experiment results demonstrate that the audio tag and audio fingerprint algorithm have high accuracy and low time cost. The audio tag algorithm achieves 100\% accuracy (recognition under 5 seconds), with loud noises existing in specific experiment environments. The audio fingerprint algorithm achieves over 95\% accuracy(recognition under 5 seconds), with proper parameter settings. Based on the two core algorithms, two android applications are developed: Hey!Shake and Parking Loud application. They utilize these algorithms in the TV watching and parking lot access control scenarios and provide services with better quality, less hardware cost, and more convenience for users. The results of this research project confirm the possibility that we can improve the quality of multimedia services by digging into the often-overlooked acoustic information. Master of Engineering 2020-02-11T01:23:09Z 2020-02-11T01:23:09Z 2019 Thesis-Master by Research Liu, Z. (2019). Towards audio-assist cognitive computing : algorithms and applications. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/136992 10.32657/10356/136992 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
spellingShingle Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
Liu, Ziyuan
Towards audio-assist cognitive computing : algorithms and applications
description Meaningful information hidden in the acoustic signals can be utilized by cognitive computing algorithms. The algorithms use them to improve the quality of services and applications. Inspired by this idea, we develop and optimize a series of applications based on cognitive computing algorithms. Two cognitive computing algorithms are developed: Audio Tag and Audio Fingerprint algorithms. The implementation and experiment results of the algorithms suggest that the information hidden in acoustic signals, either manually implanted or innate, can be utilized by proper techniques. The experiment results demonstrate that the audio tag and audio fingerprint algorithm have high accuracy and low time cost. The audio tag algorithm achieves 100\% accuracy (recognition under 5 seconds), with loud noises existing in specific experiment environments. The audio fingerprint algorithm achieves over 95\% accuracy(recognition under 5 seconds), with proper parameter settings. Based on the two core algorithms, two android applications are developed: Hey!Shake and Parking Loud application. They utilize these algorithms in the TV watching and parking lot access control scenarios and provide services with better quality, less hardware cost, and more convenience for users. The results of this research project confirm the possibility that we can improve the quality of multimedia services by digging into the often-overlooked acoustic information.
author2 Wen Yonggang
author_facet Wen Yonggang
Liu, Ziyuan
format Thesis-Master by Research
author Liu, Ziyuan
author_sort Liu, Ziyuan
title Towards audio-assist cognitive computing : algorithms and applications
title_short Towards audio-assist cognitive computing : algorithms and applications
title_full Towards audio-assist cognitive computing : algorithms and applications
title_fullStr Towards audio-assist cognitive computing : algorithms and applications
title_full_unstemmed Towards audio-assist cognitive computing : algorithms and applications
title_sort towards audio-assist cognitive computing : algorithms and applications
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/136992
_version_ 1683494345674588160