Generic community-powered meta speech recognition platform

In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled...

Full description

Saved in:
Bibliographic Details
Main Author: M. Mugunth Kumar
Other Authors: Theng Yin Leng
Format: Theses and Dissertations
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/41796
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-41796
record_format dspace
spelling sg-ntu-dr.10356-417962019-12-10T14:02:05Z Generic community-powered meta speech recognition platform M. Mugunth Kumar Theng Yin Leng Wee Kim Wee School of Communication and Information DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled applications are simply too specific for consideration. To alleviate this problem, one approach that can be used is a platform-based approach, engaging community of programmers to write applications for this platform. This dissertation, hence, aims at developing a community-supported, open and generic platform allowing other speech-enabled windows based applications to be developed using this platform. The dissertation aims at bridging two disparate areas of computer science, namely speech recognition and automation. Speech recognition remains a research area for at least 50 years and automation, which was used primarily to automatically run scheduled maintenance, updating corporate systems on the same network, but little research was done on using it for improving the usability of a system. In the course of the dissertation, different possible ways of recognizing speech and of automating Windows applications are explored. A prototype is being developed in this research, in which a broad aspect of the platform's capabilities was demonstrated rather than a single aspect in detail. As a part of evaluation, a focus group study was conducted to brainstorm on futuristic scenarios that could benefit using this platform. Finally, the dissertation concludes with advantages and disadvantages of the developed technology. Master of Science (Information Systems) 2010-08-12T08:11:32Z 2010-08-12T08:11:32Z 2008 2008 Thesis http://hdl.handle.net/10356/41796 en Nanyang Technological University 91 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
M. Mugunth Kumar
Generic community-powered meta speech recognition platform
description In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled applications are simply too specific for consideration. To alleviate this problem, one approach that can be used is a platform-based approach, engaging community of programmers to write applications for this platform. This dissertation, hence, aims at developing a community-supported, open and generic platform allowing other speech-enabled windows based applications to be developed using this platform. The dissertation aims at bridging two disparate areas of computer science, namely speech recognition and automation. Speech recognition remains a research area for at least 50 years and automation, which was used primarily to automatically run scheduled maintenance, updating corporate systems on the same network, but little research was done on using it for improving the usability of a system. In the course of the dissertation, different possible ways of recognizing speech and of automating Windows applications are explored. A prototype is being developed in this research, in which a broad aspect of the platform's capabilities was demonstrated rather than a single aspect in detail. As a part of evaluation, a focus group study was conducted to brainstorm on futuristic scenarios that could benefit using this platform. Finally, the dissertation concludes with advantages and disadvantages of the developed technology.
author2 Theng Yin Leng
author_facet Theng Yin Leng
M. Mugunth Kumar
format Theses and Dissertations
author M. Mugunth Kumar
author_sort M. Mugunth Kumar
title Generic community-powered meta speech recognition platform
title_short Generic community-powered meta speech recognition platform
title_full Generic community-powered meta speech recognition platform
title_fullStr Generic community-powered meta speech recognition platform
title_full_unstemmed Generic community-powered meta speech recognition platform
title_sort generic community-powered meta speech recognition platform
publishDate 2010
url http://hdl.handle.net/10356/41796
_version_ 1681038050681946112