Generic community-powered meta speech recognition platform

In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled...

Full description

Saved in:

Bibliographic Details
Main Author:	M. Mugunth Kumar
Other Authors:	Theng Yin Leng
Format:	Theses and Dissertations
Language:	English
Published:	2010
Subjects:	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Online Access:	http://hdl.handle.net/10356/41796
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-41796
record_format	dspace
spelling	sg-ntu-dr.10356-417962019-12-10T14:02:05Z Generic community-powered meta speech recognition platform M. Mugunth Kumar Theng Yin Leng Wee Kim Wee School of Communication and Information DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled applications are simply too specific for consideration. To alleviate this problem, one approach that can be used is a platform-based approach, engaging community of programmers to write applications for this platform. This dissertation, hence, aims at developing a community-supported, open and generic platform allowing other speech-enabled windows based applications to be developed using this platform. The dissertation aims at bridging two disparate areas of computer science, namely speech recognition and automation. Speech recognition remains a research area for at least 50 years and automation, which was used primarily to automatically run scheduled maintenance, updating corporate systems on the same network, but little research was done on using it for improving the usability of a system. In the course of the dissertation, different possible ways of recognizing speech and of automating Windows applications are explored. A prototype is being developed in this research, in which a broad aspect of the platform's capabilities was demonstrated rather than a single aspect in detail. As a part of evaluation, a focus group study was conducted to brainstorm on futuristic scenarios that could benefit using this platform. Finally, the dissertation concludes with advantages and disadvantages of the developed technology. Master of Science (Information Systems) 2010-08-12T08:11:32Z 2010-08-12T08:11:32Z 2008 2008 Thesis http://hdl.handle.net/10356/41796 en Nanyang Technological University 91 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing M. Mugunth Kumar Generic community-powered meta speech recognition platform
description	In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled applications are simply too specific for consideration. To alleviate this problem, one approach that can be used is a platform-based approach, engaging community of programmers to write applications for this platform. This dissertation, hence, aims at developing a community-supported, open and generic platform allowing other speech-enabled windows based applications to be developed using this platform. The dissertation aims at bridging two disparate areas of computer science, namely speech recognition and automation. Speech recognition remains a research area for at least 50 years and automation, which was used primarily to automatically run scheduled maintenance, updating corporate systems on the same network, but little research was done on using it for improving the usability of a system. In the course of the dissertation, different possible ways of recognizing speech and of automating Windows applications are explored. A prototype is being developed in this research, in which a broad aspect of the platform's capabilities was demonstrated rather than a single aspect in detail. As a part of evaluation, a focus group study was conducted to brainstorm on futuristic scenarios that could benefit using this platform. Finally, the dissertation concludes with advantages and disadvantages of the developed technology.
author2	Theng Yin Leng
author_facet	Theng Yin Leng M. Mugunth Kumar
format	Theses and Dissertations
author	M. Mugunth Kumar
author_sort	M. Mugunth Kumar
title	Generic community-powered meta speech recognition platform
title_short	Generic community-powered meta speech recognition platform
title_full	Generic community-powered meta speech recognition platform
title_fullStr	Generic community-powered meta speech recognition platform
title_full_unstemmed	Generic community-powered meta speech recognition platform
title_sort	generic community-powered meta speech recognition platform
publishDate	2010
url	http://hdl.handle.net/10356/41796
_version_	1681038050681946112

Generic community-powered meta speech recognition platform

Similar Items