Generic community-powered meta speech recognition platform
In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/41796 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-41796 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-417962019-12-10T14:02:05Z Generic community-powered meta speech recognition platform M. Mugunth Kumar Theng Yin Leng Wee Kim Wee School of Communication and Information DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing In today's world, for speech enabling applications, a main problem faced by researchers is having to rework from the start or "re-invent the wheel" owing to various reasons like existing speech recognition engines being too inefficient or inaccurate, or that existing speech-enabled applications are simply too specific for consideration. To alleviate this problem, one approach that can be used is a platform-based approach, engaging community of programmers to write applications for this platform. This dissertation, hence, aims at developing a community-supported, open and generic platform allowing other speech-enabled windows based applications to be developed using this platform. The dissertation aims at bridging two disparate areas of computer science, namely speech recognition and automation. Speech recognition remains a research area for at least 50 years and automation, which was used primarily to automatically run scheduled maintenance, updating corporate systems on the same network, but little research was done on using it for improving the usability of a system. In the course of the dissertation, different possible ways of recognizing speech and of automating Windows applications are explored. A prototype is being developed in this research, in which a broad aspect of the platform's capabilities was demonstrated rather than a single aspect in detail. As a part of evaluation, a focus group study was conducted to brainstorm on futuristic scenarios that could benefit using this platform. Finally, the dissertation concludes with advantages and disadvantages of the developed technology. Master of Science (Information Systems) 2010-08-12T08:11:32Z 2010-08-12T08:11:32Z 2008 2008 Thesis http://hdl.handle.net/10356/41796 en Nanyang Technological University 91 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing M. Mugunth Kumar Generic community-powered meta speech recognition platform |
description |
In today's world, for speech enabling applications, a main problem faced by
researchers is having to rework from the start or "re-invent the wheel" owing to
various reasons like existing speech recognition engines being too inefficient or
inaccurate, or that existing speech-enabled applications are simply too specific for
consideration. To alleviate this problem, one approach that can be used is a
platform-based approach, engaging community of programmers to write
applications for this platform.
This dissertation, hence, aims at developing a community-supported, open
and generic platform allowing other speech-enabled windows based applications to
be developed using this platform.
The dissertation aims at bridging two disparate areas of computer science,
namely speech recognition and automation. Speech recognition remains a research
area for at least 50 years and automation, which was used primarily to
automatically run scheduled maintenance, updating corporate systems on the
same network, but little research was done on using it for improving the usability
of a system.
In the course of the dissertation, different possible ways of recognizing
speech and of automating Windows applications are explored. A prototype is being
developed in this research, in which a broad aspect of the platform's capabilities
was demonstrated rather than a single aspect in detail.
As a part of evaluation, a focus group study was conducted to brainstorm on
futuristic scenarios that could benefit using this platform. Finally, the dissertation
concludes with advantages and disadvantages of the developed technology. |
author2 |
Theng Yin Leng |
author_facet |
Theng Yin Leng M. Mugunth Kumar |
format |
Theses and Dissertations |
author |
M. Mugunth Kumar |
author_sort |
M. Mugunth Kumar |
title |
Generic community-powered meta speech recognition platform |
title_short |
Generic community-powered meta speech recognition platform |
title_full |
Generic community-powered meta speech recognition platform |
title_fullStr |
Generic community-powered meta speech recognition platform |
title_full_unstemmed |
Generic community-powered meta speech recognition platform |
title_sort |
generic community-powered meta speech recognition platform |
publishDate |
2010 |
url |
http://hdl.handle.net/10356/41796 |
_version_ |
1681038050681946112 |