Implementing a statistical method for automatic speech recognition

Speech recognition centers on the use of natural speech for human-computer interaction providing computers an ear to listen to what human beings intend to say. In addition to speech recognition as being the most natural method of communication, it offers several advantages like ease of access, speed...

Full description

Saved in:
Bibliographic Details
Main Authors: Gochuico, Stephany, Lee, Shirlane, Marcos, Nelson, Yu, Yau Pang
Format: text
Language:English
Published: Animo Repository 1990
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/16405
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
Description
Summary:Speech recognition centers on the use of natural speech for human-computer interaction providing computers an ear to listen to what human beings intend to say. In addition to speech recognition as being the most natural method of communication, it offers several advantages like ease of access, speed, manual freedom, and remote access. The Automatic Speech Recognition system is a prototype speaker-independent, isolated speech recognition system consisting of hardware and software components necessary in performance delivery. It was implemented using a statistical method that analyzes speech parameters to recognize sequence of words spoken by a user with pauses in-between words. Words uttered by the user are compared against the words trained and stored in the vocabulary file by computing likelihood probabilities based on speech characteristics extracted from the corresponding speech signals. The vocabulary word with the highest measure of likelihood is selected to be the most probable word uttered by the user. The accuracy of recognition depends primarily on the distinctiveness and the number of words in the vocabulary and the clarity with which the user says the words. The ASR as well as other speech recognition systems provide room for future applications. These applications include: (1) Clinical-Medical records, services for the handicapped (2) Entertainment and Education - Voice-controlled toys, interactive video games (3) Manufacturing Process Control - Machine operation, package sorting (4) Office Automation - Data entry, automatic dictation, automatic transcription and (5) Security - Voiceprint identification, building access.