Automatic speech transcription from DVD

Speech Transcription is a process of converting the speech into text. That is mapping a spoken language onto written symbols. Spoken language is a continuous phenomenon, made up of a potentially unlimited number of components. This is an application of Speech Recognition. Speech Recognition (SR) is...

Full description

Saved in:
Bibliographic Details
Main Author: Ranjit Monisha Deva Belley
Other Authors: Soon Ing Yann
Format: Theses and Dissertations
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/55245
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-55245
record_format dspace
spelling sg-ntu-dr.10356-552452023-07-04T15:35:30Z Automatic speech transcription from DVD Ranjit Monisha Deva Belley Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Speech Transcription is a process of converting the speech into text. That is mapping a spoken language onto written symbols. Spoken language is a continuous phenomenon, made up of a potentially unlimited number of components. This is an application of Speech Recognition. Speech Recognition (SR) is a process to translate the speech into text format. The above process, when done automatically it is called as Automatic Speech Recognition (ASR). ASR is the challenging problems of modem man-kind. The speech recognizer should be trained with the transcribed data. Doing this process manually is expensive. Hence it is tried to be done automatically. In this project, the subtitles and its respective time information are extracted from DVD. Then the speech from its particular time information for its respective subtitle is taken from the audio information. There may be mismatch for the speech and the subtitle with the time information. Here the time domain methods are used to overcome this problem. That is done by taking energy for each time information. The proposed project does the extraction of the subtitles and the speech information for each time information from DVD automatically. The extraction of the subtitles can be performed by converting the graphical information into text information. The corresponding speech signal must also be segmented using the timing information. The information will then be converted into a format that is suitable for training a speech recognizer. This process is done automatically in MATLAB. Master of Science (Signal Processing) 2014-01-06T08:47:52Z 2014-01-06T08:47:52Z 2013 2013 Thesis http://hdl.handle.net/10356/55245 en 51 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Ranjit Monisha Deva Belley
Automatic speech transcription from DVD
description Speech Transcription is a process of converting the speech into text. That is mapping a spoken language onto written symbols. Spoken language is a continuous phenomenon, made up of a potentially unlimited number of components. This is an application of Speech Recognition. Speech Recognition (SR) is a process to translate the speech into text format. The above process, when done automatically it is called as Automatic Speech Recognition (ASR). ASR is the challenging problems of modem man-kind. The speech recognizer should be trained with the transcribed data. Doing this process manually is expensive. Hence it is tried to be done automatically. In this project, the subtitles and its respective time information are extracted from DVD. Then the speech from its particular time information for its respective subtitle is taken from the audio information. There may be mismatch for the speech and the subtitle with the time information. Here the time domain methods are used to overcome this problem. That is done by taking energy for each time information. The proposed project does the extraction of the subtitles and the speech information for each time information from DVD automatically. The extraction of the subtitles can be performed by converting the graphical information into text information. The corresponding speech signal must also be segmented using the timing information. The information will then be converted into a format that is suitable for training a speech recognizer. This process is done automatically in MATLAB.
author2 Soon Ing Yann
author_facet Soon Ing Yann
Ranjit Monisha Deva Belley
format Theses and Dissertations
author Ranjit Monisha Deva Belley
author_sort Ranjit Monisha Deva Belley
title Automatic speech transcription from DVD
title_short Automatic speech transcription from DVD
title_full Automatic speech transcription from DVD
title_fullStr Automatic speech transcription from DVD
title_full_unstemmed Automatic speech transcription from DVD
title_sort automatic speech transcription from dvd
publishDate 2014
url http://hdl.handle.net/10356/55245
_version_ 1772828097989574656