Automatic speech transcription from DVD

Speech Transcription is a process of converting the speech into text. That is mapping a spoken language onto written symbols. Spoken language is a continuous phenomenon, made up of a potentially unlimited number of components. This is an application of Speech Recognition. Speech Recognition (SR) is...

Full description

Saved in:

Bibliographic Details
Main Author:	Ranjit Monisha Deva Belley
Other Authors:	Soon Ing Yann
Format:	Theses and Dissertations
Language:	English
Published:	2014
Subjects:	DRNTU::Engineering::Electrical and electronic engineering
Online Access:	http://hdl.handle.net/10356/55245
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-55245
record_format	dspace
spelling	sg-ntu-dr.10356-552452023-07-04T15:35:30Z Automatic speech transcription from DVD Ranjit Monisha Deva Belley Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Speech Transcription is a process of converting the speech into text. That is mapping a spoken language onto written symbols. Spoken language is a continuous phenomenon, made up of a potentially unlimited number of components. This is an application of Speech Recognition. Speech Recognition (SR) is a process to translate the speech into text format. The above process, when done automatically it is called as Automatic Speech Recognition (ASR). ASR is the challenging problems of modem man-kind. The speech recognizer should be trained with the transcribed data. Doing this process manually is expensive. Hence it is tried to be done automatically. In this project, the subtitles and its respective time information are extracted from DVD. Then the speech from its particular time information for its respective subtitle is taken from the audio information. There may be mismatch for the speech and the subtitle with the time information. Here the time domain methods are used to overcome this problem. That is done by taking energy for each time information. The proposed project does the extraction of the subtitles and the speech information for each time information from DVD automatically. The extraction of the subtitles can be performed by converting the graphical information into text information. The corresponding speech signal must also be segmented using the timing information. The information will then be converted into a format that is suitable for training a speech recognizer. This process is done automatically in MATLAB. Master of Science (Signal Processing) 2014-01-06T08:47:52Z 2014-01-06T08:47:52Z 2013 2013 Thesis http://hdl.handle.net/10356/55245 en 51 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering Ranjit Monisha Deva Belley Automatic speech transcription from DVD
description	Speech Transcription is a process of converting the speech into text. That is mapping a spoken language onto written symbols. Spoken language is a continuous phenomenon, made up of a potentially unlimited number of components. This is an application of Speech Recognition. Speech Recognition (SR) is a process to translate the speech into text format. The above process, when done automatically it is called as Automatic Speech Recognition (ASR). ASR is the challenging problems of modem man-kind. The speech recognizer should be trained with the transcribed data. Doing this process manually is expensive. Hence it is tried to be done automatically. In this project, the subtitles and its respective time information are extracted from DVD. Then the speech from its particular time information for its respective subtitle is taken from the audio information. There may be mismatch for the speech and the subtitle with the time information. Here the time domain methods are used to overcome this problem. That is done by taking energy for each time information. The proposed project does the extraction of the subtitles and the speech information for each time information from DVD automatically. The extraction of the subtitles can be performed by converting the graphical information into text information. The corresponding speech signal must also be segmented using the timing information. The information will then be converted into a format that is suitable for training a speech recognizer. This process is done automatically in MATLAB.
author2	Soon Ing Yann
author_facet	Soon Ing Yann Ranjit Monisha Deva Belley
format	Theses and Dissertations
author	Ranjit Monisha Deva Belley
author_sort	Ranjit Monisha Deva Belley
title	Automatic speech transcription from DVD
title_short	Automatic speech transcription from DVD
title_full	Automatic speech transcription from DVD
title_fullStr	Automatic speech transcription from DVD
title_full_unstemmed	Automatic speech transcription from DVD
title_sort	automatic speech transcription from dvd
publishDate	2014
url	http://hdl.handle.net/10356/55245
_version_	1772828097989574656

Automatic speech transcription from DVD

Similar Items