Automatic closed caption generation from video files

The idea of speech recognition using computers and software is not new. However, for years, its rather low accuracy and constantly changing variables, such as a speaker’s accent, background noise, etc. has resulted in a low adoption rate (Challenges in adopting speech recognition, 2004), until a boo...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Tan, Kenneth Chengwei
مؤلفون آخرون:	Chng Eng Siong
التنسيق:	Final Year Project
اللغة:	English
منشور في:	2014
الموضوعات:	DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
الوصول للمادة أونلاين:	http://hdl.handle.net/10356/59631
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

id	sg-ntu-dr.10356-59631
record_format	dspace
spelling	sg-ntu-dr.10356-596312019-12-10T11:33:26Z Automatic closed caption generation from video files Tan, Kenneth Chengwei Chng Eng Siong School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation The idea of speech recognition using computers and software is not new. However, for years, its rather low accuracy and constantly changing variables, such as a speaker’s accent, background noise, etc. has resulted in a low adoption rate (Challenges in adopting speech recognition, 2004), until a boom in the medical industry with the adoption of electronic health records (EHRs) (Speech Recognition Booms As EHR Adoption Grows, 2013). Speech recognition has generally been kept for specialized and educational purposes, but are now hitting the mainstream industries and is permeating into the everyday lives of its users, for example – voice commands for smartphones, dictation software for personal computers and even home automation. (Say What? Google Works to Improve YouTube Auto-Captions for the Deaf, 2011) Through the use of speech recognition, much effort and time spent in traditionally entering large amounts of text manually into a computer can now be cut down drastically. The process of creating a closed caption file for a video manually, requires effort to listen, enter and finally synchronize the close captions to the audio track of a video, from the transcriber, and it can be tedious and time consuming. Using speech recognition, this process can be automated to produce closed captions, at a fraction of that time and effort previously required. This final year project report documents the development of an application that automatically generates closed captions from the input of a video file. It also discusses about the current speech recognition technology, and potential improvements and modifications that may be added to the application in future. The project commenced in August 2013 and completed in February 2014. Bachelor of Engineering (Computer Science) 2014-05-09T04:33:23Z 2014-05-09T04:33:23Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/59631 en Nanyang Technological University 47 p. application/msword
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
spellingShingle	DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation Tan, Kenneth Chengwei Automatic closed caption generation from video files
description	The idea of speech recognition using computers and software is not new. However, for years, its rather low accuracy and constantly changing variables, such as a speaker’s accent, background noise, etc. has resulted in a low adoption rate (Challenges in adopting speech recognition, 2004), until a boom in the medical industry with the adoption of electronic health records (EHRs) (Speech Recognition Booms As EHR Adoption Grows, 2013). Speech recognition has generally been kept for specialized and educational purposes, but are now hitting the mainstream industries and is permeating into the everyday lives of its users, for example – voice commands for smartphones, dictation software for personal computers and even home automation. (Say What? Google Works to Improve YouTube Auto-Captions for the Deaf, 2011) Through the use of speech recognition, much effort and time spent in traditionally entering large amounts of text manually into a computer can now be cut down drastically. The process of creating a closed caption file for a video manually, requires effort to listen, enter and finally synchronize the close captions to the audio track of a video, from the transcriber, and it can be tedious and time consuming. Using speech recognition, this process can be automated to produce closed captions, at a fraction of that time and effort previously required. This final year project report documents the development of an application that automatically generates closed captions from the input of a video file. It also discusses about the current speech recognition technology, and potential improvements and modifications that may be added to the application in future. The project commenced in August 2013 and completed in February 2014.
author2	Chng Eng Siong
author_facet	Chng Eng Siong Tan, Kenneth Chengwei
format	Final Year Project
author	Tan, Kenneth Chengwei
author_sort	Tan, Kenneth Chengwei
title	Automatic closed caption generation from video files
title_short	Automatic closed caption generation from video files
title_full	Automatic closed caption generation from video files
title_fullStr	Automatic closed caption generation from video files
title_full_unstemmed	Automatic closed caption generation from video files
title_sort	automatic closed caption generation from video files
publishDate	2014
url	http://hdl.handle.net/10356/59631
_version_	1681041005448527872

Automatic closed caption generation from video files

مواد مشابهة