Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation

Machine transliteration is automatic generation of the phonetic equivalents in a target language given a source language term, which is useful in many cross language applications. Transliteration between far distant languages, e.g. English and Chinese, is challenging because their phonological dissi...

Full description

Saved in:

Bibliographic Details
Main Authors:	GAO, Wei, WONG, Kam-Fai
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2006
Subjects:	Machine Transliteration Finite State Transducer Noisy Channel Maximum Entropy Phoneme Fertility Evaluation Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/4608
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-5611
record_format	dspace
spelling	sg-smu-ink.sis_research-56112019-12-26T06:12:03Z Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation GAO, Wei WONG, Kam-Fai Machine transliteration is automatic generation of the phonetic equivalents in a target language given a source language term, which is useful in many cross language applications. Transliteration between far distant languages, e.g. English and Chinese, is challenging because their phonological dissimilarities are significant. Existing techniques are typically rule-based or statistically noisy channel-based. Their accuracies are very low due to their intrinsic limitations on modeling transcription details. We propose direct statistical approaches on transliterating phoneme sequences for English–Chinese name translation. Aiming to improve performance, we propose two direct models: First, we adopt Finite State Automata on a process of direct mapping from English phonemes to a set of rudimentary Chinese phonetic symbols plus mapping units dynamically discovered from training. An effective algorithm for aligning phoneme chunks is proposed. Second, contextual features of each phoneme are taken into consideration by means of Maximum Entropy formalism, and the model is further refined with the precise alignment scheme using phoneme chunks. We compare our approaches with the noisy channel baseline that applies IBM SMT model, and demonstrate their superiority. 2006-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/4608 info:doi/10.1142/S0219427906001396 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Machine Transliteration Finite State Transducer Noisy Channel Maximum Entropy Phoneme Fertility Evaluation Databases and Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Machine Transliteration Finite State Transducer Noisy Channel Maximum Entropy Phoneme Fertility Evaluation Databases and Information Systems
spellingShingle	Machine Transliteration Finite State Transducer Noisy Channel Maximum Entropy Phoneme Fertility Evaluation Databases and Information Systems GAO, Wei WONG, Kam-Fai Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation
description	Machine transliteration is automatic generation of the phonetic equivalents in a target language given a source language term, which is useful in many cross language applications. Transliteration between far distant languages, e.g. English and Chinese, is challenging because their phonological dissimilarities are significant. Existing techniques are typically rule-based or statistically noisy channel-based. Their accuracies are very low due to their intrinsic limitations on modeling transcription details. We propose direct statistical approaches on transliterating phoneme sequences for English–Chinese name translation. Aiming to improve performance, we propose two direct models: First, we adopt Finite State Automata on a process of direct mapping from English phonemes to a set of rudimentary Chinese phonetic symbols plus mapping units dynamically discovered from training. An effective algorithm for aligning phoneme chunks is proposed. Second, contextual features of each phoneme are taken into consideration by means of Maximum Entropy formalism, and the model is further refined with the precise alignment scheme using phoneme chunks. We compare our approaches with the noisy channel baseline that applies IBM SMT model, and demonstrate their superiority.
format	text
author	GAO, Wei WONG, Kam-Fai
author_facet	GAO, Wei WONG, Kam-Fai
author_sort	GAO, Wei
title	Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation
title_short	Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation
title_full	Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation
title_fullStr	Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation
title_full_unstemmed	Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation
title_sort	experimental studies using statistical algorithms on transliterating phoneme sequences for english-chinese name translation
publisher	Institutional Knowledge at Singapore Management University
publishDate	2006
url	https://ink.library.smu.edu.sg/sis_research/4608
_version_	1770574928665378816

Experimental studies using statistical algorithms on transliterating phoneme sequences for English-Chinese name translation

Similar Items