Automatic lexicon extraction from comparable, non-parallel corpora

An automated approach of extracting bilingual lexicon (or dictionary) from comparable, non-parallel corpora is developed, implemented and tested. The corpora used are of bilingual domains containing 381,553 English and 92,610 Tagalog terms, with corresponding 4,817 and 3,421 distinct root words, res...

Full description

Saved in:
Bibliographic Details
Main Author: Tiu, Eileen Pamela K.
Format: text
Language:English
Published: Animo Repository 2004
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_masteral/3173
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English