Constituent structure for Filipino: Induction through probabilistic approaches

The current state of Philippine linguistic resources, which includes formal grammars, electronic dictionaries and corpora are not yet significant to address industrialstrength language technologies. This paper discusses a computational approach in automatically estimating constituent structures from...

Full description

Saved in:
Bibliographic Details
Main Authors: Alcantara, Danniel L., Borra, Allan
Format: text
Published: Animo Repository 2008
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/2898
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Description
Summary:The current state of Philippine linguistic resources, which includes formal grammars, electronic dictionaries and corpora are not yet significant to address industrialstrength language technologies. This paper discusses a computational approach in automatically estimating constituent structures from a corpus using unsupervised probabilistic approaches. Two models are presented and results show an F1 measure of greater than 69%. Issues and phenomena of the Filipino language are identified and discussed. © 2008 by Danniel Alcantara and Allan Borra.