Bootstrapping a Tagalog LFG F-structure Bank
Developing a wide coverage grammar is an expensive and time-consuming process. Automatic induction methods use treebanks to produce F-structures which will then be used to induce the grammar. However, this method of automatic induction for creating a Filipino LFG grammar is not possible because curr...
محفوظ في:
المؤلفون الرئيسيون: | , , , |
---|---|
التنسيق: | text |
اللغة: | English |
منشور في: |
Animo Repository
2012
|
الوصول للمادة أونلاين: | https://animorepository.dlsu.edu.ph/etd_bachelors/11933 |
الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
الملخص: | Developing a wide coverage grammar is an expensive and time-consuming process. Automatic induction methods use treebanks to produce F-structures which will then be used to induce the grammar. However, this method of automatic induction for creating a Filipino LFG grammar is not possible because currently, there are no existing treebanks for Filipino. Bootstrapping was used to solve the aforesaid problem. From existing parallel English-Filipino corpora, cross-lingual projection was used to project F-structure from English. Using state of the art and self-developed tools, equivalent Tagalog words from the parallel corpora were mapped into the projected F-structures to produce Tagalog F-structures. The final output of the research is a collection of Tagalog F-structures induced from English F-structure. |
---|