Sentence-level morphological and phonological analyzer for Filipino (filSPAM)

Morphological analysis is an important process in natural language processing. It deals with the identification of a root word and its affixes (morphemes) from a morphed word. Phonology is another facet of morphology that has to do with how a word is voiced or sounded out. There are various approach...

Full description

Saved in:
Bibliographic Details
Main Authors: Alina, Angelo Nico C., Cambaliza, Carlo Benigno Romulo, Sosa, Judd Philip F., Sta. Ana, Xedric Mikai J.
Format: text
Language:English
Published: Animo Repository 2011
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/11167
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-11812
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-118122022-03-03T02:34:36Z Sentence-level morphological and phonological analyzer for Filipino (filSPAM) Alina, Angelo Nico C. Cambaliza, Carlo Benigno Romulo Sosa, Judd Philip F. Sta. Ana, Xedric Mikai J. Morphological analysis is an important process in natural language processing. It deals with the identification of a root word and its affixes (morphemes) from a morphed word. Phonology is another facet of morphology that has to do with how a word is voiced or sounded out. There are various approaches and systems that exist and are used in morphological analysis for generating rules for different languages such as MACTag. These differ in each of their methods in identification and classification of morphemes as well as handling ambiguity. Although there are systems which handle morphology for Filipino, most of these are limited in that they are only word-level and they do not cover rules for phonology. Part-of-Speech tagging is an integrated part in sentence analysis that is concerned with annotating the part-of-speech of a particular word in a sentence. There are existing tools for part-of-speech tagging such as HATPOST. These components, namely the morphological analyzer and part-of-speech tagger, function independently from one another. However, they have their own individual limitations that need to be addressed. The research constructs a sentence-level morphological and phonological analyzer for the Filipino language that integrate the aforementioned components in order to identify the part-of-speech of a Filipino word in the sentence and generate the root word and phonology of the identified words. filSPAM (Sentence-level Phonological and Morphological Analyzer for Filipino) analyzes a given Filipino sentence input and generate the corresponding part-of-speech, root word, and phonology of this sentence. The system has four modules: POS tagger which has 54% accuracy, the morphological analyzer which has 73.02% accuracy, the phonological analyzer is corpus-based and unknown handler which has two functions, the automaton and the generalized tree which has 67% accuracy and 64% respectively. 2011-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11167 Bachelor's Theses English Animo Repository Grammar, Comparative and general--Morphology Natural language processing (Computer science) Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Grammar, Comparative and general--Morphology
Natural language processing (Computer science)
Computer Sciences
spellingShingle Grammar, Comparative and general--Morphology
Natural language processing (Computer science)
Computer Sciences
Alina, Angelo Nico C.
Cambaliza, Carlo Benigno Romulo
Sosa, Judd Philip F.
Sta. Ana, Xedric Mikai J.
Sentence-level morphological and phonological analyzer for Filipino (filSPAM)
description Morphological analysis is an important process in natural language processing. It deals with the identification of a root word and its affixes (morphemes) from a morphed word. Phonology is another facet of morphology that has to do with how a word is voiced or sounded out. There are various approaches and systems that exist and are used in morphological analysis for generating rules for different languages such as MACTag. These differ in each of their methods in identification and classification of morphemes as well as handling ambiguity. Although there are systems which handle morphology for Filipino, most of these are limited in that they are only word-level and they do not cover rules for phonology. Part-of-Speech tagging is an integrated part in sentence analysis that is concerned with annotating the part-of-speech of a particular word in a sentence. There are existing tools for part-of-speech tagging such as HATPOST. These components, namely the morphological analyzer and part-of-speech tagger, function independently from one another. However, they have their own individual limitations that need to be addressed. The research constructs a sentence-level morphological and phonological analyzer for the Filipino language that integrate the aforementioned components in order to identify the part-of-speech of a Filipino word in the sentence and generate the root word and phonology of the identified words. filSPAM (Sentence-level Phonological and Morphological Analyzer for Filipino) analyzes a given Filipino sentence input and generate the corresponding part-of-speech, root word, and phonology of this sentence. The system has four modules: POS tagger which has 54% accuracy, the morphological analyzer which has 73.02% accuracy, the phonological analyzer is corpus-based and unknown handler which has two functions, the automaton and the generalized tree which has 67% accuracy and 64% respectively.
format text
author Alina, Angelo Nico C.
Cambaliza, Carlo Benigno Romulo
Sosa, Judd Philip F.
Sta. Ana, Xedric Mikai J.
author_facet Alina, Angelo Nico C.
Cambaliza, Carlo Benigno Romulo
Sosa, Judd Philip F.
Sta. Ana, Xedric Mikai J.
author_sort Alina, Angelo Nico C.
title Sentence-level morphological and phonological analyzer for Filipino (filSPAM)
title_short Sentence-level morphological and phonological analyzer for Filipino (filSPAM)
title_full Sentence-level morphological and phonological analyzer for Filipino (filSPAM)
title_fullStr Sentence-level morphological and phonological analyzer for Filipino (filSPAM)
title_full_unstemmed Sentence-level morphological and phonological analyzer for Filipino (filSPAM)
title_sort sentence-level morphological and phonological analyzer for filipino (filspam)
publisher Animo Repository
publishDate 2011
url https://animorepository.dlsu.edu.ph/etd_bachelors/11167
_version_ 1728621034367614976