Memory-based part-of-speech tagging of Tagalog

We explore the application of memory-based learning to tagging of Tagalog Text. Memory-based learning is a form of supervised classification-based learning method based on similarity-based reasoning. It entails building a set of cases in memory based on feature-value patterns extracted from a manual...

Full description

Saved in:
Bibliographic Details
Main Authors: Trogo-Oblena, Rhia S., Raga, Rodolfo C., Jr.
Format: text
Published: Animo Repository 2006
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/12872
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-13891
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-138912024-08-05T01:22:25Z Memory-based part-of-speech tagging of Tagalog Trogo-Oblena, Rhia S. Raga, Rodolfo C., Jr. We explore the application of memory-based learning to tagging of Tagalog Text. Memory-based learning is a form of supervised classification-based learning method based on similarity-based reasoning. It entails building a set of cases in memory based on feature-value patterns extracted from a manually tagged corpus and extrapolating the part-of-speech tag of a given new word in a particular context from the most similar cases in memory. In this paper, we discuss the architecture of MBTPOST, a memory-based Tagalog POS tagger, and present the output of an experiment using MBTPOST on a small training corpus, particularly looking at the benefits of shifting window sizes as well as assigning positional weights value to words occurring in different window positions to address the fluidity of word structure in Tagalog sentences. 2006-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/12872 Faculty Research Work Animo Repository Natural language processing (Computer science) Text processing (Computer science) Computational linguistics—Philippines Computer Sciences Physical Sciences and Mathematics Software Engineering
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Natural language processing (Computer science)
Text processing (Computer science)
Computational linguistics—Philippines
Computer Sciences
Physical Sciences and Mathematics
Software Engineering
spellingShingle Natural language processing (Computer science)
Text processing (Computer science)
Computational linguistics—Philippines
Computer Sciences
Physical Sciences and Mathematics
Software Engineering
Trogo-Oblena, Rhia S.
Raga, Rodolfo C., Jr.
Memory-based part-of-speech tagging of Tagalog
description We explore the application of memory-based learning to tagging of Tagalog Text. Memory-based learning is a form of supervised classification-based learning method based on similarity-based reasoning. It entails building a set of cases in memory based on feature-value patterns extracted from a manually tagged corpus and extrapolating the part-of-speech tag of a given new word in a particular context from the most similar cases in memory. In this paper, we discuss the architecture of MBTPOST, a memory-based Tagalog POS tagger, and present the output of an experiment using MBTPOST on a small training corpus, particularly looking at the benefits of shifting window sizes as well as assigning positional weights value to words occurring in different window positions to address the fluidity of word structure in Tagalog sentences.
format text
author Trogo-Oblena, Rhia S.
Raga, Rodolfo C., Jr.
author_facet Trogo-Oblena, Rhia S.
Raga, Rodolfo C., Jr.
author_sort Trogo-Oblena, Rhia S.
title Memory-based part-of-speech tagging of Tagalog
title_short Memory-based part-of-speech tagging of Tagalog
title_full Memory-based part-of-speech tagging of Tagalog
title_fullStr Memory-based part-of-speech tagging of Tagalog
title_full_unstemmed Memory-based part-of-speech tagging of Tagalog
title_sort memory-based part-of-speech tagging of tagalog
publisher Animo Repository
publishDate 2006
url https://animorepository.dlsu.edu.ph/faculty_research/12872
_version_ 1808616315455275008