Use of word and character N-grams for low-resourced local languages
Language identification is a text classification task for identifying the language of a given text. Several works use this as a preprocessing technique prior to sentiment analysis, mood analysis, and named entity recognition among others. Thus, building an accurate language identification engine is...
Saved in:
Main Authors: | Regalado, Ralph Vincent, Agarap, Abien Fred, Baliber, Renz Iver, Yambao, Arian, Cheng, Charibeth |
---|---|
Format: | text |
Published: |
Animo Repository
2019
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/faculty_research/3924 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Similar Items
-
Bridging Philippine languages with multilingual neural machine translation
by: Baliber, Renz Iver D.
Published: (2021) -
Self-organizing cooperative neural network experts
by: Agarap, Abien Fred
Published: (2022) -
Incorporation of WordNet features to n-gram features in a language modeler
by: Go, Kathleen L., et al.
Published: (2008) -
Incorporation of WordNet features to n-gram features in a language modeler
by: Go, Kathleen L.
Published: (2008) -
Sentiment analysis of the burmese language using the distributed representation of n-gram-based words
by: Myat lay phyu
Published: (2023)