Measuring language similarity using trigrams: Limitations of language identification
Computational approaches in language identification often result in high number of false positives and low recall rates, especially if the languages involved come from the same subfamily. In this paper, we aim to determine the cause of this problem by measuring language similarity through trigrams....
Saved in:
Main Authors: | Oco, Nathaniel, Ilao, Joel P., Roxas, Rachel Edita, Syliongka, Leif Romeritch |
---|---|
Format: | text |
Published: |
Animo Repository
2013
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/faculty_research/2738 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Similar Items
-
Dice's coefficient on trigram profiles as metric for language similarity
by: Oco, Nathaniel, et al.
Published: (2013) -
Ang paggamit ng trigram ranking bilang panukat sa pagkakahalintulad at pagkakapangkat ng mga wika / Trigram ranking: Metric for language similarity and clustering
by: Oco, Natahaniel, et al.
Published: (2014) -
Building online corpora of Philippine languages
by: Dita, Shirley N., et al.
Published: (2009) -
A multilingual machine translation system for Philippine languages that exploits structural similarities
by: Roxas, Rachel Edita O., et al.
Published: (2004) -
Automatic bilingual lexicon extraction for a minority target language
by: Tiua, Eileen Pamela, et al.
Published: (2008)