Dice's coefficient on trigram profiles as metric for language similarity
In this study, we present Dice's coefficient on trigram profiles as metric for language similarity. As testbed, we focused on eight Philippine languages. No known language similarity value for these languages exists. Documents containing transcribed audio recordings, news articles, religious an...
Saved in:
Main Authors: | Oco, Nathaniel, Syliongka, Leif Romeritch, Roxas, Rachel Edita, Ilao, Joel P. |
---|---|
Format: | text |
Published: |
Animo Repository
2013
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/faculty_research/2737 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Similar Items
-
Measuring language similarity using trigrams: Limitations of language identification
by: Oco, Nathaniel, et al.
Published: (2013) -
Ang paggamit ng trigram ranking bilang panukat sa pagkakahalintulad at pagkakapangkat ng mga wika / Trigram ranking: Metric for language similarity and clustering
by: Oco, Natahaniel, et al.
Published: (2014) -
Pattern matching refinements to dictionary-based code-switching point detection
by: Oco, Nathaniel, et al.
Published: (2012) -
Building online corpora of Philippine languages
by: Dita, Shirley N., et al.
Published: (2009) -
Automatic target word disambiguation using syntactic relationships
by: Domingo, Ebony, et al.
Published: (2006)