NON-FORMAL AND NON-SHORTENED WORD NORMALIZATION WITH EDIT DISTANCE

Voice assistant technology is growing rapidly now. Its use has begun to spread to daily use. However, voice assistants are still limited to the use of standard conversation languages. Meanwhile, Indonesian people are accustomed to saying non-formal language in everyday conversation. The execution...

全面介紹

Saved in:
書目詳細資料
主要作者: Dwi Rizqullah, Rafi
格式: Final Project
語言:Indonesia
在線閱讀:https://digilib.itb.ac.id/gdl/view/35827
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Voice assistant technology is growing rapidly now. Its use has begun to spread to daily use. However, voice assistants are still limited to the use of standard conversation languages. Meanwhile, Indonesian people are accustomed to saying non-formal language in everyday conversation. The execution of this Final Project includes solutions to overcome the problem of voice assistants with non-formal words or not included in the formal word dictionary. The approach used as a solution is to normalize the text using Levenshtein distance and Jaro-Winkler distance. Test result shows that normalization using Levenshtein distance outperform the normalization using LCS distance with accuracy difference of 8.34 percent.