Developing a new statistical method for Chinese text segmentation

A new statistical formula for Chinese text segmentation called Contextual Information Formula (OF) was developed empirically for identifying 2 and 3-character words. It was developed by performing stepwise logistic regression using a sample of sentences that had been manually segmented. 300 sentence...

全面介紹

Saved in:
書目詳細資料
主要作者: Dai, Yubin
其他作者: Khoo, Christopher Soo Guan
格式: Theses and Dissertations
出版: 2008
主題:
在線閱讀:http://hdl.handle.net/10356/2614
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!