Developing a new statistical method for Chinese text segmentation
A new statistical formula for Chinese text segmentation called Contextual Information Formula (OF) was developed empirically for identifying 2 and 3-character words. It was developed by performing stepwise logistic regression using a sample of sentences that had been manually segmented. 300 sentence...
Saved in:
Main Author: | Dai, Yubin |
---|---|
Other Authors: | Khoo, Christopher Soo Guan |
Format: | Theses and Dissertations |
Published: |
2008
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/2614 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Similar Items
-
Multi-classifier system for robust pattern recognition
by: Ng, Geok See.
Published: (2008) -
Power analytics of power data
by: Chew, Jia Yong
Published: (2014) -
Machine compatible script for fast text entry of text using handwriting
by: Ma, Yang
Published: (2008) -
Speaker segmentation and verification
by: Zhong, Haishan
Published: (2008) -
Evolutionary computation for statistical pattern recognition
by: Wang, Xiao
Published: (2010)