Word-based approach is the mainstream on Japanese word segmentation.
基于词的方法是日语分词的主流方法。
Aiming at the dissatisfied effect of Chinese word segmentation to Email texts, an improved Maximum Match Based Approach is presented.
针对邮件文本分词效果较差的特点,提出采用一种改进的最大匹配法来进行中文分词的方法。
In the corpus-based approach, concordance data is extracted from a corpus on a random basis and examined within a colligational framework to generalize the collocational behaviour of a key word.
基于数据的方法以语料库索引为基本依据,在传统的句法框架内对词项的搭配进行检查与概括;
应用推荐