【Key words】 speech processing/automatic words segmentation; machine translation; maximum matching method; ambiguity partition
基于6个网页-相关网页
切分歧义 Ambiguition ; segmenting ambiguousness
Combinational ambiguity is a challenging issue in Chinese word segmentation in that its disambiguation depends on the contextual information.
组合型歧义切分字段一直是汉语自动分词的难点,难点在于消歧依赖其上下文语境信息。
参考来源 - 基于语境信息的汉语组合型歧义消歧方法 in C·2,447,543篇论文数据,部分数据来源于NoteExpress
组合型歧义切分字段一直是汉语自动分词的难点,难点在于消歧依赖其上下文语境信息。
Combinational ambiguity is a challenging issue in Chinese word segmentation in that its disambiguation depends on the contextual information.
汉语不同于英语,词之间没有间隔标记。而汉语分词是文本分析的第一步,且存在歧义切分,因此分词问题成为汉语分析的首要难题。
Different from English, there are no interval marks between words in Chinese, so it is difficult for word segmentation to identify ambiguous words.
切分歧义是影响汉语自动分词系统精度的一个重要因素。
Segmentation Ambiguity is an important factor influencing accuracy of Chinese auto-segmentation system.
应用推荐