基于既定词表的自适应汉语分词技术研究 - 田野的羽毛 - 博客园 关键词] 自动分词 新词识别 未登录词 [gap=735]Keywords] automatic segmentation; new word identification; unlisted words
基于14个网页-相关网页
The dissertation puts forward an experiment based on the N-gram model. And the result indicates the N-gram be feasible in neologism distinguishing.
提出了基于N-gram的新词识别过程,并进行了评测,结果表明N-gram技术在中文新词识别上是可行的。
参考来源 - N·2,447,543篇论文数据,部分数据来源于NoteExpress
新词识别和模糊性解决信息检索精度有重要的影响。
New words recognition and ambiguity resolving have vital effect on information retrieval precision.
前者涉及到词法、句法、语义分析,包括汉语分词、词性标注、注音、命名实体识别、新词发现、句法分析、词义消歧等。
The former includes Chinese word segmentation, part - of - speech tagging, pinyin tagging, named entity recognition, new word detection, syntactic parsing, word sense disambiguation, etc.
前者涉及到词法、句法、语义分析,包括汉语分词、词性标注、注音、命名实体识别、新词发现、句法分析、词义消歧等。
The former includes Chinese word segmentation, part-of-speech tagging, pinyin tagging, named entity recognition, new word detection, syntactic parsing, word sense disambiguation, etc .
应用推荐