...名识别;未登录词识别;角色标注;viterbi算法 [gap=1439]keywords: organization name recognition; unknown words recognition; role tagging; viterbi algorithm.
基于50个网页-相关网页
Ambiguous processing technology and unknown word recognition technology is two difficulties of Chinese word segmentation technology.
其中歧义处理技术和未登录词识别技术是中文分词技术的两大难点。
参考来源 - 基于词典的中文分词歧义算法研究As to Chinese word segmentation, unknown words recognition error, especially proper noun recognition error is major factor for automatic word segmentation error.
对于中文分词,未登录词识别错误尤其是专有名词识别错误是导致自动分词错误的主要原因之一。
参考来源 - 中文文本姓名识别的研究Ambiguity resolution and unknown word identification are two difficulties in CWS.
歧义消除和未登录词识别是分词的两大技术难点。
参考来源 - 面向大规模信息检索的中文分词技术研究·2,447,543篇论文数据,部分数据来源于NoteExpress
它同时解决了模糊的短语边界的问题和未登录词识别问题。
It simultaneously solves ambiguous phrase boundary resolution and unknown word identification problems.
系统包括初切分,词性标注、歧义字段处理、模型平滑、未登录词识别等功能模块。
The system includes some modules such as originally segmenting, POS tagging, ambiguity processing, model smoothing and Unknown Word Recognizing.
在未登录词识别中,我们分别对数词短语、叠字词、名字的识别提出了不同的识别方法。
For unknown words recognition, we use different method to recognize numeric phrase, reiterative locution and name.
应用推荐