它同时解决了模糊的短语边界的问题和未登录词识别问题。
It simultaneously solves ambiguous phrase boundary resolution and unknown word identification problems.
系统包括初切分,词性标注、歧义字段处理、模型平滑、未登录词识别等功能模块。
The system includes some modules such as originally segmenting, POS tagging, ambiguity processing, model smoothing and Unknown Word Recognizing.
在未登录词识别中,我们分别对数词短语、叠字词、名字的识别提出了不同的识别方法。
For unknown words recognition, we use different method to recognize numeric phrase, reiterative locution and name.
介绍英汉机译中识别未登录词的一种新方法。
A new recognition method for unknown words in English Chinese machine translation is proposed.
为扩展分词词典,提高分词的准确率,本文提出了一种基于信息熵的中文高频词抽取算法,其结果可以用来识别未登录词并扩充现有词典。
Targeting at extending the dictionary for word segmentation so as to improve its accuracy, this paper presents a high-frequency Chinese word extraction algorithm based on information entropy.
这样浪费了大量的人力,并且难以很好的解决未登录词的识别问题。
A mass of labor power is wasted and unknown words recognition can not be resolved well.
通过对新词召回率和分词准确率两个指标,证明本文设计的未登录词自动识别新方法是可行的。
New word recall rate and accuracy of two indicators shows that this design of unknown words automatically recognize the new method is feasible.
通过对新词召回率和分词准确率两个指标,证明本文设计的未登录词自动识别新方法是可行的。
New word recall rate and accuracy of two indicators shows that this design of unknown words automatically recognize the new method is feasible.
应用推荐