本文提出了一种快速汉语自动分词算法。
A fast algorithm for Chinese words automatic segment is put forward in this paper.
切分歧义是影响汉语自动分词系统精度的一个重要因素。
Segmentation Ambiguity is an important factor influencing accuracy of Chinese auto-segmentation system.
交集型分词歧义是汉语自动分词中的主要歧义类型之一。
Overlapping ambiguity is a major type of ambiguity in Chinese word segmentation.
西方姓名译名的自动识别为汉语自动分词不可或缺的组成部分。
Transliterated person names identification is the necessary part of Chinese word segmentation.
未登录词的存在严重影响了汉语自动分词与自动标引的准确率和速率。
The existence of unlisted words seriously affects the accuracy and speed of automatic segmentation of Chinese words.
论文的核心工作是设计并实现了一个基于多步处理策略的汉语自动分词系统。
The core work of the paper is designing and implementing a Chinese auto-segmentation system based on a multi-step processing strategy.
汉语自动分词分句的自动评判系统的研究已经成为一项亟待解决的紧迫课题。
Researching the automatic judge of Chinese word-split and sentence-split becomes an urgent task.
汉语自动分词中组合歧义是难点问题,难在两点:组合歧义字段的发现和歧义的消解。
One of challenges in Chinese Word Segmentation is the combinational ambiguity problem with two main obstacles: the detection of combinational ambiguities and ambiguity resolution.
将基于汉语自动分词的综合信息抽取技术应用于信息检索,具有实际应用意义和价值。
The extraction information technology based on Chinese Automatic segmentation with the information index system has practic...
组合型歧义切分字段一直是汉语自动分词的难点,难点在于消歧依赖其上下文语境信息。
Combinational ambiguity is a challenging issue in Chinese word segmentation in that its disambiguation depends on the contextual information.
汉语自动分词是计算机中文信息处理中的难题,也是文献内容分析中必须解决的关键问题之一。
Chinese automatic segmentation is one of the most difficult problems in computer Chinese information disposal and the key problem that document content analysis must resolve.
汉语自动分词是计算机中文信息处理中的难题,也是文献内容分析中必须解决的关键问题之一。
Chinese automatic seg mentation is one of the most difficult problems in computer Chinese information disposal and the key problem that document content analysis must resolve.
为扩展分词知识库,提高自动分词能力,本文提出了一种基于自学习机制的汉语自动分词系统。
To extend word segmentation repository and enhance word segmentation capacity, a Chinese word segmentation system based on automatic learning is proposed in this paper.
本文介绍了目前采用的几种汉语自动分词技术,包括:最大匹配法、改进的最大匹配法、全切分法等。
This paper introduces many technology of segmentation, such as maximum matching, improved maximum matching, full segmentation, and so on.
本文给出了为汉语自动分词而提出的机械匹配法、特征词库法、约束矩阵法、语法分析法和理解切分法。
This paper presents methods of mechanical matching, feature lexicon, binding matrix, grammar analysis and semantic understanding for the Chinese language automatic word segmentation.
根据以上分析,我们提出了一种基于记忆的处理策略,可有效改善实用型非受限汉语自动分词系统的精度。
As a consequence, we propose a memory-based strategy that is expected to improve the performance of practical Chinese word segmenters significantly.
在本文中,我们提出了一种统一的统计语言模型方法用来汉语自动分词和中文命名实体识别,这种方法对基于词的三元语言模型进行了很好的扩展。
In this paper, we extend a word-based trigram modeling to Chinese word segmentation and Chinese named entity recognition, by proposing a unified approach to SLM.
汉语的自动分词,是计算机中文信息处理领域中一个基础而困难的课题。
Automatic word segmentation for the Chinese language is a fundamental and difficult problem in the field of computer Chinese language information processing.
根据汉语中二字词较多的特点,给出一种改进的自动分词词典机制,该机制在词典数据结构中增加二字词检测位图表。
According to the characteristics of more two-word words in Chinese, provide an improved dictionary mechanism, which add two-word-bitmap into the data structure of the dictionary.
作战指令的自动化生成主要涉及到汉语分词技术、语法分析和代码生成。
The automatic generation of military instruction is mainly concentrated with Chinese lexical analysis, grammar analysis, and instruction coding.
现代汉语文本自动分词是中文信息处理的重要基石,为此提供一个通用的分词接口是非常重要的。
Automatic word segmentation of modern Chinese text is the base of Chinese information processing. So a general purpose application interface for word segmentation is important.
单汉字标引法是在基于汉语分词的自动标引研究遇到不可克服的困难之后,而产生的一种新的自动标引方法。
The single Chinese character indexing is a new automatic indexing method produced as automatic indexing research of Chinese word segmenting meets some difficulties that can't be overcomed.
单汉字标引法是在基于汉语分词的自动标引研究遇到不可克服的困难之后,而产生的一种新的自动标引方法。
The single Chinese character indexing is a new automatic indexing method produced as automatic indexing research of Chinese word segmenting meets some difficulties that can't be overcomed.
应用推荐