本文提出了一种快速汉语自动分词算法。
A fast algorithm for Chinese words automatic segment is put forward in this paper.
自动分词技术是自然语言处理的基础工程。
切分歧义是影响汉语自动分词系统精度的一个重要因素。
Segmentation Ambiguity is an important factor influencing accuracy of Chinese auto-segmentation system.
交集型分词歧义是汉语自动分词中的主要歧义类型之一。
Overlapping ambiguity is a major type of ambiguity in Chinese word segmentation.
本文主要研究了两个方面:自动分词方法和自动标引方法。
This paper's research focuses on the methods of automatic separation of terms and automatic indexing.
西方姓名译名的自动识别为汉语自动分词不可或缺的组成部分。
Transliterated person names identification is the necessary part of Chinese word segmentation.
使用自动分词知识可以进一步提高自动切分精度,满足高标准的需求。
Knowledge of Chinese words automatic segmentation can raise the precision of automatic segmentation, and it can satisfy high precision requirements.
在文本挖掘中,中国自动分词技术是计算机科学所要面对的一个难题。
In the text mining, the technology of Chinese automatic word segmentation is a difficult problem that the computer science has to face.
未登录词的存在严重影响了汉语自动分词与自动标引的准确率和速率。
The existence of unlisted words seriously affects the accuracy and speed of automatic segmentation of Chinese words.
汉语的自动分词,是计算机中文信息处理领域中一个基础而困难的课题。
Automatic word segmentation for the Chinese language is a fundamental and difficult problem in the field of computer Chinese language information processing.
中文搜索引擎的重点在于中文关键信息提取,其中的难点就是中文自动分词。
The core of Chinese Search Engine is the key content extracting, and the bottleneck is Chinese Word Automatic Segmentation.
论文的核心工作是设计并实现了一个基于多步处理策略的汉语自动分词系统。
The core work of the paper is designing and implementing a Chinese auto-segmentation system based on a multi-step processing strategy.
汉语自动分词分句的自动评判系统的研究已经成为一项亟待解决的紧迫课题。
Researching the automatic judge of Chinese word-split and sentence-split becomes an urgent task.
本文介绍了一个已研制成功的新闻语料自动分词系统—NEWS的结构和功能。
News, the structure and function of the automatically segmenting system, for news corpora, has been developed.
本文在分析目前自动分词各种方法的基础上,提出了一种新的神经网络分析方法。
The paper presents a new method of neural network based on analysing Current methods in automatic words segmentation.
自然语言理解采用基于两字词簇的自动分词技术,该方法简洁、速度快,易于实现。
Natural language processing used Automatic Segment based on Two-letters-word-family structure, which is succinct, high speed and easy to achieve.
词典是中文自动分词的基础,分词词典机制的优劣直接影响到中文分词的速度和效率。
As a basic component of Chinese word segmentation system, the dictionary mechanism influences the speed and the efficiency of segmentation significantly.
汉语自动分词中组合歧义是难点问题,难在两点:组合歧义字段的发现和歧义的消解。
One of challenges in Chinese Word Segmentation is the combinational ambiguity problem with two main obstacles: the detection of combinational ambiguities and ambiguity resolution.
将基于汉语自动分词的综合信息抽取技术应用于信息检索,具有实际应用意义和价值。
The extraction information technology based on Chinese Automatic segmentation with the information index system has practic...
分词词典机制是影响自动分词的重要因素,而查找速度是衡量一个词典好坏的重要标准。
The dictionary mechanism is an important factor affecting automatic word segmentation systems and the finding speed is an important criterion to determine the performance of a dictionary.
组合型歧义切分字段一直是汉语自动分词的难点,难点在于消歧依赖其上下文语境信息。
Combinational ambiguity is a challenging issue in Chinese word segmentation in that its disambiguation depends on the contextual information.
通过全面的比较研究,本文认为自动分词方法可以分为五个类型,自动标引的方法分为七种。
Through fully comparative study on automatic separation of terms and automatic indexing, this paper presents five methods of automatic separation of terms and seven ways of automatic indexing.
首先是建立法律语料库,主要是法律书面语的语料库,并对其进行机器自动分词和词性标注。
It is mainly a corpus base of the legal written language to set up legal corpus base at first, and mark automatic participle of the machine and morphological feature to it.
汉语自动分词是计算机中文信息处理中的难题,也是文献内容分析中必须解决的关键问题之一。
Chinese automatic seg mentation is one of the most difficult problems in computer Chinese information disposal and the key problem that document content analysis must resolve.
汉语自动分词是计算机中文信息处理中的难题,也是文献内容分析中必须解决的关键问题之一。
Chinese automatic segmentation is one of the most difficult problems in computer Chinese information disposal and the key problem that document content analysis must resolve.
为扩展分词知识库,提高自动分词能力,本文提出了一种基于自学习机制的汉语自动分词系统。
To extend word segmentation repository and enhance word segmentation capacity, a Chinese word segmentation system based on automatic learning is proposed in this paper.
现代汉语文本自动分词是中文信息处理的重要基石,为此提供一个通用的分词接口是非常重要的。
Automatic word segmentation of modern Chinese text is the base of Chinese information processing. So a general purpose application interface for word segmentation is important.
现代汉语文本自动分词是中文信息处理的重要基石,为此提供一个通用的分词接口是非常重要的。
Automatic word segmentation of modern Chinese text is the base of Chinese information processing. So a general purpose application interface for word segmentation is important.
应用推荐