中文自动分词是中文信息处理的基础。
Automatic Chinese word segmentation is the basis of Chinese information processing.
本文提出了一种快速汉语自动分词算法。
A fast algorithm for Chinese words automatic segment is put forward in this paper.
汉语自动分词是进行中文信息处理的基础。
Automatic Chinese word segmentation is the basis of Chinese information processing.
汉语自动分词是进行中文信息处理的基础。
Automatic Chinese segmentation is the basis of Chinese information processing.
汉语自动分词是中文信息处理的首要工作。
Chinese automatic word segmentation is the first work in Chinese information processing.
自动分词技术是自然语言处理的基础工程。
汉语自动分词是中文信息处理的重要基石。
汉语自动分词是中文信息处理中的基础课题。
Word segmentation is a basic task of Chinese information processing.
切分歧义是影响汉语自动分词系统精度的一个重要因素。
Segmentation Ambiguity is an important factor influencing accuracy of Chinese auto-segmentation system.
交集型分词歧义是汉语自动分词中的主要歧义类型之一。
Overlapping ambiguity is a major type of ambiguity in Chinese word segmentation.
汉语自动分词面临的一个主要难题就是新词的自动识别。
A major problem of Chinese word segmentation facing is the New word Identification (NWI).
特别是对专有名词的处理是中文自动分词中的又一个难点。
Especially, it is very difficult to deal with special noun in Chinese automatic word segmentation.
本文主要研究了两个方面:自动分词方法和自动标引方法。
This paper's research focuses on the methods of automatic separation of terms and automatic indexing.
西方姓名译名的自动识别为汉语自动分词不可或缺的组成部分。
Transliterated person names identification is the necessary part of Chinese word segmentation.
使用自动分词知识可以进一步提高自动切分精度,满足高标准的需求。
Knowledge of Chinese words automatic segmentation can raise the precision of automatic segmentation, and it can satisfy high precision requirements.
汉语的自动分词,是计算机中文信息处理领域中一个基础而困难的课题。
Automatic word segmentation for the Chinese language is a fundamental and difficult problem in the field of computer Chinese language information processing.
中文搜索引擎的重点在于中文关键信息提取,其中的难点就是中文自动分词。
The core of Chinese Search Engine is the key content extracting, and the bottleneck is Chinese Word Automatic Segmentation.
论文的核心工作是设计并实现了一个基于多步处理策略的汉语自动分词系统。
The core work of the paper is designing and implementing a Chinese auto-segmentation system based on a multi-step processing strategy.
汉语自动分词分句的自动评判系统的研究已经成为一项亟待解决的紧迫课题。
Researching the automatic judge of Chinese word-split and sentence-split becomes an urgent task.
本文介绍了一个已研制成功的新闻语料自动分词系统—NEWS的结构和功能。
News, the structure and function of the automatically segmenting system, for news corpora, has been developed.
本文在分析目前自动分词各种方法的基础上,提出了一种新的神经网络分析方法。
The paper presents a new method of neural network based on analysing Current methods in automatic words segmentation.
自然语言理解采用基于两字词簇的自动分词技术,该方法简洁、速度快,易于实现。
Natural language processing used Automatic Segment based on Two-letters-word-family structure, which is succinct, high speed and easy to achieve.
词典是中文自动分词的基础,分词词典机制的优劣直接影响到中文分词的速度和效率。
As a basic component of Chinese word segmentation system, the dictionary mechanism influences the speed and the efficiency of segmentation significantly.
汉语自动分词中组合歧义是难点问题,难在两点:组合歧义字段的发现和歧义的消解。
One of challenges in Chinese Word Segmentation is the combinational ambiguity problem with two main obstacles: the detection of combinational ambiguities and ambiguity resolution.
将基于汉语自动分词的综合信息抽取技术应用于信息检索,具有实际应用意义和价值。
The extraction information technology based on Chinese Automatic segmentation with the information index system has practic...
分词词典机制是影响自动分词的重要因素,而查找速度是衡量一个词典好坏的重要标准。
The dictionary mechanism is an important factor affecting automatic word segmentation systems and the finding speed is an important criterion to determine the performance of a dictionary.
组合型歧义切分字段一直是汉语自动分词的难点,难点在于消歧依赖其上下文语境信息。
Combinational ambiguity is a challenging issue in Chinese word segmentation in that its disambiguation depends on the contextual information.
通过全面的比较研究,本文认为自动分词方法可以分为五个类型,自动标引的方法分为七种。
Through fully comparative study on automatic separation of terms and automatic indexing, this paper presents five methods of automatic separation of terms and seven ways of automatic indexing.
首先是建立法律语料库,主要是法律书面语的语料库,并对其进行机器自动分词和词性标注。
It is mainly a corpus base of the legal written language to set up legal corpus base at first, and mark automatic participle of the machine and morphological feature to it.
首先是建立法律语料库,主要是法律书面语的语料库,并对其进行机器自动分词和词性标注。
It is mainly a corpus base of the legal written language to set up legal corpus base at first, and mark automatic participle of the machine and morphological feature to it.
应用推荐