Multiple paragraphs, bullet points, clear segmentation and bold headers are all easier on the reader than a chunky 400-word piece of text.
相比一篇排的密密麻麻的400字的段落,多重段落,强调重点,清晰的段落分割点,重点突出标题能更加有助于读者看懂。
As a basic component of Chinese word segmentation system, the dictionary mechanism influences the speed and the efficiency of segmentation significantly.
词典是中文自动分词的基础,分词词典机制的优劣直接影响到中文分词的速度和效率。
In this paper, the Word Segmentation technology of Chinese Text Classification is debated emphatically.
本文对中文文本分类的分词技术进行了着重讨论。
Based on the conception of formal word, the author inquires into the non grammatical factors for determination of segmentation element. They include the semantic factors, phonetic factors.
本文提出了“形式词”概念,并在形式词的基础上,进一步研究了确定切词单位的非语法因素,包括语义因素、语音因素。
Aiming at the dissatisfied effect of Chinese word segmentation to Email texts, an improved Maximum Match Based Approach is presented.
针对邮件文本分词效果较差的特点,提出采用一种改进的最大匹配法来进行中文分词的方法。
In this thesis, the goal of a general purpose word segmentation system is presented, its principle and schemes are discussed.
本文提出了通用分词接口的目标,论述了它的原理和设计方案。
Index module: first of all, discuss the design method of Chinese word segmentation and choose a word segmentation algorithm.
索引模块中:首先,讨论了中文分词的设计思想,选择了分词的算法。
One of challenges in Chinese Word Segmentation is the combinational ambiguity problem with two main obstacles: the detection of combinational ambiguities and ambiguity resolution.
汉语自动分词中组合歧义是难点问题,难在两点:组合歧义字段的发现和歧义的消解。
The experimental results shows that this model not only shorten the execution time of word segmentation program effectively, but also improve the grid resources utilization.
实验结果表明该模型不仅有效地缩短了分词处理程序的执行时间,而且能够提高网格资源的利用率。
The design and implementation of the Interface for Database Query in Chinese (IDCQ). The system includes regular word segmentation subsystem and object semantic analysis subsystem.
设计和实现了汉语数据库自然语言查询接口系统(IDCQ),系统包括正则分词子系统和对象语义解析子系统;
Text classification is helpful for user to read and handle vast amounts of texts selectively, whose preliminary work-the research of word segmentation is significative.
文本分类有助于用户有选择地阅读和处理海量文本,因此其预备工作分词系统的研究是很有意义的。
We also summarized the word structures and phonetic structures of Uighur, and proposed some rules of Uighur word segmentation and implementation of this segmentation.
本文对维文词的词法和语音法结构进行了归纳,提出了维语词切分的一些规律和实现方法。
The method improves the accuracy of word segmentation, by combining morphology and syntax with language situation.
该方法建立在词法和句法基础上,从语境角度分析歧义字段,提高分词准确率。
Chinese word segmentation is always the first step of subject extraction. The quality of word segmentation is effective to the quality of text subject extraction.
而主题提取是以中文分词作为第一步,分词质量直接影响到文献主题提取的质量。
Automatic word segmentation for the Chinese language is a fundamental and difficult problem in the field of computer Chinese language information processing.
汉语的自动分词,是计算机中文信息处理领域中一个基础而困难的课题。
This paper describes the word segmentation of database natural language query based on restricted Chinese.
对数据库受限汉语自然语言查询语句进行分词处理。
Automatic word segmentation of modern Chinese text is the base of Chinese information processing. So a general purpose application interface for word segmentation is important.
现代汉语文本自动分词是中文信息处理的重要基石,为此提供一个通用的分词接口是非常重要的。
The dictionary mechanism is an important factor affecting automatic word segmentation systems and the finding speed is an important criterion to determine the performance of a dictionary.
分词词典机制是影响自动分词的重要因素,而查找速度是衡量一个词典好坏的重要标准。
Using the result of word segmentation we give a comprehensive feature weight calculation and we get the feature words set.
对分词的结果再进行综合加权处理,最终得到文档的特征概念集。
Overlapping ambiguity is a major type of ambiguity in Chinese word segmentation.
交集型分词歧义是汉语自动分词中的主要歧义类型之一。
Therefore word segmentation is a key sub-problem of Chinese information processing, such as machine translation, information retrieval and text classification.
因此,分词在机器翻译、信息检索、文本分类等中文信息处理的各项任务中都发挥着基础性的重要作用。
The former includes Chinese word segmentation, part-of-speech tagging, pinyin tagging, named entity recognition, new word detection, syntactic parsing, word sense disambiguation, etc .
前者涉及到词法、句法、语义分析,包括汉语分词、词性标注、注音、命名实体识别、新词发现、句法分析、词义消歧等。
In this paper, Chinese word segmentation is introduced first, and then algorithm named two-way matching term is designed, which effectively reduces the ambiguity of the Chinese words.
本文首先对中文文本分词进行了介绍,在常用分词算法的基础之上设计了一种双向匹配分词算法,有效的减少了歧义词对正确分词的影响。
Therefore, the primary issue of Chinese information processing, that is, to a sentence to separate words, this is the Chinese word segmentation problem.
因此中文信息处理的首要问题,就是要将句子中一个个词给分离出来,这就是中文分词问题。
The core of Chinese Search Engine is the key content extracting, and the bottleneck is Chinese Word Automatic Segmentation.
中文搜索引擎的重点在于中文关键信息提取,其中的难点就是中文自动分词。
This paper presents methods of mechanical matching, feature lexicon, binding matrix, grammar analysis and semantic understanding for the Chinese language automatic word segmentation.
本文给出了为汉语自动分词而提出的机械匹配法、特征词库法、约束矩阵法、语法分析法和理解切分法。
The former includes Chinese word segmentation, part - of - speech tagging, pinyin tagging, named entity recognition, new word detection, syntactic parsing, word sense disambiguation, etc.
前者涉及到词法、句法、语义分析,包括汉语分词、词性标注、注音、命名实体识别、新词发现、句法分析、词义消歧等。
The former includes Chinese word segmentation, part - of - speech tagging, pinyin tagging, named entity recognition, new word detection, syntactic parsing, word sense disambiguation, etc.
前者涉及到词法、句法、语义分析,包括汉语分词、词性标注、注音、命名实体识别、新词发现、句法分析、词义消歧等。
应用推荐