Hierarchical Chinese document categorization was researched.
对层次化中文文档分类进行了研究。
Clustering analysis is an important research in data mining, and has been widely used in many fields, such as message filtering, document categorization, bioinformatics, etc.
聚类分析是数据挖掘中重要的研究课题,在信息过滤、资料自动分类、生物信息学等领域得到广泛应用。
Ontology is a description between the conception and the relation. The document categorization based on ontology is based upon the level of knowledge and semantic relations categorization indeed.
本体就是对概念和关系的描述,基于本体的文本分类就是基于知识层面和语义层次上的分类。
The pipeline processes documents before they are indexed in the full-text index, and it provides automated summarization, document format conversion, and categorization services (See Figure 1).
在全文索引中建立文档的索引之前,管道将对文档进行处理,并且提供了自动化的摘要、文档格式转换和分类服务(请参见图1)。
This categorization makes it easier for the document to be found by others looking for the same information at a later date.
这种分类方式便于以后查找相同信息的其他人发现该文档。
Text Categorization(TC) is a technique of assigning a document into predefined class.
文本分类,是一种对文档进行自动标记类别的技术。
To overcome the shortage of information gain in text categorization, this paper proposes a method of feature reduction based on the relative document frequency balance information gain (RDFBIG).
针对文本分类中信息增益降维方法的不足,提出了一种基于相对文档频的平衡信息增益(RDFBIG)降维方法。
In the most categorization algorithms, the text or document is always represented using Vector Space Model.
纲后长数文本开类方式都非以背量空间模型为基本的。
No matter what algorithm is selected, it can make up insufficient of current categorization deficient semantic relation to some extent. Enhance the document classification accuracy.
无论采取哪种算法,都可以在一定程度上弥补当前分类系统缺乏语义联系的不足,提高文本分类的准确性。
No matter what algorithm is selected, it can make up insufficient of current categorization deficient semantic relation to some extent. Enhance the document classification accuracy.
无论采取哪种算法,都可以在一定程度上弥补当前分类系统缺乏语义联系的不足,提高文本分类的准确性。
应用推荐