【Key words】 Text categorization; Feature selection; Chinese word segmentation; Categorization algorithm; Information entropy;
基于16个网页-相关网页
KNN categorization algorithm KNN分类算法
Text categorization algorithm 文本分类算法
naive bayesian categorization algorithm 朴素贝叶斯分类算法
The author analyzes technologies of feature representation, feature catching and text categorization algorithm especially.
重点研究分析了特征表示与特征提取技术,文本的分类算法。
参考来源 - SVM在文本分类中的应用·2,447,543篇论文数据,部分数据来源于NoteExpress
KNN algorithm is a common and effective text categorization algorithm.
KNN算法是一种常用的效果较好的文本分类算法。
The performance of text categorization algorithm based on centroid is poor when the documents are dispersive or existing more than one peak value.
当文本集较分散或出现多峰值时,基于质心的文本分类算法分类效果很差。
Aiming at this problem, this paper proposes an improved text categorization algorithm whose performance is higher than classical categorization algorithm based on centroid.
针对该问题提出一种改进的文本分类算法,与基于质心的经典分类算法相比,其性能较高。
应用推荐