该算法依据训练文本集的特征词句子环境,获取识别文本主题类别的特征词集合。
Both of the algorithms based on the context of feature words in sentence of training texts can get a set of feature words that identify the category of a text.
针对训练文本集中往往存在多个主题类别的问题,提出一种基于聚类分析策略的文本偏好挖掘方法。
To solve the problem of multi-topic problem in training documents, an approach which is based on cluster analysis has been introduced.
自动文本分类技术就是对大量的自然语言文本按照一定的主题类别进行自动分类,它是自然语言处理的一个十分重要的问题。
Automatical text Categorization is categorizing natural language texts according to given topics, which is a very important problem in natural language processing field.
应用推荐