These texts are gathered from the Web by an automatic mining tool PTMiner.
我们所用的平行文本是从万维网上自动获得的半结构性平行文本。
Text classification is an important branch in data mining filed, which is responsible for automatically dealing with those class-unknown texts and judging which pre-defined class sets they reside in.
文本分类是数据挖掘领域中重要分支之一,其任务是对未知类别的文本进行自动处理,判断它们所属的预定义类别集合中的类别。
Since aims at small texts data mining, its complexity of time and space is not high. So it can be said this algorithm will become one kind of practical and effective information retrieval technology.
由于是针对小文本的数据挖掘,本文研究的算法时间和空间复杂度都不高,因此有望成为一种实用、有效的信息检索技术。
应用推荐