项的权重计算(term weighting)普遍使用的 是 tfidf(term frequency/ inverse document frequency)函数[23],其定义如下: Tr tfidf (t k , d j ) =# (t k , d j ...
基于16个网页-相关网页
query term weighting 包括海量数据挖掘
term-weighting 特征权重分析
index term weighting 加权的标应词
term weighting system 词加权系统
Secondly,The term weighting which is of great importance in LSI is studied in detail, and a new weighting design based on non- linear function and location factor was proposed. The retrieval performance has been improved further.
其次对LSI的重要优化过程——权重计算进行了深入分析,提出了一种基于“非线性函数”和“位置因子”的新权重方案,并对其效果进行了对比验证。
参考来源 - 基于潜在语义索引的中文文本检索研究Moreover, the improved algorithms of term weighting and reduction of dimensionality also show their effectiveness.
实验证明,多层次的分类算法较以往的单层次的分类系统相比,在分类的精度和速度上都有明显的提高,改进的特征加权算法和维数缩减方法也有效地改善了分类器的性能。
参考来源 - 多层次中文文本分类技术的研究·2,447,543篇论文数据,部分数据来源于NoteExpress
This article aims to improve the algorithm of term weighting in automated text classification.
文章研究并改进了文本自动分类中的特征权重算法。
The algorithm imports title weight coefficient to improve the term weighting in conjuction with a new classification algorithm.
引入标题权重系数改进词语权重,并提出了一种新的分类算法。
Text representation approaches with term weighting schemes such as commonly used TF/IDF are widely used to extract indexing terms of documents.
文本表达是指将表达文献主题内容的词汇抽取出来的过程。 常用的向量空间表达法主要采用TF/IDF等权重法。
应用推荐