研究了一种基于核的最大散度差准则的文本特征抽取方法。
This paper studied a method of extracting the text features based on the kernel and scatter difference.
在此基础上,结合用户的个人兴趣,给出了文本特征抽取机制、文本推荐机制、文本与信息需求模型的匹配机制。
Also put forward are the approach for text feature extraction, the pattern of user annotations, and the mechanism for matching texts and profiles.
文本分析可以抽取出一组代表文档特征的关键词。
Text analysis can extract a set of keywords that characterize the document.
基于规则的主要思路是通过分类文本的特征、结构等信息,寻找到一些用于抽取的规则。
The main idea to rules-based model use text documents of the characteristics, structure and so on, to find some rules for extraction.
利用潜在语义分析进行特征抽取,消除多义词和同义词在文本表示时造成的偏差,并实现文本向量的降维。
Using latent semantic analysis to extract feature, the affect of synonymy and polysemy in text representation process is eliminated and the dimension of text vector is reduced.
抽取电子邮件和手机短信的多种文本特征,分别在TREC07P电子邮件语料和真实中文手机短信语料上进行了垃圾信息过滤实验。
Through multiple text features extraction from email and short message service (SMS) document, some spam filtering experiments are run on TREC07P email corpus and real Chinese SMS corpus separately.
中文文本的特征项抽取和表示是中文文本过滤基础。
Text feature extraction and representation is the fundamental operation for Chinese Text Filtering.
通过分析主客观文本之间存在的差别来抽取能够区别它们的一些特征。
By analysis of subjective and objective texts, we can select some effective features.
借助特征聚类进行特征抽取是信息检索领域进行文本特征降维的重要手段之一。
In the domain of information retrieval, using feature clustering to extract the features is one of the most important means in the reduction of text dimension.
在此提出一种基于类别核心词的概念映射方法,首先从文本中抽取类别核心词,借助《知网》将特征词映射到基于类别核心词的概念空间,然后在概念空间上完成文本分类工作。
The idea is to extract the core words of class first, then use HowNet to map key words space to concept space based on core words, finally finish the text classification pr.
首先采用模式聚合理论进行特征抽取,将对文本分类具有相似贡献的特征合并,映射为新的特征空间。
Firstly, using pattern aggregation theoretical models to extract features, merge the features which have the similar contributions to text classification, then a new mapping feature space is formed.
该部分通过抽取网页的特征项,形成文本向量,然后与中心向量进行相似度计算后,根据相似度的结果来对网页进行自动分类。
After calculate the text vector and main vector, the system will judge which kind the page is by the result of calculation.
文本探讨了描述地震剖面层状连续构造的一种特征抽取方法。
This paper describes an approach to feature extraction and description of continuous stratified structure from the seismogram for pattern recognition.
文本探讨了描述地震剖面层状连续构造的一种特征抽取方法。
This paper describes an approach to feature extraction and description of continuous stratified structure from the seismogram for pattern recognition.
应用推荐