The key problem in information extraction is how to generate accurate, general, and robust extraction rules.
信息抽取中的关键问题是如何编写健壮、准确和通用的抽取规则。
Extraction rules can extract form fields, text, attributes, headers, regular expressions, and hidden fields.
撷取规则可以从栏位、文字、属性、标头、规则运算式和隐藏栏位中撷取。
There are two parts in the system, the definition of the Extraction rules and the execution of the Extraction rules.
系统主要分为两个部分,抽取规则的定义阶段以及抽取规则的执行阶段。
Rather than relying on automatically recorded hidden field data, extraction rules can be manually added and customized as needed.
提取规则可以根据需要手动添加和自定义,而不是依赖于自动记录的隐藏字段数据。
Additionally, comments are valuable for making notes about validation and extraction rules that should be added to specific requests.
此外,对于加注有关应该在特定要求中加入的验证规则和撷取规则,注解也很有价值。
Through the domain ontology which is automatically generated by the extraction path, get the extraction rules of the information items.
通过信息项的抽取路径自动生成信息项的领域本体,通过信息项的领域本体解析出信息项的抽取规则。
With standard XSLT, we can exploit strong and flexible features of the language to construct simple, robust and general extraction rules.
基于标准的XSLT,可以利用它强大而且灵活的特性编写简单、健壮和通用的抽取规则。
Only when a new instance cannot be extracted does it need labeling. So it does not require an initial set of labeled pages to learn extraction rules.
只有当一个新的待抽取实例中的数据不能够被正确抽取时,系统再对其进行标注,因此算法无需初始的训练集合。
At last, this paper studied the optimization of extraction rules and compared several information location methods. The aim is to generate simple, robust and general extraction rules.
最后,本文还对提取规则的优化问题进行了研究,对几种信息定位方式进行了比较,目的是此基础上编写更为简单、健壮和通用的提取规则。
To use of rough set theory for data mining and the extraction rules of the knowledge, the most important point is that based on the attribute reduction and rule extraction algorithms of rough set.
利用粗糙集理论进行数据挖掘,抽取知识规则,最重要的一点就是基于粗糙集的属性约简和规则提取算法的研究。
The second method is pattern matching algorithm based on the patterns of dictionary definition, we form some extraction rules by hands, the system then automatic extract synonyms by pattern matching.
第二部分是利用词汇定义模式,对词汇的释义方式进行分析,归纳总结出在词典释义中同义词出现的模式,进而利用模式匹配方法获取同义词。
In order to extract simple and effective diagnostic rales from inconsistent diagnostic information, an extraction method of decision rules for fault diagnosis based on rough set theory is proposed.
为了在故障诊断信息不一致的情况下提取简单有效的诊断规则,提出了一种基于粗糙集理论的决策规则提取方法。
In addition, some redundant rules exist in the optimized population, considering the conciseness of the final rule set, so this paper presents a rule extraction method.
另外,在优化后的种群中存在一些冗余规则,考虑到规则集的简洁性,提出了一种规则提取方法。
Furthermore, a new algorithm for rule extraction based on decision matrices was presented. And much more concise decision rules could be got with this method.
同时,借助决策矩阵进行值约简,提出了一种新的规则提取算法,使最终得到的决策规则更加简洁。
After analyzing the three metrics of rules: support, confidence, coverage, this paper quantifies them and proposes an algorithm for rules extraction.
分析规则的三个度量标准:支持度、置信度、覆盖度,对其进行量化,并提出一规则提取算法。
Results showed that the equation optimization was succeeded and the extraction of EGCG complied to diffusion kinetic rules.
结果表明该方程拟合是成功的,EGCG提取符合传质扩散动力学规律。
Matrix computations for rules extraction are presented.
研究了规则提取的矩阵算法。
Finally, a heuristic algorithm for rules extraction of decision tree was designed.
以新的属性重要性为启发式信息设计决策树规则提取方法。
Missing data filling and rules extraction in incomplete decision table are two important data mining problems.
不完全信息系统中遗失数据的补充和规则的提取,一直是数据挖掘技术面临的重要问题。
The process of knowledge discovery in time series includes preprocessing of time series data, attributes reduction and rules extraction.
知识发现的过程包括时间序列数据预处理、属性约简和规则抽取三部分。
Based on manual definition of templates and rules, it aims at precise sentence extraction rather than wide recall.
基于模板和规则的人工定义,它旨在准确的句子提取,而非广泛地检索。
This paper focuses on the extraction and application of region-dependent rules for land-use data generalization.
本文主要探讨随地理区域不同而变化的区域依赖性数据综合规则的提取和应用。
This paper applies distributed data mining algorithm to IDS, takes some research on distributed pattern extraction which is based on distributed association rules algorithm.
论文把分布式数据挖掘算法运用于入侵检测系统,研究了基于分布式关联规则算法的分布式模式提取。
The method of rules extraction and calculating process with serial carry chain based on the arbitrary division are presented.
提出了基于任意分割的规则获取方法和相应的串行进位链计算流程。
The paper gives the method based on probabilistic techniques and rules for new word discovery via analyzing the current techniques of phrase extraction and combining the specialties of Chinese.
该文分析了已有短语抽取技术,并结合汉语特点,提出了基于概率统计技术和规则方法相结合的概念抽取方法。
Using rough set of the final value reduction algorithm for text classification rules extraction, thus gained the final text classification rules.
然后采用粗糙集的值约简算法来进行文本分类规则的抽取,从而得到最终的文本分类规则。
A binary neural network (BNN) applies to problems in Boolean space, Extraction of rules is a important research area of it.
二进神经网络是应用于布尔空间的神经网络,知识提取是它的一个重要研究领域。
A binary neural network (BNN) applies to problems in Boolean space, Extraction of rules is a important research area of it.
二进神经网络是应用于布尔空间的神经网络,知识提取是它的一个重要研究领域。
应用推荐