Properties of this paper are mainly about:1. A novel interface-query pattern extraction based on web pages marking tree models has been advanced, where the object model is used for expressing query interface.
主要内容包括:1.提出了一种基于网页标记树模型的查询接口模式抽取方法,用对象模型表示查询接口,并给出具体实现算法。
参考来源 - Deep Web数据集成研究及其在购书领域中的应用For the characters of inaccurate schema, a method of schema mining of XML based on fuzzy decision trees is proposed.
针对XML文档的模式信息不精确的特点,提出了基于模糊决策树的XML模式抽取方法。
参考来源 - XML数据库查询优化及相关技术研究Analyse the technology of vision-based page segment, and propose a new method of Deep Web interface schema extraction and Deep Web result schema extraction.
分析基于视觉特征的网页分割技术,在此基础上提出Deep Web查询接口模式和结果页面模式抽取的方法。
参考来源 - Deep Web模式获取技术研究与应用·2,447,543篇论文数据,部分数据来源于NoteExpress
模式抽取在半结构化数据研究领域中具有重要意义。
Extracting schema is important in the field of semistructured data research.
它有一个包括图形获取、预处理、模式抽取、搜索与解释、图形复合与输出等环节的工作过程。
It has an operating process which includes graph acquiring, preprocessing, pattern extracting, searching and explaining, graph combining and output.
数据挖掘旨在使用统计方法、人工智能和标准的数据库管理技术等等,从大型数据集中抽取模式。
Data mining seeks to extract patterns from large sets of data using, among other things, statistical methods, artificial intelligence, and standard database management techniques.
应用推荐