文中详细介绍了数据仓库的建模、数据抽取与转换、数据存储与管理、元数据管理以及可视化数据分析等技术。
The technologies of data warehouse modeling, data extraction and transformation, data storage and management, metadata management, and visualized data analysis and so on are introduced in detail.
并探讨了在元数据中利用模板自动抽取信息的一些原理及方法。
This article also probes into some theories and methods of automatic extraction of information from metadata by using template mining.
结合本体技术,提出了一种新的从文档中抽取引文元数据信息的方法。
A new method using ontology to extract citation metadata from technical documents is proposed in this paper.
论坛的结构化数据抽取是对论坛中帖子的标题、作者、发表时间和内容文本块等论坛元数据的抽取,它是处理论坛数据的基础。
Forum structured data extraction is the meta-data extraction from web forums such as post title, post author, post time and post content. It is the foundation of processing forum data.
实验结果表明该方法对论坛帖子的标题、作者、发表时间和内容文本块等元数据的抽取达到了较高的准确率。
Experimental results show that the proposed approach achieves high accuracy in extracting some metadata of web forums such as post title, post author, post time and post content.
其次,根据自定义规则抽取出了特征句子中的三元组,表示成本体,同样分别存放在数据库的另外两个表中。
Second extracts the triples of the characteristic sentences according to custom rules and expresses them by Ontology, also respectively stored in two database tables.
Oracle9i增加了互联网查找,从丰富内容中抽取和索引元数据的强大工具,以及查找XML数据和编目结构化数据的能力。
Oracle9i adds Internet search, powerful facilities to extract and index metadata from rich content, and the ability to search XML and catalog structures.
在此基础上对数据仓库管理中的元数据管理、数据抽取与集成及如何提高数据仓库的性能等作了详细阐述。
On the basis of that, some problems in the management of DW such as Metadata management, data extracting, data integrating and how to improve the performance of the DW are discussed in detail.
在此基础上对数据仓库管理中的元数据管理、数据抽取与集成及如何提高数据仓库的性能等作了详细阐述。
On the basis of that, some problems in the management of DW such as Metadata management, data extracting, data integrating and how to improve the performance of the DW are discussed in detail.
应用推荐