信息抽取是从自由文本语料库构建数据库,实现情报自动收集的有效途径之一。
Information extraction is a main approach for constructing database from free text corpus and for automatic collecting intelligence information.
算法不依赖特定的模板,因此可以适应论坛模板的周期性变化,自动抽取结构化数据。
It does not depend on specific template, thus is able to adapt to periodical changes of forum template and extract structured data automatically.
并探讨了在元数据中利用模板自动抽取信息的一些原理及方法。
This article also probes into some theories and methods of automatic extraction of information from metadata by using template mining.
本文利用现有的信息检索技术,对海量数据集上自动抽取关键词问题进行了研究,给出了一个基于特征组合的关键词自动抽取方法。
With the current technology of Information Retrieval, this paper proposes a method of automatic keyword extraction from massive data sets based on feature combination.
实验结果证明,该方法能不依赖科技文献网页的来源而自动地抽取相关信息,并能保证较高的数据抽取回召率和查准率。
Experimental result shows this method automatically extracts the information ignoring where Web sites the pages come from and has high accuracy in terms …
实验结果证明,该方法能不依赖科技文献网页的来源而自动地抽取相关信息,并能保证较高的数据抽取回召率和查准率。
Experimental result shows this method automatically extracts the information ignoring where Web sites the pages come from and has high accuracy in terms …
应用推荐