信息抽取是从自由文本语料库构建数据库,实现情报自动收集的有效途径之一。
Information extraction is a main approach for constructing database from free text corpus and for automatic collecting intelligence information.
算法不依赖特定的模板,因此可以适应论坛模板的周期性变化,自动抽取结构化数据。
It does not depend on specific template, thus is able to adapt to periodical changes of forum template and extract structured data automatically.
并探讨了在元数据中利用模板自动抽取信息的一些原理及方法。
This article also probes into some theories and methods of automatic extraction of information from metadata by using template mining.
应用推荐