自顶向下则着重于利用现成的页面信息,从中自动抽取出有意义的信息。
The top-down approach is focused on leveraging information in existing web pages, as-is, to derive meaning automatically.
信息抽取是从自由文本语料库构建数据库,实现情报自动收集的有效途径之一。
Information extraction is a main approach for constructing database from free text corpus and for automatic collecting intelligence information.
信息获取:搜索算法、信息抽取、自动应答、跨语言获取、多媒体获取。
Information retrieval: search algorithms, information extraction, question answering, cross-lingual retrieval and multimedia retrieval.
应用推荐