PD F文件网络信息抽取的重要资源。
PDF files are important resource of Internet information extraction.
共指消解是信息抽取中一个重要子任务。
The coreference resolution is an important subtask of information extraction.
提出了基于文档上下文查询信息抽取算法。
Presentation of an algorithm for extracting the query information based on the context of document.
信息抽取技术能够提供高质量的检索服务。
Information extraction technologies can provide high quality retrieval service.
识别句子中实体关系是信息抽取的重要技术。
Identifying entity relation of sentence is important technology of information extraction.
机构名识别在信息抽取中是一个重要研究内容。
Identification of Organization names is a very important content in information extraction.
信息抽取是一种实用性强的自然语言处理技术。
Information extraction is a practical natural language processing technology.
本文研究基于包装器模型的文本信息抽取算法。
This thesis mainly studies relative algorithms on text information extraction based on wrapper model.
实体关系抽取是信息抽取领域中的重要研究课题。
Entity Relation Extraction is an important research field in Information Extraction.
提出了一种剪枝信息熵增较大结点的信息抽取方法。
This paper presents a method of information extraction by pruning the nodes of which information entropy production reach a certain extent.
本文使用标准的XML技术来解决网页信息抽取问题。
We apply standard technologies of XML to web information extraction problem.
生物实体名识别对生物医学文献的信息抽取有重要的意义。
Identification of biomedical entities is one of important techniques to extract information from biomedical documents.
信息获取:搜索算法、信息抽取、自动应答、跨语言获取、多媒体获取。
Information retrieval: search algorithms, information extraction, question answering, cross-lingual retrieval and multimedia retrieval.
事件抽取是目前信息抽取研究领域的一个新的重要的研究课题。
Event Extraction is a new research point in the area of Information Extraction.
与信息检索不同,信息抽取直接从自然语言文本中抽取事实信息。
Unlike information Retrieval, information Extraction Systems extract factual information directly from natural language texts.
信息抽取中的关键问题是如何编写健壮、准确和通用的抽取规则。
The key problem in information extraction is how to generate accurate, general, and robust extraction rules.
文章分析了信息抽取的概念、主要分析了信息抽取的类型和功能。
This article analyses the types of information sampling and its function.
因此,这种用于汉语信息抽取的词汇领域本体模型是合理和有效的。
Therefore, this lexical domain ontology model used for Chinese information extraction is reasonable and effective.
目前出现了基于不同原理的多种信息抽取技术,它们具有不同的性能。
Now many information extraction techniques based on different principle have appeared and have different capabilities.
本文最主要的工作是构造了一个基于XML的PD F信息抽取系统。
The core work of this essay is to develop a system of PDF Information Extraction based on XML.
其次,本文采用中文信息抽取技术抽取非结构化数据包含的实体相关信息。
Secondly, in this paper, we will research the Chinese information extraction technology to extract the entities from the unstructured data.
信息抽取是从自由文本语料库构建数据库,实现情报自动收集的有效途径之一。
Information extraction is a main approach for constructing database from free text corpus and for automatic collecting intelligence information.
基于上述算法从二维平面信息抽取含有高度的三维曲面信息,重建三维曲面。
Picking up the height info of the three dimensional surface from two dimensional plane based on the arithmetics hereinbefore, and reconstructing the three dimensional surface.
将网页信息抽取知识分为若干层,由抽象到具体逐层描述信息识别模式知识。
The knowledge used in this method (called HPIE) is composed of a few kinds of pattern descriptions, from abstract to concrete for information recognition patterns.
针对训练数据来源的多样化,提出了基于多模板隐马尔可夫模型的文本信息抽取算法。
This paper proposes a new algorithm using hidden Markov model for information extraction based on multiple templates due to the variety of training data.
信息抽取研究旨在为人们提供更有力的信息获取工具,以应对信息爆炸带来的严重挑战。
The research on information Extraction aims at providing more powerful information access tools to help people overcome the problem of information overloading.
在信息抽取的研究领域,有两条主要的技术路线:基于规则的路线与基于统计模型的路线。
There are two main routes in the study area of information extraction: rules-based model and statistics-based model.
在上述基础上,重点对两种流行的文档格式html和PDF的信息抽取的实现进行了研究。
Based on these work above, this paper focuses on the realization of extracting information from HTML and PDF documents.
为了实现提高信息抽取过程中的准确率与覆盖率,在信息抽取检索系统中,引入了领域本体。
In order to enhance the rate of accuracy and coverage fraction in the information extraction process, it has introduced the domain main body in the information extraction retrieval system.
通过构造并填充通用标绘指令模板,实现标绘信息抽取并适配不同标绘软件以实现军事标绘。
Plotting commands were expressed with general plotting templates and its interpreters for different military plotting softwares were built.
应用推荐