今天让我们一起来看看流行的全文检索引擎——ApacheLucene与Lucene.Net。
Today we look at the popular Full Text search engines, Apache Lucene and Lucene.Net.
Lucene是一个开源的信息检索搜索引擎,以它的全文本索引能力和搜索互联网能力而著名。
Lucene is an open source information-retrieval search engine, best known for its full-text indexing capabilities and its ability to search the Internet.
然而由于传统的搜索引擎基本上都是采用基于关键词匹配的全文检索技术,导致检索结果不全、无关信息过多。
However, traditional Search Engine adopts full-text retrieval technique based on keyword match, and it results in having no all-sided retrieval result and more outlying information.
为应对这个挑战,在搜索引擎系统中引入了分布式计算和倒排文档全文检索技术。
In order to deal with this challenge, the technology of distributed computing and inverted document full-text retrieval were introduced into the search engine system.
从本质上来说,搜索引擎是全文检索技术最主要的一个应用。
Essentially, the search engine technology is a major application of the full-text retrieval technology.
论文的主要工作是基于Lucene搜索引擎,设计并实现了一个中文全文信息检索原型系统。
The main task of the thesis is the design and implementation of a Chinese full-text information retrieval prototype system based on the Lucene search engine.
全文检索是现代信息检索技术的一个非常重要的分支,它是处理非结构化数据的强大工具,也是搜索引擎的核心技术之一。
Full-text retrieval is an important information retrieval technology. It is a powerful tool for dealing with nonstructural data, and is one of the key technologies of the search engine.
但与通用全文搜索引擎类似,全文检索的垂直搜索引擎存在着查全率较低、网络资源消耗过多等问题。
Conforming to the universal full-text search engine, it is also confronted with the problem of low-recall ratio and high dissipation of web resource.
目前垂直搜索引擎采用与通用全文搜索引擎类似的全文检索系统结构,在专业相关度方面具有相当高的水平。
At present, the vertical search engine adopted the system structure which is similar with the full-text search engine. It had high level in the professional association degree.
目前垂直搜索引擎采用与通用全文搜索引擎类似的全文检索系统结构,在专业相关度方面具有相当高的水平。
At present, the vertical search engine adopted the system structure which is similar with the full-text search engine. It had high level in the professional association degree.
应用推荐