本文分析了中文分词、文本预处理和压缩、搜索引擎的原理、工作流程、查询处理流程等技术。
This thesis emphatically analyzed the content about Chinese participle, text pretreatment and compress, the principle of search engine, workflow, search handle process etc.
搜索引擎的技术涉及到自然语言理解、中文分词、人工智能、机器学习等学科。
Search engine technology related to natural language understanding, Chinese word segmentation, artificial intelligence, machine learning and so on.
中文分词是校园网搜索引擎项目的一个核心技术。
Word segment was a core technology of campus search engine project.
迅雷资源搜索引擎索引器的实现,主要是如何建立中文分词和倒排表。
The implement of XunLei indexer, it mainly include how to create Chinese participle and reverse table.
其中主要的工作包括:1大规模中文信息处理是构建中文搜索引擎的基本环节,为了实现大规模中文信息处理,本文提出了一种改进的中文分词算法。
The major work includes:1 Propose an improved Chinese word segmentation algorithm for large-scale Chinese information processing, which is the basic phase of the building of Chinese search engine.
中文搜索引擎的重点在于中文关键信息提取,其中的难点就是中文自动分词。
The core of Chinese Search Engine is the key content extracting, and the bottleneck is Chinese Word Automatic Segmentation.
对于基于词的搜索引擎等中文处理系统,分词速度要求较高。
The speed of Chinese word segmentation is very important for many Chinese NLP systems, such as web search engines based on words.
中文分词是搜索引擎中比较重要的部分,本文分析了正向和逆向的最大匹配分词以及基于统计的分词方法。
The design and implementation of"Chinese Word Segmentation"is the most important part of the search engine technology.
中文分词是搜索引擎中比较重要的部分,本文分析了正向和逆向的最大匹配分词以及基于统计的分词方法。
The design and implementation of"Chinese Word Segmentation"is the most important part of the search engine technology.
应用推荐