Vertical search engine is generated to fulfill the professional needs of users. It crawls specific area on web pages through the topic crawler, and saves the pages to form a web database, which is used by Vertical search.
垂直搜索引擎正是在用户对专业化需求的环境下产生的,它通过主题爬虫对Web上特定领域的网页进行抓取,保存成网页库,然后被垂直搜索所使用。
参考来源 - 基于领域本体的主题爬虫研究及实现·2,447,543篇论文数据,部分数据来源于NoteExpress
Topic crawler based on dynamic topic base was proposed by studying on topic crawlers which filter URLs based on different strategies.
通过对基于不同策略过滤url的主题爬虫的研究,提出了一种基于动态主题库的主题爬虫。
This article provides a mixed strategy topic crawler which is based on network log analysis in order to adapt the dynamics and integrality of topic.
为适应主题的动态性和完整性,本文提出了一种基于网络日志分析的混合策略主题爬虫。
Then a topic crawler system was designed and implemented, employing topic sensitive Hyperlink-Induced Topic Search (HITS) to predict the priority of fetched Web pages.
在此基础上设计并实现了一个主题爬虫系统,该系统利用主题敏感HITS来计算网页优先级。
应用推荐