• 然后搜索引擎关键技术基础上,基于一个轻量级架构设计搜索引擎的三个主要模块网页爬虫索引器搜索器。

    Then, on basic of search engine's core technologies, based on a lightweight architecture, its three main modules were designed: crawler, indexer and searcher.

    youdao

  • 例如网站可能排名如果服务器停止服务网页爬虫如果已经改变了的网址,有很大一部分你网站的页面

    For example, a site may not rank well if your server stops serving pages to Googlebot, or if you've changed the URLs for a large portion of your site's pages.

    youdao

  • 爬虫程序不同Web站点时,建立数据库数据库中包括它所爬过站点网页所包含的链接、每一分析结果数据。

    As the program crawled the various Web sites, it would build a database of the sites and pages crawled, the links each page contained, the results of analysis on each pages, and so on.

    youdao

  • 每个搜索引擎都有自己爬行网页自动化程序叫做网络蜘蛛web spider)”“网络爬虫(web crawler)”。

    Each search engine has its own automated program called a "web spider" or "web crawler" that crawls the web.

    youdao

  • beacon称为网络爬虫(Webbug)”像素”,可以在网页运行软件

    Beacons, also known as "Web bugs" and "pixels," are small pieces of software that run on a Web page.

    youdao

  • SEO影响URL中的关键词有助于告诉爬虫网页哪些内容有关。

    SEO impact: Keywords in the URL help tell the spider what the page is about.

    youdao

  • 目前这些数据可以通过非均匀方(heterogeneous)式访问比如通过语义网页浏览器或者通过语义搜索引擎爬虫收录

    The data sets currently can be accessed in heterogeneous ways; for example, through a semantic web browser or by being crawled by a semantic search engine.

    youdao

  • metarobots标签是如何影响搜索引擎爬虫抓取索引显示网页的?

    How can the meta robots tag impact how search engines crawl, index and display content on a web page?

    youdao

  • 聚焦网络爬虫并不追求大覆盖,而将目标定为抓取某一特定主题内容相关网页面向主题的用户查询准备数据资源。

    The main goals of focused web crawler are to get more web pages which are correlative with a certain topic and prepare data for users querying.

    youdao

  • 然而目前主题爬虫采用两种基本抓取网页方式效率比较低下。

    However the current two ways of web crawling used by focus crawler are low efficiency.

    youdao

  • 在此基础上设计实现了一个主题爬虫系统,该系统利用主题敏感HITS计算网页优先级

    Then a topic crawler system was designed and implemented, employing topic sensitive Hyperlink-Induced Topic Search (HITS) to predict the priority of fetched Web pages.

    youdao

  • 传统聚焦爬虫抓取的目标特定主题内容相关网页有些应用中网络目录,更多的是用户提供主题相关网站

    Traditional focused crawler is targeting web pages that are relevant to some specific topics. But some applications, such as web directory, are providing users with relevant websites.

    youdao

  • 本文提出了一种维护WAP网站网络爬虫系统系统可以自动WAP网站,并对网页进行分析检查语法语义错误

    This paper provides a Maintaining WAP Site Crawler system. This system can automatically traverse the WAP site, parse every page in the site and check syntax and semantic faults.

    youdao

  • 网络爬虫一个可以因特网上自动提取网页系统搜索引擎从万维网上下载网页,是搜索引擎的重要组成

    Web crawler is a system which can automatically get web pages from Internet. It helps searching engine download web pages, so it is an important part of searching engine.

    youdao

  • 通过网络爬虫技术实现互联网网页内容进行提取提取的网页进行文本图像识别。

    Through the web crawler technology to realize the extracting of the content on the web page, and the recognizing of the text and image appeared on the web page.

    youdao

  • 即对爬虫程序网站内获取链接采用URL比较法进行先过滤,去掉不满足匹配条件网页

    Then the hyperlinks are filtered by the method of URL comparison and the ones which satisfy matching condition are left.

    youdao

  • 即对爬虫程序网站内获取链接采用URL比较法进行先过滤,去掉不满足匹配条件网页

    Then the hyperlinks are filtered by the method of URL comparison and the ones which satisfy matching condition are left.

    youdao

$firstVoiceSent
- 来自原声例句
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定
小调查
请问您想要如何调整此模块?

感谢您的反馈,我们会尽快进行适当修改!
进来说说原因吧 确定