Apache Hadoop supports three deployment modes.
Apache Hadoop支持三种部署模式。
Apache Hadoop is the nucleus of the ecosystem.
Apache Hadoop是生态系统的核心。
Apache Hadoop is a software framework (platform) that enables a distributed manipulation of vast amount of data.
Apache Hadoop是一个软件框架(平台),它可以分布式地操纵大量数据。
Before we dive into Apache Hadoop, we will give a brief introduction to the structure of the cloud computing system.
在讨论Apache Hadoop之前,我们先简要介绍一下云计算系统的结构。
It is built using Apache Hadoop for hourly index updates and Apache HBase to provide random access to item information.
它使用Apache Hadoop来支持每小时进行的索引更新,使用ApacheHBase对随机存取信息提供支持。
There are few details available except that Microsoft promised to maintain compatibility with Apache Hadoop codebase and to contribute back to the open source project.
目前还没有太多的细节,只知道Microsoft承诺会保持与Apache Hadoop的兼容性,并且 将代码贡献给开源项目。
For example, IBM InfoSphere BigInsights analytics software starts with open-source project Apache Hadoop, but substitutes its own file system and adds other proprietary technology.
例如,IBMInfoSphereBigInsights分析软件起源于开源项目Apache Hadoop,但使用了它自己的文件系统并添加了其他专门技术。
Whether they rely on almost solely on the Apache Hadoop code, such as Cloudera, or not, such as EMC, vendors need to show potential customers that they can address real-world needs.
无论是像Cloudera那样主要依赖Apache Hadoop代码的方式还是像EMC这样不依赖Apache Hadoop代码的方式,厂商们都需要向潜在客户显示他们满足现实世界需求的能力。
Hadoop is an Apache Software Foundation project that consists of a set of tools for storing and processing large amounts of unstructured data.
Hadoop是一个Apache软件基金会项目,包含一系列用于存储和处理大量非结构化数据的工具集。
Now as a top-level Apache project, Hadoop is supported and used by many companies such as IBM, Google, Yahoo!, and Facebook, and has become the industry de facto framework for large data processing.
Hadoop现在是顶级Apache项目,IBM、Google、Yahoo!和Facebook等许多公司都支持和使用 Hadoop,它已经成为大规模数据处理方面事实上的行业标准框架。
In the example, you used Hadoop to process Apache web server access logs.
在示例中,您使用Hadoop处理Apacheweb服务器访问日志。
Apache provides a great set of resources for streaming, including the Hadoop streaming documentation and the streaming wiki (which provides a good introduction to the various command-line options).
Apache为流提供了一套非常好的资源,包括hadoop流文档和流wiki(为各种命令行选项提供了很好的介绍)。
Apache provides a great set of resources for streaming, including the Hadoop streaming documentation and the streaming wiki (which provides a good introduction to the various command-line options).
Apache为流提供了一套非常好的资源,包括hadoop流文档和流wiki(为各种命令行选项提供了很好的介绍)。
应用推荐