Hadoop

Getting Started with Hadoop 2.0

August 8, 2014

Others

9 minutes

Apache™ Hadoop® is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. It is designed to scale up from a single server to thousands of machines, with a very high degree of fault tolerance. Rather than relying on high-end hardware, the resiliency of these clusters comes from the software’s ability to detect and handle failures at the application layer. Hadoop 1 popularized MapReduce programming for batch jobs and demonstrated the potential value of large scale, distributed processing.

Running Hadoop 1.1.2 on Ubuntu Linux (Single-Node Cluster)

June 14, 2013

Others

1661 words

In this tutorial I will describe the required steps for setting up a pseudo-distributed, single-node Hadoop cluster backed by the Hadoop Distributed File System, running on Ubuntu Linux. This tutorial has been tested with the following software versions: Ubuntu 13.04 Apache Hadoop 1.1.2 (Released on February 15th, 2013) Prerequisites Oracle Java 7 Hadoop requires a working Java 1.5+ (aka Java 5) installation. In this tutorial, I will describe the installation of Java 1.

Infinite Script

Hadoop

Getting Started with Hadoop 2.0

Running Hadoop 1.1.2 on Ubuntu Linux (Single-Node Cluster)

Search

Recent Projects

Recent Posts

Categories

Tags

Who is visiting?