Apache Hadoop Tutorial Notes

Introduction

This exercise introduce you to the basic concepts of MapReduce using the Apache- Hadoop (v0.20.2)Framework. We implement the popular WordCount program, which is considered the “Hello World” in MapReduce programming style, during the course of this exercise. First we’ll run it using the Hadoop standalone installation, then, we’ll proceed to run the WordCount in a cluster environment.


Goals

 

Prerequisites

 

Exercise 1: How to Write a Hadoop-WordCount

Exercise 2: Running WordCount in a standalone Hadoop

Exercise 3: Setting up an Apache Hadoop Cluster

Exercise 4: Running WordCount on Cluste