Apache Hadoop Tutorial Notes


This exercise introduce you to the basic concepts of MapReduce using the Apache- Hadoop (v0.20.2)Framework. We implement the popular WordCount program, which is considered the “Hello World” in MapReduce programming style, during the course of this exercise. First we’ll run it using the Hadoop standalone installation, then, we’ll proceed to run the WordCount in a cluster environment.





Exercise 1: How to Write a Hadoop-WordCount

Exercise 2: Running WordCount in a standalone Hadoop

Exercise 3: Setting up an Apache Hadoop Cluster

Exercise 4: Running WordCount on Cluste