MapReduce/Hadoop Tutorial - MapReduce Programming with Apache Hadoop and Pig

sponsored by Yahoo!

Date: November 30, 2010.
Length: 3 hours

The tutorial will be given by Dr. Milind Bhandarkar from Yahoo!.

Abstract

Apache Hadoop has become the platform of choice for developing large-scale data-intensive applications. Core Hadoop components include a reliable, fault tolerant distributed file system (HDFS) and MapReduce programming framework.

In this tutorial, we will discuss design philosophy and architecture of Hadoop, describe how to design and develop Hadoop applications and higher-level application frameworks to crunch several terabytes of data, using anywhere from four to 4,000 computers. We will discuss solutions to common problems encountered in maximizing Hadoop application performance. We will also describe several frameworks and utilities developed using Hadoop that increase programmer-productivity and application-performance, focusing on Apache Pig, a parallel data-flow language that uses Hadoop as execution engine.

Biography

Milind Bhandarkar has been contributing and working with Apache Hadoop since version 0.1.0. He started the Yahoo! Grid solutions team focused on training, consulting, and supporting hundreds of new migrants to Hadoop. He has been focused on parallel programming languages and paradigms for over 20 years. He worked at the Center for Development of Advanced Computing(C-DAC), Center for Simulation of Advanced Rockets, Siebel Systems, and Pathscale Inc. (acquired by QLogic) before settling at Yahoo! in 2005.


Presenter

Milind Bhandarkar has been working with Hadoop and Pig since version 0.1.0 for both. He started the Yahoo! Grid solutions team focused on training, consulting, and supporting hundreds of new migrants to Hadoop and Pig. He has been focused on parallel programming languages and paradigms for over 20 years. He worked at the Center for Development of Advanced Computing (C-DAC), Center for Simulation of Advanced Rockets, Siebel Systems, and Pathscale Inc. (acquired by QLogic) before settling at Yahoo! in 2005.









Organized by Indiana University
ClouldCom2010 Sponsors