HBase Tutorial

SALSA Group
PTI Indiana University
June 29th 2012

OverView

"HBase is an open source, non-relational, distributed database modeled after Google's BigTable and is written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed Filesystem), providing BigTable-like capabilities for Hadoop." (Wiki) This HBase tutorial shows examples of how to create and access HBase tables via HBase shell, and how to make HBase MapReduce programs. In addition, we provide HBase assignments as well.

Contents List

   1. HBase User Shell Guide
   2. HBase Hands-on-1 Loading CSV file to HBase table
   3. HBase Hands-on-2 Loading Clueweb file to HBase table
   4. HBase Assignmente-1: WordCount
   5. HBase Assignmente-2: Create Table
   6. HBase Presentation Slides