Advanced Hadoop using Pig, Hive, and HBase Training

Answers to Popular Questions:

 
Yes, this class can be tailored to meet your specific training needs.
Yes, we provide Cloud consulting services.
Yes, group discounts are provided.

Course Description

 
This course delves into data management in HDFS, advanced Pig, Hive, and Hbase. These advanced programming techniques will be beneficial to experienced Hadoop developers. Course Topics... Data Management in HDFS... Advanced Pig ... Advanced Hive ... Advanced HBase
Course Length: 3 Days
Course Tuition: $1190 (US)

Prerequisites

This course is intended for experienced software developers and architects who know the basics of Hadoop and looking for advanced programming techniques.

Course Outline

 
 
I. Data Management in HDFS
A. Various Data Formats (JSON / Avro / Parquet)
B. Compression Schemes
C. Data Masking
D. Labs
 
II. Advanced Pig
A. User-defined Functions
B. Introduction to Pig Libraries (ElephantBird / Data-Fu)
C. Loading Complex Structured Data using Pig
D. Pig Tuning
E. Labs
 
III. Advanced Hive
A. User-defined Functions
B. Compressed Tables
C. Hive Performance Tuning
D. Labs
IV. HBase
 
A. Advanced Schema Modeling
B. Compression
C. Bulk Data Ingest
D. Wide-table / Tall-table comparison
E. HBase and Pig
F. HBase and Hive
G. HBase Performance Tuning
H. Labs
 
V. Final Project
A. End-to-End Project includes use of Learned Technologies

Course Directory [training on all levels]

Upcoming Classes