Course Introduction

Hadoop is an Apache open source framework written in java that allows distributed processing of large datasets across clusters of computers using simple programming models. A Hadoop frame-worked application works in an environment that provides distributed storage and computation across clusters of computers. Hadoop is designed to scale up from single server to thousands of machines, each offering local computation and storage.

To learn hadoop you should have basic idea about java programming basics and linux would be added advantage.


Hadoop Course Content

  •       Hadoop Overview
  •       Platforms and Automation
  •        Infrastructure
  •        Architecture Consideration

Use case walkthrough

  • ETL
  • Log Analytics
  • Real Time Analytics

Hbase for Developers :

  • NoSQL Introduction
  • Hbase Introduction
  • Hbase Architecture
  • Hbase Schema Design
  • Hbase Java API – Exercises
  • Hbase Operations, cluster management

MapReduce for Developers

  • Introduction
  • Hadoop in the Enterprise
  • Architecture
  • Hadoop CLI
  • MapReduce Programming
  • MapReduce Formats

Hadoop File Formats

  • MapReduce Design Considerations
  • MapReduce Algorithms
  • MapReduce Features

MapReduce Testing

Hadoop Ecosystem

HBase Introduction

Hadoop Fundamentals and Architecture

  • Hadoop Ecosystems Overview


Hardware and Software requirements

Deploy Hadoop ecosystem services

Hadoop Overview

  • Apache Hive & Pig for Developers

Hive Introduction


Hive Architecture – Building Blocks




Leave A Message

There are no any courses offered by this institute...!