HADOOP ONLINE TRAINING
Training Program
By

KEEN IT

http://www.keentechnologies.com

Page 1
About Us
Keen IT Technologies Pvt Ltd. is one of the leading IT training Institutions, located
in Hyderabad with the objective of providing a Training services for various requirements in
IT industry. We deliver corporate trainings as per the student requirements colonize and
innovator of global eLearning solutions and providing technology enabled online training for
individuals and corporate educators. We have highly talented faculty in their respective
courses. We furnish with online training given us an edge on numerous Technologies.

Page 2
Introduction to Hadoop

Hadoop

is

a

complete,

open-source

ecosystem

for

capturing, organizing, storing, searching, sharing, analyzing, visualizing, and otherwise
processing disparate data sources (structured, semi-structured, and unstructured) in a cluster

of commodity computers. Hadoop's ability to store and analyze large data sets in parallel on a
large cluster of computers yields exceptional performance, while the use of commodity
hardware results in a remarkably low cost. In fact, Hadoop clusters often cost 50 to 100 times
less on a per-terabyte basis than today's typical data warehouse.

Page 3
Why Hadoop (and Why Now)
Organizations across all industries are confronting the same
challenge: data is arriving faster than existing data warehousing platforms are able to absorb
and analyze it. The migration to online channels, for example, is driving unprecedented
volumes of transaction and click stream data, which are, in turn, driving up the cost of data
warehouses, ETL processing, and analytics.

Page 4
Hadoop Course Out Line
Distributed computing

Parallel computing
Concurrency
Cloud Computing
Computing Past, Present and Future
Hadoop Streaming
Distributing Debug Scripts
Getting Started With Eclipse

Page 5
Hadoop Stack
CAP Theorem
Databases: Key Value, Document, Graph
Hive and Pig
HDFS
Lab 1: Hadoop Hands-on
Installing Hadoop Single Node cluster(CDH4)

Understanding Hadoop configuration files

Page 6
Map Reduce Introduction
Functional – Concept of Map and Reduce

Functional – Ordering, Concurrency, No Lock, Concurrency
Functional – Shuffling, Reducing, Key, Concurrency
Map Reduce Execution framework

Map Reduce Practitioners and Combiners
Map Reduce and role of distributed file system
Role of Key and Pairs
Hadoop Data Types

Page 7
Lab 2:
Map Reduce Exercises

Understanding Sample Map Reduce code
Executing Map Reduce code
HDFS Introduction

Architecture
File System
Data replication and Node
Name Node

Page 8
Lab 3: Hive Hands ON
Installation, Setup and Exercises
PIG
Rationale
Pig Latin
Input, Output and Relational Operators
User Defined Functions

Analyzing and designing using Pig Latin

Page 9
Lab 4: Pig Hands on
Installation and Setup
Executing Pig Latin scripts on File system
Executing Pig Latin scripts on HDFS
Writing custom User Defined Functions
Flume
What is Flume? And How it works ?

How it works ? And An example

Page 10
What is Oozie? And How it works?
Introduction to Zoo Keeper
Cluster Planning and Cloud Manager Set-up
Hadoop Multi node Cluster Setup
Installation and Configuration
Running Map Reduce Jobs on Multi Node cluster
Working with Large data sets
Steps involved in analyzing large data
Lab walk through
High Availability Fed ration, Yarn and Security

Page 11
If you require any further information please do not hesitate to contact us
please feel free to mail us for demo session or call @ 9989754807
contact: trainings@keentechnologies.com
website url: http://www.keentechnologies.com

Page 12
THANK YOU

Page 13

Hadoop Online Training | Online Hadoop Training certification in India

  • 1.
    HADOOP ONLINE TRAINING TrainingProgram By KEEN IT http://www.keentechnologies.com Page 1
  • 2.
    About Us Keen ITTechnologies Pvt Ltd. is one of the leading IT training Institutions, located in Hyderabad with the objective of providing a Training services for various requirements in IT industry. We deliver corporate trainings as per the student requirements colonize and innovator of global eLearning solutions and providing technology enabled online training for individuals and corporate educators. We have highly talented faculty in their respective courses. We furnish with online training given us an edge on numerous Technologies. Page 2
  • 3.
    Introduction to Hadoop Hadoop is a complete, open-source ecosystem for capturing,organizing, storing, searching, sharing, analyzing, visualizing, and otherwise processing disparate data sources (structured, semi-structured, and unstructured) in a cluster of commodity computers. Hadoop's ability to store and analyze large data sets in parallel on a large cluster of computers yields exceptional performance, while the use of commodity hardware results in a remarkably low cost. In fact, Hadoop clusters often cost 50 to 100 times less on a per-terabyte basis than today's typical data warehouse. Page 3
  • 4.
    Why Hadoop (andWhy Now) Organizations across all industries are confronting the same challenge: data is arriving faster than existing data warehousing platforms are able to absorb and analyze it. The migration to online channels, for example, is driving unprecedented volumes of transaction and click stream data, which are, in turn, driving up the cost of data warehouses, ETL processing, and analytics. Page 4
  • 5.
    Hadoop Course OutLine Distributed computing Parallel computing Concurrency Cloud Computing Computing Past, Present and Future Hadoop Streaming Distributing Debug Scripts Getting Started With Eclipse Page 5
  • 6.
    Hadoop Stack CAP Theorem Databases:Key Value, Document, Graph Hive and Pig HDFS Lab 1: Hadoop Hands-on Installing Hadoop Single Node cluster(CDH4) Understanding Hadoop configuration files Page 6
  • 7.
    Map Reduce Introduction Functional– Concept of Map and Reduce Functional – Ordering, Concurrency, No Lock, Concurrency Functional – Shuffling, Reducing, Key, Concurrency Map Reduce Execution framework Map Reduce Practitioners and Combiners Map Reduce and role of distributed file system Role of Key and Pairs Hadoop Data Types Page 7
  • 8.
    Lab 2: Map ReduceExercises Understanding Sample Map Reduce code Executing Map Reduce code HDFS Introduction Architecture File System Data replication and Node Name Node Page 8
  • 9.
    Lab 3: HiveHands ON Installation, Setup and Exercises PIG Rationale Pig Latin Input, Output and Relational Operators User Defined Functions Analyzing and designing using Pig Latin Page 9
  • 10.
    Lab 4: PigHands on Installation and Setup Executing Pig Latin scripts on File system Executing Pig Latin scripts on HDFS Writing custom User Defined Functions Flume What is Flume? And How it works ? How it works ? And An example Page 10
  • 11.
    What is Oozie?And How it works? Introduction to Zoo Keeper Cluster Planning and Cloud Manager Set-up Hadoop Multi node Cluster Setup Installation and Configuration Running Map Reduce Jobs on Multi Node cluster Working with Large data sets Steps involved in analyzing large data Lab walk through High Availability Fed ration, Yarn and Security Page 11
  • 12.
    If you requireany further information please do not hesitate to contact us please feel free to mail us for demo session or call @ 9989754807 contact: trainings@keentechnologies.com website url: http://www.keentechnologies.com Page 12
  • 13.