An Introduction to Apache Oozie

5,134 views

Published on

An Introduction to Apache Oozie, what is it and what is it used
for ? How is it used with Hadoop ?

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
5,134
On SlideShare
0
From Embeds
0
Number of Embeds
13
Actions
Shares
0
Downloads
210
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

An Introduction to Apache Oozie

  1. 1. Apache Oozie ● What is it ? ● Why use it ? ● Architecture ● Examples www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  2. 2. Oozie – What is it ? ● Work flow scheduler for Hadoop ● Manages Hadoop Jobs ● Integrated with many Hadoop apps i.e. Pig ● Scaleable ● Schedule jobs ● A work flow is a collection of actions i.e. – map/reduce, pig, hfs ● A work flow is – Arranged as a DAG ( direct acyclic graph ) – Graph stored as hPDL ( XML process definition ) www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  3. 3. Oozie – Why use it ? ● It is designed for Hadoop ● It is open source ● It is designed for big data ● It allows you to design task work flow ● It allows you to interact with jobs – Stop, start, suspend, resume, rerun www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  4. 4. Oozie – Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  5. 5. Oozie – Architecture ● Install Oozie on edge node / not on cluster ● Oozie has client – Launches jobs and talks to server ● Ozzie has server – Controls jobs – Launches jobs ● Pipelines – Chained workflows – Work flow output – Is input to next www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  6. 6. Oozie – Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  7. 7. Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – info@semtech-solutions.co.nz ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems

×