SlideShare a Scribd company logo
1 of 10
Data Pipeline
Management
Framework on Oozie
Kun Lu
Overview
Architecture of Campaign Analytics
What are the issues in the old Campaign Analytics
processes
Build Pipeline Management Framework for robust
computing environment
Architecture of Campaign
Analytics
What are the issues the
framework needs to solve
Consistent and robust framework
Adding a new analytics job more easier
Ability to coordinate complex workflows
(serialized and parallel processing)
It should support the catch-up feature
It should make debugging and tracing
easier
What does Oozie provide?
Workflow Engine
Workflow definition
A DAG with control flow nodes or action nodes (connected with
transition arrows)
Workflow Nodes
Control flow nodes (start, end, decision, fork, join, kill node)
Action nodes (Map-reduce, pig, Java, Script, etc.)
Parameterization of Workflow
Job Properties
EL functions (Basic EL, WF EL, Hadoop EL, HDFS EL)
Oozie Console
Oozie Client and API
Workflow Design Pattern
Campaign Analytics Pipeline
Management Framework
Campaign Analytics Pipeline Management Framework(PMF) is
built on top of Oozie.
PMF defines campaign analytics processing pipeline. Each
pipeline includes a set of workflows.
PMF organizes, schedules and coordinates the campaign
analytics jobs. It also provides the built-in catch-up feature to
make the pipeline robust.
Oozie workflow engine executes workflows and sending jobs
status to Oozie server.
Monitoring/Tracing jobs through Oozie console.
PMF & Oozie Execution Env.
PMF Servers
Own Pipeline definition
Passing workflow tasks to Oozie through Ooize client
Oozie Server
Executes workflow tasks
Manages task status
Hadoop Cluster
Workflow definition deployed in HDFS
M/R processes run on the cluster
Oozie Console
Workflow Console
Current Workflows
PMF manages three pipelines (hourly
pipeline, daily pipeline, and weekly
pipeline)
Includes 12 workflows
Map/Reduce Jobs run per month:
~100,000 jobs

More Related Content

What's hot

Apache Oozie Workflow Scheduler - Module 10
Apache Oozie Workflow Scheduler - Module 10Apache Oozie Workflow Scheduler - Module 10
Apache Oozie Workflow Scheduler - Module 10Rohit Agrawal
 
Oozie @ Riot Games
Oozie @ Riot GamesOozie @ Riot Games
Oozie @ Riot GamesMatt Goeke
 
May 2012 HUG: Oozie: Towards a scalable Workflow Management System for Hadoop
May 2012 HUG: Oozie: Towards a scalable Workflow Management System for HadoopMay 2012 HUG: Oozie: Towards a scalable Workflow Management System for Hadoop
May 2012 HUG: Oozie: Towards a scalable Workflow Management System for HadoopYahoo Developer Network
 
Oozie HUG May12
Oozie HUG May12Oozie HUG May12
Oozie HUG May12mislam77
 
Apache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas N
Apache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas NApache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas N
Apache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas NYahoo Developer Network
 
Spring Batch Performance Tuning
Spring Batch Performance TuningSpring Batch Performance Tuning
Spring Batch Performance TuningGunnar Hillert
 
Oozie or Easy: Managing Hadoop Workloads the EASY Way
Oozie or Easy: Managing Hadoop Workloads the EASY WayOozie or Easy: Managing Hadoop Workloads the EASY Way
Oozie or Easy: Managing Hadoop Workloads the EASY WayDataWorks Summit
 
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2Cloudera, Inc.
 
Creating Modular Test-Driven SPAs with Spring and AngularJS
Creating Modular Test-Driven SPAs with Spring and AngularJSCreating Modular Test-Driven SPAs with Spring and AngularJS
Creating Modular Test-Driven SPAs with Spring and AngularJSGunnar Hillert
 
Don't Wait! Develop Responsive Applications with Java EE7 Instead
Don't Wait! Develop Responsive Applications with Java EE7 InsteadDon't Wait! Develop Responsive Applications with Java EE7 Instead
Don't Wait! Develop Responsive Applications with Java EE7 InsteadWASdev Community
 
Native REST Web Services with Oracle 11g
Native REST Web Services with Oracle 11gNative REST Web Services with Oracle 11g
Native REST Web Services with Oracle 11gMarcelo Ochoa
 
Spring Batch Behind the Scenes
Spring Batch Behind the ScenesSpring Batch Behind the Scenes
Spring Batch Behind the ScenesJoshua Long
 
Parallel batch processing with spring batch slideshare
Parallel batch processing with spring batch   slideshareParallel batch processing with spring batch   slideshare
Parallel batch processing with spring batch slideshareMorten Andersen-Gott
 
Spring Batch Workshop (advanced)
Spring Batch Workshop (advanced)Spring Batch Workshop (advanced)
Spring Batch Workshop (advanced)lyonjug
 
Cdcr apachecon-talk
Cdcr apachecon-talkCdcr apachecon-talk
Cdcr apachecon-talkAmrit Sarkar
 

What's hot (20)

Apache Oozie Workflow Scheduler - Module 10
Apache Oozie Workflow Scheduler - Module 10Apache Oozie Workflow Scheduler - Module 10
Apache Oozie Workflow Scheduler - Module 10
 
Oozie @ Riot Games
Oozie @ Riot GamesOozie @ Riot Games
Oozie @ Riot Games
 
May 2012 HUG: Oozie: Towards a scalable Workflow Management System for Hadoop
May 2012 HUG: Oozie: Towards a scalable Workflow Management System for HadoopMay 2012 HUG: Oozie: Towards a scalable Workflow Management System for Hadoop
May 2012 HUG: Oozie: Towards a scalable Workflow Management System for Hadoop
 
Oozie at Yahoo
Oozie at YahooOozie at Yahoo
Oozie at Yahoo
 
Oozie HUG May12
Oozie HUG May12Oozie HUG May12
Oozie HUG May12
 
October 2014 HUG : Oozie HA
October 2014 HUG : Oozie HAOctober 2014 HUG : Oozie HA
October 2014 HUG : Oozie HA
 
Apache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas N
Apache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas NApache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas N
Apache Hadoop India Summit 2011 talk "Oozie - Workflow for Hadoop" by Andreas N
 
AMIS Oracle OpenWorld 2013 Review Part 3 - Fusion Middleware
AMIS Oracle OpenWorld 2013 Review Part 3 - Fusion MiddlewareAMIS Oracle OpenWorld 2013 Review Part 3 - Fusion Middleware
AMIS Oracle OpenWorld 2013 Review Part 3 - Fusion Middleware
 
Spring Batch Performance Tuning
Spring Batch Performance TuningSpring Batch Performance Tuning
Spring Batch Performance Tuning
 
Oozie or Easy: Managing Hadoop Workloads the EASY Way
Oozie or Easy: Managing Hadoop Workloads the EASY WayOozie or Easy: Managing Hadoop Workloads the EASY Way
Oozie or Easy: Managing Hadoop Workloads the EASY Way
 
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2
Hadoop Summit 2012 | A New Generation of Data Transfer Tools for Hadoop: Sqoop 2
 
Creating Modular Test-Driven SPAs with Spring and AngularJS
Creating Modular Test-Driven SPAs with Spring and AngularJSCreating Modular Test-Driven SPAs with Spring and AngularJS
Creating Modular Test-Driven SPAs with Spring and AngularJS
 
Gradle - Build System
Gradle - Build SystemGradle - Build System
Gradle - Build System
 
Don't Wait! Develop Responsive Applications with Java EE7 Instead
Don't Wait! Develop Responsive Applications with Java EE7 InsteadDon't Wait! Develop Responsive Applications with Java EE7 Instead
Don't Wait! Develop Responsive Applications with Java EE7 Instead
 
The Spring Update
The Spring UpdateThe Spring Update
The Spring Update
 
Native REST Web Services with Oracle 11g
Native REST Web Services with Oracle 11gNative REST Web Services with Oracle 11g
Native REST Web Services with Oracle 11g
 
Spring Batch Behind the Scenes
Spring Batch Behind the ScenesSpring Batch Behind the Scenes
Spring Batch Behind the Scenes
 
Parallel batch processing with spring batch slideshare
Parallel batch processing with spring batch   slideshareParallel batch processing with spring batch   slideshare
Parallel batch processing with spring batch slideshare
 
Spring Batch Workshop (advanced)
Spring Batch Workshop (advanced)Spring Batch Workshop (advanced)
Spring Batch Workshop (advanced)
 
Cdcr apachecon-talk
Cdcr apachecon-talkCdcr apachecon-talk
Cdcr apachecon-talk
 

Viewers also liked

RWDG Webinar: A Data Governance Framework for Smart Data
RWDG Webinar: A Data Governance Framework for Smart DataRWDG Webinar: A Data Governance Framework for Smart Data
RWDG Webinar: A Data Governance Framework for Smart DataDATAVERSITY
 
Airflow - a data flow engine
Airflow - a data flow engineAirflow - a data flow engine
Airflow - a data flow engineWalter Liu
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBernard Marr
 
Pamplet ikbn
Pamplet ikbnPamplet ikbn
Pamplet ikbnRoy Zah
 
Alexander Alexandrovich Belov – The greatest basketball player of all time
Alexander Alexandrovich Belov – The greatest basketball  player of all timeAlexander Alexandrovich Belov – The greatest basketball  player of all time
Alexander Alexandrovich Belov – The greatest basketball player of all timeprosvsports
 
Program book 2013
Program book 2013Program book 2013
Program book 2013otakuthon
 
Quiz for ut iii pps
Quiz for ut iii ppsQuiz for ut iii pps
Quiz for ut iii ppsshajugeorge
 
Sex, City,Scavengers And I nterpersonnel Skills
Sex, City,Scavengers And I nterpersonnel Skills Sex, City,Scavengers And I nterpersonnel Skills
Sex, City,Scavengers And I nterpersonnel Skills Rashmi Singh
 
MDD and modeling tools research
MDD and modeling tools researchMDD and modeling tools research
MDD and modeling tools researchRoger Xia
 
China's Younger Architects 2014
China's Younger Architects 2014China's Younger Architects 2014
China's Younger Architects 2014Joe Carter
 
2011 expoward primaria 6to. año c2
2011 expoward primaria 6to. año c22011 expoward primaria 6to. año c2
2011 expoward primaria 6to. año c2nm48
 

Viewers also liked (20)

RWDG Webinar: A Data Governance Framework for Smart Data
RWDG Webinar: A Data Governance Framework for Smart DataRWDG Webinar: A Data Governance Framework for Smart Data
RWDG Webinar: A Data Governance Framework for Smart Data
 
Airflow - a data flow engine
Airflow - a data flow engineAirflow - a data flow engine
Airflow - a data flow engine
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
Hugps138
Hugps138Hugps138
Hugps138
 
Pamplet ikbn
Pamplet ikbnPamplet ikbn
Pamplet ikbn
 
Uuuu 2014-1 efr-corp_pres(final v.2)
Uuuu 2014-1 efr-corp_pres(final v.2)Uuuu 2014-1 efr-corp_pres(final v.2)
Uuuu 2014-1 efr-corp_pres(final v.2)
 
Cig tm future
Cig tm futureCig tm future
Cig tm future
 
Ccc 1314
Ccc 1314Ccc 1314
Ccc 1314
 
Alexander Alexandrovich Belov – The greatest basketball player of all time
Alexander Alexandrovich Belov – The greatest basketball  player of all timeAlexander Alexandrovich Belov – The greatest basketball  player of all time
Alexander Alexandrovich Belov – The greatest basketball player of all time
 
P1 L2
P1 L2P1 L2
P1 L2
 
Program book 2013
Program book 2013Program book 2013
Program book 2013
 
beckys new cv xxxx
beckys new cv xxxxbeckys new cv xxxx
beckys new cv xxxx
 
Video Sharing Services
Video Sharing ServicesVideo Sharing Services
Video Sharing Services
 
Intergenerational Networking
Intergenerational NetworkingIntergenerational Networking
Intergenerational Networking
 
Quiz for ut iii pps
Quiz for ut iii ppsQuiz for ut iii pps
Quiz for ut iii pps
 
Tambah
TambahTambah
Tambah
 
Sex, City,Scavengers And I nterpersonnel Skills
Sex, City,Scavengers And I nterpersonnel Skills Sex, City,Scavengers And I nterpersonnel Skills
Sex, City,Scavengers And I nterpersonnel Skills
 
MDD and modeling tools research
MDD and modeling tools researchMDD and modeling tools research
MDD and modeling tools research
 
China's Younger Architects 2014
China's Younger Architects 2014China's Younger Architects 2014
China's Younger Architects 2014
 
2011 expoward primaria 6to. año c2
2011 expoward primaria 6to. año c22011 expoward primaria 6to. año c2
2011 expoward primaria 6to. año c2
 

Similar to Data Pipeline Management Framework on Oozie

Sap process integration (pi) exchange infrastructure (xi) certification in...
Sap process integration (pi)  exchange infrastructure (xi)  certification  in...Sap process integration (pi)  exchange infrastructure (xi)  certification  in...
Sap process integration (pi) exchange infrastructure (xi) certification in...magnific5
 
Sap process integration (pi) exchange infrastructure (xi) technicaltraining...
Sap process integration (pi)  exchange infrastructure (xi)  technicaltraining...Sap process integration (pi)  exchange infrastructure (xi)  technicaltraining...
Sap process integration (pi) exchange infrastructure (xi) technicaltraining...magnific5
 
AnalysisServices
AnalysisServicesAnalysisServices
AnalysisServiceswebuploader
 
Report From Oracle Open World 2008 AMIS 2 October2008
Report From Oracle Open World 2008 AMIS 2 October2008Report From Oracle Open World 2008 AMIS 2 October2008
Report From Oracle Open World 2008 AMIS 2 October2008Lucas Jellema
 
WebServices and Workflow technologies
WebServices and Workflow technologiesWebServices and Workflow technologies
WebServices and Workflow technologiesNitin Pande
 
Solution Manager Deployment
Solution Manager DeploymentSolution Manager Deployment
Solution Manager DeploymentTony de Thomasis
 
Software Project Management - NESDEV
Software Project Management - NESDEVSoftware Project Management - NESDEV
Software Project Management - NESDEVKrit Kamtuo
 
Best SAP PI Online Training Instituteq
Best SAP PI Online Training InstituteqBest SAP PI Online Training Instituteq
Best SAP PI Online Training InstituteqVirtual Nuggets
 
Composite Apps using SCA (Service Component Architecture)
Composite Apps using SCA (Service Component Architecture)Composite Apps using SCA (Service Component Architecture)
Composite Apps using SCA (Service Component Architecture)Shameer Thaha Koya
 
Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...
Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...
Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...Saul Cunningham
 
BizSpark Startup Night Windows Azure March 29, 2011
BizSpark Startup Night Windows Azure March 29, 2011BizSpark Startup Night Windows Azure March 29, 2011
BizSpark Startup Night Windows Azure March 29, 2011Spiffy
 
What is an ESB? OPITZ CONSULTING - Winterberg - Trops
What is an ESB? OPITZ CONSULTING - Winterberg - TropsWhat is an ESB? OPITZ CONSULTING - Winterberg - Trops
What is an ESB? OPITZ CONSULTING - Winterberg - TropsOPITZ CONSULTING Deutschland
 
Managing IT as A Service with System Center
Managing IT as A Service with System CenterManaging IT as A Service with System Center
Managing IT as A Service with System CenterLai Yoong Seng
 
Enterprise service bus mule
Enterprise service bus  muleEnterprise service bus  mule
Enterprise service bus muleSindhu VL
 
Mule esb enterprise
Mule esb enterpriseMule esb enterprise
Mule esb enterpriseSindhu VL
 
LeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesLeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesMichael Stephenson
 

Similar to Data Pipeline Management Framework on Oozie (20)

Sap process integration (pi) exchange infrastructure (xi) certification in...
Sap process integration (pi)  exchange infrastructure (xi)  certification  in...Sap process integration (pi)  exchange infrastructure (xi)  certification  in...
Sap process integration (pi) exchange infrastructure (xi) certification in...
 
Sap process integration (pi) exchange infrastructure (xi) technicaltraining...
Sap process integration (pi)  exchange infrastructure (xi)  technicaltraining...Sap process integration (pi)  exchange infrastructure (xi)  technicaltraining...
Sap process integration (pi) exchange infrastructure (xi) technicaltraining...
 
AnalysisServices
AnalysisServicesAnalysisServices
AnalysisServices
 
Report From Oracle Open World 2008 AMIS 2 October2008
Report From Oracle Open World 2008 AMIS 2 October2008Report From Oracle Open World 2008 AMIS 2 October2008
Report From Oracle Open World 2008 AMIS 2 October2008
 
WebServices and Workflow technologies
WebServices and Workflow technologiesWebServices and Workflow technologies
WebServices and Workflow technologies
 
Oracle soa online training
Oracle soa online trainingOracle soa online training
Oracle soa online training
 
Solution Manager Deployment
Solution Manager DeploymentSolution Manager Deployment
Solution Manager Deployment
 
Software Project Management - NESDEV
Software Project Management - NESDEVSoftware Project Management - NESDEV
Software Project Management - NESDEV
 
Best SAP PI Online Training Instituteq
Best SAP PI Online Training InstituteqBest SAP PI Online Training Instituteq
Best SAP PI Online Training Instituteq
 
Composite Apps using SCA (Service Component Architecture)
Composite Apps using SCA (Service Component Architecture)Composite Apps using SCA (Service Component Architecture)
Composite Apps using SCA (Service Component Architecture)
 
Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...
Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...
Saul Cunningham - Oracle - Best Practices for Scaling your SOA Infrastructure...
 
BizSpark Startup Night Windows Azure March 29, 2011
BizSpark Startup Night Windows Azure March 29, 2011BizSpark Startup Night Windows Azure March 29, 2011
BizSpark Startup Night Windows Azure March 29, 2011
 
Java workflow engines
Java workflow enginesJava workflow engines
Java workflow engines
 
Sap pi training
Sap pi trainingSap pi training
Sap pi training
 
What is an ESB? OPITZ CONSULTING - Winterberg - Trops
What is an ESB? OPITZ CONSULTING - Winterberg - TropsWhat is an ESB? OPITZ CONSULTING - Winterberg - Trops
What is an ESB? OPITZ CONSULTING - Winterberg - Trops
 
SAP PI and SOA Overview
SAP PI and SOA OverviewSAP PI and SOA Overview
SAP PI and SOA Overview
 
Managing IT as A Service with System Center
Managing IT as A Service with System CenterManaging IT as A Service with System Center
Managing IT as A Service with System Center
 
Enterprise service bus mule
Enterprise service bus  muleEnterprise service bus  mule
Enterprise service bus mule
 
Mule esb enterprise
Mule esb enterpriseMule esb enterprise
Mule esb enterprise
 
LeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesLeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration Services
 

More from ShareThis

ShareThis Canadian Millennials Study_2015
ShareThis Canadian Millennials Study_2015ShareThis Canadian Millennials Study_2015
ShareThis Canadian Millennials Study_2015ShareThis
 
Real time pipeline at terabyte sacle
Real time pipeline at terabyte sacleReal time pipeline at terabyte sacle
Real time pipeline at terabyte sacleShareThis
 
ShareThis TV Study
ShareThis TV StudyShareThis TV Study
ShareThis TV StudyShareThis
 
Q1/2015 ShareThis Consumer Sharing Trends Report
Q1/2015 ShareThis Consumer Sharing Trends ReportQ1/2015 ShareThis Consumer Sharing Trends Report
Q1/2015 ShareThis Consumer Sharing Trends ReportShareThis
 
ShareThis Finance Study
ShareThis Finance Study ShareThis Finance Study
ShareThis Finance Study ShareThis
 
DataScienceInnovation_ShareThis
DataScienceInnovation_ShareThisDataScienceInnovation_ShareThis
DataScienceInnovation_ShareThisShareThis
 
Share this influentialdemocrats_jan2015
Share this influentialdemocrats_jan2015Share this influentialdemocrats_jan2015
Share this influentialdemocrats_jan2015ShareThis
 
ShareThis TravelStudy-2014
ShareThis TravelStudy-2014ShareThis TravelStudy-2014
ShareThis TravelStudy-2014ShareThis
 
ShareThis Midterm Elections_2014
ShareThis Midterm Elections_2014ShareThis Midterm Elections_2014
ShareThis Midterm Elections_2014ShareThis
 
H2O platform workshop
H2O platform workshopH2O platform workshop
H2O platform workshopShareThis
 
Q3 2014 Consumer Sharing Trends Report
Q3 2014 Consumer Sharing Trends ReportQ3 2014 Consumer Sharing Trends Report
Q3 2014 Consumer Sharing Trends ReportShareThis
 
ShareThis_Return on a Share Study
ShareThis_Return on a Share StudyShareThis_Return on a Share Study
ShareThis_Return on a Share StudyShareThis
 
Share this millennial study_2014
Share this millennial study_2014Share this millennial study_2014
Share this millennial study_2014ShareThis
 
ShareThis_CSTR_July2014
ShareThis_CSTR_July2014ShareThis_CSTR_July2014
ShareThis_CSTR_July2014ShareThis
 
Sharing Steals the Cup
Sharing Steals the CupSharing Steals the Cup
Sharing Steals the CupShareThis
 
Data analysis with R
Data analysis with RData analysis with R
Data analysis with RShareThis
 
ShareThis Auto Study
ShareThis Auto Study ShareThis Auto Study
ShareThis Auto Study ShareThis
 
ShareThis Return on a Share Study
ShareThis Return on a Share StudyShareThis Return on a Share Study
ShareThis Return on a Share StudyShareThis
 
ShareThis RoS
ShareThis RoS ShareThis RoS
ShareThis RoS ShareThis
 

More from ShareThis (20)

ShareThis Canadian Millennials Study_2015
ShareThis Canadian Millennials Study_2015ShareThis Canadian Millennials Study_2015
ShareThis Canadian Millennials Study_2015
 
Real time pipeline at terabyte sacle
Real time pipeline at terabyte sacleReal time pipeline at terabyte sacle
Real time pipeline at terabyte sacle
 
ShareThis TV Study
ShareThis TV StudyShareThis TV Study
ShareThis TV Study
 
Q1/2015 ShareThis Consumer Sharing Trends Report
Q1/2015 ShareThis Consumer Sharing Trends ReportQ1/2015 ShareThis Consumer Sharing Trends Report
Q1/2015 ShareThis Consumer Sharing Trends Report
 
ShareThis Finance Study
ShareThis Finance Study ShareThis Finance Study
ShareThis Finance Study
 
DataScienceInnovation_ShareThis
DataScienceInnovation_ShareThisDataScienceInnovation_ShareThis
DataScienceInnovation_ShareThis
 
Share this influentialdemocrats_jan2015
Share this influentialdemocrats_jan2015Share this influentialdemocrats_jan2015
Share this influentialdemocrats_jan2015
 
ShareThis TravelStudy-2014
ShareThis TravelStudy-2014ShareThis TravelStudy-2014
ShareThis TravelStudy-2014
 
ShareThis Midterm Elections_2014
ShareThis Midterm Elections_2014ShareThis Midterm Elections_2014
ShareThis Midterm Elections_2014
 
H2O platform workshop
H2O platform workshopH2O platform workshop
H2O platform workshop
 
Q3 2014 Consumer Sharing Trends Report
Q3 2014 Consumer Sharing Trends ReportQ3 2014 Consumer Sharing Trends Report
Q3 2014 Consumer Sharing Trends Report
 
ShareThis_Return on a Share Study
ShareThis_Return on a Share StudyShareThis_Return on a Share Study
ShareThis_Return on a Share Study
 
Share this millennial study_2014
Share this millennial study_2014Share this millennial study_2014
Share this millennial study_2014
 
ShareThis_CSTR_July2014
ShareThis_CSTR_July2014ShareThis_CSTR_July2014
ShareThis_CSTR_July2014
 
Sharing Steals the Cup
Sharing Steals the CupSharing Steals the Cup
Sharing Steals the Cup
 
Data analysis with R
Data analysis with RData analysis with R
Data analysis with R
 
ShareThis Auto Study
ShareThis Auto Study ShareThis Auto Study
ShareThis Auto Study
 
ShareThis Return on a Share Study
ShareThis Return on a Share StudyShareThis Return on a Share Study
ShareThis Return on a Share Study
 
Social TV
Social TVSocial TV
Social TV
 
ShareThis RoS
ShareThis RoS ShareThis RoS
ShareThis RoS
 

Recently uploaded

Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Servicemeghakumariji156
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfsmsksolar
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersMairaAshraf6
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 

Recently uploaded (20)

Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
 
Computer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to ComputersComputer Lecture 01.pptxIntroduction to Computers
Computer Lecture 01.pptxIntroduction to Computers
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 

Data Pipeline Management Framework on Oozie

  • 2. Overview Architecture of Campaign Analytics What are the issues in the old Campaign Analytics processes Build Pipeline Management Framework for robust computing environment
  • 4. What are the issues the framework needs to solve Consistent and robust framework Adding a new analytics job more easier Ability to coordinate complex workflows (serialized and parallel processing) It should support the catch-up feature It should make debugging and tracing easier
  • 5. What does Oozie provide? Workflow Engine Workflow definition A DAG with control flow nodes or action nodes (connected with transition arrows) Workflow Nodes Control flow nodes (start, end, decision, fork, join, kill node) Action nodes (Map-reduce, pig, Java, Script, etc.) Parameterization of Workflow Job Properties EL functions (Basic EL, WF EL, Hadoop EL, HDFS EL) Oozie Console Oozie Client and API
  • 7. Campaign Analytics Pipeline Management Framework Campaign Analytics Pipeline Management Framework(PMF) is built on top of Oozie. PMF defines campaign analytics processing pipeline. Each pipeline includes a set of workflows. PMF organizes, schedules and coordinates the campaign analytics jobs. It also provides the built-in catch-up feature to make the pipeline robust. Oozie workflow engine executes workflows and sending jobs status to Oozie server. Monitoring/Tracing jobs through Oozie console.
  • 8. PMF & Oozie Execution Env. PMF Servers Own Pipeline definition Passing workflow tasks to Oozie through Ooize client Oozie Server Executes workflow tasks Manages task status Hadoop Cluster Workflow definition deployed in HDFS M/R processes run on the cluster Oozie Console
  • 10. Current Workflows PMF manages three pipelines (hourly pipeline, daily pipeline, and weekly pipeline) Includes 12 workflows Map/Reduce Jobs run per month: ~100,000 jobs