SlideShare a Scribd company logo
1 of 16
ACHIEVING OPERATIONAL
READINESS ON HADOOP WITH
DRIVEN & CASCADING
Supreet Oberoi
VP Field Engineering, Concurrent Inc
2
TRUSTED
by over 10,000
companies as
their big data
app platform
BACKED
by top Silicon Valley
investors True Ventures,
Rembrandt VP,
Bain Capital
FOUNDED
in 2008, with
headquarters in
San Francisco
ABOUT US
As big data applications become
the engine of your data management
strategy, they must meet higher
standards of quality, reliability, and
manageability.
3
WHY NOW?
WHY DO WE NEED OPERATIONAL READINESS ON HADOOP?
4
• Perform routine tasks in an automated and predictable manner
• Give consistent answers to customer questions
• Meet new regulatory requirements without significant investment
• Optimize use of cluster
• Improve quality and service of your data platform
Cascading Apps
CASCADING - THE STANDARD FOR BIG DATA APP DEVELOPMENT
5
New Fabrics
Clojure
SQL
Ruby
StormTez
System Integration
Mainfram
e
DB / DW
Data
Stores
Hadoo
p
In-Memory
• Write apps in
programming language
of choice
• Decouple application
logic from integration
• Future-proof your app to
run on many compute
fabrics
Cascading is the proven platform for building and deploying big data
applications on Hadoop with 10,000+ production deployments
End-to-end operational telemetry metadata for big data applications
Accessible via Web browser, command-line interface (CLI), or simple search queries
Easy integrations through JMX and upcoming Driven SDK
6
DRIVEN PROVIDES OPERATIONAL VISIBILITY TO YOUR HADOOP APPS
WARfiles
Web App
Server
Telemetry
metadata
(SSL)
Server
Web App
Server
HADOOP CLUSTERS
HADOOP APPS AND INFRASTRUCTURE
APPLICATIONS

YARN
Plugin
Web CLI JMX
FOR OPERATIONAL EXCELLENCE OF YOUR
HADOOP APPLICATIONS USING DRIVEN
BEST PRACTICES
8
Marketing Sales Compliance
MANAGE BIG DATA APPS MORE EFFECTIVELY#1 - MONITOR THE FLEET, NOT THE VEHICLE
Data science team QA cluster Production cluster
By department
By team/cluster
Compare performance, resource consumption,
and other metrics across departments, teams,
and any other segment you define
9
MANAGE BIG DATA APPS MORE EFFECTIVELY#2 - UNDERSTAND INTER-APP DEPENDENCIES
See how all apps consume
resources as they run
10
BUILD HIGHER QUALITY BIG DATA APPS#3 - PROMOTE A CULTURE OF COLLABORATION
Enable multiple teams to
cooperate on what went wrong
and how to fix it
11
BUILD HIGHER QUALITY BIG DATA APPS
RESUTS
Quickly and easily identify
execution errors without
parsing log files
Error report
#4 - SHARE APPLICATION CONTEXT AROUND ERRORS
12
Pinpoint bottlenecks and
identify causes
#5 - TUNE THE APP BEFORE CHANGING YOUR INFRASTRUCTURE
Is the problem your
code, your data, or
your hardware?
Execution Waiting
13
BUILD HIGHER QUALITY BIG DATA APPS
SOURCES OPERATIONS
(Hash joins, Hive queries, MapReduce jobs, etc.)
RESULTS
#6 - TRACE DATA FLOW AT AN OPERATIONAL LEVEL
Fully visualize your entire
data pipeline
14
Completed successfully
Failed
Still executing
3d
#7 – DEFINE AND TRACK SLA ACCOUNTABILITY
From: Sept 29, 2014 4:00 pm To: Nov 21, 2014 3:00 am
See historical performance,
SLA compliance and more
QUESTIONS?
THANK YOU

More Related Content

Similar to 7 Best Practices for Achieving Operational Readiness on Hadoop with Driven and Cascading

Optimize Business Performance by Moving Apps to the Microsoft Cloud
Optimize Business Performance by Moving Apps to the Microsoft CloudOptimize Business Performance by Moving Apps to the Microsoft Cloud
Optimize Business Performance by Moving Apps to the Microsoft CloudPerficient, Inc.
 
Bridge the App Gap: Crossing the Chasm Between IT and Business
Bridge the App Gap: Crossing the Chasm Between IT and BusinessBridge the App Gap: Crossing the Chasm Between IT and Business
Bridge the App Gap: Crossing the Chasm Between IT and BusinessProgress
 
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...DataStax
 
The F5 Networks Application Services Reference Architecture (White Paper)
The F5 Networks Application Services Reference Architecture (White Paper)The F5 Networks Application Services Reference Architecture (White Paper)
The F5 Networks Application Services Reference Architecture (White Paper)F5 Networks
 
The new developer experience
The new developer experienceThe new developer experience
The new developer experienceEric Cattoir
 
Essential Guide to Becoming A Mobile App Rock Star - part III - Enterprise Apps
Essential Guide to Becoming A Mobile App Rock Star - part III - Enterprise AppsEssential Guide to Becoming A Mobile App Rock Star - part III - Enterprise Apps
Essential Guide to Becoming A Mobile App Rock Star - part III - Enterprise AppsDMIMarketing
 
10 tips for enterprise cloud migration
10 tips for enterprise cloud migration10 tips for enterprise cloud migration
10 tips for enterprise cloud migrationJeferson Rodrigues
 
ChannelEyes Technical Whitepaper
ChannelEyes Technical WhitepaperChannelEyes Technical Whitepaper
ChannelEyes Technical WhitepaperJay McBain
 
The intersection of Traditional IT and New-Generation IT
The intersection of Traditional IT and New-Generation ITThe intersection of Traditional IT and New-Generation IT
The intersection of Traditional IT and New-Generation ITKangaroot
 
Automate and customise application services and deployment
Automate and customise application services and deploymentAutomate and customise application services and deployment
Automate and customise application services and deploymentF5NetworksAPJ
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentMapR Technologies
 
7 steps to Enterprise PaaS
7 steps to Enterprise PaaS7 steps to Enterprise PaaS
7 steps to Enterprise PaaSVMware vFabric
 
Becoming a data driven organization
Becoming a data driven organization Becoming a data driven organization
Becoming a data driven organization Magnus Backman
 
Don't let it PaaS you by - the future of application PaaS in Financial Services
Don't let it PaaS you by - the future of application PaaS in Financial ServicesDon't let it PaaS you by - the future of application PaaS in Financial Services
Don't let it PaaS you by - the future of application PaaS in Financial ServicesSentronex
 
RightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to CloudRightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to CloudRightScale
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks
 
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBMBuild end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBMCodemotion Tel Aviv
 

Similar to 7 Best Practices for Achieving Operational Readiness on Hadoop with Driven and Cascading (20)

Optimize Business Performance by Moving Apps to the Microsoft Cloud
Optimize Business Performance by Moving Apps to the Microsoft CloudOptimize Business Performance by Moving Apps to the Microsoft Cloud
Optimize Business Performance by Moving Apps to the Microsoft Cloud
 
Bridge the App Gap: Crossing the Chasm Between IT and Business
Bridge the App Gap: Crossing the Chasm Between IT and BusinessBridge the App Gap: Crossing the Chasm Between IT and Business
Bridge the App Gap: Crossing the Chasm Between IT and Business
 
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
DataStax on Azure: Deploying an industry-leading data platform for cloud apps...
 
The F5 Networks Application Services Reference Architecture (White Paper)
The F5 Networks Application Services Reference Architecture (White Paper)The F5 Networks Application Services Reference Architecture (White Paper)
The F5 Networks Application Services Reference Architecture (White Paper)
 
Dev ops
Dev opsDev ops
Dev ops
 
The new developer experience
The new developer experienceThe new developer experience
The new developer experience
 
Essential Guide to Becoming A Mobile App Rock Star - part III - Enterprise Apps
Essential Guide to Becoming A Mobile App Rock Star - part III - Enterprise AppsEssential Guide to Becoming A Mobile App Rock Star - part III - Enterprise Apps
Essential Guide to Becoming A Mobile App Rock Star - part III - Enterprise Apps
 
KidoZen紹介資料
KidoZen紹介資料KidoZen紹介資料
KidoZen紹介資料
 
10 tips for enterprise cloud migration
10 tips for enterprise cloud migration10 tips for enterprise cloud migration
10 tips for enterprise cloud migration
 
ChannelEyes Technical Whitepaper
ChannelEyes Technical WhitepaperChannelEyes Technical Whitepaper
ChannelEyes Technical Whitepaper
 
The intersection of Traditional IT and New-Generation IT
The intersection of Traditional IT and New-Generation ITThe intersection of Traditional IT and New-Generation IT
The intersection of Traditional IT and New-Generation IT
 
Automate and customise application services and deployment
Automate and customise application services and deploymentAutomate and customise application services and deployment
Automate and customise application services and deployment
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
 
7 steps to Enterprise PaaS
7 steps to Enterprise PaaS7 steps to Enterprise PaaS
7 steps to Enterprise PaaS
 
Becoming a data driven organization
Becoming a data driven organization Becoming a data driven organization
Becoming a data driven organization
 
WaveMaker
WaveMakerWaveMaker
WaveMaker
 
Don't let it PaaS you by - the future of application PaaS in Financial Services
Don't let it PaaS you by - the future of application PaaS in Financial ServicesDon't let it PaaS you by - the future of application PaaS in Financial Services
Don't let it PaaS you by - the future of application PaaS in Financial Services
 
RightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to CloudRightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to Cloud
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBMBuild end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
Build end-to-end solutions with BlueMix, Avi Vizel & Ziv Dai, IBM
 

More from Cascading

Overview of Cascading 3.0 on Apache Flink
Overview of Cascading 3.0 on Apache Flink Overview of Cascading 3.0 on Apache Flink
Overview of Cascading 3.0 on Apache Flink Cascading
 
Predicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using CascadingPredicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using CascadingCascading
 
Reducing Development Time for Production-Grade Hadoop Applications
Reducing Development Time for Production-Grade Hadoop ApplicationsReducing Development Time for Production-Grade Hadoop Applications
Reducing Development Time for Production-Grade Hadoop ApplicationsCascading
 
Cascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading
 
Breathe new life into your data warehouse by offloading etl processes to hadoop
Breathe new life into your data warehouse by offloading etl processes to hadoopBreathe new life into your data warehouse by offloading etl processes to hadoop
Breathe new life into your data warehouse by offloading etl processes to hadoopCascading
 
How To Get Hadoop App Intelligence with Driven
How To Get Hadoop App Intelligence with DrivenHow To Get Hadoop App Intelligence with Driven
How To Get Hadoop App Intelligence with DrivenCascading
 
The Cascading (big) data application framework - André Keple, Sr. Engineer, C...
The Cascading (big) data application framework - André Keple, Sr. Engineer, C...The Cascading (big) data application framework - André Keple, Sr. Engineer, C...
The Cascading (big) data application framework - André Keple, Sr. Engineer, C...Cascading
 
Cascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop WorldCascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop WorldCascading
 
Elasticsearch + Cascading for Scalable Log Processing
Elasticsearch + Cascading for Scalable Log ProcessingElasticsearch + Cascading for Scalable Log Processing
Elasticsearch + Cascading for Scalable Log ProcessingCascading
 
Introduction to Cascading
Introduction to Cascading  Introduction to Cascading
Introduction to Cascading Cascading
 
Accelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with CascadingAccelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with CascadingCascading
 

More from Cascading (11)

Overview of Cascading 3.0 on Apache Flink
Overview of Cascading 3.0 on Apache Flink Overview of Cascading 3.0 on Apache Flink
Overview of Cascading 3.0 on Apache Flink
 
Predicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using CascadingPredicting Hospital Readmission Using Cascading
Predicting Hospital Readmission Using Cascading
 
Reducing Development Time for Production-Grade Hadoop Applications
Reducing Development Time for Production-Grade Hadoop ApplicationsReducing Development Time for Production-Grade Hadoop Applications
Reducing Development Time for Production-Grade Hadoop Applications
 
Cascading 2015 User Survey Results
Cascading 2015 User Survey ResultsCascading 2015 User Survey Results
Cascading 2015 User Survey Results
 
Breathe new life into your data warehouse by offloading etl processes to hadoop
Breathe new life into your data warehouse by offloading etl processes to hadoopBreathe new life into your data warehouse by offloading etl processes to hadoop
Breathe new life into your data warehouse by offloading etl processes to hadoop
 
How To Get Hadoop App Intelligence with Driven
How To Get Hadoop App Intelligence with DrivenHow To Get Hadoop App Intelligence with Driven
How To Get Hadoop App Intelligence with Driven
 
The Cascading (big) data application framework - André Keple, Sr. Engineer, C...
The Cascading (big) data application framework - André Keple, Sr. Engineer, C...The Cascading (big) data application framework - André Keple, Sr. Engineer, C...
The Cascading (big) data application framework - André Keple, Sr. Engineer, C...
 
Cascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop WorldCascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop World
 
Elasticsearch + Cascading for Scalable Log Processing
Elasticsearch + Cascading for Scalable Log ProcessingElasticsearch + Cascading for Scalable Log Processing
Elasticsearch + Cascading for Scalable Log Processing
 
Introduction to Cascading
Introduction to Cascading  Introduction to Cascading
Introduction to Cascading
 
Accelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with CascadingAccelerate Big Data Application Development with Cascading
Accelerate Big Data Application Development with Cascading
 

Recently uploaded

chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 

Recently uploaded (20)

chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 

7 Best Practices for Achieving Operational Readiness on Hadoop with Driven and Cascading

  • 1. ACHIEVING OPERATIONAL READINESS ON HADOOP WITH DRIVEN & CASCADING Supreet Oberoi VP Field Engineering, Concurrent Inc
  • 2. 2 TRUSTED by over 10,000 companies as their big data app platform BACKED by top Silicon Valley investors True Ventures, Rembrandt VP, Bain Capital FOUNDED in 2008, with headquarters in San Francisco ABOUT US
  • 3. As big data applications become the engine of your data management strategy, they must meet higher standards of quality, reliability, and manageability. 3 WHY NOW?
  • 4. WHY DO WE NEED OPERATIONAL READINESS ON HADOOP? 4 • Perform routine tasks in an automated and predictable manner • Give consistent answers to customer questions • Meet new regulatory requirements without significant investment • Optimize use of cluster • Improve quality and service of your data platform
  • 5. Cascading Apps CASCADING - THE STANDARD FOR BIG DATA APP DEVELOPMENT 5 New Fabrics Clojure SQL Ruby StormTez System Integration Mainfram e DB / DW Data Stores Hadoo p In-Memory • Write apps in programming language of choice • Decouple application logic from integration • Future-proof your app to run on many compute fabrics Cascading is the proven platform for building and deploying big data applications on Hadoop with 10,000+ production deployments
  • 6. End-to-end operational telemetry metadata for big data applications Accessible via Web browser, command-line interface (CLI), or simple search queries Easy integrations through JMX and upcoming Driven SDK 6 DRIVEN PROVIDES OPERATIONAL VISIBILITY TO YOUR HADOOP APPS WARfiles Web App Server Telemetry metadata (SSL) Server Web App Server HADOOP CLUSTERS HADOOP APPS AND INFRASTRUCTURE APPLICATIONS YARN Plugin Web CLI JMX
  • 7. FOR OPERATIONAL EXCELLENCE OF YOUR HADOOP APPLICATIONS USING DRIVEN BEST PRACTICES
  • 8. 8 Marketing Sales Compliance MANAGE BIG DATA APPS MORE EFFECTIVELY#1 - MONITOR THE FLEET, NOT THE VEHICLE Data science team QA cluster Production cluster By department By team/cluster Compare performance, resource consumption, and other metrics across departments, teams, and any other segment you define
  • 9. 9 MANAGE BIG DATA APPS MORE EFFECTIVELY#2 - UNDERSTAND INTER-APP DEPENDENCIES See how all apps consume resources as they run
  • 10. 10 BUILD HIGHER QUALITY BIG DATA APPS#3 - PROMOTE A CULTURE OF COLLABORATION Enable multiple teams to cooperate on what went wrong and how to fix it
  • 11. 11 BUILD HIGHER QUALITY BIG DATA APPS RESUTS Quickly and easily identify execution errors without parsing log files Error report #4 - SHARE APPLICATION CONTEXT AROUND ERRORS
  • 12. 12 Pinpoint bottlenecks and identify causes #5 - TUNE THE APP BEFORE CHANGING YOUR INFRASTRUCTURE Is the problem your code, your data, or your hardware? Execution Waiting
  • 13. 13 BUILD HIGHER QUALITY BIG DATA APPS SOURCES OPERATIONS (Hash joins, Hive queries, MapReduce jobs, etc.) RESULTS #6 - TRACE DATA FLOW AT AN OPERATIONAL LEVEL Fully visualize your entire data pipeline
  • 14. 14 Completed successfully Failed Still executing 3d #7 – DEFINE AND TRACK SLA ACCOUNTABILITY From: Sept 29, 2014 4:00 pm To: Nov 21, 2014 3:00 am See historical performance, SLA compliance and more

Editor's Notes

  1. Need reliable, reusable tooling to quickly build and consistently deliver data products Need the degrees of freedom to solve problems ranging from simple to complex with existing skill sets Need the flexibility to easily adapt an application to meet business needs (latency, scale, SLA), without having to rewrite the application Need operational visibility for entire data application lifecycle
  2. Spring Data is built on Cascading
  3. Spring Data is built on Cascading