SlideShare a Scribd company logo
1 © Hortonworks Inc. 2011–2018. All rights reserved.
Compute Based Sizing & Operation
Dashboard
Janet Li HP Inc.
Pranay Vyas Hortonworks
2 © Hortonworks Inc. 2011–2018. All rights reserved.
Agenda
• Sizing puzzle
• Solution
• Operation dashboard
• Q&A
3 © Hortonworks Inc. 2011–2018. All rights reserved.
Sizing puzzle
4 © Hortonworks Inc. 2011–2018. All rights reserved.
• HP operates a very complex HDP environment with key stakeholders and critical data
across a variety of business areas: finance, supply chain, sales, and customer support
• We load over 8,000 files per day, execute 1.5M lines of SQL via 6000 jobs running
against 637B rows of data comprised of over 5000 tables in 77 domains
• Defining our cluster size and monitoring job performance is essential for our success and
the satisfaction of our stakeholders across the different business and IT organizations
• With estimated to run 15000 jobs everyday, it is evident that traditional storage based
sizing will not yield the computation power needed
Background
Why do we need sizing strategy?
5 © Hortonworks Inc. 2011–2018. All rights reserved.
• Multiple applications with different target state architectures
• Data Lake with multiple layers – raw dataset, historical dataset, transformed dataset &
reporting dataset
• Hybrid environment, on premise as well as on cloud
• Application owners only knows the incoming data volumes, lack of data point for
computing requirement with the use case
• Need a method to project demand and growth with new program onboarding
• Yarn capacity management and scheduling constraints as most jobs are region based
and are triggered at same time
Challenge
What are the Challenges for sizing the cluster?
6 © Hortonworks Inc. 2011–2018. All rights reserved.
• Review all the applications and collect data volumes and its estimated growth
• Capture every applications current footprint and projected capacity requirements based
on functional / technical requirements
• Understand where each application stand in terms of meeting their required SLA and
what is blocking concern
• Looked at peak CPU utilization, allocated and peak RAM usage, Storage Capacity,
Network IOPS Bandwidth & Throughput and Average Response Time and Transaction
Throughput
• Developed a framework that could be leveraged for on going capacity management and
deployment – with appropriate modifications to address different workload patterns
Approach
How to approach compute sizing?
7 © Hortonworks Inc. 2011–2018. All rights reserved.
Solution
8 © Hortonworks Inc. 2011–2018. All rights reserved.
• The ELT process is standard
• Loading layer
• History layer
• Transformation layer
• Reporting layer
• To extrapolate the sizing, we need to capture job metrics for initial release run from
loading layer to reporting layer.
• Estimate the computation requirements for queries and tune Hive and Tez properties
Collect Data
Method – Extrapolate based on initial run
9 © Hortonworks Inc. 2011–2018. All rights reserved.
• Categorize the sample runs based on
• Major job > 30 min
• Large job < 30 min
• Small job < 15 min
• Tiny job < 1 min
• Categorize the initial run based on # of containers
• Major footprint > 400 containers
• Large footprint < 400 containers
• Small footprint < 100 containers
• Tiny footprint < 20 containers
Collect Data
Method – Extrapolate based on initial run
10 © Hortonworks Inc. 2011–2018. All rights reserved.
• Total map tasks and reduce tasks needed for initial run
• Average, max & min time taken for map task and reduce task
• Extrapolate the # of hql queries needed at every release based on existing system and
initial release runs
Collect Data
Method – Extrapolate based on initial run
11 © Hortonworks Inc. 2011–2018. All rights reserved.
• Yarn memory: 6TB
• Tez container size: 4GB
• Map/ Reducer ratio – 70%:30%
• Average map time: 0.34min
• Average reducer task time: 0.5min
Collect Data
Current cluster estimates
12 © Hortonworks Inc. 2011–2018. All rights reserved.
Map Tasks: 4553788 tasks with each running for 0.34min
Reduce Tasks: 1327104 tasks with each running for 0.5min
Estimate # of map task and reduce task that can run
Current cluster estimates
13 © Hortonworks Inc. 2011–2018. All rights reserved.
• Based on the initial run get map task at each job classification
Estimate # of map task and reduce task needed
Current cluster estimates
Category
Expected
Count
Initial Run
Count
Major Job 47 2
Large Job 43 5
Small Job 1061 20
Tiny Job 10345 20
14 © Hortonworks Inc. 2011–2018. All rights reserved.
• With 6TB of Yarn memory
Map Tasks: 4553788 tasks with each running for 0.34min
Reduce Tasks: 1327104 tasks with each running for 0.5min
• What is needed to run the transformations 3 times
Map Tasks: 6733140 *3 = 20199420 tasks
Reduce Tasks: 92426 * 3 = 277278 tasks
• With 6TB of yarn memory it is evident that we may not be able to run the required
amount of map and reduce tasks.
Estimate difference
Current cluster estimates
15 © Hortonworks Inc. 2011–2018. All rights reserved.
How we got the data?
16 © Hortonworks Inc. 2011–2018. All rights reserved.
Operation Dashboard
17 © Hortonworks Inc. 2011–2018. All rights reserved.
• Provides critical input to platform sizing
• Provides job execution details and highlights differences in run
• Easy to track incremental records written to each table and unusual low/high writes
• Help understanding peak loads or dip on capacity for new job scheduling
• Know historical patterns on how the table is being loaded and identify problems with
the load
Overview
18 © Hortonworks Inc. 2011–2018. All rights reserved.
• Tez jobs writes to timeline server its application log and job metrics
• Job execution detail are stored HDFS directory “/ats/done” in complex JSON format
• A custom parser is written that extracts metrics details from the above JSON files
• The parser extracts application information, dag details, counters and detailed vertices
information.
How is data captured
19 © Hortonworks Inc. 2011–2018. All rights reserved.
What is captured
20 © Hortonworks Inc. 2011–2018. All rights reserved.
Categorize jobs
• Job category helps in understanding where most time and resources are being
spent by each tenant
21 © Hortonworks Inc. 2011–2018. All rights reserved.
Scheduling
Resource Usage Patterns
• Find peak load time and dips where cluster remains unutilized to schedule
more jobs
• Can I run more parallel jobs? Max parallelization vs Resource utilized and Max
resource utilization vs parallelization Can
schedule
more jobs
22 © Hortonworks Inc. 2011–2018. All rights reserved.
Write Patterns
Historical Comparison
• Historical pattern for records written, # of tasks, Duration, # of vertex, Average
task duration, CPU and IO operations for a table
23 © Hortonworks Inc. 2011–2018. All rights reserved.
Summary
24 © Hortonworks Inc. 2011–2018. All rights reserved.
Deliverable Description Artifact
Template for Sizing Methodology
• Foundational approach to application infrastructure assessment.
• Although every application, is unique, we find that most fall into one or a
combination of different patterns (e.g. Data Warehouse, Analytics, Transaction
Processing).
• Subsequent documents extend this approach their specific pattern type and
approach to infrastructure assessment.
Preliminary Application Assessment
Calculator
• Calculator used to provide a heat-map of potential problem areas.
HPI Infrastructure Assessment
• Provides a detailed overview of the Infrastructure Assessment covering a summary
of the issues, the current state, issues and challenges and high-level
recommendations
Application Assessment
Methodology
• Leveraging the template, this document captures the 4 phases of the infrastructure
assessment: Discovery, Analysis, Recommendations, and Socialize.
• This document becomes the basis of the written Infrastructure Assessment.
Application Sizing Calculator
• This calculator utilizes the data gathered in the discovery phase as input to help
assess if an application is sized correctly
Charge back dashboard
• It is a build in charge back tool using Smart Sense Activity Explorer
• Charge back based on real time usage metrics
Capacity Management Framework / Capability
25 © Hortonworks Inc. 2011–2018. All rights reserved.
• The source can be found at
https://github.com/pranayVyas/ats_extract
Tested HDP versions
• HDP 2.6
• Updates will be made to support HDP 3.0
• You can reach out to us if you would like to parse additional metrics info
Free the source
26 © Hortonworks Inc. 2011–2018. All rights reserved.
Questions?
27 © Hortonworks Inc. 2011–2018. All rights reserved.
Thank you

More Related Content

What's hot

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks
 
Overcoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus onOvercoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus on
DataWorks Summit
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
DataWorks Summit
 
Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...
DataWorks Summit
 
BI on Big Data with instant response times at Verizon
BI on Big Data with instant response times at VerizonBI on Big Data with instant response times at Verizon
BI on Big Data with instant response times at Verizon
DataWorks Summit
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
DataWorks Summit/Hadoop Summit
 
Leveraging advanced technologies to support critical applications in a secure...
Leveraging advanced technologies to support critical applications in a secure...Leveraging advanced technologies to support critical applications in a secure...
Leveraging advanced technologies to support critical applications in a secure...
DataWorks Summit
 
Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...
Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...
Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...
DataWorks Summit
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
DataWorks Summit
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Using LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowUsing LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache Arrow
DataWorks Summit
 
How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...
How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...
How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...
DataWorks Summit
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
DataWorks Summit/Hadoop Summit
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
DataWorks Summit
 
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
DataWorks Summit
 
Deep learning 101
Deep learning 101Deep learning 101
Deep learning 101
DataWorks Summit
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
DataWorks Summit/Hadoop Summit
 
Spark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun MurthySpark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun Murthy
Spark Summit
 
Multi-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLASMulti-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLAS
DataWorks Summit
 

What's hot (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
Overcoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus onOvercoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus on
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
 
Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...Balancing data democratization with comprehensive information governance: bui...
Balancing data democratization with comprehensive information governance: bui...
 
BI on Big Data with instant response times at Verizon
BI on Big Data with instant response times at VerizonBI on Big Data with instant response times at Verizon
BI on Big Data with instant response times at Verizon
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Leveraging advanced technologies to support critical applications in a secure...
Leveraging advanced technologies to support critical applications in a secure...Leveraging advanced technologies to support critical applications in a secure...
Leveraging advanced technologies to support critical applications in a secure...
 
Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...
Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...
Acquisition of Seismic, Hydroacoustic, and Infrasonic Data with Apache NiFi a...
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Using LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowUsing LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache Arrow
 
How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...
How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...
How to use flash drives with Apache Hadoop 3.x: Real world use cases and proo...
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
 
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
 
Deep learning 101
Deep learning 101Deep learning 101
Deep learning 101
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
 
Spark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun MurthySpark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun Murthy
 
Multi-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLASMulti-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLAS
 

Similar to Compute-based sizing and system dashboard

Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019
alanfgates
 
Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?
DataWorks Summit
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
DataWorks Summit
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
DataWorks Summit
 
Resume quaish abuzer
Resume quaish abuzerResume quaish abuzer
Resume quaish abuzer
quaish abuzer
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Hortonworks
 
GouriShankar_Informatica
GouriShankar_InformaticaGouriShankar_Informatica
GouriShankar_InformaticaGouri Shankar M
 
BUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSEBUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSENeha Kapoor
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
DataWorks Summit
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
 
Informatica_Power_Centre_9x
Informatica_Power_Centre_9xInformatica_Power_Centre_9x
Informatica_Power_Centre_9xRamesh Togari
 
Interactive query using hadoop
Interactive query using hadoopInteractive query using hadoop
Interactive query using hadoop
Arvind Radhakrishnen
 
Ramesh_Informatica_Power_Centre
Ramesh_Informatica_Power_CentreRamesh_Informatica_Power_Centre
Ramesh_Informatica_Power_CentreRamesh Togari
 
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
Curing the Kafka blindness—Streams Messaging Manager
Curing the Kafka blindness—Streams Messaging ManagerCuring the Kafka blindness—Streams Messaging Manager
Curing the Kafka blindness—Streams Messaging Manager
DataWorks Summit
 

Similar to Compute-based sizing and system dashboard (20)

Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019Hive Performance Dataworks Summit Melbourne February 2019
Hive Performance Dataworks Summit Melbourne February 2019
 
Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?Fast SQL on Hadoop, Really?
Fast SQL on Hadoop, Really?
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Resume quaish abuzer
Resume quaish abuzerResume quaish abuzer
Resume quaish abuzer
 
Puneet Verma CV
Puneet Verma CVPuneet Verma CV
Puneet Verma CV
 
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
 
GouriShankar_Informatica
GouriShankar_InformaticaGouriShankar_Informatica
GouriShankar_Informatica
 
BUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSEBUILDING A DATA WAREHOUSE
BUILDING A DATA WAREHOUSE
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
 
Shaik Niyas Ahamed M Resume
Shaik Niyas Ahamed M ResumeShaik Niyas Ahamed M Resume
Shaik Niyas Ahamed M Resume
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
data structure
data structuredata structure
data structure
 
Informatica_Power_Centre_9x
Informatica_Power_Centre_9xInformatica_Power_Centre_9x
Informatica_Power_Centre_9x
 
Sakthivel_Ganapathy_Resume
Sakthivel_Ganapathy_ResumeSakthivel_Ganapathy_Resume
Sakthivel_Ganapathy_Resume
 
Interactive query using hadoop
Interactive query using hadoopInteractive query using hadoop
Interactive query using hadoop
 
Ramesh_Informatica_Power_Centre
Ramesh_Informatica_Power_CentreRamesh_Informatica_Power_Centre
Ramesh_Informatica_Power_Centre
 
Subrat K Panigrahi Resume
Subrat K Panigrahi ResumeSubrat K Panigrahi Resume
Subrat K Panigrahi Resume
 
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
 
Curing the Kafka blindness—Streams Messaging Manager
Curing the Kafka blindness—Streams Messaging ManagerCuring the Kafka blindness—Streams Messaging Manager
Curing the Kafka blindness—Streams Messaging Manager
 

More from DataWorks Summit

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 

Recently uploaded (20)

"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 

Compute-based sizing and system dashboard

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved. Compute Based Sizing & Operation Dashboard Janet Li HP Inc. Pranay Vyas Hortonworks
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved. Agenda • Sizing puzzle • Solution • Operation dashboard • Q&A
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved. Sizing puzzle
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved. • HP operates a very complex HDP environment with key stakeholders and critical data across a variety of business areas: finance, supply chain, sales, and customer support • We load over 8,000 files per day, execute 1.5M lines of SQL via 6000 jobs running against 637B rows of data comprised of over 5000 tables in 77 domains • Defining our cluster size and monitoring job performance is essential for our success and the satisfaction of our stakeholders across the different business and IT organizations • With estimated to run 15000 jobs everyday, it is evident that traditional storage based sizing will not yield the computation power needed Background Why do we need sizing strategy?
  • 5. 5 © Hortonworks Inc. 2011–2018. All rights reserved. • Multiple applications with different target state architectures • Data Lake with multiple layers – raw dataset, historical dataset, transformed dataset & reporting dataset • Hybrid environment, on premise as well as on cloud • Application owners only knows the incoming data volumes, lack of data point for computing requirement with the use case • Need a method to project demand and growth with new program onboarding • Yarn capacity management and scheduling constraints as most jobs are region based and are triggered at same time Challenge What are the Challenges for sizing the cluster?
  • 6. 6 © Hortonworks Inc. 2011–2018. All rights reserved. • Review all the applications and collect data volumes and its estimated growth • Capture every applications current footprint and projected capacity requirements based on functional / technical requirements • Understand where each application stand in terms of meeting their required SLA and what is blocking concern • Looked at peak CPU utilization, allocated and peak RAM usage, Storage Capacity, Network IOPS Bandwidth & Throughput and Average Response Time and Transaction Throughput • Developed a framework that could be leveraged for on going capacity management and deployment – with appropriate modifications to address different workload patterns Approach How to approach compute sizing?
  • 7. 7 © Hortonworks Inc. 2011–2018. All rights reserved. Solution
  • 8. 8 © Hortonworks Inc. 2011–2018. All rights reserved. • The ELT process is standard • Loading layer • History layer • Transformation layer • Reporting layer • To extrapolate the sizing, we need to capture job metrics for initial release run from loading layer to reporting layer. • Estimate the computation requirements for queries and tune Hive and Tez properties Collect Data Method – Extrapolate based on initial run
  • 9. 9 © Hortonworks Inc. 2011–2018. All rights reserved. • Categorize the sample runs based on • Major job > 30 min • Large job < 30 min • Small job < 15 min • Tiny job < 1 min • Categorize the initial run based on # of containers • Major footprint > 400 containers • Large footprint < 400 containers • Small footprint < 100 containers • Tiny footprint < 20 containers Collect Data Method – Extrapolate based on initial run
  • 10. 10 © Hortonworks Inc. 2011–2018. All rights reserved. • Total map tasks and reduce tasks needed for initial run • Average, max & min time taken for map task and reduce task • Extrapolate the # of hql queries needed at every release based on existing system and initial release runs Collect Data Method – Extrapolate based on initial run
  • 11. 11 © Hortonworks Inc. 2011–2018. All rights reserved. • Yarn memory: 6TB • Tez container size: 4GB • Map/ Reducer ratio – 70%:30% • Average map time: 0.34min • Average reducer task time: 0.5min Collect Data Current cluster estimates
  • 12. 12 © Hortonworks Inc. 2011–2018. All rights reserved. Map Tasks: 4553788 tasks with each running for 0.34min Reduce Tasks: 1327104 tasks with each running for 0.5min Estimate # of map task and reduce task that can run Current cluster estimates
  • 13. 13 © Hortonworks Inc. 2011–2018. All rights reserved. • Based on the initial run get map task at each job classification Estimate # of map task and reduce task needed Current cluster estimates Category Expected Count Initial Run Count Major Job 47 2 Large Job 43 5 Small Job 1061 20 Tiny Job 10345 20
  • 14. 14 © Hortonworks Inc. 2011–2018. All rights reserved. • With 6TB of Yarn memory Map Tasks: 4553788 tasks with each running for 0.34min Reduce Tasks: 1327104 tasks with each running for 0.5min • What is needed to run the transformations 3 times Map Tasks: 6733140 *3 = 20199420 tasks Reduce Tasks: 92426 * 3 = 277278 tasks • With 6TB of yarn memory it is evident that we may not be able to run the required amount of map and reduce tasks. Estimate difference Current cluster estimates
  • 15. 15 © Hortonworks Inc. 2011–2018. All rights reserved. How we got the data?
  • 16. 16 © Hortonworks Inc. 2011–2018. All rights reserved. Operation Dashboard
  • 17. 17 © Hortonworks Inc. 2011–2018. All rights reserved. • Provides critical input to platform sizing • Provides job execution details and highlights differences in run • Easy to track incremental records written to each table and unusual low/high writes • Help understanding peak loads or dip on capacity for new job scheduling • Know historical patterns on how the table is being loaded and identify problems with the load Overview
  • 18. 18 © Hortonworks Inc. 2011–2018. All rights reserved. • Tez jobs writes to timeline server its application log and job metrics • Job execution detail are stored HDFS directory “/ats/done” in complex JSON format • A custom parser is written that extracts metrics details from the above JSON files • The parser extracts application information, dag details, counters and detailed vertices information. How is data captured
  • 19. 19 © Hortonworks Inc. 2011–2018. All rights reserved. What is captured
  • 20. 20 © Hortonworks Inc. 2011–2018. All rights reserved. Categorize jobs • Job category helps in understanding where most time and resources are being spent by each tenant
  • 21. 21 © Hortonworks Inc. 2011–2018. All rights reserved. Scheduling Resource Usage Patterns • Find peak load time and dips where cluster remains unutilized to schedule more jobs • Can I run more parallel jobs? Max parallelization vs Resource utilized and Max resource utilization vs parallelization Can schedule more jobs
  • 22. 22 © Hortonworks Inc. 2011–2018. All rights reserved. Write Patterns Historical Comparison • Historical pattern for records written, # of tasks, Duration, # of vertex, Average task duration, CPU and IO operations for a table
  • 23. 23 © Hortonworks Inc. 2011–2018. All rights reserved. Summary
  • 24. 24 © Hortonworks Inc. 2011–2018. All rights reserved. Deliverable Description Artifact Template for Sizing Methodology • Foundational approach to application infrastructure assessment. • Although every application, is unique, we find that most fall into one or a combination of different patterns (e.g. Data Warehouse, Analytics, Transaction Processing). • Subsequent documents extend this approach their specific pattern type and approach to infrastructure assessment. Preliminary Application Assessment Calculator • Calculator used to provide a heat-map of potential problem areas. HPI Infrastructure Assessment • Provides a detailed overview of the Infrastructure Assessment covering a summary of the issues, the current state, issues and challenges and high-level recommendations Application Assessment Methodology • Leveraging the template, this document captures the 4 phases of the infrastructure assessment: Discovery, Analysis, Recommendations, and Socialize. • This document becomes the basis of the written Infrastructure Assessment. Application Sizing Calculator • This calculator utilizes the data gathered in the discovery phase as input to help assess if an application is sized correctly Charge back dashboard • It is a build in charge back tool using Smart Sense Activity Explorer • Charge back based on real time usage metrics Capacity Management Framework / Capability
  • 25. 25 © Hortonworks Inc. 2011–2018. All rights reserved. • The source can be found at https://github.com/pranayVyas/ats_extract Tested HDP versions • HDP 2.6 • Updates will be made to support HDP 3.0 • You can reach out to us if you would like to parse additional metrics info Free the source
  • 26. 26 © Hortonworks Inc. 2011–2018. All rights reserved. Questions?
  • 27. 27 © Hortonworks Inc. 2011–2018. All rights reserved. Thank you

Editor's Notes

  1. TALK TRACK Hortonworks Powers the Future of Data: data-in-motion, data-at-rest, and Modern Data Applications. [NEXT SLIDE]