Investigating the Effects of Overcommitting YARN Resources

•Download as PPTX, PDF•

0 likes•1,670 views

DataWorks Summit/Hadoop Summit

Investing the Effects of Overcommitting YARN resources

Technology

Investigating the Effects of
Overcommitting YARN Resources
Jason Lowe
jlowe@yahoo-inc.com

Problem: Underutilized Cluster Resources

Optimize The Jobs!
● Internal Downsizer tool quantifies job waste
● Application framework limitations
● Optimally tuned container can still have opportunities
Time
ContainerUtilization
Underutilized
Resources

What about Static Overcommit?
● Configure YARN to use more memory than node provides
● Tried with some success
● Performs very poorly when node fully utilized

Overcommit Prototype Design Goals
● No changes to applications
● Minimize changes to YARN protocols
● Minimize changes to scheduler internals
● Overcommit on memory only
● Conservative growth
● Rapid correction

Overcommit Overview
ResourceManager NodeManager
Utilization report in heartbeat
■ Unaware of overcommit amount
■ Self-preservation preemption
■ Adjusts internal node size
■ Assigns containers based on new size
Application
Masters

NodeMemoryNodeUtilization
ResourceManager Node Scaling
Time
Time
No Overcommit
Reduced Overcommit
Full Overcommit
Allocated Node Mem
Total Node Mem
Original Node Mem

ResourceManager Overcommit Tunables
Parameter Description Value
memory.max-factor Maximum amount a node will be overcommitted 1.5
memory.low-water-mark Maximum overcommit below this node utilization 0.6
memory.high-water-mark No overcommit above this node utilization 0.8
memory.increment-mb Maximum increment above node allocation 16384
increment-period-ms Delay between overcommit increments if node
container state does not change
0
Parameters use yarn.resourcemanager.scheduler.overcommit. prefix

NodeManager Self-Preservation Preemption
Node Utilization
High Water Mark
Low Water Mark
● Utilization above high mark triggers preemption
● Preempts enough to reach low mark utilization
● Does not preempt containers below original node size
● Containers preempted in group order
○ Tasks from preemptable queue
○ ApplicationMasters from preemptable queue
○ Tasks from non-preemptable queue
○ ApplicationMasters from non-preemptable queue
● Youngest containers preempted first within a group
0%
100%

NodeManager Overcommit Tunables
Parameter Description Value
memory.high-water-mark Preemption when above this utilization 0.95
memory.low-water-mark Target utilization after preemption 0.92
Parameters use yarn.nodemanager.resource-monitor.overcommit. prefix

Lessons Learned
● Significant overcommit achievable on real workloads
● Far less preemption than expected
● Container reservations can drive overcommit growth
● Coordinated reducers can be a problem
● Cluster totals over time can be a bit confusing at first

Future Work
● YARN-5202
● Only grows cluster as a whole not individual queues
● Nodes can overcommit while others are relatively idle
● CPU overcommit
● Predict growth based on past behavior
● Relinquish nodes during quiet periods
● Integration with YARN-1011

YARN-1011
● Explicit GUARANTEED vs. OPPORTUNISTIC distinction
● Promotion of containers once resources are available
● SLA guarantees along with best-effort load

Acknowledgements
● Nathan Roberts for co-developing overcommit POC
● Inigo Goiri for nodemanager utilization collection and reporting
● Giovanni Matteo Fumarola for nodemanager AM container detection
● YARN-1011 contributors for helping to shape the long-term solution

Questions?
Jason Lowe
jlowe@yahoo-inc.com

What's hot

Improving Kafka at-least-once performance at UberYing Zheng

Tame the small files problem and optimize data layout for streaming ingestion...Flink Forward

Scylla Summit 2022: Scylla 5.0 New Features, Part 2ScyllaDB

Envoy and KafkaAdam Kotwasinski

Apache Kafka Fundamentals for Architects, Admins and Developersconfluent

Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...Flink Forward

Netflix Data Pipeline With KafkaAllen (Xiaozhong) Wang

Apache Flink and what it is used forAljoscha Krettek

Apache Kafkaemreakis

Fundamentals of Apache KafkaChhavi Parasher

Kafka Tutorial - Introduction to Apache Kafka (Part 1)Jean-Paul Azar

Kafka tiered-storage-meetup-2022-final-presentedSumant Tambe

Cloud-Native ObservabilityTyler Treat

Apache Kafka Architecture & Fundamentals Explainedconfluent

Apache Kafka Best PracticesDataWorks Summit/Hadoop Summit

Helix talk at RelateIQKishore Gopalakrishna

Hadoop Meetup Jan 2019 - Overview of OzoneErik Krogen

Design Patterns For Real Time Streaming Data AnalyticsDataWorks Summit

Iceberg + Alluxio for Fast Data AnalyticsAlluxio, Inc.

Capacity Planning Your Kafka Cluster | Jason Bell, DigitalisHostedbyConfluent

What's hot (20)

Improving Kafka at-least-once performance at Uber

Tame the small files problem and optimize data layout for streaming ingestion...

Scylla Summit 2022: Scylla 5.0 New Features, Part 2

Envoy and Kafka

Apache Kafka Fundamentals for Architects, Admins and Developers

Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...

Netflix Data Pipeline With Kafka

Apache Flink and what it is used for

Apache Kafka

Fundamentals of Apache Kafka

Kafka Tutorial - Introduction to Apache Kafka (Part 1)

Kafka tiered-storage-meetup-2022-final-presented

Cloud-Native Observability

Apache Kafka Architecture & Fundamentals Explained

Apache Kafka Best Practices

Helix talk at RelateIQ

Hadoop Meetup Jan 2019 - Overview of Ozone

Design Patterns For Real Time Streaming Data Analytics

Iceberg + Alluxio for Fast Data Analytics

Capacity Planning Your Kafka Cluster | Jason Bell, Digitalis

Viewers also liked

What is Data?DataWorks Summit/Hadoop Summit

Hdfs 2016-hadoop-summit-san-jose-v4Chris Nauroth

Simplified Cluster Operation & TroubleshootingDataWorks Summit/Hadoop Summit

Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...DataWorks Summit/Hadoop Summit

Extreme Analytics @ eBayDataWorks Summit/Hadoop Summit

HDFS Analysis for Small FilesDataWorks Summit/Hadoop Summit

End-to-End Security and Auditing in a Big Data as a Service DeploymentDataWorks Summit/Hadoop Summit

A Multi Colored YARNDataWorks Summit/Hadoop Summit

Viewers also liked (8)

What is Data?

Hdfs 2016-hadoop-summit-san-jose-v4

Simplified Cluster Operation & Troubleshooting

Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...

Extreme Analytics @ eBay

HDFS Analysis for Small Files

End-to-End Security and Auditing in a Big Data as a Service Deployment

A Multi Colored YARN

Similar to Investigating the Effects of Overcommitting YARN Resources

Strata+Hadoop 2017 San Jose: Lessons from a year of supporting Apache Kafkaconfluent

Yarn optimization (Real life use case)Jean-Louis Quéguiner

DrupalCon 2014: A Perfect Launch, Every TimePantheon

Retaining Goodput with Query Rate LimitingScyllaDB

Performance tuning Grails applications SpringOne 2GX 2014Lari Hotari

Benchmarks, performance, scalability, and capacity what's behind the numbersJustin Dorfman

Benchmarks, performance, scalability, and capacity what s behind the numbers...james tong

Anatomy of in memory processing in Sparkdatamantra

Session 7362 Handout 427 0jln1028

Running Dataproc At Scale in production - Searce Talk at GDG DelhiSearce Inc

Writing Scalable Software in JavaRuben Badaró

Java Performance TuningEnder Aydin Orak

Apache Spark At Scale in the CloudRose Toomey

Apache Spark At Scale in the CloudDatabricks

Redis Developers Day 2014 - Redis Labs TalksRedis Labs

Moodle performance testing presentation - Jonathon MooreIreland & UK Moodlemoot 2012

Enabling Presto to handle massive scale at lightning speedShubham Tagra

Introduction to YarnOmid Vahdaty

Taming YARN @ Hadoop Conference Japan 2014Tsuyoshi OZAWA

DrupalCamp LA 2014 - A Perfect Launch, Every TimeSuzanne Aldrich

Similar to Investigating the Effects of Overcommitting YARN Resources (20)

Strata+Hadoop 2017 San Jose: Lessons from a year of supporting Apache Kafka

Yarn optimization (Real life use case)

DrupalCon 2014: A Perfect Launch, Every Time

Retaining Goodput with Query Rate Limiting

Performance tuning Grails applications SpringOne 2GX 2014

Benchmarks, performance, scalability, and capacity what's behind the numbers

Benchmarks, performance, scalability, and capacity what s behind the numbers...

Anatomy of in memory processing in Spark

Session 7362 Handout 427 0

Running Dataproc At Scale in production - Searce Talk at GDG Delhi

Writing Scalable Software in Java

Java Performance Tuning

Apache Spark At Scale in the Cloud

Redis Developers Day 2014 - Redis Labs Talks

Moodle performance testing presentation - Jonathon Moore

Enabling Presto to handle massive scale at lightning speed

Introduction to Yarn

Taming YARN @ Hadoop Conference Japan 2014

DrupalCamp LA 2014 - A Perfect Launch, Every Time

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Gen AI in Business - Global Trends Report 2024.pdfAddepto

CloudStudio User manual (basic edition):comworks

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Build your next Gen AI Breakthrough - April 2024Neo4j

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

AI as an Interface for Commercial BuildingsMemoori

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Install Stable Diffusion in windows machinePadma Pradeep

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

costume and set research powerpoint presentationphoebematthew05

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Recently uploaded (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Gen AI in Business - Global Trends Report 2024.pdf

CloudStudio User manual (basic edition):

Streamlining Python Development: A Guide to a Modern Project Setup

Nell’iperspazio con Rocket: il Framework Web di Rust!

WordPress Websites for Engineers: Elevate Your Brand

Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service

Build your next Gen AI Breakthrough - April 2024

Designing IA for AI - Information Architecture Conference 2024

AI as an Interface for Commercial Buildings

Unraveling Multimodality with Large Language Models.pdf

APIForce Zurich 5 April Automation LPDG

Unleash Your Potential - Namagunga Girls Coding Club

Are Multi-Cloud and Serverless Good or Bad?

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Install Stable Diffusion in windows machine

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

costume and set research powerpoint presentation

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Investigating the Effects of Overcommitting YARN Resources

1. Investigating the Effects of Overcommitting YARN Resources Jason Lowe jlowe@yahoo-inc.com

2. Problem: Underutilized Cluster Resources

3. Optimize The Jobs! ● Internal Downsizer tool quantifies job waste ● Application framework limitations ● Optimally tuned container can still have opportunities Time ContainerUtilization Underutilized Resources

4. What about Static Overcommit? ● Configure YARN to use more memory than node provides ● Tried with some success ● Performs very poorly when node fully utilized

5. Overcommit Prototype Design Goals ● No changes to applications ● Minimize changes to YARN protocols ● Minimize changes to scheduler internals ● Overcommit on memory only ● Conservative growth ● Rapid correction

6. Overcommit Overview ResourceManager NodeManager Utilization report in heartbeat ■ Unaware of overcommit amount ■ Self-preservation preemption ■ Adjusts internal node size ■ Assigns containers based on new size Application Masters

7. NodeMemoryNodeUtilization ResourceManager Node Scaling Time Time No Overcommit Reduced Overcommit Full Overcommit Allocated Node Mem Total Node Mem Original Node Mem

8. ResourceManager Overcommit Tunables Parameter Description Value memory.max-factor Maximum amount a node will be overcommitted 1.5 memory.low-water-mark Maximum overcommit below this node utilization 0.6 memory.high-water-mark No overcommit above this node utilization 0.8 memory.increment-mb Maximum increment above node allocation 16384 increment-period-ms Delay between overcommit increments if node container state does not change 0 Parameters use yarn.resourcemanager.scheduler.overcommit. prefix

9. NodeManager Self-Preservation Preemption Node Utilization High Water Mark Low Water Mark ● Utilization above high mark triggers preemption ● Preempts enough to reach low mark utilization ● Does not preempt containers below original node size ● Containers preempted in group order ○ Tasks from preemptable queue ○ ApplicationMasters from preemptable queue ○ Tasks from non-preemptable queue ○ ApplicationMasters from non-preemptable queue ● Youngest containers preempted first within a group 0% 100%

10. NodeManager Overcommit Tunables Parameter Description Value memory.high-water-mark Preemption when above this utilization 0.95 memory.low-water-mark Target utilization after preemption 0.92 Parameters use yarn.nodemanager.resource-monitor.overcommit. prefix

11. Results

12. Results - Capacity_Gained vs Work_Lost

13. Lessons Learned ● Significant overcommit achievable on real workloads ● Far less preemption than expected ● Container reservations can drive overcommit growth ● Coordinated reducers can be a problem ● Cluster totals over time can be a bit confusing at first

14. Future Work ● YARN-5202 ● Only grows cluster as a whole not individual queues ● Nodes can overcommit while others are relatively idle ● CPU overcommit ● Predict growth based on past behavior ● Relinquish nodes during quiet periods ● Integration with YARN-1011

15. YARN-1011 ● Explicit GUARANTEED vs. OPPORTUNISTIC distinction ● Promotion of containers once resources are available ● SLA guarantees along with best-effort load

16. Acknowledgements ● Nathan Roberts for co-developing overcommit POC ● Inigo Goiri for nodemanager utilization collection and reporting ● Giovanni Matteo Fumarola for nodemanager AM container detection ● YARN-1011 contributors for helping to shape the long-term solution

17. Questions? Jason Lowe jlowe@yahoo-inc.com

Editor's Notes

3.3 million GB hours gained and only 502 GB hours lost. Less than 0.01%

Investigating the Effects of Overcommitting YARN Resources

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (8)

Similar to Investigating the Effects of Overcommitting YARN Resources

Similar to Investigating the Effects of Overcommitting YARN Resources (20)

More from DataWorks Summit/Hadoop Summit

More from DataWorks Summit/Hadoop Summit (20)

Recently uploaded

Recently uploaded (20)

Investigating the Effects of Overcommitting YARN Resources

Editor's Notes