SlideShare a Scribd company logo
1 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Hadoop YARN:
State of the Union
Sanjay Radia
Founder, Chief Architect Hortonworks
Apache Hadoop PMC
2 © Hortonworks Inc. 2011–2018. All rights reserved
About the Speaker
• Sanjay Radia
• Chief Architect, Founder, Hortonworks
• Apache Hadoop PMC and Committer
• Part of the original Hadoop team at Yahoo! since 2007
• Chief Architect of Hadoop Core at Yahoo!
• Prior
• Data center automation, virtualization, Java, HA, OSs, File Systems
• Startup, Sun Microsystems, INRIA…
• Ph.D., University of Waterloo
Page 2Architecting the Future of Big Data
3 © Hortonworks Inc. 2011–2018. All rights reserved
• Introduction
• Past
• State of Union for YARN
Agenda
4 © Hortonworks Inc. 2011–2018. All rights reserved
A Brief Timeline from Past Year: GA Releases
2.8.0 2.9.0 3.0.0 3.1.0
• GPU/FPGA
• YARN Native
Service
• Placement
Constraints
• YARN Federation
• Opportunistic
Container
(Backported from 3.0)
• New YARN UI
• Timeline V2
• Global Scheduling
• Multiple Resource types
• New YARN UI
• Timeline service V2
Ever involving requirements (computation intensive, larger, services)
• Application Priority
• Reservations
• Node labels
improvements
22 March ’17 17 Nov ’17 13 Dec ‘17 02 Aug ‘182.8.4 2.9.1 3.0.3 3.1.1
8 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Hadoop 3.0/3.1
9 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
10 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
11 © Hortonworks Inc. 2011–2018. All rights reserved
• Many customers with clusters made up of large number of nodes
• Oath (Yahoo!), Twitter, LinkedIn, Microsoft, Alibaba etc.
• Previous largest clusters: 6K-8K
• Now: 50K nodes under YARN in Microsoft [1]
• Roadmap: To 100K and beyond
[1] https://azure.microsoft.com/en-us/blog/how-microsoft-drives-exabyte-analytics-on-the-world-s-largest-yarn-cluster/
Looking at the Scale!
12 © Hortonworks Inc. 2011–2018. All rights reserved
• Enables applications to scale to 100k of thousands of nodes
• Federation divides a large (10-100k nodes) cluster into smaller units called sub-clusters
• Federation negotiates with sub-clusters RM’s and provide resources to the application
• Applications can schedule tasks on any node
YARN Federation
13 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
14 © Hortonworks Inc. 2011–2018. All rights reserved
• YARN-5139
• Problems
• Current design of one-node-at-a-time allocation cycle can lead to suboptimal decisions.
• Performance and also hard to do global placement policies
• Several coarse grained locks
• With this, we improved to
• Look at several nodes at a time
• Fine grained locks
• Multiple allocator threads
• YARN scheduler can allocate 3k+ containers per second ≈ 10 mil allocations / hour!
• 10X throughput gains
• Much better placement decisions
Moving Towards Global & Fast Scheduling
16 © Hortonworks Inc. 2011–2018. All rights reserved
Better Placement Strategies (YARN-6592)
• Past
• Supported constraints in form of Node Locality
• Now YARN can support a lot more use cases
• Co-locate the allocations of a job on the same rack (affinity)
• Spread allocations across machines (anti-affinity) to minimize resource interference
• Allow up to a specific number of allocations in a node group (cardinality)
17 © Hortonworks Inc. 2011–2018. All rights reserved
Better Placement Strategies (YARN-6592)
Affinity Anti-affinity
Region
Server
Region
Server
18 © Hortonworks Inc. 2011–2018. All rights reserved
Absolute Resources Configuration in CS – YARN-5881
• The cloud model! “Give me X resources, not X%”
• Gives ability to configure Queue resources as below
<memory=24GB, vcores=20, yarn.io/gpu=2>
• Enables admins to assign different quotas of different resource-types
• No more “Single percentage value limitation for all resource-types”
root
(memory=60G, vcores=40,
gpu=10)
sales
(memory=40G, vcores=20,
gpu=6)
marketing
(memory=10G, vcores=10,
gpu=4)
engineering
(memory=10G, vcores=10,
gpu=0)
20 © Hortonworks Inc. 2011–2018. All rights reserved
Useful Scheduling Features from Recent 2.X Releases
• Application Priorities
• Queue Priorities
• Previously : Resources to least satisfied queue
• Queues can have priorities e.g. (production over dev)
21 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
22 © Hortonworks Inc. 2011–2018. All rights reserved
Usability: UI (1)
23 © Hortonworks Inc. 2011–2018. All rights reserved
Usability: UI (2)
24 © Hortonworks Inc. 2011–2018. All rights reserved
Timeline Service 2.0
• Application History
• Where did my container run
• Why is my app slow
• Why is failing
• What happened with my app
• Cluster History
• Which user had most utilization
• Largest App
• What happened in my cluster
• …
• Scalable & Robust Service
• Using HBase as backend for better scale
• More robust storage fault tolerance
• Migration and compatibility with v.1.5
25 © Hortonworks Inc. 2011–2018. All rights reserved
Usability: Queue & Logs
API based queue management
Decentralized
(YARN-5734)
Improved logs
management
(YARN-4904)
Live application logs
26 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
27 © Hortonworks Inc. 2011–2018. All rights reserved
• Better Packaging model
• Lightweight mechanism for packaging and resource isolation
• Popularized and made accessible by Docker
• Native integration ++ in YARN
• Support for “Container Runtimes” in LCE: YARN-3611
• Native process runtime
• Docker runtime
• Many security/usability improves added to 3.0.x / 3.1.x
Containers
28 © Hortonworks Inc. 2011–2018. All rights reserved
Containers
• Run both with and without Docker on the same cluster
• Choose at run-time!
29 © Hortonworks Inc. 2011–2018. All rights reserved
• Apache Spark applications have a complex set of required software dependencies
• Docker on YARN helps to solve Spark package isolation issues with
• PySpark - Python versions, packages
• R - packages
Spark on Docker in YARN
31 © Hortonworks Inc. 2011–2018. All rights reserved
Hadoop Apps
YARN on YARN… Towards a Private Cloud …YCloud
YARN
MR Tez Spark
Tensor
Flow
YARN
MR Tez Spark
Spark
32 © Hortonworks Inc. 2011–2018. All rights reserved
YCloud: YARN Based Container Cloud We Use for Testing
ü Testing Hadoop on Hadoop!
33 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
34 © Hortonworks Inc. 2011–2018. All rights reserved
• YARN supported only Memory
and CPU
• Now
• A generalized vector for all resources
• Admin could add arbitrary resource types!
Resource Profiles and Custom Resource Types
• Ease of resource requesting model
using profiles for apps
Profile Memory CPU GPU
Small 2 GB 4 Cores 0 Cores
Medium 4 GB 8 Cores 0 Cores
Large 16 GB 16 Cores 4 CoresMemory
CPU
GPU
FPGA
Node Manager
35 © Hortonworks Inc. 2011–2018. All rights reserved
• Why?
• No need to setup separate clusters
• Leverage shared compute!
• Why need isolation?
• Multiple processes use the single GPU will be:
• Serialized.
• Cause OOM easily.
• GPU isolation on YARN:
• Granularity is for per-GPU device.
• Use Cgroups / Docker to enforce isolation.
GPU Support on YARN
Tensorflow 1.2
Nginx AppUbuntu 14:04
Nginx AppHost OS
GPU Base Lib v1
Volume Mount
CUDA Library 5.0
36 © Hortonworks Inc. 2011–2018. All rights reserved
• FPGA isolation on YARN: .
• Granularity is for per-FPGA device.
• Use Cgroups to enforce the isolation.
• Currently, only Intel OpenCL SDK for FPGA is supported.
• But implementation is extensible to other FPGA SDK.
FPGA on YARN
37 © Hortonworks Inc. 2011–2018. All rights reserved
Key Themes
Scale
Platform Themes Workload Themes
Scheduling Usability Containers Resources Services
38 © Hortonworks Inc. 2011–2018. All rights reserved
Services Support in YARN
• In the past we have focused on Jobs and have an advanced fine grained scheduler
• A native YARN services framework (YARN-4692)
• Manage services and their lifecycle
• Scale up and down, restart etc
• Support for Upgrades (YARN-4726
• “Do an upgrade of my HBase app with minimal impact to end-users
• Apache Slider retired from Incubator – lessons and key code carried over to YARN
• Simplified discovery of services via DNS mechanisms: YARN-4757
• regionserver-0.hbase-app-3.hadoop.yarn.site
39 © Hortonworks Inc. 2011–2018. All rights reserved
• Applications need simple APIs
• Need to be deployable “easily”
• Simple REST API layer fronting YARN
• YARN-4793 Simplified API layer
• Spawn services & Manage them
Simplified APIs for Service Definitions
41 © Hortonworks Inc. 2011–2018. All rights reserved
LLAP on YARN
• Apache Hive LLAP is a key long running application
• Used for query processing
• Designed to run on a shared multi-tenant YARN cluster
42 © Hortonworks Inc. 2011–2018. All rights reserved
Application Timeout – YARN-3813
• Controlling running time of workloads in YARN
• Define lifetime for an application anytime for YARN to manage.
• “Give me resources for this app/service but kill it after 15 days”
Reservation-based Scheduling: If You’re Late Don’t Blame Us! - Carlo, et al. 2015
43 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Hadoop 3.2 and Beyond
44 © Hortonworks Inc. 2011–2018. All rights reserved
• Every user says “Give me 16GB for my task”, even though it’s only needed at peak
• Each node has some allocated but unutilized capacity. Use such capacity to run opportunistic tasks
• Preempt such tasks when needed
Container Overcommit (YARN-1011)
Un-allocated
Allocated
Utilized
Allocated-
but-unutilized
Node1
Guaranteed
Containers
Node1
Opportunistic
Containers
Preemption
When necessary
45 © Hortonworks Inc. 2011–2018. All rights reserved
• “Take me to a node with JDK 10”
• Node Partition vs. Node Attribute
• Partition:
• One partition for one node
• ACL
• Shares between queues
• Preemption enforced.
• Attribute:
• For container placement
• No ACL/Shares on attributes
• First-come-first-serve
Node Attributes (YARN-3409) Partition - 1 (15 nodes)
Queue-A (40%)
Queue-B (60%)
Partition - 2 (15 nodes)
A (25%)
B (25%)
C (25%)
D (25%)
Node 1
os.type=ubuntu
os.version=14.10
glibc.version=2.20
JDK.version=8u20
Node 2
os.type=RHEL
os.version=5.1
GPU.type=x86_64
JDK.version=7u20
Node 16
os.type=windows
os.version=7
JDK.version=8u20
Node 17
os.type=SUSE
os.version=12
GPU.type=i686
JDK.version=7u20
48 © Hortonworks Inc. 2011–2018. All rights reserved
Questions?

More Related Content

What's hot

What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
DataWorks Summit
 
Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0
DataWorks Summit
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
DataWorks Summit
 
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?
DataWorks Summit
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at Scale
DataWorks Summit
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
DataWorks Summit
 
What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4
DataWorks Summit
 
Navigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT DevelopmentNavigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT Development
DataWorks Summit
 
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiThe First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
DataWorks Summit
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
DataWorks Summit
 
Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?
DataWorks Summit
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
DataWorks Summit
 
Curing the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging ManagerCuring the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging Manager
DataWorks Summit
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
DataWorks Summit
 
Deep learning 101
Deep learning 101Deep learning 101
Deep learning 101
DataWorks Summit
 
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop EnvironmentHow to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
DataWorks Summit
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
DataWorks Summit
 
Hadoop: The Unintended Benefits
Hadoop: The Unintended BenefitsHadoop: The Unintended Benefits
Hadoop: The Unintended Benefits
DataWorks Summit
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
 

What's hot (20)

What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4What s new in spark 2.3 and spark 2.4
What s new in spark 2.3 and spark 2.4
 
Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?
 
Dataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFiDataflow Management From Edge to Core with Apache NiFi
Dataflow Management From Edge to Core with Apache NiFi
 
Solving Cybersecurity at Scale
Solving Cybersecurity at ScaleSolving Cybersecurity at Scale
Solving Cybersecurity at Scale
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
 
What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4
 
Navigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT DevelopmentNavigating Idiosyncrasies of IoT Development
Navigating Idiosyncrasies of IoT Development
 
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiThe First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?Fast SQL on Hadoop, really?
Fast SQL on Hadoop, really?
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
 
Curing the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging ManagerCuring the Kafka Blindness – Streams Messaging Manager
Curing the Kafka Blindness – Streams Messaging Manager
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
 
Deep learning 101
Deep learning 101Deep learning 101
Deep learning 101
 
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop EnvironmentHow to Ingest 16 Billion Records Per Day into your Hadoop Environment
How to Ingest 16 Billion Records Per Day into your Hadoop Environment
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Hadoop: The Unintended Benefits
Hadoop: The Unintended BenefitsHadoop: The Unintended Benefits
Hadoop: The Unintended Benefits
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFiConnecting the Drops with Apache NiFi & Apache MiNiFi
Connecting the Drops with Apache NiFi & Apache MiNiFi
 

Similar to Apache Hadoop YARN: State of the Union

Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
DataWorks Summit
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration story
Sunil Govindan
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
DataWorks Summit
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Wangda Tan
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
DataWorks Summit
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
DataWorks Summit
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduceApache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduce
DataWorks Summit/Hadoop Summit
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
DataWorks Summit
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Vinod Kumar Vavilapalli
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
DataWorks Summit
 
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Billie Rinaldi
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
DataWorks Summit
 
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Hakka Labs
 
Running Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache HadoopRunning Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache Hadoop
hitesh1892
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
DataWorks Summit
 
Apache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesApache Hadoop YARN: best practices
Apache Hadoop YARN: best practices
DataWorks Summit
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFT
DataWorks Summit
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 

Similar to Apache Hadoop YARN: State of the Union (20)

Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Apache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration storyApache Hadoop 3 updates with migration story
Apache Hadoop 3 updates with migration story
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
YARN - Past, Present, & Future
YARN - Past, Present, & FutureYARN - Past, Present, & Future
YARN - Past, Present, & Future
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduceApache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduce
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and FutureHadoop Summit San Jose 2015: YARN - Past, Present and Future
Hadoop Summit San Jose 2015: YARN - Past, Present and Future
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
 
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
Developing Applications with Hadoop 2.0 and YARN by Abhijit Lele
 
Running Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache HadoopRunning Non-MapReduce Big Data Applications on Apache Hadoop
Running Non-MapReduce Big Data Applications on Apache Hadoop
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Apache Hadoop YARN: best practices
Apache Hadoop YARN: best practicesApache Hadoop YARN: best practices
Apache Hadoop YARN: best practices
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFT
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 

More from DataWorks Summit

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
jpupo2018
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 

Recently uploaded (20)

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 

Apache Hadoop YARN: State of the Union

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved Apache Hadoop YARN: State of the Union Sanjay Radia Founder, Chief Architect Hortonworks Apache Hadoop PMC
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved About the Speaker • Sanjay Radia • Chief Architect, Founder, Hortonworks • Apache Hadoop PMC and Committer • Part of the original Hadoop team at Yahoo! since 2007 • Chief Architect of Hadoop Core at Yahoo! • Prior • Data center automation, virtualization, Java, HA, OSs, File Systems • Startup, Sun Microsystems, INRIA… • Ph.D., University of Waterloo Page 2Architecting the Future of Big Data
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved • Introduction • Past • State of Union for YARN Agenda
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved A Brief Timeline from Past Year: GA Releases 2.8.0 2.9.0 3.0.0 3.1.0 • GPU/FPGA • YARN Native Service • Placement Constraints • YARN Federation • Opportunistic Container (Backported from 3.0) • New YARN UI • Timeline V2 • Global Scheduling • Multiple Resource types • New YARN UI • Timeline service V2 Ever involving requirements (computation intensive, larger, services) • Application Priority • Reservations • Node labels improvements 22 March ’17 17 Nov ’17 13 Dec ‘17 02 Aug ‘182.8.4 2.9.1 3.0.3 3.1.1
  • 5. 8 © Hortonworks Inc. 2011–2018. All rights reserved Apache Hadoop 3.0/3.1
  • 6. 9 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 7. 10 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 8. 11 © Hortonworks Inc. 2011–2018. All rights reserved • Many customers with clusters made up of large number of nodes • Oath (Yahoo!), Twitter, LinkedIn, Microsoft, Alibaba etc. • Previous largest clusters: 6K-8K • Now: 50K nodes under YARN in Microsoft [1] • Roadmap: To 100K and beyond [1] https://azure.microsoft.com/en-us/blog/how-microsoft-drives-exabyte-analytics-on-the-world-s-largest-yarn-cluster/ Looking at the Scale!
  • 9. 12 © Hortonworks Inc. 2011–2018. All rights reserved • Enables applications to scale to 100k of thousands of nodes • Federation divides a large (10-100k nodes) cluster into smaller units called sub-clusters • Federation negotiates with sub-clusters RM’s and provide resources to the application • Applications can schedule tasks on any node YARN Federation
  • 10. 13 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 11. 14 © Hortonworks Inc. 2011–2018. All rights reserved • YARN-5139 • Problems • Current design of one-node-at-a-time allocation cycle can lead to suboptimal decisions. • Performance and also hard to do global placement policies • Several coarse grained locks • With this, we improved to • Look at several nodes at a time • Fine grained locks • Multiple allocator threads • YARN scheduler can allocate 3k+ containers per second ≈ 10 mil allocations / hour! • 10X throughput gains • Much better placement decisions Moving Towards Global & Fast Scheduling
  • 12. 16 © Hortonworks Inc. 2011–2018. All rights reserved Better Placement Strategies (YARN-6592) • Past • Supported constraints in form of Node Locality • Now YARN can support a lot more use cases • Co-locate the allocations of a job on the same rack (affinity) • Spread allocations across machines (anti-affinity) to minimize resource interference • Allow up to a specific number of allocations in a node group (cardinality)
  • 13. 17 © Hortonworks Inc. 2011–2018. All rights reserved Better Placement Strategies (YARN-6592) Affinity Anti-affinity Region Server Region Server
  • 14. 18 © Hortonworks Inc. 2011–2018. All rights reserved Absolute Resources Configuration in CS – YARN-5881 • The cloud model! “Give me X resources, not X%” • Gives ability to configure Queue resources as below <memory=24GB, vcores=20, yarn.io/gpu=2> • Enables admins to assign different quotas of different resource-types • No more “Single percentage value limitation for all resource-types” root (memory=60G, vcores=40, gpu=10) sales (memory=40G, vcores=20, gpu=6) marketing (memory=10G, vcores=10, gpu=4) engineering (memory=10G, vcores=10, gpu=0)
  • 15. 20 © Hortonworks Inc. 2011–2018. All rights reserved Useful Scheduling Features from Recent 2.X Releases • Application Priorities • Queue Priorities • Previously : Resources to least satisfied queue • Queues can have priorities e.g. (production over dev)
  • 16. 21 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 17. 22 © Hortonworks Inc. 2011–2018. All rights reserved Usability: UI (1)
  • 18. 23 © Hortonworks Inc. 2011–2018. All rights reserved Usability: UI (2)
  • 19. 24 © Hortonworks Inc. 2011–2018. All rights reserved Timeline Service 2.0 • Application History • Where did my container run • Why is my app slow • Why is failing • What happened with my app • Cluster History • Which user had most utilization • Largest App • What happened in my cluster • … • Scalable & Robust Service • Using HBase as backend for better scale • More robust storage fault tolerance • Migration and compatibility with v.1.5
  • 20. 25 © Hortonworks Inc. 2011–2018. All rights reserved Usability: Queue & Logs API based queue management Decentralized (YARN-5734) Improved logs management (YARN-4904) Live application logs
  • 21. 26 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 22. 27 © Hortonworks Inc. 2011–2018. All rights reserved • Better Packaging model • Lightweight mechanism for packaging and resource isolation • Popularized and made accessible by Docker • Native integration ++ in YARN • Support for “Container Runtimes” in LCE: YARN-3611 • Native process runtime • Docker runtime • Many security/usability improves added to 3.0.x / 3.1.x Containers
  • 23. 28 © Hortonworks Inc. 2011–2018. All rights reserved Containers • Run both with and without Docker on the same cluster • Choose at run-time!
  • 24. 29 © Hortonworks Inc. 2011–2018. All rights reserved • Apache Spark applications have a complex set of required software dependencies • Docker on YARN helps to solve Spark package isolation issues with • PySpark - Python versions, packages • R - packages Spark on Docker in YARN
  • 25. 31 © Hortonworks Inc. 2011–2018. All rights reserved Hadoop Apps YARN on YARN… Towards a Private Cloud …YCloud YARN MR Tez Spark Tensor Flow YARN MR Tez Spark Spark
  • 26. 32 © Hortonworks Inc. 2011–2018. All rights reserved YCloud: YARN Based Container Cloud We Use for Testing ü Testing Hadoop on Hadoop!
  • 27. 33 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 28. 34 © Hortonworks Inc. 2011–2018. All rights reserved • YARN supported only Memory and CPU • Now • A generalized vector for all resources • Admin could add arbitrary resource types! Resource Profiles and Custom Resource Types • Ease of resource requesting model using profiles for apps Profile Memory CPU GPU Small 2 GB 4 Cores 0 Cores Medium 4 GB 8 Cores 0 Cores Large 16 GB 16 Cores 4 CoresMemory CPU GPU FPGA Node Manager
  • 29. 35 © Hortonworks Inc. 2011–2018. All rights reserved • Why? • No need to setup separate clusters • Leverage shared compute! • Why need isolation? • Multiple processes use the single GPU will be: • Serialized. • Cause OOM easily. • GPU isolation on YARN: • Granularity is for per-GPU device. • Use Cgroups / Docker to enforce isolation. GPU Support on YARN Tensorflow 1.2 Nginx AppUbuntu 14:04 Nginx AppHost OS GPU Base Lib v1 Volume Mount CUDA Library 5.0
  • 30. 36 © Hortonworks Inc. 2011–2018. All rights reserved • FPGA isolation on YARN: . • Granularity is for per-FPGA device. • Use Cgroups to enforce the isolation. • Currently, only Intel OpenCL SDK for FPGA is supported. • But implementation is extensible to other FPGA SDK. FPGA on YARN
  • 31. 37 © Hortonworks Inc. 2011–2018. All rights reserved Key Themes Scale Platform Themes Workload Themes Scheduling Usability Containers Resources Services
  • 32. 38 © Hortonworks Inc. 2011–2018. All rights reserved Services Support in YARN • In the past we have focused on Jobs and have an advanced fine grained scheduler • A native YARN services framework (YARN-4692) • Manage services and their lifecycle • Scale up and down, restart etc • Support for Upgrades (YARN-4726 • “Do an upgrade of my HBase app with minimal impact to end-users • Apache Slider retired from Incubator – lessons and key code carried over to YARN • Simplified discovery of services via DNS mechanisms: YARN-4757 • regionserver-0.hbase-app-3.hadoop.yarn.site
  • 33. 39 © Hortonworks Inc. 2011–2018. All rights reserved • Applications need simple APIs • Need to be deployable “easily” • Simple REST API layer fronting YARN • YARN-4793 Simplified API layer • Spawn services & Manage them Simplified APIs for Service Definitions
  • 34. 41 © Hortonworks Inc. 2011–2018. All rights reserved LLAP on YARN • Apache Hive LLAP is a key long running application • Used for query processing • Designed to run on a shared multi-tenant YARN cluster
  • 35. 42 © Hortonworks Inc. 2011–2018. All rights reserved Application Timeout – YARN-3813 • Controlling running time of workloads in YARN • Define lifetime for an application anytime for YARN to manage. • “Give me resources for this app/service but kill it after 15 days” Reservation-based Scheduling: If You’re Late Don’t Blame Us! - Carlo, et al. 2015
  • 36. 43 © Hortonworks Inc. 2011–2018. All rights reserved Apache Hadoop 3.2 and Beyond
  • 37. 44 © Hortonworks Inc. 2011–2018. All rights reserved • Every user says “Give me 16GB for my task”, even though it’s only needed at peak • Each node has some allocated but unutilized capacity. Use such capacity to run opportunistic tasks • Preempt such tasks when needed Container Overcommit (YARN-1011) Un-allocated Allocated Utilized Allocated- but-unutilized Node1 Guaranteed Containers Node1 Opportunistic Containers Preemption When necessary
  • 38. 45 © Hortonworks Inc. 2011–2018. All rights reserved • “Take me to a node with JDK 10” • Node Partition vs. Node Attribute • Partition: • One partition for one node • ACL • Shares between queues • Preemption enforced. • Attribute: • For container placement • No ACL/Shares on attributes • First-come-first-serve Node Attributes (YARN-3409) Partition - 1 (15 nodes) Queue-A (40%) Queue-B (60%) Partition - 2 (15 nodes) A (25%) B (25%) C (25%) D (25%) Node 1 os.type=ubuntu os.version=14.10 glibc.version=2.20 JDK.version=8u20 Node 2 os.type=RHEL os.version=5.1 GPU.type=x86_64 JDK.version=7u20 Node 16 os.type=windows os.version=7 JDK.version=8u20 Node 17 os.type=SUSE os.version=12 GPU.type=i686 JDK.version=7u20
  • 39. 48 © Hortonworks Inc. 2011–2018. All rights reserved Questions?