SlideShare a Scribd company logo
1 of 27
© Hortonworks Inc. 2014
Apache Accumulo on YARN
with Apache Slider
Billie Rinaldi
Sr. Member of Technical Staff
Hortonworks, Inc.
June 12, 2014
Page 1
Apache, Accumulo, Slider, Ambari, Hadoop, Yarn, Apache Accumulo, Apache Slider, Apache Ambari,
and the Accumulo logo are trademarks of the Apache Software Foundation.
© Hortonworks Inc. 2014
Topics
•What is YARN?
•Why would you want to run Accumulo on YARN?
•What is Slider and why is it needed?
•How is Accumulo deployed & managed with Slider?
Page 2
© Hortonworks Inc. 2014
Getting more from Hadoop
HADOOP 1.0
HDFS
(redundant, reliable storage)
MapReduce
(cluster resource management
& data processing)
HDFS2
(redundant, reliable storage)
YARN
(cluster resource management)
MapReduce
(data processing)
App X
(data processing)
HADOOP 2.0
Failure handling and resource management
are no longer just for MapReduce
… and this separation enables much more flexibility
Page 3
App Y
(data processing)
Primarily Batch Batch, Interactive, Online, Streaming, …
© Hortonworks Inc. 2014
App on YARN Use Cases
•Small app clusters in a large YARN cluster
•Dynamic clusters
•Self-healing clusters
•Elastic clusters
•Transient clusters for workflows
•Custom versions & configurations
•More efficient utilization/sharing
Page 4
© Hortonworks Inc. 2014
YARN Structure
Page 5
HDFS
YARN Node Manager
HDFS
YARN Node Manager
HDFS
YARN Resource Manager
“The RM”
HDFS
YARN Node Manager
• Servers run YARN Node Managers
• NM's heartbeat to Resource Manager
• RM schedules work over cluster
• RM allocates containers to apps
• NMs start containers
• NMs report container health
© Hortonworks Inc. 2014
Client Creates App Master
Page 6
HDFS
YARN Node Manager
HDFS
YARN Node Manager
Client
HDFS
YARN Resource Manager
“The RM”
HDFS
YARN Node Manager
Application Master
© Hortonworks Inc. 2014
AM Asks for Containers
Page 7
HDFS
YARN Node Manager
HDFS
YARN Node Manager
Container
Container
HDFS
YARN Resource Manager
“The RM”
HDFS
YARN Node Manager
Container
Application Master
© Hortonworks Inc. 2014
YARN Notifies AM of Failures
Page 8
HDFS
YARN Node Manager
HDFS
YARN Node Manager
ContainerContainer
Container
HDFS
YARN Resource Manager
“The RM”
HDFS
YARN Node Manager
Application Master
© Hortonworks Inc. 2014
Issues to Consider
•Do I need to re-write parts of my application?
•How do I package my application for YARN?
•How do I configure my application?
•How do I debug my application?
•Can I still manage my application?
•Can I monitor my application?
•Can I manage inter-/intra-application
dependencies?
•How will the external clients communicate?
•What does it take to secure the application?
Page 9
© Hortonworks Inc. 2014
Apache Slider
Apache Slider is a project in incubation at the Apache Software
Foundation with the goal of making it possible and easy to deploy
existing applications onto a YARN cluster
• History
– HBase on YARN (HOYA)
– AccumuloProvider/HBaseProvider on YARN
– Agent Provider + App Packages for Accumulo/HBase/Storm/…
• Goals for long-lived applications
– Execute management operations (Start/Stop, Reconfigure, Scale
up/down, Rolling-restart, Decommission/Recommission, Upgrade)
– Detect and remedy failures
– Manage logs
– Monitor (Ganglia, JMX)
Page 10
© Hortonworks Inc. 2014
Components of Slider
Page 11
Slider
App
Package
Slider
CLI
HDFS
YARN Resource Manager
“The RM”
HDFS
YARN Node Manager
Agent Comp. Inst.
HDFS
YARN Node Manager
Agent Comp. Inst.
App Master / Agent Provider
Registry
• AppMaster
• AgentProvider
• Agent
• Component Instance
• AppPackage
• CLI
• Registry
© Hortonworks Inc. 2014
Application by Slider
Page 12
Similar to any YARN application
1. CLI starts an instance of the AM
2. AM requests containers
3. Containers activate with an Agent
4. Agent gets application definition
5. Agent registers with AM
6. AM issues commands
7. Agent reports back status,
configuration, etc.
8. AM publishes endpoints,
configurations
Slider
App
Package
Slider
CLI
HDFS
YARN Resource Manager
“The RM”
HDFS
YARN Node Manager
Agent Comp. Inst.
HDFS
YARN Node Manager
Agent Comp. Inst.
Application
Registry
App Master/Agent Provider
AM commands
install, start, stop, status, …
CLI commands
create, freeze, thaw, flex, destroy
© Hortonworks Inc. 2014
Accumulo Slider App Package
Page 13
© Hortonworks Inc. 2014
Slider Metainfo
Page 14
<metainfo><services><service>
<name>ACCUMULO</name>
<version>1.5.1</version>
<exportGroups><exportGroup>
<name>QuickLinks</name>
<exports><export>
<name>org.apache.slider.monitor</name>
<value>http://${ACCUMULO_MONITOR_HOST}:${site.accumulo-site.monitor.port.client}
</export></exports></exportGroup></exportGroups>
<commandOrders><commandOrder>
<command>ACCUMULO_TSERVER-START</command>
<requires>ACCUMULO_MASTER-STARTED</requires>
</commandOrder></commandOrders>
<components><component>
<name>ACCUMULO_MASTER</name>
<category>MASTER</category>
<minInstanceCount>1</minInstanceCount>
<commandScript>
<script>scripts/accumulo_master.py</script>
</commandScript></component></components>
</service></services></metainfo>
Application Info
Commands have
dependencies
URIs can be
published
Component
information
Commands are
implemented as
scripts
© Hortonworks Inc. 2014
Slider App Resource Spec
Page 15
{
"schema": "http://example.org/specification/v2.0.0",
"metadata": {
},
"global": {
},
"components": {
"ACCUMULO_MASTER": {
"yarn.role.priority": "1",
"yarn.component.instances": "1"
},
"slider-appmaster": {
},
"ACCUMULO_TSERVER": {
"yarn.role.priority": "2",
"yarn.component.instances": "1"
},
"ACCUMULO_MONITOR": {
"yarn.role.priority": "3",
"yarn.component.instances": "1"
},
YARN resource
requirements
Unique priorities
"ACCUMULO_GC": {
"yarn.role.priority": "4",
"yarn.component.instances": "1"
},
"ACCUMULO_TRACER": {
"yarn.role.priority": "5",
"yarn.component.instances": "1"
}
}
}
© Hortonworks Inc. 2014
Slider AppConfig Spec
Page 16
{
"application.def": "/slider/accumulo_v151.zip",
"java_home": "/usr/jdk64/jdk1.7.0_45",
"site.global.app_log_dir": "${AGENT_LOG_ROOT}/app/log",
"site.global.app_pid_dir": "${AGENT_WORK_ROOT}/app/run",
"site.global.tserver_heapsize": "128m",
"site.global.hadoop_prefix": "/usr/lib/hadoop",
"site.global.zookeeper_home": "/usr/lib/zookeeper",
"site.global.accumulo_instance_name": "instancename",
"site.global.accumulo_root_password": "secret",
"site.accumulo-site.instance.dfs.dir": "/apps/accumulo/data",
"site.accumulo-site.master.port.client": "0",
"site.accumulo-site.trace.port.client": "0",
"site.accumulo-site.tserver.port.client": "0",
"site.accumulo-site.gc.port.client": "0",
"site.accumulo-site.monitor.port.log4j": "0",
"site.accumulo-site.monitor.port.client": "${ACCUMULO_MONITOR.ALLOCATED_PORT}",
"site.accumulo-site.instance.zookeeper.host": "${ZK_HOST}",
}
Configurations needed by
Slider
Named variables
Site variables for
application
Named variables
for cluster details
Allocate and
advertise
Variables for the
application scripts
(a representative sampling of various types of configuration parameters)
© Hortonworks Inc. 2014
Slider Install
• Set up Local Install
• Set up HDFS
Page 17
/slider/accumulo_v151.zip
/slider/agent
/slider/agent/conf
/slider/agent/conf/agent.ini
/slider/agent/slider-agent.tar.gz
Plus any additional directories needed by the app
mvn clean package –DskipTests (builds tarball)
Get slider-0.31.0-incubating-SNAPSHOT-all.tar.gz
from slider-assembly/target/
Untar tarball in desired directory
Edit conf/slider-client.xml:
yarn.application.classpath
slider.zookeeper.quorum
yarn.resourcemanager.address
yarn.resourcemanager.scheduler.address
fs.defaultFS
© Hortonworks Inc. 2014
Slider Execution
• Create an Accumulo instance
• Modify an existing instance
Page 18
bin/slider create name
--image hdfs://c6401.ambari.apache.org:8020/
slider/agent/slider-agent.tar.gz
--template appConfig.json
--resources resources.json
bin/slider freeze name
bin/slider thaw name
bin/slider destroy name
bin/slider flex name --component ACCUMULO_TSERVER 2
© Hortonworks Inc. 2014
Managing a YARN Application
Goal is to have Slider integrate with any application management
framework, e.g. Ambari
Apache Ambari is an open source framework for provisioning,
managing and monitoring Apache Hadoop clusters
• Ambari Views allows development of custom user interfaces
• Slider App View will deploy, monitor, manage YARN apps using Slider,
embedded in Ambari (currently, Tech Preview)
Page 19
Ambari
Server
Ambari
Web FE
View
UI
View
BE
Slider
CLI
HDFS
YARN Node Manager
HDFS
YARN Node Manager
HDFS
YARN Node Manager
© Hortonworks Inc. 2014
Page 20
© Hortonworks Inc. 2014
Page 21
© Hortonworks Inc. 2014
What’s Next in Slider
Page 22
•Lock-in Application Specification
•Integration with the YARN Registry
•Inter/Intra-Application Dependencies
•Robust failure handling
•Improved debugging
•Security
•More applications!
© Hortonworks Inc. 2014
YARN-896: Long-Lived Apps
•Container reconnect on AM restart – mostly
complete
•Token renewal on long-lived apps – patch available
•Containers: signaling, >1 process sequence
•AM/RM managed gang scheduling
•Anti-affinity hint in container requests
•ZK Service Registry
•Logging
Page 23
© Hortonworks Inc. 2014
Slider is Seeking Contributors
• Bring Your Favorite Applications to YARN
–Create packages, give feedback, create patches, …
• Useful Links
–Source: https://git-wip-us.apache.org/repos/asf/incubator-slider.git
–Website: http://slider.incubator.apache.org
–Mailing List: dev@slider.incubator.apache.org
–JIRA: https://issues.apache.org/jira/browse/SLIDER
• Current and Upcoming Releases
–Slider 0.30-incubating (May)
–Slider 0.40-incubating (planned)
Page 24
© Hortonworks Inc. 2014
Questions?
billie@hortonworks.com
dev@accumulo.apache.org
user@accumulo.apache.org
IRC #accumulo
Page 25
© Hortonworks Inc. 2014
AM Restart – leading edge
Page 26
NodeMap
model of YARN cluster
ComponentHistory
persistent history of
component placements
Specification
resources.json &c
Container Queues
requested, starting,
releasing
Component Map
container ID -> component
instance
Event History
application history
Persisted in HDFS Rebuilt Transient
ctx.setKeepContainersAcrossApplicationAttempts(true)
© Hortonworks Inc. 2014
Application Registry
Page 27
• A common problem (not specific to Slider)
s://issues.apache.org/jira/browse/YARN-913
• Current
– Apache Curator based
– Register URLs pointing to actual data
– AM doubles up as a webserver for published data
• Future
– Registry should be stand-alone
– Slider is a consumer as well as publisher
– Slider focuses on declarative solution for Applications to publish data
– Allows integration of Applications independent of how they are hosted

More Related Content

What's hot

Past, Present and Future of Apache Ambari
Past, Present and Future of Apache AmbariPast, Present and Future of Apache Ambari
Past, Present and Future of Apache AmbariArtem Ervits
 
The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...
The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...
The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...Steven Davelaar
 
Moving complex enterprise ecommerce systems to the cloud
Moving complex enterprise ecommerce systems to the cloudMoving complex enterprise ecommerce systems to the cloud
Moving complex enterprise ecommerce systems to the cloudElastic Path
 
Oracle ADF Architecture TV - Design - Designing for Internationalization
Oracle ADF Architecture TV - Design - Designing for InternationalizationOracle ADF Architecture TV - Design - Designing for Internationalization
Oracle ADF Architecture TV - Design - Designing for InternationalizationChris Muir
 
Oracle ADF Architecture TV - Planning & Getting Started - Team, Skills and D...
Oracle ADF Architecture TV -  Planning & Getting Started - Team, Skills and D...Oracle ADF Architecture TV -  Planning & Getting Started - Team, Skills and D...
Oracle ADF Architecture TV - Planning & Getting Started - Team, Skills and D...Chris Muir
 
Oracle ADF Architecture TV - Deployment - Build Options
Oracle ADF Architecture TV - Deployment - Build OptionsOracle ADF Architecture TV - Deployment - Build Options
Oracle ADF Architecture TV - Deployment - Build OptionsChris Muir
 
Let's Talk Mobile
Let's Talk MobileLet's Talk Mobile
Let's Talk MobileChris Muir
 
Kellyn Pot'Vin-Gorman - Power awr warehouse2
Kellyn Pot'Vin-Gorman - Power awr warehouse2Kellyn Pot'Vin-Gorman - Power awr warehouse2
Kellyn Pot'Vin-Gorman - Power awr warehouse2gaougorg
 
CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...
CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...
CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...Chris Muir
 
Oracle REST Data Services: Options for your Web Services
Oracle REST Data Services: Options for your Web ServicesOracle REST Data Services: Options for your Web Services
Oracle REST Data Services: Options for your Web ServicesJeff Smith
 
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14Hortonworks
 
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...jeckels
 
Oracle Cloud: Anything as a Service
Oracle Cloud: Anything as a ServiceOracle Cloud: Anything as a Service
Oracle Cloud: Anything as a ServiceBruno Borges
 
AEM 6.0 - Author UI Customization & Features
AEM 6.0 - Author UI Customization & FeaturesAEM 6.0 - Author UI Customization & Features
AEM 6.0 - Author UI Customization & FeaturesAbhinit Bhatnagar
 
MySQL Developer Day conference: MySQL Replication and Scalability
MySQL Developer Day conference: MySQL Replication and ScalabilityMySQL Developer Day conference: MySQL Replication and Scalability
MySQL Developer Day conference: MySQL Replication and ScalabilityShivji Kumar Jha
 

What's hot (15)

Past, Present and Future of Apache Ambari
Past, Present and Future of Apache AmbariPast, Present and Future of Apache Ambari
Past, Present and Future of Apache Ambari
 
The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...
The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...
The Mobile Enterprise in Action: Managing Business Processes from Your Mobile...
 
Moving complex enterprise ecommerce systems to the cloud
Moving complex enterprise ecommerce systems to the cloudMoving complex enterprise ecommerce systems to the cloud
Moving complex enterprise ecommerce systems to the cloud
 
Oracle ADF Architecture TV - Design - Designing for Internationalization
Oracle ADF Architecture TV - Design - Designing for InternationalizationOracle ADF Architecture TV - Design - Designing for Internationalization
Oracle ADF Architecture TV - Design - Designing for Internationalization
 
Oracle ADF Architecture TV - Planning & Getting Started - Team, Skills and D...
Oracle ADF Architecture TV -  Planning & Getting Started - Team, Skills and D...Oracle ADF Architecture TV -  Planning & Getting Started - Team, Skills and D...
Oracle ADF Architecture TV - Planning & Getting Started - Team, Skills and D...
 
Oracle ADF Architecture TV - Deployment - Build Options
Oracle ADF Architecture TV - Deployment - Build OptionsOracle ADF Architecture TV - Deployment - Build Options
Oracle ADF Architecture TV - Deployment - Build Options
 
Let's Talk Mobile
Let's Talk MobileLet's Talk Mobile
Let's Talk Mobile
 
Kellyn Pot'Vin-Gorman - Power awr warehouse2
Kellyn Pot'Vin-Gorman - Power awr warehouse2Kellyn Pot'Vin-Gorman - Power awr warehouse2
Kellyn Pot'Vin-Gorman - Power awr warehouse2
 
CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...
CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...
CRUX (CRUD meets UX) Case Study: Building a Modern Applications User Experien...
 
Oracle REST Data Services: Options for your Web Services
Oracle REST Data Services: Options for your Web ServicesOracle REST Data Services: Options for your Web Services
Oracle REST Data Services: Options for your Web Services
 
Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14Hortonworks Hadoop summit 2011 keynote - eric14
Hortonworks Hadoop summit 2011 keynote - eric14
 
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
 
Oracle Cloud: Anything as a Service
Oracle Cloud: Anything as a ServiceOracle Cloud: Anything as a Service
Oracle Cloud: Anything as a Service
 
AEM 6.0 - Author UI Customization & Features
AEM 6.0 - Author UI Customization & FeaturesAEM 6.0 - Author UI Customization & Features
AEM 6.0 - Author UI Customization & Features
 
MySQL Developer Day conference: MySQL Replication and Scalability
MySQL Developer Day conference: MySQL Replication and ScalabilityMySQL Developer Day conference: MySQL Replication and Scalability
MySQL Developer Day conference: MySQL Replication and Scalability
 

Similar to Running Apache Accumulo on YARN with Apache Slider

Bring your Service to YARN
Bring your Service to YARNBring your Service to YARN
Bring your Service to YARNDataWorks Summit
 
YARN Ready - Integrating to YARN using Slider Webinar
YARN Ready - Integrating to YARN using Slider WebinarYARN Ready - Integrating to YARN using Slider Webinar
YARN Ready - Integrating to YARN using Slider WebinarHortonworks
 
Slider: Applications on YARN
Slider: Applications on YARNSlider: Applications on YARN
Slider: Applications on YARNSteve Loughran
 
Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Hortonworks
 
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native ServicesAccumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native ServicesAccumulo Summit
 
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFSDiscover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFSHortonworks
 
Get most out of Spark on YARN
Get most out of Spark on YARNGet most out of Spark on YARN
Get most out of Spark on YARNDataWorks Summit
 
Overview of slider project
Overview of slider projectOverview of slider project
Overview of slider projectSteve Loughran
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramHortonworks
 
Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNApache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNHortonworks
 
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Hortonworks
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Hortonworks
 
What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014InMobi Technology
 
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnhdhappy001
 
YARN - Hadoop Next Generation Compute Platform
YARN - Hadoop Next Generation Compute PlatformYARN - Hadoop Next Generation Compute Platform
YARN - Hadoop Next Generation Compute PlatformBikas Saha
 
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopApache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopHortonworks
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Mac Moore
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopPOSSCON
 

Similar to Running Apache Accumulo on YARN with Apache Slider (20)

October 2014 HUG : Apache Slider
October 2014 HUG : Apache SliderOctober 2014 HUG : Apache Slider
October 2014 HUG : Apache Slider
 
Bring your Service to YARN
Bring your Service to YARNBring your Service to YARN
Bring your Service to YARN
 
Apache Slider
Apache SliderApache Slider
Apache Slider
 
YARN Ready - Integrating to YARN using Slider Webinar
YARN Ready - Integrating to YARN using Slider WebinarYARN Ready - Integrating to YARN using Slider Webinar
YARN Ready - Integrating to YARN using Slider Webinar
 
Slider: Applications on YARN
Slider: Applications on YARNSlider: Applications on YARN
Slider: Applications on YARN
 
Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014
 
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native ServicesAccumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
Accumulo Summit 2016: Apache Accumulo on Docker with YARN Native Services
 
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFSDiscover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
 
Get most out of Spark on YARN
Get most out of Spark on YARNGet most out of Spark on YARN
Get most out of Spark on YARN
 
Overview of slider project
Overview of slider projectOverview of slider project
Overview of slider project
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready Program
 
Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARNApache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARN
 
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
Discover HDP 2.1: Using Apache Ambari to Manage Hadoop Clusters
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
 
What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014
 
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarnBikas saha:the next generation of hadoop– hadoop 2 and yarn
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
 
YARN - Hadoop Next Generation Compute Platform
YARN - Hadoop Next Generation Compute PlatformYARN - Hadoop Next Generation Compute Platform
YARN - Hadoop Next Generation Compute Platform
 
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of HadoopApache Hadoop YARN: Understanding the Data Operating System of Hadoop
Apache Hadoop YARN: Understanding the Data Operating System of Hadoop
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in Hadoop
 

Recently uploaded

Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 

Recently uploaded (20)

Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 

Running Apache Accumulo on YARN with Apache Slider

  • 1. © Hortonworks Inc. 2014 Apache Accumulo on YARN with Apache Slider Billie Rinaldi Sr. Member of Technical Staff Hortonworks, Inc. June 12, 2014 Page 1 Apache, Accumulo, Slider, Ambari, Hadoop, Yarn, Apache Accumulo, Apache Slider, Apache Ambari, and the Accumulo logo are trademarks of the Apache Software Foundation.
  • 2. © Hortonworks Inc. 2014 Topics •What is YARN? •Why would you want to run Accumulo on YARN? •What is Slider and why is it needed? •How is Accumulo deployed & managed with Slider? Page 2
  • 3. © Hortonworks Inc. 2014 Getting more from Hadoop HADOOP 1.0 HDFS (redundant, reliable storage) MapReduce (cluster resource management & data processing) HDFS2 (redundant, reliable storage) YARN (cluster resource management) MapReduce (data processing) App X (data processing) HADOOP 2.0 Failure handling and resource management are no longer just for MapReduce … and this separation enables much more flexibility Page 3 App Y (data processing) Primarily Batch Batch, Interactive, Online, Streaming, …
  • 4. © Hortonworks Inc. 2014 App on YARN Use Cases •Small app clusters in a large YARN cluster •Dynamic clusters •Self-healing clusters •Elastic clusters •Transient clusters for workflows •Custom versions & configurations •More efficient utilization/sharing Page 4
  • 5. © Hortonworks Inc. 2014 YARN Structure Page 5 HDFS YARN Node Manager HDFS YARN Node Manager HDFS YARN Resource Manager “The RM” HDFS YARN Node Manager • Servers run YARN Node Managers • NM's heartbeat to Resource Manager • RM schedules work over cluster • RM allocates containers to apps • NMs start containers • NMs report container health
  • 6. © Hortonworks Inc. 2014 Client Creates App Master Page 6 HDFS YARN Node Manager HDFS YARN Node Manager Client HDFS YARN Resource Manager “The RM” HDFS YARN Node Manager Application Master
  • 7. © Hortonworks Inc. 2014 AM Asks for Containers Page 7 HDFS YARN Node Manager HDFS YARN Node Manager Container Container HDFS YARN Resource Manager “The RM” HDFS YARN Node Manager Container Application Master
  • 8. © Hortonworks Inc. 2014 YARN Notifies AM of Failures Page 8 HDFS YARN Node Manager HDFS YARN Node Manager ContainerContainer Container HDFS YARN Resource Manager “The RM” HDFS YARN Node Manager Application Master
  • 9. © Hortonworks Inc. 2014 Issues to Consider •Do I need to re-write parts of my application? •How do I package my application for YARN? •How do I configure my application? •How do I debug my application? •Can I still manage my application? •Can I monitor my application? •Can I manage inter-/intra-application dependencies? •How will the external clients communicate? •What does it take to secure the application? Page 9
  • 10. © Hortonworks Inc. 2014 Apache Slider Apache Slider is a project in incubation at the Apache Software Foundation with the goal of making it possible and easy to deploy existing applications onto a YARN cluster • History – HBase on YARN (HOYA) – AccumuloProvider/HBaseProvider on YARN – Agent Provider + App Packages for Accumulo/HBase/Storm/… • Goals for long-lived applications – Execute management operations (Start/Stop, Reconfigure, Scale up/down, Rolling-restart, Decommission/Recommission, Upgrade) – Detect and remedy failures – Manage logs – Monitor (Ganglia, JMX) Page 10
  • 11. © Hortonworks Inc. 2014 Components of Slider Page 11 Slider App Package Slider CLI HDFS YARN Resource Manager “The RM” HDFS YARN Node Manager Agent Comp. Inst. HDFS YARN Node Manager Agent Comp. Inst. App Master / Agent Provider Registry • AppMaster • AgentProvider • Agent • Component Instance • AppPackage • CLI • Registry
  • 12. © Hortonworks Inc. 2014 Application by Slider Page 12 Similar to any YARN application 1. CLI starts an instance of the AM 2. AM requests containers 3. Containers activate with an Agent 4. Agent gets application definition 5. Agent registers with AM 6. AM issues commands 7. Agent reports back status, configuration, etc. 8. AM publishes endpoints, configurations Slider App Package Slider CLI HDFS YARN Resource Manager “The RM” HDFS YARN Node Manager Agent Comp. Inst. HDFS YARN Node Manager Agent Comp. Inst. Application Registry App Master/Agent Provider AM commands install, start, stop, status, … CLI commands create, freeze, thaw, flex, destroy
  • 13. © Hortonworks Inc. 2014 Accumulo Slider App Package Page 13
  • 14. © Hortonworks Inc. 2014 Slider Metainfo Page 14 <metainfo><services><service> <name>ACCUMULO</name> <version>1.5.1</version> <exportGroups><exportGroup> <name>QuickLinks</name> <exports><export> <name>org.apache.slider.monitor</name> <value>http://${ACCUMULO_MONITOR_HOST}:${site.accumulo-site.monitor.port.client} </export></exports></exportGroup></exportGroups> <commandOrders><commandOrder> <command>ACCUMULO_TSERVER-START</command> <requires>ACCUMULO_MASTER-STARTED</requires> </commandOrder></commandOrders> <components><component> <name>ACCUMULO_MASTER</name> <category>MASTER</category> <minInstanceCount>1</minInstanceCount> <commandScript> <script>scripts/accumulo_master.py</script> </commandScript></component></components> </service></services></metainfo> Application Info Commands have dependencies URIs can be published Component information Commands are implemented as scripts
  • 15. © Hortonworks Inc. 2014 Slider App Resource Spec Page 15 { "schema": "http://example.org/specification/v2.0.0", "metadata": { }, "global": { }, "components": { "ACCUMULO_MASTER": { "yarn.role.priority": "1", "yarn.component.instances": "1" }, "slider-appmaster": { }, "ACCUMULO_TSERVER": { "yarn.role.priority": "2", "yarn.component.instances": "1" }, "ACCUMULO_MONITOR": { "yarn.role.priority": "3", "yarn.component.instances": "1" }, YARN resource requirements Unique priorities "ACCUMULO_GC": { "yarn.role.priority": "4", "yarn.component.instances": "1" }, "ACCUMULO_TRACER": { "yarn.role.priority": "5", "yarn.component.instances": "1" } } }
  • 16. © Hortonworks Inc. 2014 Slider AppConfig Spec Page 16 { "application.def": "/slider/accumulo_v151.zip", "java_home": "/usr/jdk64/jdk1.7.0_45", "site.global.app_log_dir": "${AGENT_LOG_ROOT}/app/log", "site.global.app_pid_dir": "${AGENT_WORK_ROOT}/app/run", "site.global.tserver_heapsize": "128m", "site.global.hadoop_prefix": "/usr/lib/hadoop", "site.global.zookeeper_home": "/usr/lib/zookeeper", "site.global.accumulo_instance_name": "instancename", "site.global.accumulo_root_password": "secret", "site.accumulo-site.instance.dfs.dir": "/apps/accumulo/data", "site.accumulo-site.master.port.client": "0", "site.accumulo-site.trace.port.client": "0", "site.accumulo-site.tserver.port.client": "0", "site.accumulo-site.gc.port.client": "0", "site.accumulo-site.monitor.port.log4j": "0", "site.accumulo-site.monitor.port.client": "${ACCUMULO_MONITOR.ALLOCATED_PORT}", "site.accumulo-site.instance.zookeeper.host": "${ZK_HOST}", } Configurations needed by Slider Named variables Site variables for application Named variables for cluster details Allocate and advertise Variables for the application scripts (a representative sampling of various types of configuration parameters)
  • 17. © Hortonworks Inc. 2014 Slider Install • Set up Local Install • Set up HDFS Page 17 /slider/accumulo_v151.zip /slider/agent /slider/agent/conf /slider/agent/conf/agent.ini /slider/agent/slider-agent.tar.gz Plus any additional directories needed by the app mvn clean package –DskipTests (builds tarball) Get slider-0.31.0-incubating-SNAPSHOT-all.tar.gz from slider-assembly/target/ Untar tarball in desired directory Edit conf/slider-client.xml: yarn.application.classpath slider.zookeeper.quorum yarn.resourcemanager.address yarn.resourcemanager.scheduler.address fs.defaultFS
  • 18. © Hortonworks Inc. 2014 Slider Execution • Create an Accumulo instance • Modify an existing instance Page 18 bin/slider create name --image hdfs://c6401.ambari.apache.org:8020/ slider/agent/slider-agent.tar.gz --template appConfig.json --resources resources.json bin/slider freeze name bin/slider thaw name bin/slider destroy name bin/slider flex name --component ACCUMULO_TSERVER 2
  • 19. © Hortonworks Inc. 2014 Managing a YARN Application Goal is to have Slider integrate with any application management framework, e.g. Ambari Apache Ambari is an open source framework for provisioning, managing and monitoring Apache Hadoop clusters • Ambari Views allows development of custom user interfaces • Slider App View will deploy, monitor, manage YARN apps using Slider, embedded in Ambari (currently, Tech Preview) Page 19 Ambari Server Ambari Web FE View UI View BE Slider CLI HDFS YARN Node Manager HDFS YARN Node Manager HDFS YARN Node Manager
  • 20. © Hortonworks Inc. 2014 Page 20
  • 21. © Hortonworks Inc. 2014 Page 21
  • 22. © Hortonworks Inc. 2014 What’s Next in Slider Page 22 •Lock-in Application Specification •Integration with the YARN Registry •Inter/Intra-Application Dependencies •Robust failure handling •Improved debugging •Security •More applications!
  • 23. © Hortonworks Inc. 2014 YARN-896: Long-Lived Apps •Container reconnect on AM restart – mostly complete •Token renewal on long-lived apps – patch available •Containers: signaling, >1 process sequence •AM/RM managed gang scheduling •Anti-affinity hint in container requests •ZK Service Registry •Logging Page 23
  • 24. © Hortonworks Inc. 2014 Slider is Seeking Contributors • Bring Your Favorite Applications to YARN –Create packages, give feedback, create patches, … • Useful Links –Source: https://git-wip-us.apache.org/repos/asf/incubator-slider.git –Website: http://slider.incubator.apache.org –Mailing List: dev@slider.incubator.apache.org –JIRA: https://issues.apache.org/jira/browse/SLIDER • Current and Upcoming Releases –Slider 0.30-incubating (May) –Slider 0.40-incubating (planned) Page 24
  • 25. © Hortonworks Inc. 2014 Questions? billie@hortonworks.com dev@accumulo.apache.org user@accumulo.apache.org IRC #accumulo Page 25
  • 26. © Hortonworks Inc. 2014 AM Restart – leading edge Page 26 NodeMap model of YARN cluster ComponentHistory persistent history of component placements Specification resources.json &c Container Queues requested, starting, releasing Component Map container ID -> component instance Event History application history Persisted in HDFS Rebuilt Transient ctx.setKeepContainersAcrossApplicationAttempts(true)
  • 27. © Hortonworks Inc. 2014 Application Registry Page 27 • A common problem (not specific to Slider) s://issues.apache.org/jira/browse/YARN-913 • Current – Apache Curator based – Register URLs pointing to actual data – AM doubles up as a webserver for published data • Future – Registry should be stand-alone – Slider is a consumer as well as publisher – Slider focuses on declarative solution for Applications to publish data – Allows integration of Applications independent of how they are hosted

Editor's Notes

  1. Client talks to the Yarn ResourceManager, which gets a container from a NodeManager and starts the App Master (AM)
  2. AM requests containers for the other specified roles of the cluster (e.g. tserver, monitor) and launches those processes