SlideShare a Scribd company logo
1 of 10
Stacki Lab
Joe Kaiser
Director of Open Source Engineering
Open Source Stack Installer
Stacki is a very fast and ultra reliable Linux server provisioning tool … at scale.
With zero prerequisites for taking systems from bare metal to a ping and prompt.
Stuff it does
 Installs to bare metal or VMs that look like bare metal
 Kickstart based
 Parallel formatting of disk
 Parallel Sharing of RPMs
 CentOS/RHEL
 Networking
 Multiple subnets, vlaning, bonding.
 Storage
 Controller config
 Partitioning
Whatever you can do in Linux, you can do with Stacki only clustered
Stacki and Hortonworks Data Platform
Hortonworks – stacki-hdp-bridge pallet
 Add/enable/run
 Stacki creates ambari appliance
 Download software as pallets (isos)
 Add/enable HDP and Ambari
 Run gethdp script
 Or download (links in documentation)
 Assign a node to be Ambari deployment.
 Do partitions and preps backend nodes for Hadoop.
 Install all.
 Go to Ambari interface to deploy Hadoop.
 Current versions: (updated /export/HDP/hdp.cfg for new versions)
 distribution = 2.x
 os = centos7
 ambari = 2.4.2.0
 hdp = 2.5.3.0
Augment the “default” Box
1) Replace the “os” pallet with the “CentOS” and “CentOS-
Updates” pallets
2) Add the Hortonworks pallets: “HDP”, “HDP-UTILS”,
“Updates-ambari” or run /export/HDP/gethdp.py
3) Add a pallet to glue the two layers together: “stacki-
hdp-bridge”
Specify One Host as the “Ambari” appliance
“backend-0-0” will be automatically configured with Ambari
Install the node and then point your web browser at it
Reinstall All Backend Nodes
Wipe all hardware disk array configuration and rebuild all the LUNs:
◦ stack set host attr ambari backend attr=nukecontroller value=true
Remove all partitions then repartition and reformat the disks:
◦ stack set host attr ambari backend attr=nukedisks value=true
Instruct nodes to install on next PXE boot:
◦ stack set host boot ambari backend action=install
Try It
 Website
www.stacki.com
 Vagrant tire kick
https://github.com/rfkrocktk/vagrant-stacki
Source Code and docs
github.com/stackiq/stacki
github.com/StackIQ/stacki-hdp-bridge
Slack Channel (because everyone)
Google Groups
groups.google.com/forum/#!forum/stacki
Finis
Thanks

More Related Content

What's hot

Luci, ricci and the rac bc
Luci, ricci and the rac bcLuci, ricci and the rac bc
Luci, ricci and the rac bc
fauzg
 
OSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
OSMC 2019 | Ignite | Power your Datacenter by Toshaan BharvaniOSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
OSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
NETWAYS
 

What's hot (20)

Deploying Alluxio in the Cloud for Machine Learning
Deploying Alluxio in the Cloud for Machine LearningDeploying Alluxio in the Cloud for Machine Learning
Deploying Alluxio in the Cloud for Machine Learning
 
DevOpsDaysRiga 2017 Ignite: Toshaan Bharvani - POWER your DC
DevOpsDaysRiga 2017 Ignite: Toshaan Bharvani - POWER your DCDevOpsDaysRiga 2017 Ignite: Toshaan Bharvani - POWER your DC
DevOpsDaysRiga 2017 Ignite: Toshaan Bharvani - POWER your DC
 
Openstack CPI cloudfoundry
Openstack CPI cloudfoundryOpenstack CPI cloudfoundry
Openstack CPI cloudfoundry
 
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
 
MySQL Head-to-Head
MySQL Head-to-HeadMySQL Head-to-Head
MySQL Head-to-Head
 
Stig Telfer - OpenStack and the Software-Defined SuperComputer
Stig Telfer - OpenStack and the Software-Defined SuperComputerStig Telfer - OpenStack and the Software-Defined SuperComputer
Stig Telfer - OpenStack and the Software-Defined SuperComputer
 
DevOps Days Kyiv 2019 -- Power your PC // Toshaan Bharvani
DevOps Days Kyiv 2019 -- Power your PC // Toshaan BharvaniDevOps Days Kyiv 2019 -- Power your PC // Toshaan Bharvani
DevOps Days Kyiv 2019 -- Power your PC // Toshaan Bharvani
 
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance ArchiectureCeph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance Archiecture
 
Ceph Day KL - Ceph on ARM
Ceph Day KL - Ceph on ARM Ceph Day KL - Ceph on ARM
Ceph Day KL - Ceph on ARM
 
Luci, ricci and the rac bc
Luci, ricci and the rac bcLuci, ricci and the rac bc
Luci, ricci and the rac bc
 
OpenStack Manila 紹介
OpenStack Manila 紹介OpenStack Manila 紹介
OpenStack Manila 紹介
 
OSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
OSMC 2019 | Ignite | Power your Datacenter by Toshaan BharvaniOSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
OSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
 
Hpc to OpenStack: Our journey
Hpc to OpenStack: Our journeyHpc to OpenStack: Our journey
Hpc to OpenStack: Our journey
 
Cncf meetup kubespray
Cncf meetup kubesprayCncf meetup kubespray
Cncf meetup kubespray
 
BlackPearl introduction
BlackPearl introductionBlackPearl introduction
BlackPearl introduction
 
What is OpenStack Trove? Trove Day 2014
What is OpenStack Trove? Trove Day 2014What is OpenStack Trove? Trove Day 2014
What is OpenStack Trove? Trove Day 2014
 
ONIE LinuxCon 2015
ONIE LinuxCon 2015ONIE LinuxCon 2015
ONIE LinuxCon 2015
 
Ata Over Ethernet
Ata Over EthernetAta Over Ethernet
Ata Over Ethernet
 
Cloud foundry on kubernetes
Cloud foundry on kubernetesCloud foundry on kubernetes
Cloud foundry on kubernetes
 
Solr on Docker - the Good, the Bad and the Ugly
Solr on Docker - the Good, the Bad and the UglySolr on Docker - the Good, the Bad and the Ugly
Solr on Docker - the Good, the Bad and the Ugly
 

Similar to Building a Hadoop Cluster with Stacki

Similar to Building a Hadoop Cluster with Stacki (20)

Introduction to Stacki - World's fastest Linux server provisioning Tool
Introduction to Stacki - World's fastest Linux server provisioning ToolIntroduction to Stacki - World's fastest Linux server provisioning Tool
Introduction to Stacki - World's fastest Linux server provisioning Tool
 
StackiFest16: Building a Cluster with Stacki - Greg Bruno
StackiFest16: Building a Cluster with Stacki - Greg BrunoStackiFest16: Building a Cluster with Stacki - Greg Bruno
StackiFest16: Building a Cluster with Stacki - Greg Bruno
 
Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016Introduction to Stacki at Atlanta Meetup February 2016
Introduction to Stacki at Atlanta Meetup February 2016
 
SF Bay Area OpenStack Meetup Stacki Presentation
SF Bay Area OpenStack Meetup Stacki Presentation SF Bay Area OpenStack Meetup Stacki Presentation
SF Bay Area OpenStack Meetup Stacki Presentation
 
Provisioning Servers Made Easy
Provisioning Servers Made EasyProvisioning Servers Made Easy
Provisioning Servers Made Easy
 
Introducing OpenHPC Cross Platform Provisioning Assembly for Warewulf
Introducing OpenHPC Cross Platform Provisioning Assembly for WarewulfIntroducing OpenHPC Cross Platform Provisioning Assembly for Warewulf
Introducing OpenHPC Cross Platform Provisioning Assembly for Warewulf
 
Baylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStackBaylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStack
 
Open stack implementation
Open stack implementation Open stack implementation
Open stack implementation
 
Single node hadoop cluster installation
Single node hadoop cluster installation Single node hadoop cluster installation
Single node hadoop cluster installation
 
SCALE 2011 Deploying OpenStack with Chef
SCALE 2011 Deploying OpenStack with ChefSCALE 2011 Deploying OpenStack with Chef
SCALE 2011 Deploying OpenStack with Chef
 
Linux
LinuxLinux
Linux
 
Openstack in 10 mins
Openstack in 10 minsOpenstack in 10 mins
Openstack in 10 mins
 
Openstack
OpenstackOpenstack
Openstack
 
Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
 
NFD9 - Matt Peterson, Data Center Operations
NFD9 - Matt Peterson, Data Center OperationsNFD9 - Matt Peterson, Data Center Operations
NFD9 - Matt Peterson, Data Center Operations
 
Deployment of an HPC Cloud based on Intel hardware
Deployment of an HPC Cloud based on Intel hardwareDeployment of an HPC Cloud based on Intel hardware
Deployment of an HPC Cloud based on Intel hardware
 
JUDCon 2010 Boston : BoxGrinder
JUDCon 2010 Boston : BoxGrinderJUDCon 2010 Boston : BoxGrinder
JUDCon 2010 Boston : BoxGrinder
 
Stacki at the Seattle Scalability Meetup
Stacki at the Seattle Scalability MeetupStacki at the Seattle Scalability Meetup
Stacki at the Seattle Scalability Meetup
 
TXLF: Automated Deployment of OpenStack with Chef
TXLF: Automated Deployment of OpenStack with ChefTXLF: Automated Deployment of OpenStack with Chef
TXLF: Automated Deployment of OpenStack with Chef
 
Deploying datacenters with Puppet - PuppetCamp Europe 2010
Deploying datacenters with Puppet - PuppetCamp Europe 2010Deploying datacenters with Puppet - PuppetCamp Europe 2010
Deploying datacenters with Puppet - PuppetCamp Europe 2010
 

More from StackIQ

More from StackIQ (20)

How Teradata uses Stacki
How Teradata uses StackiHow Teradata uses Stacki
How Teradata uses Stacki
 
StackiFest 2017 Technology Update
StackiFest 2017 Technology UpdateStackiFest 2017 Technology Update
StackiFest 2017 Technology Update
 
StackiFest 2017 Welcome & Opening Address
StackiFest 2017 Welcome & Opening AddressStackiFest 2017 Welcome & Opening Address
StackiFest 2017 Welcome & Opening Address
 
Provisioning with Stacki at NIST
Provisioning with Stacki at NISTProvisioning with Stacki at NIST
Provisioning with Stacki at NIST
 
Public vs. Private Cloud Performance by Flex
Public vs. Private Cloud Performance by FlexPublic vs. Private Cloud Performance by Flex
Public vs. Private Cloud Performance by Flex
 
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg BrunoStackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
 
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
 
StackiFest16: What's Next in Stacki - Mason Katz
StackiFest16: What's Next in Stacki - Mason Katz StackiFest16: What's Next in Stacki - Mason Katz
StackiFest16: What's Next in Stacki - Mason Katz
 
StackiFest16: CoreOS/Ubuntu on Stacki
StackiFest16: CoreOS/Ubuntu on Stacki StackiFest16: CoreOS/Ubuntu on Stacki
StackiFest16: CoreOS/Ubuntu on Stacki
 
StackiFest16: Building a Cart
StackiFest16: Building a CartStackiFest16: Building a Cart
StackiFest16: Building a Cart
 
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
 
StackiFest 16: Stacki Overview- Anoop Rajendra
StackiFest 16: Stacki Overview- Anoop Rajendra StackiFest 16: Stacki Overview- Anoop Rajendra
StackiFest 16: Stacki Overview- Anoop Rajendra
 
Salesforce at Stacki Atlanta Meetup February 2016
Salesforce at Stacki Atlanta Meetup February 2016Salesforce at Stacki Atlanta Meetup February 2016
Salesforce at Stacki Atlanta Meetup February 2016
 
Private Cloud vs. Public Cloud
Private Cloud vs. Public CloudPrivate Cloud vs. Public Cloud
Private Cloud vs. Public Cloud
 
Datacenter Word Search
Datacenter Word SearchDatacenter Word Search
Datacenter Word Search
 
Stacki: Remove Commands
Stacki: Remove CommandsStacki: Remove Commands
Stacki: Remove Commands
 
The Big Picture on Hadoop
The Big Picture on HadoopThe Big Picture on Hadoop
The Big Picture on Hadoop
 
Stacki Crossword Puzzle
Stacki Crossword PuzzleStacki Crossword Puzzle
Stacki Crossword Puzzle
 
Open Source Adoption in the Enterprise
Open Source Adoption in the EnterpriseOpen Source Adoption in the Enterprise
Open Source Adoption in the Enterprise
 
Stacki Saves Time
Stacki Saves TimeStacki Saves Time
Stacki Saves Time
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

Building a Hadoop Cluster with Stacki

  • 1. Stacki Lab Joe Kaiser Director of Open Source Engineering
  • 2. Open Source Stack Installer Stacki is a very fast and ultra reliable Linux server provisioning tool … at scale. With zero prerequisites for taking systems from bare metal to a ping and prompt.
  • 3. Stuff it does  Installs to bare metal or VMs that look like bare metal  Kickstart based  Parallel formatting of disk  Parallel Sharing of RPMs  CentOS/RHEL  Networking  Multiple subnets, vlaning, bonding.  Storage  Controller config  Partitioning Whatever you can do in Linux, you can do with Stacki only clustered
  • 4. Stacki and Hortonworks Data Platform
  • 5. Hortonworks – stacki-hdp-bridge pallet  Add/enable/run  Stacki creates ambari appliance  Download software as pallets (isos)  Add/enable HDP and Ambari  Run gethdp script  Or download (links in documentation)  Assign a node to be Ambari deployment.  Do partitions and preps backend nodes for Hadoop.  Install all.  Go to Ambari interface to deploy Hadoop.  Current versions: (updated /export/HDP/hdp.cfg for new versions)  distribution = 2.x  os = centos7  ambari = 2.4.2.0  hdp = 2.5.3.0
  • 6. Augment the “default” Box 1) Replace the “os” pallet with the “CentOS” and “CentOS- Updates” pallets 2) Add the Hortonworks pallets: “HDP”, “HDP-UTILS”, “Updates-ambari” or run /export/HDP/gethdp.py 3) Add a pallet to glue the two layers together: “stacki- hdp-bridge”
  • 7. Specify One Host as the “Ambari” appliance “backend-0-0” will be automatically configured with Ambari Install the node and then point your web browser at it
  • 8. Reinstall All Backend Nodes Wipe all hardware disk array configuration and rebuild all the LUNs: ◦ stack set host attr ambari backend attr=nukecontroller value=true Remove all partitions then repartition and reformat the disks: ◦ stack set host attr ambari backend attr=nukedisks value=true Instruct nodes to install on next PXE boot: ◦ stack set host boot ambari backend action=install
  • 9. Try It  Website www.stacki.com  Vagrant tire kick https://github.com/rfkrocktk/vagrant-stacki Source Code and docs github.com/stackiq/stacki github.com/StackIQ/stacki-hdp-bridge Slack Channel (because everyone) Google Groups groups.google.com/forum/#!forum/stacki

Editor's Notes

  1. Linux – Focused on RedHat-ish (Kickstart/Anaconda) Provisioning – Bare Metal (total stack control) Scale – solve 1000+ servers problem then scale down Ping and Prompt – Get machine up to known base OS fully configuration raid / disk / networking / ssh access on Nothing else … No agent left on the server