SlideShare a Scribd company logo
1 of 30
Download to read offline
Deploying Hadoop-Based Bigdata
                  Environments
     Click to edit Master subtitle style
 “[Tall] Tales From The Frontier”

Roman Shaposhnik
rvs@apache.org, Cloudera Inc.
$ whoami

   An open source software developer
       Linux kernel, C/C++ compilers, FFmpeg, Plan9
   A Hadoop and all around UNIX guy
   root@cloudera
       Member of the “Kitchen” team
   Apache Software Foundation Incubator PMC
       [Bigtop], Hadoop Development Tools, Celix, Helix
   VP of Apache Bigtop
                                                    2
ZooKeeper (coordination)

       HUE (web based UI)


Pig (DQL) Hive (SQL) Impala (SQL)

 HBase      YARN/MR1         Oozie

         HDFS (filesystem)


                                     3
ZooKeeper (coordination)

       HUE (web based UI)


Pig (DQL) Hive (SQL) Impala (SQL)

 HBase      YARN/MR1         Oozie

         HDFS (filesystem)


                                     4
It is a jungle out there
   Zookeeper         Sqoop       JDK/JRE
   Hadoop            Oozie       Kerberos
         HDFS        Whirr       Ganglia
         YARN        Mahout      Nagios
         MR1         Flume       JSVC
         HTTPFS      Giraph      Tomcat
   HBase             Hama        Utils
   Pig               Hue         Postgress
   Hive              Solr        HTTPD
   Impala            Crunch
                                                5
And the answer is:

         Puppet[forge]


                     6
One way of using Apache software

  $ wget http://apache.org/httpd.tar.gz
  $ tar xzvf httpd.tar.gz
  $ cd httpd
  $ ./configure ; make
  $ make install
  ERROR: can't write to /usr/local/bin
  $ sudo make install
                                          7
A different way

  $ sudo apt-get install httpd
  Would you like to also upgrade your conf?




                                              8
Is there apt-get install hadoop ?

   Hadoop is still in a very active development
   Hadoop is Java based
   Hadoop is a distributed application
   Hadoop is way more than HDFS + MR




                                              9
Project-by-project approach

   “Passively” maintained code
       Packaging, OS-level (init.d)
   Developer-centric view
       Edit-compile-debug cycle vs. deployment
       Lack of integration testing
   Differences in distributions/packaging:
       Where is this valid: /usr/libexec ?
   Combinatoric explosion of dependencies
                                                  10
Dependencies Inferno:

                            Hive 0.8.1


          HBase
       Hbase (0.92, 0.90)
                                               HBase
                                            HBase
                                  Hadoop (1.0, 0.22, 0.23)



             A million dollar question:
$ tar xzvf hive-0.8.1.tar.gz
$ ls hive-0.8.1/lib
                                                             11
Dependencies Inferno:

                            Hive 0.8.1


          HBase
       Hbase (0.92, 0.90)
                                               HBase
                                            HBase
                                  Hadoop (1.0, 0.22, 0.23)



             A million dollar question:
$ tar xzvf hive-0.8.1.tar.gz
$ ls hive-0.8.1/lib
hbase-0.89.jar log4j-1.2.15.jar log4j-1.2.16.jar
                                             12
Remember what Debian did to Linux?


 GNU Software             Linux kernel
                         Linux kernel




                                         13
Bigtop is trying to do it with Hadoop

Hadoop Ecosystem              Hadoop
                             Linux kernel
(Pig, Hive, Mahout)        (HDFS + MR)




CDH4 beta 1
                                            14
What's there in Bigtop

       Build/Packaging infrastructure
           RPM, DEB, (tarballs, homebrew/MacPorts)
           VirtualBox, VMWare and KVM VMs
           Fedora, OpenSUSE, Mageia, CentOS, Ubuntu
       Puppet deployment infrastrucutre
       Integration test infrastrucutre (iTest)
       Bigtop Jenkins:
           http://bigtop01.cloudera.org:8080
                                                      15
And the answer is:

      Puppet[Bigtop]


                     16
System software deployment

   Packages vs. Puppet code
       package/file/service
   What is packaging?
       dependency tracking
       build encapsulation
       java packaging
       file layout
       user creation
       service registration   17
Does it really work?

   Java packaging
       maven/ivy integration
   file layout
       side-by-side installations of the same package
   user creation
       LDAP/AD provisioning
   service registration
       start on install vs. start on reboot
                                                     18
Petascale distributed systems

       Scale
           Yahoo! ~5000 nodes
       Deployment orchestration
           Kerberos::Host_keytab <| title == "hdfs" |> ->
              Service["hadoop-hdfs-datanode"]
       Highly coordinated distributed system
           It ain't HTTPD/loadbalancer
           Rolling upgrades/asynchronous rollbacks
                                                             19
Back to tarballs and shell?

       What's better for Puppet: fpm or rpm?
       What is the role of Puppet?
           coordinating the entire system: lack of DSL
           converging an isolated node: will it ever work?
           a building block for an agent-based system
       One agent to rule them all?
           there's no spoon^H^H^H^H^H^ agent: Whirr
           MCollective
                                                          20
           Cloudera Manager, Ambari
Evolution, not perfection!
   Minimalistic, highly consistent packages
       /usr/lib/hadoop, /etc/hadoop/conf (alternative)
       fail gracefully: .... || : )
       Java packaging is not solved [yet]: symlinks
   Minimalistic Puppet code
       package/file/service
       masterless (most of the time)
       integration with Whirr
   BoxGrinder                                            21
The road ahead
   New kind of configuration management
       /etc/hadoop vs Zookeeper
   New kinds of system packaging
       Parcels (tarballs + metadata)
       HPS (Hadoop Packaging System)
   Orchestration: to puppet or not to puppet?
       Cloudera Manager
       Apache Ambari (incubating)
       Reactor 8: http://reactor8.com       22
Java Packaging
   Fate of Java
       OpenJDK
   OSGi
       Hadoop's view: MAPREDUCE-1700
        https://issues.apache.org/jira/browse/MAPREDUCE-1700
   Project Jigsaw
       Language tie-ins? Really?
   Linux vendors getting their act together
                                                               23
Integration testing
   Clean room provisioning
       Those ain't unit tests – they trash the system
   Cluster topology and cluster state discovery
       How can puppet help us?
   Cluster state manipulation
       Test-driven orchestration
       Chaos Monkey
   How to be successful in OS co-opetition
       Make everything pluggable (and subvert ;-))      24
Anatomy of iTest

   Versioned, JVM-based test/data artifacts
   Dependency between test artifacts
   Matching stack of integration tests
   Implementation
       Maven artifacts, pom files
       JUnit test-execution entry point
       Groovy for scripting

                                               25
Who's the target audience

       End users
           YOU!
       ASF Projects/Bigdata developers
           from Avro to Zookeeper
       Bigdata solutions vendors
           Cloudera, EMC, Hortonworks, Karmasphere
       DevOPs
           Ebay, Yahoo, Facebook, LinkedIn
                                                      26
Who's on-board?
   Cloudera
       CDH4 is 100% based on Bigtop (hadoop v2)
       Available @cloudera.com
   Canonical
       Ubuntu Server: Hadoop and Bigdata blueprint
        https://blueprints.launchpad.net/ubuntu/+spec/servercloud-p-hdp-hadoop

   TrendMicro
   Hortonworks (partially)
   EMC, EBay (early stages of prototyping)                                27
What's happening?
   A special release: Bigtop 0.3.0-incubating
       Hadoop 1.0.1
   Last stable release: Bigtop 0.5.0
       Hadoop 2.0.2-alpha
   Next stable release: Bigtop 0.6.0
       End of Mar 2013 release
       Hadoop 2.0.3-beta
       Major focus on developers
                                                 28
What Bigtop needs from you?

       More of you!
           Meetup: “Silicon Valley Hands-on Programming”
            http://www.meetup.com/HandsOnProgrammingEvents/
       More infrastructure for build/test
           EC2, Supercell, EMC magic cluster, CloudStack
       More integration tests
           Convince your bosses to commit to Bigtop
       Validate upstream release using Bigtop
                                                              29
Contact
§
    Bigtop home @Apache:
    •
        http://incubator.apache.org/bigtop/
§
    Hangout places:
    •
        {dev,user}@bigtop.apache.org
    •
        #bigtop on Freenode
§
    Roman Shaposhnik
    •
        rvs@apache.org, rvs@cloudera.com



                                 30

More Related Content

What's hot

Big Data Step-by-Step: Infrastructure 1/3: Local VM
Big Data Step-by-Step: Infrastructure 1/3: Local VMBig Data Step-by-Step: Infrastructure 1/3: Local VM
Big Data Step-by-Step: Infrastructure 1/3: Local VMJeffrey Breen
 
GlusterFS Update and OpenStack Integration
GlusterFS Update and OpenStack IntegrationGlusterFS Update and OpenStack Integration
GlusterFS Update and OpenStack IntegrationEtsuji Nakai
 
2 Linux Container and Docker
2 Linux Container and Docker2 Linux Container and Docker
2 Linux Container and DockerFabio Fumarola
 
Dealing with kubesprawl tetris style !
Dealing with kubesprawl   tetris style !Dealing with kubesprawl   tetris style !
Dealing with kubesprawl tetris style !Taco Scargo
 
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...Patrick Chanezon
 
Cloud Foundry Introduction - Canada - October 2012
Cloud Foundry Introduction - Canada - October 2012Cloud Foundry Introduction - Canada - October 2012
Cloud Foundry Introduction - Canada - October 2012Patrick Chanezon
 
開放運算&GPU技術研究班
開放運算&GPU技術研究班開放運算&GPU技術研究班
開放運算&GPU技術研究班Paul Chao
 
R hive tutorial supplement 2 - Installing Hive
R hive tutorial supplement 2 - Installing HiveR hive tutorial supplement 2 - Installing Hive
R hive tutorial supplement 2 - Installing HiveAiden Seonghak Hong
 
Docker - container and lightweight virtualization
Docker - container and lightweight virtualization Docker - container and lightweight virtualization
Docker - container and lightweight virtualization Sim Janghoon
 
Docker Online Meetup #3: Docker in Production
Docker Online Meetup #3: Docker in ProductionDocker Online Meetup #3: Docker in Production
Docker Online Meetup #3: Docker in ProductionDocker, Inc.
 
Drupalcamp es 2013 drupal with lxc docker and vagrant
Drupalcamp es 2013  drupal with lxc docker and vagrant Drupalcamp es 2013  drupal with lxc docker and vagrant
Drupalcamp es 2013 drupal with lxc docker and vagrant Ricardo Amaro
 
R hive tutorial supplement 1 - Installing Hadoop
R hive tutorial supplement 1 - Installing HadoopR hive tutorial supplement 1 - Installing Hadoop
R hive tutorial supplement 1 - Installing HadoopAiden Seonghak Hong
 
Linux containers and docker
Linux containers and dockerLinux containers and docker
Linux containers and dockerFabio Fumarola
 
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exerciseApache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exerciseShiva Rama Krishna Dasharathi
 
Configuring Your First Hadoop Cluster On EC2
Configuring Your First Hadoop Cluster On EC2Configuring Your First Hadoop Cluster On EC2
Configuring Your First Hadoop Cluster On EC2benjaminwootton
 

What's hot (18)

Big Data Step-by-Step: Infrastructure 1/3: Local VM
Big Data Step-by-Step: Infrastructure 1/3: Local VMBig Data Step-by-Step: Infrastructure 1/3: Local VM
Big Data Step-by-Step: Infrastructure 1/3: Local VM
 
GlusterFS Update and OpenStack Integration
GlusterFS Update and OpenStack IntegrationGlusterFS Update and OpenStack Integration
GlusterFS Update and OpenStack Integration
 
2 Linux Container and Docker
2 Linux Container and Docker2 Linux Container and Docker
2 Linux Container and Docker
 
Dealing with kubesprawl tetris style !
Dealing with kubesprawl   tetris style !Dealing with kubesprawl   tetris style !
Dealing with kubesprawl tetris style !
 
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
Docker San Francisco Meetup April 2015 - The Docker Orchestration Ecosystem o...
 
Cloud Foundry Introduction - Canada - October 2012
Cloud Foundry Introduction - Canada - October 2012Cloud Foundry Introduction - Canada - October 2012
Cloud Foundry Introduction - Canada - October 2012
 
開放運算&GPU技術研究班
開放運算&GPU技術研究班開放運算&GPU技術研究班
開放運算&GPU技術研究班
 
R hive tutorial supplement 2 - Installing Hive
R hive tutorial supplement 2 - Installing HiveR hive tutorial supplement 2 - Installing Hive
R hive tutorial supplement 2 - Installing Hive
 
Docker - container and lightweight virtualization
Docker - container and lightweight virtualization Docker - container and lightweight virtualization
Docker - container and lightweight virtualization
 
Dbdeployer
DbdeployerDbdeployer
Dbdeployer
 
Docker Online Meetup #3: Docker in Production
Docker Online Meetup #3: Docker in ProductionDocker Online Meetup #3: Docker in Production
Docker Online Meetup #3: Docker in Production
 
Drupalcamp es 2013 drupal with lxc docker and vagrant
Drupalcamp es 2013  drupal with lxc docker and vagrant Drupalcamp es 2013  drupal with lxc docker and vagrant
Drupalcamp es 2013 drupal with lxc docker and vagrant
 
R hive tutorial supplement 1 - Installing Hadoop
R hive tutorial supplement 1 - Installing HadoopR hive tutorial supplement 1 - Installing Hadoop
R hive tutorial supplement 1 - Installing Hadoop
 
Linux containers and docker
Linux containers and dockerLinux containers and docker
Linux containers and docker
 
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exerciseApache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exercise
 
Configuring Your First Hadoop Cluster On EC2
Configuring Your First Hadoop Cluster On EC2Configuring Your First Hadoop Cluster On EC2
Configuring Your First Hadoop Cluster On EC2
 
Lab docker
Lab dockerLab docker
Lab docker
 
CvmFS Workshop
CvmFS WorkshopCvmFS Workshop
CvmFS Workshop
 

Similar to Deploying Hadoop-based Bigdata Environments

App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)outstanding59
 
Inside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworldInside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworldRichard McDougall
 
App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)outstanding59
 
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache ApexMaking sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache ApexApache Apex
 
Hw09 Clouderas Distribution For Hadoop
Hw09   Clouderas Distribution For HadoopHw09   Clouderas Distribution For Hadoop
Hw09 Clouderas Distribution For HadoopCloudera, Inc.
 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesNicola Ferraro
 
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive DemosHadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive DemosLester Martin
 
Hadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsHadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsLynn Langit
 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopDataWorks Summit
 
2014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop1012014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop101Adam Muise
 
Best hadoop-online-training
Best hadoop-online-trainingBest hadoop-online-training
Best hadoop-online-trainingGeohedrick
 
Hadoop demo ppt
Hadoop demo pptHadoop demo ppt
Hadoop demo pptPhil Young
 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereGanesh Raju
 
Hadoop online training
Hadoop online trainingHadoop online training
Hadoop online trainingsrikanthhadoop
 
Develop with linux containers and docker
Develop with linux containers and dockerDevelop with linux containers and docker
Develop with linux containers and dockerFabio Fumarola
 
A DevOps guide to Kubernetes
A DevOps guide to KubernetesA DevOps guide to Kubernetes
A DevOps guide to KubernetesPaul Czarkowski
 

Similar to Deploying Hadoop-based Bigdata Environments (20)

App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)
 
Inside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworldInside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworld
 
App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)
 
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache ApexMaking sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
 
Hw09 Clouderas Distribution For Hadoop
Hw09   Clouderas Distribution For HadoopHw09   Clouderas Distribution For Hadoop
Hw09 Clouderas Distribution For Hadoop
 
Hadoop description
Hadoop descriptionHadoop description
Hadoop description
 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with Kubernetes
 
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive DemosHadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
 
Unit 5
Unit  5Unit  5
Unit 5
 
Hadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsHadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing Hadoop
 
2014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop1012014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop101
 
Best hadoop-online-training
Best hadoop-online-trainingBest hadoop-online-training
Best hadoop-online-training
 
Hadoop demo ppt
Hadoop demo pptHadoop demo ppt
Hadoop demo ppt
 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
 
Hadoop online training
Hadoop online trainingHadoop online training
Hadoop online training
 
Develop with linux containers and docker
Develop with linux containers and dockerDevelop with linux containers and docker
Develop with linux containers and docker
 
Docker-v3.pdf
Docker-v3.pdfDocker-v3.pdf
Docker-v3.pdf
 
Lecture 2 part 2
Lecture 2 part 2Lecture 2 part 2
Lecture 2 part 2
 
A DevOps guide to Kubernetes
A DevOps guide to KubernetesA DevOps guide to Kubernetes
A DevOps guide to Kubernetes
 

More from buildacloud

The Future of SDN in CloudStack by Chiradeep Vittal
The Future of SDN in CloudStack by Chiradeep VittalThe Future of SDN in CloudStack by Chiradeep Vittal
The Future of SDN in CloudStack by Chiradeep Vittalbuildacloud
 
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
Policy Based SDN Solution for DC and Branch Office by Suresh BoddapatiPolicy Based SDN Solution for DC and Branch Office by Suresh Boddapati
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapatibuildacloud
 
L4-L7 services for SDN and NVF by Youcef Laribi
L4-L7 services for SDN and NVF by Youcef LaribiL4-L7 services for SDN and NVF by Youcef Laribi
L4-L7 services for SDN and NVF by Youcef Laribibuildacloud
 
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
Jenkins, jclouds, CloudStack, and CentOS by David NalleyJenkins, jclouds, CloudStack, and CentOS by David Nalley
Jenkins, jclouds, CloudStack, and CentOS by David Nalleybuildacloud
 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew KirchIntro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirchbuildacloud
 
Guaranteeing Storage Performance by Mike Tutkowski
Guaranteeing Storage Performance by Mike TutkowskiGuaranteeing Storage Performance by Mike Tutkowski
Guaranteeing Storage Performance by Mike Tutkowskibuildacloud
 
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
Cloud Application Blueprints with Apache Brooklyn by Alex HenevaldCloud Application Blueprints with Apache Brooklyn by Alex Henevald
Cloud Application Blueprints with Apache Brooklyn by Alex Henevaldbuildacloud
 
Introduction to Apache CloudStack by David Nalley
Introduction to Apache CloudStack by David NalleyIntroduction to Apache CloudStack by David Nalley
Introduction to Apache CloudStack by David Nalleybuildacloud
 
Managing infrastructure with Application Policy by Mike Cohen
Managing infrastructure with Application Policy by Mike CohenManaging infrastructure with Application Policy by Mike Cohen
Managing infrastructure with Application Policy by Mike Cohenbuildacloud
 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew KirchIntro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirchbuildacloud
 
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
Monitoring CloudStack in context with Converged Infrastructure by Mike TurnlundMonitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlundbuildacloud
 
Rest api design by george reese
Rest api design by george reeseRest api design by george reese
Rest api design by george reesebuildacloud
 
Enterprise grade firewall and ssl termination to ac by will stevens
Enterprise grade firewall and ssl termination to ac by will stevensEnterprise grade firewall and ssl termination to ac by will stevens
Enterprise grade firewall and ssl termination to ac by will stevensbuildacloud
 
State of the cloud by reuven cohen
State of the cloud by reuven cohenState of the cloud by reuven cohen
State of the cloud by reuven cohenbuildacloud
 
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
Securing Your Cloud With the Xen Hypervisor by Russell PavlicekSecuring Your Cloud With the Xen Hypervisor by Russell Pavlicek
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicekbuildacloud
 
DevCloud - Setup and Demo on Apache CloudStack
DevCloud - Setup and Demo on Apache CloudStack DevCloud - Setup and Demo on Apache CloudStack
DevCloud - Setup and Demo on Apache CloudStack buildacloud
 
Cloud Network Virtualization with Juniper Contrail
Cloud Network Virtualization with Juniper ContrailCloud Network Virtualization with Juniper Contrail
Cloud Network Virtualization with Juniper Contrailbuildacloud
 
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...buildacloud
 
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski buildacloud
 
CloudStack University by Sebastien Goasguen
CloudStack University by Sebastien GoasguenCloudStack University by Sebastien Goasguen
CloudStack University by Sebastien Goasguenbuildacloud
 

More from buildacloud (20)

The Future of SDN in CloudStack by Chiradeep Vittal
The Future of SDN in CloudStack by Chiradeep VittalThe Future of SDN in CloudStack by Chiradeep Vittal
The Future of SDN in CloudStack by Chiradeep Vittal
 
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
Policy Based SDN Solution for DC and Branch Office by Suresh BoddapatiPolicy Based SDN Solution for DC and Branch Office by Suresh Boddapati
Policy Based SDN Solution for DC and Branch Office by Suresh Boddapati
 
L4-L7 services for SDN and NVF by Youcef Laribi
L4-L7 services for SDN and NVF by Youcef LaribiL4-L7 services for SDN and NVF by Youcef Laribi
L4-L7 services for SDN and NVF by Youcef Laribi
 
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
Jenkins, jclouds, CloudStack, and CentOS by David NalleyJenkins, jclouds, CloudStack, and CentOS by David Nalley
Jenkins, jclouds, CloudStack, and CentOS by David Nalley
 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew KirchIntro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
 
Guaranteeing Storage Performance by Mike Tutkowski
Guaranteeing Storage Performance by Mike TutkowskiGuaranteeing Storage Performance by Mike Tutkowski
Guaranteeing Storage Performance by Mike Tutkowski
 
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
Cloud Application Blueprints with Apache Brooklyn by Alex HenevaldCloud Application Blueprints with Apache Brooklyn by Alex Henevald
Cloud Application Blueprints with Apache Brooklyn by Alex Henevald
 
Introduction to Apache CloudStack by David Nalley
Introduction to Apache CloudStack by David NalleyIntroduction to Apache CloudStack by David Nalley
Introduction to Apache CloudStack by David Nalley
 
Managing infrastructure with Application Policy by Mike Cohen
Managing infrastructure with Application Policy by Mike CohenManaging infrastructure with Application Policy by Mike Cohen
Managing infrastructure with Application Policy by Mike Cohen
 
Intro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew KirchIntro to Zenoss by Andrew Kirch
Intro to Zenoss by Andrew Kirch
 
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
Monitoring CloudStack in context with Converged Infrastructure by Mike TurnlundMonitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
Monitoring CloudStack in context with Converged Infrastructure by Mike Turnlund
 
Rest api design by george reese
Rest api design by george reeseRest api design by george reese
Rest api design by george reese
 
Enterprise grade firewall and ssl termination to ac by will stevens
Enterprise grade firewall and ssl termination to ac by will stevensEnterprise grade firewall and ssl termination to ac by will stevens
Enterprise grade firewall and ssl termination to ac by will stevens
 
State of the cloud by reuven cohen
State of the cloud by reuven cohenState of the cloud by reuven cohen
State of the cloud by reuven cohen
 
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
Securing Your Cloud With the Xen Hypervisor by Russell PavlicekSecuring Your Cloud With the Xen Hypervisor by Russell Pavlicek
Securing Your Cloud With the Xen Hypervisor by Russell Pavlicek
 
DevCloud - Setup and Demo on Apache CloudStack
DevCloud - Setup and Demo on Apache CloudStack DevCloud - Setup and Demo on Apache CloudStack
DevCloud - Setup and Demo on Apache CloudStack
 
Cloud Network Virtualization with Juniper Contrail
Cloud Network Virtualization with Juniper ContrailCloud Network Virtualization with Juniper Contrail
Cloud Network Virtualization with Juniper Contrail
 
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
Ian rae panel cloud stack & cloud storage where are we at, and where do we ne...
 
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
 
CloudStack University by Sebastien Goasguen
CloudStack University by Sebastien GoasguenCloudStack University by Sebastien Goasguen
CloudStack University by Sebastien Goasguen
 

Recently uploaded

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 

Recently uploaded (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 

Deploying Hadoop-based Bigdata Environments

  • 1. Deploying Hadoop-Based Bigdata Environments Click to edit Master subtitle style “[Tall] Tales From The Frontier” Roman Shaposhnik rvs@apache.org, Cloudera Inc.
  • 2. $ whoami  An open source software developer  Linux kernel, C/C++ compilers, FFmpeg, Plan9  A Hadoop and all around UNIX guy  root@cloudera  Member of the “Kitchen” team  Apache Software Foundation Incubator PMC  [Bigtop], Hadoop Development Tools, Celix, Helix  VP of Apache Bigtop 2
  • 3. ZooKeeper (coordination) HUE (web based UI) Pig (DQL) Hive (SQL) Impala (SQL) HBase YARN/MR1 Oozie HDFS (filesystem) 3
  • 4. ZooKeeper (coordination) HUE (web based UI) Pig (DQL) Hive (SQL) Impala (SQL) HBase YARN/MR1 Oozie HDFS (filesystem) 4
  • 5. It is a jungle out there  Zookeeper  Sqoop  JDK/JRE  Hadoop  Oozie  Kerberos  HDFS  Whirr  Ganglia  YARN  Mahout  Nagios  MR1  Flume  JSVC  HTTPFS  Giraph  Tomcat  HBase  Hama  Utils  Pig  Hue  Postgress  Hive  Solr  HTTPD  Impala  Crunch 5
  • 6. And the answer is: Puppet[forge] 6
  • 7. One way of using Apache software $ wget http://apache.org/httpd.tar.gz $ tar xzvf httpd.tar.gz $ cd httpd $ ./configure ; make $ make install ERROR: can't write to /usr/local/bin $ sudo make install 7
  • 8. A different way $ sudo apt-get install httpd Would you like to also upgrade your conf? 8
  • 9. Is there apt-get install hadoop ?  Hadoop is still in a very active development  Hadoop is Java based  Hadoop is a distributed application  Hadoop is way more than HDFS + MR 9
  • 10. Project-by-project approach  “Passively” maintained code  Packaging, OS-level (init.d)  Developer-centric view  Edit-compile-debug cycle vs. deployment  Lack of integration testing  Differences in distributions/packaging:  Where is this valid: /usr/libexec ?  Combinatoric explosion of dependencies 10
  • 11. Dependencies Inferno: Hive 0.8.1 HBase Hbase (0.92, 0.90) HBase HBase Hadoop (1.0, 0.22, 0.23) A million dollar question: $ tar xzvf hive-0.8.1.tar.gz $ ls hive-0.8.1/lib 11
  • 12. Dependencies Inferno: Hive 0.8.1 HBase Hbase (0.92, 0.90) HBase HBase Hadoop (1.0, 0.22, 0.23) A million dollar question: $ tar xzvf hive-0.8.1.tar.gz $ ls hive-0.8.1/lib hbase-0.89.jar log4j-1.2.15.jar log4j-1.2.16.jar 12
  • 13. Remember what Debian did to Linux? GNU Software Linux kernel Linux kernel 13
  • 14. Bigtop is trying to do it with Hadoop Hadoop Ecosystem Hadoop Linux kernel (Pig, Hive, Mahout) (HDFS + MR) CDH4 beta 1 14
  • 15. What's there in Bigtop  Build/Packaging infrastructure  RPM, DEB, (tarballs, homebrew/MacPorts)  VirtualBox, VMWare and KVM VMs  Fedora, OpenSUSE, Mageia, CentOS, Ubuntu  Puppet deployment infrastrucutre  Integration test infrastrucutre (iTest)  Bigtop Jenkins:  http://bigtop01.cloudera.org:8080 15
  • 16. And the answer is: Puppet[Bigtop] 16
  • 17. System software deployment  Packages vs. Puppet code  package/file/service  What is packaging?  dependency tracking  build encapsulation  java packaging  file layout  user creation  service registration 17
  • 18. Does it really work?  Java packaging  maven/ivy integration  file layout  side-by-side installations of the same package  user creation  LDAP/AD provisioning  service registration  start on install vs. start on reboot 18
  • 19. Petascale distributed systems  Scale  Yahoo! ~5000 nodes  Deployment orchestration  Kerberos::Host_keytab <| title == "hdfs" |> -> Service["hadoop-hdfs-datanode"]  Highly coordinated distributed system  It ain't HTTPD/loadbalancer  Rolling upgrades/asynchronous rollbacks 19
  • 20. Back to tarballs and shell?  What's better for Puppet: fpm or rpm?  What is the role of Puppet?  coordinating the entire system: lack of DSL  converging an isolated node: will it ever work?  a building block for an agent-based system  One agent to rule them all?  there's no spoon^H^H^H^H^H^ agent: Whirr  MCollective 20  Cloudera Manager, Ambari
  • 21. Evolution, not perfection!  Minimalistic, highly consistent packages  /usr/lib/hadoop, /etc/hadoop/conf (alternative)  fail gracefully: .... || : )  Java packaging is not solved [yet]: symlinks  Minimalistic Puppet code  package/file/service  masterless (most of the time)  integration with Whirr  BoxGrinder 21
  • 22. The road ahead  New kind of configuration management  /etc/hadoop vs Zookeeper  New kinds of system packaging  Parcels (tarballs + metadata)  HPS (Hadoop Packaging System)  Orchestration: to puppet or not to puppet?  Cloudera Manager  Apache Ambari (incubating)  Reactor 8: http://reactor8.com 22
  • 23. Java Packaging  Fate of Java  OpenJDK  OSGi  Hadoop's view: MAPREDUCE-1700 https://issues.apache.org/jira/browse/MAPREDUCE-1700  Project Jigsaw  Language tie-ins? Really?  Linux vendors getting their act together 23
  • 24. Integration testing  Clean room provisioning  Those ain't unit tests – they trash the system  Cluster topology and cluster state discovery  How can puppet help us?  Cluster state manipulation  Test-driven orchestration  Chaos Monkey  How to be successful in OS co-opetition  Make everything pluggable (and subvert ;-)) 24
  • 25. Anatomy of iTest  Versioned, JVM-based test/data artifacts  Dependency between test artifacts  Matching stack of integration tests  Implementation  Maven artifacts, pom files  JUnit test-execution entry point  Groovy for scripting 25
  • 26. Who's the target audience  End users  YOU!  ASF Projects/Bigdata developers  from Avro to Zookeeper  Bigdata solutions vendors  Cloudera, EMC, Hortonworks, Karmasphere  DevOPs  Ebay, Yahoo, Facebook, LinkedIn 26
  • 27. Who's on-board?  Cloudera  CDH4 is 100% based on Bigtop (hadoop v2)  Available @cloudera.com  Canonical  Ubuntu Server: Hadoop and Bigdata blueprint https://blueprints.launchpad.net/ubuntu/+spec/servercloud-p-hdp-hadoop  TrendMicro  Hortonworks (partially)  EMC, EBay (early stages of prototyping) 27
  • 28. What's happening?  A special release: Bigtop 0.3.0-incubating  Hadoop 1.0.1  Last stable release: Bigtop 0.5.0  Hadoop 2.0.2-alpha  Next stable release: Bigtop 0.6.0  End of Mar 2013 release  Hadoop 2.0.3-beta  Major focus on developers 28
  • 29. What Bigtop needs from you?  More of you!  Meetup: “Silicon Valley Hands-on Programming” http://www.meetup.com/HandsOnProgrammingEvents/  More infrastructure for build/test  EC2, Supercell, EMC magic cluster, CloudStack  More integration tests  Convince your bosses to commit to Bigtop  Validate upstream release using Bigtop 29
  • 30. Contact § Bigtop home @Apache: • http://incubator.apache.org/bigtop/ § Hangout places: • {dev,user}@bigtop.apache.org • #bigtop on Freenode § Roman Shaposhnik • rvs@apache.org, rvs@cloudera.com 30