SlideShare a Scribd company logo
1 of 30
Download to read offline
Deploying Hadoop-Based Bigdata
                  Environments
     Click to edit Master subtitle style
 β€œ[Tall] Tales From The Frontier”

Roman Shaposhnik
rvs@apache.org, Cloudera Inc.
$ whoami

ξ€Š   An open source software developer
    ξ€Š   Linux kernel, C/C++ compilers, FFmpeg, Plan9
ξ€Š   A Hadoop and all around UNIX guy
ξ€Š   root@cloudera
    ξ€Š   Member of the β€œKitchen” team
ξ€Š   Apache Software Foundation Incubator PMC
    ξ€Š   [Bigtop], Hadoop Development Tools, Celix, Helix
ξ€Š   VP of Apache Bigtop
                                                    2
ZooKeeper (coordination)

       HUE (web based UI)


Pig (DQL) Hive (SQL) Impala (SQL)

 HBase      YARN/MR1         Oozie

         HDFS (filesystem)


                                     3
ZooKeeper (coordination)

       HUE (web based UI)


Pig (DQL) Hive (SQL) Impala (SQL)

 HBase      YARN/MR1         Oozie

         HDFS (filesystem)


                                     4
It is a jungle out there
ξ€Š   Zookeeper      ξ€Š   Sqoop    ξ€Š   JDK/JRE
ξ€Š   Hadoop         ξ€Š   Oozie    ξ€Š   Kerberos
     ξ€Š    HDFS     ξ€Š   Whirr    ξ€Š   Ganglia
     ξ€Š    YARN     ξ€Š   Mahout   ξ€Š   Nagios
     ξ€Š    MR1      ξ€Š   Flume    ξ€Š   JSVC
     ξ€Š    HTTPFS   ξ€Š   Giraph   ξ€Š   Tomcat
ξ€Š   HBase          ξ€Š   Hama     ξ€Š   Utils
ξ€Š   Pig            ξ€Š   Hue      ξ€Š   Postgress
ξ€Š   Hive           ξ€Š   Solr     ξ€Š   HTTPD
ξ€Š   Impala         ξ€Š   Crunch
                                                5
And the answer is:

         Puppet[forge]


                     6
One way of using Apache software

  $ wget http://apache.org/httpd.tar.gz
  $ tar xzvf httpd.tar.gz
  $ cd httpd
  $ ./configure ; make
  $ make install
  ERROR: can't write to /usr/local/bin
  $ sudo make install
                                          7
A different way

  $ sudo apt-get install httpd
  Would you like to also upgrade your conf?




                                              8
Is there apt-get install hadoop ?

ξ€Š   Hadoop is still in a very active development
ξ€Š   Hadoop is Java based
ξ€Š   Hadoop is a distributed application
ξ€Š   Hadoop is way more than HDFS + MR




                                              9
Project-by-project approach

ξ€Š   β€œPassively” maintained code
    ξ€Š   Packaging, OS-level (init.d)
ξ€Š   Developer-centric view
    ξ€Š   Edit-compile-debug cycle vs. deployment
    ξ€Š   Lack of integration testing
ξ€Š   Differences in distributions/packaging:
    ξ€Š   Where is this valid: /usr/libexec ?
ξ€Š   Combinatoric explosion of dependencies
                                                  10
Dependencies Inferno:

                            Hive 0.8.1


          HBase
       Hbase (0.92, 0.90)
                                               HBase
                                            HBase
                                  Hadoop (1.0, 0.22, 0.23)



             A million dollar question:
$ tar xzvf hive-0.8.1.tar.gz
$ ls hive-0.8.1/lib
                                                             11
Dependencies Inferno:

                            Hive 0.8.1


          HBase
       Hbase (0.92, 0.90)
                                               HBase
                                            HBase
                                  Hadoop (1.0, 0.22, 0.23)



             A million dollar question:
$ tar xzvf hive-0.8.1.tar.gz
$ ls hive-0.8.1/lib
hbase-0.89.jar log4j-1.2.15.jar log4j-1.2.16.jar
                                             12
Remember what Debian did to Linux?


 GNU Software             Linux kernel
                         Linux kernel




                                         13
Bigtop is trying to do it with Hadoop

Hadoop Ecosystem              Hadoop
                             Linux kernel
(Pig, Hive, Mahout)        (HDFS + MR)




CDH4 beta 1
                                            14
What's there in Bigtop

    ξ€Š   Build/Packaging infrastructure
        ξ€Š   RPM, DEB, (tarballs, homebrew/MacPorts)
        ξ€Š   VirtualBox, VMWare and KVM VMs
        ξ€Š   Fedora, OpenSUSE, Mageia, CentOS, Ubuntu
ξ€Š       Puppet deployment infrastrucutre
ξ€Š       Integration test infrastrucutre (iTest)
ξ€Š       Bigtop Jenkins:
        ξ€Š   http://bigtop01.cloudera.org:8080
                                                      15
And the answer is:

      Puppet[Bigtop]


                     16
System software deployment

ξ€Š   Packages vs. Puppet code
    ξ€Š   package/file/service
ξ€Š   What is packaging?
    ξ€Š   dependency tracking
    ξ€Š   build encapsulation
    ξ€Š   java packaging
    ξ€Š   file layout
    ξ€Š   user creation
    ξ€Š   service registration   17
Does it really work?

ξ€Š   Java packaging
    ξ€Š   maven/ivy integration
ξ€Š   file layout
    ξ€Š   side-by-side installations of the same package
ξ€Š   user creation
    ξ€Š   LDAP/AD provisioning
ξ€Š   service registration
    ξ€Š   start on install vs. start on reboot
                                                     18
Petascale distributed systems

    ξ€Š   Scale
        ξ€Š   Yahoo! ~5000 nodes
    ξ€Š   Deployment orchestration
        ξ€Š   Kerberos::Host_keytab <| title == "hdfs" |> ->
              Service["hadoop-hdfs-datanode"]
ξ€Š       Highly coordinated distributed system
        ξ€Š   It ain't HTTPD/loadbalancer
        ξ€Š   Rolling upgrades/asynchronous rollbacks
                                                             19
Back to tarballs and shell?

    ξ€Š   What's better for Puppet: fpm or rpm?
    ξ€Š   What is the role of Puppet?
        ξ€Š   coordinating the entire system: lack of DSL
        ξ€Š   converging an isolated node: will it ever work?
        ξ€Š   a building block for an agent-based system
ξ€Š       One agent to rule them all?
        ξ€Š   there's no spoon^H^H^H^H^H^ agent: Whirr
        ξ€Š   MCollective
                                                          20
        ξ€Š   Cloudera Manager, Ambari
Evolution, not perfection!
ξ€Š   Minimalistic, highly consistent packages
    ξ€Š   /usr/lib/hadoop, /etc/hadoop/conf (alternative)
    ξ€Š   fail gracefully: .... || : )
    ξ€Š   Java packaging is not solved [yet]: symlinks
ξ€Š   Minimalistic Puppet code
    ξ€Š   package/file/service
    ξ€Š   masterless (most of the time)
    ξ€Š   integration with Whirr
ξ€Š   BoxGrinder                                            21
The road ahead
ξ€Š   New kind of configuration management
    ξ€Š   /etc/hadoop vs Zookeeper
ξ€Š   New kinds of system packaging
    ξ€Š   Parcels (tarballs + metadata)
    ξ€Š   HPS (Hadoop Packaging System)
ξ€Š   Orchestration: to puppet or not to puppet?
    ξ€Š   Cloudera Manager
    ξ€Š   Apache Ambari (incubating)
    ξ€Š   Reactor 8: http://reactor8.com       22
Java Packaging
ξ€Š   Fate of Java
    ξ€Š   OpenJDK
ξ€Š   OSGi
    ξ€Š   Hadoop's view: MAPREDUCE-1700
        https://issues.apache.org/jira/browse/MAPREDUCE-1700
ξ€Š   Project Jigsaw
    ξ€Š   Language tie-ins? Really?
ξ€Š   Linux vendors getting their act together
                                                               23
Integration testing
ξ€Š   Clean room provisioning
    ξ€Š   Those ain't unit tests – they trash the system
ξ€Š   Cluster topology and cluster state discovery
    ξ€Š   How can puppet help us?
ξ€Š   Cluster state manipulation
    ξ€Š   Test-driven orchestration
    ξ€Š   Chaos Monkey
ξ€Š   How to be successful in OS co-opetition
    ξ€Š   Make everything pluggable (and subvert ;-))      24
Anatomy of iTest

ξ€Š   Versioned, JVM-based test/data artifacts
ξ€Š   Dependency between test artifacts
ξ€Š   Matching stack of integration tests
ξ€Š   Implementation
    ξ€Š   Maven artifacts, pom files
    ξ€Š   JUnit test-execution entry point
    ξ€Š   Groovy for scripting

                                               25
Who's the target audience

    ξ€Š   End users
        ξ€Š   YOU!
    ξ€Š   ASF Projects/Bigdata developers
        ξ€Š   from Avro to Zookeeper
    ξ€Š   Bigdata solutions vendors
        ξ€Š   Cloudera, EMC, Hortonworks, Karmasphere
ξ€Š       DevOPs
        ξ€Š   Ebay, Yahoo, Facebook, LinkedIn
                                                      26
Who's on-board?
ξ€Š   Cloudera
    ξ€Š   CDH4 is 100% based on Bigtop (hadoop v2)
    ξ€Š   Available @cloudera.com
ξ€Š   Canonical
    ξ€Š   Ubuntu Server: Hadoop and Bigdata blueprint
        https://blueprints.launchpad.net/ubuntu/+spec/servercloud-p-hdp-hadoop

ξ€Š   TrendMicro
ξ€Š   Hortonworks (partially)
ξ€Š   EMC, EBay (early stages of prototyping)                                27
What's happening?
ξ€Š   A special release: Bigtop 0.3.0-incubating
    ξ€Š   Hadoop 1.0.1
ξ€Š   Last stable release: Bigtop 0.5.0
    ξ€Š   Hadoop 2.0.2-alpha
ξ€Š   Next stable release: Bigtop 0.6.0
    ξ€Š   End of Mar 2013 release
    ξ€Š   Hadoop 2.0.3-beta
    ξ€Š   Major focus on developers
                                                 28
What Bigtop needs from you?

    ξ€Š   More of you!
        ξ€Š   Meetup: β€œSilicon Valley Hands-on Programming”
            http://www.meetup.com/HandsOnProgrammingEvents/
    ξ€Š   More infrastructure for build/test
        ξ€Š   EC2, Supercell, EMC magic cluster, CloudStack
    ξ€Š   More integration tests
        ξ€Š   Convince your bosses to commit to Bigtop
ξ€Š       Validate upstream release using Bigtop
                                                              29
Contact
Β§
    Bigtop home @Apache:
    β€’
        http://incubator.apache.org/bigtop/
Β§
    Hangout places:
    β€’
        {dev,user}@bigtop.apache.org
    β€’
        #bigtop on Freenode
Β§
    Roman Shaposhnik
    β€’
        rvs@apache.org, rvs@cloudera.com



                                 30

More Related Content

What's hot

Practical introduction to dev ops with chef
Practical introduction to dev ops with chefPractical introduction to dev ops with chef
Practical introduction to dev ops with chefLeanDog
Β 
Chef Fundamentals Training Series Module 1: Overview of Chef
Chef Fundamentals Training Series Module 1: Overview of ChefChef Fundamentals Training Series Module 1: Overview of Chef
Chef Fundamentals Training Series Module 1: Overview of ChefChef Software, Inc.
Β 
Chef for OpenStack: Grizzly Roadmap
Chef for OpenStack: Grizzly RoadmapChef for OpenStack: Grizzly Roadmap
Chef for OpenStack: Grizzly RoadmapMatt Ray
Β 
Community Cookbooks & further resources - Fundamentals Webinar Series Part 6
Community Cookbooks & further resources - Fundamentals Webinar Series Part 6Community Cookbooks & further resources - Fundamentals Webinar Series Part 6
Community Cookbooks & further resources - Fundamentals Webinar Series Part 6Chef
Β 
Automated Deployment and Configuration Engines. Ansible
Automated Deployment and Configuration Engines. AnsibleAutomated Deployment and Configuration Engines. Ansible
Automated Deployment and Configuration Engines. AnsibleAlberto Molina Coballes
Β 
Boston/NYC Chef for OpenStack Hack Days
Boston/NYC Chef for OpenStack Hack DaysBoston/NYC Chef for OpenStack Hack Days
Boston/NYC Chef for OpenStack Hack DaysMatt Ray
Β 
Chef for OpenStack December 2012
Chef for OpenStack December 2012Chef for OpenStack December 2012
Chef for OpenStack December 2012Matt Ray
Β 
Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...
Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...
Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...Chef Software, Inc.
Β 
Ninja, Choose Your Weapon!
Ninja, Choose Your Weapon!Ninja, Choose Your Weapon!
Ninja, Choose Your Weapon!Anton Weiss
Β 
Chef Fundamentals Training Series Module 2: Workstation Setup
Chef Fundamentals Training Series Module 2: Workstation SetupChef Fundamentals Training Series Module 2: Workstation Setup
Chef Fundamentals Training Series Module 2: Workstation SetupChef Software, Inc.
Β 
Package Management and Chef - ChefConf 2015
Package Management and Chef - ChefConf 2015Package Management and Chef - ChefConf 2015
Package Management and Chef - ChefConf 2015Chef
Β 
Scaling Development Environments with Docker
Scaling Development Environments with DockerScaling Development Environments with Docker
Scaling Development Environments with DockerDocker, Inc.
Β 
Node setup, resource, and recipes - Fundamentals Webinar Series Part 2
Node setup, resource, and recipes - Fundamentals Webinar Series Part 2Node setup, resource, and recipes - Fundamentals Webinar Series Part 2
Node setup, resource, and recipes - Fundamentals Webinar Series Part 2Chef
Β 
Environments - Fundamentals Webinar Series Week 5
Environments - Fundamentals Webinar Series Week 5Environments - Fundamentals Webinar Series Week 5
Environments - Fundamentals Webinar Series Week 5Chef
Β 
Opscode Webinar: Managing Your VMware Infrastructure with Chef
Opscode Webinar: Managing Your VMware Infrastructure with ChefOpscode Webinar: Managing Your VMware Infrastructure with Chef
Opscode Webinar: Managing Your VMware Infrastructure with ChefChef Software, Inc.
Β 
The unintended benefits of Chef
The unintended benefits of ChefThe unintended benefits of Chef
The unintended benefits of ChefChef Software, Inc.
Β 
Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...
Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...
Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...Chef Software, Inc.
Β 
SaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web Scale
SaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web ScaleSaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web Scale
SaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web ScaleSaltStack
Β 
Building a PaaS using Chef
Building a PaaS using ChefBuilding a PaaS using Chef
Building a PaaS using ChefShaun Domingo
Β 

What's hot (20)

Practical introduction to dev ops with chef
Practical introduction to dev ops with chefPractical introduction to dev ops with chef
Practical introduction to dev ops with chef
Β 
Chef Fundamentals Training Series Module 1: Overview of Chef
Chef Fundamentals Training Series Module 1: Overview of ChefChef Fundamentals Training Series Module 1: Overview of Chef
Chef Fundamentals Training Series Module 1: Overview of Chef
Β 
Chef: Smart infrastructure automation
Chef: Smart infrastructure automationChef: Smart infrastructure automation
Chef: Smart infrastructure automation
Β 
Chef for OpenStack: Grizzly Roadmap
Chef for OpenStack: Grizzly RoadmapChef for OpenStack: Grizzly Roadmap
Chef for OpenStack: Grizzly Roadmap
Β 
Community Cookbooks & further resources - Fundamentals Webinar Series Part 6
Community Cookbooks & further resources - Fundamentals Webinar Series Part 6Community Cookbooks & further resources - Fundamentals Webinar Series Part 6
Community Cookbooks & further resources - Fundamentals Webinar Series Part 6
Β 
Automated Deployment and Configuration Engines. Ansible
Automated Deployment and Configuration Engines. AnsibleAutomated Deployment and Configuration Engines. Ansible
Automated Deployment and Configuration Engines. Ansible
Β 
Boston/NYC Chef for OpenStack Hack Days
Boston/NYC Chef for OpenStack Hack DaysBoston/NYC Chef for OpenStack Hack Days
Boston/NYC Chef for OpenStack Hack Days
Β 
Chef for OpenStack December 2012
Chef for OpenStack December 2012Chef for OpenStack December 2012
Chef for OpenStack December 2012
Β 
Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...
Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...
Chef Fundamentals Training Series Module 4: The Chef Client Run and Expanding...
Β 
Ninja, Choose Your Weapon!
Ninja, Choose Your Weapon!Ninja, Choose Your Weapon!
Ninja, Choose Your Weapon!
Β 
Chef Fundamentals Training Series Module 2: Workstation Setup
Chef Fundamentals Training Series Module 2: Workstation SetupChef Fundamentals Training Series Module 2: Workstation Setup
Chef Fundamentals Training Series Module 2: Workstation Setup
Β 
Package Management and Chef - ChefConf 2015
Package Management and Chef - ChefConf 2015Package Management and Chef - ChefConf 2015
Package Management and Chef - ChefConf 2015
Β 
Scaling Development Environments with Docker
Scaling Development Environments with DockerScaling Development Environments with Docker
Scaling Development Environments with Docker
Β 
Node setup, resource, and recipes - Fundamentals Webinar Series Part 2
Node setup, resource, and recipes - Fundamentals Webinar Series Part 2Node setup, resource, and recipes - Fundamentals Webinar Series Part 2
Node setup, resource, and recipes - Fundamentals Webinar Series Part 2
Β 
Environments - Fundamentals Webinar Series Week 5
Environments - Fundamentals Webinar Series Week 5Environments - Fundamentals Webinar Series Week 5
Environments - Fundamentals Webinar Series Week 5
Β 
Opscode Webinar: Managing Your VMware Infrastructure with Chef
Opscode Webinar: Managing Your VMware Infrastructure with ChefOpscode Webinar: Managing Your VMware Infrastructure with Chef
Opscode Webinar: Managing Your VMware Infrastructure with Chef
Β 
The unintended benefits of Chef
The unintended benefits of ChefThe unintended benefits of Chef
The unintended benefits of Chef
Β 
Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...
Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...
Chef Fundamentals Training Series Module 3: Setting up Nodes and Cookbook Aut...
Β 
SaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web Scale
SaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web ScaleSaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web Scale
SaltConf14 - Craig Sebenik, LinkedIn - SaltStack at Web Scale
Β 
Building a PaaS using Chef
Building a PaaS using ChefBuilding a PaaS using Chef
Building a PaaS using Chef
Β 

Viewers also liked

Whirr devdown
Whirr devdownWhirr devdown
Whirr devdownPuppet
Β 
Cascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop WorldCascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop WorldCascading
Β 
Hadoop Scheduling - a 7 year perspective
Hadoop Scheduling - a 7 year perspectiveHadoop Scheduling - a 7 year perspective
Hadoop Scheduling - a 7 year perspectiveJoydeep Sen Sarma
Β 
Introduction to Puppet Enterprise 2016.4
Introduction to Puppet Enterprise 2016.4Introduction to Puppet Enterprise 2016.4
Introduction to Puppet Enterprise 2016.4Puppet
Β 
Using Vagrant, Puppet, Testing & Hadoop
Using Vagrant, Puppet, Testing & HadoopUsing Vagrant, Puppet, Testing & Hadoop
Using Vagrant, Puppet, Testing & HadoopPuppet
Β 
Demystifying TLS
Demystifying TLSDemystifying TLS
Demystifying TLSPuppet
Β 
Introduction to Puppet Enterprise
Introduction to Puppet EnterpriseIntroduction to Puppet Enterprise
Introduction to Puppet EnterprisePuppet
Β 
Introduction to Puppet Enterprise
Introduction to Puppet EnterpriseIntroduction to Puppet Enterprise
Introduction to Puppet EnterprisePuppet
Β 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXHow Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXBMC Software
Β 
Intro To Cascading
Intro To CascadingIntro To Cascading
Intro To CascadingNate Murray
Β 
Introduction to Puppet Enterprise
Introduction to Puppet EnterpriseIntroduction to Puppet Enterprise
Introduction to Puppet EnterprisePuppet
Β 
Adopting Kubernetes with Puppet
Adopting Kubernetes with PuppetAdopting Kubernetes with Puppet
Adopting Kubernetes with PuppetPuppet
Β 

Viewers also liked (14)

Whirr devdown
Whirr devdownWhirr devdown
Whirr devdown
Β 
Cascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop WorldCascading - A Java Developer’s Companion to the Hadoop World
Cascading - A Java Developer’s Companion to the Hadoop World
Β 
Hadoop Scheduling - a 7 year perspective
Hadoop Scheduling - a 7 year perspectiveHadoop Scheduling - a 7 year perspective
Hadoop Scheduling - a 7 year perspective
Β 
Hadoop scheduler
Hadoop schedulerHadoop scheduler
Hadoop scheduler
Β 
Introduction to Puppet Enterprise 2016.4
Introduction to Puppet Enterprise 2016.4Introduction to Puppet Enterprise 2016.4
Introduction to Puppet Enterprise 2016.4
Β 
Streamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache AmbariStreamline Hadoop DevOps with Apache Ambari
Streamline Hadoop DevOps with Apache Ambari
Β 
Using Vagrant, Puppet, Testing & Hadoop
Using Vagrant, Puppet, Testing & HadoopUsing Vagrant, Puppet, Testing & Hadoop
Using Vagrant, Puppet, Testing & Hadoop
Β 
Demystifying TLS
Demystifying TLSDemystifying TLS
Demystifying TLS
Β 
Introduction to Puppet Enterprise
Introduction to Puppet EnterpriseIntroduction to Puppet Enterprise
Introduction to Puppet Enterprise
Β 
Introduction to Puppet Enterprise
Introduction to Puppet EnterpriseIntroduction to Puppet Enterprise
Introduction to Puppet Enterprise
Β 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAXHow Big Data and Hadoop Integrated into BMC ControlM at CARFAX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
Β 
Intro To Cascading
Intro To CascadingIntro To Cascading
Intro To Cascading
Β 
Introduction to Puppet Enterprise
Introduction to Puppet EnterpriseIntroduction to Puppet Enterprise
Introduction to Puppet Enterprise
Β 
Adopting Kubernetes with Puppet
Adopting Kubernetes with PuppetAdopting Kubernetes with Puppet
Adopting Kubernetes with Puppet
Β 

Similar to Deploying Hadoop-Based Bigdata Environments

App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)outstanding59
Β 
Inside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworldInside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworldRichard McDougall
Β 
App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)outstanding59
Β 
Hw09 Clouderas Distribution For Hadoop
Hw09   Clouderas Distribution For HadoopHw09   Clouderas Distribution For Hadoop
Hw09 Clouderas Distribution For HadoopCloudera, Inc.
Β 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesNicola Ferraro
Β 
2012 09-08-josug-jeff
2012 09-08-josug-jeff2012 09-08-josug-jeff
2012 09-08-josug-jeffZheng (Jeff) Xu
Β 
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive DemosHadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive DemosLester Martin
Β 
Hadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsHadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsLynn Langit
Β 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopDataWorks Summit
Β 
2014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop1012014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop101Adam Muise
Β 
Best hadoop-online-training
Best hadoop-online-trainingBest hadoop-online-training
Best hadoop-online-trainingGeohedrick
Β 
Hadoop demo ppt
Hadoop demo pptHadoop demo ppt
Hadoop demo pptPhil Young
Β 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereGanesh Raju
Β 
Hadoop online training
Hadoop online trainingHadoop online training
Hadoop online trainingsrikanthhadoop
Β 
Develop with linux containers and docker
Develop with linux containers and dockerDevelop with linux containers and docker
Develop with linux containers and dockerFabio Fumarola
Β 
Docker-v3.pdf
Docker-v3.pdfDocker-v3.pdf
Docker-v3.pdfBruno Cornec
Β 
2 Linux Container and Docker
2 Linux Container and Docker2 Linux Container and Docker
2 Linux Container and DockerFabio Fumarola
Β 

Similar to Deploying Hadoop-Based Bigdata Environments (20)

App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)
Β 
Inside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworldInside the Hadoop Machine @ VMworld
Inside the Hadoop Machine @ VMworld
Β 
App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)App Cap2956v2 121001194956 Phpapp01 (1)
App Cap2956v2 121001194956 Phpapp01 (1)
Β 
Hw09 Clouderas Distribution For Hadoop
Hw09   Clouderas Distribution For HadoopHw09   Clouderas Distribution For Hadoop
Hw09 Clouderas Distribution For Hadoop
Β 
Hadoop description
Hadoop descriptionHadoop description
Hadoop description
Β 
Extending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with KubernetesExtending DevOps to Big Data Applications with Kubernetes
Extending DevOps to Big Data Applications with Kubernetes
Β 
2012 09-08-josug-jeff
2012 09-08-josug-jeff2012 09-08-josug-jeff
2012 09-08-josug-jeff
Β 
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive DemosHadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Hadoop Demystified + MapReduce (Java and C#), Pig, and Hive Demos
Β 
Unit 5
Unit  5Unit  5
Unit 5
Β 
Hadoop MapReduce Fundamentals
Hadoop MapReduce FundamentalsHadoop MapReduce Fundamentals
Hadoop MapReduce Fundamentals
Β 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing Hadoop
Β 
2014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop1012014 feb 24_big_datacongress_hadoopsession1_hadoop101
2014 feb 24_big_datacongress_hadoopsession1_hadoop101
Β 
Best hadoop-online-training
Best hadoop-online-trainingBest hadoop-online-training
Best hadoop-online-training
Β 
Hadoop demo ppt
Hadoop demo pptHadoop demo ppt
Hadoop demo ppt
Β 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Β 
Hadoop online training
Hadoop online trainingHadoop online training
Hadoop online training
Β 
Develop with linux containers and docker
Develop with linux containers and dockerDevelop with linux containers and docker
Develop with linux containers and docker
Β 
Docker-v3.pdf
Docker-v3.pdfDocker-v3.pdf
Docker-v3.pdf
Β 
2 Linux Container and Docker
2 Linux Container and Docker2 Linux Container and Docker
2 Linux Container and Docker
Β 
Lecture 2 part 2
Lecture 2 part 2Lecture 2 part 2
Lecture 2 part 2
Β 

More from Puppet

Puppet camp2021 testing modules and controlrepo
Puppet camp2021 testing modules and controlrepoPuppet camp2021 testing modules and controlrepo
Puppet camp2021 testing modules and controlrepoPuppet
Β 
Puppetcamp r10kyaml
Puppetcamp r10kyamlPuppetcamp r10kyaml
Puppetcamp r10kyamlPuppet
Β 
2021 04-15 operational verification (with notes)
2021 04-15 operational verification (with notes)2021 04-15 operational verification (with notes)
2021 04-15 operational verification (with notes)Puppet
Β 
Puppet camp vscode
Puppet camp vscodePuppet camp vscode
Puppet camp vscodePuppet
Β 
Modules of the twenties
Modules of the twentiesModules of the twenties
Modules of the twentiesPuppet
Β 
Applying Roles and Profiles method to compliance code
Applying Roles and Profiles method to compliance codeApplying Roles and Profiles method to compliance code
Applying Roles and Profiles method to compliance codePuppet
Β 
KGI compliance as-code approach
KGI compliance as-code approachKGI compliance as-code approach
KGI compliance as-code approachPuppet
Β 
Enforce compliance policy with model-driven automation
Enforce compliance policy with model-driven automationEnforce compliance policy with model-driven automation
Enforce compliance policy with model-driven automationPuppet
Β 
Keynote: Puppet camp compliance
Keynote: Puppet camp complianceKeynote: Puppet camp compliance
Keynote: Puppet camp compliancePuppet
Β 
Automating it management with Puppet + ServiceNow
Automating it management with Puppet + ServiceNowAutomating it management with Puppet + ServiceNow
Automating it management with Puppet + ServiceNowPuppet
Β 
Puppet: The best way to harden Windows
Puppet: The best way to harden WindowsPuppet: The best way to harden Windows
Puppet: The best way to harden WindowsPuppet
Β 
Simplified Patch Management with Puppet - Oct. 2020
Simplified Patch Management with Puppet - Oct. 2020Simplified Patch Management with Puppet - Oct. 2020
Simplified Patch Management with Puppet - Oct. 2020Puppet
Β 
Accelerating azure adoption with puppet
Accelerating azure adoption with puppetAccelerating azure adoption with puppet
Accelerating azure adoption with puppetPuppet
Β 
Puppet catalog Diff; Raphael Pinson
Puppet catalog Diff; Raphael PinsonPuppet catalog Diff; Raphael Pinson
Puppet catalog Diff; Raphael PinsonPuppet
Β 
ServiceNow and Puppet- better together, Kevin Reeuwijk
ServiceNow and Puppet- better together, Kevin ReeuwijkServiceNow and Puppet- better together, Kevin Reeuwijk
ServiceNow and Puppet- better together, Kevin ReeuwijkPuppet
Β 
Take control of your dev ops dumping ground
Take control of your  dev ops dumping groundTake control of your  dev ops dumping ground
Take control of your dev ops dumping groundPuppet
Β 
100% Puppet Cloud Deployment of Legacy Software
100% Puppet Cloud Deployment of Legacy Software100% Puppet Cloud Deployment of Legacy Software
100% Puppet Cloud Deployment of Legacy SoftwarePuppet
Β 
Puppet User Group
Puppet User GroupPuppet User Group
Puppet User GroupPuppet
Β 
Continuous Compliance and DevSecOps
Continuous Compliance and DevSecOpsContinuous Compliance and DevSecOps
Continuous Compliance and DevSecOpsPuppet
Β 
The Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick Maludy
The Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick MaludyThe Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick Maludy
The Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick MaludyPuppet
Β 

More from Puppet (20)

Puppet camp2021 testing modules and controlrepo
Puppet camp2021 testing modules and controlrepoPuppet camp2021 testing modules and controlrepo
Puppet camp2021 testing modules and controlrepo
Β 
Puppetcamp r10kyaml
Puppetcamp r10kyamlPuppetcamp r10kyaml
Puppetcamp r10kyaml
Β 
2021 04-15 operational verification (with notes)
2021 04-15 operational verification (with notes)2021 04-15 operational verification (with notes)
2021 04-15 operational verification (with notes)
Β 
Puppet camp vscode
Puppet camp vscodePuppet camp vscode
Puppet camp vscode
Β 
Modules of the twenties
Modules of the twentiesModules of the twenties
Modules of the twenties
Β 
Applying Roles and Profiles method to compliance code
Applying Roles and Profiles method to compliance codeApplying Roles and Profiles method to compliance code
Applying Roles and Profiles method to compliance code
Β 
KGI compliance as-code approach
KGI compliance as-code approachKGI compliance as-code approach
KGI compliance as-code approach
Β 
Enforce compliance policy with model-driven automation
Enforce compliance policy with model-driven automationEnforce compliance policy with model-driven automation
Enforce compliance policy with model-driven automation
Β 
Keynote: Puppet camp compliance
Keynote: Puppet camp complianceKeynote: Puppet camp compliance
Keynote: Puppet camp compliance
Β 
Automating it management with Puppet + ServiceNow
Automating it management with Puppet + ServiceNowAutomating it management with Puppet + ServiceNow
Automating it management with Puppet + ServiceNow
Β 
Puppet: The best way to harden Windows
Puppet: The best way to harden WindowsPuppet: The best way to harden Windows
Puppet: The best way to harden Windows
Β 
Simplified Patch Management with Puppet - Oct. 2020
Simplified Patch Management with Puppet - Oct. 2020Simplified Patch Management with Puppet - Oct. 2020
Simplified Patch Management with Puppet - Oct. 2020
Β 
Accelerating azure adoption with puppet
Accelerating azure adoption with puppetAccelerating azure adoption with puppet
Accelerating azure adoption with puppet
Β 
Puppet catalog Diff; Raphael Pinson
Puppet catalog Diff; Raphael PinsonPuppet catalog Diff; Raphael Pinson
Puppet catalog Diff; Raphael Pinson
Β 
ServiceNow and Puppet- better together, Kevin Reeuwijk
ServiceNow and Puppet- better together, Kevin ReeuwijkServiceNow and Puppet- better together, Kevin Reeuwijk
ServiceNow and Puppet- better together, Kevin Reeuwijk
Β 
Take control of your dev ops dumping ground
Take control of your  dev ops dumping groundTake control of your  dev ops dumping ground
Take control of your dev ops dumping ground
Β 
100% Puppet Cloud Deployment of Legacy Software
100% Puppet Cloud Deployment of Legacy Software100% Puppet Cloud Deployment of Legacy Software
100% Puppet Cloud Deployment of Legacy Software
Β 
Puppet User Group
Puppet User GroupPuppet User Group
Puppet User Group
Β 
Continuous Compliance and DevSecOps
Continuous Compliance and DevSecOpsContinuous Compliance and DevSecOps
Continuous Compliance and DevSecOps
Β 
The Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick Maludy
The Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick MaludyThe Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick Maludy
The Dynamic Duo of Puppet and Vault tame SSL Certificates, Nick Maludy
Β 

Recently uploaded

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
Β 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
Β 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
Β 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
Β 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
Β 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
Β 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
Β 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
Β 
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | DelhiFULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhisoniya singh
Β 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
Β 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
Β 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
Β 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
Β 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
Β 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
Β 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
Β 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
Β 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
Β 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
Β 

Recently uploaded (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Β 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
Β 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
Β 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
Β 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
Β 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
Β 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Β 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
Β 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Β 
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | DelhiFULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY πŸ” 8264348440 πŸ” Call Girls in Diplomatic Enclave | Delhi
Β 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Β 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Β 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
Β 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Β 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Β 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Β 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
Β 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Β 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
Β 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Β 

Deploying Hadoop-Based Bigdata Environments

  • 1. Deploying Hadoop-Based Bigdata Environments Click to edit Master subtitle style β€œ[Tall] Tales From The Frontier” Roman Shaposhnik rvs@apache.org, Cloudera Inc.
  • 2. $ whoami ξ€Š An open source software developer ξ€Š Linux kernel, C/C++ compilers, FFmpeg, Plan9 ξ€Š A Hadoop and all around UNIX guy ξ€Š root@cloudera ξ€Š Member of the β€œKitchen” team ξ€Š Apache Software Foundation Incubator PMC ξ€Š [Bigtop], Hadoop Development Tools, Celix, Helix ξ€Š VP of Apache Bigtop 2
  • 3. ZooKeeper (coordination) HUE (web based UI) Pig (DQL) Hive (SQL) Impala (SQL) HBase YARN/MR1 Oozie HDFS (filesystem) 3
  • 4. ZooKeeper (coordination) HUE (web based UI) Pig (DQL) Hive (SQL) Impala (SQL) HBase YARN/MR1 Oozie HDFS (filesystem) 4
  • 5. It is a jungle out there ξ€Š Zookeeper ξ€Š Sqoop ξ€Š JDK/JRE ξ€Š Hadoop ξ€Š Oozie ξ€Š Kerberos ξ€Š HDFS ξ€Š Whirr ξ€Š Ganglia ξ€Š YARN ξ€Š Mahout ξ€Š Nagios ξ€Š MR1 ξ€Š Flume ξ€Š JSVC ξ€Š HTTPFS ξ€Š Giraph ξ€Š Tomcat ξ€Š HBase ξ€Š Hama ξ€Š Utils ξ€Š Pig ξ€Š Hue ξ€Š Postgress ξ€Š Hive ξ€Š Solr ξ€Š HTTPD ξ€Š Impala ξ€Š Crunch 5
  • 6. And the answer is: Puppet[forge] 6
  • 7. One way of using Apache software $ wget http://apache.org/httpd.tar.gz $ tar xzvf httpd.tar.gz $ cd httpd $ ./configure ; make $ make install ERROR: can't write to /usr/local/bin $ sudo make install 7
  • 8. A different way $ sudo apt-get install httpd Would you like to also upgrade your conf? 8
  • 9. Is there apt-get install hadoop ? ξ€Š Hadoop is still in a very active development ξ€Š Hadoop is Java based ξ€Š Hadoop is a distributed application ξ€Š Hadoop is way more than HDFS + MR 9
  • 10. Project-by-project approach ξ€Š β€œPassively” maintained code ξ€Š Packaging, OS-level (init.d) ξ€Š Developer-centric view ξ€Š Edit-compile-debug cycle vs. deployment ξ€Š Lack of integration testing ξ€Š Differences in distributions/packaging: ξ€Š Where is this valid: /usr/libexec ? ξ€Š Combinatoric explosion of dependencies 10
  • 11. Dependencies Inferno: Hive 0.8.1 HBase Hbase (0.92, 0.90) HBase HBase Hadoop (1.0, 0.22, 0.23) A million dollar question: $ tar xzvf hive-0.8.1.tar.gz $ ls hive-0.8.1/lib 11
  • 12. Dependencies Inferno: Hive 0.8.1 HBase Hbase (0.92, 0.90) HBase HBase Hadoop (1.0, 0.22, 0.23) A million dollar question: $ tar xzvf hive-0.8.1.tar.gz $ ls hive-0.8.1/lib hbase-0.89.jar log4j-1.2.15.jar log4j-1.2.16.jar 12
  • 13. Remember what Debian did to Linux? GNU Software Linux kernel Linux kernel 13
  • 14. Bigtop is trying to do it with Hadoop Hadoop Ecosystem Hadoop Linux kernel (Pig, Hive, Mahout) (HDFS + MR) CDH4 beta 1 14
  • 15. What's there in Bigtop ξ€Š Build/Packaging infrastructure ξ€Š RPM, DEB, (tarballs, homebrew/MacPorts) ξ€Š VirtualBox, VMWare and KVM VMs ξ€Š Fedora, OpenSUSE, Mageia, CentOS, Ubuntu ξ€Š Puppet deployment infrastrucutre ξ€Š Integration test infrastrucutre (iTest) ξ€Š Bigtop Jenkins: ξ€Š http://bigtop01.cloudera.org:8080 15
  • 16. And the answer is: Puppet[Bigtop] 16
  • 17. System software deployment ξ€Š Packages vs. Puppet code ξ€Š package/file/service ξ€Š What is packaging? ξ€Š dependency tracking ξ€Š build encapsulation ξ€Š java packaging ξ€Š file layout ξ€Š user creation ξ€Š service registration 17
  • 18. Does it really work? ξ€Š Java packaging ξ€Š maven/ivy integration ξ€Š file layout ξ€Š side-by-side installations of the same package ξ€Š user creation ξ€Š LDAP/AD provisioning ξ€Š service registration ξ€Š start on install vs. start on reboot 18
  • 19. Petascale distributed systems ξ€Š Scale ξ€Š Yahoo! ~5000 nodes ξ€Š Deployment orchestration ξ€Š Kerberos::Host_keytab <| title == "hdfs" |> -> Service["hadoop-hdfs-datanode"] ξ€Š Highly coordinated distributed system ξ€Š It ain't HTTPD/loadbalancer ξ€Š Rolling upgrades/asynchronous rollbacks 19
  • 20. Back to tarballs and shell? ξ€Š What's better for Puppet: fpm or rpm? ξ€Š What is the role of Puppet? ξ€Š coordinating the entire system: lack of DSL ξ€Š converging an isolated node: will it ever work? ξ€Š a building block for an agent-based system ξ€Š One agent to rule them all? ξ€Š there's no spoon^H^H^H^H^H^ agent: Whirr ξ€Š MCollective 20 ξ€Š Cloudera Manager, Ambari
  • 21. Evolution, not perfection! ξ€Š Minimalistic, highly consistent packages ξ€Š /usr/lib/hadoop, /etc/hadoop/conf (alternative) ξ€Š fail gracefully: .... || : ) ξ€Š Java packaging is not solved [yet]: symlinks ξ€Š Minimalistic Puppet code ξ€Š package/file/service ξ€Š masterless (most of the time) ξ€Š integration with Whirr ξ€Š BoxGrinder 21
  • 22. The road ahead ξ€Š New kind of configuration management ξ€Š /etc/hadoop vs Zookeeper ξ€Š New kinds of system packaging ξ€Š Parcels (tarballs + metadata) ξ€Š HPS (Hadoop Packaging System) ξ€Š Orchestration: to puppet or not to puppet? ξ€Š Cloudera Manager ξ€Š Apache Ambari (incubating) ξ€Š Reactor 8: http://reactor8.com 22
  • 23. Java Packaging ξ€Š Fate of Java ξ€Š OpenJDK ξ€Š OSGi ξ€Š Hadoop's view: MAPREDUCE-1700 https://issues.apache.org/jira/browse/MAPREDUCE-1700 ξ€Š Project Jigsaw ξ€Š Language tie-ins? Really? ξ€Š Linux vendors getting their act together 23
  • 24. Integration testing ξ€Š Clean room provisioning ξ€Š Those ain't unit tests – they trash the system ξ€Š Cluster topology and cluster state discovery ξ€Š How can puppet help us? ξ€Š Cluster state manipulation ξ€Š Test-driven orchestration ξ€Š Chaos Monkey ξ€Š How to be successful in OS co-opetition ξ€Š Make everything pluggable (and subvert ;-)) 24
  • 25. Anatomy of iTest ξ€Š Versioned, JVM-based test/data artifacts ξ€Š Dependency between test artifacts ξ€Š Matching stack of integration tests ξ€Š Implementation ξ€Š Maven artifacts, pom files ξ€Š JUnit test-execution entry point ξ€Š Groovy for scripting 25
  • 26. Who's the target audience ξ€Š End users ξ€Š YOU! ξ€Š ASF Projects/Bigdata developers ξ€Š from Avro to Zookeeper ξ€Š Bigdata solutions vendors ξ€Š Cloudera, EMC, Hortonworks, Karmasphere ξ€Š DevOPs ξ€Š Ebay, Yahoo, Facebook, LinkedIn 26
  • 27. Who's on-board? ξ€Š Cloudera ξ€Š CDH4 is 100% based on Bigtop (hadoop v2) ξ€Š Available @cloudera.com ξ€Š Canonical ξ€Š Ubuntu Server: Hadoop and Bigdata blueprint https://blueprints.launchpad.net/ubuntu/+spec/servercloud-p-hdp-hadoop ξ€Š TrendMicro ξ€Š Hortonworks (partially) ξ€Š EMC, EBay (early stages of prototyping) 27
  • 28. What's happening? ξ€Š A special release: Bigtop 0.3.0-incubating ξ€Š Hadoop 1.0.1 ξ€Š Last stable release: Bigtop 0.5.0 ξ€Š Hadoop 2.0.2-alpha ξ€Š Next stable release: Bigtop 0.6.0 ξ€Š End of Mar 2013 release ξ€Š Hadoop 2.0.3-beta ξ€Š Major focus on developers 28
  • 29. What Bigtop needs from you? ξ€Š More of you! ξ€Š Meetup: β€œSilicon Valley Hands-on Programming” http://www.meetup.com/HandsOnProgrammingEvents/ ξ€Š More infrastructure for build/test ξ€Š EC2, Supercell, EMC magic cluster, CloudStack ξ€Š More integration tests ξ€Š Convince your bosses to commit to Bigtop ξ€Š Validate upstream release using Bigtop 29
  • 30. Contact Β§ Bigtop home @Apache: β€’ http://incubator.apache.org/bigtop/ Β§ Hangout places: β€’ {dev,user}@bigtop.apache.org β€’ #bigtop on Freenode Β§ Roman Shaposhnik β€’ rvs@apache.org, rvs@cloudera.com 30