ODP, PPTX2,642 views

Farming hadoop in_the_cloud

The document discusses running Hadoop clusters in the cloud and the challenges that presents. It introduces CloudFarmer, a tool that allows defining roles for VMs and dynamically allocating VMs to roles. This allows building agile Hadoop clusters in the cloud that can adapt as needs change without static configurations. CloudFarmer provides a web UI to manage roles and hosts.

Technology◦

Committer: Ant, Hadoop-common, HDFS, MapReduce

CLOUD ELIMINATES Buying hardware based on predicted load

2+ week lead time on new hardware, storage

Static machine names, addresses and capabilities

Someone in the datacentre who cares about you

APPLICATIONS MUST BE AGILE Directory, database or CM service to configure

Use dynamic DNS services; don’t cache IPAddrs

Don’t expect HDD content to last on a single disk

Restart VMs on any app failure Nothing is static. Nothing lasts .

CLASSIC TEAM ROLES Business Development Architecture Operations Development

Business Development Architecture Operations Development BEFORE Design Code Test Staging Live

RESPONSIBILITIES Work Architects design the application

Developers code and test on local machines

More Related Content

PPTX

Where to Deploy Hadoop: Bare Metal or Cloud?

byDataWorks Summit

PPTX

Hadoop Operations

byCloudera, Inc.

PDF

Apache Hadoop on Virtual Machines

byDataWorks Summit

PPTX

Configuring a Secure, Multitenant Cluster for the Enterprise

byCloudera, Inc.

PPTX

Best Practices for Virtualizing Hadoop

byDataWorks Summit

PDF

Big data on virtualized infrastucture

byDataWorks Summit

PDF

Hadoop Operations for Production Systems (Strata NYC)

byKathleen Ting

PPTX

Scalable On-Demand Hadoop Clusters with Docker and Mesos

bynelsonadpresent

Where to Deploy Hadoop: Bare Metal or Cloud?

byDataWorks Summit

Hadoop Operations

byCloudera, Inc.

Apache Hadoop on Virtual Machines

byDataWorks Summit

Configuring a Secure, Multitenant Cluster for the Enterprise

byCloudera, Inc.

Best Practices for Virtualizing Hadoop

byDataWorks Summit

Big data on virtualized infrastucture

byDataWorks Summit

Hadoop Operations for Production Systems (Strata NYC)

byKathleen Ting

Scalable On-Demand Hadoop Clusters with Docker and Mesos

bynelsonadpresent

What's hot

PPTX

Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017

byStefan Lipp

PPTX

Intel and Cloudera: Accelerating Enterprise Big Data Success

byCloudera, Inc.

PPTX

Hadoop on Virtual Machines

byRichard McDougall

PPTX

Multi-Tenant Operations with Cloudera 5.7 & BT

byCloudera, Inc.

PPTX

Oracle big data appliance and solutions

bysolarisyougood

PDF

Bare-metal performance for Big Data workloads on Docker containers

byBlueData, Inc.

PPTX

Docker based Hadoop provisioning - anywhere

byJanos Matyas

PPTX

A deep dive into running data analytic workloads in the cloud

byCloudera, Inc.

PPTX

One Click Hadoop Clusters - Anywhere (Using Docker)

byDataWorks Summit

PDF

Hadoop Operations at LinkedIn

byDataWorks Summit

PPTX

Intro to Apache Spark

byCloudera, Inc.

PDF

20150716 introduction to apache spark v3

byAndrey Vykhodtsev

PDF

BDTC2015 hulu-梁宇明-voidbox - docker on yarn

byJerry Wen

PDF

Hadoop Virtualization - Intel White Paper

byBlueData, Inc.

PPTX

How to deploy Apache Spark in a multi-tenant, on-premises environment

byBlueData, Inc.

PPTX

Hadoop on Docker

byRakesh Saha

PDF

Hazelcast 3.6 Roadmap Preview

byHazelcast

PDF

VMworld 2013: Virtualizing Databases: Doing IT Right

byVMworld

PPTX

Ceph Deployment at Target: Customer Spotlight

byColleen Corrice

PPTX

Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production

byCloudera, Inc.

Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017

byStefan Lipp

Intel and Cloudera: Accelerating Enterprise Big Data Success

byCloudera, Inc.

Hadoop on Virtual Machines

byRichard McDougall

Multi-Tenant Operations with Cloudera 5.7 & BT

byCloudera, Inc.

Oracle big data appliance and solutions

bysolarisyougood

Bare-metal performance for Big Data workloads on Docker containers

byBlueData, Inc.

Docker based Hadoop provisioning - anywhere

byJanos Matyas

A deep dive into running data analytic workloads in the cloud

byCloudera, Inc.

One Click Hadoop Clusters - Anywhere (Using Docker)

byDataWorks Summit

Hadoop Operations at LinkedIn

byDataWorks Summit

Intro to Apache Spark

byCloudera, Inc.

20150716 introduction to apache spark v3

byAndrey Vykhodtsev

BDTC2015 hulu-梁宇明-voidbox - docker on yarn

byJerry Wen

Hadoop Virtualization - Intel White Paper

byBlueData, Inc.

How to deploy Apache Spark in a multi-tenant, on-premises environment

byBlueData, Inc.

Hadoop on Docker

byRakesh Saha

Hazelcast 3.6 Roadmap Preview

byHazelcast

VMworld 2013: Virtualizing Databases: Doing IT Right

byVMworld

Ceph Deployment at Target: Customer Spotlight

byColleen Corrice

Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production

byCloudera, Inc.

Viewers also liked

PDF

PSUG #52 Dataflow and simplified reactive programming with Akka-streams

byStephane Manciot

PDF

Des principes de la démarche DevOps à sa mise en oeuvre

byStephane Manciot

PDF

Machine learning

byebiznext

PDF

Packaging et déploiement d'une application avec Docker et Ansible @DevoxxFR 2015

byStephane Manciot

PPTX

Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...

byCloudera, Inc.

PDF

Spark / Mesos Cluster Optimization

byebiznext

PDF

DevOps avec Ansible et Docker

byStephane Manciot

PPTX

How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform

byCloudera, Inc.

PPTX

Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...

byCloudera, Inc.

PSUG #52 Dataflow and simplified reactive programming with Akka-streams

byStephane Manciot

Des principes de la démarche DevOps à sa mise en oeuvre

byStephane Manciot

Machine learning

byebiznext

Packaging et déploiement d'une application avec Docker et Ansible @DevoxxFR 2015

byStephane Manciot

Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...

byCloudera, Inc.

Spark / Mesos Cluster Optimization

byebiznext

DevOps avec Ansible et Docker

byStephane Manciot

How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform

byCloudera, Inc.

Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...

byCloudera, Inc.

Similar to Farming hadoop in_the_cloud

PPTX

New Roles In The Cloud

Farming hadoop in_the_cloud

More Related Content

What's hot

Viewers also liked

Similar to Farming hadoop in_the_cloud

More from Steve Loughran

Recently uploaded

Farming hadoop in_the_cloud

Editor's Notes