The Elephant in the Room Jim O’Neil Developer Evangelist, Microsoft email@example.com @jimoneil
DIYRoll your own Hadoop cluster…. welcome to DevOps “Isotope” Pallet
AppliancesOracle Big Data Appliance – 18 server / 12 core each / 40Gb Infiniband – Partnering with Cloudera on the distributionGreenplum HD Data Computing Appliance – 18 nodes, 12 core each – Straight up Apache HadoopNetApp Open Solution for Hadoop – Storage arrays only (E2660 and FAS2040) – Partnership with Cloudera
The Elephant in the CloudJim O’NeilDeveloper Evangelist, Microsoftjim.firstname.lastname@example.org @jimoneil
Cloud: a Notional Definition Private Cloud Deployment Models Hybrid Cloud Community Cloud Public Cloud Essential Characteristics Infrastructure as a Service Platform as a Service Broad network Rapid Elasticity accessSoftware as Resource Pooling a Service On-demand self-service Measured service
Hadoop in the CloudGoogle App Engine appengine-mapreduce API (not really Hadoop)Amazon Web Services 66 Public AMIs (including Cloudera) Elastic Map ReduceWindows Azure Hadoop on AzureIBM SmartCloud Infosphere BigInsights
Google App Engine MapreducePipeline ClassExperimental!Mapreduce is an experimental, innovative, and rapidlychanging new feature for App Engine.Unfortunately, being on the bleeding edge means thatwe may make backwards-incompatible changes toMapreduce. We will inform the community when thisfeature is no longer experimental.