BIGDATA APPS
ON OPENSTACK
thanks:Uri Cohen


V.P. PM @ GigaSpaces
     @uri1803
  github.com/uric
about:me


V.P. R&D @ GigaSpaces
    @g_korland
github.com/gkorland
“Big data and cloud technology
go hand-in-hand. Big data needs
clusters of servers for processing,
which clouds can readily provide.”

                  Ed Dumbill, The O’Reilly Radar
Why OpenStack?

Everyone (well, almost…) is in
HP, RackSpace, Dell, IBM, RedHat
Public / Private cloud interop
Cloud bursting
Portability
The 3 V’s of Big Data
Big Data Is Not Just About
        Elephants
Distributed File System
Map/Reduce
Data Mining
Data Access
Discovery & Orchestration
Real Time Event Processing
HDFS, in Reality
Map/Reduce, in Reality
Data Access, in Reality
Mongo, in Reality
Storm
Storm, In Reality
And What about Other Parts
      of Your App??
And This is What Your Big Data
 App Will Eventually Look Like
Managing All of That In the Cloud
Start VMs
Install and configure components
Monitor
Repair
(Auto) Scale
Burst…
Managing Big   Managing Big
 Data Apps   ≠    Data
Some Existing (incomplete)
        Solutions
Introducing Cloudify
the Open Paas Stack (Pun
When PaaS Met DevOps…
Your App, Your Recipe
How It Works           Install
                                                      Configure
                                  Bootstrap             Start
Application Recipe          Java & Cloudify Agent      Monitor
                                                    & Ensures H/A




mmands
 e Allocation



    Invokes IaaS
                     Compute
        API
                     Services
Demo
Getting Started…




     32
Get Involved!
     Community site
     http://www.cloudifysource.org
     OSS project repo:
     https://github.com/CloudifySource/cloudify
     Jira:
     https://cloudifysource.atlassian.net
     Travis-CI:

33
     http://travis-ci.org/#!/CloudifySource/cloudify
Q&A
Thank You!

 Get it today, for free:
www.cloudifysource.org

Open stack bigdata NY cloudcamp

Editor's Notes

  • #26 Primary and Backup dependency Moving data around for scaling Availability - moving data without losing it. Moving processes to the data rather than the other way around Managing the app as a whole