Thank you
@natishalom
About GigaSpaces




                   3
The Reality of Big Data..

       2.7 ZB
   Global Digital Data




0.5 Petabytes
                               43%                   think that data

                               analytics could be improved in their
   Two years tweets          organization if data analytics was part of

                              cloud services
        66%
Plan to use Big Data/Cloud


                                                                       5
Large ISV Case Study
• Application
  – Call Center surveillance
• Background
  – Previously – voice data
• Goal for a new system
  – Monitor data & voice
  – Multiple data sources
  – Advanced correlations
Ever Growing Data

Deeper Correlation

Tight Performance
A Classic Case for..
A Typical Big Data System…
Business
Cost
        Impact
Big Data
in the Cloud
Big Data in the Cloud- 3 Reasons
                • Skills
                   – Do you really need/want this all in-
                     house?
                • Huge amounts of external data.
Holger Kisker      – Does it make sense to move and
                     manage all this data behind your
                     firewall?
                • Focus on the value of your data
                   – Instead of big data management.
• Auto start VMs
• Install and configure
  app components
• Monitor
• Repair
• (Auto) Scale
• Burst…
Running Bare-Metal for
high I/O workloads, Public
cloud for sporadic
workloads ..
• Consistent
  Management
• Automation Through
  the Entire Stack
http://code.zynga.com/2012/02/the-evolution-of-zcloud/
http://code.mixpanel.com/2010/11/08/amazon-vs-rackspace/
Realization:
  What You
 Really Care
  about Is
    App
 Portability
Consistent Management
   Recipes consistent description for running any app:
                       What middleware services to run
                       Dependencies between services
                       How to install services
                       Where application and service binaries are
                       When to spawn or terminate instances
                       How to monitor each of the services.




      ® Copyright 2012 GigaSpaces Ltd. All Rights
                                                                     27
                      Reserved
Choosing the Right Cloud for the Job
      compute {
        template "SMALL_LINUX"
      }




SMALL_LINUX : template{                                                SMALL_LINUX : template
  imageId "1234"                                                         imageId "us-east-1/ami-76f0061f“
  machineMemoryMB 3200                                                   remoteDirectory "/home/ec2-user/gs-files“
  hardwareId "103"                                                       machineMemoryMB 1600
  remoteDirectory "/root/gs-files"                                       hardwareId "m1.small"
  localDirectory "upload"                                                locationId "us-east-1"
  keyFile "gigaPGHP.pem"                                                 localDirectory "upload"
  options ([                                                             keyFile "myKeyFile.pem"
    "openstack.securityGroup" : "default",
    "openstack.keyPair" : "gigaPGHP"                                     options ([
      ])                                                                       "securityGroups" : ["default"]as
      privileged true                                                  String[],
}                                                                              "keyPair" : "myKeyFile"
                                                                             ])
                                                                             overrides (["jclouds.ec2.ami-query":"",
                                                                             "jclouds.ec2.cc-ami-query":""])
                                                                             privileged true
                                                                       }


                                        ® Copyright 2012 GigaSpaces Ltd. All Rights
                                                                                                                 29
                                                        Reserved
Automation across the stack
             1   Upload your recipe.

             2   Cloudify creates VM’s & installs agents

             3   Agents install and manage your app

             4   Cloudify automate the scaling
Demo Time – Storm on Demand..




                                32
RightScale
Amazon Elastic Map Reduce
Large ISV Case Study
• Application
  – Call Center surveillance system
• Background
  – Previously – voice data
• Goal for a new system
  Monitor data & voice
  Multiple data sources
  Advanced correlations              Mission
                                      Accomplished
Additional Benefits
     • True Cloud Economics

     • One product -> any
       Customer Environment



     • Increased Agility
Try a simple Big Data Demo Yourself




launch.cloudifysource.org/d
http://www.cloudifysource.org
http://github.com/CloudifySource

Big data (reversim)

  • 1.
  • 2.
  • 3.
  • 5.
    The Reality ofBig Data.. 2.7 ZB Global Digital Data 0.5 Petabytes 43% think that data analytics could be improved in their Two years tweets organization if data analytics was part of cloud services 66% Plan to use Big Data/Cloud 5
  • 6.
    Large ISV CaseStudy • Application – Call Center surveillance • Background – Previously – voice data • Goal for a new system – Monitor data & voice – Multiple data sources – Advanced correlations
  • 7.
    Ever Growing Data DeeperCorrelation Tight Performance
  • 8.
  • 9.
    A Typical BigData System…
  • 10.
  • 11.
  • 12.
    Big Data inthe Cloud- 3 Reasons • Skills – Do you really need/want this all in- house? • Huge amounts of external data. Holger Kisker – Does it make sense to move and manage all this data behind your firewall? • Focus on the value of your data – Instead of big data management.
  • 13.
    • Auto startVMs • Install and configure app components • Monitor • Repair • (Auto) Scale • Burst…
  • 14.
    Running Bare-Metal for highI/O workloads, Public cloud for sporadic workloads ..
  • 15.
    • Consistent Management • Automation Through the Entire Stack
  • 19.
  • 20.
  • 25.
    Realization: WhatYou Really Care about Is App Portability
  • 27.
    Consistent Management Recipes consistent description for running any app:  What middleware services to run  Dependencies between services  How to install services  Where application and service binaries are  When to spawn or terminate instances  How to monitor each of the services. ® Copyright 2012 GigaSpaces Ltd. All Rights 27 Reserved
  • 29.
    Choosing the RightCloud for the Job compute { template "SMALL_LINUX" } SMALL_LINUX : template{ SMALL_LINUX : template imageId "1234" imageId "us-east-1/ami-76f0061f“ machineMemoryMB 3200 remoteDirectory "/home/ec2-user/gs-files“ hardwareId "103" machineMemoryMB 1600 remoteDirectory "/root/gs-files" hardwareId "m1.small" localDirectory "upload" locationId "us-east-1" keyFile "gigaPGHP.pem" localDirectory "upload" options ([ keyFile "myKeyFile.pem" "openstack.securityGroup" : "default", "openstack.keyPair" : "gigaPGHP" options ([ ]) "securityGroups" : ["default"]as privileged true String[], } "keyPair" : "myKeyFile" ]) overrides (["jclouds.ec2.ami-query":"", "jclouds.ec2.cc-ami-query":""]) privileged true } ® Copyright 2012 GigaSpaces Ltd. All Rights 29 Reserved
  • 30.
    Automation across thestack 1 Upload your recipe. 2 Cloudify creates VM’s & installs agents 3 Agents install and manage your app 4 Cloudify automate the scaling
  • 32.
    Demo Time –Storm on Demand.. 32
  • 34.
  • 35.
  • 36.
    Large ISV CaseStudy • Application – Call Center surveillance system • Background – Previously – voice data • Goal for a new system Monitor data & voice Multiple data sources Advanced correlations Mission Accomplished
  • 37.
    Additional Benefits • True Cloud Economics • One product -> any Customer Environment • Increased Agility
  • 38.
    Try a simpleBig Data Demo Yourself launch.cloudifysource.org/d
  • 39.

Editor's Notes

  • #5 Reasons why people would be concerned of moving data into the cloud- I suppose one thing you could mention "against" having data in the cloud is the fear of losing control of your data (high cost of transfer, lock in etc)Talk about private/public cloud
  • #6 GigaSpaces Big Data Survey:http://www.gigaspaces.com/sites/default/files/product/BigDataSurvey_Report.pdfForbes on Big Data & Cloud http://www.forbes.com/sites/forrester/2012/08/15/big-data-meets-cloud/38% of all companies from our survey are planning a BI SaaS project before the end of 2013. Many of those respondents (27%) plan to complement their existing BI solutions and a smaller number (11%) actually plan to fully replace their existing BI with a cloud solutionForrester:This year we will hit a volume of 2.7 zettabytes of global digital data ~20% of all tweets include a link that needs to be opened to understand its context.[ii] All tweets from the past two years take 0.5 petabytes to store; it simply doesn’t make sense for every company interested in social media to start storing the same big data in-house.http://www.globaltelecomsbusiness.com/article/3133566/Big-data-becomes-priority-as-executives-tackle-complexity-of-business-analytics.htmlCompanies are most interested in getting access to data in real time (54%), accessing data from multiple devices (51%) and accessing data from remote/flexible locations (44%). Yet, getting access to data in real time emerges as the biggest challenge for companies (52%) along with speed of data delivery (50%); 
• 43% think that data analytics could be improved in their organisation if data analytics was part of cloud services delivered with third-party expertise. 
   
  • #13 Big data requires a spectrum of advanced technologies, skills, and investments. Do you really need/want this all in-house?Big data includes huge amounts of external data. Does it make sense to move and manage all this data behind your firewall?Big data needs a lot of data services. Focus on the value of your differentiated data analysis instead of big data management.http://www.forbes.com/sites/forrester/2012/08/15/big-data-meets-cloud/
  • #16 Consistent Management: Making the deployment, installation, scaling, fail-over looks the same through the entire stack
  • #31 Any app - All clouds