SlideShare a Scribd company logo
1 of 13
March 13, 2011
From Zero to Big Data Answers in Less Than
an Hour
Daniel Templeton | Cloudera Manager, Partner Program Adoption
Richard Guth | Karmasphere Chief Marketing Officer
The ‘Big Data’ Phenomenon

       Big Data Drivers:                     More Content                                                                      More Devices

        The proliferation of data capture
         and creation technologies
        Increased “interconnectedness”
         drives consumption (creating
         more data)                             More                                                                           New & Better
                                             Consumption                                                                        Information
        Inexpensive storage makes it
         possible to keep more, longer
        Innovative software and analysis
         tools turn data into information




                                                                                          Every gigabyte of stored content can generate
                                  Big Data encompasses not                                 a petabyte or more of transient data*
                                  only the content itself, but
                                  how it’s consumed.                                      The information about you is much greater
                                                                                           than the information you create
*Source: IDC 2011




     2
                                                  ©2011 Cloudera, Inc. All Rights Reserved.
What is Apache Hadoop?

                                                                                   CORE HADOOP COMPONENTS
   Apache Hadoop is a platform for
   data storage and processing that is…                                     Hadoop
                                                                        Distributed File
        Scalable                                                       System (HDFS)                         MapReduce
        Fault tolerant
        Open source                                                      File Sharing & Data
                                                                           Protection Across
                                                                                                            Distributed Computing
                                                                                                           Across Physical Servers
                                                                           Physical Servers




 Has the Flexibility to Store                      Excels at                                                Scales
 and Mine Any Type of Data                  Processing Complex Data                                      Economically
 Ask questions across structured and       Scale-out architecture divides                      Can be deployed on commodity
  unstructured data that were previously     workloads across multiple nodes                      hardware
  impossible to ask or solve                Flexible file system eliminates ETL                 Open source platform guards
 Not bound by a single schema               bottlenecks                                          against vendor lock




   3
                                               ©2011 Cloudera, Inc. All Rights Reserved.
Who Is Cloudera?

The trusted leader in            We make Hadoop                         Unrivaled knowledge               Strong executive
  Apache Hadoop.                 enterprise-easy.                         and experience.                 team with proven
                                                                                                              abilities.
 Package the #1                A distribution of Apache                Founders, committers and
  distribution of Apache         Hadoop that is                           contributors to Apache
                                                                                                       Mike Olson        Amr Awadallah
  Hadoop in commercial and       tested, certified and                    Hadoop and related           CEO               VP, Engineering
  non-commercial                 supported                                projects                     Kirk Dunn         Mary Rorabaugh
  environments                                                                                         COO               VP, Finance
                                A suite of management                   A wealth of experience in    Jeff              Charles
 Roadmap control or             software for Hadoop                      the design and delivery of   Hammerbacher      Zedlewski
                                                                                                       Chief Scientist   VP, Products
  influence over all Apache      administrators                           enterprise software
                                                                                                       Doug Cutting      Omer Trajman
  Hadoop-related projects                                                                              Chief Architect   VP, Customer
                                Training and certification                                                              Solutions
 Top contributor to the         programs
  Apache ecosystem overall
                                Comprehensive support
 Tens of thousands of nodes     and consulting services
  under management




    4
                                                 ©2011 Cloudera, Inc. All Rights Reserved.
CDH Overview

         The #1 commercial and non-commercial
         Apache Hadoop distribution.

    Complete, Integrated Hadoop Stack                                                                                     CDH Components
                                                                                                        Apache Hadoop – reliable, scalable distributed computing
     File System Mount                    UI Framework                       SDK
                      FUSE-DFS                                 HUE                  HUE SDK
                                                                                                        Apache Hive – SQL-like language and metadata repository
                                                                                                        Apache Pig – High level language for expressing data analysis
                                                                                                         programs
           Workflow                         Scheduling                     Metadata
                 APACHE OOZIE                        APACHE OOZIE               APACHE HIVE             Apache HBase – Hadoop database for random, real-time read/write
                                                                                                         access
                                                                                                        Apache Zookeeper – Highly reliable distributed coordination service
                                    Languages / Compilers
          Data                                   APACHE PIG, APACHE HIVE       Fast                     Apache Flume* – Distributed service for collecting and aggregating
                                                                            Read/Write                   log and event data
      Integration
                                                                             Access
                                                                                                        Apache Whirr* – Library for running Hadoop in the cloud
      APACHE FLUME,
      APACHE SQOOP                                                           APACHE HBASE               Apache Sqoop* – Integrating Hadoop with RDBMS
                                                                                                        Apache Oozie* – Server-based workflow engine for Hadoop Activities
                                           Coordination
                                                                           APACHE ZOOKEEPER             Fuse-DFS – Module within Hadoop for mounting HDFS as a traditional
                                                                                                         file system
                                                                                                        Hue – Browser-based desktop interface for interacting with Hadoop
* Currently undergoing Incubation at the Apache Software Foundation.




     5
                                                                              ©2011 Cloudera, Inc. All Rights Reserved.
Cloudera Enterprise
Cloudera Enterprise makes                                        CLOUDERA ENTERPRISE COMPONENTS
open source Hadoop enterprise-easy

 Simplify and Accelerate Hadoop Deployment
                                                                     Cloudera               Production-Level
                                                                     Manager                    Support
 Reduce Adoption Costs and Risks
 Lower the Cost of Administration
                                                              End-to-End Management          Our Team of Experts On-
 Increase Transparency and Control Over Hadoop                Application for Apache         Call to Help You Meet
                                                                      Hadoop                        Your SLAs
 Leverage the Experience of Our Experts




            EFFECTIVENESS                                                           EFFICIENCY
                  Ensuring You                                                     Enabling You to
    Get Value From Your Hadoop Deployment                               Affordably Run Hadoop in Production




6
                                      ©2011 Cloudera, Inc. All Rights Reserved.
Big Data Intelligence Applications
                                           for Enterprise Data Professionals




                                                   www.karmasphere.com
7 © Karmasphere 2012 All rights reserved
About Karmasphere


  Company
  Pure-play, singularly focused on Big Data
  Intelligence and Analytics on Hadoop and
  NoSQL, in the cloud and on-premise.
  Engineering Expertise
  Hadoop, analytics, web
  analytics, business
  intelligence, visualizations, programming
  languages, compilers, architecture, mathe
  matics, database
  Management Experience
  Google, Yahoo, Ask, Ning, Omniture, BEA,
  Oracle, Sybase, Actuate, Apple, Zend, Intel
  , BMC, Spotfire
8 © Karmasphere 2012 All rights reserved
Karmasphere Mission



                                 Provide an EASY way to find
                           INSIGHTS in Big Data to transform business

                                           Upcoming Skills Shortage
           “By 2018, the United States alone could face a shortage of 140,000
           to 190,000 people with deep analytical skills as well as 1.5 million
           managers and analysts with the know-how to use the analysis of
           big data to make effective decisions”
           “Big Data: the next frontier for innovation, competition and productivity”
           McKinsey, May 2011



9 © Karmasphere 2012 All rights reserved
Karmasphere: Big Data Mining and Analytics ON Hadoop




10 © Karmasphere 2012 All rights reserved
From Zero to Answers in 60 Minutes
                                                                         DEMO


   Our Process                              Marketing Analyst for Retail Chain
   • Access any cloud or on-                  1 Connect to the preconfigured
     premise Cloudera CDH                       Cloudera CDH cluster
   • Assemble and organize                    2 Access our structured point of sale
     unstructured and                           transactions data and bring up
     structured data in                         transactional data for lunch meals
     Hadoop
                                              3 Correlate results with unstructured
   • Analyze the data using
                                                social media data to get some insight
     familiar SQL
                                                on our buyers and buying behavior
                                              4 Infer from these results on
                                                underperforming stores and come up
                                                with an action plan to increase sales
                                                for these stores
11 © Karmasphere 2012 All rights reserved
www.karmasphere.com
12 © Karmasphere 2012 All rights reserved
From Zero to Big Data Answers in Less Than an Hour

       The webinar recording will be made available shortly at:
       • https://www1.gotomeeting.com/register/890391584

       Contact Information:



       • info@cloudera.com
       • 1 (888) 789-1488




       • info@karmasphere.com
       • 1 (650) 292-6100




13 © Karmasphere 2012 All rights reserved
                                            ©2011 Cloudera, Inc. All   13

More Related Content

What's hot

hadoop 101 aug 21 2012 tohug
 hadoop 101 aug 21 2012 tohug hadoop 101 aug 21 2012 tohug
hadoop 101 aug 21 2012 tohugAdam Muise
 
Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Jan Robin
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Cloudera, Inc.
 
Big Data launch Singapore Patrick Buddenbaum
Big Data launch Singapore Patrick BuddenbaumBig Data launch Singapore Patrick Buddenbaum
Big Data launch Singapore Patrick BuddenbaumIntelAPAC
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsHortonworks
 
Couchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeCouchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeDipti Borkar
 
Cloudwatt pioneers big_data
Cloudwatt pioneers big_dataCloudwatt pioneers big_data
Cloudwatt pioneers big_dataxband
 
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, ParisHadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, ParisOW2
 
Analytics on Hadoop
Analytics on HadoopAnalytics on Hadoop
Analytics on HadoopEMC
 
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?  Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You? EMC
 
Novell File Management Suite for Microsoft Active Directory Environments
Novell File Management Suite for Microsoft Active Directory EnvironmentsNovell File Management Suite for Microsoft Active Directory Environments
Novell File Management Suite for Microsoft Active Directory EnvironmentsNovell
 
Control the Chaos with Novell File Reporter
Control the Chaos with Novell File ReporterControl the Chaos with Novell File Reporter
Control the Chaos with Novell File ReporterNovell
 
The 25 Most Promising Open Source Projects
The 25 Most Promising Open Source ProjectsThe 25 Most Promising Open Source Projects
The 25 Most Promising Open Source Projectsaf83
 
IT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of ThingsIT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of ThingsIntel IT Center
 
Pivotal: Virtualize Big Data to Make the Elephant Dance
Pivotal: Virtualize Big Data to Make the Elephant DancePivotal: Virtualize Big Data to Make the Elephant Dance
Pivotal: Virtualize Big Data to Make the Elephant DanceEMC
 
Paris live eddiesatterly_022013
Paris live eddiesatterly_022013Paris live eddiesatterly_022013
Paris live eddiesatterly_022013jenny_splunk
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks
 

What's hot (20)

hadoop 101 aug 21 2012 tohug
 hadoop 101 aug 21 2012 tohug hadoop 101 aug 21 2012 tohug
hadoop 101 aug 21 2012 tohug
 
Monika_Raghuvanshi
Monika_RaghuvanshiMonika_Raghuvanshi
Monika_Raghuvanshi
 
Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
 
Big Data launch Singapore Patrick Buddenbaum
Big Data launch Singapore Patrick BuddenbaumBig Data launch Singapore Patrick Buddenbaum
Big Data launch Singapore Patrick Buddenbaum
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for Windows
 
Couchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = ThreeCouchbase Server and IBM BigInsights: One + One = Three
Couchbase Server and IBM BigInsights: One + One = Three
 
Cloudwatt pioneers big_data
Cloudwatt pioneers big_dataCloudwatt pioneers big_data
Cloudwatt pioneers big_data
 
A Mayo Clinic Big Data Implementation
A Mayo Clinic Big Data ImplementationA Mayo Clinic Big Data Implementation
A Mayo Clinic Big Data Implementation
 
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, ParisHadoop's Role in the Big Data Architecture, OW2con'12, Paris
Hadoop's Role in the Big Data Architecture, OW2con'12, Paris
 
Analytics on Hadoop
Analytics on HadoopAnalytics on Hadoop
Analytics on Hadoop
 
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?  Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
 
Novell File Management Suite for Microsoft Active Directory Environments
Novell File Management Suite for Microsoft Active Directory EnvironmentsNovell File Management Suite for Microsoft Active Directory Environments
Novell File Management Suite for Microsoft Active Directory Environments
 
gfs-sosp2003
gfs-sosp2003gfs-sosp2003
gfs-sosp2003
 
Control the Chaos with Novell File Reporter
Control the Chaos with Novell File ReporterControl the Chaos with Novell File Reporter
Control the Chaos with Novell File Reporter
 
The 25 Most Promising Open Source Projects
The 25 Most Promising Open Source ProjectsThe 25 Most Promising Open Source Projects
The 25 Most Promising Open Source Projects
 
IT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of ThingsIT @ Intel: Preparing the Future Enterprise with the Internet of Things
IT @ Intel: Preparing the Future Enterprise with the Internet of Things
 
Pivotal: Virtualize Big Data to Make the Elephant Dance
Pivotal: Virtualize Big Data to Make the Elephant DancePivotal: Virtualize Big Data to Make the Elephant Dance
Pivotal: Virtualize Big Data to Make the Elephant Dance
 
Paris live eddiesatterly_022013
Paris live eddiesatterly_022013Paris live eddiesatterly_022013
Paris live eddiesatterly_022013
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2
 

Viewers also liked

My PC
My PCMy PC
My PCmeelo
 
Information technology
Information technologyInformation technology
Information technologyroyaljwalaa
 
BC Bela Cintra (Corretor Leite 99354 8288)
BC Bela Cintra (Corretor Leite 99354 8288)BC Bela Cintra (Corretor Leite 99354 8288)
BC Bela Cintra (Corretor Leite 99354 8288)Leite Corretor
 
NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...
NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...
NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...Fatma ÇINAR
 
3006 0809a P1 Jyang
3006 0809a P1 Jyang3006 0809a P1 Jyang
3006 0809a P1 Jyangdraa
 
Existing campaings research - Poverty
Existing campaings research - PovertyExisting campaings research - Poverty
Existing campaings research - PovertyJamie Kessel
 
Da Design Arte Jardins
Da Design Arte JardinsDa Design Arte Jardins
Da Design Arte JardinsLeite Corretor
 
Biblioteca 2.0: claves para una biblioteca participativa
Biblioteca 2.0: claves para una biblioteca participativaBiblioteca 2.0: claves para una biblioteca participativa
Biblioteca 2.0: claves para una biblioteca participativanatalia.arroyo
 
Apresentação You Now Campo Belo (Corretor Leite 99354 8288)
Apresentação You Now Campo Belo (Corretor Leite 99354 8288)Apresentação You Now Campo Belo (Corretor Leite 99354 8288)
Apresentação You Now Campo Belo (Corretor Leite 99354 8288)Leite Corretor
 
Cartas de patrocinio
Cartas de patrocinioCartas de patrocinio
Cartas de patrocinioJavi Bilbao
 
Filing Documents
Filing DocumentsFiling Documents
Filing Documentsbeatriz0889
 
Totara Startup Training
Totara Startup TrainingTotara Startup Training
Totara Startup TrainingJames Gale
 

Viewers also liked (14)

My PC
My PCMy PC
My PC
 
Information technology
Information technologyInformation technology
Information technology
 
BC Bela Cintra (Corretor Leite 99354 8288)
BC Bela Cintra (Corretor Leite 99354 8288)BC Bela Cintra (Corretor Leite 99354 8288)
BC Bela Cintra (Corretor Leite 99354 8288)
 
Microgravidade
MicrogravidadeMicrogravidade
Microgravidade
 
NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...
NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...
NUTS AND SECTORAL LOANS DEFAULT CHART OF TURKEY: Graphical Data-Mining Analys...
 
3006 0809a P1 Jyang
3006 0809a P1 Jyang3006 0809a P1 Jyang
3006 0809a P1 Jyang
 
W
WW
W
 
Existing campaings research - Poverty
Existing campaings research - PovertyExisting campaings research - Poverty
Existing campaings research - Poverty
 
Da Design Arte Jardins
Da Design Arte JardinsDa Design Arte Jardins
Da Design Arte Jardins
 
Biblioteca 2.0: claves para una biblioteca participativa
Biblioteca 2.0: claves para una biblioteca participativaBiblioteca 2.0: claves para una biblioteca participativa
Biblioteca 2.0: claves para una biblioteca participativa
 
Apresentação You Now Campo Belo (Corretor Leite 99354 8288)
Apresentação You Now Campo Belo (Corretor Leite 99354 8288)Apresentação You Now Campo Belo (Corretor Leite 99354 8288)
Apresentação You Now Campo Belo (Corretor Leite 99354 8288)
 
Cartas de patrocinio
Cartas de patrocinioCartas de patrocinio
Cartas de patrocinio
 
Filing Documents
Filing DocumentsFiling Documents
Filing Documents
 
Totara Startup Training
Totara Startup TrainingTotara Startup Training
Totara Startup Training
 

Similar to Webinar | From Zero to Big Data Answers in Less Than an Hour – Live Demo Slides

Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopCloudera, Inc.
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...Amr Awadallah
 
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...Cloudera, Inc.
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Cloudera, Inc.
 
Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7Cloudera, Inc.
 
Amr Awadallah, unSEXY Presentation
Amr Awadallah, unSEXY PresentationAmr Awadallah, unSEXY Presentation
Amr Awadallah, unSEXY Presentation500 Startups
 
Data Science Day New York: The Platform for Big Data
Data Science Day New York: The Platform for Big DataData Science Day New York: The Platform for Big Data
Data Science Day New York: The Platform for Big DataCloudera, Inc.
 
CCD-410 Cloudera Study Material
CCD-410 Cloudera Study MaterialCCD-410 Cloudera Study Material
CCD-410 Cloudera Study MaterialRoxycodone Online
 
The power of hadoop in cloud computing
The power of hadoop in cloud computingThe power of hadoop in cloud computing
The power of hadoop in cloud computingJoey Echeverria
 
Hortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks
 
Hadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, ProvidersHadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, ProvidersMrigendra Sharma
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Jonathan Seidman
 
Hadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceHadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceNeev Technologies
 
Ventana Research Presents: Best Practices with Hadoop - Real World Data
Ventana Research Presents:  Best Practices with Hadoop - Real World DataVentana Research Presents:  Best Practices with Hadoop - Real World Data
Ventana Research Presents: Best Practices with Hadoop - Real World DataCloudera, Inc.
 
Apache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondApache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondDataWorks Summit
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondTeradata Aster
 
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopCreate a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopHortonworks
 

Similar to Webinar | From Zero to Big Data Answers in Less Than an Hour – Live Demo Slides (20)

Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
 
Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7Cloudera Manager Webinar | Cloudera Enterprise 3.7
Cloudera Manager Webinar | Cloudera Enterprise 3.7
 
Amr Awadallah, unSEXY Presentation
Amr Awadallah, unSEXY PresentationAmr Awadallah, unSEXY Presentation
Amr Awadallah, unSEXY Presentation
 
Data Science Day New York: The Platform for Big Data
Data Science Day New York: The Platform for Big DataData Science Day New York: The Platform for Big Data
Data Science Day New York: The Platform for Big Data
 
Hadoop
HadoopHadoop
Hadoop
 
CCD-410 Cloudera Study Material
CCD-410 Cloudera Study MaterialCCD-410 Cloudera Study Material
CCD-410 Cloudera Study Material
 
The power of hadoop in cloud computing
The power of hadoop in cloud computingThe power of hadoop in cloud computing
The power of hadoop in cloud computing
 
Hortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinar
 
Hadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, ProvidersHadoop Platforms - Introduction, Importance, Providers
Hadoop Platforms - Introduction, Importance, Providers
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013
 
Hadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceHadoop Ecosystem at a Glance
Hadoop Ecosystem at a Glance
 
Ventana Research Presents: Best Practices with Hadoop - Real World Data
Ventana Research Presents:  Best Practices with Hadoop - Real World DataVentana Research Presents:  Best Practices with Hadoop - Real World Data
Ventana Research Presents: Best Practices with Hadoop - Real World Data
 
Apache Hadoop Now Next and Beyond
Apache Hadoop Now Next and BeyondApache Hadoop Now Next and Beyond
Apache Hadoop Now Next and Beyond
 
Hadoop - Now, Next and Beyond
Hadoop - Now, Next and BeyondHadoop - Now, Next and Beyond
Hadoop - Now, Next and Beyond
 
Azure Big data
Azure Big data Azure Big data
Azure Big data
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopCreate a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache Hadoop
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 

Recently uploaded (20)

AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 

Webinar | From Zero to Big Data Answers in Less Than an Hour – Live Demo Slides

  • 1. March 13, 2011 From Zero to Big Data Answers in Less Than an Hour Daniel Templeton | Cloudera Manager, Partner Program Adoption Richard Guth | Karmasphere Chief Marketing Officer
  • 2. The ‘Big Data’ Phenomenon Big Data Drivers: More Content More Devices  The proliferation of data capture and creation technologies  Increased “interconnectedness” drives consumption (creating more data) More New & Better Consumption Information  Inexpensive storage makes it possible to keep more, longer  Innovative software and analysis tools turn data into information  Every gigabyte of stored content can generate Big Data encompasses not a petabyte or more of transient data* only the content itself, but how it’s consumed.  The information about you is much greater than the information you create *Source: IDC 2011 2 ©2011 Cloudera, Inc. All Rights Reserved.
  • 3. What is Apache Hadoop? CORE HADOOP COMPONENTS Apache Hadoop is a platform for data storage and processing that is… Hadoop Distributed File  Scalable System (HDFS) MapReduce  Fault tolerant  Open source File Sharing & Data Protection Across Distributed Computing Across Physical Servers Physical Servers Has the Flexibility to Store Excels at Scales and Mine Any Type of Data Processing Complex Data Economically  Ask questions across structured and  Scale-out architecture divides  Can be deployed on commodity unstructured data that were previously workloads across multiple nodes hardware impossible to ask or solve  Flexible file system eliminates ETL  Open source platform guards  Not bound by a single schema bottlenecks against vendor lock 3 ©2011 Cloudera, Inc. All Rights Reserved.
  • 4. Who Is Cloudera? The trusted leader in We make Hadoop Unrivaled knowledge Strong executive Apache Hadoop. enterprise-easy. and experience. team with proven abilities.  Package the #1  A distribution of Apache  Founders, committers and distribution of Apache Hadoop that is contributors to Apache Mike Olson Amr Awadallah Hadoop in commercial and tested, certified and Hadoop and related CEO VP, Engineering non-commercial supported projects Kirk Dunn Mary Rorabaugh environments COO VP, Finance  A suite of management  A wealth of experience in Jeff Charles  Roadmap control or software for Hadoop the design and delivery of Hammerbacher Zedlewski Chief Scientist VP, Products influence over all Apache administrators enterprise software Doug Cutting Omer Trajman Hadoop-related projects Chief Architect VP, Customer  Training and certification Solutions  Top contributor to the programs Apache ecosystem overall  Comprehensive support  Tens of thousands of nodes and consulting services under management 4 ©2011 Cloudera, Inc. All Rights Reserved.
  • 5. CDH Overview The #1 commercial and non-commercial Apache Hadoop distribution. Complete, Integrated Hadoop Stack CDH Components  Apache Hadoop – reliable, scalable distributed computing File System Mount UI Framework SDK FUSE-DFS HUE HUE SDK  Apache Hive – SQL-like language and metadata repository  Apache Pig – High level language for expressing data analysis programs Workflow Scheduling Metadata APACHE OOZIE APACHE OOZIE APACHE HIVE  Apache HBase – Hadoop database for random, real-time read/write access  Apache Zookeeper – Highly reliable distributed coordination service Languages / Compilers Data APACHE PIG, APACHE HIVE Fast  Apache Flume* – Distributed service for collecting and aggregating Read/Write log and event data Integration Access  Apache Whirr* – Library for running Hadoop in the cloud APACHE FLUME, APACHE SQOOP APACHE HBASE  Apache Sqoop* – Integrating Hadoop with RDBMS  Apache Oozie* – Server-based workflow engine for Hadoop Activities Coordination APACHE ZOOKEEPER  Fuse-DFS – Module within Hadoop for mounting HDFS as a traditional file system  Hue – Browser-based desktop interface for interacting with Hadoop * Currently undergoing Incubation at the Apache Software Foundation. 5 ©2011 Cloudera, Inc. All Rights Reserved.
  • 6. Cloudera Enterprise Cloudera Enterprise makes CLOUDERA ENTERPRISE COMPONENTS open source Hadoop enterprise-easy  Simplify and Accelerate Hadoop Deployment Cloudera Production-Level Manager Support  Reduce Adoption Costs and Risks  Lower the Cost of Administration End-to-End Management Our Team of Experts On-  Increase Transparency and Control Over Hadoop Application for Apache Call to Help You Meet Hadoop Your SLAs  Leverage the Experience of Our Experts EFFECTIVENESS EFFICIENCY Ensuring You Enabling You to Get Value From Your Hadoop Deployment Affordably Run Hadoop in Production 6 ©2011 Cloudera, Inc. All Rights Reserved.
  • 7. Big Data Intelligence Applications for Enterprise Data Professionals www.karmasphere.com 7 © Karmasphere 2012 All rights reserved
  • 8. About Karmasphere Company Pure-play, singularly focused on Big Data Intelligence and Analytics on Hadoop and NoSQL, in the cloud and on-premise. Engineering Expertise Hadoop, analytics, web analytics, business intelligence, visualizations, programming languages, compilers, architecture, mathe matics, database Management Experience Google, Yahoo, Ask, Ning, Omniture, BEA, Oracle, Sybase, Actuate, Apple, Zend, Intel , BMC, Spotfire 8 © Karmasphere 2012 All rights reserved
  • 9. Karmasphere Mission Provide an EASY way to find INSIGHTS in Big Data to transform business Upcoming Skills Shortage “By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions” “Big Data: the next frontier for innovation, competition and productivity” McKinsey, May 2011 9 © Karmasphere 2012 All rights reserved
  • 10. Karmasphere: Big Data Mining and Analytics ON Hadoop 10 © Karmasphere 2012 All rights reserved
  • 11. From Zero to Answers in 60 Minutes DEMO Our Process Marketing Analyst for Retail Chain • Access any cloud or on- 1 Connect to the preconfigured premise Cloudera CDH Cloudera CDH cluster • Assemble and organize 2 Access our structured point of sale unstructured and transactions data and bring up structured data in transactional data for lunch meals Hadoop 3 Correlate results with unstructured • Analyze the data using social media data to get some insight familiar SQL on our buyers and buying behavior 4 Infer from these results on underperforming stores and come up with an action plan to increase sales for these stores 11 © Karmasphere 2012 All rights reserved
  • 12. www.karmasphere.com 12 © Karmasphere 2012 All rights reserved
  • 13. From Zero to Big Data Answers in Less Than an Hour The webinar recording will be made available shortly at: • https://www1.gotomeeting.com/register/890391584 Contact Information: • info@cloudera.com • 1 (888) 789-1488 • info@karmasphere.com • 1 (650) 292-6100 13 © Karmasphere 2012 All rights reserved ©2011 Cloudera, Inc. All 13