SlideShare a Scribd company logo
1 of 43
Download to read offline
Pivotal: Hadoop for
Powerful Processing of
Unstructured Data for
Valuable Insights
SK Krishnamurthy

skrishnamurthy@gopivotal.com

© Copyright 2013 EMC Corporation. All rights reserved.

1
Traditional Enterprise Analytics Process

© Copyright 2013 EMC Corporation. All rights reserved.

2
The Fundamental Paradigm Shift
 Internet age and exploding data growth
 Enterprises leverage new data sources to identify
emerging trends and opportunities
 Traditional database tools not able to cope

© Copyright 2013 EMC Corporation. All rights reserved.

3
Enter Hadoop
 Flexible
 Scalable
 Inexpensive

Platform for Big

 Fault-tolerant

Data

 Rapidly Adopted

© Copyright 2013 EMC Corporation. All rights reserved.

4
Evolution of Process with Hadoop

© Copyright 2013 EMC Corporation. All rights reserved.

5
HDFS Economics Have Changed the Game
Big Data Platform Price/TB

$80,000

Big Data RDBMS pricing will
ultimately converge with
Hadoop pricing

$60,000
The price per TB of Big Data RDMBS has
been consistently eroding over time.

$40,000

Hadoop pricing has increased slightly over
time as vendors have injected value added
services into the ecosystem.

$20,000

$-

2008

2009

2010
Big Data DB

© Copyright 2013 EMC Corporation. All rights reserved.

2011

2012

2013

Hadoop

6
Where We’re Going

© Copyright 2013 EMC Corporation. All rights reserved.
© Copyright 2013 Pivotal. All rights reserved.

7
Big Data Platform
Pivotal Data Platform
Stream
Ingestion
Streaming Services

Data Staging
Platform
Data Mgmt. Services

Operational
Intelligence

Run-Time
Applications

In-Memory DB

Analytical Query

In-Memory
Objects

HDFS
Enterprise Data
Warehouse
RDBMS

© Copyright 2013 EMC Corporation. All rights reserved.

Continues to
serve as system
of record

Traditional
BI/Reporting
Data Visualization

Compliance
and financial
reporting

8
Flexible Deployment Model
Portable
Elastic
HW Abstracted
Manageable
“Consumer” grade

deploy

Private Cloud

© Copyright 2013 EMC Corporation. All rights reserved.

On Premise

Public Cloud

9
PIVOTAL HD
The world’s most powerful
Hadoop distribution

© Copyright 2013 EMC Corporation. All rights reserved.

10
Pivotal HD
 World’s first true SQL processing for enterprise-ready
Hadoop
 100% Apache Hadoop-based platform
 Virtualization and cloud ready with VMWare and Isilon
 Scale tested in 1000 node Pivotal Analytics Workbench
 Available as a software-only or appliance-based solution
 Backed by EMC’s global, 24x7 support infrastructure

© Copyright 2013 EMC Corporation. All rights reserved.

11
Pivotal Hadoop Distributions
GPHD

Pivotal HD

Apache Hadoop 1.x

Apache Hadoop 2.x

100% Open Source Compatible

© Copyright 2013 EMC Corporation. All rights reserved.

12
Pivotal HD Components
• HDFS – The Hadoop Distributed File
System acts as the storage layer for
Hadoop

• Pig – High-level procedural language
for data pipeline/data flow processing in
Hadoop

• MapReduce – Parallel processing
framework used for data computation in
Hadoop

• HBase – NoSQL, key-value data store
on top of HDFS

• Hive – Structured, data warehouse
implementation for data in HDFS that
provides a SQL-like interface to Hadoop

© Copyright 2013 EMC Corporation. All rights reserved.

• Mahout – Library of scalable machinelearning Algorithms
• Spring Hadoop – Integrates the
Spring framework into Hadoop

13
Pivotal HD Value-Added Components
GPHD Includes…
• Installation and Configuration Manager (ICM)
– cluster installation, upgrade, and expansion tools.
• GP Command Center – visual interface for cluster
health, system metrics, and job monitoring.
• Hadoop Virtualization Extension (HVE) –
enhances Hadoop to support virtual node
awareness and enables greater cluster elasticity.
• GP Data Loader – parallel loading infrastructure
that supports “line speed” data loading into HDFS.

Pivotal HD Adds the Following to GPHD…
• Advanced Database Services (HAWQ)– highperformance, “True SQL” query interface running
within the Hadoop cluster.
• Extensions Framework (GPXF) – support for HAWQ
interfaces on external data providers (HBase, Avro,
etc.).
• Advanced Analytics Functions (MADLib) – ability
to access parallelized machine-learning and datamining functions at scale.

• Isilon Integration – extensively tested at scale
with guidelines for compute-heavy, storage-heavy,
and balanced configurations.

© Copyright 2013 EMC Corporation. All rights reserved.

14
Pivotal Core Components & Versions
GPHD 1.2 Core Distribution

Pivotal HD Enterprise

Component

Version

Component

Version

Hadoop

1.0.3

Hadoop

2.0.2

HBase

0.92.1

HBase

0.94.2

Hive

0.8.1

Hive

0.9.1

Mahout

0.6

Mahout

0.8.0

Pig

0.9.2

Pig

0.10.0

Zookeeper

3.3.5

Zookeeper

3.4.3

Flume

1.2.0

Flume

1.2.0

Sqoop

1.4.1

Sqoop

1.4.1

Spring Hadoop

© Copyright 2013 EMC Corporation. All rights reserved.

Spring Hadoop

15
Pivotal HD Architecture

Resource
Management
& Workflow

Pig, Hive,
Mahout

HBase

Map Reduce

Yarn

HDFS

Zookeeper
Sqoop

Flume

Apache

© Copyright 2013 EMC Corporation. All rights reserved.

16
Pivotal HD Architecture
Pivotal HD
Enterprise
Resource
Management
& Workflow

Pig, Hive,
Mahout

HBase

Map Reduce
Hadoop Virtualization (HVE)

Yarn

HDFS

Zookeeper
Sqoop

Apache

© Copyright 2013 EMC Corporation. All rights reserved.

Data Loader

Deploy,
Configure,
Monitor,
Manage

Command
Flume

Center

Pivotal HD Enterprise

17
Pivotal HD Architecture
HAWQ– Advanced
Database Services
ANSI SQL + Analytics

Pivotal HD
Enterprise
Resource
Management
& Workflow

Xtension
Framework
HBase

Query
Optimizer

Dynamic Pipelining

Pig, Hive,
Mahout
Map Reduce

Hadoop Virtualization (HVE)

Yarn

HDFS

Zookeeper
Sqoop

Apache

© Copyright 2013 EMC Corporation. All rights reserved.

Catalog
Services

Command
Center
Flume

Data Loader

Pivotal HD Enterprise

Deploy,
Configure,
Monitor,
Manage

HAWQ

18
DataLoader
Streams

DataLoader

Pull
Push

Web GUI and CLI

Connectors

Flume
Files

Data Source
Registration

Job
Management

Data
Destination
Registration

Copy
Strategy
Optimization

Data
Processing

Data Copy

HDFS

HDFS
NFS
HTTP
FTP
Local

© Copyright 2013 EMC Corporation. All rights reserved.

REST APIs

.
.

19
Command Center

Simple and complete cluster management
 Install and configure Hadoop
components and services
 Centralized interface for Pivotal
HD cluster monitoring,
diagnostics, and management
 Live and historical Hadoop
system metrics analysis

© Copyright 2013 EMC Corporation. All rights reserved.

Deploy

Configure

Analyze

Monitor

Manage

20
Command Center – Monitor, Manage,
and Analyze
 Host, application, and job level
monitoring across the entire
Pivotal HD cluster performance
 Visualize and analyze live and
historical Hadoop cluster
information through Command
Center Dashboard
 Quick diagnostics of functional
or performance issue

© Copyright 2013 EMC Corporation. All rights reserved.

21
Hadoop Virtualization Extensions (HVE)
• HVE enables Hadoop to support more effective virtual deployments
• This creates the opportunity to provision and scale the compute and storage
processes independently resulting in:
• Much better resource utilization
• Improved resource allocation and consumption
• Support Multi-Tenancy

© Copyright 2013 EMC Corporation. All rights reserved.

22
HAWQ
© Copyright 2013 EMC Corporation. All rights reserved.
© Copyright 2013 Pivotal. All rights reserved.

23
23
HAWQ: The Crown Jewels of Greenplum
 SQL compliant
 World-class query optimizer
 Interactive query
 Horizontal scalability
 Robust data management
 Common Hadoop formats
 Deep analytics
© Copyright 2013 EMC Corporation. All rights reserved.

24
HAWQ

High-Performance Query Processing

 Interactive and true ANSI SQL support
 Multi-petabyte horizontal scalability
 Cost-based parallel query optimizer
 Programmable analytics

© Copyright 2013 EMC Corporation. All rights reserved.

25
HAWQ

Enterprise-Class Database Services & Management

 Scatter-gather data loading
 Row and column storage
 Workload management
 Multi-level partitioning
 3rd-party tool & open client interfaces

© Copyright 2013 EMC Corporation. All rights reserved.

26
HAWQ

Pre-integrated Deep Analytics

 Performance via fully parallelized implementation
 Consistent, user friendly SQL interfaces
 Ease of data preparation
 Pre-integrated MADLib support
– Linear Regression
– Logistic Regression
– Multinomial Logisitic
Regression

© Copyright 2013 EMC Corporation. All rights reserved.

– K-Means
– Association Rules
– PLDA - useful for topic
modeling
27
GPDB – Components
GPDB

Resource Management

Query Engine

Catalog Service

Planner

Optimizer

Executor

Transaction
Manager

© Copyright 2013 EMC Corporation. All rights reserved.

GPXF

Local File System

28
HAWQ – Components

Resource
Management

GPSQL
Query Engine
Planner

Optimizer

Executor

Catalog Service

Transaction
Manager

GPXF

HDFS

© Copyright 2013 EMC Corporation. All rights reserved.

29
How HAWQ Works
Clients

SELECT beer, price
FROM Bars b, Sells s
WHERE b.name = s.bar
AND b.city = ‘San Francisco’

HAWQ Master Host
Query Parser

JDBC/ODBC
SQL Console

Query Optimizer
HDFS Namenode

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HDFS Datanode

HDFS Datanode

HDFS Datanode

© Copyright 2013 EMC Corporation. All rights reserved.

...

30
How HAWQ Works
Clients

Optimization
Context
Parse Tree

HAWQ Master Host

Metadata

Query Parser

JDBC/ODBC
SQL Console

Query Optimizer
HDFS Namenode

Cost Model
Resources

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HDFS Datanode

HDFS Datanode

HDFS Datanode

© Copyright 2013 EMC Corporation. All rights reserved.

...

31
How HAWQ Works

Execution Plan

Clients

HAWQ Master Host
Query Parser

JDBC/ODBC
SQL Console

Query Optimizer
HDFS Namenode

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HDFS Datanode

HDFS Datanode

HDFS Datanode

© Copyright 2013 EMC Corporation. All rights reserved.

...

32
How HAWQ Works
Clients

HAWQ Master Host
Query Parser

JDBC/ODBC
SQL Console

Query Optimizer
HDFS Namenode

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HDFS Datanode

HDFS Datanode

HDFS Datanode

© Copyright 2013 EMC Corporation. All rights reserved.

...

33
How HAWQ Works
Clients

HAWQ Master Host
Query Parser

JDBC/ODBC

Query Optimizer

SQL Console

HAWQ Segment
Host
Query Executor

HDFS Namenode

HAWQ Segment
Host
Query Executor

D y n a m i c
HDFS Datanode

© Copyright 2013 EMC Corporation. All rights reserved.

HAWQ Segment
Host
Query Executor

P i p e l i n i n g ™

HDFS Datanode

...

HDFS Datanode

34
How HAWQ Works
Clients

HAWQ Master Host
Query Parser

JDBC/ODBC
SQL Console

Query Optimizer
HDFS Namenode

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HAWQ Segment
Host
Query Executor

HDFS Datanode

HDFS Datanode

HDFS Datanode

© Copyright 2013 EMC Corporation. All rights reserved.

...

35
HAWQ Deployment
ODBC/JDBC Driver

Master
Servers & Name
Nodes

...

...

Query planning & dispatch

Dynamic
Pipelining

Segment
Servers & Data
Nodes

...

Query processing &
data storage

...
HDFS

External
Sources
Loading,
streaming, etc.

© Copyright 2013 EMC Corporation. All rights reserved.

36
Xtension Framework
 An advanced version of GPDB
external tables
 Enables combining HAWQ data
and Hadoop data in single query
Xtension Framework

HDFS

HBase

© Copyright 2013 EMC Corporation. All rights reserved.

Hive

 Supports connectors for HDFS,
Hbase and Hive
 Provides extensible framework
API to enable custom connector
development for other data
sources
37
HAWQ Benchmarks

User intelligence

4.2

198

47X

Sales analysis

8.7

161

19X

Click analysis

2.0

415

208X

Data exploration

2.7

1,285

476X

BI drill down

2.8

1,815

648X

© Copyright 2013 EMC Corporation. All rights reserved.

38
Pivotal Analytics Workbench (AWB)
Commitment to Accelerating Innovation &
Contributing to the Apache Community
• Multi-million dollar investment by Pivotal and partners
in a 1,000-node, 24-Petabyte cluster to facilitate
innovation and conduct regular integration/scale
testing of Apache Hadoop
• Full-time, dedicated integration onboarding projects
and validating each release of Apache Hadoop at-scale
• Contributing back our results and findings to the open
source community as well as incorporating them into
the continued development of Pivotal HD
© Copyright 2013 EMC Corporation. All rights reserved.

39
“Real” Hadoop Cluster

© Copyright 2013 EMC Corporation. All rights reserved.

40
Leveraging Full Power of the Family

© Copyright 2013 EMC Corporation. All rights reserved.

41
Pivotal Sessions at EMC World
Session

Presenter

Dates/Times

The Pivotal Platform: A Purpose-Built Platform for Big-DataDriven Applications

Josh Klahr

Tue 5:30 - 6:30, Palazzo E Wed
11:30 - 12:30, Delfino 4005

Pivotal: Data Scientists on the Front Line: Examples of
Data Science in Action

Noelle Sio

Tue 10:00 - 11:00, Lando 4205
Thu 8:30 - 9:30, Palazzo F

Pivotal: Operationalizing 1000-node Hadoop Cluster –
Analytics Workbench

Clinton Ooi
Bhavin Modi

Tue 11:30 - 12:30, Palazzo L Thu
10:00- 11:00 am, Delfino 4001A

Pivotal: for Powerful Processing of Unstructured Data For
Valuable Insights

SK
Krishnamurthy

Mon 4:00 - 5:00, Lando 4201 A
Tue 4:00 - 5:00, Palazzo M

Pivotal: Big & Fast data – merging real-time data and deep
analytics

Michael
Crutcher

Mon 1:00 - 2:00, Lando 4201 A
Wed 10:00 - 11:00, Palazzo M

Pivotal: Virtualize Big Data to Make The Elephant Dance

June Yang
Dan Baskette

Mon 11:30 - 12:30, Marcello
4401A Wed 4:00 - 5:00, Palazzo
E

Hadoop Design Patterns

Don Miner

Mon 2:30 - 3:30, Palazzo F Wed
8:30 - 9:30, Delfino 4005

© Copyright 2013 EMC Corporation. All rights reserved.

42
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Insights

More Related Content

What's hot

Where to Deploy Hadoop: Bare Metal or Cloud?
Where to Deploy Hadoop: Bare Metal or Cloud? Where to Deploy Hadoop: Bare Metal or Cloud?
Where to Deploy Hadoop: Bare Metal or Cloud? DataWorks Summit
 
The Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data CentricThe Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data CentricDataWorks Summit
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenDataWorks Summit
 
Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNDataWorks Summit
 
Best Practices for Virtualizing Apache Hadoop
Best Practices for Virtualizing Apache HadoopBest Practices for Virtualizing Apache Hadoop
Best Practices for Virtualizing Apache HadoopHortonworks
 
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...DataWorks Summit
 
The TCO Calculator - Estimate the True Cost of Hadoop
The TCO Calculator - Estimate the True Cost of Hadoop The TCO Calculator - Estimate the True Cost of Hadoop
The TCO Calculator - Estimate the True Cost of Hadoop MapR Technologies
 
Cloudy with a Chance of Hadoop - Real World Considerations
Cloudy with a Chance of Hadoop - Real World ConsiderationsCloudy with a Chance of Hadoop - Real World Considerations
Cloudy with a Chance of Hadoop - Real World ConsiderationsDataWorks Summit/Hadoop Summit
 
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing DataWorks Summit
 
Is Cloud a right Companion for Hadoop
Is Cloud a right Companion for HadoopIs Cloud a right Companion for Hadoop
Is Cloud a right Companion for HadoopDataWorks Summit
 
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...DataWorks Summit
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersDataWorks Summit
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopCloudera, Inc.
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsDataWorks Summit
 
Greenplum Database Overview
Greenplum Database Overview Greenplum Database Overview
Greenplum Database Overview EMC
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Hortonworks
 

What's hot (20)

5. pivotal hd 2013
5. pivotal hd 20135. pivotal hd 2013
5. pivotal hd 2013
 
Where to Deploy Hadoop: Bare Metal or Cloud?
Where to Deploy Hadoop: Bare Metal or Cloud? Where to Deploy Hadoop: Bare Metal or Cloud?
Where to Deploy Hadoop: Bare Metal or Cloud?
 
The Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data CentricThe Car of the Future - Autonomous, Connected, and Data Centric
The Car of the Future - Autonomous, Connected, and Data Centric
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP Haven
 
Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARN
 
Best Practices for Virtualizing Apache Hadoop
Best Practices for Virtualizing Apache HadoopBest Practices for Virtualizing Apache Hadoop
Best Practices for Virtualizing Apache Hadoop
 
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
 
The TCO Calculator - Estimate the True Cost of Hadoop
The TCO Calculator - Estimate the True Cost of Hadoop The TCO Calculator - Estimate the True Cost of Hadoop
The TCO Calculator - Estimate the True Cost of Hadoop
 
Empower Hive with Spark
Empower Hive with SparkEmpower Hive with Spark
Empower Hive with Spark
 
Cloudy with a Chance of Hadoop - Real World Considerations
Cloudy with a Chance of Hadoop - Real World ConsiderationsCloudy with a Chance of Hadoop - Real World Considerations
Cloudy with a Chance of Hadoop - Real World Considerations
 
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
 
Is Cloud a right Companion for Hadoop
Is Cloud a right Companion for HadoopIs Cloud a right Companion for Hadoop
Is Cloud a right Companion for Hadoop
 
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against Disasters
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on Hadoop
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the experts
 
Hive Now Sparks
Hive Now SparksHive Now Sparks
Hive Now Sparks
 
Greenplum Database Overview
Greenplum Database Overview Greenplum Database Overview
Greenplum Database Overview
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014Developing YARN Applications - Integrating natively to YARN July 24 2014
Developing YARN Applications - Integrating natively to YARN July 24 2014
 

Viewers also liked

Recessions graphs etc
Recessions graphs etcRecessions graphs etc
Recessions graphs etcTravis Klein
 
Insaat kursu-kucukcekmece
Insaat kursu-kucukcekmeceInsaat kursu-kucukcekmece
Insaat kursu-kucukcekmecesersld54
 
Broadview Mortgage Long Beach - California Extra Credit Teacher Program
Broadview Mortgage Long Beach - California Extra Credit Teacher ProgramBroadview Mortgage Long Beach - California Extra Credit Teacher Program
Broadview Mortgage Long Beach - California Extra Credit Teacher ProgramBroadview Mortgage Long Beach
 
International trade
International tradeInternational trade
International tradeTravis Klein
 
Visibility & Security for the Virtualized Enterprise
Visibility & Security for the Virtualized EnterpriseVisibility & Security for the Virtualized Enterprise
Visibility & Security for the Virtualized EnterpriseEMC
 
2 shifts to demand
2   shifts to demand2   shifts to demand
2 shifts to demandTravis Klein
 
Analyst Report : How to Ride the Post-PC End User Computing Wave
Analyst Report : How to Ride the Post-PC End User Computing Wave Analyst Report : How to Ride the Post-PC End User Computing Wave
Analyst Report : How to Ride the Post-PC End User Computing Wave EMC
 
Netwerk Bewust Verbruiken, Sharing and Repairing
Netwerk Bewust Verbruiken, Sharing and RepairingNetwerk Bewust Verbruiken, Sharing and Repairing
Netwerk Bewust Verbruiken, Sharing and RepairingNetwerk Bewust Verbruiken
 
Insaat kursu-eskisehir
Insaat kursu-eskisehirInsaat kursu-eskisehir
Insaat kursu-eskisehirsersld54
 
Fri papal response to luther
Fri papal response to lutherFri papal response to luther
Fri papal response to lutherTravis Klein
 
Capitulo #1 de computación en la nube
Capitulo #1 de computación en la nubeCapitulo #1 de computación en la nube
Capitulo #1 de computación en la nubeMario Trochez Romero
 
Mobile Broadband For Everyone
Mobile Broadband For Everyone Mobile Broadband For Everyone
Mobile Broadband For Everyone Rene Summer
 
10 months with a 1-to-1 iPad deployment
10 months with a 1-to-1 iPad deployment10 months with a 1-to-1 iPad deployment
10 months with a 1-to-1 iPad deploymentTech with Intent
 
SBIC Report : Transforming Information Security: Future-Proofing Processes
SBIC Report : Transforming Information Security: Future-Proofing ProcessesSBIC Report : Transforming Information Security: Future-Proofing Processes
SBIC Report : Transforming Information Security: Future-Proofing ProcessesEMC
 
มลพิษทางอากาศ1
มลพิษทางอากาศ1มลพิษทางอากาศ1
มลพิษทางอากาศ1Kyjung Seekwang
 
Diaporama Organoleptic analysis
Diaporama Organoleptic analysisDiaporama Organoleptic analysis
Diaporama Organoleptic analysisjean-marc.bayle
 
TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...
TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...
TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...Dr. Raju M. Mathew
 
Conduct monetary policy
Conduct monetary policyConduct monetary policy
Conduct monetary policyTravis Klein
 

Viewers also liked (20)

Recessions graphs etc
Recessions graphs etcRecessions graphs etc
Recessions graphs etc
 
Insaat kursu-kucukcekmece
Insaat kursu-kucukcekmeceInsaat kursu-kucukcekmece
Insaat kursu-kucukcekmece
 
Broadview Mortgage Long Beach - California Extra Credit Teacher Program
Broadview Mortgage Long Beach - California Extra Credit Teacher ProgramBroadview Mortgage Long Beach - California Extra Credit Teacher Program
Broadview Mortgage Long Beach - California Extra Credit Teacher Program
 
International trade
International tradeInternational trade
International trade
 
Visibility & Security for the Virtualized Enterprise
Visibility & Security for the Virtualized EnterpriseVisibility & Security for the Virtualized Enterprise
Visibility & Security for the Virtualized Enterprise
 
2 shifts to demand
2   shifts to demand2   shifts to demand
2 shifts to demand
 
Analyst Report : How to Ride the Post-PC End User Computing Wave
Analyst Report : How to Ride the Post-PC End User Computing Wave Analyst Report : How to Ride the Post-PC End User Computing Wave
Analyst Report : How to Ride the Post-PC End User Computing Wave
 
Netwerk Bewust Verbruiken, Sharing and Repairing
Netwerk Bewust Verbruiken, Sharing and RepairingNetwerk Bewust Verbruiken, Sharing and Repairing
Netwerk Bewust Verbruiken, Sharing and Repairing
 
Insaat kursu-eskisehir
Insaat kursu-eskisehirInsaat kursu-eskisehir
Insaat kursu-eskisehir
 
Fri papal response to luther
Fri papal response to lutherFri papal response to luther
Fri papal response to luther
 
Capitulo #1 de computación en la nube
Capitulo #1 de computación en la nubeCapitulo #1 de computación en la nube
Capitulo #1 de computación en la nube
 
Mobile Broadband For Everyone
Mobile Broadband For Everyone Mobile Broadband For Everyone
Mobile Broadband For Everyone
 
10 months with a 1-to-1 iPad deployment
10 months with a 1-to-1 iPad deployment10 months with a 1-to-1 iPad deployment
10 months with a 1-to-1 iPad deployment
 
SBIC Report : Transforming Information Security: Future-Proofing Processes
SBIC Report : Transforming Information Security: Future-Proofing ProcessesSBIC Report : Transforming Information Security: Future-Proofing Processes
SBIC Report : Transforming Information Security: Future-Proofing Processes
 
Wed thurs reform
Wed thurs reformWed thurs reform
Wed thurs reform
 
มลพิษทางอากาศ1
มลพิษทางอากาศ1มลพิษทางอากาศ1
มลพิษทางอากาศ1
 
Presentation1
Presentation1Presentation1
Presentation1
 
Diaporama Organoleptic analysis
Diaporama Organoleptic analysisDiaporama Organoleptic analysis
Diaporama Organoleptic analysis
 
TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...
TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...
TECHNIQUES TO KEEP MEN AND WOMEN YOUNG, SEXY AND HAPPY FOR PEACE AND PROSPERI...
 
Conduct monetary policy
Conduct monetary policyConduct monetary policy
Conduct monetary policy
 

Similar to Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Insights

Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14John Sing
 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeNicolas Morales
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseCloudera, Inc.
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecasesudhakara st
 
2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverviewjdijcks
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHitendra Kumar
 
Talend Big Data Capabilities Overview
Talend Big Data Capabilities OverviewTalend Big Data Capabilities Overview
Talend Big Data Capabilities OverviewRajan Kanitkar
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoopmarkgrover
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleHarald Erb
 
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformEMC
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanJim Kaskade
 
Get started with hadoop hive hive ql languages
Get started with hadoop hive hive ql languagesGet started with hadoop hive hive ql languages
Get started with hadoop hive hive ql languagesJanBask Training
 
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 How to use Hadoop for operational and transactional purposes by RODRIGO MERI... How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...Big Data Spain
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Innovative Management Services
 
How pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureHow pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureKovid Academy
 
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...Amazon Web Services
 
Pivotal HD and Spring for Apache Hadoop
Pivotal HD and Spring for Apache HadoopPivotal HD and Spring for Apache Hadoop
Pivotal HD and Spring for Apache Hadoopmarklpollack
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionAppfluent Technology
 

Similar to Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Insights (20)

Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14
 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor Landscape
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data Warehouse
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecase
 
2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview2013 05 Oracle big_dataapplianceoverview
2013 05 Oracle big_dataapplianceoverview
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log Processing
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Talend Big Data Capabilities Overview
Talend Big Data Capabilities OverviewTalend Big Data Capabilities Overview
Talend Big Data Capabilities Overview
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by Example
 
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platformPivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
Pivotal deep dive_on_pivotal_hd_world_class_hdfs_platform
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps Ironfan
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 
Get started with hadoop hive hive ql languages
Get started with hadoop hive hive ql languagesGet started with hadoop hive hive ql languages
Get started with hadoop hive hive ql languages
 
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 How to use Hadoop for operational and transactional purposes by RODRIGO MERI... How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
How to use Hadoop for operational and transactional purposes by RODRIGO MERI...
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
How pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureHow pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architecture
 
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
 
Pivotal HD and Spring for Apache Hadoop
Pivotal HD and Spring for Apache HadoopPivotal HD and Spring for Apache Hadoop
Pivotal HD and Spring for Apache Hadoop
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 

More from EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

More from EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Insights

  • 1. Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Insights SK Krishnamurthy skrishnamurthy@gopivotal.com © Copyright 2013 EMC Corporation. All rights reserved. 1
  • 2. Traditional Enterprise Analytics Process © Copyright 2013 EMC Corporation. All rights reserved. 2
  • 3. The Fundamental Paradigm Shift  Internet age and exploding data growth  Enterprises leverage new data sources to identify emerging trends and opportunities  Traditional database tools not able to cope © Copyright 2013 EMC Corporation. All rights reserved. 3
  • 4. Enter Hadoop  Flexible  Scalable  Inexpensive Platform for Big  Fault-tolerant Data  Rapidly Adopted © Copyright 2013 EMC Corporation. All rights reserved. 4
  • 5. Evolution of Process with Hadoop © Copyright 2013 EMC Corporation. All rights reserved. 5
  • 6. HDFS Economics Have Changed the Game Big Data Platform Price/TB $80,000 Big Data RDBMS pricing will ultimately converge with Hadoop pricing $60,000 The price per TB of Big Data RDMBS has been consistently eroding over time. $40,000 Hadoop pricing has increased slightly over time as vendors have injected value added services into the ecosystem. $20,000 $- 2008 2009 2010 Big Data DB © Copyright 2013 EMC Corporation. All rights reserved. 2011 2012 2013 Hadoop 6
  • 7. Where We’re Going © Copyright 2013 EMC Corporation. All rights reserved. © Copyright 2013 Pivotal. All rights reserved. 7
  • 8. Big Data Platform Pivotal Data Platform Stream Ingestion Streaming Services Data Staging Platform Data Mgmt. Services Operational Intelligence Run-Time Applications In-Memory DB Analytical Query In-Memory Objects HDFS Enterprise Data Warehouse RDBMS © Copyright 2013 EMC Corporation. All rights reserved. Continues to serve as system of record Traditional BI/Reporting Data Visualization Compliance and financial reporting 8
  • 9. Flexible Deployment Model Portable Elastic HW Abstracted Manageable “Consumer” grade deploy Private Cloud © Copyright 2013 EMC Corporation. All rights reserved. On Premise Public Cloud 9
  • 10. PIVOTAL HD The world’s most powerful Hadoop distribution © Copyright 2013 EMC Corporation. All rights reserved. 10
  • 11. Pivotal HD  World’s first true SQL processing for enterprise-ready Hadoop  100% Apache Hadoop-based platform  Virtualization and cloud ready with VMWare and Isilon  Scale tested in 1000 node Pivotal Analytics Workbench  Available as a software-only or appliance-based solution  Backed by EMC’s global, 24x7 support infrastructure © Copyright 2013 EMC Corporation. All rights reserved. 11
  • 12. Pivotal Hadoop Distributions GPHD Pivotal HD Apache Hadoop 1.x Apache Hadoop 2.x 100% Open Source Compatible © Copyright 2013 EMC Corporation. All rights reserved. 12
  • 13. Pivotal HD Components • HDFS – The Hadoop Distributed File System acts as the storage layer for Hadoop • Pig – High-level procedural language for data pipeline/data flow processing in Hadoop • MapReduce – Parallel processing framework used for data computation in Hadoop • HBase – NoSQL, key-value data store on top of HDFS • Hive – Structured, data warehouse implementation for data in HDFS that provides a SQL-like interface to Hadoop © Copyright 2013 EMC Corporation. All rights reserved. • Mahout – Library of scalable machinelearning Algorithms • Spring Hadoop – Integrates the Spring framework into Hadoop 13
  • 14. Pivotal HD Value-Added Components GPHD Includes… • Installation and Configuration Manager (ICM) – cluster installation, upgrade, and expansion tools. • GP Command Center – visual interface for cluster health, system metrics, and job monitoring. • Hadoop Virtualization Extension (HVE) – enhances Hadoop to support virtual node awareness and enables greater cluster elasticity. • GP Data Loader – parallel loading infrastructure that supports “line speed” data loading into HDFS. Pivotal HD Adds the Following to GPHD… • Advanced Database Services (HAWQ)– highperformance, “True SQL” query interface running within the Hadoop cluster. • Extensions Framework (GPXF) – support for HAWQ interfaces on external data providers (HBase, Avro, etc.). • Advanced Analytics Functions (MADLib) – ability to access parallelized machine-learning and datamining functions at scale. • Isilon Integration – extensively tested at scale with guidelines for compute-heavy, storage-heavy, and balanced configurations. © Copyright 2013 EMC Corporation. All rights reserved. 14
  • 15. Pivotal Core Components & Versions GPHD 1.2 Core Distribution Pivotal HD Enterprise Component Version Component Version Hadoop 1.0.3 Hadoop 2.0.2 HBase 0.92.1 HBase 0.94.2 Hive 0.8.1 Hive 0.9.1 Mahout 0.6 Mahout 0.8.0 Pig 0.9.2 Pig 0.10.0 Zookeeper 3.3.5 Zookeeper 3.4.3 Flume 1.2.0 Flume 1.2.0 Sqoop 1.4.1 Sqoop 1.4.1 Spring Hadoop © Copyright 2013 EMC Corporation. All rights reserved. Spring Hadoop 15
  • 16. Pivotal HD Architecture Resource Management & Workflow Pig, Hive, Mahout HBase Map Reduce Yarn HDFS Zookeeper Sqoop Flume Apache © Copyright 2013 EMC Corporation. All rights reserved. 16
  • 17. Pivotal HD Architecture Pivotal HD Enterprise Resource Management & Workflow Pig, Hive, Mahout HBase Map Reduce Hadoop Virtualization (HVE) Yarn HDFS Zookeeper Sqoop Apache © Copyright 2013 EMC Corporation. All rights reserved. Data Loader Deploy, Configure, Monitor, Manage Command Flume Center Pivotal HD Enterprise 17
  • 18. Pivotal HD Architecture HAWQ– Advanced Database Services ANSI SQL + Analytics Pivotal HD Enterprise Resource Management & Workflow Xtension Framework HBase Query Optimizer Dynamic Pipelining Pig, Hive, Mahout Map Reduce Hadoop Virtualization (HVE) Yarn HDFS Zookeeper Sqoop Apache © Copyright 2013 EMC Corporation. All rights reserved. Catalog Services Command Center Flume Data Loader Pivotal HD Enterprise Deploy, Configure, Monitor, Manage HAWQ 18
  • 19. DataLoader Streams DataLoader Pull Push Web GUI and CLI Connectors Flume Files Data Source Registration Job Management Data Destination Registration Copy Strategy Optimization Data Processing Data Copy HDFS HDFS NFS HTTP FTP Local © Copyright 2013 EMC Corporation. All rights reserved. REST APIs . . 19
  • 20. Command Center Simple and complete cluster management  Install and configure Hadoop components and services  Centralized interface for Pivotal HD cluster monitoring, diagnostics, and management  Live and historical Hadoop system metrics analysis © Copyright 2013 EMC Corporation. All rights reserved. Deploy Configure Analyze Monitor Manage 20
  • 21. Command Center – Monitor, Manage, and Analyze  Host, application, and job level monitoring across the entire Pivotal HD cluster performance  Visualize and analyze live and historical Hadoop cluster information through Command Center Dashboard  Quick diagnostics of functional or performance issue © Copyright 2013 EMC Corporation. All rights reserved. 21
  • 22. Hadoop Virtualization Extensions (HVE) • HVE enables Hadoop to support more effective virtual deployments • This creates the opportunity to provision and scale the compute and storage processes independently resulting in: • Much better resource utilization • Improved resource allocation and consumption • Support Multi-Tenancy © Copyright 2013 EMC Corporation. All rights reserved. 22
  • 23. HAWQ © Copyright 2013 EMC Corporation. All rights reserved. © Copyright 2013 Pivotal. All rights reserved. 23 23
  • 24. HAWQ: The Crown Jewels of Greenplum  SQL compliant  World-class query optimizer  Interactive query  Horizontal scalability  Robust data management  Common Hadoop formats  Deep analytics © Copyright 2013 EMC Corporation. All rights reserved. 24
  • 25. HAWQ High-Performance Query Processing  Interactive and true ANSI SQL support  Multi-petabyte horizontal scalability  Cost-based parallel query optimizer  Programmable analytics © Copyright 2013 EMC Corporation. All rights reserved. 25
  • 26. HAWQ Enterprise-Class Database Services & Management  Scatter-gather data loading  Row and column storage  Workload management  Multi-level partitioning  3rd-party tool & open client interfaces © Copyright 2013 EMC Corporation. All rights reserved. 26
  • 27. HAWQ Pre-integrated Deep Analytics  Performance via fully parallelized implementation  Consistent, user friendly SQL interfaces  Ease of data preparation  Pre-integrated MADLib support – Linear Regression – Logistic Regression – Multinomial Logisitic Regression © Copyright 2013 EMC Corporation. All rights reserved. – K-Means – Association Rules – PLDA - useful for topic modeling 27
  • 28. GPDB – Components GPDB Resource Management Query Engine Catalog Service Planner Optimizer Executor Transaction Manager © Copyright 2013 EMC Corporation. All rights reserved. GPXF Local File System 28
  • 29. HAWQ – Components Resource Management GPSQL Query Engine Planner Optimizer Executor Catalog Service Transaction Manager GPXF HDFS © Copyright 2013 EMC Corporation. All rights reserved. 29
  • 30. How HAWQ Works Clients SELECT beer, price FROM Bars b, Sells s WHERE b.name = s.bar AND b.city = ‘San Francisco’ HAWQ Master Host Query Parser JDBC/ODBC SQL Console Query Optimizer HDFS Namenode HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HDFS Datanode HDFS Datanode HDFS Datanode © Copyright 2013 EMC Corporation. All rights reserved. ... 30
  • 31. How HAWQ Works Clients Optimization Context Parse Tree HAWQ Master Host Metadata Query Parser JDBC/ODBC SQL Console Query Optimizer HDFS Namenode Cost Model Resources HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HDFS Datanode HDFS Datanode HDFS Datanode © Copyright 2013 EMC Corporation. All rights reserved. ... 31
  • 32. How HAWQ Works Execution Plan Clients HAWQ Master Host Query Parser JDBC/ODBC SQL Console Query Optimizer HDFS Namenode HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HDFS Datanode HDFS Datanode HDFS Datanode © Copyright 2013 EMC Corporation. All rights reserved. ... 32
  • 33. How HAWQ Works Clients HAWQ Master Host Query Parser JDBC/ODBC SQL Console Query Optimizer HDFS Namenode HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HDFS Datanode HDFS Datanode HDFS Datanode © Copyright 2013 EMC Corporation. All rights reserved. ... 33
  • 34. How HAWQ Works Clients HAWQ Master Host Query Parser JDBC/ODBC Query Optimizer SQL Console HAWQ Segment Host Query Executor HDFS Namenode HAWQ Segment Host Query Executor D y n a m i c HDFS Datanode © Copyright 2013 EMC Corporation. All rights reserved. HAWQ Segment Host Query Executor P i p e l i n i n g ™ HDFS Datanode ... HDFS Datanode 34
  • 35. How HAWQ Works Clients HAWQ Master Host Query Parser JDBC/ODBC SQL Console Query Optimizer HDFS Namenode HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HAWQ Segment Host Query Executor HDFS Datanode HDFS Datanode HDFS Datanode © Copyright 2013 EMC Corporation. All rights reserved. ... 35
  • 36. HAWQ Deployment ODBC/JDBC Driver Master Servers & Name Nodes ... ... Query planning & dispatch Dynamic Pipelining Segment Servers & Data Nodes ... Query processing & data storage ... HDFS External Sources Loading, streaming, etc. © Copyright 2013 EMC Corporation. All rights reserved. 36
  • 37. Xtension Framework  An advanced version of GPDB external tables  Enables combining HAWQ data and Hadoop data in single query Xtension Framework HDFS HBase © Copyright 2013 EMC Corporation. All rights reserved. Hive  Supports connectors for HDFS, Hbase and Hive  Provides extensible framework API to enable custom connector development for other data sources 37
  • 38. HAWQ Benchmarks User intelligence 4.2 198 47X Sales analysis 8.7 161 19X Click analysis 2.0 415 208X Data exploration 2.7 1,285 476X BI drill down 2.8 1,815 648X © Copyright 2013 EMC Corporation. All rights reserved. 38
  • 39. Pivotal Analytics Workbench (AWB) Commitment to Accelerating Innovation & Contributing to the Apache Community • Multi-million dollar investment by Pivotal and partners in a 1,000-node, 24-Petabyte cluster to facilitate innovation and conduct regular integration/scale testing of Apache Hadoop • Full-time, dedicated integration onboarding projects and validating each release of Apache Hadoop at-scale • Contributing back our results and findings to the open source community as well as incorporating them into the continued development of Pivotal HD © Copyright 2013 EMC Corporation. All rights reserved. 39
  • 40. “Real” Hadoop Cluster © Copyright 2013 EMC Corporation. All rights reserved. 40
  • 41. Leveraging Full Power of the Family © Copyright 2013 EMC Corporation. All rights reserved. 41
  • 42. Pivotal Sessions at EMC World Session Presenter Dates/Times The Pivotal Platform: A Purpose-Built Platform for Big-DataDriven Applications Josh Klahr Tue 5:30 - 6:30, Palazzo E Wed 11:30 - 12:30, Delfino 4005 Pivotal: Data Scientists on the Front Line: Examples of Data Science in Action Noelle Sio Tue 10:00 - 11:00, Lando 4205 Thu 8:30 - 9:30, Palazzo F Pivotal: Operationalizing 1000-node Hadoop Cluster – Analytics Workbench Clinton Ooi Bhavin Modi Tue 11:30 - 12:30, Palazzo L Thu 10:00- 11:00 am, Delfino 4001A Pivotal: for Powerful Processing of Unstructured Data For Valuable Insights SK Krishnamurthy Mon 4:00 - 5:00, Lando 4201 A Tue 4:00 - 5:00, Palazzo M Pivotal: Big & Fast data – merging real-time data and deep analytics Michael Crutcher Mon 1:00 - 2:00, Lando 4201 A Wed 10:00 - 11:00, Palazzo M Pivotal: Virtualize Big Data to Make The Elephant Dance June Yang Dan Baskette Mon 11:30 - 12:30, Marcello 4401A Wed 4:00 - 5:00, Palazzo E Hadoop Design Patterns Don Miner Mon 2:30 - 3:30, Palazzo F Wed 8:30 - 9:30, Delfino 4005 © Copyright 2013 EMC Corporation. All rights reserved. 42