SlideShare a Scribd company logo
® © 2014 MapR Technologies 1 
® 
© 2014 MapR Technologies 
Frans Thamura / Meruvian / frans@meruvian.com 
March 2014
® © 2014 MapR Technologies 2 
MapR Overview 
BIG 
DATA 
BEST 
PRODUCT 
BUSINESS 
IMPACT 
Hadoop 
Top Ranked 
Production 
Success
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 3 ® 
3 Trends 
Forcing a revolution in enterprise architecture
TREND 1 Industry Leaders Compete and Win with Data 
More Data Beats Better Algorithms 
Collecting interaction data from ecommerce, social media, offline, and call centers 
enables a “customer 360 view” and consumer intimacy 
Competitive Advantage is Decided by 0.5% 
Consumer financial services: 1% improvement in fraud detection means hundreds of millions of dollars 
Advertising and retail: 0.5% improvement in lift means millions of dollars increase in profitability 
® © 2014 MapR Technologies 4
Big Data is Overwhelming Traditional Systems 
® © 2014 MapR Technologies 5 
• Mission-critical reliability 
• Transaction guarantees 
• Deep security 
• Real-time performance 
• Backup and recovery 
• Interactive SQL 
• Rich analytics 
• Workload management 
• Data governance 
• Backup and recovery 
Enterprise 
Data 
Architecture 
TREND 2 
ENTERPRISE 
USERS 
OPERATIONAL 
SYSTEMS 
ANALYTICAL 
SYSTEMS 
PRODUCTION 
REQUIREMENTS 
PRODUCTION 
REQUIREMENTS 
OUTSIDE SOURCES
TREND 3 Hadoop: The Disruptive Technology at the Core of Big Data 
® © 2014 MapR Technologies 6 
GOOGLE TRENDS 
JOB TRENDS FROM INDEED.COM 
I n t e r e s t O v e r Time 
2 0 0 4 2 0 0 6 2 0 0 8 2 0 1 0 2 0 1 2 2 0 1 4
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 7 ® 
And 3 Realities
Hadoop Relieves the Pressure from Enterprise Systems 
Keys for Production Success 
1 Reliability and DR 
3 High performance 
® © 2014 MapR Technologies 8 
OPERATIONAL 
SYSTEMS 
ANALYTICAL 
SYSTEMS 
ENTERPRISE 
USERS 
REALITY 1 
• Data staging 
• Archive 
• Data transformation 
• Data exploration 
• Streaming, 
interactions 
2 Interoperability 
4 Supports operations 
and analytics
Google’s operational data store (BigTable) has enabled multiple revolutions 
within the company: 
® © 2014 MapR Technologies 9 
What Would Google Do? 
2003 
GFS 
2004 
Web index is batch 
(GFS/MapReduce) 
2010 
Web index is real-time 
(BigTable) 
The transition from 
batch to real-time 
2004 
MapReduce 
2006 
BigTable 
The explosion in 
operational applications 
(1) 
(2) 
REALITY 2
® © 2014 MapR Technologies 10 
REALITY 3 Architecture Matters for Success 
FOUNDATION
NEW APPLICATIONS SLAs TRUSTED INFORMATION LOWER TCO 
Open standards 
for integration 
® © 2014 MapR Technologies 11 
REALITY 3 Architecture Matters for Success 
FOUNDATION 
Data protection 
& security 
High performance 
Multi-tenancy 
Operational & 
Analytical Workloads
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 12 ® 
MapR: Architecture Matters
® © 2014 MapR Technologies 13 
104M 
CARD MEMBERS 
Fortune 100 Financial Services Company
® © 2014 MapR Technologies 14 
Advertising 
Automation 
Cloud! 
Sellers 
Cloud! 
Buyers! 
Cloud! 
100B 
AD AUCTIONS 
per day
® © 2014 MapR Technologies 15 
45M 
SHOPPERS 
analyzed each month 
Fortune 100 Retailer
® © 2014 MapR Technologies 16 
20M 
SONGS
Largest Biometric Database in the World 
® © 2014 MapR Technologies 17 
1.3B 
PEOPLE 
PEOPLE
Common Use Cases: Taking Advantage of Hadoop 
® © 2014 MapR Technologies 18 
ENTERPRISE 
DATA HUB 
MARKETING 
OPTIMIZATION 
RISK & SECURITY 
OPTIMIZATION 
OPERATIONAL 
INTELLIGENCE 
• Multi-structured 
data staging & archive 
• ETL / DW optimization 
• Mainframe 
optimization 
• Data exploration 
• Recommendation 
engines & targeting 
• Customer 360 
• Click-stream analysis 
• Social media analysis 
• Ad optimization 
• Network security 
monitoring 
• Security information & 
event management 
• Fraudulent behavioral 
analysis 
• Supply chain & logistics 
• System log analysis 
• Manufacturing quality 
assurance 
• Preventative 
maintenance 
• Smart meter analysis
® © 2014 MapR Technologies 19 
MapR is the Hadoop Technology Leader 
BIG DATA 
HADOOP
The Power of the Open Source Community 
Provisioning 
& 
coordination 
Savannah* 
Workflow 
& Data 
Governance 
Data 
Integration 
& Access 
Hue 
HttpFS 
Flume Knox* Falcon* 
MapR-FS MapR-DB 
® © 2014 MapR Technologies 20 
Management 
APACHE HADOOP AND OSS ECOSYSTEM 
Streaming 
Storm* 
NoSQL & 
Search 
Solr 
MapR Data Platform 
Security 
SQL 
Drill 
Shark 
Impala 
YARN 
Batch 
Spark 
Cascading 
Pig 
Spark 
Streaming 
HBase 
Juju 
ML, Graph 
GraphX 
MLLib 
Mahout 
MapReduce 
v1 & v2 
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS 
Tez* 
Accumulo* 
Hive 
Sqoop Sentry* Oozie ZooKeeper 
* 
Cer&fica&on/support 
planned 
for 
2014
Provisioning 
& 
coordination 
Savannah* 
Workflow 
& Data 
Governance 
Data 
Integration 
& Access 
Hue 
HttpFS 
Flume Knox* Falcon* 
MapR-FS MapR-DB 
Enterprise-grade Interoperability Performance Multi-tenancy Security Operational 
® © 2014 MapR Technologies 21 
MapR Distribution for Hadoop 
Management 
APACHE HADOOP AND OSS ECOSYSTEM 
Streaming 
Storm* 
NoSQL & 
Search 
Solr 
MapR Data Platform 
Security 
SQL 
Drill 
Shark 
Impala 
YARN 
Batch 
Spark 
Cascading 
Pig 
Spark 
Streaming 
HBase 
Juju 
ML, Graph 
GraphX 
MLLib 
Mahout 
MapReduce 
v1 & v2 
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS 
Tez* 
Accumulo* 
Hive 
Sqoop Sentry* Oozie ZooKeeper 
* 
Cer&fica&on/support 
planned 
for 
2014 
• Standard file access 
• Standard database 
access 
• Pluggable services 
• Broad developer 
support 
• Enterprise security 
authorization 
• Wire-level 
authentication 
• Data governance 
• Ability to support 
predictive analytics, 
real-time database 
operations, and 
support high arrival 
rate data 
• Ability to logically 
divide a cluster to 
support different use 
cases, job types, 
user groups, and 
administrators 
• 2X to 7X higher 
performance 
• Consistent, low 
latency 
• High availability 
• Data protection 
• Disaster recovery
• Ability to support 
predictive Provisioning 
analytics, 
real-time database 
& 
operations, coordination 
and 
support high arrival 
rate data 
Ø Integrated 
in-Hadoop Savannah* 
database 
Ø Consistent low 
latency 
Ø Instant recovery for 
database operations 
Ø No compactions 
Ø Elimination of read/ 
write amplification 
Ø Zero administration 
• Enterprise security 
authorization 
• Wire-level 
authentication 
• Data governance 
Workflow 
& Data 
Governance 
Data 
Integration 
& Access 
Hue 
HttpFS 
Ø Kerberos support 
Ø Native key-based 
authentication 
Ø Enterprise directory 
integration LDAP/NIS/ 
AD 
Ø Linux PAM 
Ø Role-based access 
control with Boolean 
expressions 
Ø Intel AES/NI high 
performance 
encryption 
Flume Knox* Falcon* Whirr 
® © 2014 MapR Technologies 22 
MapR Distribution for Hadoop 
Management 
APACHE HADOOP AND OSS ECOSYSTEM 
Streaming 
Storm* 
NoSQL & 
Search 
Solr 
MapR Data Platform 
Security 
SQL 
Drill* 
Shark 
Impala 
YARN 
Batch 
Spark 
Cascading 
Pig 
Spark 
Streaming 
HBase 
Juju 
ML, Graph 
GraphX 
MLLib 
Mahout 
MapReduce 
v1 & v2 
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS 
Tez* 
Accumulo* 
Hive 
Sqoop Sentry* Oozie ZooKeeper 
* 
Cer&fica&on/support 
planned 
for 
2014 
• Ability to logically 
divide a cluster to 
support different use 
cases, job types, 
user groups, and 
administrators 
Ø Data placement 
control 
Ø Job placement 
control 
Ø Logical volumes 
Ø Ability to leverage 
enterprise access 
control to isolate and 
secure data access 
Ø Enforce SLAs, 
provide job isolation 
• High availability 
• Data protection 
• Disaster recovery 
Ø Instant stateful 
failover 
Ø 99.999% Availability 
Ø Consistent snapshots 
Ø Point-in-time recovery 
Ø Self-healing 
Ø WAN replication 
Ø RTO with mirroring 
Ø Job Tracker HA 
Ø System resource 
protection 
Ø Job isolation and user 
quotas 
• Standard file access 
• Standard database 
access 
• Pluggable services 
• Broad developer 
support 
Ø NFS support 
Ø POSIX 
Ø Random read/write 
Ø Concurrent read/write 
Ø JDBC/ODBC 
Ø Nagios/Gangila 
integration 
Ø REST API 
• 2X to 7X higher 
performance 
• Consistent , low 
latency 
Ø No-Namenode 
distributed 
architecture 
Ø Database 
performance with no 
compactions or 
defragmentation 
Ø Automated 
compression 
Enterprise-grade Interoperability Performance Multi-tenancy Security Operational
MapR: Best Solution for Customer Success 
® © 2014 MapR Technologies 23 
Top Ranked Exponential 
Growth 
500+ 
Customers 
Premier 
Investors 
>2x annual bookings 
90% software licenses 
80% of accounts expand 3X 
< 1% lifetime churn 
> $1B in incremental revenue 
generated by 1 customer
Forrester Wave™: Big Data Hadoop Solutions, Q1‘14 
“The score speaks for itself. MapR 
has added some unique innovations 
to its Hadoop distribution, including 
support for Network File System 
(NFS), running arbitrary code in the 
cluster, performance enhancements 
for HBase, as well as high-availability 
and disaster recovery features.” 
® © 2014 MapR Technologies 24 
MapR: The Top Ranked Current Offering 
Weak 
The Forrester Wave is copyrighted by Forrester Research, Inc. Forrester and Forrester Wave are trademarks of Forrester Research, Inc. The Forrester Wave 
is a graphical representation of Forrester's call on a market and is plotted using a detailed spreadsheet with exposed scores, weightings, and comments. 
Forrester does not endorse any vendor, product, or service depicted in the Forrester Wave. Information is based on best available resources. Opinions reflect 
judgment at the time and are subject to change. 
Strong 
Weak 
Strategy Strong 
Current 
offerings 
Risky 
Bets Contenders 
Strong 
Performers Leaders 
Market presence
Forrester Wave™: Big Data Hadoop Solutions, Q1‘14 
“The score speaks for itself. MapR 
has added some unique innovations 
to its Hadoop distribution, including 
support for Network File System 
(NFS), running arbitrary code in the 
cluster, performance enhancements 
for HBase, as well as high-availability 
and disaster recovery features.” 
® © 2014 MapR Technologies 25 
MapR: The Top Ranked Current Offering 
Weak 
The Forrester Wave is copyrighted by Forrester Research, Inc. Forrester and Forrester Wave are trademarks of Forrester Research, Inc. The Forrester Wave 
is a graphical representation of Forrester's call on a market and is plotted using a detailed spreadsheet with exposed scores, weightings, and comments. 
Forrester does not endorse any vendor, product, or service depicted in the Forrester Wave. Information is based on best available resources. Opinions reflect 
judgment at the time and are subject to change. 
Strong 
Weak 
Strategy Strong 
Current 
offerings 
Risky 
Bets Contenders 
Strong 
Performers Leaders 
Market presence
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 26 ® 
High Availability & Data Protection
® © 2014 MapR Technologies 27 
Business Continuity 
High 
Availability 
Data 
Protection 
Disaster 
Recovery 
What are your requirements? 
What do you have for your enterprise storage, 
databases and data warehouses?
® © 2014 MapR Technologies 28 
High Availability (HA) Everywhere 
No NameNode architecture 
MapReduce/YARN HA 
NFS HA 
Instant recovery 
Rolling upgrades 
HA is built in 
• Distributed metadata can self-heal 
• No practical limit on # of files 
• Jobs are not impacted by failures 
• Meet your data processing SLAs 
• High throughput and resilience for NFS-based data 
ingestion, import/export and multi-client access 
• Files and tables are accessible within seconds of a node 
failure or cluster restart 
• Upgrade the software with no downtime 
• No special configuration to enable HA 
• All MapR customers operate with HA
Apache Hadoop NameNode High Availability 
® © 2014 MapR Technologies 29 
HDFS HA HDFS 
Federation 
A B C D E F 
A B C D E F 
A B C D E F 
C D 
C D 
NameNode 
A B 
A B 
Primary NameNode 
NameNode 
DataNode 
DataNode 
DataNode 
NameNode 
NameNode 
DataNode 
DataNode 
DataNode 
E F 
E F 
Standby NameNode 
NameNode 
NameNode 
DataNode 
DataNode 
DataNode 
HDFS-based Distributions 
Single point of failure 
Only one active NameNode 
Limited to 50-200 million files 
Metadata must fit in memory 
Double the block reports 
Multiple single points 
of failure w/o HA 
Needs 20 NameNodes 
for 1 Billion files 
Performance bottleneck
® © 2014 MapR Technologies 30 
No-NameNode Architecture 
A B C D E F 
DataNode 
DataNode 
DataNode 
DataNode 
DataNode 
DataNode 
DataNode 
DataNode 
DataNode 
® 
NameNode 
Up to 1T files (> 5000x advantage) 
Significantly less hardware & OpEx 
Higher performance 
No special config to enable HA 
Automatic failover & re-replication 
Metadata is persisted to disk
Data Protection: Replication and Snapshots 
C7 C7 
® © 2014 MapR Technologies 31 
Replication 
• Protect from hardware failures 
• File chunks, table regions and metadata are automatically 
replicated (3x by default) 
• At least one replica on a different rack 
Snapshots 
• Protect from user and application errors 
• Point-in-time recovery 
• Redirect on write 
• No performance or scale impact 
• Read files and tables directly from snapshot 
C1 C2 
C3 
C1 C2 
C4 
C1 C4 C4 C2 
C5 
C5 C6 
C3 
C5 C6 
C6 C7 C3 
Ac#ve&Volume Snapshot 
13505505.09500 
A B C D D₁
® © 2014 MapR Technologies 32 
Disaster Recovery: Mirroring 
• Flexible 
– Choose the volumes/directories to mirror 
– You don’t need to mirror the entire cluster 
– Active/active 
• Fast 
– No performance impact 
– Block-level (8KB) deltas 
– Automatic compression 
• Safe 
– Point-in-time consistency 
– End-to-end checksums 
• Easy 
Production Research 
WAN 
Datacenter 
1 
Datacenter 
2 
– Graceful handling of network issues 
– No third-party software 
Production WAN EC2 
– Takes less than two minutes to configure!
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 33 ® 
Interoperability
Seamless Integration with Direct Access NFS 
® © 2014 MapR Technologies 34 
• MapR is POSIX compliant 
– Random reads/writes 
– Simultaneous reading and writing to a file 
– Compression is automatic and transparent 
• Industry-standard NFS interface (in 
addition to HDFS API) 
– Stream data into the cluster 
– Leverage thousands of tools and 
applications 
– Easier to use non-Java programming 
languages 
– No need for most proprietary Hadoop 
connectors 
®
Logs 
® © 2014 MapR Technologies 35 
When Hadoop Looks Like a NAS… 
• Data ingestion is easy 
– Popular online gaming company changed data 
ingestion from a complex Flume cluster to a 17-line 
Python script 
• Database bulk import/export with standard 
vendor tools 
– Large telco saved $30M on EDW costs (5 years) by 
leveraging MapR to pre-process and store raw data 
prior to loading into EDW 
• 1000s of applications/tools 
– Large credit card company uses MapR volumes as 
the user home directories on the Hadoop gateway 
servers 
Application 
servers 
$ 
find 
. 
| 
grep 
log 
$ 
cp 
$ 
vi 
results.csv 
$ 
scp 
$ 
tail 
-­‐f 
part-­‐00000
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 36 ® 
Multi-Tenancy & Security
® © 2014 MapR Technologies 37 
Volumes 
100K volumes are OK, 
create as many as needed 
Volumes dramatically simplify 
management: 
• Replication factor 
• Scheduled mirroring 
• Scheduled snapshots 
• Data placement control 
• User access and tracking 
• Administrative permissions 
/projects 
/tahoe 
/yosemite 
/user 
/msmith 
/bjohnson
® © 2014 MapR Technologies 38 
Multi-tenancy 
Isolation 
• Tasks sandboxed so they don’t impact other tasks or system daemons 
• System resources protected from runaway jobs 
• Volume-based data placement 
• Label-based job scheduling 
Quotas 
• Storage quotas by volume/user/group 
• CPU and memory quotas by queue/user/group 
Security and delegation 
• Wire-level authentication and encryption (Kerberos not required) 
• Fine-grained administration permissions including volume-level delegation 
• Authenticate users to AD, LDAP and Kerberos via Linux PAM 
Reporting 
• Detailed reporting on resource usage (75+ different metrics) 
• All reports are available via UI, CLI and REST API
MapR Integrates MapR 
IntegratesS 
Seeccuurritiyty i 
nintot oH 
Hadaodoopo p 
® © 2014 MapR Technologies 39
® © 2014 MapR Technologies 40 
Making Security Easy 
> 99% 
consumers accessing 
online banks use strong 
wire-level authentication 
< 5% 
organizations deploying 
Hadoop enable strong 
wire-level authentication
® © 2014 MapR Technologies 41 
Hadoop Security 
Authorization to 
ensure the right 
access to files 
and databases 
Authentication 
for users and 
user-created job 
requests 
Encryption to 
ensure user 
credentials and 
data are always 
secure 
Integration with 
existing security 
infrastructure
… Along With Fine-Grained Access Control 
Full POSIX permissions on files and directories 
ACLs on tables, column families and columns 
ACLs on MapReduce jobs and queues 
Administration ACLs on cluster and volumes 
Access control expressions for easy, role-based control 
® © 2014 MapR Technologies 42
Integration with Existing Security Infrastructure 
SSO with existing Kerberos infrastructure (optional) 
Linux PAM integration enables third-party user directories 
MapR supports wire-level 
authentication with and 
without Kerberos 
USER DIRECTORY 
(AD, LDAP, NIS, …) 
® © 2014 MapR Technologies 43 
HADOOP CLUSTER 
CLIENT 
(NO KERBEROS) 
CLIENT 
(KERBEROS-ENABLED) 
KERBEROS KDC 
USERNAME/ 
PASSWORD 
(HTTPS) 
KERBEROS 
SERVICE 
TICKET 
CHECK 
USERNAME/ 
PASSWORD 
CHECK 
USERNAME/ 
PASSWORD 
Existing Security Infrastructure
Cluster-wide Security 
All operations on Hadoop are secured natively 
including: 
User operations such as file reads and writes, 
database manipulations, MapReduce job 
submissions 
Intra-cluster node-node interactions including 
remote procedure calls 
Inter-cluster operations such as mirroring 
® © 2014 MapR Technologies 44 
Native Security Authentication 
Ease of Deployment 
Hadoop initiates and maintains secure key 
communication* throughout the cluster without 
requiring external validation 
Users authenticate themselves through a simple and 
secure login-password mechanism 
All cluster nodes authenticate and interact with each 
other through secure keys 
*MapR Leverages Standard Cryptography: NSA Suite B Cryptography (AES-256 and SHA-384)
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 45 ® 
Performance Leader
® © 2014 MapR Technologies 46 
World-Record Performance 
NEW MINUTESORT WORLD RECORD 
1.65 TB 
IN 1 M INUTE 
298 NODES 
PREVIOUS 
RECORD: 1.6 TB 
with 2200 nodes 
Previous Record 
MapR: With a Fraction of the Hardware
Comparative Study of Hadoop Distributions 
475 465 IDH 
® © 2014 MapR Technologies 47 
212 
59 
262 
69 
276 
64 
CDH 
HDP 
MapR 
Source: Flux7 Labs Study, October 2013 
Read and Write Throughput Benchmarks 
DFSIO Read Throughput DFSIO Write Throughput 
MB per Second 
MB per Second
® © 2014 MapR Technologies 48 
MapR-DB: The Best In-Hadoop Database 
MapR-DB 
▪ NoSQL 
Wide-­‐column 
Store 
▪ Apache 
HBase 
API 
▪ Integrated 
with 
Hadoop 
HBase 
JVM 
HDFS 
JVM 
ext3/ext4 
Disks 
Other Distros 
Tables/Files 
Disks 
MapR Enterprise Database Edition (M7) 
The most scalable, enterprise-grade, 
NoSQL database that supports online applications and analytics
® © 2014 MapR Technologies 49 
Consistent, Low Latency 
--- M7 Read Latency --- Others Read Latency
Operations + Analytics = Real-time, Personalized Services 
® © 2014 MapR Technologies 50 
Real-time Operational Applications 
Fraud model Recommendations 
table 
MapR Distribution for Hadoop 
Fraud 
investigator 
Interactive 
marketer 
Online 
transactions 
Fraud 
detection 
Personalized 
offers 
Clickstream 
analysis 
Fraud 
investigation tool 
Analytics
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 51 ® 
Ensuring Your Success
® © 2014 MapR Technologies 52
® © 2014 MapR Technologies 53 
Committed to our Customers’ Success 
Educational Services Professional Services Customer Support 
Core 
Hadoop 
Services 
Data 
Engineering 
Advanced 
Analytics 
M7/HBase 
Practice 
Hadoop engineering 
experts provide 
24x7x365 
global coverage 
Instructor-led courses & 
Web-based 
training for Hadoop cluster 
administration, HBase & 
MapReduce programming 
and more 
Data 
Engineering 
Data 
Science
® © 2014 MapR Technologies 54 
HQ 
WORLDWIDE 
PRESENCE & 
CUSTOMER 
SUPPORT
® © 2014 MapR Technologies 55 
Key MapR Advantage Partners 
Business 
Services 
INFRASTRUCTURE 
& CLOUD 
ANALYTICS & 
BUSINESS INTELLIGENCE 
APPLICATIONS 
& OS 
CONSULTANTS 
& INTEGRATORS 
DATA WAREHOUSE 
& INTEGRATION
Opportunity to Revolutionize Enterprise Data Architecture 
From Redundant Processing Silos and Data Science Experiments… 
® © 2014 MapR Technologies 56
The Production Enterprise Data Hub 
® © 2014 MapR Technologies 57 
® 
… to Consolidated Operational and Analytical Workloads
® © 2014 MapR Technologies 58 
Summary 
BIG 
DATA 
BEST 
PRODUCT 
BUSINESS 
IMPACT 
Hadoop 
Top Ranked 
Production 
Success
® © 2014 MapR Technologies 59 
Q& A 
Engage with us! 
@mapr maprtech 
YOURNAME@mapr.com 
MapR 
maprtech 
mapr-technologies
® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 60 ® 
Extra slides
Packages Supported by various distributions 
Red – lacking 
Blue - leading 
® © 2014 MapR Technologies 61 
MapR 4.0.1 
(Sep 2014) 
Cloudera 5.1.2 
(Aug 2014) 
Hortonworks 2.1.5 
(Aug 2014) 
Apache Versions 
(Sep 12th, 2014) 
Core Hadoop Hadoop Core, YARN 2.4.1 2.3.0 2.4.0 2.5.1 
Batch Map Reduce MRv1 and MRv2 MRv1 or MRv2 MRv2 MRv2 
Hive 0.12, 0.13 0.12 0.13 0.13 
Tez 0.4 (Dev Preview Only) X 0.4 0.5 
Pig 0.12 0.12 0.12 0.12 
Cascading 2.1.6 X X 2.5 
Spark 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
Interactive SQL Impala 1.2.3 1.4 X 1.4 
Drill 0.5 X X 0.5 
SparkSQL 1.0.2 X 1.0.1 (Tech Preview only) 1.1 
NoSQL and Search HBase/NoSQL 0.94.2, 0.98.4, MapR-DB 0.98 0.98, Accumulo 1.5.1 HBase 0.98 
Phoenix X X 4.0.0 4.1.0 
AsyncHBase 1.5 X X 1.5 
Search LW (Solr) 2.6.1 , 2.7 Cloudera Search 1.5 X NA 
Machine Learning and 
Graph 
Mahout 0.9 0.9 0.9 0.9 
MLLib/MLBase 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
GraphX 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
Streaming/Messaging Spark Streaming 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 
Storm 0.9, 0.9.2 (Certified) X 0.9.1 0.9.2 
Kafka X X 0.8.1.1 (Tech Preview) 0.8.1.1 
Data Integration Sqoop, Sqoop2 1.4.4, 1.99.3 1.4.4, 1.99.3 1.4.4 1.4.5 
Flume 1.5.0 1.5.0 1.4.0 1.5.0 
Knox X X 0.4 0.4 
Coordination Oozie 4.0.1 4.0.0 4.0.0 4.0.1 
Zookeeper 3.4.5 3.4.5 3.4.5 3.4.5 
GUI, Configuration, 
Monitoring 
Management MCS CM Ambari Ambari 
Hue 3.5 3.6 2.5.1 3.6 
http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH-Version-and-Packaging-Information/cdhvd_cdh_package_tarball.html?scroll=topic_3_unique_8 
http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.5/bk_releasenotes_hdp_2.1/content/ch_relnotes-hdp-2.1.5-product.html
® © 2014 MapR Technologies 62 
Business Continuity 
High 
Availability 
Data 
Protection 
Disaster 
Recovery 
What are your requirements? 
What do you have for your enterprise storage, 
databases and data warehouses?
® © 2014 MapR Technologies 63 
The Cloud Leaders Pick MapR 
Google chose MapR to 
provide Hadoop on Google 
Compute Engine 
Amazon EMR is the largest 
Hadoop provider in revenue 
and # of clusters

More Related Content

What's hot

Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
MapR Technologies
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
MapR Technologies
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
MapR Technologies
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
MapR Technologies
 
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor DataState of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
Mathieu Dumoulin
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR Technologies
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
Ian Downard
 
Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
MapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
MapR Technologies
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
MapR Technologies
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
MapR Technologies
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
MapR Technologies
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
DataWorks Summit/Hadoop Summit
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DB
MapR Technologies
 

What's hot (20)

Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor DataState of the Art Robot Predictive Maintenance with Real-time Sensor Data
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
 
Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
 
MapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data PlatformMapR 5.2: Getting More Value from the MapR Converged Data Platform
MapR 5.2: Getting More Value from the MapR Converged Data Platform
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DB
 

Similar to Meruvian - Introduction to MapR

Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
MapR Technologies
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with Hadoop
Precisely
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
MapR Technologies
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
Data Con LA
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
MapR Technologies
 
Hadoop: Revolutionizing Analytics AND Operations
Hadoop: Revolutionizing Analytics AND OperationsHadoop: Revolutionizing Analytics AND Operations
Hadoop: Revolutionizing Analytics AND Operations
MapR Technologies
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
ervogler
 
Hadoop In The Real World
Hadoop In The Real WorldHadoop In The Real World
Hadoop In The Real World
MapR Technologies
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
MapR Technologies
 
MapR & Skytree:
MapR & Skytree: MapR & Skytree:
MapR & Skytree:
MapR Technologies
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
jaxconf
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big Data
Senturus
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
Pactera_US
 
Powering the "As it Happens" Business
Powering the "As it Happens" BusinessPowering the "As it Happens" Business
Powering the "As it Happens" Business
MapR Technologies
 
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate RisksLearn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
MapR Technologies
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
Revolution Analytics
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
WANdisco Plc
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopHortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with Hadoop
Mats Johansson
 

Similar to Meruvian - Introduction to MapR (20)

Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with Hadoop
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
 
Hadoop: Revolutionizing Analytics AND Operations
Hadoop: Revolutionizing Analytics AND OperationsHadoop: Revolutionizing Analytics AND Operations
Hadoop: Revolutionizing Analytics AND Operations
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
 
Hadoop In The Real World
Hadoop In The Real WorldHadoop In The Real World
Hadoop In The Real World
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
MapR & Skytree:
MapR & Skytree: MapR & Skytree:
MapR & Skytree:
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big Data
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
Powering the "As it Happens" Business
Powering the "As it Happens" BusinessPowering the "As it Happens" Business
Powering the "As it Happens" Business
 
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate RisksLearn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with HadoopHortonworks & Bilot Data Driven Transformations with Hadoop
Hortonworks & Bilot Data Driven Transformations with Hadoop
 

More from The World Bank

Meruvian MDP 2.0.1 2017
Meruvian MDP 2.0.1 2017Meruvian MDP 2.0.1 2017
Meruvian MDP 2.0.1 2017
The World Bank
 
G20 digital-economy-ministerial-declaration-english-version
G20 digital-economy-ministerial-declaration-english-versionG20 digital-economy-ministerial-declaration-english-version
G20 digital-economy-ministerial-declaration-english-version
The World Bank
 
Virtualization
VirtualizationVirtualization
Virtualization
The World Bank
 
Kebijakan pembinaan smk 2017 (rakor lsp, 140317)
Kebijakan pembinaan smk  2017  (rakor lsp, 140317)Kebijakan pembinaan smk  2017  (rakor lsp, 140317)
Kebijakan pembinaan smk 2017 (rakor lsp, 140317)
The World Bank
 
Inpres nomer 9 tahun 2016 - SMK
Inpres nomer 9 tahun 2016 - SMKInpres nomer 9 tahun 2016 - SMK
Inpres nomer 9 tahun 2016 - SMK
The World Bank
 
JBoss Fuse vs Tibco Matrix
JBoss Fuse vs Tibco MatrixJBoss Fuse vs Tibco Matrix
JBoss Fuse vs Tibco Matrix
The World Bank
 
VSphere Integrated Containers v3.0
VSphere Integrated Containers v3.0VSphere Integrated Containers v3.0
VSphere Integrated Containers v3.0
The World Bank
 
SoftBank ARM TechCon Keynote Masayoshi Son
SoftBank ARM TechCon Keynote Masayoshi SonSoftBank ARM TechCon Keynote Masayoshi Son
SoftBank ARM TechCon Keynote Masayoshi Son
The World Bank
 
KPTIK Maestro internship program
KPTIK Maestro internship programKPTIK Maestro internship program
KPTIK Maestro internship program
The World Bank
 
MOU 5 Menteri Terkait Vokasi dan SMK 4 5919
MOU 5 Menteri Terkait Vokasi dan SMK 4 5919MOU 5 Menteri Terkait Vokasi dan SMK 4 5919
MOU 5 Menteri Terkait Vokasi dan SMK 4 5919
The World Bank
 
PKS 5 Menteri terkait Vokasi dan SMK
PKS 5 Menteri terkait Vokasi dan SMKPKS 5 Menteri terkait Vokasi dan SMK
PKS 5 Menteri terkait Vokasi dan SMK
The World Bank
 
Design Sprint Methods
Design Sprint MethodsDesign Sprint Methods
Design Sprint Methods
The World Bank
 
Instruktur Teman Sebaya (edit 28nov)
Instruktur Teman Sebaya (edit 28nov)Instruktur Teman Sebaya (edit 28nov)
Instruktur Teman Sebaya (edit 28nov)
The World Bank
 
Kebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compress
Kebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compressKebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compress
Kebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compress
The World Bank
 
Instruktur Teman Sebaya
Instruktur Teman SebayaInstruktur Teman Sebaya
Instruktur Teman Sebaya
The World Bank
 
Docker QNAP Container Station
Docker QNAP Container StationDocker QNAP Container Station
Docker QNAP Container Station
The World Bank
 
Penetrasi & Prilaku Pengguna Internet Indonesia 2016
Penetrasi & Prilaku Pengguna Internet Indonesia 2016Penetrasi & Prilaku Pengguna Internet Indonesia 2016
Penetrasi & Prilaku Pengguna Internet Indonesia 2016
The World Bank
 
Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016
Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016
Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016
The World Bank
 
Paparan Aspek Hukum Tanda Tangan Digital
Paparan Aspek Hukum Tanda Tangan Digital Paparan Aspek Hukum Tanda Tangan Digital
Paparan Aspek Hukum Tanda Tangan Digital
The World Bank
 
Presentasi Seminar TTD Aplikasi Perkantoran
Presentasi Seminar TTD Aplikasi PerkantoranPresentasi Seminar TTD Aplikasi Perkantoran
Presentasi Seminar TTD Aplikasi Perkantoran
The World Bank
 

More from The World Bank (20)

Meruvian MDP 2.0.1 2017
Meruvian MDP 2.0.1 2017Meruvian MDP 2.0.1 2017
Meruvian MDP 2.0.1 2017
 
G20 digital-economy-ministerial-declaration-english-version
G20 digital-economy-ministerial-declaration-english-versionG20 digital-economy-ministerial-declaration-english-version
G20 digital-economy-ministerial-declaration-english-version
 
Virtualization
VirtualizationVirtualization
Virtualization
 
Kebijakan pembinaan smk 2017 (rakor lsp, 140317)
Kebijakan pembinaan smk  2017  (rakor lsp, 140317)Kebijakan pembinaan smk  2017  (rakor lsp, 140317)
Kebijakan pembinaan smk 2017 (rakor lsp, 140317)
 
Inpres nomer 9 tahun 2016 - SMK
Inpres nomer 9 tahun 2016 - SMKInpres nomer 9 tahun 2016 - SMK
Inpres nomer 9 tahun 2016 - SMK
 
JBoss Fuse vs Tibco Matrix
JBoss Fuse vs Tibco MatrixJBoss Fuse vs Tibco Matrix
JBoss Fuse vs Tibco Matrix
 
VSphere Integrated Containers v3.0
VSphere Integrated Containers v3.0VSphere Integrated Containers v3.0
VSphere Integrated Containers v3.0
 
SoftBank ARM TechCon Keynote Masayoshi Son
SoftBank ARM TechCon Keynote Masayoshi SonSoftBank ARM TechCon Keynote Masayoshi Son
SoftBank ARM TechCon Keynote Masayoshi Son
 
KPTIK Maestro internship program
KPTIK Maestro internship programKPTIK Maestro internship program
KPTIK Maestro internship program
 
MOU 5 Menteri Terkait Vokasi dan SMK 4 5919
MOU 5 Menteri Terkait Vokasi dan SMK 4 5919MOU 5 Menteri Terkait Vokasi dan SMK 4 5919
MOU 5 Menteri Terkait Vokasi dan SMK 4 5919
 
PKS 5 Menteri terkait Vokasi dan SMK
PKS 5 Menteri terkait Vokasi dan SMKPKS 5 Menteri terkait Vokasi dan SMK
PKS 5 Menteri terkait Vokasi dan SMK
 
Design Sprint Methods
Design Sprint MethodsDesign Sprint Methods
Design Sprint Methods
 
Instruktur Teman Sebaya (edit 28nov)
Instruktur Teman Sebaya (edit 28nov)Instruktur Teman Sebaya (edit 28nov)
Instruktur Teman Sebaya (edit 28nov)
 
Kebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compress
Kebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compressKebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compress
Kebijakan pengembangan pendidikan kejuruan (its, 23 nov 2016) compress
 
Instruktur Teman Sebaya
Instruktur Teman SebayaInstruktur Teman Sebaya
Instruktur Teman Sebaya
 
Docker QNAP Container Station
Docker QNAP Container StationDocker QNAP Container Station
Docker QNAP Container Station
 
Penetrasi & Prilaku Pengguna Internet Indonesia 2016
Penetrasi & Prilaku Pengguna Internet Indonesia 2016Penetrasi & Prilaku Pengguna Internet Indonesia 2016
Penetrasi & Prilaku Pengguna Internet Indonesia 2016
 
Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016
Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016
Tindak Lanjut Program Pendidikan Vokasi 30 Agustus 2016
 
Paparan Aspek Hukum Tanda Tangan Digital
Paparan Aspek Hukum Tanda Tangan Digital Paparan Aspek Hukum Tanda Tangan Digital
Paparan Aspek Hukum Tanda Tangan Digital
 
Presentasi Seminar TTD Aplikasi Perkantoran
Presentasi Seminar TTD Aplikasi PerkantoranPresentasi Seminar TTD Aplikasi Perkantoran
Presentasi Seminar TTD Aplikasi Perkantoran
 

Recently uploaded

guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
Rogerio Filho
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
3ipehhoa
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
laozhuseo02
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
nirahealhty
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
GTProductions1
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
3ipehhoa
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
3ipehhoa
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
ufdana
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
VivekSinghShekhawat2
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
laozhuseo02
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
eutxy
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
JungkooksNonexistent
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
Gal Baras
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
keoku
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
natyesu
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
JeyaPerumal1
 
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdfJAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
Javier Lasa
 

Recently uploaded (20)

guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
 
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdfJAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
 

Meruvian - Introduction to MapR

  • 1. ® © 2014 MapR Technologies 1 ® © 2014 MapR Technologies Frans Thamura / Meruvian / frans@meruvian.com March 2014
  • 2. ® © 2014 MapR Technologies 2 MapR Overview BIG DATA BEST PRODUCT BUSINESS IMPACT Hadoop Top Ranked Production Success
  • 3. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 3 ® 3 Trends Forcing a revolution in enterprise architecture
  • 4. TREND 1 Industry Leaders Compete and Win with Data More Data Beats Better Algorithms Collecting interaction data from ecommerce, social media, offline, and call centers enables a “customer 360 view” and consumer intimacy Competitive Advantage is Decided by 0.5% Consumer financial services: 1% improvement in fraud detection means hundreds of millions of dollars Advertising and retail: 0.5% improvement in lift means millions of dollars increase in profitability ® © 2014 MapR Technologies 4
  • 5. Big Data is Overwhelming Traditional Systems ® © 2014 MapR Technologies 5 • Mission-critical reliability • Transaction guarantees • Deep security • Real-time performance • Backup and recovery • Interactive SQL • Rich analytics • Workload management • Data governance • Backup and recovery Enterprise Data Architecture TREND 2 ENTERPRISE USERS OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS PRODUCTION REQUIREMENTS PRODUCTION REQUIREMENTS OUTSIDE SOURCES
  • 6. TREND 3 Hadoop: The Disruptive Technology at the Core of Big Data ® © 2014 MapR Technologies 6 GOOGLE TRENDS JOB TRENDS FROM INDEED.COM I n t e r e s t O v e r Time 2 0 0 4 2 0 0 6 2 0 0 8 2 0 1 0 2 0 1 2 2 0 1 4
  • 7. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 7 ® And 3 Realities
  • 8. Hadoop Relieves the Pressure from Enterprise Systems Keys for Production Success 1 Reliability and DR 3 High performance ® © 2014 MapR Technologies 8 OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS ENTERPRISE USERS REALITY 1 • Data staging • Archive • Data transformation • Data exploration • Streaming, interactions 2 Interoperability 4 Supports operations and analytics
  • 9. Google’s operational data store (BigTable) has enabled multiple revolutions within the company: ® © 2014 MapR Technologies 9 What Would Google Do? 2003 GFS 2004 Web index is batch (GFS/MapReduce) 2010 Web index is real-time (BigTable) The transition from batch to real-time 2004 MapReduce 2006 BigTable The explosion in operational applications (1) (2) REALITY 2
  • 10. ® © 2014 MapR Technologies 10 REALITY 3 Architecture Matters for Success FOUNDATION
  • 11. NEW APPLICATIONS SLAs TRUSTED INFORMATION LOWER TCO Open standards for integration ® © 2014 MapR Technologies 11 REALITY 3 Architecture Matters for Success FOUNDATION Data protection & security High performance Multi-tenancy Operational & Analytical Workloads
  • 12. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 12 ® MapR: Architecture Matters
  • 13. ® © 2014 MapR Technologies 13 104M CARD MEMBERS Fortune 100 Financial Services Company
  • 14. ® © 2014 MapR Technologies 14 Advertising Automation Cloud! Sellers Cloud! Buyers! Cloud! 100B AD AUCTIONS per day
  • 15. ® © 2014 MapR Technologies 15 45M SHOPPERS analyzed each month Fortune 100 Retailer
  • 16. ® © 2014 MapR Technologies 16 20M SONGS
  • 17. Largest Biometric Database in the World ® © 2014 MapR Technologies 17 1.3B PEOPLE PEOPLE
  • 18. Common Use Cases: Taking Advantage of Hadoop ® © 2014 MapR Technologies 18 ENTERPRISE DATA HUB MARKETING OPTIMIZATION RISK & SECURITY OPTIMIZATION OPERATIONAL INTELLIGENCE • Multi-structured data staging & archive • ETL / DW optimization • Mainframe optimization • Data exploration • Recommendation engines & targeting • Customer 360 • Click-stream analysis • Social media analysis • Ad optimization • Network security monitoring • Security information & event management • Fraudulent behavioral analysis • Supply chain & logistics • System log analysis • Manufacturing quality assurance • Preventative maintenance • Smart meter analysis
  • 19. ® © 2014 MapR Technologies 19 MapR is the Hadoop Technology Leader BIG DATA HADOOP
  • 20. The Power of the Open Source Community Provisioning & coordination Savannah* Workflow & Data Governance Data Integration & Access Hue HttpFS Flume Knox* Falcon* MapR-FS MapR-DB ® © 2014 MapR Technologies 20 Management APACHE HADOOP AND OSS ECOSYSTEM Streaming Storm* NoSQL & Search Solr MapR Data Platform Security SQL Drill Shark Impala YARN Batch Spark Cascading Pig Spark Streaming HBase Juju ML, Graph GraphX MLLib Mahout MapReduce v1 & v2 EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Tez* Accumulo* Hive Sqoop Sentry* Oozie ZooKeeper * Cer&fica&on/support planned for 2014
  • 21. Provisioning & coordination Savannah* Workflow & Data Governance Data Integration & Access Hue HttpFS Flume Knox* Falcon* MapR-FS MapR-DB Enterprise-grade Interoperability Performance Multi-tenancy Security Operational ® © 2014 MapR Technologies 21 MapR Distribution for Hadoop Management APACHE HADOOP AND OSS ECOSYSTEM Streaming Storm* NoSQL & Search Solr MapR Data Platform Security SQL Drill Shark Impala YARN Batch Spark Cascading Pig Spark Streaming HBase Juju ML, Graph GraphX MLLib Mahout MapReduce v1 & v2 EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Tez* Accumulo* Hive Sqoop Sentry* Oozie ZooKeeper * Cer&fica&on/support planned for 2014 • Standard file access • Standard database access • Pluggable services • Broad developer support • Enterprise security authorization • Wire-level authentication • Data governance • Ability to support predictive analytics, real-time database operations, and support high arrival rate data • Ability to logically divide a cluster to support different use cases, job types, user groups, and administrators • 2X to 7X higher performance • Consistent, low latency • High availability • Data protection • Disaster recovery
  • 22. • Ability to support predictive Provisioning analytics, real-time database & operations, coordination and support high arrival rate data Ø Integrated in-Hadoop Savannah* database Ø Consistent low latency Ø Instant recovery for database operations Ø No compactions Ø Elimination of read/ write amplification Ø Zero administration • Enterprise security authorization • Wire-level authentication • Data governance Workflow & Data Governance Data Integration & Access Hue HttpFS Ø Kerberos support Ø Native key-based authentication Ø Enterprise directory integration LDAP/NIS/ AD Ø Linux PAM Ø Role-based access control with Boolean expressions Ø Intel AES/NI high performance encryption Flume Knox* Falcon* Whirr ® © 2014 MapR Technologies 22 MapR Distribution for Hadoop Management APACHE HADOOP AND OSS ECOSYSTEM Streaming Storm* NoSQL & Search Solr MapR Data Platform Security SQL Drill* Shark Impala YARN Batch Spark Cascading Pig Spark Streaming HBase Juju ML, Graph GraphX MLLib Mahout MapReduce v1 & v2 EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Tez* Accumulo* Hive Sqoop Sentry* Oozie ZooKeeper * Cer&fica&on/support planned for 2014 • Ability to logically divide a cluster to support different use cases, job types, user groups, and administrators Ø Data placement control Ø Job placement control Ø Logical volumes Ø Ability to leverage enterprise access control to isolate and secure data access Ø Enforce SLAs, provide job isolation • High availability • Data protection • Disaster recovery Ø Instant stateful failover Ø 99.999% Availability Ø Consistent snapshots Ø Point-in-time recovery Ø Self-healing Ø WAN replication Ø RTO with mirroring Ø Job Tracker HA Ø System resource protection Ø Job isolation and user quotas • Standard file access • Standard database access • Pluggable services • Broad developer support Ø NFS support Ø POSIX Ø Random read/write Ø Concurrent read/write Ø JDBC/ODBC Ø Nagios/Gangila integration Ø REST API • 2X to 7X higher performance • Consistent , low latency Ø No-Namenode distributed architecture Ø Database performance with no compactions or defragmentation Ø Automated compression Enterprise-grade Interoperability Performance Multi-tenancy Security Operational
  • 23. MapR: Best Solution for Customer Success ® © 2014 MapR Technologies 23 Top Ranked Exponential Growth 500+ Customers Premier Investors >2x annual bookings 90% software licenses 80% of accounts expand 3X < 1% lifetime churn > $1B in incremental revenue generated by 1 customer
  • 24. Forrester Wave™: Big Data Hadoop Solutions, Q1‘14 “The score speaks for itself. MapR has added some unique innovations to its Hadoop distribution, including support for Network File System (NFS), running arbitrary code in the cluster, performance enhancements for HBase, as well as high-availability and disaster recovery features.” ® © 2014 MapR Technologies 24 MapR: The Top Ranked Current Offering Weak The Forrester Wave is copyrighted by Forrester Research, Inc. Forrester and Forrester Wave are trademarks of Forrester Research, Inc. The Forrester Wave is a graphical representation of Forrester's call on a market and is plotted using a detailed spreadsheet with exposed scores, weightings, and comments. Forrester does not endorse any vendor, product, or service depicted in the Forrester Wave. Information is based on best available resources. Opinions reflect judgment at the time and are subject to change. Strong Weak Strategy Strong Current offerings Risky Bets Contenders Strong Performers Leaders Market presence
  • 25. Forrester Wave™: Big Data Hadoop Solutions, Q1‘14 “The score speaks for itself. MapR has added some unique innovations to its Hadoop distribution, including support for Network File System (NFS), running arbitrary code in the cluster, performance enhancements for HBase, as well as high-availability and disaster recovery features.” ® © 2014 MapR Technologies 25 MapR: The Top Ranked Current Offering Weak The Forrester Wave is copyrighted by Forrester Research, Inc. Forrester and Forrester Wave are trademarks of Forrester Research, Inc. The Forrester Wave is a graphical representation of Forrester's call on a market and is plotted using a detailed spreadsheet with exposed scores, weightings, and comments. Forrester does not endorse any vendor, product, or service depicted in the Forrester Wave. Information is based on best available resources. Opinions reflect judgment at the time and are subject to change. Strong Weak Strategy Strong Current offerings Risky Bets Contenders Strong Performers Leaders Market presence
  • 26. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 26 ® High Availability & Data Protection
  • 27. ® © 2014 MapR Technologies 27 Business Continuity High Availability Data Protection Disaster Recovery What are your requirements? What do you have for your enterprise storage, databases and data warehouses?
  • 28. ® © 2014 MapR Technologies 28 High Availability (HA) Everywhere No NameNode architecture MapReduce/YARN HA NFS HA Instant recovery Rolling upgrades HA is built in • Distributed metadata can self-heal • No practical limit on # of files • Jobs are not impacted by failures • Meet your data processing SLAs • High throughput and resilience for NFS-based data ingestion, import/export and multi-client access • Files and tables are accessible within seconds of a node failure or cluster restart • Upgrade the software with no downtime • No special configuration to enable HA • All MapR customers operate with HA
  • 29. Apache Hadoop NameNode High Availability ® © 2014 MapR Technologies 29 HDFS HA HDFS Federation A B C D E F A B C D E F A B C D E F C D C D NameNode A B A B Primary NameNode NameNode DataNode DataNode DataNode NameNode NameNode DataNode DataNode DataNode E F E F Standby NameNode NameNode NameNode DataNode DataNode DataNode HDFS-based Distributions Single point of failure Only one active NameNode Limited to 50-200 million files Metadata must fit in memory Double the block reports Multiple single points of failure w/o HA Needs 20 NameNodes for 1 Billion files Performance bottleneck
  • 30. ® © 2014 MapR Technologies 30 No-NameNode Architecture A B C D E F DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode ® NameNode Up to 1T files (> 5000x advantage) Significantly less hardware & OpEx Higher performance No special config to enable HA Automatic failover & re-replication Metadata is persisted to disk
  • 31. Data Protection: Replication and Snapshots C7 C7 ® © 2014 MapR Technologies 31 Replication • Protect from hardware failures • File chunks, table regions and metadata are automatically replicated (3x by default) • At least one replica on a different rack Snapshots • Protect from user and application errors • Point-in-time recovery • Redirect on write • No performance or scale impact • Read files and tables directly from snapshot C1 C2 C3 C1 C2 C4 C1 C4 C4 C2 C5 C5 C6 C3 C5 C6 C6 C7 C3 Ac#ve&Volume Snapshot 13505505.09500 A B C D D₁
  • 32. ® © 2014 MapR Technologies 32 Disaster Recovery: Mirroring • Flexible – Choose the volumes/directories to mirror – You don’t need to mirror the entire cluster – Active/active • Fast – No performance impact – Block-level (8KB) deltas – Automatic compression • Safe – Point-in-time consistency – End-to-end checksums • Easy Production Research WAN Datacenter 1 Datacenter 2 – Graceful handling of network issues – No third-party software Production WAN EC2 – Takes less than two minutes to configure!
  • 33. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 33 ® Interoperability
  • 34. Seamless Integration with Direct Access NFS ® © 2014 MapR Technologies 34 • MapR is POSIX compliant – Random reads/writes – Simultaneous reading and writing to a file – Compression is automatic and transparent • Industry-standard NFS interface (in addition to HDFS API) – Stream data into the cluster – Leverage thousands of tools and applications – Easier to use non-Java programming languages – No need for most proprietary Hadoop connectors ®
  • 35. Logs ® © 2014 MapR Technologies 35 When Hadoop Looks Like a NAS… • Data ingestion is easy – Popular online gaming company changed data ingestion from a complex Flume cluster to a 17-line Python script • Database bulk import/export with standard vendor tools – Large telco saved $30M on EDW costs (5 years) by leveraging MapR to pre-process and store raw data prior to loading into EDW • 1000s of applications/tools – Large credit card company uses MapR volumes as the user home directories on the Hadoop gateway servers Application servers $ find . | grep log $ cp $ vi results.csv $ scp $ tail -­‐f part-­‐00000
  • 36. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 36 ® Multi-Tenancy & Security
  • 37. ® © 2014 MapR Technologies 37 Volumes 100K volumes are OK, create as many as needed Volumes dramatically simplify management: • Replication factor • Scheduled mirroring • Scheduled snapshots • Data placement control • User access and tracking • Administrative permissions /projects /tahoe /yosemite /user /msmith /bjohnson
  • 38. ® © 2014 MapR Technologies 38 Multi-tenancy Isolation • Tasks sandboxed so they don’t impact other tasks or system daemons • System resources protected from runaway jobs • Volume-based data placement • Label-based job scheduling Quotas • Storage quotas by volume/user/group • CPU and memory quotas by queue/user/group Security and delegation • Wire-level authentication and encryption (Kerberos not required) • Fine-grained administration permissions including volume-level delegation • Authenticate users to AD, LDAP and Kerberos via Linux PAM Reporting • Detailed reporting on resource usage (75+ different metrics) • All reports are available via UI, CLI and REST API
  • 39. MapR Integrates MapR IntegratesS Seeccuurritiyty i nintot oH Hadaodoopo p ® © 2014 MapR Technologies 39
  • 40. ® © 2014 MapR Technologies 40 Making Security Easy > 99% consumers accessing online banks use strong wire-level authentication < 5% organizations deploying Hadoop enable strong wire-level authentication
  • 41. ® © 2014 MapR Technologies 41 Hadoop Security Authorization to ensure the right access to files and databases Authentication for users and user-created job requests Encryption to ensure user credentials and data are always secure Integration with existing security infrastructure
  • 42. … Along With Fine-Grained Access Control Full POSIX permissions on files and directories ACLs on tables, column families and columns ACLs on MapReduce jobs and queues Administration ACLs on cluster and volumes Access control expressions for easy, role-based control ® © 2014 MapR Technologies 42
  • 43. Integration with Existing Security Infrastructure SSO with existing Kerberos infrastructure (optional) Linux PAM integration enables third-party user directories MapR supports wire-level authentication with and without Kerberos USER DIRECTORY (AD, LDAP, NIS, …) ® © 2014 MapR Technologies 43 HADOOP CLUSTER CLIENT (NO KERBEROS) CLIENT (KERBEROS-ENABLED) KERBEROS KDC USERNAME/ PASSWORD (HTTPS) KERBEROS SERVICE TICKET CHECK USERNAME/ PASSWORD CHECK USERNAME/ PASSWORD Existing Security Infrastructure
  • 44. Cluster-wide Security All operations on Hadoop are secured natively including: User operations such as file reads and writes, database manipulations, MapReduce job submissions Intra-cluster node-node interactions including remote procedure calls Inter-cluster operations such as mirroring ® © 2014 MapR Technologies 44 Native Security Authentication Ease of Deployment Hadoop initiates and maintains secure key communication* throughout the cluster without requiring external validation Users authenticate themselves through a simple and secure login-password mechanism All cluster nodes authenticate and interact with each other through secure keys *MapR Leverages Standard Cryptography: NSA Suite B Cryptography (AES-256 and SHA-384)
  • 45. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 45 ® Performance Leader
  • 46. ® © 2014 MapR Technologies 46 World-Record Performance NEW MINUTESORT WORLD RECORD 1.65 TB IN 1 M INUTE 298 NODES PREVIOUS RECORD: 1.6 TB with 2200 nodes Previous Record MapR: With a Fraction of the Hardware
  • 47. Comparative Study of Hadoop Distributions 475 465 IDH ® © 2014 MapR Technologies 47 212 59 262 69 276 64 CDH HDP MapR Source: Flux7 Labs Study, October 2013 Read and Write Throughput Benchmarks DFSIO Read Throughput DFSIO Write Throughput MB per Second MB per Second
  • 48. ® © 2014 MapR Technologies 48 MapR-DB: The Best In-Hadoop Database MapR-DB ▪ NoSQL Wide-­‐column Store ▪ Apache HBase API ▪ Integrated with Hadoop HBase JVM HDFS JVM ext3/ext4 Disks Other Distros Tables/Files Disks MapR Enterprise Database Edition (M7) The most scalable, enterprise-grade, NoSQL database that supports online applications and analytics
  • 49. ® © 2014 MapR Technologies 49 Consistent, Low Latency --- M7 Read Latency --- Others Read Latency
  • 50. Operations + Analytics = Real-time, Personalized Services ® © 2014 MapR Technologies 50 Real-time Operational Applications Fraud model Recommendations table MapR Distribution for Hadoop Fraud investigator Interactive marketer Online transactions Fraud detection Personalized offers Clickstream analysis Fraud investigation tool Analytics
  • 51. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 51 ® Ensuring Your Success
  • 52. ® © 2014 MapR Technologies 52
  • 53. ® © 2014 MapR Technologies 53 Committed to our Customers’ Success Educational Services Professional Services Customer Support Core Hadoop Services Data Engineering Advanced Analytics M7/HBase Practice Hadoop engineering experts provide 24x7x365 global coverage Instructor-led courses & Web-based training for Hadoop cluster administration, HBase & MapReduce programming and more Data Engineering Data Science
  • 54. ® © 2014 MapR Technologies 54 HQ WORLDWIDE PRESENCE & CUSTOMER SUPPORT
  • 55. ® © 2014 MapR Technologies 55 Key MapR Advantage Partners Business Services INFRASTRUCTURE & CLOUD ANALYTICS & BUSINESS INTELLIGENCE APPLICATIONS & OS CONSULTANTS & INTEGRATORS DATA WAREHOUSE & INTEGRATION
  • 56. Opportunity to Revolutionize Enterprise Data Architecture From Redundant Processing Silos and Data Science Experiments… ® © 2014 MapR Technologies 56
  • 57. The Production Enterprise Data Hub ® © 2014 MapR Technologies 57 ® … to Consolidated Operational and Analytical Workloads
  • 58. ® © 2014 MapR Technologies 58 Summary BIG DATA BEST PRODUCT BUSINESS IMPACT Hadoop Top Ranked Production Success
  • 59. ® © 2014 MapR Technologies 59 Q& A Engage with us! @mapr maprtech YOURNAME@mapr.com MapR maprtech mapr-technologies
  • 60. ® © 2014 © 201 M4 aMpaRp RTe Tcehcnhonloogloiegsie s 60 ® Extra slides
  • 61. Packages Supported by various distributions Red – lacking Blue - leading ® © 2014 MapR Technologies 61 MapR 4.0.1 (Sep 2014) Cloudera 5.1.2 (Aug 2014) Hortonworks 2.1.5 (Aug 2014) Apache Versions (Sep 12th, 2014) Core Hadoop Hadoop Core, YARN 2.4.1 2.3.0 2.4.0 2.5.1 Batch Map Reduce MRv1 and MRv2 MRv1 or MRv2 MRv2 MRv2 Hive 0.12, 0.13 0.12 0.13 0.13 Tez 0.4 (Dev Preview Only) X 0.4 0.5 Pig 0.12 0.12 0.12 0.12 Cascading 2.1.6 X X 2.5 Spark 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 Interactive SQL Impala 1.2.3 1.4 X 1.4 Drill 0.5 X X 0.5 SparkSQL 1.0.2 X 1.0.1 (Tech Preview only) 1.1 NoSQL and Search HBase/NoSQL 0.94.2, 0.98.4, MapR-DB 0.98 0.98, Accumulo 1.5.1 HBase 0.98 Phoenix X X 4.0.0 4.1.0 AsyncHBase 1.5 X X 1.5 Search LW (Solr) 2.6.1 , 2.7 Cloudera Search 1.5 X NA Machine Learning and Graph Mahout 0.9 0.9 0.9 0.9 MLLib/MLBase 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 GraphX 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 Streaming/Messaging Spark Streaming 0.9.2, 1.0.2 1.0.0 1.0.1 (Tech Preview only) 1.1 Storm 0.9, 0.9.2 (Certified) X 0.9.1 0.9.2 Kafka X X 0.8.1.1 (Tech Preview) 0.8.1.1 Data Integration Sqoop, Sqoop2 1.4.4, 1.99.3 1.4.4, 1.99.3 1.4.4 1.4.5 Flume 1.5.0 1.5.0 1.4.0 1.5.0 Knox X X 0.4 0.4 Coordination Oozie 4.0.1 4.0.0 4.0.0 4.0.1 Zookeeper 3.4.5 3.4.5 3.4.5 3.4.5 GUI, Configuration, Monitoring Management MCS CM Ambari Ambari Hue 3.5 3.6 2.5.1 3.6 http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH5/latest/CDH-Version-and-Packaging-Information/cdhvd_cdh_package_tarball.html?scroll=topic_3_unique_8 http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.5/bk_releasenotes_hdp_2.1/content/ch_relnotes-hdp-2.1.5-product.html
  • 62. ® © 2014 MapR Technologies 62 Business Continuity High Availability Data Protection Disaster Recovery What are your requirements? What do you have for your enterprise storage, databases and data warehouses?
  • 63. ® © 2014 MapR Technologies 63 The Cloud Leaders Pick MapR Google chose MapR to provide Hadoop on Google Compute Engine Amazon EMR is the largest Hadoop provider in revenue and # of clusters