SlideShare a Scribd company logo
1 of 37
Download to read offline
© 2015 MapR Technologies 1© 2015 MapR Technologies
© 2015 MapR Technologies 2
• The most common use cases for Hadoop
• The top considerations before "going live" with Hadoop
• Product Demo – multiple workloads in the Data Lake
Topics
© 2015 MapR Technologies 3
State of Big Data Adoption
Source: Gartner. Sept. 2014. Survey Analysis: Big Data Investment Grows but Deployments Remain Scarce in 2014
© 2015 MapR Technologies 4© 2015 MapR Technologies
Top Hadoop Use Cases
© 2015 MapR Technologies 5
Speeding The Journey To Value
Operational
Batch
Create Data Capital
Big data novice Mature
Empower BI users
Operational
Applications
Mine
Logs
Recommendation
Engine
Data
Hub
Ad
Targeting360
View
Anomaly
detection
Fraud
preventionGet fast value
© 2015 MapR Technologies 6
The As-it-happens Business
Speeding The Journey To Value
Operational
Batch
Create Data Capital
Big data novice Mature
Empower BI users
Operational
Applications
Mine
Logs
Recommendation
Engine
Data
Hub
Ad
Targeting360
View
Anomaly
detection
Fraud
preventionGet fast value
© 2015 MapR Technologies 7
ENTERPRISE
DATA HUB
MARKETING
OPTIMIZATION
RISK & SECURITY
OPTIMIZATION
OPERATIONAL
INTELLIGENCE
• Multi-structured
data staging & archive
• ETL / DW optimization
• Mainframe
optimization
• Data exploration
• Recommendation
engines & targeting
• Customer 360
• Click-stream analysis
• Social media analysis
• Ad optimization
• Network security
monitoring
• Security information &
event management
• Fraudulent behavioral
analysis
• Supply chain & logistics
• System log analysis
• Manufacturing quality
assurance
• Preventative
maintenance
• Smart meter analysis
Common Use Cases: Taking Advantage of Hadoop
© 2015 MapR Technologies 8
Hadoop Use Cases by Industry
HEALTHCARE &
LIFE SCIENCES
GOVERNMENT
ADVERTISING, MEDIA
& ENTERTAINMENT
• Improved ad targeting, analysis,
forecasting and optimization
• Personalized recommendations
• Superior analytics capability
• Enhanced game player engagement
FINANCIAL SERVICES
• Fraud Detection
• Customer Segmentation Analysis
• Customer Sentiment Analysis
• Risk Aggregation
• Counterparty Risk Analytics
• New Products and Services for
Consumer Card Holders
• Credit Risk Assessment
• 360-Degree Customer Service
• Cybersecurity, Intelligence
• Crime Prediction and Prevention
• Defense, National Security
• Pharmaceutical Drug Evaluation
• Scientific Research
• Weather Forecasting
• Fraud Detection
• Emergency Communications/Response
• Traffic Optimization
TELECOMMANUFACTURING OIL & GAS RETAIL
• Personalized Treatment Planning
• Assisted Diagnosis
• Fraud Detection
• Monitor Patient Vital Signs
• Assembly Line Quality Assurance
• Preventive Maintenance
• Supply Chain and Logistics
• Monitoring Product Quality through
Telemetry Data
• Real-time Parts Flow Monitoring
• Product Configuration Planning
• Market Pricing and Planning
• Oil Exploration and Discovery
• New oil prospect identification
• Seismic trace identification
• Oil Production
• Equipment Maintenance
• Reservoir Engineering
• Safety and Environment
• Security
• Up-Sell/Cross-Sell Recommendations
• Social Media Analysis
• Dynamic Pricing Across Multiple
Channels
• Fraud Detection
• Clickstream Analysis
• Loyalty Program Benefits
• 360° Customer View
• Operational Intelligence
• Customer Churn Analysis
• Fraud Detection
• Clickstream Analysis
• Recommendations
• Product Development
• Network Management/Optimization
© 2015 MapR Technologies 9
900B
WORLDWIDE
BILLS
$
DATA STORED
10Years100M+
CARDS
45s
TERASORT
1.65TB
MINUTESORT
Offer Serving,
Credit Risk & Fraud
<
Largest deployment
in financial services
1700+
SAVED FOR
CARDHOLDERS
$100M
MapR Hadoop nodes
FINSERVICES
GOAL:
© 2015 MapR Technologies 10
Operations + Analytics = Real-time, Personalized Services
Fraud model
Recommendations
table
MapR Distribution including Hadoop
Fraud
investigator
Interactive
marketer
Online
transactions
Fraud
detection
Personalized
offers
Clickstream
analysis
Fraud
investigation tool
Real-time Operational Applications
Analytics
Customer
Support
© 2015 MapR Technologies 11
Hadoop + Data Warehouse Architecture
Improve data services to customers without increasing enterprise architecture costs
• Provide cloud, security, managed services, data center, & comms
• Report on customer usage, profiles, billing, and sales metrics
• Improve service: Measure service quality and repair metrics
• Reduce customer churn – identify and address IP network hotspots
• Cost of ETL & DW storage for growing IP and clickstream data; >3 months
• Reliability & cost of Hadoop alternatives limited ETL & storage offload
• MapR for data staging, ETL, and storage at 1/10th the cost
• MapR provided smallest datacenter footprint with best DR solution
• Enterprise-grade: NFS file management, consistent snapshots & mirroring
• Data warehouse for mission-critical reporting and analysis
OBJECTIVES
CHALLENGES
SOLUTION
Hadoop + Data Warehouse = New, Deeper Insights for the Business
• Increased scale to handle network IP and clickstream data
• Freed up processing on DW to maintain reporting SLA’s to business
• Unlocked new insights into network usage and customer preferences
Business
Impact
FORTUNE 500
TELCO
© 2015 MapR Technologies 12
MapR Optimized Data Architecture
Sources
RELATIONAL,
SAAS,
MAINFRAME
DOCUMENTS,
EMAILS
LOG FILES,
CLICKSTREAMS
SENSORS
BLOGS,
TWEETS,
LINK DATA
DATA WAREHOUSE
Data Movement
Data Access
Analytics
Search
Schema-less
data exploration
BI, reporting
Ad-hoc integrated
analytics
Data Transformation, Enrichment
and Integration
MAPR DISTRIBUTION FOR HADOOP
Streaming
(Spark Streaming,
Storm)
NoSQL ODBMS
(HBase, Accumulo, …)
MapR Data Platform
MapR-DB
MAPR DISTRIBUTION FOR HADOOP
Batch/Search
(MR, Spark, Hive, Pig)
MapR-FS
Operational Apps
Recommendations
Fraud Detection
Logistics
Optimized Data Architecture Machine Learning
Interactive
(Impala, Drill)
© 2015 MapR Technologies 13
 Bullet-proof data vault that meets SEC and FINRA requirements
 46x cost savings over legacy system
 Efficiency of MapR cluster that can store the Elasticsearch index for real-time search
Security Log Analysis & Enterprise Data Vault
F100 bank accelerates log analytics to meet investigation and compliance mandates
• Meet compliance requirements to minimize lawsuits and fines
• Complete IT audits more quickly
• Prior system (flat files on Unix) was difficult to maintain for operations team
• HA and data protection issues in HDFS put critical data at risk
• File volume (300K files/day) was straining system
• Seamless Hadoop file movement & management: MapR NFS
• MapReduce enables archival of data for historical search and analysis
• Data is indexed into Elasticsearch from MapR for real-time search
• Customizable user interface and dashboard: Kibana (ELK stack)
OBJECTIVES
CHALLENGES
SOLUTION
Business
Impact
LARGE FINANCIAL
SERVICES INSTITUTION
© 2015 MapR Technologies 14© 2015 MapR Technologies
Planning for Production Success with Hadoop
© 2015 MapR Technologies 15
Key Questions for
Big Data Planning
Source: Gartner. Jan 2015. Answering Big Data's 10 Biggest Planning and Implementation Questions
© 2015 MapR Technologies 16
Big Data is Overwhelming Traditional Systems
• Mission-critical reliability
• Transaction guarantees
• Deep security
• Real-time performance
• Backup and recovery
• Interactive SQL
• Rich analytics
• Workload management
• Data governance
• Backup and recovery
Enterprise
Data
Architecture
TREND
ENTERPRISE
USERS
OPERATIONAL
SYSTEMS
ANALYTICAL
SYSTEMS
PRODUCTION
REQUIREMENTS
PRODUCTION
REQUIREMENTS
OUTSIDE SOURCES
© 2015 MapR Technologies 17
OPERATIONAL
SYSTEMS
ANALYTICAL
SYSTEMS
ENTERPRISE
USERS
REALITY
• Data staging
• Archive
• Data transformation
• Data exploration
• Streaming,
interactions
Hadoop Relieves the Pressure from Enterprise Systems
2 Interoperability
1 Business continuity
4 Multi-tenacy
3 High performance
Keys for Production Success
© 2015 MapR Technologies 18
Key Reasons for Selecting the MapR Distribution including Hadoop
Respondents who have had prior experience with another Hadoop distribution*
* Apache Hadoop, Cloudera or Hortonworks
© 2015 MapR Technologies 19
Business Continuity
High
Availability
Data
Protection
Disaster
Recovery
What are your requirements?
What do you have for your enterprise storage,
databases and data warehouses?
© 2015 MapR Technologies 20
Seamless Integration with Direct Access NFS
• POSIX compliant
– Random reads/writes
– Simultaneous reading and writing to a file
– Compression is automatic and transparent
• Industry-standard NFS interface (in
addition to HDFS API)
– Stream data into the cluster
– Leverage thousands of tools and
applications
– Easier to use non-Java programming
languages
– No need for most proprietary Hadoop
connectors
• Compression/parallel access/security
from edge nodes to MapR cluster
© 2015 MapR Technologies 21
Narrow Foundations – Big and Fast are Separate
HDFS
Map/
Reduce
HBase
Spark /
Storm
Hive
RDBMS NAS
Sequential File
Processing
OLAP
Data
Mining
WEB SERVICES
Big Data is
heavy and
expensive
to move
© 2015 MapR Technologies 22
Unify Big & Fast on One Platform
HDFS
Map
Reduce
HBase
Spark /
Storm
Hive
RDBMS NAS
Sequential File
Processing
OLAP
Data
Mining
WEB SERVICES
NEXT GENERATION DISTRIBUTION
HADOOP API’S NFS
© 2015 MapR Technologies 23© 2015 MapR Technologies
What Makes MapR Different
© 2015 MapR Technologies 24
MapR: Best Solution for Customer Success
Premier
Investors
High Growth
2X Growth In Direct Customers
90% Subscription Licenses
Software Margins
140% Dollar-based Net Expansion
700+
Customers
2X Growth In Annual
Subscriptions ( ACV)
Best Product
Apache Open Source
© 2015 MapR Technologies 25
The Power of the Open Source Community
APACHE HADOOP AND OSS ECOSYSTEM
Security
YARN
Spark
Streaming
Storm
StreamingNoSQL &
Search
Juju
Provisioning
&
Coordination
Sahara
ML, Graph
Mahout
MLLib
GraphX
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS
Workflow
& Data
Governance
Pig
Cascading
Spark
Batch
MapReduce
v1 & v2
Tez
HBase
Solr
Hive
Impala
Spark SQL
Drill
SQL
Sentry Oozie ZooKeeperSqoop
Flume
Data
Integration
& Access
HttpFS
Hue
Data PlatformMapR-FS MapR-DB
Management
© 2015 MapR Technologies 26
The MapR Distribution including Apache Hadoop
APACHE HADOOP AND OSS ECOSYSTEM
Security
YARN
Spark
Streaming
Storm
StreamingNoSQL &
Search
Juju
Provisioning
&
Coordination
Sahara
ML, Graph
Mahout
MLLib
GraphX
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS
Workflow
& Data
Governance
Pig
Cascading
Spark
Batch
MapReduce
v1 & v2
Tez
HBase
Solr
Hive
Impala
Spark SQL
Drill
SQL
Sentry Oozie ZooKeeperSqoop
Flume
Data
Integration
& Access
HttpFS
Hue
Data PlatformMapR-FS MapR-DB
Management
Data HubEnterprise Grade Operational
© 2015 MapR Technologies 27
MapR Distribution including Hadoop
Theme Requirements Features Product
Enterprise Grade
• Uptime service levels
• Site to site DR
• Backup/recovery
• Security
• High velocity data ingress
• HW/SW HA
• Mirroring
• Snapshots
• Authorization, Kerberos
• 2X-5X performance
MapR
Enterprise Edition
Data Hub
• Hadoop
• Traditional applications
• Data of record
• Batch and interactive
• HDFS
• POSIX
• Strong consistency
• MapReduce and SQL
MapR
Enterprise Edition
Operational
• Real time
• NoSQL
• Operational analytics
• HBase
• Update in place
• Concurrent read/write
MapR
Enterprise Database Edition
MapR Patent Pending – “Table Format for Map Reduce”
“Map Reduce Ready Distributed File System”
Enterprise Grade
Operational
Data Hub
© 2015 MapR Technologies 28
Achievements: Triple Crown Of Analyst Ranking
© 2015 MapR Technologies 29
Apache Hadoop NameNode High Availability
NameNode
A B C D E F
HDFS-based Distributions
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
Primary NameNode
A B C D E F
Standby NameNode
A B C D E F
NameNode
A B
NameNode
C D
NameNode
E F
NameNode
A B
NameNode
C D
NameNode
E F
HDFS HA
HDFS
Federation
Single point of failure
Limited to 50-200 million files
Performance bottleneck
Metadata must fit in memory
Only one active NameNode
Limited to 50-200 million files
Performance bottleneck
Metadata must fit in memory
Double the block reports
Multiple single points
of failure w/o HA
Needs 20 NameNodes
for 1 Billion files
Performance bottleneck
Metadata must fit in memory
Double the block reports
© 2015 MapR Technologies 30
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
No-NameNode Architecture
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
DataNode
NameNode
A B C D E FAAA BBBB CCC DDD EEE FFF
Up to 1T files (> 5000x advantage)
Significantly less hardware & OpEx
Higher performance
No special config to enable HA
Automatic failover & re-replication
Metadata is persisted to disk
© 2015 MapR Technologies 31
© 2015 MapR Technologies 33
MapR: Fast and Dependable with Lowest TCO
Cost comparison for a 500 TB cluster vs HDFS-based distro’s
TCO: mapr.com/tco
© 2015 MapR Technologies 34© 2015 MapR Technologies
Product Demo: Multi-tenancy
© 2015 MapR Technologies 35
Committed to our Customers’ Success
Educational Services Professional Services Customer Support
Core
Hadoop
Services
Data
Engineering
Advanced
Analytics
M7/HBase
Practice
Hadoop engineering
experts provide
24x7x365
global coverage
Instructor-led courses &
Free On-Demand
training for Hadoop cluster
administration, HBase &
MapReduce programming
and more
Data
Engineering
Data
Science
© 2015 MapR Technologies 36
WORLDWIDE
PRESENCE &
CUSTOMER
SUPPORT
HQ
© 2015 MapR Technologies 37
Key MapR Advantage Partners
Business Services
INFRASTRUCTURE
& CLOUD
ANALYTICS &
BUSINESS INTELLIGENCE
APPLICATIONS
& OS
CONSULTANTS
& INTEGRATORS
DATA WAREHOUSE
& INTEGRATION
© 2015 MapR Technologies 38
Q&A
@mapr maprtech
info@mapr.com
Engage with us!
MapR
maprtech
mapr-technologies
GET STARTED NOW! mapr.com/sandbox

More Related Content

What's hot

Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersDataWorks Summit
 
Real time trade surveillance in financial markets
Real time trade surveillance in financial marketsReal time trade surveillance in financial markets
Real time trade surveillance in financial marketsHortonworks
 
6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWS6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWSRackspace
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?Rackspace
 
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...DataWorks Summit
 
Strategyzing big data in telco industry
Strategyzing big data in telco industryStrategyzing big data in telco industry
Strategyzing big data in telco industryParviz Iskhakov
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemCapgemini
 
S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7Tony Pearson
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...GetInData
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Impetus Technologies
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014
 
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasBig data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasProf Dr Mehmed ERDAS
 
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360Databricks
 
ParStream - Big Data for Business Users
ParStream - Big Data for Business UsersParStream - Big Data for Business Users
ParStream - Big Data for Business UsersParStream Inc.
 
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service ProvidersMonetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service ProvidersCubic Corporation
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use CasesInSemble
 
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data AnalyticsTop 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data AnalyticsHortonworks
 

What's hot (20)

Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 
Real time trade surveillance in financial markets
Real time trade surveillance in financial marketsReal time trade surveillance in financial markets
Real time trade surveillance in financial markets
 
6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWS6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWS
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?
 
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
 
Strategyzing big data in telco industry
Strategyzing big data in telco industryStrategyzing big data in telco industry
Strategyzing big data in telco industry
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
 
S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7
 
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
 
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasBig data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
 
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
 
Extreme Analytics @ eBay
Extreme Analytics @ eBayExtreme Analytics @ eBay
Extreme Analytics @ eBay
 
ParStream - Big Data for Business Users
ParStream - Big Data for Business UsersParStream - Big Data for Business Users
ParStream - Big Data for Business Users
 
Haven 2 0
Haven 2 0 Haven 2 0
Haven 2 0
 
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service ProvidersMonetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Cases
 
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data AnalyticsTop 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data Analytics
 
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 20135 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
 

Viewers also liked

Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionReal World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionCodemotion
 
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in ProductionTugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in ProductionCodemotion
 
Hadoop and Manufacturing
Hadoop and ManufacturingHadoop and Manufacturing
Hadoop and ManufacturingCloudera, Inc.
 
Map r hadoop-security-mar2014 (2)
Map r hadoop-security-mar2014 (2)Map r hadoop-security-mar2014 (2)
Map r hadoop-security-mar2014 (2)MapR Technologies
 
Digital Transformation with AI and Data - H2O.ai and Open Source
Digital Transformation with AI and Data - H2O.ai and Open SourceDigital Transformation with AI and Data - H2O.ai and Open Source
Digital Transformation with AI and Data - H2O.ai and Open Sourcesrisatish ambati
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoopMohit Tare
 
Troubleshooting Hadoop: Distributed Debugging
Troubleshooting Hadoop: Distributed DebuggingTroubleshooting Hadoop: Distributed Debugging
Troubleshooting Hadoop: Distributed DebuggingGreat Wide Open
 
Advanced Security In Hadoop Cluster
Advanced Security In Hadoop ClusterAdvanced Security In Hadoop Cluster
Advanced Security In Hadoop ClusterEdureka!
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Mathieu Dumoulin
 
Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016Christoph Wurm
 
Architectural considerations for Hadoop Applications
Architectural considerations for Hadoop ApplicationsArchitectural considerations for Hadoop Applications
Architectural considerations for Hadoop Applicationshadooparchbook
 
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민NAVER D2
 
Which data should you move to Hadoop?
Which data should you move to Hadoop?Which data should you move to Hadoop?
Which data should you move to Hadoop?Attunity
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Zaloni
 
MapR-DB Elasticsearch Integration
MapR-DB Elasticsearch IntegrationMapR-DB Elasticsearch Integration
MapR-DB Elasticsearch IntegrationMapR Technologies
 
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceHandling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceMapR Technologies
 

Viewers also liked (20)

Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in ProductionReal World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
 
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in ProductionTugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
 
Hadoop and Manufacturing
Hadoop and ManufacturingHadoop and Manufacturing
Hadoop and Manufacturing
 
Map r hadoop-security-mar2014 (2)
Map r hadoop-security-mar2014 (2)Map r hadoop-security-mar2014 (2)
Map r hadoop-security-mar2014 (2)
 
Digital Transformation with AI and Data - H2O.ai and Open Source
Digital Transformation with AI and Data - H2O.ai and Open SourceDigital Transformation with AI and Data - H2O.ai and Open Source
Digital Transformation with AI and Data - H2O.ai and Open Source
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Big Data Journey
Big Data JourneyBig Data Journey
Big Data Journey
 
Troubleshooting Hadoop: Distributed Debugging
Troubleshooting Hadoop: Distributed DebuggingTroubleshooting Hadoop: Distributed Debugging
Troubleshooting Hadoop: Distributed Debugging
 
Advanced Security In Hadoop Cluster
Advanced Security In Hadoop ClusterAdvanced Security In Hadoop Cluster
Advanced Security In Hadoop Cluster
 
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
 
Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016
 
Architectural considerations for Hadoop Applications
Architectural considerations for Hadoop ApplicationsArchitectural considerations for Hadoop Applications
Architectural considerations for Hadoop Applications
 
Hadoop fault-tolerance
Hadoop fault-toleranceHadoop fault-tolerance
Hadoop fault-tolerance
 
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
 
Which data should you move to Hadoop?
Which data should you move to Hadoop?Which data should you move to Hadoop?
Which data should you move to Hadoop?
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...
 
MapR-DB Elasticsearch Integration
MapR-DB Elasticsearch IntegrationMapR-DB Elasticsearch Integration
MapR-DB Elasticsearch Integration
 
Wayne Eckerson: Secrets of Analytical Leaders
Wayne Eckerson: Secrets of Analytical LeadersWayne Eckerson: Secrets of Analytical Leaders
Wayne Eckerson: Secrets of Analytical Leaders
 
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in FinanceHandling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in Finance
 

Similar to Key Considerations for Putting Hadoop in Production SlideShare

Powering the "As it Happens" Business
Powering the "As it Happens" BusinessPowering the "As it Happens" Business
Powering the "As it Happens" BusinessMapR Technologies
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with HadoopPrecisely
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014MapR Technologies
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentMapR Technologies
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTKiththi Perera
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardKiththi Perera
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiFelicia Haggarty
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...MapR Technologies
 
Meruvian - Introduction to MapR
Meruvian - Introduction to MapRMeruvian - Introduction to MapR
Meruvian - Introduction to MapRThe World Bank
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...ervogler
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concretoHP Enterprise Italia
 
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate RisksLearn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate RisksMapR Technologies
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR Technologies
 

Similar to Key Considerations for Putting Hadoop in Production SlideShare (20)

Powering the "As it Happens" Business
Powering the "As it Happens" BusinessPowering the "As it Happens" Business
Powering the "As it Happens" Business
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with Hadoop
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
Meruvian - Introduction to MapR
Meruvian - Introduction to MapRMeruvian - Introduction to MapR
Meruvian - Introduction to MapR
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
 
Hadoop In The Real World
Hadoop In The Real WorldHadoop In The Real World
Hadoop In The Real World
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analytics
 
Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analytics
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concreto
 
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate RisksLearn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 

More from MapR Technologies

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscapeMapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureMapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsMapR Technologies
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMapR Technologies
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsMapR Technologies
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageMapR Technologies
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformMapR Technologies
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...MapR Technologies
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareMapR Technologies
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Technologies
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data AnalyticsMapR Technologies
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsMapR Technologies
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR Technologies
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLMapR Technologies
 

More from MapR Technologies (20)

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
 

Key Considerations for Putting Hadoop in Production SlideShare

  • 1. © 2015 MapR Technologies 1© 2015 MapR Technologies
  • 2. © 2015 MapR Technologies 2 • The most common use cases for Hadoop • The top considerations before "going live" with Hadoop • Product Demo – multiple workloads in the Data Lake Topics
  • 3. © 2015 MapR Technologies 3 State of Big Data Adoption Source: Gartner. Sept. 2014. Survey Analysis: Big Data Investment Grows but Deployments Remain Scarce in 2014
  • 4. © 2015 MapR Technologies 4© 2015 MapR Technologies Top Hadoop Use Cases
  • 5. © 2015 MapR Technologies 5 Speeding The Journey To Value Operational Batch Create Data Capital Big data novice Mature Empower BI users Operational Applications Mine Logs Recommendation Engine Data Hub Ad Targeting360 View Anomaly detection Fraud preventionGet fast value
  • 6. © 2015 MapR Technologies 6 The As-it-happens Business Speeding The Journey To Value Operational Batch Create Data Capital Big data novice Mature Empower BI users Operational Applications Mine Logs Recommendation Engine Data Hub Ad Targeting360 View Anomaly detection Fraud preventionGet fast value
  • 7. © 2015 MapR Technologies 7 ENTERPRISE DATA HUB MARKETING OPTIMIZATION RISK & SECURITY OPTIMIZATION OPERATIONAL INTELLIGENCE • Multi-structured data staging & archive • ETL / DW optimization • Mainframe optimization • Data exploration • Recommendation engines & targeting • Customer 360 • Click-stream analysis • Social media analysis • Ad optimization • Network security monitoring • Security information & event management • Fraudulent behavioral analysis • Supply chain & logistics • System log analysis • Manufacturing quality assurance • Preventative maintenance • Smart meter analysis Common Use Cases: Taking Advantage of Hadoop
  • 8. © 2015 MapR Technologies 8 Hadoop Use Cases by Industry HEALTHCARE & LIFE SCIENCES GOVERNMENT ADVERTISING, MEDIA & ENTERTAINMENT • Improved ad targeting, analysis, forecasting and optimization • Personalized recommendations • Superior analytics capability • Enhanced game player engagement FINANCIAL SERVICES • Fraud Detection • Customer Segmentation Analysis • Customer Sentiment Analysis • Risk Aggregation • Counterparty Risk Analytics • New Products and Services for Consumer Card Holders • Credit Risk Assessment • 360-Degree Customer Service • Cybersecurity, Intelligence • Crime Prediction and Prevention • Defense, National Security • Pharmaceutical Drug Evaluation • Scientific Research • Weather Forecasting • Fraud Detection • Emergency Communications/Response • Traffic Optimization TELECOMMANUFACTURING OIL & GAS RETAIL • Personalized Treatment Planning • Assisted Diagnosis • Fraud Detection • Monitor Patient Vital Signs • Assembly Line Quality Assurance • Preventive Maintenance • Supply Chain and Logistics • Monitoring Product Quality through Telemetry Data • Real-time Parts Flow Monitoring • Product Configuration Planning • Market Pricing and Planning • Oil Exploration and Discovery • New oil prospect identification • Seismic trace identification • Oil Production • Equipment Maintenance • Reservoir Engineering • Safety and Environment • Security • Up-Sell/Cross-Sell Recommendations • Social Media Analysis • Dynamic Pricing Across Multiple Channels • Fraud Detection • Clickstream Analysis • Loyalty Program Benefits • 360° Customer View • Operational Intelligence • Customer Churn Analysis • Fraud Detection • Clickstream Analysis • Recommendations • Product Development • Network Management/Optimization
  • 9. © 2015 MapR Technologies 9 900B WORLDWIDE BILLS $ DATA STORED 10Years100M+ CARDS 45s TERASORT 1.65TB MINUTESORT Offer Serving, Credit Risk & Fraud < Largest deployment in financial services 1700+ SAVED FOR CARDHOLDERS $100M MapR Hadoop nodes FINSERVICES GOAL:
  • 10. © 2015 MapR Technologies 10 Operations + Analytics = Real-time, Personalized Services Fraud model Recommendations table MapR Distribution including Hadoop Fraud investigator Interactive marketer Online transactions Fraud detection Personalized offers Clickstream analysis Fraud investigation tool Real-time Operational Applications Analytics Customer Support
  • 11. © 2015 MapR Technologies 11 Hadoop + Data Warehouse Architecture Improve data services to customers without increasing enterprise architecture costs • Provide cloud, security, managed services, data center, & comms • Report on customer usage, profiles, billing, and sales metrics • Improve service: Measure service quality and repair metrics • Reduce customer churn – identify and address IP network hotspots • Cost of ETL & DW storage for growing IP and clickstream data; >3 months • Reliability & cost of Hadoop alternatives limited ETL & storage offload • MapR for data staging, ETL, and storage at 1/10th the cost • MapR provided smallest datacenter footprint with best DR solution • Enterprise-grade: NFS file management, consistent snapshots & mirroring • Data warehouse for mission-critical reporting and analysis OBJECTIVES CHALLENGES SOLUTION Hadoop + Data Warehouse = New, Deeper Insights for the Business • Increased scale to handle network IP and clickstream data • Freed up processing on DW to maintain reporting SLA’s to business • Unlocked new insights into network usage and customer preferences Business Impact FORTUNE 500 TELCO
  • 12. © 2015 MapR Technologies 12 MapR Optimized Data Architecture Sources RELATIONAL, SAAS, MAINFRAME DOCUMENTS, EMAILS LOG FILES, CLICKSTREAMS SENSORS BLOGS, TWEETS, LINK DATA DATA WAREHOUSE Data Movement Data Access Analytics Search Schema-less data exploration BI, reporting Ad-hoc integrated analytics Data Transformation, Enrichment and Integration MAPR DISTRIBUTION FOR HADOOP Streaming (Spark Streaming, Storm) NoSQL ODBMS (HBase, Accumulo, …) MapR Data Platform MapR-DB MAPR DISTRIBUTION FOR HADOOP Batch/Search (MR, Spark, Hive, Pig) MapR-FS Operational Apps Recommendations Fraud Detection Logistics Optimized Data Architecture Machine Learning Interactive (Impala, Drill)
  • 13. © 2015 MapR Technologies 13  Bullet-proof data vault that meets SEC and FINRA requirements  46x cost savings over legacy system  Efficiency of MapR cluster that can store the Elasticsearch index for real-time search Security Log Analysis & Enterprise Data Vault F100 bank accelerates log analytics to meet investigation and compliance mandates • Meet compliance requirements to minimize lawsuits and fines • Complete IT audits more quickly • Prior system (flat files on Unix) was difficult to maintain for operations team • HA and data protection issues in HDFS put critical data at risk • File volume (300K files/day) was straining system • Seamless Hadoop file movement & management: MapR NFS • MapReduce enables archival of data for historical search and analysis • Data is indexed into Elasticsearch from MapR for real-time search • Customizable user interface and dashboard: Kibana (ELK stack) OBJECTIVES CHALLENGES SOLUTION Business Impact LARGE FINANCIAL SERVICES INSTITUTION
  • 14. © 2015 MapR Technologies 14© 2015 MapR Technologies Planning for Production Success with Hadoop
  • 15. © 2015 MapR Technologies 15 Key Questions for Big Data Planning Source: Gartner. Jan 2015. Answering Big Data's 10 Biggest Planning and Implementation Questions
  • 16. © 2015 MapR Technologies 16 Big Data is Overwhelming Traditional Systems • Mission-critical reliability • Transaction guarantees • Deep security • Real-time performance • Backup and recovery • Interactive SQL • Rich analytics • Workload management • Data governance • Backup and recovery Enterprise Data Architecture TREND ENTERPRISE USERS OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS PRODUCTION REQUIREMENTS PRODUCTION REQUIREMENTS OUTSIDE SOURCES
  • 17. © 2015 MapR Technologies 17 OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS ENTERPRISE USERS REALITY • Data staging • Archive • Data transformation • Data exploration • Streaming, interactions Hadoop Relieves the Pressure from Enterprise Systems 2 Interoperability 1 Business continuity 4 Multi-tenacy 3 High performance Keys for Production Success
  • 18. © 2015 MapR Technologies 18 Key Reasons for Selecting the MapR Distribution including Hadoop Respondents who have had prior experience with another Hadoop distribution* * Apache Hadoop, Cloudera or Hortonworks
  • 19. © 2015 MapR Technologies 19 Business Continuity High Availability Data Protection Disaster Recovery What are your requirements? What do you have for your enterprise storage, databases and data warehouses?
  • 20. © 2015 MapR Technologies 20 Seamless Integration with Direct Access NFS • POSIX compliant – Random reads/writes – Simultaneous reading and writing to a file – Compression is automatic and transparent • Industry-standard NFS interface (in addition to HDFS API) – Stream data into the cluster – Leverage thousands of tools and applications – Easier to use non-Java programming languages – No need for most proprietary Hadoop connectors • Compression/parallel access/security from edge nodes to MapR cluster
  • 21. © 2015 MapR Technologies 21 Narrow Foundations – Big and Fast are Separate HDFS Map/ Reduce HBase Spark / Storm Hive RDBMS NAS Sequential File Processing OLAP Data Mining WEB SERVICES Big Data is heavy and expensive to move
  • 22. © 2015 MapR Technologies 22 Unify Big & Fast on One Platform HDFS Map Reduce HBase Spark / Storm Hive RDBMS NAS Sequential File Processing OLAP Data Mining WEB SERVICES NEXT GENERATION DISTRIBUTION HADOOP API’S NFS
  • 23. © 2015 MapR Technologies 23© 2015 MapR Technologies What Makes MapR Different
  • 24. © 2015 MapR Technologies 24 MapR: Best Solution for Customer Success Premier Investors High Growth 2X Growth In Direct Customers 90% Subscription Licenses Software Margins 140% Dollar-based Net Expansion 700+ Customers 2X Growth In Annual Subscriptions ( ACV) Best Product Apache Open Source
  • 25. © 2015 MapR Technologies 25 The Power of the Open Source Community APACHE HADOOP AND OSS ECOSYSTEM Security YARN Spark Streaming Storm StreamingNoSQL & Search Juju Provisioning & Coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Cascading Spark Batch MapReduce v1 & v2 Tez HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue Data PlatformMapR-FS MapR-DB Management
  • 26. © 2015 MapR Technologies 26 The MapR Distribution including Apache Hadoop APACHE HADOOP AND OSS ECOSYSTEM Security YARN Spark Streaming Storm StreamingNoSQL & Search Juju Provisioning & Coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Cascading Spark Batch MapReduce v1 & v2 Tez HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue Data PlatformMapR-FS MapR-DB Management Data HubEnterprise Grade Operational
  • 27. © 2015 MapR Technologies 27 MapR Distribution including Hadoop Theme Requirements Features Product Enterprise Grade • Uptime service levels • Site to site DR • Backup/recovery • Security • High velocity data ingress • HW/SW HA • Mirroring • Snapshots • Authorization, Kerberos • 2X-5X performance MapR Enterprise Edition Data Hub • Hadoop • Traditional applications • Data of record • Batch and interactive • HDFS • POSIX • Strong consistency • MapReduce and SQL MapR Enterprise Edition Operational • Real time • NoSQL • Operational analytics • HBase • Update in place • Concurrent read/write MapR Enterprise Database Edition MapR Patent Pending – “Table Format for Map Reduce” “Map Reduce Ready Distributed File System” Enterprise Grade Operational Data Hub
  • 28. © 2015 MapR Technologies 28 Achievements: Triple Crown Of Analyst Ranking
  • 29. © 2015 MapR Technologies 29 Apache Hadoop NameNode High Availability NameNode A B C D E F HDFS-based Distributions DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode Primary NameNode A B C D E F Standby NameNode A B C D E F NameNode A B NameNode C D NameNode E F NameNode A B NameNode C D NameNode E F HDFS HA HDFS Federation Single point of failure Limited to 50-200 million files Performance bottleneck Metadata must fit in memory Only one active NameNode Limited to 50-200 million files Performance bottleneck Metadata must fit in memory Double the block reports Multiple single points of failure w/o HA Needs 20 NameNodes for 1 Billion files Performance bottleneck Metadata must fit in memory Double the block reports
  • 30. © 2015 MapR Technologies 30 DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode No-NameNode Architecture DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode NameNode A B C D E FAAA BBBB CCC DDD EEE FFF Up to 1T files (> 5000x advantage) Significantly less hardware & OpEx Higher performance No special config to enable HA Automatic failover & re-replication Metadata is persisted to disk
  • 31. © 2015 MapR Technologies 31
  • 32. © 2015 MapR Technologies 33 MapR: Fast and Dependable with Lowest TCO Cost comparison for a 500 TB cluster vs HDFS-based distro’s TCO: mapr.com/tco
  • 33. © 2015 MapR Technologies 34© 2015 MapR Technologies Product Demo: Multi-tenancy
  • 34. © 2015 MapR Technologies 35 Committed to our Customers’ Success Educational Services Professional Services Customer Support Core Hadoop Services Data Engineering Advanced Analytics M7/HBase Practice Hadoop engineering experts provide 24x7x365 global coverage Instructor-led courses & Free On-Demand training for Hadoop cluster administration, HBase & MapReduce programming and more Data Engineering Data Science
  • 35. © 2015 MapR Technologies 36 WORLDWIDE PRESENCE & CUSTOMER SUPPORT HQ
  • 36. © 2015 MapR Technologies 37 Key MapR Advantage Partners Business Services INFRASTRUCTURE & CLOUD ANALYTICS & BUSINESS INTELLIGENCE APPLICATIONS & OS CONSULTANTS & INTEGRATORS DATA WAREHOUSE & INTEGRATION
  • 37. © 2015 MapR Technologies 38 Q&A @mapr maprtech info@mapr.com Engage with us! MapR maprtech mapr-technologies GET STARTED NOW! mapr.com/sandbox