Submit Search
Upload
Key Considerations for Putting Hadoop in Production SlideShare
•
1 like
•
1,179 views
MapR Technologies
Follow
Ensuring Production Success for Apache Hadoop: Product Demonstration of Multi-tenancy
Read less
Read more
Report
Share
Report
Share
1 of 37
Download now
Download to read offline
Recommended
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Publicis Sapient Engineering
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
Datameer
Capturing big value in big data
Capturing big value in big data
BSP Media Group
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Cloudera, Inc.
Bigdata based fraud detection
Bigdata based fraud detection
Mk Kim
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Hortonworks
Overview of analytics and big data in practice
Overview of analytics and big data in practice
Vivek Murugesan
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM (Middle East and Africa)
Recommended
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Publicis Sapient Engineering
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
Datameer
Capturing big value in big data
Capturing big value in big data
BSP Media Group
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Cloudera, Inc.
Bigdata based fraud detection
Bigdata based fraud detection
Mk Kim
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Hortonworks
Overview of analytics and big data in practice
Overview of analytics and big data in practice
Vivek Murugesan
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM Software Day 2013. Smarter analytics and big data. building the next gene...
IBM (Middle East and Africa)
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
Real time trade surveillance in financial markets
Real time trade surveillance in financial markets
Hortonworks
6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWS
Rackspace
How Startups can leverage big data?
How Startups can leverage big data?
Rackspace
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
DataWorks Summit
Strategyzing big data in telco industry
Strategyzing big data in telco industry
Parviz Iskhakov
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
Capgemini
S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7
Tony Pearson
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
GetInData
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Impetus Technologies
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Prof Dr Mehmed ERDAS
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Databricks
Extreme Analytics @ eBay
Extreme Analytics @ eBay
DataWorks Summit/Hadoop Summit
ParStream - Big Data for Business Users
ParStream - Big Data for Business Users
ParStream Inc.
Haven 2 0
Haven 2 0
Data Science Warsaw
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Cubic Corporation
Big Data Use Cases
Big Data Use Cases
InSemble
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data Analytics
Hortonworks
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
Infochimps, a CSC Big Data Business
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
More Related Content
What's hot
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
DataWorks Summit
Real time trade surveillance in financial markets
Real time trade surveillance in financial markets
Hortonworks
6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWS
Rackspace
How Startups can leverage big data?
How Startups can leverage big data?
Rackspace
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
DataWorks Summit
Strategyzing big data in telco industry
Strategyzing big data in telco industry
Parviz Iskhakov
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
Capgemini
S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7
Tony Pearson
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
GetInData
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Impetus Technologies
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Prof Dr Mehmed ERDAS
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Databricks
Extreme Analytics @ eBay
Extreme Analytics @ eBay
DataWorks Summit/Hadoop Summit
ParStream - Big Data for Business Users
ParStream - Big Data for Business Users
ParStream Inc.
Haven 2 0
Haven 2 0
Data Science Warsaw
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Cubic Corporation
Big Data Use Cases
Big Data Use Cases
InSemble
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data Analytics
Hortonworks
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
Infochimps, a CSC Big Data Business
What's hot
(20)
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
Real time trade surveillance in financial markets
Real time trade surveillance in financial markets
6 Commonly Asked Questions from Customers Building on AWS
6 Commonly Asked Questions from Customers Building on AWS
How Startups can leverage big data?
How Startups can leverage big data?
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Large Scale Graph Processing & Machine Learning Algorithms for Payment Fraud ...
Strategyzing big data in telco industry
Strategyzing big data in telco industry
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
S ba0881 big-data-use-cases-pearson-edge2015-v7
S ba0881 big-data-use-cases-pearson-edge2015-v7
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Understanding Big Data Analytics - solutions for growing businesses - Rafał M...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Bank Struggles Along the Way for the Holy Grail of Personalization: Customer 360
Extreme Analytics @ eBay
Extreme Analytics @ eBay
ParStream - Big Data for Business Users
ParStream - Big Data for Business Users
Haven 2 0
Haven 2 0
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Big Data Use Cases
Big Data Use Cases
Top 5 Strategies for Retail Data Analytics
Top 5 Strategies for Retail Data Analytics
5 Big Data Use Cases for 2013
5 Big Data Use Cases for 2013
Viewers also liked
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Codemotion
Hadoop and Manufacturing
Hadoop and Manufacturing
Cloudera, Inc.
Map r hadoop-security-mar2014 (2)
Map r hadoop-security-mar2014 (2)
MapR Technologies
Digital Transformation with AI and Data - H2O.ai and Open Source
Digital Transformation with AI and Data - H2O.ai and Open Source
srisatish ambati
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
Edgar Alejandro Villegas
Big data and hadoop
Big data and hadoop
Mohit Tare
Big Data Journey
Big Data Journey
Tugdual Grall
Troubleshooting Hadoop: Distributed Debugging
Troubleshooting Hadoop: Distributed Debugging
Great Wide Open
Advanced Security In Hadoop Cluster
Advanced Security In Hadoop Cluster
Edureka!
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Mathieu Dumoulin
Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016
Christoph Wurm
Architectural considerations for Hadoop Applications
Architectural considerations for Hadoop Applications
hadooparchbook
Hadoop fault-tolerance
Hadoop fault-tolerance
Ravindra Bandara
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
NAVER D2
Which data should you move to Hadoop?
Which data should you move to Hadoop?
Attunity
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...
Zaloni
MapR-DB Elasticsearch Integration
MapR-DB Elasticsearch Integration
MapR Technologies
Wayne Eckerson: Secrets of Analytical Leaders
Wayne Eckerson: Secrets of Analytical Leaders
Pivotal Analytics (Cetas Analytics)
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in Finance
MapR Technologies
Viewers also liked
(20)
Real World Use Cases: Hadoop and NoSQL in Production
Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Tugdual Grall - Real World Use Cases: Hadoop and NoSQL in Production
Hadoop and Manufacturing
Hadoop and Manufacturing
Map r hadoop-security-mar2014 (2)
Map r hadoop-security-mar2014 (2)
Digital Transformation with AI and Data - H2O.ai and Open Source
Digital Transformation with AI and Data - H2O.ai and Open Source
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
Big data and hadoop
Big data and hadoop
Big Data Journey
Big Data Journey
Troubleshooting Hadoop: Distributed Debugging
Troubleshooting Hadoop: Distributed Debugging
Advanced Security In Hadoop Cluster
Advanced Security In Hadoop Cluster
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Real-World Machine Learning - Leverage the Features of MapR Converged Data Pl...
Why Elastic? @ 50th Vinitaly 2016
Why Elastic? @ 50th Vinitaly 2016
Architectural considerations for Hadoop Applications
Architectural considerations for Hadoop Applications
Hadoop fault-tolerance
Hadoop fault-tolerance
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
Which data should you move to Hadoop?
Which data should you move to Hadoop?
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...
MapR-DB Elasticsearch Integration
MapR-DB Elasticsearch Integration
Wayne Eckerson: Secrets of Analytical Leaders
Wayne Eckerson: Secrets of Analytical Leaders
Handling the Extremes: Scaling and Streaming in Finance
Handling the Extremes: Scaling and Streaming in Finance
Similar to Key Considerations for Putting Hadoop in Production SlideShare
Powering the "As it Happens" Business
Powering the "As it Happens" Business
MapR Technologies
How Experian increased insights with Hadoop
How Experian increased insights with Hadoop
Precisely
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
MapR Technologies
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
MapR Technologies
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Kiththi Perera
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
Kiththi Perera
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
Nicolas Morales
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Felicia Haggarty
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
MapR Technologies
Meruvian - Introduction to MapR
Meruvian - Introduction to MapR
The World Bank
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
ervogler
Hadoop In The Real World
Hadoop In The Real World
MapR Technologies
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
Big Data and Analytics
Big Data and Analytics
Cameron. A. Bradbury
Big Data and Analytics
Big Data and Analytics
Cameron. A. Bradbury
Come fare business con i big data in concreto
Come fare business con i big data in concreto
HP Enterprise Italia
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
MapR Technologies
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR Technologies
Similar to Key Considerations for Putting Hadoop in Production SlideShare
(20)
Powering the "As it Happens" Business
Powering the "As it Happens" Business
How Experian increased insights with Hadoop
How Experian increased insights with Hadoop
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Meruvian - Introduction to MapR
Meruvian - Introduction to MapR
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Hadoop In The Real World
Hadoop In The Real World
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Big Data and Analytics
Big Data and Analytics
Big Data and Analytics
Big Data and Analytics
Come fare business con i big data in concreto
Come fare business con i big data in concreto
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
Learn How Financial Services Organizations Can Use Big Data to Mitigate Risks
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
More from MapR Technologies
Converging your data landscape
Converging your data landscape
MapR Technologies
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
MapR Technologies
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
MapR Technologies
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
MapR Technologies
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
MapR Technologies
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
MapR Technologies
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
MapR Technologies
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
MapR Technologies
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
MapR Technologies
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
MapR Technologies
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
MapR Technologies
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
MapR Technologies
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
MapR Technologies
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
MapR Technologies
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
MapR Technologies
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
MapR Technologies
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
MapR Technologies
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
MapR Technologies
MapR and Cisco Make IT Better
MapR and Cisco Make IT Better
MapR Technologies
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
MapR Technologies
More from MapR Technologies
(20)
Converging your data landscape
Converging your data landscape
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
MapR and Cisco Make IT Better
MapR and Cisco Make IT Better
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
Key Considerations for Putting Hadoop in Production SlideShare
1.
© 2015 MapR
Technologies 1© 2015 MapR Technologies
2.
© 2015 MapR
Technologies 2 • The most common use cases for Hadoop • The top considerations before "going live" with Hadoop • Product Demo – multiple workloads in the Data Lake Topics
3.
© 2015 MapR
Technologies 3 State of Big Data Adoption Source: Gartner. Sept. 2014. Survey Analysis: Big Data Investment Grows but Deployments Remain Scarce in 2014
4.
© 2015 MapR
Technologies 4© 2015 MapR Technologies Top Hadoop Use Cases
5.
© 2015 MapR
Technologies 5 Speeding The Journey To Value Operational Batch Create Data Capital Big data novice Mature Empower BI users Operational Applications Mine Logs Recommendation Engine Data Hub Ad Targeting360 View Anomaly detection Fraud preventionGet fast value
6.
© 2015 MapR
Technologies 6 The As-it-happens Business Speeding The Journey To Value Operational Batch Create Data Capital Big data novice Mature Empower BI users Operational Applications Mine Logs Recommendation Engine Data Hub Ad Targeting360 View Anomaly detection Fraud preventionGet fast value
7.
© 2015 MapR
Technologies 7 ENTERPRISE DATA HUB MARKETING OPTIMIZATION RISK & SECURITY OPTIMIZATION OPERATIONAL INTELLIGENCE • Multi-structured data staging & archive • ETL / DW optimization • Mainframe optimization • Data exploration • Recommendation engines & targeting • Customer 360 • Click-stream analysis • Social media analysis • Ad optimization • Network security monitoring • Security information & event management • Fraudulent behavioral analysis • Supply chain & logistics • System log analysis • Manufacturing quality assurance • Preventative maintenance • Smart meter analysis Common Use Cases: Taking Advantage of Hadoop
8.
© 2015 MapR
Technologies 8 Hadoop Use Cases by Industry HEALTHCARE & LIFE SCIENCES GOVERNMENT ADVERTISING, MEDIA & ENTERTAINMENT • Improved ad targeting, analysis, forecasting and optimization • Personalized recommendations • Superior analytics capability • Enhanced game player engagement FINANCIAL SERVICES • Fraud Detection • Customer Segmentation Analysis • Customer Sentiment Analysis • Risk Aggregation • Counterparty Risk Analytics • New Products and Services for Consumer Card Holders • Credit Risk Assessment • 360-Degree Customer Service • Cybersecurity, Intelligence • Crime Prediction and Prevention • Defense, National Security • Pharmaceutical Drug Evaluation • Scientific Research • Weather Forecasting • Fraud Detection • Emergency Communications/Response • Traffic Optimization TELECOMMANUFACTURING OIL & GAS RETAIL • Personalized Treatment Planning • Assisted Diagnosis • Fraud Detection • Monitor Patient Vital Signs • Assembly Line Quality Assurance • Preventive Maintenance • Supply Chain and Logistics • Monitoring Product Quality through Telemetry Data • Real-time Parts Flow Monitoring • Product Configuration Planning • Market Pricing and Planning • Oil Exploration and Discovery • New oil prospect identification • Seismic trace identification • Oil Production • Equipment Maintenance • Reservoir Engineering • Safety and Environment • Security • Up-Sell/Cross-Sell Recommendations • Social Media Analysis • Dynamic Pricing Across Multiple Channels • Fraud Detection • Clickstream Analysis • Loyalty Program Benefits • 360° Customer View • Operational Intelligence • Customer Churn Analysis • Fraud Detection • Clickstream Analysis • Recommendations • Product Development • Network Management/Optimization
9.
© 2015 MapR
Technologies 9 900B WORLDWIDE BILLS $ DATA STORED 10Years100M+ CARDS 45s TERASORT 1.65TB MINUTESORT Offer Serving, Credit Risk & Fraud < Largest deployment in financial services 1700+ SAVED FOR CARDHOLDERS $100M MapR Hadoop nodes FINSERVICES GOAL:
10.
© 2015 MapR
Technologies 10 Operations + Analytics = Real-time, Personalized Services Fraud model Recommendations table MapR Distribution including Hadoop Fraud investigator Interactive marketer Online transactions Fraud detection Personalized offers Clickstream analysis Fraud investigation tool Real-time Operational Applications Analytics Customer Support
11.
© 2015 MapR
Technologies 11 Hadoop + Data Warehouse Architecture Improve data services to customers without increasing enterprise architecture costs • Provide cloud, security, managed services, data center, & comms • Report on customer usage, profiles, billing, and sales metrics • Improve service: Measure service quality and repair metrics • Reduce customer churn – identify and address IP network hotspots • Cost of ETL & DW storage for growing IP and clickstream data; >3 months • Reliability & cost of Hadoop alternatives limited ETL & storage offload • MapR for data staging, ETL, and storage at 1/10th the cost • MapR provided smallest datacenter footprint with best DR solution • Enterprise-grade: NFS file management, consistent snapshots & mirroring • Data warehouse for mission-critical reporting and analysis OBJECTIVES CHALLENGES SOLUTION Hadoop + Data Warehouse = New, Deeper Insights for the Business • Increased scale to handle network IP and clickstream data • Freed up processing on DW to maintain reporting SLA’s to business • Unlocked new insights into network usage and customer preferences Business Impact FORTUNE 500 TELCO
12.
© 2015 MapR
Technologies 12 MapR Optimized Data Architecture Sources RELATIONAL, SAAS, MAINFRAME DOCUMENTS, EMAILS LOG FILES, CLICKSTREAMS SENSORS BLOGS, TWEETS, LINK DATA DATA WAREHOUSE Data Movement Data Access Analytics Search Schema-less data exploration BI, reporting Ad-hoc integrated analytics Data Transformation, Enrichment and Integration MAPR DISTRIBUTION FOR HADOOP Streaming (Spark Streaming, Storm) NoSQL ODBMS (HBase, Accumulo, …) MapR Data Platform MapR-DB MAPR DISTRIBUTION FOR HADOOP Batch/Search (MR, Spark, Hive, Pig) MapR-FS Operational Apps Recommendations Fraud Detection Logistics Optimized Data Architecture Machine Learning Interactive (Impala, Drill)
13.
© 2015 MapR
Technologies 13 Bullet-proof data vault that meets SEC and FINRA requirements 46x cost savings over legacy system Efficiency of MapR cluster that can store the Elasticsearch index for real-time search Security Log Analysis & Enterprise Data Vault F100 bank accelerates log analytics to meet investigation and compliance mandates • Meet compliance requirements to minimize lawsuits and fines • Complete IT audits more quickly • Prior system (flat files on Unix) was difficult to maintain for operations team • HA and data protection issues in HDFS put critical data at risk • File volume (300K files/day) was straining system • Seamless Hadoop file movement & management: MapR NFS • MapReduce enables archival of data for historical search and analysis • Data is indexed into Elasticsearch from MapR for real-time search • Customizable user interface and dashboard: Kibana (ELK stack) OBJECTIVES CHALLENGES SOLUTION Business Impact LARGE FINANCIAL SERVICES INSTITUTION
14.
© 2015 MapR
Technologies 14© 2015 MapR Technologies Planning for Production Success with Hadoop
15.
© 2015 MapR
Technologies 15 Key Questions for Big Data Planning Source: Gartner. Jan 2015. Answering Big Data's 10 Biggest Planning and Implementation Questions
16.
© 2015 MapR
Technologies 16 Big Data is Overwhelming Traditional Systems • Mission-critical reliability • Transaction guarantees • Deep security • Real-time performance • Backup and recovery • Interactive SQL • Rich analytics • Workload management • Data governance • Backup and recovery Enterprise Data Architecture TREND ENTERPRISE USERS OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS PRODUCTION REQUIREMENTS PRODUCTION REQUIREMENTS OUTSIDE SOURCES
17.
© 2015 MapR
Technologies 17 OPERATIONAL SYSTEMS ANALYTICAL SYSTEMS ENTERPRISE USERS REALITY • Data staging • Archive • Data transformation • Data exploration • Streaming, interactions Hadoop Relieves the Pressure from Enterprise Systems 2 Interoperability 1 Business continuity 4 Multi-tenacy 3 High performance Keys for Production Success
18.
© 2015 MapR
Technologies 18 Key Reasons for Selecting the MapR Distribution including Hadoop Respondents who have had prior experience with another Hadoop distribution* * Apache Hadoop, Cloudera or Hortonworks
19.
© 2015 MapR
Technologies 19 Business Continuity High Availability Data Protection Disaster Recovery What are your requirements? What do you have for your enterprise storage, databases and data warehouses?
20.
© 2015 MapR
Technologies 20 Seamless Integration with Direct Access NFS • POSIX compliant – Random reads/writes – Simultaneous reading and writing to a file – Compression is automatic and transparent • Industry-standard NFS interface (in addition to HDFS API) – Stream data into the cluster – Leverage thousands of tools and applications – Easier to use non-Java programming languages – No need for most proprietary Hadoop connectors • Compression/parallel access/security from edge nodes to MapR cluster
21.
© 2015 MapR
Technologies 21 Narrow Foundations – Big and Fast are Separate HDFS Map/ Reduce HBase Spark / Storm Hive RDBMS NAS Sequential File Processing OLAP Data Mining WEB SERVICES Big Data is heavy and expensive to move
22.
© 2015 MapR
Technologies 22 Unify Big & Fast on One Platform HDFS Map Reduce HBase Spark / Storm Hive RDBMS NAS Sequential File Processing OLAP Data Mining WEB SERVICES NEXT GENERATION DISTRIBUTION HADOOP API’S NFS
23.
© 2015 MapR
Technologies 23© 2015 MapR Technologies What Makes MapR Different
24.
© 2015 MapR
Technologies 24 MapR: Best Solution for Customer Success Premier Investors High Growth 2X Growth In Direct Customers 90% Subscription Licenses Software Margins 140% Dollar-based Net Expansion 700+ Customers 2X Growth In Annual Subscriptions ( ACV) Best Product Apache Open Source
25.
© 2015 MapR
Technologies 25 The Power of the Open Source Community APACHE HADOOP AND OSS ECOSYSTEM Security YARN Spark Streaming Storm StreamingNoSQL & Search Juju Provisioning & Coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Cascading Spark Batch MapReduce v1 & v2 Tez HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue Data PlatformMapR-FS MapR-DB Management
26.
© 2015 MapR
Technologies 26 The MapR Distribution including Apache Hadoop APACHE HADOOP AND OSS ECOSYSTEM Security YARN Spark Streaming Storm StreamingNoSQL & Search Juju Provisioning & Coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Cascading Spark Batch MapReduce v1 & v2 Tez HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue Data PlatformMapR-FS MapR-DB Management Data HubEnterprise Grade Operational
27.
© 2015 MapR
Technologies 27 MapR Distribution including Hadoop Theme Requirements Features Product Enterprise Grade • Uptime service levels • Site to site DR • Backup/recovery • Security • High velocity data ingress • HW/SW HA • Mirroring • Snapshots • Authorization, Kerberos • 2X-5X performance MapR Enterprise Edition Data Hub • Hadoop • Traditional applications • Data of record • Batch and interactive • HDFS • POSIX • Strong consistency • MapReduce and SQL MapR Enterprise Edition Operational • Real time • NoSQL • Operational analytics • HBase • Update in place • Concurrent read/write MapR Enterprise Database Edition MapR Patent Pending – “Table Format for Map Reduce” “Map Reduce Ready Distributed File System” Enterprise Grade Operational Data Hub
28.
© 2015 MapR
Technologies 28 Achievements: Triple Crown Of Analyst Ranking
29.
© 2015 MapR
Technologies 29 Apache Hadoop NameNode High Availability NameNode A B C D E F HDFS-based Distributions DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode Primary NameNode A B C D E F Standby NameNode A B C D E F NameNode A B NameNode C D NameNode E F NameNode A B NameNode C D NameNode E F HDFS HA HDFS Federation Single point of failure Limited to 50-200 million files Performance bottleneck Metadata must fit in memory Only one active NameNode Limited to 50-200 million files Performance bottleneck Metadata must fit in memory Double the block reports Multiple single points of failure w/o HA Needs 20 NameNodes for 1 Billion files Performance bottleneck Metadata must fit in memory Double the block reports
30.
© 2015 MapR
Technologies 30 DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode No-NameNode Architecture DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode DataNode NameNode A B C D E FAAA BBBB CCC DDD EEE FFF Up to 1T files (> 5000x advantage) Significantly less hardware & OpEx Higher performance No special config to enable HA Automatic failover & re-replication Metadata is persisted to disk
31.
© 2015 MapR
Technologies 31
32.
© 2015 MapR
Technologies 33 MapR: Fast and Dependable with Lowest TCO Cost comparison for a 500 TB cluster vs HDFS-based distro’s TCO: mapr.com/tco
33.
© 2015 MapR
Technologies 34© 2015 MapR Technologies Product Demo: Multi-tenancy
34.
© 2015 MapR
Technologies 35 Committed to our Customers’ Success Educational Services Professional Services Customer Support Core Hadoop Services Data Engineering Advanced Analytics M7/HBase Practice Hadoop engineering experts provide 24x7x365 global coverage Instructor-led courses & Free On-Demand training for Hadoop cluster administration, HBase & MapReduce programming and more Data Engineering Data Science
35.
© 2015 MapR
Technologies 36 WORLDWIDE PRESENCE & CUSTOMER SUPPORT HQ
36.
© 2015 MapR
Technologies 37 Key MapR Advantage Partners Business Services INFRASTRUCTURE & CLOUD ANALYTICS & BUSINESS INTELLIGENCE APPLICATIONS & OS CONSULTANTS & INTEGRATORS DATA WAREHOUSE & INTEGRATION
37.
© 2015 MapR
Technologies 38 Q&A @mapr maprtech info@mapr.com Engage with us! MapR maprtech mapr-technologies GET STARTED NOW! mapr.com/sandbox
Download now