Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Spark and Hadoop at Production Scale-(Anil Gadre, MapR)

4,233 views

Published on

Presentation at Spark Summit 2015

Published in: Data & Analytics
  • Hello! Get Your Professional Job-Winning Resume Here - Check our website! https://vk.cc/818RFv
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here

Spark and Hadoop at Production Scale-(Anil Gadre, MapR)

  1. 1. ® © 2015 MapR Technologies 1 ® © 2015 MapR Technologies Taking Your Spark To Production Scale Anil Gadre, SVP Product Management, MapR Technologies June 15, 2015
  2. 2. ® © 2015 MapR Technologies 2 The Journey To Production Scale Trials, science projects Large mission-critical, operational deployments ® © 2015 MapR Technologies 2
  3. 3. ® © 2015 MapR Technologies 3 Companies with Spark & MapR in Production GLOBAL TELECOM HEALTHCARE GLOBALFINANCIAL SERVICES
  4. 4. ® © 2015 MapR Technologies 4 Key Issues To Plan For Spark stack support? Real-time? Enterprise reliability & security? Open ended agility? 1 2 3 4
  5. 5. ® © 2015 MapR Technologies 5 Global Managed Security Services delivered on Hadoop Spark Stream processing used to first check for known threats Data next processed on Hadoop using MLLib and GraphX Additional SQL querying done via Spark SQL Security Intelligence Operations
  6. 6. Delivers Lightning Fast Analytics for Clients Building largest Hadoop cluster in Australia Real-time analytics using Spark on MapR–reducing data loading time from hours to minutes Leverage multi-tenancy, high-performance and reliability of MapR
  7. 7. ® © 2015 MapR Technologies 7 Next-Gen Genomics Develop flexible platform to keep up with fast changing research techniques POSIX file access lets bio-informaticians use existing tools with open source tools (Spark) Graph manipulations can be done reliably and at scale using Spark
  8. 8. ® © 2015 MapR Technologies 8 Real-Time Customer Analytics • MapR Data Lake stores both online and archive data • Spark on MapR reduced ETL processing • NFS moved data into the cluster seamlessly • 1/10th Total Cost of Ownership vs. old way • New customer onboarding cut from months to weeks
  9. 9. ® © 2015 MapR Technologies 9 Databricks & MapR Strategic Partnership (since April 2014) Support for the complete Spark stack Engineering & roadmap collaboration Back-end support +
  10. 10. ® © 2015 MapR Technologies 10 The Most Complete Spark Environment Spark SQL (SQL) Spark Streaming (Streaming) MLlib (Machine learning) GraphX (Graph computation) Foundation For Enterprise-Grade Spark
  11. 11. ® © 2015 MapR Technologies 11 DB Operations Real-Time and Actionable Analytics Operations + Analytics on One Hadoop Platform with SQL Access Mobile application server Customer 360 dashboard Churn analysis Product/service optimization and personalization Real-time ad targeting Web application server Data exploration (SQL) • User profiles and state • User interactions • Real-time location data • Web and mobile session state • Comments/rankings
  12. 12. ® © 2015 MapR Technologies 12 Spark + MapR = Ready For Production Success World-record performance on disk High Performance SLA-Driven Applications •  High availability •  Data protection •  Disaster recovery Reliability for Production Strategic partnership with Databricks to ensure enterprise support for the entire stack 24/7 Best-in-class Global Support MapR-DB + Spark = real-time analytics Operational Data Store
  13. 13. ® © 2015 MapR Technologies 13 MapR Introduces 3 New Spark-Based Quick Start Solutions Real-Time Security Log Analytics Time Series Analytics Genome Sequencing
  14. 14. ® © 2015 MapR Technologies 14 Self-Service Data Exploration Data Agility with Less IT Required Single SQL Interface for Structured and Semi-Structured Data
  15. 15. ® © 2015 MapR Technologies 15 Free On-Demand Training www.mapr.com/training
  16. 16. ® © 2015 MapR Technologies 16 Get Your Tattoo In The MapR Booth! Show off your Kickstart My Heart skills and enter to win Xbox 360 & Guitar Hero
  17. 17. ® © 2015 MapR Technologies 17 Top-Ranked NoSQL Top-Ranked Hadoop Distribution Top-Ranked SQL-on Hadoop Solution

×