• Save
SAS and Cloudera – Analytics at Scale
 

SAS and Cloudera – Analytics at Scale

on

  • 702 views

Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory Statistics for Hadoop, machine learning capabilities.

Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory Statistics for Hadoop, machine learning capabilities.

Statistics

Views

Total Views
702
Views on SlideShare
537
Embed Views
165

Actions

Likes
1
Downloads
0
Comments
0

4 Embeds 165

http://www.cloudera.com 139
http://cloudera.com 19
http://author01.mtv.cloudera.com 5
http://author01.core.cloudera.com 2

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

SAS and Cloudera – Analytics at Scale SAS and Cloudera – Analytics at Scale Presentation Transcript

  • 1 Welcome to the webinar! • All lines are muted • Q&A after the presentation • Ask questions at any time by typing them in the Chat panel on the left side of your screen • Recording of this webinar and slides will be available on-demand at cloudera.com • Join the conversation on Twitter: @cloudera @SASsoftware ©2014 Cloudera and SAS. All rights reserved.
  • 2 We will begin at 10:03am PST / 1:03pm EST 2 1. You are automatically connected to the audio bridge - You will hear audio once the presentation begins - If needed, find dial-in information by clicking the Audio button at the top of your screen 2. Turn up your computer’s speaker volume - Headphones are recommended - Your computer’s microphone is automatically set to mute 3. Use the Chat tab on the left-side of your screen to submit questions - We will answer questions at the end of the presentation ©2014 Cloudera and SAS. All rights reserved.
  • 3 Analytics at Scale and Speed Cloudera and SAS Online Webinar Wednesday, May 7, 2014 - 10am PST/1pm PST Mike Ames, SAS Eli Collins, Cloudera Scott Armstrong, Cloudera
  • 4 Agenda • An introduction to Cloudera's enterprise data hub • SAS and Cloudera technical integration • How SAS builds on the enterprise data hub • SAS® In-Memory solutions for Hadoop • Live Demo • Q&A ©2014 Cloudera and SAS. All rights reserved.
  • 5 Hadoop and Cloudera’s EDH: A New Approach to Data
  • 6 Expanding Data Requires A New Approach 6 Then Bring Data to Compute Now Bring Compute to Data Data Information-centric businesses use all Data: Multi-structured, Internal & external data of all types Comput e Comput e Comput e Process-centric businesses use: • Structured data mainly • Internal data only • “Important” data only Comput e Comput e Comput e Data Data Data Data ©2014 Cloudera and SAS. All rights reserved.
  • 7 The Old Way: Bringing Data to Compute 7 ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data Sources Data ArchivesEDWs Marts SearchServers Document Stores Storage Complex Architecture • Many special-purpose systems • Moving data around • No complete views Visibility • Leaving data behind • Risk and compliance • High cost of storage Time to Data • Up-front modeling • Transforms slow • Transforms lose data Cost of Analytics • Existing systems strained • No agility • BI backlog 4 1 2 3 ©2014 Cloudera and SAS. All rights reserved.
  • 8 EDWs Marts Storage Search Servers Documents Archives ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data Sources Multi-workload analytic platform • Bring applications to data • Combine different workloads on common data (i.e. SQL + Search) • True BI agility 4 1 2 1 34 The New Way: Bringing Compute to Data 8 Active archive • Full fidelity original data • Indefinite time, any source • Lowest cost storage 1 Data management, transformations • One source of data for all analytics • Persisted state of transformed data • Significantly faster & cheaper 2 Self-service exploratory BI • Simple search + BI tools • “Schema on read” agility • Reduce BI user backlog requests 3 ©2014 Cloudera and SAS. All rights reserved.
  • 9 SAS® Embedded Process SAS & Cloudera Big data analytics in Cloudera HDFS SAS® LASR™ Analytic Server SAS® Event Stream Processing SAS/ACCESS® to Hadoop™ & to Impala™ Real-Time & Streaming Interactive Batch & SQL Visual Analytics Visual Statistics Visual Scenario Designer In-Memory Statistics for Hadoop Visual Data BuilderVisual Scenario Designer High-Performance Analytics ©2014 Cloudera and SAS. All rights reserved.
  • 10 SAS / Access SAS/Access to Hadoop or Impala - Push some of SAS’ processing to Hadoop1 Hive QL SAS SERVER SAS/Access to Hadoop SAS/Access to Cloudera Impala ©2014 Cloudera and SAS. All rights reserved.
  • 11 ©2014 Cloudera and SAS. All rights reserved. SAS SERVER SAS/Scoring Accelerator for Hadoop SAS/Code Accelerator for Hadoop SAS/Data Quality Accelerator for Hadoop proc ds2 ; /* thread ~ eqiv to a mapper */ thread map_program; method run(); set dbmslib.intab; /* program statements */ end; endthread; run; /* program wrapper */ data hdf.data_reduced; dcl thread map_program map_pgm; method run(); set from map_pgm threads=N; /* reduce steps */ end; enddata; run; quit; SAS / Embedded Process SAS/Embedded Process - Push SAS processing to Cloudera with Map Reduce2 SAS Data Step & DS2
  • 12 SAS / High-Performance Analytics SAS High-Performance Statistics SAS High-Performance Data Mining SAS High-Performance Text Mining SAS High-Performance Econometrics SAS High-Performance Forecasting SAS High-Performance Optimization SAS/High-Performance Analytics – High-Performance Enabled SAS Procedures3 SAS SERVER SAS HPA Procedures ©2014 Cloudera and SAS. All rights reserved.
  • 13 SAS ® LASR ANALYTIC SERVER SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY WEB CLIENTS APPLICATIONS ERP SCM CRM Images Audio and Video Machine Logs Text fWeb and Social In-Memory Analytics – Process in Memory, use Hadoop for Storage persistence and commodity computing 4 SAS ANALYTIC HADOOP ENVIRONMENT Visual Analytics Visual Statistics Visual Scenario Designer In-Memory Statistics Visual Data Builder SAS LASR and Hadoop In-Memory Solutions in Cloudera ©2014 Cloudera and SAS. All rights reserved.
  • 14 Demo
  • 15 Summary 15 • The combination of SAS analytics and Cloudera’s enterprise data hub (EDH) is a common recipe for Analytics at Scale. • SAS has baseline support for Cloudera with connectivity through Hive and Impala. • SAS also allows you to run In-Memory Analytics in a Cloudera cluster through multiple validated solutions: • Visual Analytics, Visual Statistics, Visual Scenario Designer, In- Memory Statistics for Hadoop & High-Performance Analytics • Strong SAS / Cloudera product integration with more to come! ©2014 Cloudera and SAS. All rights reserved.
  • 16 Questions? 16 Use the Chat tab on the left-side of your screen to submit question Watch this webinar on-demand: www.Cloudera.com Alliances Contacts: Richard.O'Brien@SAS.com Scott@Cloudera.com Or contact your account team Thank you for attending! Joint Solution Brief http://bit.ly/SASClouderaSolution Download CDH – Free Open Source http://bit.ly/CDH-download Cloudera http://bit.ly/ClouderaPartnerSAS SAS http://bit.ly/SASPartnerCloudera ©2014 Cloudera and SAS. All rights reserved.
  • 17 ©2014 Cloudera and SAS. All rights reserved.
  • 18 Appendix