1
Welcome to the webinar!
• All lines are muted
• Q&A after the presentation
• Ask questions at any time by typing them in...
2
We will begin at 10:03am PST / 1:03pm EST
2
1. You are automatically connected to the audio bridge
- You will hear audio...
3
Analytics at Scale and Speed
Cloudera and SAS Online Webinar
Wednesday, May 7, 2014 - 10am PST/1pm PST
Mike Ames, SAS
El...
4
Agenda
• An introduction to Cloudera's enterprise data hub
• SAS and Cloudera technical integration
• How SAS builds on ...
5
Hadoop and Cloudera’s EDH:
A New Approach to Data
6
Expanding Data Requires A New Approach
6
Then
Bring Data to Compute
Now
Bring Compute to Data
Data
Information-centric
b...
7
The Old Way: Bringing Data to Compute
7
ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data...
8
EDWs
Marts Storage
Search
Servers
Documents
Archives
ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams ...
9
SAS® Embedded
Process
SAS & Cloudera
Big data analytics in Cloudera
HDFS
SAS® LASR™ Analytic
Server
SAS® Event Stream
Pr...
10
SAS / Access
SAS/Access to Hadoop or Impala - Push some of SAS’ processing to Hadoop1
Hive QL
SAS
SERVER
SAS/Access to ...
11 ©2014 Cloudera and SAS. All rights reserved.
SAS
SERVER
SAS/Scoring Accelerator for Hadoop
SAS/Code Accelerator for Had...
12
SAS / High-Performance Analytics
SAS High-Performance Statistics
SAS High-Performance Data Mining
SAS High-Performance ...
13
SAS
®
LASR ANALYTIC
SERVER
SAS
®
IN-MEMORY
SAS
®
IN-MEMORY
SAS
®
IN-MEMORY
SAS
®
IN-MEMORY
SAS
®
IN-MEMORY
WEB CLIENTS ...
14
Demo
15
Summary
15
• The combination of SAS analytics and Cloudera’s enterprise
data hub (EDH) is a common recipe for Analytics...
16
Questions?
16
Use the Chat tab on the left-side of
your screen to submit question
Watch this webinar on-demand:
www.Clo...
17 ©2014 Cloudera and SAS. All rights reserved.
18
Appendix
Upcoming SlideShare
Loading in...5
×

SAS and Cloudera – Analytics at Scale

1,606

Published on

Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory Statistics for Hadoop, machine learning capabilities.

Published in: Software, Technology

SAS and Cloudera – Analytics at Scale

  1. 1. 1 Welcome to the webinar! • All lines are muted • Q&A after the presentation • Ask questions at any time by typing them in the Chat panel on the left side of your screen • Recording of this webinar and slides will be available on-demand at cloudera.com • Join the conversation on Twitter: @cloudera @SASsoftware ©2014 Cloudera and SAS. All rights reserved.
  2. 2. 2 We will begin at 10:03am PST / 1:03pm EST 2 1. You are automatically connected to the audio bridge - You will hear audio once the presentation begins - If needed, find dial-in information by clicking the Audio button at the top of your screen 2. Turn up your computer’s speaker volume - Headphones are recommended - Your computer’s microphone is automatically set to mute 3. Use the Chat tab on the left-side of your screen to submit questions - We will answer questions at the end of the presentation ©2014 Cloudera and SAS. All rights reserved.
  3. 3. 3 Analytics at Scale and Speed Cloudera and SAS Online Webinar Wednesday, May 7, 2014 - 10am PST/1pm PST Mike Ames, SAS Eli Collins, Cloudera Scott Armstrong, Cloudera
  4. 4. 4 Agenda • An introduction to Cloudera's enterprise data hub • SAS and Cloudera technical integration • How SAS builds on the enterprise data hub • SAS® In-Memory solutions for Hadoop • Live Demo • Q&A ©2014 Cloudera and SAS. All rights reserved.
  5. 5. 5 Hadoop and Cloudera’s EDH: A New Approach to Data
  6. 6. 6 Expanding Data Requires A New Approach 6 Then Bring Data to Compute Now Bring Compute to Data Data Information-centric businesses use all Data: Multi-structured, Internal & external data of all types Comput e Comput e Comput e Process-centric businesses use: • Structured data mainly • Internal data only • “Important” data only Comput e Comput e Comput e Data Data Data Data ©2014 Cloudera and SAS. All rights reserved.
  7. 7. 7 The Old Way: Bringing Data to Compute 7 ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data Sources Data ArchivesEDWs Marts SearchServers Document Stores Storage Complex Architecture • Many special-purpose systems • Moving data around • No complete views Visibility • Leaving data behind • Risk and compliance • High cost of storage Time to Data • Up-front modeling • Transforms slow • Transforms lose data Cost of Analytics • Existing systems strained • No agility • BI backlog 4 1 2 3 ©2014 Cloudera and SAS. All rights reserved.
  8. 8. 8 EDWs Marts Storage Search Servers Documents Archives ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data Sources Multi-workload analytic platform • Bring applications to data • Combine different workloads on common data (i.e. SQL + Search) • True BI agility 4 1 2 1 34 The New Way: Bringing Compute to Data 8 Active archive • Full fidelity original data • Indefinite time, any source • Lowest cost storage 1 Data management, transformations • One source of data for all analytics • Persisted state of transformed data • Significantly faster & cheaper 2 Self-service exploratory BI • Simple search + BI tools • “Schema on read” agility • Reduce BI user backlog requests 3 ©2014 Cloudera and SAS. All rights reserved.
  9. 9. 9 SAS® Embedded Process SAS & Cloudera Big data analytics in Cloudera HDFS SAS® LASR™ Analytic Server SAS® Event Stream Processing SAS/ACCESS® to Hadoop™ & to Impala™ Real-Time & Streaming Interactive Batch & SQL Visual Analytics Visual Statistics Visual Scenario Designer In-Memory Statistics for Hadoop Visual Data BuilderVisual Scenario Designer High-Performance Analytics ©2014 Cloudera and SAS. All rights reserved.
  10. 10. 10 SAS / Access SAS/Access to Hadoop or Impala - Push some of SAS’ processing to Hadoop1 Hive QL SAS SERVER SAS/Access to Hadoop SAS/Access to Cloudera Impala ©2014 Cloudera and SAS. All rights reserved.
  11. 11. 11 ©2014 Cloudera and SAS. All rights reserved. SAS SERVER SAS/Scoring Accelerator for Hadoop SAS/Code Accelerator for Hadoop SAS/Data Quality Accelerator for Hadoop proc ds2 ; /* thread ~ eqiv to a mapper */ thread map_program; method run(); set dbmslib.intab; /* program statements */ end; endthread; run; /* program wrapper */ data hdf.data_reduced; dcl thread map_program map_pgm; method run(); set from map_pgm threads=N; /* reduce steps */ end; enddata; run; quit; SAS / Embedded Process SAS/Embedded Process - Push SAS processing to Cloudera with Map Reduce2 SAS Data Step & DS2
  12. 12. 12 SAS / High-Performance Analytics SAS High-Performance Statistics SAS High-Performance Data Mining SAS High-Performance Text Mining SAS High-Performance Econometrics SAS High-Performance Forecasting SAS High-Performance Optimization SAS/High-Performance Analytics – High-Performance Enabled SAS Procedures3 SAS SERVER SAS HPA Procedures ©2014 Cloudera and SAS. All rights reserved.
  13. 13. 13 SAS ® LASR ANALYTIC SERVER SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY WEB CLIENTS APPLICATIONS ERP SCM CRM Images Audio and Video Machine Logs Text fWeb and Social In-Memory Analytics – Process in Memory, use Hadoop for Storage persistence and commodity computing 4 SAS ANALYTIC HADOOP ENVIRONMENT Visual Analytics Visual Statistics Visual Scenario Designer In-Memory Statistics Visual Data Builder SAS LASR and Hadoop In-Memory Solutions in Cloudera ©2014 Cloudera and SAS. All rights reserved.
  14. 14. 14 Demo
  15. 15. 15 Summary 15 • The combination of SAS analytics and Cloudera’s enterprise data hub (EDH) is a common recipe for Analytics at Scale. • SAS has baseline support for Cloudera with connectivity through Hive and Impala. • SAS also allows you to run In-Memory Analytics in a Cloudera cluster through multiple validated solutions: • Visual Analytics, Visual Statistics, Visual Scenario Designer, In- Memory Statistics for Hadoop & High-Performance Analytics • Strong SAS / Cloudera product integration with more to come! ©2014 Cloudera and SAS. All rights reserved.
  16. 16. 16 Questions? 16 Use the Chat tab on the left-side of your screen to submit question Watch this webinar on-demand: www.Cloudera.com Alliances Contacts: Richard.O'Brien@SAS.com Scott@Cloudera.com Or contact your account team Thank you for attending! Joint Solution Brief http://bit.ly/SASClouderaSolution Download CDH – Free Open Source http://bit.ly/CDH-download Cloudera http://bit.ly/ClouderaPartnerSAS SAS http://bit.ly/SASPartnerCloudera ©2014 Cloudera and SAS. All rights reserved.
  17. 17. 17 ©2014 Cloudera and SAS. All rights reserved.
  18. 18. 18 Appendix

×