Your SlideShare is downloading. ×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Introducing the official SlideShare app

Stunning, full-screen experience for iPhone and Android

Text the download link to your phone

Standard text messaging rates apply

SAS and Cloudera – Analytics at Scale

1,066
views

Published on

Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory Statistics for Hadoop, machine learning capabilities.

Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory Statistics for Hadoop, machine learning capabilities.

Published in: Software, Technology

0 Comments
5 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,066
On Slideshare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
0
Comments
0
Likes
5
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. 1 Welcome to the webinar! • All lines are muted • Q&A after the presentation • Ask questions at any time by typing them in the Chat panel on the left side of your screen • Recording of this webinar and slides will be available on-demand at cloudera.com • Join the conversation on Twitter: @cloudera @SASsoftware ©2014 Cloudera and SAS. All rights reserved.
  • 2. 2 We will begin at 10:03am PST / 1:03pm EST 2 1. You are automatically connected to the audio bridge - You will hear audio once the presentation begins - If needed, find dial-in information by clicking the Audio button at the top of your screen 2. Turn up your computer’s speaker volume - Headphones are recommended - Your computer’s microphone is automatically set to mute 3. Use the Chat tab on the left-side of your screen to submit questions - We will answer questions at the end of the presentation ©2014 Cloudera and SAS. All rights reserved.
  • 3. 3 Analytics at Scale and Speed Cloudera and SAS Online Webinar Wednesday, May 7, 2014 - 10am PST/1pm PST Mike Ames, SAS Eli Collins, Cloudera Scott Armstrong, Cloudera
  • 4. 4 Agenda • An introduction to Cloudera's enterprise data hub • SAS and Cloudera technical integration • How SAS builds on the enterprise data hub • SAS® In-Memory solutions for Hadoop • Live Demo • Q&A ©2014 Cloudera and SAS. All rights reserved.
  • 5. 5 Hadoop and Cloudera’s EDH: A New Approach to Data
  • 6. 6 Expanding Data Requires A New Approach 6 Then Bring Data to Compute Now Bring Compute to Data Data Information-centric businesses use all Data: Multi-structured, Internal & external data of all types Comput e Comput e Comput e Process-centric businesses use: • Structured data mainly • Internal data only • “Important” data only Comput e Comput e Comput e Data Data Data Data ©2014 Cloudera and SAS. All rights reserved.
  • 7. 7 The Old Way: Bringing Data to Compute 7 ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data Sources Data ArchivesEDWs Marts SearchServers Document Stores Storage Complex Architecture • Many special-purpose systems • Moving data around • No complete views Visibility • Leaving data behind • Risk and compliance • High cost of storage Time to Data • Up-front modeling • Transforms slow • Transforms lose data Cost of Analytics • Existing systems strained • No agility • BI backlog 4 1 2 3 ©2014 Cloudera and SAS. All rights reserved.
  • 8. 8 EDWs Marts Storage Search Servers Documents Archives ERP, CRM, RDBMS, Machines Files, Images, Video, Logs, Clickstreams External Data Sources Multi-workload analytic platform • Bring applications to data • Combine different workloads on common data (i.e. SQL + Search) • True BI agility 4 1 2 1 34 The New Way: Bringing Compute to Data 8 Active archive • Full fidelity original data • Indefinite time, any source • Lowest cost storage 1 Data management, transformations • One source of data for all analytics • Persisted state of transformed data • Significantly faster & cheaper 2 Self-service exploratory BI • Simple search + BI tools • “Schema on read” agility • Reduce BI user backlog requests 3 ©2014 Cloudera and SAS. All rights reserved.
  • 9. 9 SAS® Embedded Process SAS & Cloudera Big data analytics in Cloudera HDFS SAS® LASR™ Analytic Server SAS® Event Stream Processing SAS/ACCESS® to Hadoop™ & to Impala™ Real-Time & Streaming Interactive Batch & SQL Visual Analytics Visual Statistics Visual Scenario Designer In-Memory Statistics for Hadoop Visual Data BuilderVisual Scenario Designer High-Performance Analytics ©2014 Cloudera and SAS. All rights reserved.
  • 10. 10 SAS / Access SAS/Access to Hadoop or Impala - Push some of SAS’ processing to Hadoop1 Hive QL SAS SERVER SAS/Access to Hadoop SAS/Access to Cloudera Impala ©2014 Cloudera and SAS. All rights reserved.
  • 11. 11 ©2014 Cloudera and SAS. All rights reserved. SAS SERVER SAS/Scoring Accelerator for Hadoop SAS/Code Accelerator for Hadoop SAS/Data Quality Accelerator for Hadoop proc ds2 ; /* thread ~ eqiv to a mapper */ thread map_program; method run(); set dbmslib.intab; /* program statements */ end; endthread; run; /* program wrapper */ data hdf.data_reduced; dcl thread map_program map_pgm; method run(); set from map_pgm threads=N; /* reduce steps */ end; enddata; run; quit; SAS / Embedded Process SAS/Embedded Process - Push SAS processing to Cloudera with Map Reduce2 SAS Data Step & DS2
  • 12. 12 SAS / High-Performance Analytics SAS High-Performance Statistics SAS High-Performance Data Mining SAS High-Performance Text Mining SAS High-Performance Econometrics SAS High-Performance Forecasting SAS High-Performance Optimization SAS/High-Performance Analytics – High-Performance Enabled SAS Procedures3 SAS SERVER SAS HPA Procedures ©2014 Cloudera and SAS. All rights reserved.
  • 13. 13 SAS ® LASR ANALYTIC SERVER SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY SAS ® IN-MEMORY WEB CLIENTS APPLICATIONS ERP SCM CRM Images Audio and Video Machine Logs Text fWeb and Social In-Memory Analytics – Process in Memory, use Hadoop for Storage persistence and commodity computing 4 SAS ANALYTIC HADOOP ENVIRONMENT Visual Analytics Visual Statistics Visual Scenario Designer In-Memory Statistics Visual Data Builder SAS LASR and Hadoop In-Memory Solutions in Cloudera ©2014 Cloudera and SAS. All rights reserved.
  • 14. 14 Demo
  • 15. 15 Summary 15 • The combination of SAS analytics and Cloudera’s enterprise data hub (EDH) is a common recipe for Analytics at Scale. • SAS has baseline support for Cloudera with connectivity through Hive and Impala. • SAS also allows you to run In-Memory Analytics in a Cloudera cluster through multiple validated solutions: • Visual Analytics, Visual Statistics, Visual Scenario Designer, In- Memory Statistics for Hadoop & High-Performance Analytics • Strong SAS / Cloudera product integration with more to come! ©2014 Cloudera and SAS. All rights reserved.
  • 16. 16 Questions? 16 Use the Chat tab on the left-side of your screen to submit question Watch this webinar on-demand: www.Cloudera.com Alliances Contacts: Richard.O'Brien@SAS.com Scott@Cloudera.com Or contact your account team Thank you for attending! Joint Solution Brief http://bit.ly/SASClouderaSolution Download CDH – Free Open Source http://bit.ly/CDH-download Cloudera http://bit.ly/ClouderaPartnerSAS SAS http://bit.ly/SASPartnerCloudera ©2014 Cloudera and SAS. All rights reserved.
  • 17. 17 ©2014 Cloudera and SAS. All rights reserved.
  • 18. 18 Appendix