SlideShare a Scribd company logo
1 of 11
Download to read offline
SAS-Hadoop
Foundation
23/03/2017
Table of Contents
 Taming to get the SAS Hadoop Environment Set up
 5 ways SAS gets to the data inside Hadoop
 SAS-HADOOP Talk Talk?
 Connecting SAS 9.4 (Windows) Cloudera CDH5.8 VM
 Submitting HDFS Commands from SAS
 Submitting pig commands from SAS
 Q& A
Taming to get the SAS
Hadoop Environment Set up
 Storing structured and unstructured data inside Hadoop
- A new industry Normal
 SAS Hadoop are now friends together
- Is Your Team ready to embrace the new Environments?
What First Question your analytical mind triggers?
• How many ways SAS can get to the data
inside Hadoop?
• How SAS & Hadoop can listen to each
other?
• What configuration changes are required?
5 ways SAS gets data inside Hadoop
- Depends on several SAS technology products
 BASE SAS
Base SAS can access hdfs files and can perform read-write operations only on plain Text and SAS Scalable
Performance Data Engine (spde) files.
 SAS Scalable Performance Data Server (SPD Server)
SPD server when connected to Hadoop, can directly read-write partitioned SPD server files To & Fro hdfs.
 SAS Access Hadoop Interface
Provides capability to interact with Hive tables. Can read-write data to Hive directly Using SAS SQL pass through
facility and SAS libname statements.
 SAS LASER Analytics Server
It is an in-memory analytics engine that can process data directly to hdfs using the SASHDAT file format -a highly
optimized, fastest and most efficient way of processing the data
 SAS In-Database Products
SAS In-Database Code Accelerator for Hadoop enables to run data and thread programs ( DS2 programming)
in map-reduce framework. The In-Database products offers several speedy methods for data preparation based
on DS2 thread programming on multicore symmetric multiprocessing (SMP) and massively parallel processing
(MPP) machines.
SAS-Hadoop Talk Talk?
 Connect and Configure SAS and Hadoop requires:
 Hadoop Jar Files
 Hadoop Configuration Properties
 Define the new SAS Environment Variables
HIVE Tables
HDFS
Directory
SAS_HADOOP_JAR_PATH
SAS_HADOOP_CONFIG_PATH
SAS_HADOOP_RESTFUL
(Optional- Enable WebHDFS)
New Environment
Variables
SAS Access to Hadoop
Interface
Connecting SAS(9.4 Windows)
Cloudera CDH VM 5.8
A Step-by-Step Guide
 Install the following
- SAS 9.4 windows
- Download and install VM Player and CDH 5.8
https://www.cloudera.com/downloads/quickstart_vms/5-8.html
 Import the CDH 5.8 into VM player
 Go to VM Settings and
- Allocate 16GB RAM and 2 cores
- Ensure NAT Adopter is Enabled
- Create a shared Folder Location that is accessible by SAS
 Validate Hadoop is up and running
 Note the IP address of VM machine
 In Windows, add VM IP address and hostname to host file
continued. . . . .
Connecting SAS(9.4 Windows)
Cloudera CDH VM 5.8
A Step-by-Step Guide
 Locate your latest hadoop jars and configuration files
- download hadoop tracer python script to get the jars and config files
ftp://ftp.sas.com/techsup/download/blind/access/hadooptracer.zip -
unzip the hadoop tracer python script in the shared folder location
- run the python script as
python ./hadooptracer.py --filterby=latest
It will pulls all the hadoop jars and config files under the directory /tmp/jars and /tmp/sitesxml
 Copy the jars and sitesxml to your shared location accessible to SAS
 Set the New SAS Environment Variables to point to Jar path and Config path
Submitting HDFS Commands
from SAS
Submitting pig Commands
from SAS
Thank You !!

More Related Content

What's hot

Hadoop in three use cases
Hadoop in three use casesHadoop in three use cases
Hadoop in three use cases
Joey Echeverria
 

What's hot (20)

Cloudera
ClouderaCloudera
Cloudera
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop distributions - ecosystem
Hadoop distributions - ecosystemHadoop distributions - ecosystem
Hadoop distributions - ecosystem
 
Ravi Namboori Hadoop & HDFS Architecture
Ravi Namboori Hadoop & HDFS ArchitectureRavi Namboori Hadoop & HDFS Architecture
Ravi Namboori Hadoop & HDFS Architecture
 
What is HDFS | Hadoop Distributed File System | Edureka
What is HDFS | Hadoop Distributed File System | EdurekaWhat is HDFS | Hadoop Distributed File System | Edureka
What is HDFS | Hadoop Distributed File System | Edureka
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation
 
Session 03 - Hadoop Installation and Basic Commands
Session 03 - Hadoop Installation and Basic CommandsSession 03 - Hadoop Installation and Basic Commands
Session 03 - Hadoop Installation and Basic Commands
 
Hadoop hdfs
Hadoop hdfsHadoop hdfs
Hadoop hdfs
 
SQL Server 2012 and Big Data
SQL Server 2012 and Big DataSQL Server 2012 and Big Data
SQL Server 2012 and Big Data
 
Session 01 - Into to Hadoop
Session 01 - Into to HadoopSession 01 - Into to Hadoop
Session 01 - Into to Hadoop
 
Hadoop description
Hadoop descriptionHadoop description
Hadoop description
 
HADOOP TECHNOLOGY ppt
HADOOP  TECHNOLOGY pptHADOOP  TECHNOLOGY ppt
HADOOP TECHNOLOGY ppt
 
Hadoop HDFS
Hadoop HDFSHadoop HDFS
Hadoop HDFS
 
H base
H baseH base
H base
 
HDFS
HDFSHDFS
HDFS
 
Introduction to HDFS and MapReduce
Introduction to HDFS and MapReduceIntroduction to HDFS and MapReduce
Introduction to HDFS and MapReduce
 
ImpalaToGo and Tachyon integration
ImpalaToGo and Tachyon integrationImpalaToGo and Tachyon integration
ImpalaToGo and Tachyon integration
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop in three use cases
Hadoop in three use casesHadoop in three use cases
Hadoop in three use cases
 
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
Hadoop vs Spark | Which One to Choose? | Hadoop Training | Spark Training | E...
 

Viewers also liked

Accelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC TechnologiesAccelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
inside-BigData.com
 
ちらし オストメイトなびサポーター
ちらし オストメイトなびサポーターちらし オストメイトなびサポーター
ちらし オストメイトなびサポーター
Tsubasa Kambe
 

Viewers also liked (16)

Ppt hadoop
Ppt hadoopPpt hadoop
Ppt hadoop
 
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC TechnologiesAccelerating Hadoop, Spark, and Memcached with HPC Technologies
Accelerating Hadoop, Spark, and Memcached with HPC Technologies
 
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
 
Hadoop or Spark: is it an either-or proposition? By Slim Baltagi
Hadoop or Spark: is it an either-or proposition? By Slim BaltagiHadoop or Spark: is it an either-or proposition? By Slim Baltagi
Hadoop or Spark: is it an either-or proposition? By Slim Baltagi
 
Power of OpenStack & Hadoop
Power of OpenStack & HadoopPower of OpenStack & Hadoop
Power of OpenStack & Hadoop
 
Hadoop basics
Hadoop basicsHadoop basics
Hadoop basics
 
ちらし オストメイトなびサポーター
ちらし オストメイトなびサポーターちらし オストメイトなびサポーター
ちらし オストメイトなびサポーター
 
Catalytic leadership - TriAgile - final
Catalytic leadership  - TriAgile - finalCatalytic leadership  - TriAgile - final
Catalytic leadership - TriAgile - final
 
ISA Toronto Chapter Presentation-March 2017
ISA Toronto  Chapter Presentation-March 2017ISA Toronto  Chapter Presentation-March 2017
ISA Toronto Chapter Presentation-March 2017
 
Letter to Senator Gardner Regarding Joint Resolution 34 - 20170330
Letter to Senator Gardner Regarding Joint Resolution 34 - 20170330Letter to Senator Gardner Regarding Joint Resolution 34 - 20170330
Letter to Senator Gardner Regarding Joint Resolution 34 - 20170330
 
Workshop convite
Workshop conviteWorkshop convite
Workshop convite
 
Samena trends february 2017
Samena trends february 2017Samena trends february 2017
Samena trends february 2017
 
NIH Support of Health Research in California
NIH Support of Health Research in CaliforniaNIH Support of Health Research in California
NIH Support of Health Research in California
 
Successful Small Business Energy Efficiency Program Practices
Successful Small Business Energy Efficiency Program PracticesSuccessful Small Business Energy Efficiency Program Practices
Successful Small Business Energy Efficiency Program Practices
 
Microservice's in detailed
Microservice's in detailedMicroservice's in detailed
Microservice's in detailed
 
シェーダー伝道師 第二回
シェーダー伝道師 第二回シェーダー伝道師 第二回
シェーダー伝道師 第二回
 

Similar to SAS-Hadoop Foundation

Hadoop online training by certified trainer
Hadoop online training by certified trainerHadoop online training by certified trainer
Hadoop online training by certified trainer
sriram0233
 
Best Hadoop and Amazon Online Training
Best Hadoop and Amazon Online TrainingBest Hadoop and Amazon Online Training
Best Hadoop and Amazon Online Training
Samatha Kamuni
 
Hadoop and aws map reducecourse
Hadoop and aws map reducecourseHadoop and aws map reducecourse
Hadoop and aws map reducecourse
Samatha Kamuni
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
orsenit
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
orsenit
 
HDFS presented by VIJAY
HDFS presented by VIJAYHDFS presented by VIJAY
HDFS presented by VIJAY
thevijayps
 

Similar to SAS-Hadoop Foundation (20)

Hadoop online training by certified trainer
Hadoop online training by certified trainerHadoop online training by certified trainer
Hadoop online training by certified trainer
 
Best Hadoop and Amazon Online Training
Best Hadoop and Amazon Online TrainingBest Hadoop and Amazon Online Training
Best Hadoop and Amazon Online Training
 
Hadoop and aws map reducecourse
Hadoop and aws map reducecourseHadoop and aws map reducecourse
Hadoop and aws map reducecourse
 
Lecture 2 Hadoop.pptx
Lecture 2 Hadoop.pptxLecture 2 Hadoop.pptx
Lecture 2 Hadoop.pptx
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 
Hadoop online training
Hadoop online trainingHadoop online training
Hadoop online training
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
 
Haoop ppt
Haoop pptHaoop ppt
Haoop ppt
 
Design and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryDesign and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on Raspberry
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log Processing
 
Hadoop vs Apache Spark
Hadoop vs Apache SparkHadoop vs Apache Spark
Hadoop vs Apache Spark
 
Hadoop Distributed File System
Hadoop Distributed File SystemHadoop Distributed File System
Hadoop Distributed File System
 
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1
 
Overview of big data & hadoop version 1 - Tony Nguyen
Overview of big data & hadoop   version 1 - Tony NguyenOverview of big data & hadoop   version 1 - Tony Nguyen
Overview of big data & hadoop version 1 - Tony Nguyen
 
Hadoop_arunam_ppt
Hadoop_arunam_pptHadoop_arunam_ppt
Hadoop_arunam_ppt
 
Hadoop map reduce
Hadoop map reduceHadoop map reduce
Hadoop map reduce
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 
BIG DATA: Apache Hadoop
BIG DATA: Apache HadoopBIG DATA: Apache Hadoop
BIG DATA: Apache Hadoop
 
HDFS presented by VIJAY
HDFS presented by VIJAYHDFS presented by VIJAY
HDFS presented by VIJAY
 
Hadoop Tutorial for Beginners
Hadoop Tutorial for BeginnersHadoop Tutorial for Beginners
Hadoop Tutorial for Beginners
 

Recently uploaded

Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
LuisMiguelPaz5
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
23050636
 
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
varanasisatyanvesh
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
mikehavy0
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
pwgnohujw
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get CytotecAbortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
saurabvyas476
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
zifhagzkk
 

Recently uploaded (20)

Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...Simplify hybrid data integration at an enterprise scale. Integrate all your d...
Simplify hybrid data integration at an enterprise scale. Integrate all your d...
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
 
👉 Tirunelveli Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Gir...
👉 Tirunelveli Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Gir...👉 Tirunelveli Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Gir...
👉 Tirunelveli Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Gir...
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get CytotecAbortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptx
 

SAS-Hadoop Foundation

  • 2. Table of Contents  Taming to get the SAS Hadoop Environment Set up  5 ways SAS gets to the data inside Hadoop  SAS-HADOOP Talk Talk?  Connecting SAS 9.4 (Windows) Cloudera CDH5.8 VM  Submitting HDFS Commands from SAS  Submitting pig commands from SAS  Q& A
  • 3. Taming to get the SAS Hadoop Environment Set up  Storing structured and unstructured data inside Hadoop - A new industry Normal  SAS Hadoop are now friends together - Is Your Team ready to embrace the new Environments? What First Question your analytical mind triggers? • How many ways SAS can get to the data inside Hadoop? • How SAS & Hadoop can listen to each other? • What configuration changes are required?
  • 4. 5 ways SAS gets data inside Hadoop - Depends on several SAS technology products  BASE SAS Base SAS can access hdfs files and can perform read-write operations only on plain Text and SAS Scalable Performance Data Engine (spde) files.  SAS Scalable Performance Data Server (SPD Server) SPD server when connected to Hadoop, can directly read-write partitioned SPD server files To & Fro hdfs.  SAS Access Hadoop Interface Provides capability to interact with Hive tables. Can read-write data to Hive directly Using SAS SQL pass through facility and SAS libname statements.  SAS LASER Analytics Server It is an in-memory analytics engine that can process data directly to hdfs using the SASHDAT file format -a highly optimized, fastest and most efficient way of processing the data  SAS In-Database Products SAS In-Database Code Accelerator for Hadoop enables to run data and thread programs ( DS2 programming) in map-reduce framework. The In-Database products offers several speedy methods for data preparation based on DS2 thread programming on multicore symmetric multiprocessing (SMP) and massively parallel processing (MPP) machines.
  • 5. SAS-Hadoop Talk Talk?  Connect and Configure SAS and Hadoop requires:  Hadoop Jar Files  Hadoop Configuration Properties  Define the new SAS Environment Variables HIVE Tables HDFS Directory SAS_HADOOP_JAR_PATH SAS_HADOOP_CONFIG_PATH SAS_HADOOP_RESTFUL (Optional- Enable WebHDFS) New Environment Variables SAS Access to Hadoop Interface
  • 6. Connecting SAS(9.4 Windows) Cloudera CDH VM 5.8 A Step-by-Step Guide  Install the following - SAS 9.4 windows - Download and install VM Player and CDH 5.8 https://www.cloudera.com/downloads/quickstart_vms/5-8.html  Import the CDH 5.8 into VM player  Go to VM Settings and - Allocate 16GB RAM and 2 cores - Ensure NAT Adopter is Enabled - Create a shared Folder Location that is accessible by SAS  Validate Hadoop is up and running  Note the IP address of VM machine  In Windows, add VM IP address and hostname to host file continued. . . . .
  • 7. Connecting SAS(9.4 Windows) Cloudera CDH VM 5.8 A Step-by-Step Guide  Locate your latest hadoop jars and configuration files - download hadoop tracer python script to get the jars and config files ftp://ftp.sas.com/techsup/download/blind/access/hadooptracer.zip - unzip the hadoop tracer python script in the shared folder location - run the python script as python ./hadooptracer.py --filterby=latest It will pulls all the hadoop jars and config files under the directory /tmp/jars and /tmp/sitesxml  Copy the jars and sitesxml to your shared location accessible to SAS  Set the New SAS Environment Variables to point to Jar path and Config path
  • 10.