SlideShare a Scribd company logo
1 of 16
Download to read offline
Institute for Cyber Security
© Maanak Gupta World-Leading Research with Real-World Impact! 1
Multi-Layer Authorization Framework for a
Representative Hadoop Ecosystem Deployment
Maanak Gupta, Farhan Patwa, James Benson, and Ravi Sandhu
Institute for Cyber Security and Department of Computer Science
University of Texas at San Antonio
22nd ACM Symposium on Access Control Models and Technologies (SACMAT)
June 21-23, 2017, Indianapolis, IN, USA
© Maanak Gupta World-Leading Research with Real-World Impact! 2
Agenda
➢ Introduction and Motivation
➢ Multi-layer Access Control
➢ Hadoop Ecosystem Authorization Architecture
➢ Access Control Mechanisms and Policy
Configuration Points
➢ Conclusion
➢ IDC 2025 :
❖ global “datasphere” – 163 zettabytes
❖ 10x than 2016
➢ Opportunities: 21st century gold for data miners
➢ Big Data require “Big Systems”
Security:
➢ Secure Storage
➢ Privacy Concerns (e.g. HIPPA)
➢ Fine granular access requirements
Big Data and Big Challenges
© Maanak Gupta World-Leading Research with Real-World Impact! 3
➢ Hadoop: resilient, cost efficient
distributed storage (HDFS) and processing
framework (MapReduce or YARN)
➢ Ecosystem = Hadoop core +
Open-Source Projects
➢ Hadoop Data Lake
➢ Security Concerns
Hadoop Ecosystem
© Maanak Gupta World-Leading Research with Real-World Impact! 4
© Maanak Gupta World-Leading Research with Real-World Impact! 5
Multi-Layer Access Control
Services
Data and Service
Objects Cluster Resources
and ApplicationsHDFS NameNode,
YARN ResourceManager
Apache Hive
HDFS Files,
Hive Tables
Kafka Topics YARN Queues,
Cluster Nodes
© Maanak Gupta World-Leading Research with Real-World Impact! 6
Hadoop Ecosystem
Authorization Architecture
© Maanak Gupta World-Leading Research with Real-World Impact! 7
Hadoop and Data Services Access
Hadoop Daemons Access Configuration
WebHDFS Access via Apache Knox
© Maanak Gupta World-Leading Research with Real-World Impact! 8
Data Objects Access
Hive and HDFS Access Configurations
© Maanak Gupta World-Leading Research with Real-World Impact! 9
Data Objects Access
Authorization Options and End User Impersonation
© Maanak Gupta World-Leading Research with Real-World Impact! 10
Data Objects Access
Tag Based Policy Configuration
© Maanak Gupta World-Leading Research with Real-World Impact! 11
Data Objects Access
Data Masking and Column Filtering
© Maanak Gupta World-Leading Research with Real-World Impact! 12
Context Enricher and Policy Conditions
Geo Location Based Policies
© Maanak Gupta World-Leading Research with Real-World Impact! 13
Context Enricher and Policy Conditions
Data Combination Prohibition
© Maanak Gupta World-Leading Research with Real-World Impact! 14
Cluster Resource and Application Access
YARN Queue Access Control Configuration
root
default newQueue
root
accessrights
© Maanak Gupta World-Leading Research with Real-World Impact! 15
Cluster Resource and Application Access
Queue and Job Level Access Control
Cluster Nodes Configuration
© Maanak Gupta World-Leading Research with Real-World Impact! 16
Conclusion
Secure Hadoop Ecosystem
‘Defense in Depth’
Hadoop And
Services
Data and Service
Objects
Cluster Resource
and Application

More Related Content

What's hot

A hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplicationA hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplication
Adz91 Digital Ads Pvt Ltd
 

What's hot (20)

The Exabyte Journey and DataBrew with CICD
The Exabyte Journey and DataBrew with CICDThe Exabyte Journey and DataBrew with CICD
The Exabyte Journey and DataBrew with CICD
 
Hasbe a hierarchical attribute based solution for flexible and scalable acces...
Hasbe a hierarchical attribute based solution for flexible and scalable acces...Hasbe a hierarchical attribute based solution for flexible and scalable acces...
Hasbe a hierarchical attribute based solution for flexible and scalable acces...
 
IRJET - A Secure Access Policies based on Data Deduplication System
IRJET - A Secure Access Policies based on Data Deduplication SystemIRJET - A Secure Access Policies based on Data Deduplication System
IRJET - A Secure Access Policies based on Data Deduplication System
 
Globus toolkit in grid
Globus toolkit in gridGlobus toolkit in grid
Globus toolkit in grid
 
Foundations for the Future of Science
Foundations for the Future of ScienceFoundations for the Future of Science
Foundations for the Future of Science
 
Article data-centric security key to cloud and digital business
Article   data-centric security key to cloud and digital businessArticle   data-centric security key to cloud and digital business
Article data-centric security key to cloud and digital business
 
Computing Outside The Box June 2009
Computing Outside The Box June 2009Computing Outside The Box June 2009
Computing Outside The Box June 2009
 
Cloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsCloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
 
A Hybrid Cloud Approach for Secure Authorized De-Duplication
A Hybrid Cloud Approach for Secure Authorized De-DuplicationA Hybrid Cloud Approach for Secure Authorized De-Duplication
A Hybrid Cloud Approach for Secure Authorized De-Duplication
 
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURECYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
 
A hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplicationA hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplication
 
Review on Big Data Security in Hadoop
Review on Big Data Security in HadoopReview on Big Data Security in Hadoop
Review on Big Data Security in Hadoop
 
Computing Outside The Box September 2009
Computing Outside The Box September 2009Computing Outside The Box September 2009
Computing Outside The Box September 2009
 
grid mining
grid mininggrid mining
grid mining
 
Enabling data dynamic and indirect mutual trust for cloud computing storage s...
Enabling data dynamic and indirect mutual trust for cloud computing storage s...Enabling data dynamic and indirect mutual trust for cloud computing storage s...
Enabling data dynamic and indirect mutual trust for cloud computing storage s...
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by Sunny
 
Hadoop
HadoopHadoop
Hadoop
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
 
A hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplicationA hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplication
 

Similar to Multi-layer Authorization Framework for Hadoop Ecosystem

Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...
Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...
Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...
inside-BigData.com
 
Hadoop Administration pdf
Hadoop Administration pdfHadoop Administration pdf
Hadoop Administration pdf
Edureka!
 

Similar to Multi-layer Authorization Framework for Hadoop Ecosystem (20)

Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...
Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...
Designing Convergent HPC and Big Data Software Stacks: An Overview of the HiB...
 
Introduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduceIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduce
 
What Is Hadoop | Hadoop Tutorial For Beginners | Edureka
What Is Hadoop | Hadoop Tutorial For Beginners | EdurekaWhat Is Hadoop | Hadoop Tutorial For Beginners | Edureka
What Is Hadoop | Hadoop Tutorial For Beginners | Edureka
 
Big data-hadoop-training-course-content-content
Big data-hadoop-training-course-content-contentBig data-hadoop-training-course-content-content
Big data-hadoop-training-course-content-content
 
Hadoop/MapReduce/HDFS
Hadoop/MapReduce/HDFSHadoop/MapReduce/HDFS
Hadoop/MapReduce/HDFS
 
IEEE_BigData2014-Lee.pdf
IEEE_BigData2014-Lee.pdfIEEE_BigData2014-Lee.pdf
IEEE_BigData2014-Lee.pdf
 
Introducing the hadoop ecosystem
Introducing the hadoop ecosystemIntroducing the hadoop ecosystem
Introducing the hadoop ecosystem
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
 
HDFS
HDFSHDFS
HDFS
 
Hadoop.powerpoint.pptx
Hadoop.powerpoint.pptxHadoop.powerpoint.pptx
Hadoop.powerpoint.pptx
 
Apache hadoop: POSH Meetup Palo Alto, CA April 2014
Apache hadoop: POSH Meetup Palo Alto, CA April 2014Apache hadoop: POSH Meetup Palo Alto, CA April 2014
Apache hadoop: POSH Meetup Palo Alto, CA April 2014
 
myHadoop - Hadoop-on-Demand on Traditional HPC Resources
myHadoop - Hadoop-on-Demand on Traditional HPC ResourcesmyHadoop - Hadoop-on-Demand on Traditional HPC Resources
myHadoop - Hadoop-on-Demand on Traditional HPC Resources
 
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014 WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
WANdisco Non-Stop Hadoop: PHXDataConference Presentation Oct 2014
 
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksThe Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
 
Hadoop Administration pdf
Hadoop Administration pdfHadoop Administration pdf
Hadoop Administration pdf
 
Hadoop - Architectural road map for Hadoop Ecosystem
Hadoop -  Architectural road map for Hadoop EcosystemHadoop -  Architectural road map for Hadoop Ecosystem
Hadoop - Architectural road map for Hadoop Ecosystem
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
Unit IV.pdf
Unit IV.pdfUnit IV.pdf
Unit IV.pdf
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 

Multi-layer Authorization Framework for Hadoop Ecosystem

  • 1. Institute for Cyber Security © Maanak Gupta World-Leading Research with Real-World Impact! 1 Multi-Layer Authorization Framework for a Representative Hadoop Ecosystem Deployment Maanak Gupta, Farhan Patwa, James Benson, and Ravi Sandhu Institute for Cyber Security and Department of Computer Science University of Texas at San Antonio 22nd ACM Symposium on Access Control Models and Technologies (SACMAT) June 21-23, 2017, Indianapolis, IN, USA
  • 2. © Maanak Gupta World-Leading Research with Real-World Impact! 2 Agenda ➢ Introduction and Motivation ➢ Multi-layer Access Control ➢ Hadoop Ecosystem Authorization Architecture ➢ Access Control Mechanisms and Policy Configuration Points ➢ Conclusion
  • 3. ➢ IDC 2025 : ❖ global “datasphere” – 163 zettabytes ❖ 10x than 2016 ➢ Opportunities: 21st century gold for data miners ➢ Big Data require “Big Systems” Security: ➢ Secure Storage ➢ Privacy Concerns (e.g. HIPPA) ➢ Fine granular access requirements Big Data and Big Challenges © Maanak Gupta World-Leading Research with Real-World Impact! 3
  • 4. ➢ Hadoop: resilient, cost efficient distributed storage (HDFS) and processing framework (MapReduce or YARN) ➢ Ecosystem = Hadoop core + Open-Source Projects ➢ Hadoop Data Lake ➢ Security Concerns Hadoop Ecosystem © Maanak Gupta World-Leading Research with Real-World Impact! 4
  • 5. © Maanak Gupta World-Leading Research with Real-World Impact! 5 Multi-Layer Access Control Services Data and Service Objects Cluster Resources and ApplicationsHDFS NameNode, YARN ResourceManager Apache Hive HDFS Files, Hive Tables Kafka Topics YARN Queues, Cluster Nodes
  • 6. © Maanak Gupta World-Leading Research with Real-World Impact! 6 Hadoop Ecosystem Authorization Architecture
  • 7. © Maanak Gupta World-Leading Research with Real-World Impact! 7 Hadoop and Data Services Access Hadoop Daemons Access Configuration WebHDFS Access via Apache Knox
  • 8. © Maanak Gupta World-Leading Research with Real-World Impact! 8 Data Objects Access Hive and HDFS Access Configurations
  • 9. © Maanak Gupta World-Leading Research with Real-World Impact! 9 Data Objects Access Authorization Options and End User Impersonation
  • 10. © Maanak Gupta World-Leading Research with Real-World Impact! 10 Data Objects Access Tag Based Policy Configuration
  • 11. © Maanak Gupta World-Leading Research with Real-World Impact! 11 Data Objects Access Data Masking and Column Filtering
  • 12. © Maanak Gupta World-Leading Research with Real-World Impact! 12 Context Enricher and Policy Conditions Geo Location Based Policies
  • 13. © Maanak Gupta World-Leading Research with Real-World Impact! 13 Context Enricher and Policy Conditions Data Combination Prohibition
  • 14. © Maanak Gupta World-Leading Research with Real-World Impact! 14 Cluster Resource and Application Access YARN Queue Access Control Configuration root default newQueue root accessrights
  • 15. © Maanak Gupta World-Leading Research with Real-World Impact! 15 Cluster Resource and Application Access Queue and Job Level Access Control Cluster Nodes Configuration
  • 16. © Maanak Gupta World-Leading Research with Real-World Impact! 16 Conclusion Secure Hadoop Ecosystem ‘Defense in Depth’ Hadoop And Services Data and Service Objects Cluster Resource and Application