Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level

•

0 likes•606 views

Data warehouses contain sensitive data that must be secured in two ways: by defining appropriate access rights to the users and by preventing potential data inferences. Inspired from development methods for information systems, the first way of securing a data warehouse has been treated in the literature during the early phases of the development cycle. However, despite the high risks of inferences, the second way is not sufficiently taken into account in the design phase; it is rather left to the administrator of the data warehouse. However, managing inferences during the exploitation phase may induce high maintenance costs and complex OLAP server administration. In this paper, we propose an approach that, starting from the conceptual model of the data sources, assists the designer of the data warehouse in indentifying multidimensional sensitive data and those that may be subject to inferences.

Technology

Securing Data Warehouses:
A Semi-automatic Approach for Inference
Prevention at the Design Level
Salah Triki
Hanene Ben-Abdallah (Mir@cl, University of Sfax)
Nouria Harbi, Omar Boussaid (ERIC, University of Lyon)
1

Outline
• Introduction
• Securing Data Warehouses
• An approach for assisting the design of
secure DW
• Conclusion

Introduction
• A data warehouse is a collection of data:
– integrated
– subject-oriented
– nonvolatile
– historized
– available for querying and analysis
• A DW can be deployed in various domains:
Commerce, Hospital ...

Introduction
• Data warehouses contain:
– Sensitive data
– Some personal/propriatary data
• Legal requirements:
– HIPPA
– GLBA
– Safe Harbor
– Sarbanes-Oxley
• Organizations must comply with these laws

Outline
6
• Introduction
• Securing Data Warehouses
• An approach for assisting the design of
secure DW
• Conclusion

Securing Data Warehouses
7
• The two levels of security :
– Design level
– Physical level

Securing Data Warehouses
• At the design level
Security constraint
Security constraint

Entrepôt de
données
• The types of
inferences :
– Precise
Inference
– Partial Inference
Query Not
Authorized
Data
Authorized
Data
• At the physical level
Securing Data Warehouses

• Prevention of inferences at the physical level
[Haibing and al. 2008, Cuzzocrea 2009, Zhang and al. 2011]
can induce :
– high administrative costs
– high maintenance.
• Prevention of inferences at the design level
[Steger and al. 2000, Blanco and al. 2010] :
– do not take into account the potential inferences
from the available data
– specific to a particular application domain.
Securing Data Warehouses

• Assumptions :
– The data sources’ class diagram is
available.
– The star schema is already designed.
– The star schema is mapped to the data
sources’ class diagram.
An approach for assisting the design
of secure DW

(1)
(2)
(3)
(4)
An approach for assisting the design
of secure DW
Security
Designer

• Inferences Graph : a set of nodes
connected by oriented arcs.
– The nodes represent the data :
●
Node colored in gray : sensitive data
●
Node colored in white : none sensitive data
– The arcs indicate the direction of inference :
●
Solid arc : precise inference
●
Dotted arc : partial inference
B C
A
Inferences graph construction

Types of inferences
• The automatic construction of the
inferences graph does not indicate the
type of inferences: partial or precise.
• The indication cannot be, unfortunately,
deducted automatically.
• The security designer must distinguish
partial inferences (drawn by dotted arcs).

Detection of new inferences
A
B C
D E
• Calculation of the transitive closure
Partial path Precise path

Enrichment of the star schema
A
B C
D E
Partial path Precise path
<<Partial Inference : D:A>>
<<Precise Inference : E:A>>
<<Sensitive Data >>

• Class diagram of the data sources
Example

• DW star schema
Example
Illness Critical
Illness

Example
Illness
Critical
Illness
Treatment Diagnostic Transfer

• Inferences graph transitive closure
Example

•Inference type specification
Example
<< Partial Inference : Date : Illness>>
<< Partial Inference : Time : Illness>>
<< Sensitive Data >>
<<Partial Inference : Transfer :Critical Illness>>

• An approach to produce a conceptual
multidimensional model annotated with
information for inference prevention:
– A graph of inferences based on the class
diagram of data sources.
– The class diagram allows us to identify the
elements to lead to precise/partial inferences.
• Studying how to transfer to the logical level
the annotations defined at the design level.
Conclusion

Similar to Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level

Secure Coding Practices for MiddlewareManuel Brugnoli

Overview of data programming: easing the bottleneck of supervised machine lea...datalab-vietnam

OWASPPen Testeronyguy

Anomaly detection (Unsupervised Learning) in Machine LearningKuppusamy P

Archive enabling tagging using progressive barcodesMarie Vans

lecture1.pptbayhehua

Secure and Privacy-Preserving Big-Data ProcessingShantanu Sharma

Azure Digital TwinsMarco Parenzan

Outlier analysis for Temporal DatasetsQuantUniversity

Cloud lastAnmitas1

Attaining data security in cloud computingGopinath Muthusamy

security Issues of cloud computingprachupanchal

Supporting Data-Rich Research on Many FrontsJohn Kunze

State of Florida Neo4j Graph Briefing - Cyber IAMNeo4j

REASSURE Robust and Efficient Approaches to Evaluating Side Channel and Fault...Agence du Numérique (AdN)

Computer Hardware | 3BCMDLMS

Computer Hardware - Lecture BCMDLearning

Building Your Application Security Data Hub - OWASP AppSecUSADenim Group

Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...Data Con LA

High-Volume Data Collection and Real Time Analytics Using Rediscacois

Similar to Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level (20)

Secure Coding Practices for Middleware

Overview of data programming: easing the bottleneck of supervised machine lea...

OWASP

Anomaly detection (Unsupervised Learning) in Machine Learning

Archive enabling tagging using progressive barcodes

lecture1.ppt

Secure and Privacy-Preserving Big-Data Processing

Azure Digital Twins

Outlier analysis for Temporal Datasets

Cloud last

Attaining data security in cloud computing

security Issues of cloud computing

Supporting Data-Rich Research on Many Fronts

State of Florida Neo4j Graph Briefing - Cyber IAM

REASSURE Robust and Efficient Approaches to Evaluating Side Channel and Fault...

Computer Hardware | 3B

Computer Hardware - Lecture B

Building Your Application Security Data Hub - OWASP AppSecUSA

Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...

High-Volume Data Collection and Real Time Analytics Using Redis

Recently uploaded

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Artificial intelligence in the post-deep learning eraDeakin University

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Install Stable Diffusion in windows machinePadma Pradeep

Build your next Gen AI Breakthrough - April 2024Neo4j

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

How to convert PDF to text with Nanonetsnaman860154

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Pigging Solutions in Pet Food ManufacturingPigging Solutions

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation

Breaking the Kubernetes Kill Chain: Host Path Mount

Artificial intelligence in the post-deep learning era

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Install Stable Diffusion in windows machine

Build your next Gen AI Breakthrough - April 2024

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Streamlining Python Development: A Guide to a Modern Project Setup

Are Multi-Cloud and Serverless Good or Bad?

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Advanced Test Driven-Development @ php[tek] 2024

Pigging Solutions Piggable Sweeping Elbows

SQL Database Design For Developers at php[tek] 2024

How to convert PDF to text with Nanonets

Unleash Your Potential - Namagunga Girls Coding Club

08448380779 Call Girls In Friends Colony Women Seeking Men

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Pigging Solutions in Pet Food Manufacturing

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level

1. Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level Salah Triki Hanene Ben-Abdallah (Mir@cl, University of Sfax) Nouria Harbi, Omar Boussaid (ERIC, University of Lyon) 1

2. Outline • Introduction • Securing Data Warehouses • An approach for assisting the design of secure DW • Conclusion

3. Outline • Introduction • Securing Data Warehouses • An approach for assisting the design of secure DW • Conclusion

4. Introduction • A data warehouse is a collection of data: – integrated – subject-oriented – nonvolatile – historized – available for querying and analysis • A DW can be deployed in various domains: Commerce, Hospital ...

5. Introduction • Data warehouses contain: – Sensitive data – Some personal/propriatary data • Legal requirements: – HIPPA – GLBA – Safe Harbor – Sarbanes-Oxley • Organizations must comply with these laws

6. Outline 6 • Introduction • Securing Data Warehouses • An approach for assisting the design of secure DW • Conclusion

7. Securing Data Warehouses 7 • The two levels of security : – Design level – Physical level

8. Securing Data Warehouses • At the design level Security constraint Security constraint

9. Entrepôt de données • The types of inferences : – Precise Inference – Partial Inference Query Not Authorized Data Authorized Data • At the physical level Securing Data Warehouses

10. • Prevention of inferences at the physical level [Haibing and al. 2008, Cuzzocrea 2009, Zhang and al. 2011] can induce : – high administrative costs – high maintenance. • Prevention of inferences at the design level [Steger and al. 2000, Blanco and al. 2010] : – do not take into account the potential inferences from the available data – specific to a particular application domain. Securing Data Warehouses

11. Outline • Introduction • Securing Data Warehouses • An approach for assisting the design of secure DW • Conclusion

12. • Assumptions : – The data sources’ class diagram is available. – The star schema is already designed. – The star schema is mapped to the data sources’ class diagram. An approach for assisting the design of secure DW

13. (1) (2) (3) (4) An approach for assisting the design of secure DW Security Designer

14. • Inferences Graph : a set of nodes connected by oriented arcs. – The nodes represent the data : ● Node colored in gray : sensitive data ● Node colored in white : none sensitive data – The arcs indicate the direction of inference : ● Solid arc : precise inference ● Dotted arc : partial inference B C A Inferences graph construction

15. Inference rules 1/3 C1 C1

16. Inference rules 2/3

17. Inference rules 3/3

18. Types of inferences • The automatic construction of the inferences graph does not indicate the type of inferences: partial or precise. • The indication cannot be, unfortunately, deducted automatically. • The security designer must distinguish partial inferences (drawn by dotted arcs).

19. Detection of new inferences A B C D E • Calculation of the transitive closure Partial path Precise path

20. Enrichment of the star schema A B C D E Partial path Precise path <<Partial Inference : D:A>> <<Precise Inference : E:A>> <<Sensitive Data >>

21. • Class diagram of the data sources Example

22. • DW star schema Example Illness Critical Illness

23. Example Illness Critical Illness Treatment Diagnostic Transfer

24. • Inferences graph Example

25. • Inferences graph transitive closure Example

26. •Inference type specification Example << Partial Inference : Date : Illness>> << Partial Inference : Time : Illness>> << Sensitive Data >> <<Partial Inference : Transfer :Critical Illness>>

27. Outline • Introduction • Securing Data Warehouses • An approach for assisting the design of secure DW • Conclusion

28. • An approach to produce a conceptual multidimensional model annotated with information for inference prevention: – A graph of inferences based on the class diagram of data sources. – The class diagram allows us to identify the elements to lead to precise/partial inferences. • Studying how to transfer to the logical level the annotations defined at the design level. Conclusion

Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level

Recommended

Recommended

More Related Content

Similar to Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level

Similar to Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level (20)

More from Salah Triki

More from Salah Triki (14)

Recently uploaded

Recently uploaded (20)

Securing Data Warehouses: A Semi-automatic Approach for Inference Prevention at the Design Level