Graph-based Metadata Management &
Data Governance
Dr. Jesús Barrasa - @BarrasaDV
Leverage the power of Graphs for GDPR
14 November 2017
GraphTalks - Frankfurt
WHAT IS A GRAPH?
A way of representing data
DATA DATA
Relational
Database
Good for:
Well-understood data structures
that don’t change too frequently
A way of representing data
Known problems involving discrete
parts of the data, or minimal
connectivity
Graph
Database
Relational
Database
Good for:
Well-understood data structures
that don’t change too frequently
Known problems involving discrete
parts of the data, or minimal
connectivity
A way of representing data
Good for:
Dynamic systems: where the data
topology is difficult to predict
Dynamic requirements: 

the evolve with the business
Problems where the relationships in
data contribute meaning & value
THE PROPERTY GRAPH
DATA MODEL
A Graph Is
ROAD
Incident
LIGHT
A Graph Is
LIGHT
HAS
AVAILABLE
HOTEL
ROOMS
AVAILABILITY
A Graph Is
ACCOUNT
ADDRESS
PERSON
PERSON
NAME
STREET
BANK
NAME
COMPANY
BANK
BAHAMAS
A Graph Is
KNOWS
KNOWS
KNOWS
WORKS_AT
WORKS_AT
WORKS_AT
COMPANY
STANFORD
STUDIED_AT
KNOWS
NEO
COLUMBIA
STUDIED_AT
STU
D
IED
_AT
STUDIED_AT
NAME:ANNE
A Graph
SINCE:2012
RELATIONSHIPS
NODE
PROPERTY
A Graph
NAME:ANNE
SINCE:2012
GRAPH THINKING
Look at this data…
Swap glasses…
… now look at it again, this time as a graph
Use of Graphs has created some of the most successful companies in the world
C
34,3%B
38,4%A
3,3%
D
3,8%
1,8%
1,8%
1,8%
1,8%
1,8%
E
8,1%
F
3,9%
Neo4j is the world’s first “off the shelf” graph database
designed to derive value from data relationships
13.4 million leaked files
A year ago…
Panama Papers: The size and the shape
WHAT DID WE LEARN FROM THE
PANAMA PAPERS?
Finance HR &
Recruiting
Manufacturing
& Logistics
Health CareTelco Retail
Today we see graph-projects in virtually every industry
Government
NEO4j USE CASES
Real Time Recommendations
Fraud Detection
Identity & Access Mgmnt
Dependency Modelling in Telecoms
Knowledge Graph
VALUE FROM GRAPHS
IN MDM, DATA GOVERNANCE AND
METADATA MANAGEMENT
Data Modelling and Definition
1
Data Lineage2
Consent Management 3
Entitlement 4
GRAPHS IN
METADATA
MANAGEMENT
#1 Data Modelling and
Definition
Party
CUST_SCHEMA
Party
First NameHAS_LOGICAL_ATT
CUST_SCH
EMA.ROLE
CUST_SCHEM
A_PARTY.FIRST_
NM
CUST_SCH
EMA.PARTY
SHEMA_HAS_TABLE
#1 Data Modelling and Definition
Party
Last Name
HAS_LOGICAL_ATT
SHEMA_HAS_TABLE
GENERATES
CONCEPT_FOR
CONCEPT_FOR
CONCEPT_FOR
HAS_NAME
CUSTOMER
NAMECUSTOMER
HAS_PHONE
TABLE_HAS_COL
CUST_SCHEM
A_PARTY.LAST_
NM
TABLE_HAS_COL
GENERATES
Enterprise Ontology
Application Logical
Model
Physical Schema
m
#2 Data Lineage
ETL_PROC
_1
SALES_SCHEMA
Normaliz
e_Date
SLS_SCHEM
A.PRODUCT
SLS_SCHEMA.S
ALES.DATE
SLS_SCHE
MA.SALES
SHEMA_HAS_TABLE
#2 Data Lineage
Channel_No
rmalization
SHEMA_HAS_TABLE
HAS_COL
SLS_SCHEMA.S
ALES.CHANNELHAS_COL
BusinessView
Integration MiddlewareOperational Systems
HAS_INPUT
HAS_INPUT
Time.time_key
Time.day_of_
week
Enterprise DWH
HAS_OUTPUT
HAS_OUTPUT
HAS_COL
HAS_COL
TechnicalView
Billing
System
EDWH
CDE:
Transaction
_Date
SENDS RECEIVES
CONCEPTUAL_ELEMENT_FOR
Star_Schema
Star_Sch
ema.Time
#3 Consent management
#3 Consent Management + MDM
2 Blackfriars
Bridge rd.
J.Barrasa
+44776611…
jesus@neo4j
.com
PHO
NE_FO
R
EMAIL_FOR
ADDRESS_FOR
jb@outlook.
com
EMAIL_FOR
{ contrib: ‘XYZ’,
permittedFor: [UC1,UC4],
consentUntil : 31-12-19 }
{ contrib: ‘internal’,
permittedFor: [UC3],
consentUntil : 31-12-20 }
{ contrib: ‘internal’,
permittedFor: [UC3],
consentUntil : 31-12-20 }
{ contrib: ‘internal’,
permittedFor: [UC3],
consentUntil : 31-12-20 }
EMAIL_FOR
{ contrib: ‘LMN’,
permittedFor: [UC2,UC6],
consentUntil : 31-12-20 }
#4 Entitlement
#4 Entitlement
User 1 User 3User 2
Exclusion
List G1
Resource 1
Group 1
Resource 2
Group 3
D
MEMBER
CRUD
MEMBER
R
M
EM
BER
MEMBER
CRUD
Recent Projects on GDPR & Governance
Big 4 Bank in London: Using Neo4j for Metadata
Lineage for GDPR
• GDPR: Protecting,
Categorising and Auditing
personal data.
• Lineage
• Changes tracking
• Impact analysis
APM +
ITOA
Graph
Database
APM +
ITOA
Data
Classification
Global Information Services Group: Using Neo4j for
Identity and Consent Management
Identity and consent management
• Sales want to…
“… know which people I can compliantly contact with a mailing
in vertical market x ?”
• Client wants to…
“… see the statement that allowed to contact me”
”… see a full statement of all the information you hold on me”
”… have my contact details suppressed from future mailings”
• Prod Man wants to…
“… assess the impact of removing data from supplier x”
“… test the effects of adjusting consent parameters”
DEMO
THANK YOU!

GraphTalks Frankfurt - Graph-based Metadata Management & Data Governance