SlideShare a Scribd company logo
Industry Models – Visualisation & Graph DB
May 2017
Michal Miklas
2
Reasons for interest in Graph DB: Exploration of new content delivery formats and tools for
content consumption
 Use of graph DB as store for all glossaries and organisation data stores
metadata and their relationships
 Common independent interface for all services, apps and users
 Flexible schema (does not need to be predefined), easily extensible
 Consumable – easy access to the data via common API/interface
 Visualisation – a business user friendly tool for data exploration, navigation,
search and analysis
 Easy integration of Industry models content (or derived from) with customer
collection of glossaries, vocabularies, ontologies and other models
 Not seen as replacement of the main formats & content
authoring/management tools, rather complementary as a “read only
platform” mainly beneficial for business users and analysts
 Any current work is and future work will stay flexible & compatible with main
software tools: “Titan/Janusgraph with Tinkerpop”, “IBM Graph”, Neo4j
3
Industry Model Common Components
Data Models
Vocabulary
Atomic
Warehouse
Model
Dimensional
Warehouse
Model
Business
Data Model
Business
Terms
Analytical
Requirements
Supportive
Content
Industry Models
Industry concepts in plain business language and with no
modeling. Business Terms are organized by Business
Categories. The mapping to the data models allows the
transformation of requirements into IT data structures.
Business Terms
High level groups of business information to express
business Measures along axes of analysis, which are
named Dimensions.
Analytical Requirements
Grouping of terms incorporating any terminology
originating from an internal or external source. It is used to
support data structures such as regulatory reports,
industry standards, business architecture standards,
vendor interfaces, or legacy source systems.
Supportive Content
Highly normalized conceptual data
model that is an enterprise-wide,
generic, and flexible data epresentation
of informational systems.
Business Data Model
A normalized design level data
model representing the
repository of atomic data used
for informational processing.
Atomic Model
A design level dimensional model
representing the repository of analytical
data. It contains star schemas
supporting the Analytical Requirements
Dimensional Model
4
Tools Currently Used & Supported
 Data Modelling tools:
• Infosphere Data Architect
• Erwin Data Modeler
 Data Governance tools:
• Infosphere Governance Catalog
(Business Glossaries, Models
and other metadata)
5
Graph Databases & Graph Data Visualisation tools
 Graph Databases
• Neo4j Community Edition (GPL v3 license)
• Neo4j Enterprise Edition (Evaluation & Commercial License)
• Titan (Apache License 2.0)
• Janusgraph (Apache 2 License & Creative Commons Attribution 4.0 International)
• IBM Graph managed service on Bluemix (build on Titan/Tinkeprop stack)
 Graph computing framework
• Tinkerpop (Apache License 2.0) embedded in Titan/Janusgraph but can be use as
standalone GraphDB too for demos and small projects using in-memory TinkerGraph
 Graph data visualisation
• Neo4j Browser (same licensing as above) - part of the Neo4j Graph DB
• Linkurious Enterprise (commercial license)
vis.js javascript visualisation library (Apache 2.0 and MIT)
6
Architecture options
7
Energy & Utilities - Outage - Neo4j
8
Energy & Utilities - Outage - Linkurious
9
Energy & Utilities - Outage - Vis.JS
10
Example of Graph DB schema model
11
Content Transformation
 IM artifact formats
• IGC export in XML
• Logical Models LDM files are XML files
 Transformation is split into two steps (fully working script prototype in Powershell)
This allows to bring any customer’s data into the mix in easy to understand format: collection of
CSV files in two folders – nodes and edges – each file represents different node/edge type with
any properties as CSV columns (each node type can have different set of properties/columns) –
the only mandatory fields are ID and name
1. XML (LDM & IGC) to CSV transform
2. CSV to GRAPHML transform
• also produces Graph DB schema model in JSON format (for IBM Graph on Bluemix)
• also produces groovy script for schema creation for Titan/Janusgraph
 GRAPHML graph data format – includes schema and all node and edge data
• a format importable to Titan/JanusGraph using the Gremlin/Tinkerpop console
• Tinkeprop/Gramlin can be also used to import to Neo4j
• Format recognized and supported also by IBM graph (although currently with limitation –
size of file cannot be over 10MB)
12
Meaning of Node types in Graph DB
 Logical model based types
• entity
• attribute
• package
• model
• diagram
 Physical model based types
• column
• Table
 Glossary based types
• term
• category
13
Meaning of Edge types in Graph DB
 Assigns:
• term assignation to asset: attribute/column/entity/table
 Belongs:
• describes parent object = ownership
• entity/table to package
• package to package
• package to model
• term to category
• category to category
• category to glossary
• diagram to package
 Describes:
• attribute describes entity
• column describes table
• term isOf another term
 Maps:
• attribute to attribute calculation
• attribute to attribute/entity population in the same model or AWM-
>DWM
• column to column/table population in the same schema or AWM-
>DWM
• note: covers both population and calculation dependencies
• design transformation dependency
• attribute to attribute/entity (from another model e.g. BDM->AWM or
BDM->DWM)
• table to entity
• column to attribute
 References:
• term to category: can be referenced by any number of categories (in
addition to owning category)
• entity/table to diagram: can be referenced by any number of diagrams
 Relates:
• entity to entity relationship in ER models
• table to table relationship in ER models
• term to term relatedTerm
 Subtype:
• entity is subtype of another entity generalization/inheritance in ER
models
• term isTypeOf another term
 Synonym:
• one term is synonym of another - in a directed graph there is a direction
of this edge suggesting master-child
14
Common attribute across all edge & node types
 Taxonomy:
• Logical Business Data Model
• Logical Atomic Warehouse Model
• Logical Dimensional Warehouse
Model
• Business Glossary
• Analytical Requirements
• Supportive Content
• Scopes
• Physical Dimensional Warehouse
Model
• Physical Business Data Model
• Physical Atomic Warehouse Model
 Taxonomy Type:
• logicalModel
• physicalModel
• glossary
 Industry:
• banking
• insurance
• healthcare
• Utilities
 Version

More Related Content

Similar to IM in Graph 2017-05.pdf

Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Bill Duncan
 
Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...
Bill Duncan
 
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusDeveloping Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Obeo
 
Dynamics ax 2012 development overview
Dynamics ax 2012 development overviewDynamics ax 2012 development overview
Dynamics ax 2012 development overviewAli Raza Zaidi
 
CS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLCS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQL
Palani Kumar
 
DITA 1.3: What's New and Different
DITA 1.3: What's New and DifferentDITA 1.3: What's New and Different
DITA 1.3: What's New and Different
dclsocialmedia
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUGIF
 
MDE in Practice
MDE in PracticeMDE in Practice
MDE in Practice
Abdalmassih Yakeen
 
UNIT III CAD STANDARDS
UNIT III CAD STANDARDS UNIT III CAD STANDARDS
UNIT III CAD STANDARDS
ravis205084
 
IBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentIBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application Development
Andrew Coleman
 
the Modeling is a way of thinking about the
the Modeling is a way of thinking about thethe Modeling is a way of thinking about the
the Modeling is a way of thinking about the
saman zaker
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope Document
Ryan Casey
 
CAD Data Exchange format used in industry
CAD Data Exchange format used in industryCAD Data Exchange format used in industry
CAD Data Exchange format used in industry
rahulkatre9
 
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURINGUnit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Mohanumar S
 
Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Software
 
Real-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASReal-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSAS
Lynn Langit
 
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxUnit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
dinesh babu
 
Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)
PascalDesmarets1
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
Srivatsan Ramanujam
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4jSina Khorami
 

Similar to IM in Graph 2017-05.pdf (20)

Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...
 
Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...
 
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusDeveloping Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse Sirius
 
Dynamics ax 2012 development overview
Dynamics ax 2012 development overviewDynamics ax 2012 development overview
Dynamics ax 2012 development overview
 
CS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLCS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQL
 
DITA 1.3: What's New and Different
DITA 1.3: What's New and DifferentDITA 1.3: What's New and Different
DITA 1.3: What's New and Different
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutes
 
MDE in Practice
MDE in PracticeMDE in Practice
MDE in Practice
 
UNIT III CAD STANDARDS
UNIT III CAD STANDARDS UNIT III CAD STANDARDS
UNIT III CAD STANDARDS
 
IBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentIBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application Development
 
the Modeling is a way of thinking about the
the Modeling is a way of thinking about thethe Modeling is a way of thinking about the
the Modeling is a way of thinking about the
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope Document
 
CAD Data Exchange format used in industry
CAD Data Exchange format used in industryCAD Data Exchange format used in industry
CAD Data Exchange format used in industry
 
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURINGUnit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
 
Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com
 
Real-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASReal-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSAS
 
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxUnit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
 
Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4j
 

Recently uploaded

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
2023240532
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

IM in Graph 2017-05.pdf

  • 1. Industry Models – Visualisation & Graph DB May 2017 Michal Miklas
  • 2. 2 Reasons for interest in Graph DB: Exploration of new content delivery formats and tools for content consumption  Use of graph DB as store for all glossaries and organisation data stores metadata and their relationships  Common independent interface for all services, apps and users  Flexible schema (does not need to be predefined), easily extensible  Consumable – easy access to the data via common API/interface  Visualisation – a business user friendly tool for data exploration, navigation, search and analysis  Easy integration of Industry models content (or derived from) with customer collection of glossaries, vocabularies, ontologies and other models  Not seen as replacement of the main formats & content authoring/management tools, rather complementary as a “read only platform” mainly beneficial for business users and analysts  Any current work is and future work will stay flexible & compatible with main software tools: “Titan/Janusgraph with Tinkerpop”, “IBM Graph”, Neo4j
  • 3. 3 Industry Model Common Components Data Models Vocabulary Atomic Warehouse Model Dimensional Warehouse Model Business Data Model Business Terms Analytical Requirements Supportive Content Industry Models Industry concepts in plain business language and with no modeling. Business Terms are organized by Business Categories. The mapping to the data models allows the transformation of requirements into IT data structures. Business Terms High level groups of business information to express business Measures along axes of analysis, which are named Dimensions. Analytical Requirements Grouping of terms incorporating any terminology originating from an internal or external source. It is used to support data structures such as regulatory reports, industry standards, business architecture standards, vendor interfaces, or legacy source systems. Supportive Content Highly normalized conceptual data model that is an enterprise-wide, generic, and flexible data epresentation of informational systems. Business Data Model A normalized design level data model representing the repository of atomic data used for informational processing. Atomic Model A design level dimensional model representing the repository of analytical data. It contains star schemas supporting the Analytical Requirements Dimensional Model
  • 4. 4 Tools Currently Used & Supported  Data Modelling tools: • Infosphere Data Architect • Erwin Data Modeler  Data Governance tools: • Infosphere Governance Catalog (Business Glossaries, Models and other metadata)
  • 5. 5 Graph Databases & Graph Data Visualisation tools  Graph Databases • Neo4j Community Edition (GPL v3 license) • Neo4j Enterprise Edition (Evaluation & Commercial License) • Titan (Apache License 2.0) • Janusgraph (Apache 2 License & Creative Commons Attribution 4.0 International) • IBM Graph managed service on Bluemix (build on Titan/Tinkeprop stack)  Graph computing framework • Tinkerpop (Apache License 2.0) embedded in Titan/Janusgraph but can be use as standalone GraphDB too for demos and small projects using in-memory TinkerGraph  Graph data visualisation • Neo4j Browser (same licensing as above) - part of the Neo4j Graph DB • Linkurious Enterprise (commercial license) vis.js javascript visualisation library (Apache 2.0 and MIT)
  • 7. 7 Energy & Utilities - Outage - Neo4j
  • 8. 8 Energy & Utilities - Outage - Linkurious
  • 9. 9 Energy & Utilities - Outage - Vis.JS
  • 10. 10 Example of Graph DB schema model
  • 11. 11 Content Transformation  IM artifact formats • IGC export in XML • Logical Models LDM files are XML files  Transformation is split into two steps (fully working script prototype in Powershell) This allows to bring any customer’s data into the mix in easy to understand format: collection of CSV files in two folders – nodes and edges – each file represents different node/edge type with any properties as CSV columns (each node type can have different set of properties/columns) – the only mandatory fields are ID and name 1. XML (LDM & IGC) to CSV transform 2. CSV to GRAPHML transform • also produces Graph DB schema model in JSON format (for IBM Graph on Bluemix) • also produces groovy script for schema creation for Titan/Janusgraph  GRAPHML graph data format – includes schema and all node and edge data • a format importable to Titan/JanusGraph using the Gremlin/Tinkerpop console • Tinkeprop/Gramlin can be also used to import to Neo4j • Format recognized and supported also by IBM graph (although currently with limitation – size of file cannot be over 10MB)
  • 12. 12 Meaning of Node types in Graph DB  Logical model based types • entity • attribute • package • model • diagram  Physical model based types • column • Table  Glossary based types • term • category
  • 13. 13 Meaning of Edge types in Graph DB  Assigns: • term assignation to asset: attribute/column/entity/table  Belongs: • describes parent object = ownership • entity/table to package • package to package • package to model • term to category • category to category • category to glossary • diagram to package  Describes: • attribute describes entity • column describes table • term isOf another term  Maps: • attribute to attribute calculation • attribute to attribute/entity population in the same model or AWM- >DWM • column to column/table population in the same schema or AWM- >DWM • note: covers both population and calculation dependencies • design transformation dependency • attribute to attribute/entity (from another model e.g. BDM->AWM or BDM->DWM) • table to entity • column to attribute  References: • term to category: can be referenced by any number of categories (in addition to owning category) • entity/table to diagram: can be referenced by any number of diagrams  Relates: • entity to entity relationship in ER models • table to table relationship in ER models • term to term relatedTerm  Subtype: • entity is subtype of another entity generalization/inheritance in ER models • term isTypeOf another term  Synonym: • one term is synonym of another - in a directed graph there is a direction of this edge suggesting master-child
  • 14. 14 Common attribute across all edge & node types  Taxonomy: • Logical Business Data Model • Logical Atomic Warehouse Model • Logical Dimensional Warehouse Model • Business Glossary • Analytical Requirements • Supportive Content • Scopes • Physical Dimensional Warehouse Model • Physical Business Data Model • Physical Atomic Warehouse Model  Taxonomy Type: • logicalModel • physicalModel • glossary  Industry: • banking • insurance • healthcare • Utilities  Version