SlideShare a Scribd company logo
1 of 14
Download to read offline
Industry Models – Visualisation & Graph DB
May 2017
Michal Miklas
2
Reasons for interest in Graph DB: Exploration of new content delivery formats and tools for
content consumption
 Use of graph DB as store for all glossaries and organisation data stores
metadata and their relationships
 Common independent interface for all services, apps and users
 Flexible schema (does not need to be predefined), easily extensible
 Consumable – easy access to the data via common API/interface
 Visualisation – a business user friendly tool for data exploration, navigation,
search and analysis
 Easy integration of Industry models content (or derived from) with customer
collection of glossaries, vocabularies, ontologies and other models
 Not seen as replacement of the main formats & content
authoring/management tools, rather complementary as a “read only
platform” mainly beneficial for business users and analysts
 Any current work is and future work will stay flexible & compatible with main
software tools: “Titan/Janusgraph with Tinkerpop”, “IBM Graph”, Neo4j
3
Industry Model Common Components
Data Models
Vocabulary
Atomic
Warehouse
Model
Dimensional
Warehouse
Model
Business
Data Model
Business
Terms
Analytical
Requirements
Supportive
Content
Industry Models
Industry concepts in plain business language and with no
modeling. Business Terms are organized by Business
Categories. The mapping to the data models allows the
transformation of requirements into IT data structures.
Business Terms
High level groups of business information to express
business Measures along axes of analysis, which are
named Dimensions.
Analytical Requirements
Grouping of terms incorporating any terminology
originating from an internal or external source. It is used to
support data structures such as regulatory reports,
industry standards, business architecture standards,
vendor interfaces, or legacy source systems.
Supportive Content
Highly normalized conceptual data
model that is an enterprise-wide,
generic, and flexible data epresentation
of informational systems.
Business Data Model
A normalized design level data
model representing the
repository of atomic data used
for informational processing.
Atomic Model
A design level dimensional model
representing the repository of analytical
data. It contains star schemas
supporting the Analytical Requirements
Dimensional Model
4
Tools Currently Used & Supported
 Data Modelling tools:
• Infosphere Data Architect
• Erwin Data Modeler
 Data Governance tools:
• Infosphere Governance Catalog
(Business Glossaries, Models
and other metadata)
5
Graph Databases & Graph Data Visualisation tools
 Graph Databases
• Neo4j Community Edition (GPL v3 license)
• Neo4j Enterprise Edition (Evaluation & Commercial License)
• Titan (Apache License 2.0)
• Janusgraph (Apache 2 License & Creative Commons Attribution 4.0 International)
• IBM Graph managed service on Bluemix (build on Titan/Tinkeprop stack)
 Graph computing framework
• Tinkerpop (Apache License 2.0) embedded in Titan/Janusgraph but can be use as
standalone GraphDB too for demos and small projects using in-memory TinkerGraph
 Graph data visualisation
• Neo4j Browser (same licensing as above) - part of the Neo4j Graph DB
• Linkurious Enterprise (commercial license)
vis.js javascript visualisation library (Apache 2.0 and MIT)
6
Architecture options
7
Energy & Utilities - Outage - Neo4j
8
Energy & Utilities - Outage - Linkurious
9
Energy & Utilities - Outage - Vis.JS
10
Example of Graph DB schema model
11
Content Transformation
 IM artifact formats
• IGC export in XML
• Logical Models LDM files are XML files
 Transformation is split into two steps (fully working script prototype in Powershell)
This allows to bring any customer’s data into the mix in easy to understand format: collection of
CSV files in two folders – nodes and edges – each file represents different node/edge type with
any properties as CSV columns (each node type can have different set of properties/columns) –
the only mandatory fields are ID and name
1. XML (LDM & IGC) to CSV transform
2. CSV to GRAPHML transform
• also produces Graph DB schema model in JSON format (for IBM Graph on Bluemix)
• also produces groovy script for schema creation for Titan/Janusgraph
 GRAPHML graph data format – includes schema and all node and edge data
• a format importable to Titan/JanusGraph using the Gremlin/Tinkerpop console
• Tinkeprop/Gramlin can be also used to import to Neo4j
• Format recognized and supported also by IBM graph (although currently with limitation –
size of file cannot be over 10MB)
12
Meaning of Node types in Graph DB
 Logical model based types
• entity
• attribute
• package
• model
• diagram
 Physical model based types
• column
• Table
 Glossary based types
• term
• category
13
Meaning of Edge types in Graph DB
 Assigns:
• term assignation to asset: attribute/column/entity/table
 Belongs:
• describes parent object = ownership
• entity/table to package
• package to package
• package to model
• term to category
• category to category
• category to glossary
• diagram to package
 Describes:
• attribute describes entity
• column describes table
• term isOf another term
 Maps:
• attribute to attribute calculation
• attribute to attribute/entity population in the same model or AWM-
>DWM
• column to column/table population in the same schema or AWM-
>DWM
• note: covers both population and calculation dependencies
• design transformation dependency
• attribute to attribute/entity (from another model e.g. BDM->AWM or
BDM->DWM)
• table to entity
• column to attribute
 References:
• term to category: can be referenced by any number of categories (in
addition to owning category)
• entity/table to diagram: can be referenced by any number of diagrams
 Relates:
• entity to entity relationship in ER models
• table to table relationship in ER models
• term to term relatedTerm
 Subtype:
• entity is subtype of another entity generalization/inheritance in ER
models
• term isTypeOf another term
 Synonym:
• one term is synonym of another - in a directed graph there is a direction
of this edge suggesting master-child
14
Common attribute across all edge & node types
 Taxonomy:
• Logical Business Data Model
• Logical Atomic Warehouse Model
• Logical Dimensional Warehouse
Model
• Business Glossary
• Analytical Requirements
• Supportive Content
• Scopes
• Physical Dimensional Warehouse
Model
• Physical Business Data Model
• Physical Atomic Warehouse Model
 Taxonomy Type:
• logicalModel
• physicalModel
• glossary
 Industry:
• banking
• insurance
• healthcare
• Utilities
 Version

More Related Content

Similar to IM in Graph 2017-05.pdf

Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Bill Duncan
 
Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Bill Duncan
 
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusDeveloping Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusObeo
 
Dynamics ax 2012 development overview
Dynamics ax 2012 development overviewDynamics ax 2012 development overview
Dynamics ax 2012 development overviewAli Raza Zaidi
 
CS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLCS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLPalani Kumar
 
DITA 1.3: What's New and Different
DITA 1.3: What's New and DifferentDITA 1.3: What's New and Different
DITA 1.3: What's New and Differentdclsocialmedia
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUGIF
 
UNIT III CAD STANDARDS
UNIT III CAD STANDARDS UNIT III CAD STANDARDS
UNIT III CAD STANDARDS ravis205084
 
IBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentIBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentAndrew Coleman
 
the Modeling is a way of thinking about the
the Modeling is a way of thinking about thethe Modeling is a way of thinking about the
the Modeling is a way of thinking about thesaman zaker
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentRyan Casey
 
CAD Data Exchange format used in industry
CAD Data Exchange format used in industryCAD Data Exchange format used in industry
CAD Data Exchange format used in industryrahulkatre9
 
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURINGUnit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURINGMohanumar S
 
Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Software
 
Real-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASReal-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASLynn Langit
 
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxUnit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxdinesh babu
 
Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)PascalDesmarets1
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...Srivatsan Ramanujam
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4jSina Khorami
 

Similar to IM in Graph 2017-05.pdf (20)

Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...
 
Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...
 
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusDeveloping Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse Sirius
 
Dynamics ax 2012 development overview
Dynamics ax 2012 development overviewDynamics ax 2012 development overview
Dynamics ax 2012 development overview
 
CS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLCS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQL
 
DITA 1.3: What's New and Different
DITA 1.3: What's New and DifferentDITA 1.3: What's New and Different
DITA 1.3: What's New and Different
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutes
 
MDE in Practice
MDE in PracticeMDE in Practice
MDE in Practice
 
UNIT III CAD STANDARDS
UNIT III CAD STANDARDS UNIT III CAD STANDARDS
UNIT III CAD STANDARDS
 
IBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentIBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application Development
 
the Modeling is a way of thinking about the
the Modeling is a way of thinking about thethe Modeling is a way of thinking about the
the Modeling is a way of thinking about the
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope Document
 
CAD Data Exchange format used in industry
CAD Data Exchange format used in industryCAD Data Exchange format used in industry
CAD Data Exchange format used in industry
 
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURINGUnit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
 
Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com
 
Real-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASReal-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSAS
 
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxUnit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
 
Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4j
 

Recently uploaded

Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 

Recently uploaded (20)

Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 

IM in Graph 2017-05.pdf

  • 1. Industry Models – Visualisation & Graph DB May 2017 Michal Miklas
  • 2. 2 Reasons for interest in Graph DB: Exploration of new content delivery formats and tools for content consumption  Use of graph DB as store for all glossaries and organisation data stores metadata and their relationships  Common independent interface for all services, apps and users  Flexible schema (does not need to be predefined), easily extensible  Consumable – easy access to the data via common API/interface  Visualisation – a business user friendly tool for data exploration, navigation, search and analysis  Easy integration of Industry models content (or derived from) with customer collection of glossaries, vocabularies, ontologies and other models  Not seen as replacement of the main formats & content authoring/management tools, rather complementary as a “read only platform” mainly beneficial for business users and analysts  Any current work is and future work will stay flexible & compatible with main software tools: “Titan/Janusgraph with Tinkerpop”, “IBM Graph”, Neo4j
  • 3. 3 Industry Model Common Components Data Models Vocabulary Atomic Warehouse Model Dimensional Warehouse Model Business Data Model Business Terms Analytical Requirements Supportive Content Industry Models Industry concepts in plain business language and with no modeling. Business Terms are organized by Business Categories. The mapping to the data models allows the transformation of requirements into IT data structures. Business Terms High level groups of business information to express business Measures along axes of analysis, which are named Dimensions. Analytical Requirements Grouping of terms incorporating any terminology originating from an internal or external source. It is used to support data structures such as regulatory reports, industry standards, business architecture standards, vendor interfaces, or legacy source systems. Supportive Content Highly normalized conceptual data model that is an enterprise-wide, generic, and flexible data epresentation of informational systems. Business Data Model A normalized design level data model representing the repository of atomic data used for informational processing. Atomic Model A design level dimensional model representing the repository of analytical data. It contains star schemas supporting the Analytical Requirements Dimensional Model
  • 4. 4 Tools Currently Used & Supported  Data Modelling tools: • Infosphere Data Architect • Erwin Data Modeler  Data Governance tools: • Infosphere Governance Catalog (Business Glossaries, Models and other metadata)
  • 5. 5 Graph Databases & Graph Data Visualisation tools  Graph Databases • Neo4j Community Edition (GPL v3 license) • Neo4j Enterprise Edition (Evaluation & Commercial License) • Titan (Apache License 2.0) • Janusgraph (Apache 2 License & Creative Commons Attribution 4.0 International) • IBM Graph managed service on Bluemix (build on Titan/Tinkeprop stack)  Graph computing framework • Tinkerpop (Apache License 2.0) embedded in Titan/Janusgraph but can be use as standalone GraphDB too for demos and small projects using in-memory TinkerGraph  Graph data visualisation • Neo4j Browser (same licensing as above) - part of the Neo4j Graph DB • Linkurious Enterprise (commercial license) vis.js javascript visualisation library (Apache 2.0 and MIT)
  • 7. 7 Energy & Utilities - Outage - Neo4j
  • 8. 8 Energy & Utilities - Outage - Linkurious
  • 9. 9 Energy & Utilities - Outage - Vis.JS
  • 10. 10 Example of Graph DB schema model
  • 11. 11 Content Transformation  IM artifact formats • IGC export in XML • Logical Models LDM files are XML files  Transformation is split into two steps (fully working script prototype in Powershell) This allows to bring any customer’s data into the mix in easy to understand format: collection of CSV files in two folders – nodes and edges – each file represents different node/edge type with any properties as CSV columns (each node type can have different set of properties/columns) – the only mandatory fields are ID and name 1. XML (LDM & IGC) to CSV transform 2. CSV to GRAPHML transform • also produces Graph DB schema model in JSON format (for IBM Graph on Bluemix) • also produces groovy script for schema creation for Titan/Janusgraph  GRAPHML graph data format – includes schema and all node and edge data • a format importable to Titan/JanusGraph using the Gremlin/Tinkerpop console • Tinkeprop/Gramlin can be also used to import to Neo4j • Format recognized and supported also by IBM graph (although currently with limitation – size of file cannot be over 10MB)
  • 12. 12 Meaning of Node types in Graph DB  Logical model based types • entity • attribute • package • model • diagram  Physical model based types • column • Table  Glossary based types • term • category
  • 13. 13 Meaning of Edge types in Graph DB  Assigns: • term assignation to asset: attribute/column/entity/table  Belongs: • describes parent object = ownership • entity/table to package • package to package • package to model • term to category • category to category • category to glossary • diagram to package  Describes: • attribute describes entity • column describes table • term isOf another term  Maps: • attribute to attribute calculation • attribute to attribute/entity population in the same model or AWM- >DWM • column to column/table population in the same schema or AWM- >DWM • note: covers both population and calculation dependencies • design transformation dependency • attribute to attribute/entity (from another model e.g. BDM->AWM or BDM->DWM) • table to entity • column to attribute  References: • term to category: can be referenced by any number of categories (in addition to owning category) • entity/table to diagram: can be referenced by any number of diagrams  Relates: • entity to entity relationship in ER models • table to table relationship in ER models • term to term relatedTerm  Subtype: • entity is subtype of another entity generalization/inheritance in ER models • term isTypeOf another term  Synonym: • one term is synonym of another - in a directed graph there is a direction of this edge suggesting master-child
  • 14. 14 Common attribute across all edge & node types  Taxonomy: • Logical Business Data Model • Logical Atomic Warehouse Model • Logical Dimensional Warehouse Model • Business Glossary • Analytical Requirements • Supportive Content • Scopes • Physical Dimensional Warehouse Model • Physical Business Data Model • Physical Atomic Warehouse Model  Taxonomy Type: • logicalModel • physicalModel • glossary  Industry: • banking • insurance • healthcare • Utilities  Version