SlideShare a Scribd company logo
1 of 13
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
Recent Updates on IBM System G
— GraphBIG and Temporal Data
Yinglong Xia
IBM T.J. Watson Research Center
Yorktown Heights, NY 10598
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
2
IBM T.J. Watson Research Center
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
3
Using LDBC-SNB for GraphBIG
• GraphBIG = Graph Benchmark Suite from IBM System G and GaTech HPArch
• A wide selection of workloads from both CPU and GPU
• Workload ranging from graph traversal to Gibbs Sampling on Bayesian Network
• Illustrating processor architecture impact using h/w performance counter
• Fix input data and implementation
• Show performance profiling at processor architecture level
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
4
Beyond the Benchmarking for Graph DBs
• Graph computing was barely considered in architecture design
• Increasing motivation due to popularity of graph analytics
• Impact of architecture requires fixed input data and analytic implementation
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
5
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
6
Demanding Graph
• Interactions of entities in many big data applications are naturally modeled by property graphs
• Evolution of graph structure and properties over time usually provides useful information, which needs
to be maintained for query or analytics
• Graph analytics market grows increasingly fast as well as the graph data size and complexity, but
near real time response is typically required
Xiaoyan Fu, Seok-Hee Hong, Nikola S. Nikolov, Xiaobin Shen, Ying Xin Wu and Kai Xu,
Visualization and Analysis of Email Networks, Proceedings of APVIS 2007, IEEE, pp.1-8, 2007.
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
7
Use Case: Forensic Analysis on Individual Status
• Recover the dynamics of individual status
• Evaluate status measures, anomalies, etc.
• Propagate known status measures
• Estimate labels for each person at each time stamp
• Aggregate the received measures
Chain Graph: A collection of graphs on 

contiguous time steps
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
8
Use Case: Bitemporal Data Exploration
• Support the valid dimension and the transaction dimension
• Audit trail of what you know and when did you know
• History of how history from a business perspective was stored in the
database
http://bitemporalmodeling.com/temporal-data-blog/
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
9
Graph Data Management
SparkseeNeo4j
Titan
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
10
Organization of Graph Store
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
11
Organize Temporal Graph Data Name Default Value
vertex_history Disabled
num_vertex_property_bundles 0
edge_history Disabled
num_edge_property_bundles 0
… …
Flag

(uint8)
inEdge

(uint64)
inEdge Count
(uint16)
outEdge

(uint64)
outEdge Count

(uint16)
Property

(uint64)
Property Count

(unit64)
History

(unit64)
…
…
…
Vertex Record Table
inEdge List
Flag Property Property

Count
History …
…
…
Prev Edge_list_buffer<EID,VID,LID>
…
…
Edge Record Table
Accessed Vertex 

Record by VID * 

izeof (VtxRec)
inEdgeCount * sizeof (<EID,VID,LID>)

point to the buffer end
Accessed Edge 

Record by EID * 

izeof (EdgeRec)
Vertex Property Table
Prev property_buffer
…
…
PropertyCount

point to the buffer end
Edge Property Table
Prev property_buffer
…
…
Name Default
Value
min_VID 0
max_VID
min_EID 0
Max_EID
… …
Local Configuration
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
12
Pointer Jumping in Temporal Graph Inference
• Converting a temporal graph into tridiagonal system
• Forward Gaussian elimination by propagation
• Backward substitution to produce solutions
• A Parallel Solution to Thomas Algorithm
• Apply pointer jumping to Thomas algorithm
• Logarithmic speedup
parallel solution to solve a tridiagonal linear system
• Converting a chain graph into tridiagonal system
• Forward Gaussian elimination by propagation
• Backward substitution to produce solutions 

• A Parallel Solution to Thomas Algorithm
• Apply pointer jumping to Thomas algorithm
• Logarithmic speedup
• Propagate belief among vertices within and cross
time stamps
Speedup wrt Gaussian 

Elimination: T3 / logT
© 2014 International Business Machines Corporation
IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences
13
Comments and Questions?

More Related Content

Viewers also liked

хто отчет за 9 месяцев 2015 2016 года
хто отчет за 9 месяцев 2015 2016 годахто отчет за 9 месяцев 2015 2016 года
хто отчет за 9 месяцев 2015 2016 годаJako Bektibaev
 
기년회 130101 위승용
기년회 130101 위승용기년회 130101 위승용
기년회 130101 위승용wi seungyong
 
5 reasons why your content calendar is failing
5 reasons why your content calendar is failing5 reasons why your content calendar is failing
5 reasons why your content calendar is failingSendible
 
(주)아키테리어금빛가람 회사소개
(주)아키테리어금빛가람 회사소개(주)아키테리어금빛가람 회사소개
(주)아키테리어금빛가람 회사소개진용 강
 
Mini Projeto Horta Escolar
Mini Projeto Horta EscolarMini Projeto Horta Escolar
Mini Projeto Horta EscolarPaulo Rocha
 
2016 회사소개서 통합 ver 01
2016 회사소개서 통합 ver 012016 회사소개서 통합 ver 01
2016 회사소개서 통합 ver 01startupkorea
 
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,WakeeRightBrain inc.
 
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편RightBrain inc.
 
Analysis of student digipaks
Analysis of student digipaksAnalysis of student digipaks
Analysis of student digipaksBrettMooreG321
 

Viewers also liked (14)

Dlp phi
Dlp phiDlp phi
Dlp phi
 
хто отчет за 9 месяцев 2015 2016 года
хто отчет за 9 месяцев 2015 2016 годахто отчет за 9 месяцев 2015 2016 года
хто отчет за 9 месяцев 2015 2016 года
 
기년회 130101 위승용
기년회 130101 위승용기년회 130101 위승용
기년회 130101 위승용
 
5 reasons why your content calendar is failing
5 reasons why your content calendar is failing5 reasons why your content calendar is failing
5 reasons why your content calendar is failing
 
Question 4
Question 4Question 4
Question 4
 
Recruitment Brochure
Recruitment BrochureRecruitment Brochure
Recruitment Brochure
 
(주)아키테리어금빛가람 회사소개
(주)아키테리어금빛가람 회사소개(주)아키테리어금빛가람 회사소개
(주)아키테리어금빛가람 회사소개
 
Mini Projeto Horta Escolar
Mini Projeto Horta EscolarMini Projeto Horta Escolar
Mini Projeto Horta Escolar
 
Varied Indian attires
Varied Indian attiresVaried Indian attires
Varied Indian attires
 
IntellianSystems Digital Signage Brochure
IntellianSystems Digital Signage BrochureIntellianSystems Digital Signage Brochure
IntellianSystems Digital Signage Brochure
 
2016 회사소개서 통합 ver 01
2016 회사소개서 통합 ver 012016 회사소개서 통합 ver 01
2016 회사소개서 통합 ver 01
 
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
라이트브레인 UX 아카데미 4기 오픈프로젝트 - Smart Toy for kidult,Wakee
 
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
2015 1st UX 트렌드 리포트_헬스케어(healthcare)편
 
Analysis of student digipaks
Analysis of student digipaksAnalysis of student digipaks
Analysis of student digipaks
 

Similar to Recent Updates on IBM System G — GraphBIG and Temporal Data

IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...In-Memory Computing Summit
 
Data-Centric Approach for Project Delivery
Data-Centric Approach for Project DeliveryData-Centric Approach for Project Delivery
Data-Centric Approach for Project DeliveryAVEVA Group plc
 
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
 New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S... New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...Big Data Spain
 
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDBHBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDBHBaseCon
 
Don't Leave Your Traditional IBM Systems Out of Your IT Operations Efforts
Don't Leave Your Traditional IBM Systems Out of Your IT Operations EffortsDon't Leave Your Traditional IBM Systems Out of Your IT Operations Efforts
Don't Leave Your Traditional IBM Systems Out of Your IT Operations EffortsPrecisely
 
Continuous Intelligence - Intersecting Event-Based Business Logic and ML
Continuous Intelligence - Intersecting Event-Based Business Logic and MLContinuous Intelligence - Intersecting Event-Based Business Logic and ML
Continuous Intelligence - Intersecting Event-Based Business Logic and MLParis Carbone
 
Lessons learned building a big data analytics engine, from proprietary to ope...
Lessons learned building a big data analytics engine, from proprietary to ope...Lessons learned building a big data analytics engine, from proprietary to ope...
Lessons learned building a big data analytics engine, from proprietary to ope...J On The Beach
 
Making Hadoop Realtime by Dr. William Bain of Scaleout Software
Making Hadoop Realtime by Dr. William Bain of Scaleout SoftwareMaking Hadoop Realtime by Dr. William Bain of Scaleout Software
Making Hadoop Realtime by Dr. William Bain of Scaleout SoftwareData Con LA
 
Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)Stefan Urbanek
 
Scorecard Integration v1 MFGates, Map It Ralph
Scorecard Integration v1 MFGates, Map It RalphScorecard Integration v1 MFGates, Map It Ralph
Scorecard Integration v1 MFGates, Map It RalphBrij Consulting, LLC
 
Servi sMART - Servi.ca - Smart Market Platform for Microservices
Servi sMART - Servi.ca - Smart Market Platform for MicroservicesServi sMART - Servi.ca - Smart Market Platform for Microservices
Servi sMART - Servi.ca - Smart Market Platform for MicroservicesStefan Ianta
 
StreamCentral for the IT Professional
StreamCentral for the IT ProfessionalStreamCentral for the IT Professional
StreamCentral for the IT ProfessionalRaheel Retiwalla
 
The Heatmap
 - Why is Security Visualization so Hard?
The Heatmap
 - Why is Security Visualization so Hard?The Heatmap
 - Why is Security Visualization so Hard?
The Heatmap
 - Why is Security Visualization so Hard?Raffael Marty
 
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017Justin Hayward
 
Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...Bas Geerdink
 
Actionable Insights - Thompson
Actionable Insights - ThompsonActionable Insights - Thompson
Actionable Insights - ThompsonProlifics
 
Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j
 
Mindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryMindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryIIoTWorld
 

Similar to Recent Updates on IBM System G — GraphBIG and Temporal Data (20)

IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
IMCSummit 2015 - Day 1 Developer Track - Implementing Operational Intelligenc...
 
Data-Centric Approach for Project Delivery
Data-Centric Approach for Project DeliveryData-Centric Approach for Project Delivery
Data-Centric Approach for Project Delivery
 
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
 New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S... New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
 
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDBHBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
 
Don't Leave Your Traditional IBM Systems Out of Your IT Operations Efforts
Don't Leave Your Traditional IBM Systems Out of Your IT Operations EffortsDon't Leave Your Traditional IBM Systems Out of Your IT Operations Efforts
Don't Leave Your Traditional IBM Systems Out of Your IT Operations Efforts
 
Continuous Intelligence - Intersecting Event-Based Business Logic and ML
Continuous Intelligence - Intersecting Event-Based Business Logic and MLContinuous Intelligence - Intersecting Event-Based Business Logic and ML
Continuous Intelligence - Intersecting Event-Based Business Logic and ML
 
Lessons learned building a big data analytics engine, from proprietary to ope...
Lessons learned building a big data analytics engine, from proprietary to ope...Lessons learned building a big data analytics engine, from proprietary to ope...
Lessons learned building a big data analytics engine, from proprietary to ope...
 
Making Hadoop Realtime by Dr. William Bain of Scaleout Software
Making Hadoop Realtime by Dr. William Bain of Scaleout SoftwareMaking Hadoop Realtime by Dr. William Bain of Scaleout Software
Making Hadoop Realtime by Dr. William Bain of Scaleout Software
 
Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
Smart building mendix azure influx / smart City / IoT
Smart building mendix azure influx  / smart  City / IoT Smart building mendix azure influx  / smart  City / IoT
Smart building mendix azure influx / smart City / IoT
 
Scorecard Integration v1 MFGates, Map It Ralph
Scorecard Integration v1 MFGates, Map It RalphScorecard Integration v1 MFGates, Map It Ralph
Scorecard Integration v1 MFGates, Map It Ralph
 
Servi sMART - Servi.ca - Smart Market Platform for Microservices
Servi sMART - Servi.ca - Smart Market Platform for MicroservicesServi sMART - Servi.ca - Smart Market Platform for Microservices
Servi sMART - Servi.ca - Smart Market Platform for Microservices
 
StreamCentral for the IT Professional
StreamCentral for the IT ProfessionalStreamCentral for the IT Professional
StreamCentral for the IT Professional
 
The Heatmap
 - Why is Security Visualization so Hard?
The Heatmap
 - Why is Security Visualization so Hard?The Heatmap
 - Why is Security Visualization so Hard?
The Heatmap
 - Why is Security Visualization so Hard?
 
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
 
Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...
 
Actionable Insights - Thompson
Actionable Insights - ThompsonActionable Insights - Thompson
Actionable Insights - Thompson
 
Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You
 
Mindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryMindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for Industry
 

More from Graph-TA

Computing on Event-sourced Graphs
Computing on Event-sourced GraphsComputing on Event-sourced Graphs
Computing on Event-sourced GraphsGraph-TA
 
Using Evolutionary Computing for Feature-driven Graph generation
Using Evolutionary Computing for Feature-driven Graph generationUsing Evolutionary Computing for Feature-driven Graph generation
Using Evolutionary Computing for Feature-driven Graph generationGraph-TA
 
Reactive Databases for Big Data applications
Reactive Databases for Big Data applicationsReactive Databases for Big Data applications
Reactive Databases for Big Data applicationsGraph-TA
 
The scarcity of crossing dependencies: a direct outcome of a specific constra...
The scarcity of crossing dependencies: a direct outcome of a specific constra...The scarcity of crossing dependencies: a direct outcome of a specific constra...
The scarcity of crossing dependencies: a direct outcome of a specific constra...Graph-TA
 
Holistic Benchmarking of Big Linked Data: HOBBIT
Holistic Benchmarking of Big Linked Data: HOBBITHolistic Benchmarking of Big Linked Data: HOBBIT
Holistic Benchmarking of Big Linked Data: HOBBITGraph-TA
 
Identifiability in Dynamic Casual Networks
Identifiability in Dynamic Casual NetworksIdentifiability in Dynamic Casual Networks
Identifiability in Dynamic Casual NetworksGraph-TA
 
Polyglot Graph Databases using OCL as pivot
Polyglot Graph Databases using OCL as pivotPolyglot Graph Databases using OCL as pivot
Polyglot Graph Databases using OCL as pivotGraph-TA
 
Benchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked DataBenchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked DataGraph-TA
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingGraph-TA
 
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
Use of Graphs for Cloud Service Selection in Multi-Cloud EnvironmentsUse of Graphs for Cloud Service Selection in Multi-Cloud Environments
Use of Graphs for Cloud Service Selection in Multi-Cloud EnvironmentsGraph-TA
 
Graphalytics: A big data benchmark for graph-processing platforms
Graphalytics: A big data benchmark for graph-processing platformsGraphalytics: A big data benchmark for graph-processing platforms
Graphalytics: A big data benchmark for graph-processing platformsGraph-TA
 
Modelling the Clustering Coefficient of a Random graph
Modelling the Clustering Coefficient of a Random graphModelling the Clustering Coefficient of a Random graph
Modelling the Clustering Coefficient of a Random graphGraph-TA
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsGraph-TA
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGraph-TA
 
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphsOn the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphsGraph-TA
 
Graphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platformsGraphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platformsGraph-TA
 
Autograph: an evolving lightweight graph tool
Autograph: an evolving lightweight graph toolAutograph: an evolving lightweight graph tool
Autograph: an evolving lightweight graph toolGraph-TA
 
Understanding Graph Structure in Knowledge Bases
Understanding Graph Structure in Knowledge BasesUnderstanding Graph Structure in Knowledge Bases
Understanding Graph Structure in Knowledge BasesGraph-TA
 
Finding patterns of chronic disease and medication prescriptions from a large...
Finding patterns of chronic disease and medication prescriptions from a large...Finding patterns of chronic disease and medication prescriptions from a large...
Finding patterns of chronic disease and medication prescriptions from a large...Graph-TA
 
Analysing the degree distribution of real graphs by means of several probabil...
Analysing the degree distribution of real graphs by means of several probabil...Analysing the degree distribution of real graphs by means of several probabil...
Analysing the degree distribution of real graphs by means of several probabil...Graph-TA
 

More from Graph-TA (20)

Computing on Event-sourced Graphs
Computing on Event-sourced GraphsComputing on Event-sourced Graphs
Computing on Event-sourced Graphs
 
Using Evolutionary Computing for Feature-driven Graph generation
Using Evolutionary Computing for Feature-driven Graph generationUsing Evolutionary Computing for Feature-driven Graph generation
Using Evolutionary Computing for Feature-driven Graph generation
 
Reactive Databases for Big Data applications
Reactive Databases for Big Data applicationsReactive Databases for Big Data applications
Reactive Databases for Big Data applications
 
The scarcity of crossing dependencies: a direct outcome of a specific constra...
The scarcity of crossing dependencies: a direct outcome of a specific constra...The scarcity of crossing dependencies: a direct outcome of a specific constra...
The scarcity of crossing dependencies: a direct outcome of a specific constra...
 
Holistic Benchmarking of Big Linked Data: HOBBIT
Holistic Benchmarking of Big Linked Data: HOBBITHolistic Benchmarking of Big Linked Data: HOBBIT
Holistic Benchmarking of Big Linked Data: HOBBIT
 
Identifiability in Dynamic Casual Networks
Identifiability in Dynamic Casual NetworksIdentifiability in Dynamic Casual Networks
Identifiability in Dynamic Casual Networks
 
Polyglot Graph Databases using OCL as pivot
Polyglot Graph Databases using OCL as pivotPolyglot Graph Databases using OCL as pivot
Polyglot Graph Databases using OCL as pivot
 
Benchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked DataBenchmarking Versioning for Big Linked Data
Benchmarking Versioning for Big Linked Data
 
Synthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modelingSynthetic Data Generation using exponential random Graph modeling
Synthetic Data Generation using exponential random Graph modeling
 
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
Use of Graphs for Cloud Service Selection in Multi-Cloud EnvironmentsUse of Graphs for Cloud Service Selection in Multi-Cloud Environments
Use of Graphs for Cloud Service Selection in Multi-Cloud Environments
 
Graphalytics: A big data benchmark for graph-processing platforms
Graphalytics: A big data benchmark for graph-processing platformsGraphalytics: A big data benchmark for graph-processing platforms
Graphalytics: A big data benchmark for graph-processing platforms
 
Modelling the Clustering Coefficient of a Random graph
Modelling the Clustering Coefficient of a Random graphModelling the Clustering Coefficient of a Random graph
Modelling the Clustering Coefficient of a Random graph
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL Platforms
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
 
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphsOn the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
On the Discovery of Novel Drug-Target Interactions from Dense SubGraphs
 
Graphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platformsGraphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platforms
 
Autograph: an evolving lightweight graph tool
Autograph: an evolving lightweight graph toolAutograph: an evolving lightweight graph tool
Autograph: an evolving lightweight graph tool
 
Understanding Graph Structure in Knowledge Bases
Understanding Graph Structure in Knowledge BasesUnderstanding Graph Structure in Knowledge Bases
Understanding Graph Structure in Knowledge Bases
 
Finding patterns of chronic disease and medication prescriptions from a large...
Finding patterns of chronic disease and medication prescriptions from a large...Finding patterns of chronic disease and medication prescriptions from a large...
Finding patterns of chronic disease and medication prescriptions from a large...
 
Analysing the degree distribution of real graphs by means of several probabil...
Analysing the degree distribution of real graphs by means of several probabil...Analysing the degree distribution of real graphs by means of several probabil...
Analysing the degree distribution of real graphs by means of several probabil...
 

Recently uploaded

Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Recently uploaded (20)

Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Recent Updates on IBM System G — GraphBIG and Temporal Data

  • 1. IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences Recent Updates on IBM System G — GraphBIG and Temporal Data Yinglong Xia IBM T.J. Watson Research Center Yorktown Heights, NY 10598
  • 2. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 2 IBM T.J. Watson Research Center
  • 3. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 3 Using LDBC-SNB for GraphBIG • GraphBIG = Graph Benchmark Suite from IBM System G and GaTech HPArch • A wide selection of workloads from both CPU and GPU • Workload ranging from graph traversal to Gibbs Sampling on Bayesian Network • Illustrating processor architecture impact using h/w performance counter • Fix input data and implementation • Show performance profiling at processor architecture level
  • 4. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 4 Beyond the Benchmarking for Graph DBs • Graph computing was barely considered in architecture design • Increasing motivation due to popularity of graph analytics • Impact of architecture requires fixed input data and analytic implementation
  • 5. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 5
  • 6. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 6 Demanding Graph • Interactions of entities in many big data applications are naturally modeled by property graphs • Evolution of graph structure and properties over time usually provides useful information, which needs to be maintained for query or analytics • Graph analytics market grows increasingly fast as well as the graph data size and complexity, but near real time response is typically required Xiaoyan Fu, Seok-Hee Hong, Nikola S. Nikolov, Xiaobin Shen, Ying Xin Wu and Kai Xu, Visualization and Analysis of Email Networks, Proceedings of APVIS 2007, IEEE, pp.1-8, 2007.
  • 7. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 7 Use Case: Forensic Analysis on Individual Status • Recover the dynamics of individual status • Evaluate status measures, anomalies, etc. • Propagate known status measures • Estimate labels for each person at each time stamp • Aggregate the received measures Chain Graph: A collection of graphs on 
 contiguous time steps
  • 8. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 8 Use Case: Bitemporal Data Exploration • Support the valid dimension and the transaction dimension • Audit trail of what you know and when did you know • History of how history from a business perspective was stored in the database http://bitemporalmodeling.com/temporal-data-blog/
  • 9. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 9 Graph Data Management SparkseeNeo4j Titan
  • 10. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 10 Organization of Graph Store
  • 11. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 11 Organize Temporal Graph Data Name Default Value vertex_history Disabled num_vertex_property_bundles 0 edge_history Disabled num_edge_property_bundles 0 … … Flag
 (uint8) inEdge
 (uint64) inEdge Count (uint16) outEdge
 (uint64) outEdge Count
 (uint16) Property
 (uint64) Property Count
 (unit64) History
 (unit64) … … … Vertex Record Table inEdge List Flag Property Property
 Count History … … … Prev Edge_list_buffer<EID,VID,LID> … … Edge Record Table Accessed Vertex 
 Record by VID * 
 izeof (VtxRec) inEdgeCount * sizeof (<EID,VID,LID>)
 point to the buffer end Accessed Edge 
 Record by EID * 
 izeof (EdgeRec) Vertex Property Table Prev property_buffer … … PropertyCount
 point to the buffer end Edge Property Table Prev property_buffer … … Name Default Value min_VID 0 max_VID min_EID 0 Max_EID … … Local Configuration
  • 12. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 12 Pointer Jumping in Temporal Graph Inference • Converting a temporal graph into tridiagonal system • Forward Gaussian elimination by propagation • Backward substitution to produce solutions • A Parallel Solution to Thomas Algorithm • Apply pointer jumping to Thomas algorithm • Logarithmic speedup parallel solution to solve a tridiagonal linear system • Converting a chain graph into tridiagonal system • Forward Gaussian elimination by propagation • Backward substitution to produce solutions 
 • A Parallel Solution to Thomas Algorithm • Apply pointer jumping to Thomas algorithm • Logarithmic speedup • Propagate belief among vertices within and cross time stamps Speedup wrt Gaussian 
 Elimination: T3 / logT
  • 13. © 2014 International Business Machines Corporation IBM Research – Industries & Solutions – Business Solutions & Mathematical Sciences 13 Comments and Questions?