SlideShare a Scribd company logo
25 Settembre 2017
With a datascience perspective
Aview of graph data usage by Cerved
Stefano Gatti – Head of Innovation and data sources
Nunzio Pellegrino – Senior Data Scientist – Innovation team
Cerved and its graphs in a nutshell
3
Cerved, in a nutshell
The Italian data-driven company
CREDIT INFORMATION
Protection against credit risk
MARKETING SOLUTIONS
New business opportunities
CREDIT MANAGEMENT
Manage and collect performing and
non-performing loans.
	
Over 1000 a minute	
ü  Documents	
	
Over 40 million	
ü  Lines of code
	
Over 30,000	
ü  Customers
	
Over 50 different	
ü  Data sources
Over 10million a day	
ü  Api call
	
Over 1,900	
ü  People
	
377 million Eur (2016)	
ü  Revenue
4
Web	Data	
Open	Data	
Proprietary	data	
Official	data	
Chamber	of	
Commerce	
official	data	
C
o
m
p
l
e
x
i
t
y	
Our big data
5
Cerved, in a tech view
Data	
Algorithms	
Solu0ons	
Towards algorithmic economy …
6
Cerved Graph Story
2011-12 - we started from an IT problem: reengineering of beneficial owner algorithm
7
Cerved Graph Story
2014-15 - we went through a more algorithmic problem: corporate linkages algorithm
8
Cerved Graph Story
2015-16 - we go with a “full stack” solution
9
Cerved Graph thoughts
We strongly believe in …
The	power	of	linking	data		
The	power	of	analyzing	data	with	network	analysis	
The	power	of	visualizing	data	in	a	different	way	
To	understand	a	li5le	be5er	the		
increasing	complexity	of	modern	world	…	
also	from	an	economic	point	of	view
Why a Graph Database?
11
What is a Graph?
12
Key Concepts
Graph database
NoSQL database
Managing highly connected data
and complex queries
Flexible data model
13
Key Concepts
Graph database
Declarative or imperative
language
Horizontal Scaling
Graph native storage and
process
14
Where graphdb can be useful?
“Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
15
Maybe in the future…
“Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
16
Frame the Problem
Data Model
Simple1
Expressive2
3 Additive
17
RDBMS
RDBMS vs Graph Data Model
Graph
18
Store & Get Data
Native graph storage
Store Data Fast Write Performance
Easy Data Integration:
CSV
Jdbc
REST Api
19
Store & Get Data
Native graph storage
Store Data Fast Write Performance
Easy Data Integration:
Get Data
Native graph processing à Index free adjacency
CSV
Jdbc
REST Api
Cypher, Declarative Language
Driver:
Python
py2neo (unofficial)
R (unofficial)
Java
APOC
20
Explore Data
Transform implicit to explicitCypher (access points, pattern)
21
Explore Data
Transform implicit to explicitCypher (access points, pattern)
22
Explore Data
Transform implicit to explicitCypher (access points, pattern)
23
Prepare Data
Feature Creation with parallel Graph algorithms
Centralities
•  Page Rank
•  Betweenness Centrality
•  Closeness Centrality
Graph Partitioning
•  Label Propagation
•  Connected Components
•  Strongly Connected
Components
Path Finding
•  Minimum Weight
Spanning Tree
•  All Pairs- and Single
Source Shortest Path
24
Prepare Data
Feature Creation with parallel Graph algorithms
Centralities
•  Page Rank
•  Betweenness Centrality
•  Closeness Centrality
Graph Partitioning
•  Label Propagation
•  Connected Components
•  Strongly Connected
Components
Path Finding
•  Minimum Weight
Spanning Tree
•  All Pairs- and Single
Source Shortest Path
Graph Size (GB) nodes (M) rels (M)
PageRank
(s)
ConCom
(s)
LabelPropag
(s)
StrongConCom
(s)
Pokec 7.3 2 31 10 24 12 12
DBPedia 15 11 117 46 91 51 65
Graphs500-23 7.9 5 129 19 29 18 25
Twitter-2010 49 42 1468 349 353 405 339
soc-LifeJournal1 6.3 5 69 30 34 25 23
Friendster 62 66 1806 611 619 296 483
Performance
25
Present&Launch your solution
Real time
Recommendation
Fraud Detection
Social Network Analysis
Search & Link Analysis
Knowledge Graph Natural Language
Process
Nunzio Pellegrino
S e n i o r D a t a S c i e n t i s t –
Innovation Team
nunzio.pellegrino@cerved.com
Stefano Gatti
Head of Innovation & Data
Sources
stefano.gatti@cerved.com

More Related Content

What's hot

GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionGraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
Neo4j
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavator
ingeangevaare
 
Demystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaDemystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and Akka
Knoldus Inc.
 
Turning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceTurning Data into Insights and Intelligence
Turning Data into Insights and Intelligence
Sabine Kurjo McNeill
 
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
Eric Horesnyi
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Neo4j
 
Integration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and Ontologies
Roberto García
 
Foundations of Data Teams
Foundations of Data TeamsFoundations of Data Teams
Foundations of Data Teams
Databricks
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing Methods
Kate Shao
 
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaAutograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Neo4j
 
Sitech
SitechSitech
Sitech
BigDataExpo
 
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn
 
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
The Statistical and Applied Mathematical Sciences Institute
 
Coordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsCoordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functions
Marcos Rebelo
 
Session 4.3 semantic annotation for enhancing collaborative ideation
Session 4.3   semantic annotation for enhancing collaborative ideationSession 4.3   semantic annotation for enhancing collaborative ideation
Session 4.3 semantic annotation for enhancing collaborative ideation
semanticsconference
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
Avkash Chauhan
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
OW2
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
Joachim Neubert
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics Meetup
Open Analytics
 
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn
 

What's hot (20)

GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to ProductionGraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
GraphTour - The Workshop - Device Tracking in Practice: From Idea to Production
 
07 verheul texcavator
07 verheul texcavator07 verheul texcavator
07 verheul texcavator
 
Demystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and AkkaDemystifying Big Data with Scala and Akka
Demystifying Big Data with Scala and Akka
 
Turning Data into Insights and Intelligence
Turning Data into Insights and IntelligenceTurning Data into Insights and Intelligence
Turning Data into Insights and Intelligence
 
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
2018 12-10 apidays.io eric horesnyi streamdata.io event-driven ap is
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
 
Integration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and OntologiesIntegration and Exploration of Financial Data using Semantics and Ontologies
Integration and Exploration of Financial Data using Semantics and Ontologies
 
Foundations of Data Teams
Foundations of Data TeamsFoundations of Data Teams
Foundations of Data Teams
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing Methods
 
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, CienaAutograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
Autograph - Natural Signatures for Graph Modelling, Simon Brueckheimer, Ciena
 
Sitech
SitechSitech
Sitech
 
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022 Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
Tracxn - Geo Monthly Report - Austria Tech - Mar 2022
 
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
2019 GDRR: Blockchain Data Analytics - Cryptocurrency and blockchain analysis...
 
Coordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functionsCoordinating external data importer services using AWS step functions
Coordinating external data importer services using AWS step functions
 
Session 4.3 semantic annotation for enhancing collaborative ideation
Session 4.3   semantic annotation for enhancing collaborative ideationSession 4.3   semantic annotation for enhancing collaborative ideation
Session 4.3 semantic annotation for enhancing collaborative ideation
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
 
MarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics MeetupMarkLogic - Open Analytics Meetup
MarkLogic - Open Analytics Meetup
 
Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021Tracxn - Monthly Report - Austria Tech - Oct 2021
Tracxn - Monthly Report - Austria Tech - Oct 2021
 

Similar to Cerved Datascience Milan

A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by Cerved
Data Science Milan
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
MDS ap
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
Neo4j
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
Amazon Web Services
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities
Juan Sequeda
 
Talend introduction v1
Talend introduction v1Talend introduction v1
Talend introduction v1
Softnix Technology
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics Products
Luke Han
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Shirshanka Das
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
Yael Garten
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Yael Garten
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Shirshanka Das
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter Cailliau
Redis Labs
 
GraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphGraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business Graph
Neo4j
 
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesSC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
BigData_Europe
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4j
Neo4j
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
Cédric Fauvet
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
SingleStore
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
LoQutus
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Deepak Chandramouli
 
It takes a village (to raise a ML model)
It takes a village (to raise a ML model)It takes a village (to raise a ML model)
It takes a village (to raise a ML model)
Anselmo Rodrigues da Silva
 

Similar to Cerved Datascience Milan (20)

A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by Cerved
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
AWS Partner Webcast - Analyze Big Data for Consumer Applications with Looker ...
 
Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities Integrating Semantic Web in the Real World: A Journey between Two Cities
Integrating Semantic Web in the Real World: A Journey between Two Cities
 
Talend introduction v1
Talend introduction v1Talend introduction v1
Talend introduction v1
 
Refactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics ProductsRefactoring your EDW with Mobile Analytics Products
Refactoring your EDW with Mobile Analytics Products
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
 
Architecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystemArchitecting for change: LinkedIn's new data ecosystem
Architecting for change: LinkedIn's new data ecosystem
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
 
RedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter CailliauRedisGraph A Low Latency Graph DB: Pieter Cailliau
RedisGraph A Low Latency Graph DB: Pieter Cailliau
 
GraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business GraphGraphTalks Rome - The Italian Business Graph
GraphTalks Rome - The Italian Business Graph
 
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologiesSC4 Workshop 1: Simon Scerri: Existing tools and technologies
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4j
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
 
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J ConferenceNodes2020 | Graph of enterprise_metadata | NEO4J Conference
Nodes2020 | Graph of enterprise_metadata | NEO4J Conference
 
It takes a village (to raise a ML model)
It takes a village (to raise a ML model)It takes a village (to raise a ML model)
It takes a village (to raise a ML model)
 

Recently uploaded

一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 

Recently uploaded (20)

一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 

Cerved Datascience Milan

  • 1. 25 Settembre 2017 With a datascience perspective Aview of graph data usage by Cerved Stefano Gatti – Head of Innovation and data sources Nunzio Pellegrino – Senior Data Scientist – Innovation team
  • 2. Cerved and its graphs in a nutshell
  • 3. 3 Cerved, in a nutshell The Italian data-driven company CREDIT INFORMATION Protection against credit risk MARKETING SOLUTIONS New business opportunities CREDIT MANAGEMENT Manage and collect performing and non-performing loans. Over 1000 a minute ü  Documents Over 40 million ü  Lines of code Over 30,000 ü  Customers Over 50 different ü  Data sources Over 10million a day ü  Api call Over 1,900 ü  People 377 million Eur (2016) ü  Revenue
  • 5. 5 Cerved, in a tech view Data Algorithms Solu0ons Towards algorithmic economy …
  • 6. 6 Cerved Graph Story 2011-12 - we started from an IT problem: reengineering of beneficial owner algorithm
  • 7. 7 Cerved Graph Story 2014-15 - we went through a more algorithmic problem: corporate linkages algorithm
  • 8. 8 Cerved Graph Story 2015-16 - we go with a “full stack” solution
  • 9. 9 Cerved Graph thoughts We strongly believe in … The power of linking data The power of analyzing data with network analysis The power of visualizing data in a different way To understand a li5le be5er the increasing complexity of modern world … also from an economic point of view
  • 10. Why a Graph Database?
  • 11. 11 What is a Graph?
  • 12. 12 Key Concepts Graph database NoSQL database Managing highly connected data and complex queries Flexible data model
  • 13. 13 Key Concepts Graph database Declarative or imperative language Horizontal Scaling Graph native storage and process
  • 14. 14 Where graphdb can be useful? “Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
  • 15. 15 Maybe in the future… “Hands-On Machine Learning with Scikit-Learn and TensorFlow” by Aurélien Géron
  • 16. 16 Frame the Problem Data Model Simple1 Expressive2 3 Additive
  • 17. 17 RDBMS RDBMS vs Graph Data Model Graph
  • 18. 18 Store & Get Data Native graph storage Store Data Fast Write Performance Easy Data Integration: CSV Jdbc REST Api
  • 19. 19 Store & Get Data Native graph storage Store Data Fast Write Performance Easy Data Integration: Get Data Native graph processing à Index free adjacency CSV Jdbc REST Api Cypher, Declarative Language Driver: Python py2neo (unofficial) R (unofficial) Java APOC
  • 20. 20 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 21. 21 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 22. 22 Explore Data Transform implicit to explicitCypher (access points, pattern)
  • 23. 23 Prepare Data Feature Creation with parallel Graph algorithms Centralities •  Page Rank •  Betweenness Centrality •  Closeness Centrality Graph Partitioning •  Label Propagation •  Connected Components •  Strongly Connected Components Path Finding •  Minimum Weight Spanning Tree •  All Pairs- and Single Source Shortest Path
  • 24. 24 Prepare Data Feature Creation with parallel Graph algorithms Centralities •  Page Rank •  Betweenness Centrality •  Closeness Centrality Graph Partitioning •  Label Propagation •  Connected Components •  Strongly Connected Components Path Finding •  Minimum Weight Spanning Tree •  All Pairs- and Single Source Shortest Path Graph Size (GB) nodes (M) rels (M) PageRank (s) ConCom (s) LabelPropag (s) StrongConCom (s) Pokec 7.3 2 31 10 24 12 12 DBPedia 15 11 117 46 91 51 65 Graphs500-23 7.9 5 129 19 29 18 25 Twitter-2010 49 42 1468 349 353 405 339 soc-LifeJournal1 6.3 5 69 30 34 25 23 Friendster 62 66 1806 611 619 296 483 Performance
  • 25. 25 Present&Launch your solution Real time Recommendation Fraud Detection Social Network Analysis Search & Link Analysis Knowledge Graph Natural Language Process
  • 26. Nunzio Pellegrino S e n i o r D a t a S c i e n t i s t – Innovation Team nunzio.pellegrino@cerved.com Stefano Gatti Head of Innovation & Data Sources stefano.gatti@cerved.com