SlideShare a Scribd company logo
1 of 18
The (Swiss cheese) data conundrum:
Sourcing, curating & integrating data for impact
Introducing C.H.E.E.S.E.:
Company Hierarchy End-to-End Structure Extraction
Alicia Montoya, Head Research Commercialisation, Swiss Re Institute
Nariman Maddah, Senior Risk & Data Engineer, Swiss Re Institute
1
DSC Europe, 14th November, 2022
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude
Data is the lifeblood of the insurance industry. But getting quality data to the
right people at the right time in the right format remains a challenge.
Companies focus a lot on developing models, while data is
assumed to be accessible and complete. In reality, teams spend an
inordinate amount of time and money sourcing, curating,
integrating and processing data. Without that, the data
landscape looks more like a Swiss cheese, with different data gaps
across datasets.
Swiss Re Institute's Research Commercialization team combines
internal data (from across the group's business units and functions)
with a variety of 3rd party datasets to plug data gaps and develop
end-to-end risk views for impactful risk analytics and products.
Introducing: "Company Hierarchy End-to-End Structure Extraction
= C.H.E.E.S.E." developed to power business impact through
quality data.
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude
Value Creation Measure and track financial
impact
Creating new revenue and/or improving performance across the insurance data and
analytics value chain
3
Data Sources
Third-party data
purchased
In-house data form BUs
across organization
Public data and maps
Internal expertise
Data analytics
Machine learning
Model development
Commercial valuation
Testing and validation
Deployment
Pricing
Products & Solutions
Data
ingestion and
curation
Business management
Fee-based services
Costs
Profit
Brand value
Partnership
Data
acquisition
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude
Extended set of data seamlessly accessible to many BUs 0%
Proactive and efficient access across the firm 11%
Core set of data consistently accessible to build models 11%
Some data is accessible to start building PoCs 51%
Access to data is very difficult and a barrier to AI 11%
Not aware 16%
But … data maturity of insurers is not yet up to the game
Actively evolving data cleansing and consolidation, with automated tools 5%
Standard data cleansing and consolidation pipeline, with tools 16%
Begun to standardise data cleansing and consolidation across the firm 11%
Some cleansing and consolidation for specific use cases 43%
We don't know if our data is ready to use 24%
Source: The Five Dimension of Enterprise AI, Element AI, May 2020, Insurance respondents only
4
Are you able to
access all the
data you need
for AI?
Is accessible
data cleaned
and
consolidated
for the use
with AI?
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 5
Achieving accessible, clean, consolidated data to use in AI is tricky
Integration
Datasets require
extensive integration
efforts to adapt them
for a specific usage
and consolidate
them.
Sourcing
Getting access to
quality relevant data
requires time-
consuming
governance, legal
and purchasing
processes
Reconciliation
Larger organisations
present complex data
set ups including legacy
systems and multi-
cloud storage, making it
hard to access relevant
data.
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 5
Unique company identifiers
are the missing key to
match and consolidate datasets
6
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude
Business objective:
Data sourced from different providers
have different company identifiers
e.g., DUNS, ORBIS, LEI, CUSIP.
Reconciliation of these datasets is
crucial to unlock each dataset’s
potential.
In addition, Swiss Re has a variety of
corporate client data that need to be
integrated with external datasets to
fill data gaps for different business
objectives, for example:
• Improving data quality
• Improving address geocoding
• Building supply chain risk models
• Modelling credit risk
• Quantifying portfolio sustainability
7
Problem statement:
Company x as an entity does not have a unique identifier across datasets. For example,
Swiss Re Ltd. and Swiss Reinsurance Company Limited cannot be string-matched.
Furthermore, the same logical (legal) entity can appear multiple times in a single dataset
because of similarities in an entity’s name and address, or even because of data entry errors.
Typical approaches such as fuzzy matching using a bag of words are not effective solutions.
For example, Swiss Re Company Ltd. and Munich Re Company Ltd. match 3 out of 4 words,
which is more than Swiss Re Ltd. and Swiss Reinsurance Company Limited. We need to use
as many features as possible to calculate the similarity between entities.
Required
features
Data gap
Datasets
Objectives: Matching companies by address, location, name etc.
Reference: image adapted from Wikipedia, Shared under the Creative Commons Attribution-Share Alike 4.0 International license.
https://en.wikipedia.org/wiki/Swiss_cheese_model#/media/File:Swiss_cheese_model.svg, 4 July 2020 CC BY-SA 4.0 (edited)
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 8
Solution:
• Build a similarity function between entities, that uses as many features as possible (company name, address, industry
codes), thus outperforming simple lexical similarity.
• Once we have such a function, we can perform cross-dataset entity clustering, aiming to:
– Disambiguate: Matching of entities across datasets
– Detect duplicates: Curation of datasets
The suggested process:
1. Word stemming: e.g. {LTD., Limited, Ltd etc.} -> {limited}
2. Normalisation: Address normalisation -> address structuring -> geo-localisation
3. Feature engineering: Prepare features for the similarity function (e.g., term frequency–inverse document frequency)
4. Bucketing: Slice datasets geographically (e.g., per country) to contain computational cost and improve precision
5. Clustering: Match companies by their extracted features within each bucket using similarity / closeness measures.
The C.H.E.E.S.E. solution
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 9
C.H.E.E.S.E.: Demonstration of matching process
Normalisation Features
Stemming Bucketing Clustering
DATASET 1
Fantasy Island Operations Ltd Nottingham NG5 7EA 57 Front Street Nottingham, Nottinghamshire NG5 7EA GB
Fantasy Island Retail Ltd. NG5 7EA 57 Front Street Arnold Nottingham NG5 7EA
Mellors Group Fantasy Island Holdings Ltd. NG5 7EA 57 Front Street Arnold, Nottingham NG5 7EA GB
Fantasy Island Resort FL 32118 3205 S Atlantic Ave Daytona Beach Shores US
DATASET 2
Fantasy Island Operations Limited Ng5 7-ea 57 Front Street Nottingham
Fantasy Island Leisure Limited NG5 7EA 57 Front Street Arnold GB
Mellors Group Fantasy Island Limited Ng5 7-ea 57 Front St. Nottinghamshire GB
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 10
C.H.E.E.S.E.: Demonstration of matching process: word stemming
Normalisation Features
Stemming Bucketing Clustering
DATASET 1
Fantasy Island Operations ltd Nottingham NG5 7EA 57 Front Street Nottingham, Nottinghamshire NG5 7EA GB
Fantasy Island Retail ltd NG5 7EA 57 Front Street Arnold Nottingham NG5 7EA
Mellors Group Fantasy Island Holdings ltd NG5 7EA 57 Front Street Arnold, Nottingham NG5 7EA GB
Fantasy Island Resort FL 32118 3205 S Atlantic Ave Daytona Beach Shores US
DATASET 2
Fantasy Island Operations ltd Ng5 7-ea 57 Front Street Nottingham
Fantasy Island Leisure ltd NG5 7EA 57 Front Street Arnold GB
Mellors Group Fantasy Island ltd Ng5 7-ea 57 Front St. Nottinghamshire GB
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 11
C.H.E.E.S.E.: Demonstration of matching process: address normalisation
Normalisation Features Bucketing Clustering
DATASET 1
Fantasy Island Operations ltd Nottingham {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…}
Fantasy Island Retail ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…}
Mellors Group Fantasy Island Holdings ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…}
Fantasy Island Resort {“postcode”: “FL32118 ”, “rd”: “S Atlantic Avenue”, “n”: 3205, “country”: “US”…}
DATASET 2
Fantasy Island Operations ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…}
Fantasy Island Leisure ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…}
Mellors Group Fantasy Island Holdings ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…}
Stemming
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 12
C.H.E.E.S.E.: Demonstration of matching process: feature extraction
Normalisation Features Bucketing Clustering
DATASET 1
(20,[1, 2, 3, 5, 10],[0.91, 0.8, 0.2,0.01,0.11]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
(20,[1, 2, 7, 5], [0.91, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
(20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
(20,[1, 2, 15], [0.91, 0.82, 0.23]) {“lat/long”: (29.160155, -80.9736214} {“iso”: “US”}
DATASET 2
(20,[1, 2, 3, 5], [0.91, 0.82, 0.2, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
(20,[1, 2, 7, 5], [0.91, 0.82, 0.54, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
(20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
Stemming
TFIDF Geo-localisation
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 13
C.H.E.E.S.E.: Demonstration of matching process: bucketing
Normalisation Features Bucketing Clustering
Stemming
Bucket 1
DS1 (20,[1, 2, 3, 5, 10],[0.91, 0.8, 0.2,0.01,0.11]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
DS1 (20,[1, 2, 7, 5], [0.91, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
DS1 (20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
DS2 (20,[1, 2, 3, 5], [0.91, 0.82, 0.2, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
DS2 (20,[1, 2, 7, 5], [0.91, 0.82, 0.54, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
DS2 (20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”}
Bucket 2
DS1 (20,[1, 2, 15], [0.91, 0.82, 0.23]) {“lat/long”: (29.160155, -80.9736214)} {“iso”: “US”}
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 14
C.H.E.E.S.E.: Demonstration of matching process – clustering
Normalisation Features Bucketing Clustering
Stemming
Bucket 1
DS1 Fantasy Island Operations ltd Nottingham Vector 1
DS1 Fantasy Island Retail ltd Vector 2
DS1 Mellors Group Fantasy Island Holdings ltd Vector 3
DS2 Fantasy Island Operations Limited Vector 4
DS2 Fantasy Island Leisure Limited Vector 5
DS2 Mellors Group Fantasy Island Holdings Vector 6
Vector 1
Vector 4
X1
X2
q
TFIDF cosine similarity Clustering (e.g., DBSCAN)
d = f(cosine, geo-distance, etc.)
d
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 15
The most common challenges in end-to-end
data analytics and machine learning are data
sourcing, reconciliation, and integration – not
data modelling.
Corporate companies aim to fill data gaps
(Swiss cheese model* example) by sourcing
data from different providers. However, a lack
of a common identifiers between datasets
prevent them from extracting the value that
each dataset can offer.
Using Company Hierarchy End-2-End Structure
Extraction (C.H.E.E.S.E.) as an example, we
demonstrated how a data science toolbox can
enable us to improve company matching.
Summary
* Swiss cheese model adapted from https://en.wikipedia.org/wiki/Swiss_cheese_model
Shared under the Creative Commons Attribution-Share Alike 4.0 Int license
Alpkäserei Flumserberg
Thank you!
Questions and comments
are welcome
16
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 17
N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 18
Legal notice
©2022 Swiss Re. All rights reserved. You may use this presentation for private or internal purposes but note
that any copyright or other proprietary notices must not be removed. You are not permitted to create any
modifications or derivative works of this presentation, or to use it for commercial or other public purposes,
without the prior written permission of Swiss Re.
The information and opinions contained in the presentation are provided as at the date of the presentation
and may change. Although the information used was taken from reliable sources, Swiss Re does not accept
any responsibility for its accuracy or comprehensiveness or its updating. All liability for the accuracy and
completeness of the information or for any damage or loss resulting from its use is expressly excluded.

More Related Content

Similar to [DSC Europe 22] The (Swiss cheese) data conundrum: Sourcing, curating and integrating data for impact - Alicia Montoya & Nariman Maddah

Keynote Presentation at GraphTalk Oslo 2023
Keynote Presentation at GraphTalk Oslo 2023Keynote Presentation at GraphTalk Oslo 2023
Keynote Presentation at GraphTalk Oslo 2023Neo4j
 
Why Data Virtualization Matters in Your Portfolio
Why Data Virtualization Matters in Your PortfolioWhy Data Virtualization Matters in Your Portfolio
Why Data Virtualization Matters in Your PortfolioDenodo
 
The 05 Best Backup Solution Providers to Watch in 2022.pdf
The 05 Best Backup Solution Providers to Watch in 2022.pdfThe 05 Best Backup Solution Providers to Watch in 2022.pdf
The 05 Best Backup Solution Providers to Watch in 2022.pdfInsightsSuccess4
 
How can Insurers Accelerate Digital Transformation with Data Virtualization (...
How can Insurers Accelerate Digital Transformation with Data Virtualization (...How can Insurers Accelerate Digital Transformation with Data Virtualization (...
How can Insurers Accelerate Digital Transformation with Data Virtualization (...Denodo
 
Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...
Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...
Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...Jim Merrifield, IGP, CIP
 
The Knowledge Graph Explosion
The Knowledge Graph ExplosionThe Knowledge Graph Explosion
The Knowledge Graph ExplosionNeo4j
 
Novum insights client deck november 2016
Novum insights client deck november 2016Novum insights client deck november 2016
Novum insights client deck november 2016Bokyung Park
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Cambridge Semantics
 
MMV Webinar 3. Cybersecurity Perspectives. March 2018
MMV Webinar 3. Cybersecurity Perspectives. March 2018MMV Webinar 3. Cybersecurity Perspectives. March 2018
MMV Webinar 3. Cybersecurity Perspectives. March 2018Match-Maker Ventures
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Gdpr action plan - ISSA
Gdpr action plan - ISSAGdpr action plan - ISSA
Gdpr action plan - ISSAUlf Mattsson
 
western digital annual96
western digital  annual96western digital  annual96
western digital annual96finance37
 
EP Info Data Mgement 3-4 Feb 2015
EP Info  Data Mgement 3-4 Feb 2015EP Info  Data Mgement 3-4 Feb 2015
EP Info Data Mgement 3-4 Feb 2015Andy Moore
 
Discover the business value of Open Data by Majken Sander
Discover the business value of Open Data by Majken SanderDiscover the business value of Open Data by Majken Sander
Discover the business value of Open Data by Majken SanderMajken Sander
 
Presentation for tech job
Presentation for tech jobPresentation for tech job
Presentation for tech jobTechMeetups
 
Data insight presentation for tech job
Data insight presentation for tech jobData insight presentation for tech job
Data insight presentation for tech jobTechMeetups
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online caniceconsulting
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)Prof. Dr. Diego Kuonen
 
ISPAB Presentation - The Commerce Data Service
ISPAB Presentation - The Commerce Data ServiceISPAB Presentation - The Commerce Data Service
ISPAB Presentation - The Commerce Data ServiceTyrone Grandison
 

Similar to [DSC Europe 22] The (Swiss cheese) data conundrum: Sourcing, curating and integrating data for impact - Alicia Montoya & Nariman Maddah (20)

Keynote Presentation at GraphTalk Oslo 2023
Keynote Presentation at GraphTalk Oslo 2023Keynote Presentation at GraphTalk Oslo 2023
Keynote Presentation at GraphTalk Oslo 2023
 
Why Data Virtualization Matters in Your Portfolio
Why Data Virtualization Matters in Your PortfolioWhy Data Virtualization Matters in Your Portfolio
Why Data Virtualization Matters in Your Portfolio
 
The 05 Best Backup Solution Providers to Watch in 2022.pdf
The 05 Best Backup Solution Providers to Watch in 2022.pdfThe 05 Best Backup Solution Providers to Watch in 2022.pdf
The 05 Best Backup Solution Providers to Watch in 2022.pdf
 
How can Insurers Accelerate Digital Transformation with Data Virtualization (...
How can Insurers Accelerate Digital Transformation with Data Virtualization (...How can Insurers Accelerate Digital Transformation with Data Virtualization (...
How can Insurers Accelerate Digital Transformation with Data Virtualization (...
 
Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...
Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...
Become an Information Governance Superhero in 2015 - Lunch Keynote ARMA Houst...
 
The Knowledge Graph Explosion
The Knowledge Graph ExplosionThe Knowledge Graph Explosion
The Knowledge Graph Explosion
 
Novum insights client deck november 2016
Novum insights client deck november 2016Novum insights client deck november 2016
Novum insights client deck november 2016
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
MMV Webinar 3. Cybersecurity Perspectives. March 2018
MMV Webinar 3. Cybersecurity Perspectives. March 2018MMV Webinar 3. Cybersecurity Perspectives. March 2018
MMV Webinar 3. Cybersecurity Perspectives. March 2018
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Gdpr action plan - ISSA
Gdpr action plan - ISSAGdpr action plan - ISSA
Gdpr action plan - ISSA
 
western digital annual96
western digital  annual96western digital  annual96
western digital annual96
 
EP Info Data Mgement 3-4 Feb 2015
EP Info  Data Mgement 3-4 Feb 2015EP Info  Data Mgement 3-4 Feb 2015
EP Info Data Mgement 3-4 Feb 2015
 
Discover the business value of Open Data by Majken Sander
Discover the business value of Open Data by Majken SanderDiscover the business value of Open Data by Majken Sander
Discover the business value of Open Data by Majken Sander
 
Presentation for tech job
Presentation for tech jobPresentation for tech job
Presentation for tech job
 
Data insight presentation for tech job
Data insight presentation for tech jobData insight presentation for tech job
Data insight presentation for tech job
 
Data science hypes and reality
Data science hypes and realityData science hypes and reality
Data science hypes and reality
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
 
ISPAB Presentation - The Commerce Data Service
ISPAB Presentation - The Commerce Data ServiceISPAB Presentation - The Commerce Data Service
ISPAB Presentation - The Commerce Data Service
 

More from DataScienceConferenc1

[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdfDataScienceConferenc1
 
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...DataScienceConferenc1
 
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdfDataScienceConferenc1
 
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdfDataScienceConferenc1
 
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdfDataScienceConferenc1
 
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptxDataScienceConferenc1
 
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdfDataScienceConferenc1
 
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...DataScienceConferenc1
 
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdfDataScienceConferenc1
 
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...DataScienceConferenc1
 
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...DataScienceConferenc1
 
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdfDataScienceConferenc1
 
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptxDataScienceConferenc1
 
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...DataScienceConferenc1
 
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptxDataScienceConferenc1
 
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...DataScienceConferenc1
 
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...DataScienceConferenc1
 
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptxDataScienceConferenc1
 
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptxDataScienceConferenc1
 
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdfDataScienceConferenc1
 

More from DataScienceConferenc1 (20)

[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
[DSC MENA 24] Mostafa_Essa_-_Ai_and_cloud.pdf
 
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
[DSC MENA 24] Yasser_El_Bendary - How NLP & LLMs model can excel in comprehen...
 
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
[DSC MENA 24] Medhat_Kandil - Empowering Egypt's AI & Biotechnology Scenes.pdf
 
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
[DSC MENA 24] Youssef_Kamal - Data governance and quality.pdf
 
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
[DSC MENA 24] Abdelrahman_Ghallab_-_Data_Product_mgmt.pdf
 
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
[DSC MENA 24] Asmaa_Eltaher_-_Innovation_Beyond_Brainstorming.pptx
 
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
[DSC MENA 24] Muhammad_Ezzat_-_Sustianable_Growth_Empowerment.pdf
 
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
[DSC MENA 24] Basma_Rady_-_Building_a_Data_Driven_Culture_in_Your_Organizatio...
 
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
[DSC MENA 24] Ahmed_Muselhy_-_Unveiling-the-Secrets-of-AI-in-Hiring.pdf
 
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
[DSC MENA 24] Ziad_Diab_-_Data-Driven_Disruption_-_The_Role_of_Data_Strategy_...
 
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
[DSC MENA 24] Mohammad_Essam_- Leveraging Scene Graphs for Generative AI and ...
 
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
[DSC MENA 24] Ahmed_Fahmy - Navigating the Future.pdf
 
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
[DSC MENA 24] Hany_Saad_Gheit_-_Azure_OpenAI_service.pptx
 
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
[DSC MENA 24] Nezar_El_Kady_-_From_Turing_to_Transformers__Navigating_the_AI_...
 
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
[DSC MENA 24] Amira_Abdelaziz_-_AI_in_Financial_Services.pptx
 
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
[DSC MENA 24] Omar_Ossama - My Journey from the Field of Oil & Gas, to the Ex...
 
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
[DSC MENA 24] Ramy_Agieb_-_Advancements_in_Artificial_Intelligence_for_Cybers...
 
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
[DSC MENA 24] Sohaila_Diab_-_Lets_Talk_Gen_AI_Presentation.pptx
 
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
[DSC MENA 24] Amal_Elgammal_-_QUALITOP_presentation.pptx
 
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
[DSC MENA 24] Abdelrahman_Sleem_-_AI_For_Marketing_DSC.pdf
 

Recently uploaded

The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.pptibrahimabdi22
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制vexqp
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制vexqp
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........EfruzAsilolu
 

Recently uploaded (20)

The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 

[DSC Europe 22] The (Swiss cheese) data conundrum: Sourcing, curating and integrating data for impact - Alicia Montoya & Nariman Maddah

  • 1. The (Swiss cheese) data conundrum: Sourcing, curating & integrating data for impact Introducing C.H.E.E.S.E.: Company Hierarchy End-to-End Structure Extraction Alicia Montoya, Head Research Commercialisation, Swiss Re Institute Nariman Maddah, Senior Risk & Data Engineer, Swiss Re Institute 1 DSC Europe, 14th November, 2022
  • 2. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude Data is the lifeblood of the insurance industry. But getting quality data to the right people at the right time in the right format remains a challenge. Companies focus a lot on developing models, while data is assumed to be accessible and complete. In reality, teams spend an inordinate amount of time and money sourcing, curating, integrating and processing data. Without that, the data landscape looks more like a Swiss cheese, with different data gaps across datasets. Swiss Re Institute's Research Commercialization team combines internal data (from across the group's business units and functions) with a variety of 3rd party datasets to plug data gaps and develop end-to-end risk views for impactful risk analytics and products. Introducing: "Company Hierarchy End-to-End Structure Extraction = C.H.E.E.S.E." developed to power business impact through quality data.
  • 3. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude Value Creation Measure and track financial impact Creating new revenue and/or improving performance across the insurance data and analytics value chain 3 Data Sources Third-party data purchased In-house data form BUs across organization Public data and maps Internal expertise Data analytics Machine learning Model development Commercial valuation Testing and validation Deployment Pricing Products & Solutions Data ingestion and curation Business management Fee-based services Costs Profit Brand value Partnership Data acquisition
  • 4. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude Extended set of data seamlessly accessible to many BUs 0% Proactive and efficient access across the firm 11% Core set of data consistently accessible to build models 11% Some data is accessible to start building PoCs 51% Access to data is very difficult and a barrier to AI 11% Not aware 16% But … data maturity of insurers is not yet up to the game Actively evolving data cleansing and consolidation, with automated tools 5% Standard data cleansing and consolidation pipeline, with tools 16% Begun to standardise data cleansing and consolidation across the firm 11% Some cleansing and consolidation for specific use cases 43% We don't know if our data is ready to use 24% Source: The Five Dimension of Enterprise AI, Element AI, May 2020, Insurance respondents only 4 Are you able to access all the data you need for AI? Is accessible data cleaned and consolidated for the use with AI?
  • 5. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 5 Achieving accessible, clean, consolidated data to use in AI is tricky Integration Datasets require extensive integration efforts to adapt them for a specific usage and consolidate them. Sourcing Getting access to quality relevant data requires time- consuming governance, legal and purchasing processes Reconciliation Larger organisations present complex data set ups including legacy systems and multi- cloud storage, making it hard to access relevant data. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 5
  • 6. Unique company identifiers are the missing key to match and consolidate datasets 6
  • 7. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude Business objective: Data sourced from different providers have different company identifiers e.g., DUNS, ORBIS, LEI, CUSIP. Reconciliation of these datasets is crucial to unlock each dataset’s potential. In addition, Swiss Re has a variety of corporate client data that need to be integrated with external datasets to fill data gaps for different business objectives, for example: • Improving data quality • Improving address geocoding • Building supply chain risk models • Modelling credit risk • Quantifying portfolio sustainability 7 Problem statement: Company x as an entity does not have a unique identifier across datasets. For example, Swiss Re Ltd. and Swiss Reinsurance Company Limited cannot be string-matched. Furthermore, the same logical (legal) entity can appear multiple times in a single dataset because of similarities in an entity’s name and address, or even because of data entry errors. Typical approaches such as fuzzy matching using a bag of words are not effective solutions. For example, Swiss Re Company Ltd. and Munich Re Company Ltd. match 3 out of 4 words, which is more than Swiss Re Ltd. and Swiss Reinsurance Company Limited. We need to use as many features as possible to calculate the similarity between entities. Required features Data gap Datasets Objectives: Matching companies by address, location, name etc. Reference: image adapted from Wikipedia, Shared under the Creative Commons Attribution-Share Alike 4.0 International license. https://en.wikipedia.org/wiki/Swiss_cheese_model#/media/File:Swiss_cheese_model.svg, 4 July 2020 CC BY-SA 4.0 (edited)
  • 8. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 8 Solution: • Build a similarity function between entities, that uses as many features as possible (company name, address, industry codes), thus outperforming simple lexical similarity. • Once we have such a function, we can perform cross-dataset entity clustering, aiming to: – Disambiguate: Matching of entities across datasets – Detect duplicates: Curation of datasets The suggested process: 1. Word stemming: e.g. {LTD., Limited, Ltd etc.} -> {limited} 2. Normalisation: Address normalisation -> address structuring -> geo-localisation 3. Feature engineering: Prepare features for the similarity function (e.g., term frequency–inverse document frequency) 4. Bucketing: Slice datasets geographically (e.g., per country) to contain computational cost and improve precision 5. Clustering: Match companies by their extracted features within each bucket using similarity / closeness measures. The C.H.E.E.S.E. solution
  • 9. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 9 C.H.E.E.S.E.: Demonstration of matching process Normalisation Features Stemming Bucketing Clustering DATASET 1 Fantasy Island Operations Ltd Nottingham NG5 7EA 57 Front Street Nottingham, Nottinghamshire NG5 7EA GB Fantasy Island Retail Ltd. NG5 7EA 57 Front Street Arnold Nottingham NG5 7EA Mellors Group Fantasy Island Holdings Ltd. NG5 7EA 57 Front Street Arnold, Nottingham NG5 7EA GB Fantasy Island Resort FL 32118 3205 S Atlantic Ave Daytona Beach Shores US DATASET 2 Fantasy Island Operations Limited Ng5 7-ea 57 Front Street Nottingham Fantasy Island Leisure Limited NG5 7EA 57 Front Street Arnold GB Mellors Group Fantasy Island Limited Ng5 7-ea 57 Front St. Nottinghamshire GB
  • 10. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 10 C.H.E.E.S.E.: Demonstration of matching process: word stemming Normalisation Features Stemming Bucketing Clustering DATASET 1 Fantasy Island Operations ltd Nottingham NG5 7EA 57 Front Street Nottingham, Nottinghamshire NG5 7EA GB Fantasy Island Retail ltd NG5 7EA 57 Front Street Arnold Nottingham NG5 7EA Mellors Group Fantasy Island Holdings ltd NG5 7EA 57 Front Street Arnold, Nottingham NG5 7EA GB Fantasy Island Resort FL 32118 3205 S Atlantic Ave Daytona Beach Shores US DATASET 2 Fantasy Island Operations ltd Ng5 7-ea 57 Front Street Nottingham Fantasy Island Leisure ltd NG5 7EA 57 Front Street Arnold GB Mellors Group Fantasy Island ltd Ng5 7-ea 57 Front St. Nottinghamshire GB
  • 11. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 11 C.H.E.E.S.E.: Demonstration of matching process: address normalisation Normalisation Features Bucketing Clustering DATASET 1 Fantasy Island Operations ltd Nottingham {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…} Fantasy Island Retail ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…} Mellors Group Fantasy Island Holdings ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…} Fantasy Island Resort {“postcode”: “FL32118 ”, “rd”: “S Atlantic Avenue”, “n”: 3205, “country”: “US”…} DATASET 2 Fantasy Island Operations ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…} Fantasy Island Leisure ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…} Mellors Group Fantasy Island Holdings ltd {“postcode”: “NG57EA”, “rd”: “Front Street”, “n”: 57, “country”: “GB”…} Stemming
  • 12. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 12 C.H.E.E.S.E.: Demonstration of matching process: feature extraction Normalisation Features Bucketing Clustering DATASET 1 (20,[1, 2, 3, 5, 10],[0.91, 0.8, 0.2,0.01,0.11]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} (20,[1, 2, 7, 5], [0.91, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} (20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} (20,[1, 2, 15], [0.91, 0.82, 0.23]) {“lat/long”: (29.160155, -80.9736214} {“iso”: “US”} DATASET 2 (20,[1, 2, 3, 5], [0.91, 0.82, 0.2, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} (20,[1, 2, 7, 5], [0.91, 0.82, 0.54, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} (20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} Stemming TFIDF Geo-localisation
  • 13. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 13 C.H.E.E.S.E.: Demonstration of matching process: bucketing Normalisation Features Bucketing Clustering Stemming Bucket 1 DS1 (20,[1, 2, 3, 5, 10],[0.91, 0.8, 0.2,0.01,0.11]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} DS1 (20,[1, 2, 7, 5], [0.91, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} DS1 (20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} DS2 (20,[1, 2, 3, 5], [0.91, 0.82, 0.2, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} DS2 (20,[1, 2, 7, 5], [0.91, 0.82, 0.54, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} DS2 (20,[4, 5, 7, 20], [0.01, 0.82, 0.34, 0.01]) {“lat/long”: (53.002632, -1.1283949)} {“iso”: “GB”} Bucket 2 DS1 (20,[1, 2, 15], [0.91, 0.82, 0.23]) {“lat/long”: (29.160155, -80.9736214)} {“iso”: “US”}
  • 14. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 14 C.H.E.E.S.E.: Demonstration of matching process – clustering Normalisation Features Bucketing Clustering Stemming Bucket 1 DS1 Fantasy Island Operations ltd Nottingham Vector 1 DS1 Fantasy Island Retail ltd Vector 2 DS1 Mellors Group Fantasy Island Holdings ltd Vector 3 DS2 Fantasy Island Operations Limited Vector 4 DS2 Fantasy Island Leisure Limited Vector 5 DS2 Mellors Group Fantasy Island Holdings Vector 6 Vector 1 Vector 4 X1 X2 q TFIDF cosine similarity Clustering (e.g., DBSCAN) d = f(cosine, geo-distance, etc.) d
  • 15. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 15 The most common challenges in end-to-end data analytics and machine learning are data sourcing, reconciliation, and integration – not data modelling. Corporate companies aim to fill data gaps (Swiss cheese model* example) by sourcing data from different providers. However, a lack of a common identifiers between datasets prevent them from extracting the value that each dataset can offer. Using Company Hierarchy End-2-End Structure Extraction (C.H.E.E.S.E.) as an example, we demonstrated how a data science toolbox can enable us to improve company matching. Summary * Swiss cheese model adapted from https://en.wikipedia.org/wiki/Swiss_cheese_model Shared under the Creative Commons Attribution-Share Alike 4.0 Int license Alpkäserei Flumserberg
  • 16. Thank you! Questions and comments are welcome 16
  • 17. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 17
  • 18. N. Maddah, A. Montoya | November 2022 | Swiss Re Institude 18 Legal notice ©2022 Swiss Re. All rights reserved. You may use this presentation for private or internal purposes but note that any copyright or other proprietary notices must not be removed. You are not permitted to create any modifications or derivative works of this presentation, or to use it for commercial or other public purposes, without the prior written permission of Swiss Re. The information and opinions contained in the presentation are provided as at the date of the presentation and may change. Although the information used was taken from reliable sources, Swiss Re does not accept any responsibility for its accuracy or comprehensiveness or its updating. All liability for the accuracy and completeness of the information or for any damage or loss resulting from its use is expressly excluded.