SlideShare a Scribd company logo
1 of 24
Download to read offline
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Unleash the Power of Neo4j with GPT and Large
Language Models: Harmonizing Concepts from
Cancer Research Data
October 25, 2023
1
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Speakers
2
Mark A. Jensen, PhD is the Director of
Data Science in the Center for Technical
Operations Support at the Frederick
National Laboratory for Cancer Research
Robert Chang, Modeling & Analytics
Domain Leader at FI Consulting
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Cancer Research Data Commons (CRDC)
3
Cancer Data Aggregator
Aggregate by patient, sample, study, disease, tissue, etc.
Clinical Proteomics Imaging
Genomics Immuno-
oncology
Animal Models Cancer Biomarkers
Cancer Research
Data Commons
0100111
0
0100001
1
0100100
1
Cancer Data Hub
Enable submission to one
or multiple commons
In development:
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
CRDC is a Federation of Going Concerns
§ Each CRDC node has its own data systems, business processes, stakeholders,
and users
§ Each has its own purpose-built data model that enables data ingestion, query, and distribution.
§ Each has large, ongoing inflows and outflows of data today.
§ Some have large legacy datasets that contain inconsistencies and gaps
§ A top-down, prescriptive approach to standardization is not feasible.
§ Retrospective data cleanup is highly manual and very occasional.
§ CRDC nodes have their own sets of data submitters:
§ Larger submitters (such as well-funded consortia) have data wranglers and engineers to automate
data transformation and submission – once their study is “onboarded”
§ Smaller submitters, with more diverse data and fewer IT resources, are set to become a larger part of
the submitter base, because of new data sharing requirements for individual NIH grants.
4
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Common Idiosyncrasies in Submitted Data
5
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Example: "Species"
6
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Typical Study Onboarding Workflow
7
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
8
Solution Overview
§ Our solution uses Graph Technology (Neo4J) and Natural Language Processing (NLP)
techniques.
§ NLP is needed for the solution to go beyond simple string-based similarity, and Graph
enables the solution to be fast, efficient, and scalable.
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
9
Data Preparation – GPT for Synonym Generation
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
10
Data Preparation – GPT for Parsing Long Text
§ Generate keywords based on long text to facilitate querying.
§ GPT was used to generate keywords, other parsers only returned single words, GPT returned
phrases.
“We established a preclinical testing program that has created >300 genomically-characterized pediatric solid tumor
patient-derived xenograft (PDX) models between the pediatric oncology programs at Memorial Sloan Kettering Cancer
Center and University of California San Francisco. We propose to leverage this large portfolio of models across a
diversity of diseases, along with the deep expertise of the team, to establish an NCI Pediatric In Vivo Testing Program
(Ped-In Vivo-TP) Research Team focused on pediatric bone and soft tissue sarcomas, renal tumors, desmoplastic small
round cell tumor (DSRCT) and other rare pediatric solid tumors.”
Key words from GPT3.5 model: Preclinical testing program; Pediatric solid tumor; Patient-derived xenograft
(PDX); NCI Pediatric In Vivo Testing Program; Rare pediatric solid tumors
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
11
Vectorizing Text Data
Cosine
Similarity
BioBERT-
Base
OpenAI
Mean 0.9327 0.9269
Standard
Deviation 0.0364 0.0446
Embedding Models:
§ BioBERT-Base
§ OpenAI
Evaluation methods:
§ Visualization of Clusters
§ Dimensionality reduction by T-SNE
§ Cosine Similarity of top-5 similar node
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
12
Calculate Text Similarity
After generating dense vectors, we calculate
cosine similarity scores between each pair
of vectors to find matches.
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
§ Nodes from data's
structure:
§ Category
§ Header
§ Value
§ Synthetic nodes:
§ Synonyms of headers
§ Synonyms of values
§ Tokens
13
Adding Nodes to our Graph
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
§ Edges from data's
structure
§ Edges from synonyms
§ Synthetic edges
§ Fast Vector
Similarity Search
§ Cosine Similarity
14
Adding Edges to our Graph
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Use Case #1: Correcting Typos
15
Value on Dataset Correct Value on Dataset Cosine Similarity
subcarina subcarinal 96.504 %
irinitecan irinotecan 98.506 %
bronvhoscopy bronchoscopy 99.262 %
right paratrac right paratracheal 98.186 %
liposarcoma, well diferentiated liposarcoma, well differentiated 98.776 %
supraclav supraclavicular 99.574 %
pleiomorphic liposarcoma pleomorphic liposarcoma 99.917 %
cisplastin cisplatin 99.221 %
hospitialization hospitalization 98.369 %
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Use Case #1: Correcting Typos
Value “subcarina” from the ACRIN dataset may be a typo and could match value
“subcarinal” from the same dataset.
HEADER
VALUE
Anatomic Site - other
VALUE
16
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Use Case #1: Correcting Typos
Value “bronvhoscopy” from the ACRIN NSCLC dataset is a typo and could match value
“bronchoscopy” from the same dataset.
HEADER VALUE VALUE
procedures performed
for diagnostic workup
17
HEADER
sites of progression
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Use Case #2: Mapping to Standards
18
Value on Dataset Similar Value on Dataset Cosine Similarity
lower third of esophagus abdominal esophagus 99.994 %
abdominal fibromatosis desmoid fibromatosis 99.607 %
acute monoblastic leukemia acute monocytic leukemia 99.998 %
androblastoma sertoli leydig cell tumor 99.721 %
glial-neuronal neoplasm glioma 99.678 %
benign fibrocyst benign fibrosis 97.492 %
hemangioendothelioma malignant hemangiosarcoma 99.361 %
pinealoma pineocytoma 99.957 %
diagnosis dx 99.885 %
acute myelogenous leukemia acute myeloid leukemias 99.347 %
pemetrexed alimta 99.414 %
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Use Case #2: Mapping to Standards
Value “lower third of esophagus” from the CCDI dataset is similar to value “abdominal
esophagus” from the TCGA dataset.
HEADER (CCDI) VALUE HEADER (TCGA)
VALUE
19
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Use Case #2: Mapping to Standards
Value “acute monoblastic leukemia” from the CCDI dataset is similar to value
“acute monocytic leukemia” from the same dataset.
20
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
21
Neo4J Queries for Node Classification
Random Walk Procedure:
§ Utilizes paths created in the
graph by various methods
(cosine similarity, ontology,
SME) the random walk can
identify headers that are in
near proximity.
§ Run Neo4J Random
Walk procedure using Cypher
to generate random walk
paths.
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
22
Neo4J Queries for Node Classification cont.
Using the Neo4j Python Driver, execute the GDS (graph data science) procedure and count
header occurrences in a node's random walk.
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
23
GPT User Interface for Graph Management
©2023 FI Consulting. All rights reserved. ficonsulting.com
©2023 FI Consulting. All rights reserved. ficonsulting.com
Thank you!
https://www.slideshare.net/neo4j/government-graphsummit-and-then-there-were-15-standards

More Related Content

What's hot

Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsDenodo
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCarl Anderson
 
Data Management Strategies
Data Management StrategiesData Management Strategies
Data Management StrategiesMicheal Axelsen
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricCambridge Semantics
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)DATAVERSITY
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceDATAVERSITY
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureDATAVERSITY
 
Neo4j y GenAI
Neo4j y GenAI Neo4j y GenAI
Neo4j y GenAI Neo4j
 
EA foundations (Views, Repository, Artifacts and Metamodel)
EA foundations (Views, Repository, Artifacts and Metamodel)EA foundations (Views, Repository, Artifacts and Metamodel)
EA foundations (Views, Repository, Artifacts and Metamodel)Mohamed Zakarya Abdelgawad
 
Power BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data SolutionsPower BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data SolutionsJames Serra
 
The Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent ApplicationsThe Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent ApplicationsNeo4j
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data worldCraig Milroy
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management DATAVERSITY
 
Data Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsData Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsDATAVERSITY
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 

What's hot (20)

Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Data Management Strategies
Data Management StrategiesData Management Strategies
Data Management Strategies
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)
 
Data Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and GovernanceData Architecture - The Foundation for Enterprise Architecture and Governance
Data Architecture - The Foundation for Enterprise Architecture and Governance
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
Neo4j y GenAI
Neo4j y GenAI Neo4j y GenAI
Neo4j y GenAI
 
EA foundations (Views, Repository, Artifacts and Metamodel)
EA foundations (Views, Repository, Artifacts and Metamodel)EA foundations (Views, Repository, Artifacts and Metamodel)
EA foundations (Views, Repository, Artifacts and Metamodel)
 
Power BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data SolutionsPower BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data Solutions
 
The Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent ApplicationsThe Data Platform for Today’s Intelligent Applications
The Data Platform for Today’s Intelligent Applications
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Data strategy in a Big Data world
Data strategy in a Big Data worldData strategy in a Big Data world
Data strategy in a Big Data world
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
Data Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsData Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and Roadmaps
 
Data Monetization Framework
Data Monetization FrameworkData Monetization Framework
Data Monetization Framework
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 

Similar to Unleash Power of Neo4j GPT Harmonize Cancer Data

Implications of the Fourth Paradigm
Implications of the Fourth ParadigmImplications of the Fourth Paradigm
Implications of the Fourth ParadigmPhilip Bourne
 
EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013Warren Kibbe
 
Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021
Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021
Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021Clifford Thornton
 
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Amazon Web Services
 
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...BCS Data Management Specialist Group
 
Advanced Medical Isotope Corp. Presentation
Advanced Medical Isotope Corp. PresentationAdvanced Medical Isotope Corp. Presentation
Advanced Medical Isotope Corp. PresentationRedChip Companies, Inc.
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingDenodo
 
CIS13: FCCX and IDESG: An Industry Perspectives
CIS13: FCCX and IDESG: An Industry PerspectivesCIS13: FCCX and IDESG: An Industry Perspectives
CIS13: FCCX and IDESG: An Industry PerspectivesCloudIDSummit
 
Enabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseEnabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseNeo4j
 
Capstone Project.pptx
Capstone Project.pptxCapstone Project.pptx
Capstone Project.pptxARESProject1
 
CRO Industry Overview
CRO Industry OverviewCRO Industry Overview
CRO Industry OverviewUsama Malik
 
Special Report: Getting the Optimal Return on X-ray Equipment
Special Report: Getting the Optimal Return on X-ray EquipmentSpecial Report: Getting the Optimal Return on X-ray Equipment
Special Report: Getting the Optimal Return on X-ray EquipmentCarestream
 
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...RobertGHunter
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science WorkshopWarren Kibbe
 

Similar to Unleash Power of Neo4j GPT Harmonize Cancer Data (20)

Trial io pcori doc v1
Trial io pcori doc v1Trial io pcori doc v1
Trial io pcori doc v1
 
Linked data in industry
Linked data in industryLinked data in industry
Linked data in industry
 
Implications of the Fourth Paradigm
Implications of the Fourth ParadigmImplications of the Fourth Paradigm
Implications of the Fourth Paradigm
 
EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013EBI Industry programme TCGA Warren KIbbe November 2013
EBI Industry programme TCGA Warren KIbbe November 2013
 
Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021
Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021
Vascular devices llc_implanted_vessel_clearing_modules_&_system_02-21-2021
 
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
Life Technologies' Journey to the Cloud (ENT208) | AWS re:Invent 2013
 
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
 
Investor Present 04202017
Investor Present 04202017Investor Present 04202017
Investor Present 04202017
 
Investor Present 04202017
Investor Present  04202017Investor Present  04202017
Investor Present 04202017
 
Advanced Medical Isotope Corp. Presentation
Advanced Medical Isotope Corp. PresentationAdvanced Medical Isotope Corp. Presentation
Advanced Medical Isotope Corp. Presentation
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
CIS13: FCCX and IDESG: An Industry Perspectives
CIS13: FCCX and IDESG: An Industry PerspectivesCIS13: FCCX and IDESG: An Industry Perspectives
CIS13: FCCX and IDESG: An Industry Perspectives
 
Enabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseEnabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph Database
 
Capstone Project.pptx
Capstone Project.pptxCapstone Project.pptx
Capstone Project.pptx
 
CRO Industry Overview
CRO Industry OverviewCRO Industry Overview
CRO Industry Overview
 
Special Report: Getting the Optimal Return on X-ray Equipment
Special Report: Getting the Optimal Return on X-ray EquipmentSpecial Report: Getting the Optimal Return on X-ray Equipment
Special Report: Getting the Optimal Return on X-ray Equipment
 
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
Predictive in vitro & in silico Methods for Precision Medicine- Robert G. Hun...
 
MDC Connects: Make the Molecules that Matter
MDC Connects: Make the Molecules that MatterMDC Connects: Make the Molecules that Matter
MDC Connects: Make the Molecules that Matter
 
My blogs on collaboration
My blogs on collaborationMy blogs on collaboration
My blogs on collaboration
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
 

More from Neo4j

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansNeo4j
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...Neo4j
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosNeo4j
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Neo4j
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Neo4j
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeNeo4j
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsNeo4j
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j
 

More from Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Recently uploaded

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Recently uploaded (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Unleash Power of Neo4j GPT Harmonize Cancer Data

  • 1. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Unleash the Power of Neo4j with GPT and Large Language Models: Harmonizing Concepts from Cancer Research Data October 25, 2023 1
  • 2. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Speakers 2 Mark A. Jensen, PhD is the Director of Data Science in the Center for Technical Operations Support at the Frederick National Laboratory for Cancer Research Robert Chang, Modeling & Analytics Domain Leader at FI Consulting
  • 3. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Cancer Research Data Commons (CRDC) 3 Cancer Data Aggregator Aggregate by patient, sample, study, disease, tissue, etc. Clinical Proteomics Imaging Genomics Immuno- oncology Animal Models Cancer Biomarkers Cancer Research Data Commons 0100111 0 0100001 1 0100100 1 Cancer Data Hub Enable submission to one or multiple commons In development:
  • 4. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com CRDC is a Federation of Going Concerns § Each CRDC node has its own data systems, business processes, stakeholders, and users § Each has its own purpose-built data model that enables data ingestion, query, and distribution. § Each has large, ongoing inflows and outflows of data today. § Some have large legacy datasets that contain inconsistencies and gaps § A top-down, prescriptive approach to standardization is not feasible. § Retrospective data cleanup is highly manual and very occasional. § CRDC nodes have their own sets of data submitters: § Larger submitters (such as well-funded consortia) have data wranglers and engineers to automate data transformation and submission – once their study is “onboarded” § Smaller submitters, with more diverse data and fewer IT resources, are set to become a larger part of the submitter base, because of new data sharing requirements for individual NIH grants. 4
  • 5. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Common Idiosyncrasies in Submitted Data 5
  • 6. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Example: "Species" 6
  • 7. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Typical Study Onboarding Workflow 7
  • 8. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 8 Solution Overview § Our solution uses Graph Technology (Neo4J) and Natural Language Processing (NLP) techniques. § NLP is needed for the solution to go beyond simple string-based similarity, and Graph enables the solution to be fast, efficient, and scalable.
  • 9. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 9 Data Preparation – GPT for Synonym Generation
  • 10. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 10 Data Preparation – GPT for Parsing Long Text § Generate keywords based on long text to facilitate querying. § GPT was used to generate keywords, other parsers only returned single words, GPT returned phrases. “We established a preclinical testing program that has created >300 genomically-characterized pediatric solid tumor patient-derived xenograft (PDX) models between the pediatric oncology programs at Memorial Sloan Kettering Cancer Center and University of California San Francisco. We propose to leverage this large portfolio of models across a diversity of diseases, along with the deep expertise of the team, to establish an NCI Pediatric In Vivo Testing Program (Ped-In Vivo-TP) Research Team focused on pediatric bone and soft tissue sarcomas, renal tumors, desmoplastic small round cell tumor (DSRCT) and other rare pediatric solid tumors.” Key words from GPT3.5 model: Preclinical testing program; Pediatric solid tumor; Patient-derived xenograft (PDX); NCI Pediatric In Vivo Testing Program; Rare pediatric solid tumors
  • 11. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 11 Vectorizing Text Data Cosine Similarity BioBERT- Base OpenAI Mean 0.9327 0.9269 Standard Deviation 0.0364 0.0446 Embedding Models: § BioBERT-Base § OpenAI Evaluation methods: § Visualization of Clusters § Dimensionality reduction by T-SNE § Cosine Similarity of top-5 similar node
  • 12. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 12 Calculate Text Similarity After generating dense vectors, we calculate cosine similarity scores between each pair of vectors to find matches.
  • 13. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com § Nodes from data's structure: § Category § Header § Value § Synthetic nodes: § Synonyms of headers § Synonyms of values § Tokens 13 Adding Nodes to our Graph
  • 14. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com § Edges from data's structure § Edges from synonyms § Synthetic edges § Fast Vector Similarity Search § Cosine Similarity 14 Adding Edges to our Graph
  • 15. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Use Case #1: Correcting Typos 15 Value on Dataset Correct Value on Dataset Cosine Similarity subcarina subcarinal 96.504 % irinitecan irinotecan 98.506 % bronvhoscopy bronchoscopy 99.262 % right paratrac right paratracheal 98.186 % liposarcoma, well diferentiated liposarcoma, well differentiated 98.776 % supraclav supraclavicular 99.574 % pleiomorphic liposarcoma pleomorphic liposarcoma 99.917 % cisplastin cisplatin 99.221 % hospitialization hospitalization 98.369 %
  • 16. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Use Case #1: Correcting Typos Value “subcarina” from the ACRIN dataset may be a typo and could match value “subcarinal” from the same dataset. HEADER VALUE Anatomic Site - other VALUE 16
  • 17. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Use Case #1: Correcting Typos Value “bronvhoscopy” from the ACRIN NSCLC dataset is a typo and could match value “bronchoscopy” from the same dataset. HEADER VALUE VALUE procedures performed for diagnostic workup 17 HEADER sites of progression
  • 18. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Use Case #2: Mapping to Standards 18 Value on Dataset Similar Value on Dataset Cosine Similarity lower third of esophagus abdominal esophagus 99.994 % abdominal fibromatosis desmoid fibromatosis 99.607 % acute monoblastic leukemia acute monocytic leukemia 99.998 % androblastoma sertoli leydig cell tumor 99.721 % glial-neuronal neoplasm glioma 99.678 % benign fibrocyst benign fibrosis 97.492 % hemangioendothelioma malignant hemangiosarcoma 99.361 % pinealoma pineocytoma 99.957 % diagnosis dx 99.885 % acute myelogenous leukemia acute myeloid leukemias 99.347 % pemetrexed alimta 99.414 %
  • 19. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Use Case #2: Mapping to Standards Value “lower third of esophagus” from the CCDI dataset is similar to value “abdominal esophagus” from the TCGA dataset. HEADER (CCDI) VALUE HEADER (TCGA) VALUE 19
  • 20. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Use Case #2: Mapping to Standards Value “acute monoblastic leukemia” from the CCDI dataset is similar to value “acute monocytic leukemia” from the same dataset. 20
  • 21. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 21 Neo4J Queries for Node Classification Random Walk Procedure: § Utilizes paths created in the graph by various methods (cosine similarity, ontology, SME) the random walk can identify headers that are in near proximity. § Run Neo4J Random Walk procedure using Cypher to generate random walk paths.
  • 22. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 22 Neo4J Queries for Node Classification cont. Using the Neo4j Python Driver, execute the GDS (graph data science) procedure and count header occurrences in a node's random walk.
  • 23. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com 23 GPT User Interface for Graph Management
  • 24. ©2023 FI Consulting. All rights reserved. ficonsulting.com ©2023 FI Consulting. All rights reserved. ficonsulting.com Thank you! https://www.slideshare.net/neo4j/government-graphsummit-and-then-there-were-15-standards