SlideShare a Scribd company logo
DEPARTMENT OF HEALTH AND HUMAN SERVICES • National Institutes of Health • National Cancer Institute
Frederick National Laboratory is a Federally Funded Research and Development Center operated by Leidos Biomedical Research, Inc., for the National Cancer Institute
Neo4j GovConnect:
The Bento Framework for Building Data Commons
Biomedical Informatics and Data Science (BIDS) Directorate
Frederick National Laboratory for Cancer Research (FNLCR)
Todd Pihl, PhD, PMP
Ming Ying, MS
Mark A. Jensen, PhD, PMP
Frederick National Laboratory for Cancer Research
• Why was Bento Framework built?
• What is the Bento Framework?
• How does it work?
• Where does Neo4j fit in?
• What’s next?
The NCI Cancer Research Data Commons Ecosystem
https://datascience.cancer.gov/data-commons
Frederick National Laboratory for Cancer Research
Frederick National Laboratory for Cancer Research
Integrated Canine Data Commons
• Not data type centric
Clinical Trials Data Commons
• Genomic data bias
Frederick National Laboratory for Cancer Research
o Bento can serve as a framework
for CRDC platforms
o Each platform can customize the
core framework
o Bento provides modular,
customizable components
o QA is baked into Bento
development process
o Bento can serve as a “foundry”
for CRDC
Bento’s Core
Frederick National Laboratory for Cancer Research
Bento Saves
Development Time
Integrated Canine Data Commons
Clinical Trial Data Commons
Bento Based Data Commons
9 MONTHS
4 MONTHS
~4 WEEKS
Frederick National Laboratory for Cancer Research
o Microservices in Containers:
- User Interface
- Backend
- Database
- Core Data Model
o Technology Stack:
Neo4J | Java/Python | GraphQL | ReactJS
- Neo4j drives both persistence and UI
o Loose Coupling of Components
Bento Architecture
For the Custodian
o Pull codebase
o Configure locally
o Deploy anywhere
o Multi-cloud support
o Documentation support
Frederick National Laboratory for Cancer Research
1
Bento Core
Data Model
Bento Core API
Bento Core UI
Framework
ü Bento Core Components work
out of the box.
ü Bento Custodian need only
configure the UI.
ü The core components provide a
minimal but working application.
2
Bento Core UI
Framework
Bento
Extended
Model
Bento Extended API
ü Bento Core Data Model is extended.
ü Bento Core API custom queries
updated to provide access to data in the
extended sections of the model.
ü UI continues to work with the API and
data model, without any additional
development.
ü Bento Core Data Model is extended.
ü Bento Core API custom queries
updated to provided access to data in
the extended sections of the model.
ü Core UI built out to display model
elements according to your
custodian’s specifications.
3
Bento Extended UI
Framework
Bento
Extended
Model
Bento Extended API
Bento - Degrees of Configuration
Model Description Format (MDF)
MDF is a compact, human-readable—but computable—format
for defining a property graph
• Nodes
• Node Properties
• Relationships
• Relationship Properties
• Relationship Attributes: direction, multiplicity, required/optional, …
• Property Definitions
• Property Attributes: allowable value types or sets, required/optional, …
Frederick National Laboratory for Cancer Research
https://github.com/CBIIT/bento-mdf
MDF Overview
Nodes
Edges
Property
Defs
Props referenced here … and defined here
Entity names are the keys
Nodes at the ends,
with direction
Other attributes
specified
Constrain the
data values
to defined types
Frederick National Laboratory for Cancer Research
Frederick National Laboratory for Cancer Research
Metamodel Database
https://github.com/CBIIT/bento-meta
CTDC Model
in MDB
Metamodel
"Schema"
Testimonial - ICDC Product Owner
Philip Musk 12:06
And let me tell you, with data needs driving many of
ICDC's requirements as they are, and have been thus far,
being able to both write the requirements, and make
the required model changes ahead of engineers doing
their thing, is really powerful. I don't have to explain
what model changes we need to make to someone else -
I can get the model changes done myself, and explain
what we need the engineers and the UI to do with those
changes.
Data pipeline
Frederick National Laboratory for Cancer Research
Case file
Data-loader
(Python) Neo4J
Case file
Case file
Study file
Case file
Study
COTC007B
Case
COTC007b-0505
Case
COTC007b-1605
Case
COTC007b-0610 Case
COTC007b-0408
Cohort
NSC 7067….
Cypher
Backend Backend Architecture
Neo4j
GraphQL
Plugin
GraphQL
GraphQL
Java App
Data
Data
Client
Bento Backend
Redis
GraphQL null
GraphQL/Data
Frederick National Laboratory for Cancer Research
Neo4j
GraphQL
Plugin
GraphQL
Java App
Data
Client
Bento Backend
Redis
GraphQL
Data
Frederick National Laboratory for Cancer Research
Backend Backend Architecture
Why Neo4j-GraphQL Plugin?
• Very easy to setup
• Very easy to use
• Only one file (the schema file) is needed to setup a GraphQL API
• Automatically generated common queries
• Supports writing custom queries with Cypher
• Very responsive customer support!
Frederick National Laboratory for Cancer Research
GraphQL Schema (Cypher Annotated)
Frederick National Laboratory for Cancer Research
Frederick National Laboratory for Cancer Research
Frederick National Laboratory for Cancer Research
0
0
Establish workflow for building
reusable Bento components
Retool existing sites to use Bento
Add authentication and authorization layer
Add enhanced data viz capabilities
Enhance performance
1
2
3
Bento Roadmap
4
5
1
2
3
4
5
Frederick National Laboratory for Cancer Research
The Bento Team
Bento Leadership
Braulio Cabral
John Otridge
Todd Pihl
Mark Jensen
DevOps & Backend
Ming Ying
Vincent Donkor
Michael Fleming
Austin Mueller
Bento Management
Ye Wu
Kailing Chen
Amit Mukherjee
Data
Mark Jensen
Philip Musk
Mark Benson
Kevin Burns
Claire Wolfe
Anjan Purkayastha
UI
Dimitri Darras
Hannah Stogsdill
Sri Kiran Chaparala
Ajay Doddapaneni
Ming Ying
Yizhen Chen
Karan Sheth
QA
Laxmi Lolla
Gayatri Radhakrishnan
Megan Lewis
Integrated Frederick National Lab and Essential Software Inc. Team
Thank you.
Frederick National Laboratory for Cancer Research
https://cbiit.github.io/bento-docs-sphinx/master/installation/bento-quick-start.html

More Related Content

What's hot

The Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdfThe Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdf
Neo4j
 

What's hot (20)

Road to NODES - Healthcare Analytics
Road to NODES - Healthcare AnalyticsRoad to NODES - Healthcare Analytics
Road to NODES - Healthcare Analytics
 
Fraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business AuthorityFraud Detection with Graphs at the Danish Business Authority
Fraud Detection with Graphs at the Danish Business Authority
 
JupyterHub - A "Thing Explainer" Overview
JupyterHub - A "Thing Explainer" OverviewJupyterHub - A "Thing Explainer" Overview
JupyterHub - A "Thing Explainer" Overview
 
Neo4j Graph Use Cases, Bruno Ungermann, Neo4j
Neo4j Graph Use Cases, Bruno Ungermann, Neo4jNeo4j Graph Use Cases, Bruno Ungermann, Neo4j
Neo4j Graph Use Cases, Bruno Ungermann, Neo4j
 
Bucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
Bucketing 2.0: Improve Spark SQL Performance by Removing ShuffleBucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
Bucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
 
Demystifying Graph Neural Networks
Demystifying Graph Neural NetworksDemystifying Graph Neural Networks
Demystifying Graph Neural Networks
 
Intro to Cypher
Intro to CypherIntro to Cypher
Intro to Cypher
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the Covers
 
Building Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta LakeBuilding Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta Lake
 
The Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdfThe Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdf
 
Hadoop Query Performance Smackdown
Hadoop Query Performance SmackdownHadoop Query Performance Smackdown
Hadoop Query Performance Smackdown
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Build Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingBuild Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks Streaming
 
Optimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j GraphOptimizing Your Supply Chain with the Neo4j Graph
Optimizing Your Supply Chain with the Neo4j Graph
 
Iceberg: A modern table format for big data (Strata NY 2018)
Iceberg: A modern table format for big data (Strata NY 2018)Iceberg: A modern table format for big data (Strata NY 2018)
Iceberg: A modern table format for big data (Strata NY 2018)
 
ntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technologyntroducing to the Power of Graph Technology
ntroducing to the Power of Graph Technology
 
Web analytics at scale with Druid at naver.com
Web analytics at scale with Druid at naver.comWeb analytics at scale with Druid at naver.com
Web analytics at scale with Druid at naver.com
 
GPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge GraphGPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge Graph
 
Accelerate Your ML Pipeline with AutoML and MLflow
Accelerate Your ML Pipeline with AutoML and MLflowAccelerate Your ML Pipeline with AutoML and MLflow
Accelerate Your ML Pipeline with AutoML and MLflow
 

Similar to 5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework

CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECAProject
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
OSTHUS
 
Record matching over query results
Record matching over query resultsRecord matching over query results
Record matching over query results
ambitlick
 
PILoT-v2
PILoT-v2PILoT-v2
PILoT-v2
Di Wu
 
SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17
Pieter Pauwels
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 

Similar to 5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework (20)

Government GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsGovernment GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 Standards
 
Simbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational ResearchSimbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational Research
 
Patent database a methodology of information retrieval from pdf
Patent database  a methodology of information retrieval from pdfPatent database  a methodology of information retrieval from pdf
Patent database a methodology of information retrieval from pdf
 
Shailendra Resume
Shailendra ResumeShailendra Resume
Shailendra Resume
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Framework Enabling End-Users to Maintain Web Applications (ICICWS2015)
Framework Enabling End-Users to Maintain Web Applications (ICICWS2015)Framework Enabling End-Users to Maintain Web Applications (ICICWS2015)
Framework Enabling End-Users to Maintain Web Applications (ICICWS2015)
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
SIES IoT spresentation
SIES IoT spresentationSIES IoT spresentation
SIES IoT spresentation
 
第1回バイオインフォマティクスデータ可視化セミナー@Riken
第1回バイオインフォマティクスデータ可視化セミナー@Riken第1回バイオインフォマティクスデータ可視化セミナー@Riken
第1回バイオインフォマティクスデータ可視化セミナー@Riken
 
Record matching over query results
Record matching over query resultsRecord matching over query results
Record matching over query results
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Alex Wade, Digital Library Interoperability
Alex Wade, Digital Library InteroperabilityAlex Wade, Digital Library Interoperability
Alex Wade, Digital Library Interoperability
 
PILoT-v2
PILoT-v2PILoT-v2
PILoT-v2
 
SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17SustainablePlaces_ifcOWL_applications_2015-09-17
SustainablePlaces_ifcOWL_applications_2015-09-17
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Applying linear regression and predictive analytics
Applying linear regression and predictive analyticsApplying linear regression and predictive analytics
Applying linear regression and predictive analytics
 
Infrastructure as Code for Network
Infrastructure as Code for NetworkInfrastructure as Code for Network
Infrastructure as Code for Network
 
Bots & spiders
Bots & spidersBots & spiders
Bots & spiders
 

More from Neo4j

More from Neo4j (20)

GraphAware - Transforming policing with graph-based intelligence analysis
GraphAware - Transforming policing with graph-based intelligence analysisGraphAware - Transforming policing with graph-based intelligence analysis
GraphAware - Transforming policing with graph-based intelligence analysis
 
GraphSummit Stockholm - Neo4j - Knowledge Graphs and Product Updates
GraphSummit Stockholm - Neo4j - Knowledge Graphs and Product UpdatesGraphSummit Stockholm - Neo4j - Knowledge Graphs and Product Updates
GraphSummit Stockholm - Neo4j - Knowledge Graphs and Product Updates
 
KLARNA - Language Models and Knowledge Graphs: A Systems Approach
KLARNA -  Language Models and Knowledge Graphs: A Systems ApproachKLARNA -  Language Models and Knowledge Graphs: A Systems Approach
KLARNA - Language Models and Knowledge Graphs: A Systems Approach
 
INGKA DIGITAL: Linked Metadata by Design
INGKA DIGITAL: Linked Metadata by DesignINGKA DIGITAL: Linked Metadata by Design
INGKA DIGITAL: Linked Metadata by Design
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptxBT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
BT & Neo4j _ How Knowledge Graphs help BT deliver Digital Transformation.pptx
 
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit MilanWorkshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4jGraphSummit Milan - Visione e roadmap del prodotto Neo4j
GraphSummit Milan - Visione e roadmap del prodotto Neo4j
 
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with GraphGraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
 
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
LARUS - Galileo.XAI e Gen-AI: la nuova prospettiva di LARUS per il futuro del...
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
CERVED e Neo4j su una nuvola, migrazione ed evoluzione di un grafo mission cr...
 
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 

Recently uploaded

Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 

5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework

  • 1. DEPARTMENT OF HEALTH AND HUMAN SERVICES • National Institutes of Health • National Cancer Institute Frederick National Laboratory is a Federally Funded Research and Development Center operated by Leidos Biomedical Research, Inc., for the National Cancer Institute Neo4j GovConnect: The Bento Framework for Building Data Commons Biomedical Informatics and Data Science (BIDS) Directorate Frederick National Laboratory for Cancer Research (FNLCR) Todd Pihl, PhD, PMP Ming Ying, MS Mark A. Jensen, PhD, PMP
  • 2. Frederick National Laboratory for Cancer Research • Why was Bento Framework built? • What is the Bento Framework? • How does it work? • Where does Neo4j fit in? • What’s next?
  • 3. The NCI Cancer Research Data Commons Ecosystem https://datascience.cancer.gov/data-commons Frederick National Laboratory for Cancer Research
  • 4. Frederick National Laboratory for Cancer Research Integrated Canine Data Commons • Not data type centric Clinical Trials Data Commons • Genomic data bias
  • 5. Frederick National Laboratory for Cancer Research o Bento can serve as a framework for CRDC platforms o Each platform can customize the core framework o Bento provides modular, customizable components o QA is baked into Bento development process o Bento can serve as a “foundry” for CRDC Bento’s Core
  • 6. Frederick National Laboratory for Cancer Research Bento Saves Development Time Integrated Canine Data Commons Clinical Trial Data Commons Bento Based Data Commons 9 MONTHS 4 MONTHS ~4 WEEKS
  • 7. Frederick National Laboratory for Cancer Research o Microservices in Containers: - User Interface - Backend - Database - Core Data Model o Technology Stack: Neo4J | Java/Python | GraphQL | ReactJS - Neo4j drives both persistence and UI o Loose Coupling of Components Bento Architecture For the Custodian o Pull codebase o Configure locally o Deploy anywhere o Multi-cloud support o Documentation support
  • 8. Frederick National Laboratory for Cancer Research 1 Bento Core Data Model Bento Core API Bento Core UI Framework ü Bento Core Components work out of the box. ü Bento Custodian need only configure the UI. ü The core components provide a minimal but working application. 2 Bento Core UI Framework Bento Extended Model Bento Extended API ü Bento Core Data Model is extended. ü Bento Core API custom queries updated to provide access to data in the extended sections of the model. ü UI continues to work with the API and data model, without any additional development. ü Bento Core Data Model is extended. ü Bento Core API custom queries updated to provided access to data in the extended sections of the model. ü Core UI built out to display model elements according to your custodian’s specifications. 3 Bento Extended UI Framework Bento Extended Model Bento Extended API Bento - Degrees of Configuration
  • 9. Model Description Format (MDF) MDF is a compact, human-readable—but computable—format for defining a property graph • Nodes • Node Properties • Relationships • Relationship Properties • Relationship Attributes: direction, multiplicity, required/optional, … • Property Definitions • Property Attributes: allowable value types or sets, required/optional, … Frederick National Laboratory for Cancer Research https://github.com/CBIIT/bento-mdf
  • 10. MDF Overview Nodes Edges Property Defs Props referenced here … and defined here Entity names are the keys Nodes at the ends, with direction Other attributes specified Constrain the data values to defined types Frederick National Laboratory for Cancer Research
  • 11. Frederick National Laboratory for Cancer Research Metamodel Database https://github.com/CBIIT/bento-meta CTDC Model in MDB Metamodel "Schema"
  • 12. Testimonial - ICDC Product Owner Philip Musk 12:06 And let me tell you, with data needs driving many of ICDC's requirements as they are, and have been thus far, being able to both write the requirements, and make the required model changes ahead of engineers doing their thing, is really powerful. I don't have to explain what model changes we need to make to someone else - I can get the model changes done myself, and explain what we need the engineers and the UI to do with those changes.
  • 13. Data pipeline Frederick National Laboratory for Cancer Research Case file Data-loader (Python) Neo4J Case file Case file Study file Case file Study COTC007B Case COTC007b-0505 Case COTC007b-1605 Case COTC007b-0610 Case COTC007b-0408 Cohort NSC 7067…. Cypher
  • 14. Backend Backend Architecture Neo4j GraphQL Plugin GraphQL GraphQL Java App Data Data Client Bento Backend Redis GraphQL null GraphQL/Data Frederick National Laboratory for Cancer Research
  • 15. Neo4j GraphQL Plugin GraphQL Java App Data Client Bento Backend Redis GraphQL Data Frederick National Laboratory for Cancer Research Backend Backend Architecture
  • 16. Why Neo4j-GraphQL Plugin? • Very easy to setup • Very easy to use • Only one file (the schema file) is needed to setup a GraphQL API • Automatically generated common queries • Supports writing custom queries with Cypher • Very responsive customer support! Frederick National Laboratory for Cancer Research
  • 17. GraphQL Schema (Cypher Annotated) Frederick National Laboratory for Cancer Research
  • 18. Frederick National Laboratory for Cancer Research
  • 19. Frederick National Laboratory for Cancer Research
  • 20. 0 0 Establish workflow for building reusable Bento components Retool existing sites to use Bento Add authentication and authorization layer Add enhanced data viz capabilities Enhance performance 1 2 3 Bento Roadmap 4 5 1 2 3 4 5
  • 21. Frederick National Laboratory for Cancer Research The Bento Team Bento Leadership Braulio Cabral John Otridge Todd Pihl Mark Jensen DevOps & Backend Ming Ying Vincent Donkor Michael Fleming Austin Mueller Bento Management Ye Wu Kailing Chen Amit Mukherjee Data Mark Jensen Philip Musk Mark Benson Kevin Burns Claire Wolfe Anjan Purkayastha UI Dimitri Darras Hannah Stogsdill Sri Kiran Chaparala Ajay Doddapaneni Ming Ying Yizhen Chen Karan Sheth QA Laxmi Lolla Gayatri Radhakrishnan Megan Lewis Integrated Frederick National Lab and Essential Software Inc. Team
  • 22. Thank you. Frederick National Laboratory for Cancer Research https://cbiit.github.io/bento-docs-sphinx/master/installation/bento-quick-start.html