SlideShare a Scribd company logo
1 of 22
BEL.bio and BioDati Studio
Why BEL?
Chemists have the Chemical Reaction Language
Biologists have the Biological Expression Language (BEL)
Open standard for communication and knowledge-storage
Whiteboard and Computer friendly
Partial chemical synthesis pathway: https://www.synarchive.com/syn/128
Overall Goals for BEL.bio
Try to simplify use of BEL and BEL Content
Stronger BEL/Nanopub validation, better error messages
Easy addition of new BEL Language features
Convert to Python and Docker
Easier community engagement
Quick easy startup/deployment
Provide API and Namespaces hosting
Easier to use/deploy/maintain search/completion service
Greatly expand organisms supported (all EntrezGene/NCBITaxonomy)
Simplify addition/maintenance of namespaces/zero downtime updates!!!
Glossary
BEL Assertion – single string version of BEL or subject, relation, object
(SRO) version of BEL assertion (e.g. BEL triple)
BEL Nanopub – BEL triple, Evidence, Context, Citation, Metadata
Evidence – short text extraction or supporting information for BELTriple (Evidence
in BEL Script, Support in OpenBEL Nanopub format)
Annotations – OpenBEL Annotations are now called Annotations and were referred
to as Experimental Context in BELMgr
BEL Edge – BEL triples, primary and computed BEL canonicalized to
standard namespace IDs and potentially orthologized stored in the
EdgeStore (a graph database)
API – BEL.bio API – BEL language, nanopub, terminology (namespace,
orthology) services
AST – Abstract SyntaxTree of BEL Statement
Function: BEL function, e.g. p() or modifier function, e.g. var()
NSArg: Namespace argument, e.g. HGNC:AKT1
StrArg: String argument, e.g. pmod(Ph,T, 22), Ph,T and 22 are string arguments
Not supported by BEL.bio
KAMs
OpenBEL API/tooling
BELScripts (except for converting to BEL Nanopubs)
XBEL
OpenBEL namespace/equivalence files (limited conversion to
BEL.bioTerminology files)
BEL Parsing and Validation
bel_lang python module
Depends on BEL.bio API for terminology services (namespaces, equivalents,
orthology)
Parsing, validation, canonicalization, orthologization, compute
edges (eventually completion and migration)
Uses BEL Specification and EBNF file for parsing and semantic
validation
EBNF file used byTatsu module to create parser library to parse BELTriple
into dictionary AST of Function, NSArg, StrArg components,AST is
transformed to python AST class-based object (BEL Object  BO)
BEL Spec used to process BO for semantic validation
bo.parse('p(MGI:A1bg)').orthologize('TAX:9606').canonicalize().ast.to_string(fmt='medium')
p(EG:1)
Provides CLI installed with module
Supports Multiple BEL versions
Can deploy bel_lang with multiple BEL versions (only BEL 2.0.0
currently (using semantic versioning now for BEL)
One BEL Specification file per version, EBNF/parser generated from
BEL Spec
Drop in new BEL Spec, get new BELVersion functionality, easy
testing of proposed BEL language features
Future: create BEL migration signatures like the computed edge
signatures for migrating BEL
BioDati Studio
Terminology Services
BEL Terminology Resources
Simplify Namespaces
GOBP, GOCC, GOBPID, GOCCID -> GO
Context (Annotations) are now also Namespaces
Simplify generator scripts
Single script per resource: download and reformat into
terminology or orthology load file
Single download/cache directory (gzipped)
BEL Resource tools Github repo
https://github.com/belbio/bel_resources
Terminology Workflow
Term Completion Examples – BioDati Studio
Namespaces Count
EG 20,750,186
TAX 1,736,298
SP 557,012
AFFX 327,392
CHEBI 106,644
MGI 57,532
RGD 44,972
GO 44,922
HGNC 41,315
ZFIN 23,388
MESH 19,223
UBERON 13,232
DO 8,699
CL 2,194
EFO 937
Terminology Statistics
Summary
BioDati Services
BioDati Studio
Data Stores (Nanopubs, Networks, Edges)
Terminologies (collection, normalization, search, completions)
Consulting – BEL-related
Acknowledgements
NatalieCatlett, PatientsLikeMe
Anselmo DiFabio, BioDati
David Chen
Tony Bargnesi
Nick Bargnesi
Additional resources
http://bel.bio
http://biodati.com
http://medium.com/biodati
JSON philosophy: https://towardsdatascience.com/my-love-affair-
with-json-edaca39e8320
https://github.com/belbio
location: 1501 Main Street, Rahway, NJ 07065 | call: 732-764-8844 | online: biodati.com
Anselmo Di Fabio
adifabio@biodati.com
William Hayes
whayes@biodati.com

More Related Content

What's hot

web programming UNIT VIII python by Bhavsingh Maloth
web programming UNIT VIII python by Bhavsingh Malothweb programming UNIT VIII python by Bhavsingh Maloth
web programming UNIT VIII python by Bhavsingh MalothBhavsingh Maloth
 
Implementation - Sample Runs
Implementation - Sample RunsImplementation - Sample Runs
Implementation - Sample RunsAdwiteeya Agrawal
 
Introduction to Python Programing
Introduction to Python ProgramingIntroduction to Python Programing
Introduction to Python Programingsameer patil
 
Jython: Integrating Python and Java
Jython: Integrating Python and JavaJython: Integrating Python and Java
Jython: Integrating Python and JavaCharles Anderson
 

What's hot (9)

Python Tutorial Part 2
Python Tutorial Part 2Python Tutorial Part 2
Python Tutorial Part 2
 
web programming UNIT VIII python by Bhavsingh Maloth
web programming UNIT VIII python by Bhavsingh Malothweb programming UNIT VIII python by Bhavsingh Maloth
web programming UNIT VIII python by Bhavsingh Maloth
 
Implementation - Sample Runs
Implementation - Sample RunsImplementation - Sample Runs
Implementation - Sample Runs
 
Introduction to Python Programing
Introduction to Python ProgramingIntroduction to Python Programing
Introduction to Python Programing
 
Compilation
CompilationCompilation
Compilation
 
Introduction to Python Basics Programming
Introduction to Python Basics ProgrammingIntroduction to Python Basics Programming
Introduction to Python Basics Programming
 
PBS Tech Con 2011 API Workshop
PBS Tech Con 2011 API WorkshopPBS Tech Con 2011 API Workshop
PBS Tech Con 2011 API Workshop
 
Jython: Integrating Python and Java
Jython: Integrating Python and JavaJython: Integrating Python and Java
Jython: Integrating Python and Java
 
MPI - 1
MPI - 1MPI - 1
MPI - 1
 

Similar to BEL.bio Overview and BioDati Studio

plone.app.multilingual
plone.app.multilingual plone.app.multilingual
plone.app.multilingual Ramon Navarro
 
Corpus annotation for corpus linguistics (nov2009)
Corpus annotation for corpus linguistics (nov2009)Corpus annotation for corpus linguistics (nov2009)
Corpus annotation for corpus linguistics (nov2009)Jorge Baptista
 
Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007Gong Cheng
 
AO: Annotation Ontology for science on the web
AO: Annotation Ontology for science on the webAO: Annotation Ontology for science on the web
AO: Annotation Ontology for science on the webPaolo Ciccarese
 
IMS LODE and the ASPECT project
IMS LODE and the ASPECT projectIMS LODE and the ASPECT project
IMS LODE and the ASPECT projectDavid Massart
 
Jazoon2010 - Edgar Silva - Open source SOA on Steroids
Jazoon2010 - Edgar Silva - Open source SOA on SteroidsJazoon2010 - Edgar Silva - Open source SOA on Steroids
Jazoon2010 - Edgar Silva - Open source SOA on SteroidsEdgar Silva
 
Whats new in Alchemy Catalyst 8.0
Whats new in Alchemy Catalyst 8.0Whats new in Alchemy Catalyst 8.0
Whats new in Alchemy Catalyst 8.0Shamusd
 
Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.Alexandro Colorado
 
.NET 4 Demystified - Sandeep Joshi
.NET 4 Demystified - Sandeep Joshi.NET 4 Demystified - Sandeep Joshi
.NET 4 Demystified - Sandeep JoshiSpiffy
 
Language Server Protocol - Why the Hype?
Language Server Protocol - Why the Hype?Language Server Protocol - Why the Hype?
Language Server Protocol - Why the Hype?mikaelbarbero
 
Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009bosc
 
Network Protocol Testing Using Robot Framework
Network Protocol Testing Using Robot FrameworkNetwork Protocol Testing Using Robot Framework
Network Protocol Testing Using Robot FrameworkPayal Jain
 
Managing Change
Managing ChangeManaging Change
Managing ChangeMirko Jahn
 
Corpus Linguistics :Analytical Tools
Corpus Linguistics :Analytical ToolsCorpus Linguistics :Analytical Tools
Corpus Linguistics :Analytical ToolsJitendra Patil
 

Similar to BEL.bio Overview and BioDati Studio (20)

The aos cs workbench
The aos cs workbenchThe aos cs workbench
The aos cs workbench
 
The AOS/CS Workbench
The AOS/CS WorkbenchThe AOS/CS Workbench
The AOS/CS Workbench
 
plone.app.multilingual
plone.app.multilingual plone.app.multilingual
plone.app.multilingual
 
Corpus annotation for corpus linguistics (nov2009)
Corpus annotation for corpus linguistics (nov2009)Corpus annotation for corpus linguistics (nov2009)
Corpus annotation for corpus linguistics (nov2009)
 
Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007
 
CLTL Software and Web Services
CLTL Software and Web Services CLTL Software and Web Services
CLTL Software and Web Services
 
Robot framework
Robot frameworkRobot framework
Robot framework
 
AO: Annotation Ontology for science on the web
AO: Annotation Ontology for science on the webAO: Annotation Ontology for science on the web
AO: Annotation Ontology for science on the web
 
IMS LODE and the ASPECT project
IMS LODE and the ASPECT projectIMS LODE and the ASPECT project
IMS LODE and the ASPECT project
 
Jazoon2010 - Edgar Silva - Open source SOA on Steroids
Jazoon2010 - Edgar Silva - Open source SOA on SteroidsJazoon2010 - Edgar Silva - Open source SOA on Steroids
Jazoon2010 - Edgar Silva - Open source SOA on Steroids
 
Preliminary study on using vector quantization latent spaces for TTS/VC syste...
Preliminary study on using vector quantization latent spaces for TTS/VC syste...Preliminary study on using vector quantization latent spaces for TTS/VC syste...
Preliminary study on using vector quantization latent spaces for TTS/VC syste...
 
Whats new in Alchemy Catalyst 8.0
Whats new in Alchemy Catalyst 8.0Whats new in Alchemy Catalyst 8.0
Whats new in Alchemy Catalyst 8.0
 
Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.
 
.NET 4 Demystified - Sandeep Joshi
.NET 4 Demystified - Sandeep Joshi.NET 4 Demystified - Sandeep Joshi
.NET 4 Demystified - Sandeep Joshi
 
Managing domain ontologies within the AOS
Managing domain ontologies within the AOSManaging domain ontologies within the AOS
Managing domain ontologies within the AOS
 
Language Server Protocol - Why the Hype?
Language Server Protocol - Why the Hype?Language Server Protocol - Why the Hype?
Language Server Protocol - Why the Hype?
 
Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009
 
Network Protocol Testing Using Robot Framework
Network Protocol Testing Using Robot FrameworkNetwork Protocol Testing Using Robot Framework
Network Protocol Testing Using Robot Framework
 
Managing Change
Managing ChangeManaging Change
Managing Change
 
Corpus Linguistics :Analytical Tools
Corpus Linguistics :Analytical ToolsCorpus Linguistics :Analytical Tools
Corpus Linguistics :Analytical Tools
 

Recently uploaded

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 

BEL.bio Overview and BioDati Studio

  • 2. Why BEL? Chemists have the Chemical Reaction Language Biologists have the Biological Expression Language (BEL) Open standard for communication and knowledge-storage Whiteboard and Computer friendly Partial chemical synthesis pathway: https://www.synarchive.com/syn/128
  • 3. Overall Goals for BEL.bio Try to simplify use of BEL and BEL Content Stronger BEL/Nanopub validation, better error messages Easy addition of new BEL Language features Convert to Python and Docker Easier community engagement Quick easy startup/deployment Provide API and Namespaces hosting Easier to use/deploy/maintain search/completion service Greatly expand organisms supported (all EntrezGene/NCBITaxonomy) Simplify addition/maintenance of namespaces/zero downtime updates!!!
  • 4. Glossary BEL Assertion – single string version of BEL or subject, relation, object (SRO) version of BEL assertion (e.g. BEL triple) BEL Nanopub – BEL triple, Evidence, Context, Citation, Metadata Evidence – short text extraction or supporting information for BELTriple (Evidence in BEL Script, Support in OpenBEL Nanopub format) Annotations – OpenBEL Annotations are now called Annotations and were referred to as Experimental Context in BELMgr BEL Edge – BEL triples, primary and computed BEL canonicalized to standard namespace IDs and potentially orthologized stored in the EdgeStore (a graph database) API – BEL.bio API – BEL language, nanopub, terminology (namespace, orthology) services AST – Abstract SyntaxTree of BEL Statement Function: BEL function, e.g. p() or modifier function, e.g. var() NSArg: Namespace argument, e.g. HGNC:AKT1 StrArg: String argument, e.g. pmod(Ph,T, 22), Ph,T and 22 are string arguments
  • 5. Not supported by BEL.bio KAMs OpenBEL API/tooling BELScripts (except for converting to BEL Nanopubs) XBEL OpenBEL namespace/equivalence files (limited conversion to BEL.bioTerminology files)
  • 6. BEL Parsing and Validation bel_lang python module Depends on BEL.bio API for terminology services (namespaces, equivalents, orthology) Parsing, validation, canonicalization, orthologization, compute edges (eventually completion and migration) Uses BEL Specification and EBNF file for parsing and semantic validation EBNF file used byTatsu module to create parser library to parse BELTriple into dictionary AST of Function, NSArg, StrArg components,AST is transformed to python AST class-based object (BEL Object  BO) BEL Spec used to process BO for semantic validation bo.parse('p(MGI:A1bg)').orthologize('TAX:9606').canonicalize().ast.to_string(fmt='medium') p(EG:1) Provides CLI installed with module
  • 7. Supports Multiple BEL versions Can deploy bel_lang with multiple BEL versions (only BEL 2.0.0 currently (using semantic versioning now for BEL) One BEL Specification file per version, EBNF/parser generated from BEL Spec Drop in new BEL Spec, get new BELVersion functionality, easy testing of proposed BEL language features Future: create BEL migration signatures like the computed edge signatures for migrating BEL
  • 9.
  • 10.
  • 11.
  • 12.
  • 14. BEL Terminology Resources Simplify Namespaces GOBP, GOCC, GOBPID, GOCCID -> GO Context (Annotations) are now also Namespaces Simplify generator scripts Single script per resource: download and reformat into terminology or orthology load file Single download/cache directory (gzipped) BEL Resource tools Github repo https://github.com/belbio/bel_resources
  • 16. Term Completion Examples – BioDati Studio
  • 17. Namespaces Count EG 20,750,186 TAX 1,736,298 SP 557,012 AFFX 327,392 CHEBI 106,644 MGI 57,532 RGD 44,972 GO 44,922 HGNC 41,315 ZFIN 23,388 MESH 19,223 UBERON 13,232 DO 8,699 CL 2,194 EFO 937 Terminology Statistics
  • 19. BioDati Services BioDati Studio Data Stores (Nanopubs, Networks, Edges) Terminologies (collection, normalization, search, completions) Consulting – BEL-related
  • 20. Acknowledgements NatalieCatlett, PatientsLikeMe Anselmo DiFabio, BioDati David Chen Tony Bargnesi Nick Bargnesi
  • 21. Additional resources http://bel.bio http://biodati.com http://medium.com/biodati JSON philosophy: https://towardsdatascience.com/my-love-affair- with-json-edaca39e8320 https://github.com/belbio
  • 22. location: 1501 Main Street, Rahway, NJ 07065 | call: 732-764-8844 | online: biodati.com Anselmo Di Fabio adifabio@biodati.com William Hayes whayes@biodati.com

Editor's Notes

  1. History of BEL – developed over 10 years ago by Dexter Pratt at Genstruct (renamed to Selventa) and used for biomarker development as well as drug and toxicology mechanism analysis. BEL was was open-sourced about 5 years ago by Selventa by David de Graaf.
  2. Seeking funding for API and Namespaces hosting