SlideShare a Scribd company logo
1 of 27
Chunlei Wu, Ph.D.
cwu@scripps.edu
@chunleiwu
https://wulab.io
Associate Professor
Dept. of Integrative Structural and Computational Biology
The Scripps Research Institute
La Jolla, CA, USA
07/07/2018
Building a FAIR API Ecosystem for Biomedical Knowledge
http://biothings.io
Biomedical Data API
API – Application Programming Interface
API is a way to abstract the data-access layer.
Why bioinformaticians need APIs
It's about
Modularization
photo credits: http://www.edmentum.com/sites/edmentum.com/files/solutions/content/building_0.jpg
http://www.howcsharp.com/img/0/68/dont-repeat-yourself-dry-300x211.jpg
http://blog.capinc.com/wp-content/uploads/2013/02/Recycle_Logo_by_Har1-300x263.png
Reusability DRY principle
Biomedical APIs and FAIR matrix
APIs are not quite findable
APIs are naturally accessible
But enterprise-grade Biomedical APIs are still few
Often not interoperable across APIs
APIs serve reusable piece of data
But more can be made reusable in API development
?
?
MyGene.info API
http://mygene.info
Aggregated Gene annotations represented in JSON documents
{
“_id”: “1017”,
“Symbol”: “CDK2”,
“Ensembl”: “ENSG00000123374”,
“RefSeq”: [
“NM_001798”,
“NM_052827”
],
“Reporter”: {
“U95A”: [
“1792_g_at”,
“1833_at”
],
“U133A”:[
“211804_s_at”,
“2045252_at”,
“211803_at”
]
}
}
Enterprise-grade API via Simple interface
http://mygene.info/v3/gene/1017
http://mygene.info/v3/gene/1017?fields=symbol,name,pathway,uniprot
http://mygene.info/v3/query?q=CDK2
http://mygene.info/v3/query?q=name:kinase&species=human
http://mygene.info/v3/query?q=name:kinase AND _exists_:pathway
http://mygene.info/v3/query?q=pathway.kegg.name:wnt&fields=entrezgene,symbol,taxid,interpro
 Simple to use
 Always up-to-date (weekly updated)
 Comprehensive (23M genes from 22K species)
 High-performance and scalable
 High-availability
 Python, R, Javascript clients
 Developer-friendly (support CORS, gzip, https, msgpack, etc.)
A collection of “BioThings APIs”
Aggregates annotations for
93 million drugs/chemicals from 11 resources
I have a list of drug/chemical ids, want to get annotations
about them?
Drug/chemical annotation service:
GET /v1/drug/<drugid>
POST /v1/drug/ (batch mode)
I want to get matching drugs/chemicals with my query
term(s)
Drug/chemical query service:
GET /v1/query/?q= <query>
POST /v1/query/ (batch mode)
http://mygene.info http://myvariant.info http://mychem.info
~8 M requests
~10,000 unique IPs
every month
~1 M requests
1000 unique IPs
every month
Just launched!
Aggregates annotations for
22 million genes from 30 resources
I have a list of gene ids, want to get annotations about
them?
Gene annotation service:
GET /v3/gene/<geneid>
POST /v3/gene/ (batch mode)
I want to get matching genes with my query term(s)
Gene query service:
GET /v3/query/?q= <query>
POST /v3/query/ (batch mode)
Aggregates annotations for
874 million variants from 19 resources
I have a list of variant ids, want to get annotations about
them?
Variant annotation service:
GET /v1/variant/<hgvsid>
POST /v1/variant/ (batch mode)
I want to get matching variants with my query term(s)
Variant query service:
GET /v1/query/?q= <query>
POST /v1/query/ (batch mode)
Who is using BioThings API
Many users use our APIs in their daily analysis pipelines or simply caching annotations locally
MinePath.org
Gene Wiki
JBrowse
Allele
Registry
Variant
Curation
Interface
GNOMICS
Others can build their own APIs with
src monitor
scheduler
data merger
data indexer
URL pattern
JSONP
CORS
compression
JSON-LD
Tracking
unit tests
cluster setup
data deploy
cluster
scaling
load-balancing
Optional query
customization
Data Hub Web API Cloud
Deployment
data parsers
for individual
resources
MongoDB +
Elasticsearch
Python/Tornado
Amazon
AWS
http://biothings.io
BioThingsSDK
done by Users
abstracted in SDK
A collection of high-
performance APIs
An SDK for building
your own APIs
http://T.biothings.io
fast, up-to-date, simple-to-use
JSON data
aggregation
mechanism
High-
performance
query engine
Well-designed
REST API
pattern
JSON-LD
enabled
Linked Data
Data-updating scheduler
Python/R clients
…
Your data source
Your API
Abstraction of API building/deployment
Gene
Variant
Drug/Chemical
Taxonomy
Linked APIs via JSON-
LD & SmartAPI
JSON Object
{...}
Semantic Context
Linked API
Enhancing API interoperability
http://MyDisease.info
Disease
API-level data integration for translational research
Electronic
Health
Record
(EHS)
Drugs
Proteins
Pathways
Genes
Variants
MyVariant.info
ClinVar
CiVIC
…
MyGene.info
Ensembl
… Reactome
WikiPathways
…UniProt
…
MyChem.info
Clue.io
DrugBank
…
Pharos
Biolink
Wikidata
NDEx
…
Cross-API data interoperability
Input
Output
1. Compacted
Format
2. Compacted
Format
3. Nquads Format
Semantically-aligned API output
JSON-LD semantic context does not have
to be done by the API developers.
Anybody can contribute.
Semantic relationship represented in JSON-LD
{
"_id": "RZVAJINKPMORJF-UHFFFAOYSA-N",
"indication":[
{
concept_id: "Migraine",
concept_name: "37796009"
},
...
]
}
{
"@context": {
"indication": {
"@type": "@id",
"@id": "assoc:treats",
"@context": {
"concept_name": {
"@type": "@id",
"@id": "attr:label",
"@context": {
"@base": "http://biothings.io/explorer/vocab/terms/disease-name/"
}
},
"concept_id": {
"@type": "@id",
"@id": "attr:id",
"@context": {
"@base": "http://identifiers.org/snomedct/"
}
}
}
}
}
}
acetaminophen Migraine
treats
JSON object
JSON-LD context
OpenAPI specifications for API metadata
Tells how an API works
SmartAPI built on community standards
http://smart-api.info
Adds the semantic
context for the data
served from an API
Tells how an API works
SmartAPI as an API registry
http://smart-api.info
Hosted interactive documentation for your API
http://myvariant.smart-api.info
A Real-world Translational Questions
From NCATS Translator Hackathon in May 2018
Disease - Gene
Gene - Pathways
Pathways - GeneGene - Chemical
Symptom - Disease
BioThings Explorer
To explore the network of “SmartAPIs”
– discover APIs for specific questions
– Automatically trigger API calls to construct a subset of the
knowledge graph
http://biothings.io/explorer/
Find APIs can get me from pathways to genes:
Pathways Available APIs Genes
Find associated drug compounds to gene LCK:
LCK CHEML3707348
LCK
inhibits
Via DGIDB API
INCHIKEY:KKYYLKPGILUPOA-UHFFFAOYSA-N
UniProt:P06239
equals
Via MyGene API
targets
Via MyChem API
CHEMBL223873
equals Via MyChem API
Accessible via BioThings Explorer’s own API
http://biothings.io/explorer/api/v2/directinput2output
?input_prefix=hgnc.symbol
&output_prefix=chembl.compound
&input_value=LCK
&format=translator
View the complete API document:
http://biothings.io/explorer/api/
More about BioThings Explorer
Video Tutorial
https://youtu.be/cPUKRsaTlhg
Demos in Jupyter Notebook:
BioThings Explorer Demo
BioThings Explorer Metadata
http://biothings.io/explorer/
BioThings as a FAIR API Ecosystem
Accessible
Findable
Interoperable
Reusable
Acknowledgement
Scripps Research
Andrew Su (sulab.org)
Cyrus Afrasiabi
Sebastien Lelong
Jiwen (Kevin) Xin
Marco Cano Alvarado
Ginger Tsueng
Byung Ryul Jeon
Greg Taylor
Maastricht Univ.
Michel Dumontier
(dumontierlab.com)
Amrapali Zaveri
Kody Moodley
Trish Whetzel (T2 Labs)
Shima Dastgheib (NuMedii)
Ruben Verborgh (Ghent Univ.)
Paul Avillach (Harvard)
Gabor Korodi (Harvard)
Raymond Terryn (Univ. of Miami)
Kathleen Jagodnik (Mount Sinai)
Pedro Assis (Stanford)
Funding support from
NIH Data Commons
API interoperability working group
Univ. of Washington
Sean Mooney
Vikas R Pejaver

More Related Content

Similar to BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge

BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
BioThings API: Building a FAIR API Ecosystem for Biomedical KnowledgeBioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
BioThings API: Building a FAIR API Ecosystem for Biomedical KnowledgeChunlei Wu
 
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...Chunlei Wu
 
MyVariant.info: Variant Annotation as a Service
MyVariant.info: Variant Annotation as a ServiceMyVariant.info: Variant Annotation as a Service
MyVariant.info: Variant Annotation as a ServiceChunlei Wu
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioCatalogue
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...Araport
 
Chunlei wu heart_bd2k_201602_ebi
Chunlei wu heart_bd2k_201602_ebiChunlei wu heart_bd2k_201602_ebi
Chunlei wu heart_bd2k_201602_ebiChunlei Wu
 
Elsevier developer network - developer presentation
Elsevier   developer network - developer presentationElsevier   developer network - developer presentation
Elsevier developer network - developer presentationremko caprio
 
apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...
apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...
apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...apidays
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchDavid Ruau
 
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...Syed Ahmad Chan Bukhari, PhD
 
2008 11 13 Hcls Call
2008 11 13 Hcls Call2008 11 13 Hcls Call
2008 11 13 Hcls CallJun Zhao
 
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...University of California, San Diego
 
SADI SWSIP '09 'cause you can't always GET what you want!
SADI SWSIP '09  'cause you can't always GET what you want!SADI SWSIP '09  'cause you can't always GET what you want!
SADI SWSIP '09 'cause you can't always GET what you want!Mark Wilkinson
 
Biomart Update
Biomart UpdateBiomart Update
Biomart Updatebosc
 
Plant ontology web services on Araport
Plant ontology web services on AraportPlant ontology web services on Araport
Plant ontology web services on AraportAraport
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Valery Tkachenko
 
API Evolution and Usability
API Evolution and UsabilityAPI Evolution and Usability
API Evolution and UsabilityThiago Colares
 
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...Amazon Web Services
 

Similar to BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge (20)

BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
BioThings API: Building a FAIR API Ecosystem for Biomedical KnowledgeBioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge
 
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
BioThings and SmartAPI: building an ecosystem of interoperable biological kno...
 
MyVariant.info: Variant Annotation as a Service
MyVariant.info: Variant Annotation as a ServiceMyVariant.info: Variant Annotation as a Service
MyVariant.info: Variant Annotation as a Service
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
 
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...
 
Chunlei wu heart_bd2k_201602_ebi
Chunlei wu heart_bd2k_201602_ebiChunlei wu heart_bd2k_201602_ebi
Chunlei wu heart_bd2k_201602_ebi
 
Elsevier developer network - developer presentation
Elsevier   developer network - developer presentationElsevier   developer network - developer presentation
Elsevier developer network - developer presentation
 
apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...
apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...
apidays New York 2023 - Android Applications and APIs Hacking, Gabrielle Botb...
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
 
2008 11 13 Hcls Call
2008 11 13 Hcls Call2008 11 13 Hcls Call
2008 11 13 Hcls Call
 
Kino : Making Semantic Annotations Easier
Kino : Making Semantic Annotations EasierKino : Making Semantic Annotations Easier
Kino : Making Semantic Annotations Easier
 
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
 
SADI SWSIP '09 'cause you can't always GET what you want!
SADI SWSIP '09  'cause you can't always GET what you want!SADI SWSIP '09  'cause you can't always GET what you want!
SADI SWSIP '09 'cause you can't always GET what you want!
 
Biomart Update
Biomart UpdateBiomart Update
Biomart Update
 
Plant ontology web services on Araport
Plant ontology web services on AraportPlant ontology web services on Araport
Plant ontology web services on Araport
 
Biothings presentation
Biothings presentationBiothings presentation
Biothings presentation
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...
 
API Evolution and Usability
API Evolution and UsabilityAPI Evolution and Usability
API Evolution and Usability
 
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
WuXi NextCODE Scales up Genomic Sequencing on AWS (ANT210-S) - AWS re:Invent ...
 

Recently uploaded

300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 

Recently uploaded (20)

300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 

BioThings API: Building a FAIR API Ecosystem for Biomedical Knowledge

  • 1. Chunlei Wu, Ph.D. cwu@scripps.edu @chunleiwu https://wulab.io Associate Professor Dept. of Integrative Structural and Computational Biology The Scripps Research Institute La Jolla, CA, USA 07/07/2018 Building a FAIR API Ecosystem for Biomedical Knowledge http://biothings.io
  • 2. Biomedical Data API API – Application Programming Interface API is a way to abstract the data-access layer.
  • 3. Why bioinformaticians need APIs It's about Modularization photo credits: http://www.edmentum.com/sites/edmentum.com/files/solutions/content/building_0.jpg http://www.howcsharp.com/img/0/68/dont-repeat-yourself-dry-300x211.jpg http://blog.capinc.com/wp-content/uploads/2013/02/Recycle_Logo_by_Har1-300x263.png Reusability DRY principle
  • 4. Biomedical APIs and FAIR matrix APIs are not quite findable APIs are naturally accessible But enterprise-grade Biomedical APIs are still few Often not interoperable across APIs APIs serve reusable piece of data But more can be made reusable in API development ? ?
  • 6. Aggregated Gene annotations represented in JSON documents { “_id”: “1017”, “Symbol”: “CDK2”, “Ensembl”: “ENSG00000123374”, “RefSeq”: [ “NM_001798”, “NM_052827” ], “Reporter”: { “U95A”: [ “1792_g_at”, “1833_at” ], “U133A”:[ “211804_s_at”, “2045252_at”, “211803_at” ] } }
  • 7. Enterprise-grade API via Simple interface http://mygene.info/v3/gene/1017 http://mygene.info/v3/gene/1017?fields=symbol,name,pathway,uniprot http://mygene.info/v3/query?q=CDK2 http://mygene.info/v3/query?q=name:kinase&species=human http://mygene.info/v3/query?q=name:kinase AND _exists_:pathway http://mygene.info/v3/query?q=pathway.kegg.name:wnt&fields=entrezgene,symbol,taxid,interpro  Simple to use  Always up-to-date (weekly updated)  Comprehensive (23M genes from 22K species)  High-performance and scalable  High-availability  Python, R, Javascript clients  Developer-friendly (support CORS, gzip, https, msgpack, etc.)
  • 8. A collection of “BioThings APIs” Aggregates annotations for 93 million drugs/chemicals from 11 resources I have a list of drug/chemical ids, want to get annotations about them? Drug/chemical annotation service: GET /v1/drug/<drugid> POST /v1/drug/ (batch mode) I want to get matching drugs/chemicals with my query term(s) Drug/chemical query service: GET /v1/query/?q= <query> POST /v1/query/ (batch mode) http://mygene.info http://myvariant.info http://mychem.info ~8 M requests ~10,000 unique IPs every month ~1 M requests 1000 unique IPs every month Just launched! Aggregates annotations for 22 million genes from 30 resources I have a list of gene ids, want to get annotations about them? Gene annotation service: GET /v3/gene/<geneid> POST /v3/gene/ (batch mode) I want to get matching genes with my query term(s) Gene query service: GET /v3/query/?q= <query> POST /v3/query/ (batch mode) Aggregates annotations for 874 million variants from 19 resources I have a list of variant ids, want to get annotations about them? Variant annotation service: GET /v1/variant/<hgvsid> POST /v1/variant/ (batch mode) I want to get matching variants with my query term(s) Variant query service: GET /v1/query/?q= <query> POST /v1/query/ (batch mode)
  • 9. Who is using BioThings API Many users use our APIs in their daily analysis pipelines or simply caching annotations locally MinePath.org Gene Wiki JBrowse Allele Registry Variant Curation Interface GNOMICS
  • 10. Others can build their own APIs with src monitor scheduler data merger data indexer URL pattern JSONP CORS compression JSON-LD Tracking unit tests cluster setup data deploy cluster scaling load-balancing Optional query customization Data Hub Web API Cloud Deployment data parsers for individual resources MongoDB + Elasticsearch Python/Tornado Amazon AWS http://biothings.io BioThingsSDK done by Users abstracted in SDK
  • 11. A collection of high- performance APIs An SDK for building your own APIs http://T.biothings.io fast, up-to-date, simple-to-use JSON data aggregation mechanism High- performance query engine Well-designed REST API pattern JSON-LD enabled Linked Data Data-updating scheduler Python/R clients … Your data source Your API Abstraction of API building/deployment Gene Variant Drug/Chemical Taxonomy Linked APIs via JSON- LD & SmartAPI JSON Object {...} Semantic Context Linked API Enhancing API interoperability http://MyDisease.info Disease
  • 12. API-level data integration for translational research Electronic Health Record (EHS) Drugs Proteins Pathways Genes Variants MyVariant.info ClinVar CiVIC … MyGene.info Ensembl … Reactome WikiPathways …UniProt … MyChem.info Clue.io DrugBank … Pharos Biolink Wikidata NDEx …
  • 14. Input Output 1. Compacted Format 2. Compacted Format 3. Nquads Format Semantically-aligned API output JSON-LD semantic context does not have to be done by the API developers. Anybody can contribute.
  • 15. Semantic relationship represented in JSON-LD { "_id": "RZVAJINKPMORJF-UHFFFAOYSA-N", "indication":[ { concept_id: "Migraine", concept_name: "37796009" }, ... ] } { "@context": { "indication": { "@type": "@id", "@id": "assoc:treats", "@context": { "concept_name": { "@type": "@id", "@id": "attr:label", "@context": { "@base": "http://biothings.io/explorer/vocab/terms/disease-name/" } }, "concept_id": { "@type": "@id", "@id": "attr:id", "@context": { "@base": "http://identifiers.org/snomedct/" } } } } } } acetaminophen Migraine treats JSON object JSON-LD context
  • 16. OpenAPI specifications for API metadata Tells how an API works
  • 17. SmartAPI built on community standards http://smart-api.info Adds the semantic context for the data served from an API Tells how an API works
  • 18. SmartAPI as an API registry http://smart-api.info
  • 19. Hosted interactive documentation for your API http://myvariant.smart-api.info
  • 20. A Real-world Translational Questions From NCATS Translator Hackathon in May 2018 Disease - Gene Gene - Pathways Pathways - GeneGene - Chemical Symptom - Disease
  • 21. BioThings Explorer To explore the network of “SmartAPIs” – discover APIs for specific questions – Automatically trigger API calls to construct a subset of the knowledge graph http://biothings.io/explorer/
  • 22. Find APIs can get me from pathways to genes: Pathways Available APIs Genes
  • 23. Find associated drug compounds to gene LCK: LCK CHEML3707348 LCK inhibits Via DGIDB API INCHIKEY:KKYYLKPGILUPOA-UHFFFAOYSA-N UniProt:P06239 equals Via MyGene API targets Via MyChem API CHEMBL223873 equals Via MyChem API
  • 24. Accessible via BioThings Explorer’s own API http://biothings.io/explorer/api/v2/directinput2output ?input_prefix=hgnc.symbol &output_prefix=chembl.compound &input_value=LCK &format=translator View the complete API document: http://biothings.io/explorer/api/
  • 25. More about BioThings Explorer Video Tutorial https://youtu.be/cPUKRsaTlhg Demos in Jupyter Notebook: BioThings Explorer Demo BioThings Explorer Metadata http://biothings.io/explorer/
  • 26. BioThings as a FAIR API Ecosystem Accessible Findable Interoperable Reusable
  • 27. Acknowledgement Scripps Research Andrew Su (sulab.org) Cyrus Afrasiabi Sebastien Lelong Jiwen (Kevin) Xin Marco Cano Alvarado Ginger Tsueng Byung Ryul Jeon Greg Taylor Maastricht Univ. Michel Dumontier (dumontierlab.com) Amrapali Zaveri Kody Moodley Trish Whetzel (T2 Labs) Shima Dastgheib (NuMedii) Ruben Verborgh (Ghent Univ.) Paul Avillach (Harvard) Gabor Korodi (Harvard) Raymond Terryn (Univ. of Miami) Kathleen Jagodnik (Mount Sinai) Pedro Assis (Stanford) Funding support from NIH Data Commons API interoperability working group Univ. of Washington Sean Mooney Vikas R Pejaver

Editor's Notes

  1. Up-to-date and high-performance and high-availability
  2. SmartAPI tries to solve interoperability problem with two established community standards: JSON-LD and OpenAPI specifications.