SlideShare a Scribd company logo
NBDC / DBCLS presents 
BioHackathon 2014 
Standardization and utilization of human genome information 
with Semantic Web technologies 
Toshiaki Katayama <ktym@dbcls.jp> 
http://jp.linkedin.com/in/toshiakikatayama 
Database Center for Life Science (DBCLS), 
Research Organization of Information and Systems (ROIS), Japan 
2014/11/9 @ Tohoku Medical Megabank, Sendai, Japan
Hackathon 
Excursion... 
Symposium
Mission of NBDC/DBCLS 
• Biomedical domain 
• So many databases, so many publications 
• Integration of life science databases 
• To accelerate data driven science 
• Standardization and interoperability 
• Semantic Web and Linked Open Data 
• Software development 
• : 
• Right technology + collaborative community 
• BioHackathon = Bio + Hack + Marathon = effective innovation
http://2014.biohackathon.org 
BioHackathon 2014 - the 7th NBDC/DBCLS BioHackathon 
• BioHackathon 2008 in Tokyo 
• Towards integrated Web service in life science with Open Bio* libraries 
• http://hackathon.dbcls.jp 
• BioHackathon 2009 in Okinawa 
• Integration of Web services in bioinformatics applications 
• http://hackathon2.dbcls.jp 
• BioHackathon 2010 in Tokyo 
• Integration and interpretation of biological knowledge with the Semantic Web technologies 
• http://hackathon3.dbcls.jp 
• BioHackathon 2011 in Kyoto 
• Creation and utilization of Linked Data in life sciences 
• http://2011.biohackathon.org 
• BioHackathon 2012 in Toyama 
• Biomecial applications based on the Semantic Web technologies 
• http://2012.biohackathon.org 
• BioHackathon 2013 in Tokyo 
• Semantic interoperability and standardization of bioinformatics data and Web services 
• http://2013.biohackathon.org
BioHackathon publications 
BioHackathon 
thematic series
Linked Open Data 
• Use URIs as names for things 
• Use HTTP URIs 
• so that people can look up those names 
• When someone looks up a URI 
• provide useful information 
• using the standards (RDF*, SPARQL) 
• Include links to other URIs 
• so that they can discover more things
http://togogenome.org 
TogoGenome: RDF-based genome DB } 
• Genome annotation / Protein annotation / Biomedical ontologies / URIs 
Regulatory region Protein coding gene rRNA gene 
↑ ↑ ↑ ↑ 
rdfs:subClassOf obo:SO_0000147 . 
Accumulate annotations in RDF 
Genome 
↑ ↑ ↑ ↑ 
<exon> 
<gene> rdfs:subClassOf obo:SO_0000704 ; 
faldo:location [ ... ] ; 
rdfs:label "geneA" ; 
rdfs:seeAlso <UniProt> . 
← Sequence ontology types 
← FALDO locations 
← Label of annotations 
← Link to external resources
Annotation w/ in-house developed ontologies 
Accumulate annotations in RDF 
+ 
In-house developed ontologies 
•MEO (environment) 
•MPO (phenotype) 
•GMO (growth medium) 
•MCCV (culture collection) 
•PDO (infectious disease) 
↓ 
Stored in triple store 
↓ 
SPARQL query 
↓ 
TogoGenome / TogoStanza 
Genome sequences 
NCBI: BioProject/RefSeq -- existing reference seqs 
DDBJ: Annotation pipeline/GTPS -- newly sequenced 
Annotations 
UniProt: Protein functions and links 
Formats: GFF3, GTF, GVF, DAS, BED, .. 
Tools: Cufflinks, BLAST, InterProt, ... 
Ontologies 
NCBO: BioPortal, OBO (GO, SO, ...) 
DBCLS: FALDO, MEO, MPO, GMO, MCCV, PDO ... 
DDBJ: INSDC, Taxonomy, ... 
Titech: PDO, ... 
GOLD: Environmental metadata 
Samples and metadata 
INSDC, NCBI: SRA, GEO 
DBCLS: RefEx, Kusarinoko 
Bulk data: Literatures, Images, ...
http://togogenome.org 
TogoGenome faceted search & modular reports
Regulatory region Protein coding gene rRNA gene 
↑ ↑ ↑ ↑ 
rdfs:subClassOf obo:SO_0000147 . 
<gene> rdf:type insdc:Gene ; 
so:so_part_of <chromosome> . 
<mRNA> rdf:type insdc:Messenger_RNA ; 
sio:is-transcribed-from <gene> ; 
sio:has-ordered-part <p1>, <p2>, ... . 
<p1> sio:has-value "1"^^xsd:integer ; 
sio:refers-to <exon1> . 
<p2> sio:has-value "2"^^xsd:integer ; 
sio:refers-to <exon2> . 
<exon1> rdf:type insdc:Exon ; 
faldo:location <region1> . 
<region1> rdf:type faldo:Region ; 
faldo:begin <position1> ; 
faldo:end <position2> . 
← Sequence ontology types 
<position1> rdf:type faldo:ExactPosition, faldo:ForwardStrandPosition ; 
faldo:position 12345 ; 
faldo:reference <chromosome> . 
Genome 
↑ ↑ ↑ ↑ 
<exon> 
<gene> rdfs:subClassOf obo:SO_0000704 ; 
faldo:location [ ... ] ; 
rdfs:label "geneA" ; 
rdfs:seeAlso <UniProt> . 
← FALDO locations 
← Label of annotations 
← Link to external resources 
INSDC/RefSeq/Ensembl RDF:
RDF summit May 17-20, 2014 
Standardization of RDF models for genomics 
INSDC 
Standardization of INSDC nucleotide annotations in RDF 
Ontology for locations of annotations 
Common URIs to be shared 
Common RDF model for genomes 
Transcriptomes and regulations 
Personal/Japanese genomes
http://genomicsandhealth.org/ 
Global Alliance for Genomics and Health 
The greatest need was a common framework of international standards 
designed to enable and oversee how genomic and clinical data are 
shared in an effective, responsible, and interpretable manner. 
to develop this common framework, enabling learning from data 
while protecting participant autonomy and privacy. 
Over 180 organizations world wide (2013-14) 
+ Google (since 2014/2/28) 
To enable secure sharing of genomic and clinical data
18,487 
March 11, 2011
http://www9.nhk.or.jp/311shogen/map/
Tohoku Medical Megabank Organization 
• As one of the reconstruction plans, Tohoku Medical Megabank 
Organization was founded for rebuilding the community 
medical system by developing a biobank that combines medical 
and genome information for supporting health and welfare in 
the Tohoku area.
Questions? 
SELECT ?question 
WHERE { 
?question :bb|^:b{2} ?question . 
} 
:bb 
:b 
?question 
?x 
:b
Introduction to BioHackathon 2014

More Related Content

What's hot

Anno4j - Idiomatic Persistence and Querying for the W3C Annotation Data Model
Anno4j - Idiomatic Persistence and Querying for the W3C Annotation Data ModelAnno4j - Idiomatic Persistence and Querying for the W3C Annotation Data Model
Anno4j - Idiomatic Persistence and Querying for the W3C Annotation Data Model
Emanuel Berndl
 
Repository koloniale architectuur v1.0
Repository koloniale architectuur v1.0Repository koloniale architectuur v1.0
Repository koloniale architectuur v1.0
psuijker
 
Your Data, Your Search, ElasticSearch (EURUKO 2011)
Your Data, Your Search, ElasticSearch (EURUKO 2011)Your Data, Your Search, ElasticSearch (EURUKO 2011)
Your Data, Your Search, ElasticSearch (EURUKO 2011)
Karel Minarik
 
ElasticSearch in action
ElasticSearch in actionElasticSearch in action
ElasticSearch in action
Codemotion
 
How Solr Search Works
How Solr Search WorksHow Solr Search Works
How Solr Search Works
Atlogys Technical Consulting
 
ElasticSearch - index server used as a document database
ElasticSearch - index server used as a document databaseElasticSearch - index server used as a document database
ElasticSearch - index server used as a document database
Robert Lujo
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
Ruslan Zavacky
 
Elasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetupElasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrScaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Trey Grainger
 
Scaling Analytics with elasticsearch
Scaling Analytics with elasticsearchScaling Analytics with elasticsearch
Scaling Analytics with elasticsearch
dnoble00
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
Rahul Jain
 
Elasticsearch & "PeopleSearch"
Elasticsearch & "PeopleSearch"Elasticsearch & "PeopleSearch"
Elasticsearch & "PeopleSearch"
George Stathis
 
Elasticsearch
ElasticsearchElasticsearch
Elasticsearch
Ricardo Peres
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
Sperasoft
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Web
ebiquity
 
Using elasticsearch with rails
Using elasticsearch with railsUsing elasticsearch with rails
Using elasticsearch with rails
Tom Z Zeng
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
Jason Austin
 
Elasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English versionElasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English version
David Pilato
 
Elasticsearch Basics
Elasticsearch BasicsElasticsearch Basics
Elasticsearch Basics
Shifa Khan
 
ElasticSearch in Production: lessons learned
ElasticSearch in Production: lessons learnedElasticSearch in Production: lessons learned
ElasticSearch in Production: lessons learned
BeyondTrees
 

What's hot (20)

Anno4j - Idiomatic Persistence and Querying for the W3C Annotation Data Model
Anno4j - Idiomatic Persistence and Querying for the W3C Annotation Data ModelAnno4j - Idiomatic Persistence and Querying for the W3C Annotation Data Model
Anno4j - Idiomatic Persistence and Querying for the W3C Annotation Data Model
 
Repository koloniale architectuur v1.0
Repository koloniale architectuur v1.0Repository koloniale architectuur v1.0
Repository koloniale architectuur v1.0
 
Your Data, Your Search, ElasticSearch (EURUKO 2011)
Your Data, Your Search, ElasticSearch (EURUKO 2011)Your Data, Your Search, ElasticSearch (EURUKO 2011)
Your Data, Your Search, ElasticSearch (EURUKO 2011)
 
ElasticSearch in action
ElasticSearch in actionElasticSearch in action
ElasticSearch in action
 
How Solr Search Works
How Solr Search WorksHow Solr Search Works
How Solr Search Works
 
ElasticSearch - index server used as a document database
ElasticSearch - index server used as a document databaseElasticSearch - index server used as a document database
ElasticSearch - index server used as a document database
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
 
Elasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetupElasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetup
 
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrScaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solr
 
Scaling Analytics with elasticsearch
Scaling Analytics with elasticsearchScaling Analytics with elasticsearch
Scaling Analytics with elasticsearch
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
 
Elasticsearch & "PeopleSearch"
Elasticsearch & "PeopleSearch"Elasticsearch & "PeopleSearch"
Elasticsearch & "PeopleSearch"
 
Elasticsearch
ElasticsearchElasticsearch
Elasticsearch
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Web
 
Using elasticsearch with rails
Using elasticsearch with railsUsing elasticsearch with rails
Using elasticsearch with rails
 
Introduction to Elasticsearch
Introduction to ElasticsearchIntroduction to Elasticsearch
Introduction to Elasticsearch
 
Elasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English versionElasticsearch - Devoxx France 2012 - English version
Elasticsearch - Devoxx France 2012 - English version
 
Elasticsearch Basics
Elasticsearch BasicsElasticsearch Basics
Elasticsearch Basics
 
ElasticSearch in Production: lessons learned
ElasticSearch in Production: lessons learnedElasticSearch in Production: lessons learned
ElasticSearch in Production: lessons learned
 

Similar to Introduction to BioHackathon 2014

Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
François Belleau
 
W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2
nolmar01
 
Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)
DevDays
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
François Belleau
 
BioPAX Models and Pathways
BioPAX Models and PathwaysBioPAX Models and Pathways
BioPAX Models and Pathways
Michel Dumontier
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
François Belleau
 
Programming with LOD
Programming with LODProgramming with LOD
Programming with LOD
Fumihiro Kato
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
National Information Standards Organization (NISO)
 
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
National Institute of Informatics
 
NCBO Technology
NCBO TechnologyNCBO Technology
NCBO Technology
Trish Whetzel
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Takeshi Morita
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
Ivan Ermilov
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
Ken Karapetyan
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Harmonization of vocabularies for water data
Harmonization of vocabularies for water dataHarmonization of vocabularies for water data
Harmonization of vocabularies for water data
Simon Cox
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
Marta Villegas
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
net2-project
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
Jun Zhao
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
henkvandenberg16
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFa
Platypus
 

Similar to Introduction to BioHackathon 2014 (20)

Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
 
W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2
 
Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)Furore devdays 2017- rdf1(solbrig)
Furore devdays 2017- rdf1(solbrig)
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
BioPAX Models and Pathways
BioPAX Models and PathwaysBioPAX Models and Pathways
BioPAX Models and Pathways
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
 
Programming with LOD
Programming with LODProgramming with LOD
Programming with LOD
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
 
NCBO Technology
NCBO TechnologyNCBO Technology
NCBO Technology
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
 
Harmonization of vocabularies for water data
Harmonization of vocabularies for water dataHarmonization of vocabularies for water data
Harmonization of vocabularies for water data
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFa
 

More from Toshiaki Katayama

SPARQList
SPARQListSPARQList
SPARQLアプリケーション開発
SPARQLアプリケーション開発SPARQLアプリケーション開発
SPARQLアプリケーション開発
Toshiaki Katayama
 
BioHackathon 2015 report
BioHackathon 2015 reportBioHackathon 2015 report
BioHackathon 2015 report
Toshiaki Katayama
 
RDFによるデータ統合と相互運用性のための技術開発
RDFによるデータ統合と相互運用性のための技術開発RDFによるデータ統合と相互運用性のための技術開発
RDFによるデータ統合と相互運用性のための技術開発
Toshiaki Katayama
 
d3sparql.js demo at SWAT4LS 2014 in Berlin
d3sparql.js demo at SWAT4LS 2014 in Berlind3sparql.js demo at SWAT4LS 2014 in Berlin
d3sparql.js demo at SWAT4LS 2014 in Berlin
Toshiaki Katayama
 

More from Toshiaki Katayama (6)

SPARQList
SPARQListSPARQList
SPARQList
 
SPARQLアプリケーション開発
SPARQLアプリケーション開発SPARQLアプリケーション開発
SPARQLアプリケーション開発
 
BioHackathon 2015 report
BioHackathon 2015 reportBioHackathon 2015 report
BioHackathon 2015 report
 
RDFによるデータ統合と相互運用性のための技術開発
RDFによるデータ統合と相互運用性のための技術開発RDFによるデータ統合と相互運用性のための技術開発
RDFによるデータ統合と相互運用性のための技術開発
 
d3sparql.js
d3sparql.js d3sparql.js
d3sparql.js
 
d3sparql.js demo at SWAT4LS 2014 in Berlin
d3sparql.js demo at SWAT4LS 2014 in Berlind3sparql.js demo at SWAT4LS 2014 in Berlin
d3sparql.js demo at SWAT4LS 2014 in Berlin
 

Recently uploaded

Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 

Recently uploaded (20)

Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 

Introduction to BioHackathon 2014

  • 1. NBDC / DBCLS presents BioHackathon 2014 Standardization and utilization of human genome information with Semantic Web technologies Toshiaki Katayama <ktym@dbcls.jp> http://jp.linkedin.com/in/toshiakikatayama Database Center for Life Science (DBCLS), Research Organization of Information and Systems (ROIS), Japan 2014/11/9 @ Tohoku Medical Megabank, Sendai, Japan
  • 3. Mission of NBDC/DBCLS • Biomedical domain • So many databases, so many publications • Integration of life science databases • To accelerate data driven science • Standardization and interoperability • Semantic Web and Linked Open Data • Software development • : • Right technology + collaborative community • BioHackathon = Bio + Hack + Marathon = effective innovation
  • 4. http://2014.biohackathon.org BioHackathon 2014 - the 7th NBDC/DBCLS BioHackathon • BioHackathon 2008 in Tokyo • Towards integrated Web service in life science with Open Bio* libraries • http://hackathon.dbcls.jp • BioHackathon 2009 in Okinawa • Integration of Web services in bioinformatics applications • http://hackathon2.dbcls.jp • BioHackathon 2010 in Tokyo • Integration and interpretation of biological knowledge with the Semantic Web technologies • http://hackathon3.dbcls.jp • BioHackathon 2011 in Kyoto • Creation and utilization of Linked Data in life sciences • http://2011.biohackathon.org • BioHackathon 2012 in Toyama • Biomecial applications based on the Semantic Web technologies • http://2012.biohackathon.org • BioHackathon 2013 in Tokyo • Semantic interoperability and standardization of bioinformatics data and Web services • http://2013.biohackathon.org
  • 6. Linked Open Data • Use URIs as names for things • Use HTTP URIs • so that people can look up those names • When someone looks up a URI • provide useful information • using the standards (RDF*, SPARQL) • Include links to other URIs • so that they can discover more things
  • 7. http://togogenome.org TogoGenome: RDF-based genome DB } • Genome annotation / Protein annotation / Biomedical ontologies / URIs Regulatory region Protein coding gene rRNA gene ↑ ↑ ↑ ↑ rdfs:subClassOf obo:SO_0000147 . Accumulate annotations in RDF Genome ↑ ↑ ↑ ↑ <exon> <gene> rdfs:subClassOf obo:SO_0000704 ; faldo:location [ ... ] ; rdfs:label "geneA" ; rdfs:seeAlso <UniProt> . ← Sequence ontology types ← FALDO locations ← Label of annotations ← Link to external resources
  • 8. Annotation w/ in-house developed ontologies Accumulate annotations in RDF + In-house developed ontologies •MEO (environment) •MPO (phenotype) •GMO (growth medium) •MCCV (culture collection) •PDO (infectious disease) ↓ Stored in triple store ↓ SPARQL query ↓ TogoGenome / TogoStanza Genome sequences NCBI: BioProject/RefSeq -- existing reference seqs DDBJ: Annotation pipeline/GTPS -- newly sequenced Annotations UniProt: Protein functions and links Formats: GFF3, GTF, GVF, DAS, BED, .. Tools: Cufflinks, BLAST, InterProt, ... Ontologies NCBO: BioPortal, OBO (GO, SO, ...) DBCLS: FALDO, MEO, MPO, GMO, MCCV, PDO ... DDBJ: INSDC, Taxonomy, ... Titech: PDO, ... GOLD: Environmental metadata Samples and metadata INSDC, NCBI: SRA, GEO DBCLS: RefEx, Kusarinoko Bulk data: Literatures, Images, ...
  • 9. http://togogenome.org TogoGenome faceted search & modular reports
  • 10. Regulatory region Protein coding gene rRNA gene ↑ ↑ ↑ ↑ rdfs:subClassOf obo:SO_0000147 . <gene> rdf:type insdc:Gene ; so:so_part_of <chromosome> . <mRNA> rdf:type insdc:Messenger_RNA ; sio:is-transcribed-from <gene> ; sio:has-ordered-part <p1>, <p2>, ... . <p1> sio:has-value "1"^^xsd:integer ; sio:refers-to <exon1> . <p2> sio:has-value "2"^^xsd:integer ; sio:refers-to <exon2> . <exon1> rdf:type insdc:Exon ; faldo:location <region1> . <region1> rdf:type faldo:Region ; faldo:begin <position1> ; faldo:end <position2> . ← Sequence ontology types <position1> rdf:type faldo:ExactPosition, faldo:ForwardStrandPosition ; faldo:position 12345 ; faldo:reference <chromosome> . Genome ↑ ↑ ↑ ↑ <exon> <gene> rdfs:subClassOf obo:SO_0000704 ; faldo:location [ ... ] ; rdfs:label "geneA" ; rdfs:seeAlso <UniProt> . ← FALDO locations ← Label of annotations ← Link to external resources INSDC/RefSeq/Ensembl RDF:
  • 11. RDF summit May 17-20, 2014 Standardization of RDF models for genomics INSDC Standardization of INSDC nucleotide annotations in RDF Ontology for locations of annotations Common URIs to be shared Common RDF model for genomes Transcriptomes and regulations Personal/Japanese genomes
  • 12. http://genomicsandhealth.org/ Global Alliance for Genomics and Health The greatest need was a common framework of international standards designed to enable and oversee how genomic and clinical data are shared in an effective, responsible, and interpretable manner. to develop this common framework, enabling learning from data while protecting participant autonomy and privacy. Over 180 organizations world wide (2013-14) + Google (since 2014/2/28) To enable secure sharing of genomic and clinical data
  • 15. Tohoku Medical Megabank Organization • As one of the reconstruction plans, Tohoku Medical Megabank Organization was founded for rebuilding the community medical system by developing a biobank that combines medical and genome information for supporting health and welfare in the Tohoku area.
  • 16. Questions? SELECT ?question WHERE { ?question :bb|^:b{2} ?question . } :bb :b ?question ?x :b