SlideShare a Scribd company logo
1 of 46
Download to read offline
Bio2RDF cloud of
Virtuoso SPARQL endpoints


 Life Science
Raw Data Now


François Belleau, Marc-Alexandre Nolin,
    Peter Ansell, Michel Dumontier

          30th April 2009
W3C-HCLS F2F Meeting, Cambridge, MA
Agenda

    Why we did Bio2RDF ?
●



    How we did it ?
●



    What is know about hexokinase ?
●



    Where we are going ?
●
The problem

According to NAR 2009 Database
collection 1170 public databases
exists.

How can they be integrated to behave
like a global coherent resource ?
Public map of 1744 namespaces according to
  BioMoby, NAR, SRS, GO, NCBI, UniProt
Bio2RDF vision in 2007



 Johanne Luciano vision for
knowledge integration in 2005




 W3C vision of semantic web
          in 2006
Bio2RDF Mouse and Human Atlas map
      in 2008 65 millions triples
Bio2RDF actual contribution
                  to the Linked Data cloud




    Linked data cloud
         in 2007




                                                     Linked data cloud
                                                      in March 2009

http://linkeddata.org/
http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/Statistics
Bio2RDF cloud map of
2,3 billions triples in 2009
Why do it ?
Not to replace HTML or XML by an other new
format, RDF and OWL, but to answer science
question by submiting SPARQL query over
the global knowledge base accessible through
the Internet to the Life Science SPARQL
endpoints cloud.
Solution


Bio2RDF approach to the data integration
problem in bioinformatics :
Apply the semantic web approach based
on RDF, OWL and SPARQL technologies.
How we did it ?
Bio2RDF architecture
Our design principles



http://www.w3.org/DesignIssues/LinkedData




                    http://bio2rdf.wiki.sourceforge.net/Banff%20Manifesto
YeastHub design in 2005

       Conversion of Dataset to RDF
   ●


       Use of Sesame Triplestore
   ●


       SeRQL query interface
   ●




http://www.ncbi.nlm.nih.gov/pubmed/15961502
Bio2RDF at ISMB 2005
                 the begining



Thanks to Kei Cheung,
Johanne Luciano, Eric
Neumann and
Christopher Baker they
draw the lines.
Bio2RDF realtime rdfiser in 2007
Actual Architecture




              Offline rdfising process
            ●

            ● Virtuoso SPARQL endpoints

               network
            ● Namespace resolution

              through DNS subdomain
Main REST services
    Describe a ressource by a dereferencable URI
●


      http://bio2rdf.org/ns:id
    ●


    Global services over federated endpoints
●


        http://bio2rdf.org/links/ns:id
    ●


        http://bio2rdf.org/search/searchedTerm
    ●


    Targeted services to a specific endpoint
●



        http://bio2rdf.org/linksns/ns2/ns1:id
    ●


        http://bio2rdf.org/searchns/ns/searchedTerm
    ●


    other services are available.
●
Describe service implementation
    http://bio2rdf.org/ns:id
●



    Corresponding SPARQL query :
●


        CONSTRUCT {
    ●

          ?s ?p ?o .
        }
        WHERE {
          ?s ?p ?o .
          FILTER(?s = <http://bio2rdf.org/ns:id>).
        }
    Submited at this URL
●


        http://ns.bio2rdf.org/sparql?query=...
    ●


            Based of DNS subdomain resolution service
        –
Bio2RDF JSP server software
http://sourceforge.net/projects/bio2rdf/
Peter Ansell is writing the Bio2RDF
            JSP server
    The software transform Bio2RDF URIs to SPARQL
●

    queries in real time.
    Its aim is to access normalised RDF information
●

    located in multiple endpoints using the concept of
    Public Namespaces and Private Record Identifiers and
    distributed SPARQL queries which are matched to the
    content in each endpoint.
    Each of the following databases have normalisation
●

    rules which normalise them back to bio2rdf.org
    URI's :Dbpedia, Drugbank, LinkedCT, HCLS
    KB/Neurocommons, Diseasome, Dailymed, Bioguid
    DOI
Bio2RDF.war package future
    Provide more pipes to perform integrated actions without
●

    having to put HTTP SPARQL requests into a workflow
    system when a URI resolution can perform the query in a
    distributed and normalised manner more efficiently
    Bring together the current distributed efforts to provide a
●

    complete HTML redirection registry so that a large
    percentage of Bio2RDF namespaces can be redirected
    with http://bio2rdf.org/html/namespace:identifier
    Form ontologies describing the query type, provider, rdf
●

    normalisation rule, namespace paradigm
    Integrate http://rdf.myexperiment.org/sparql and similar
●

    workflow RDF endpoints so that scientific workflows can
    be linked to their data cleanly
Bio2RDF.owl




http://quebec.bio2rdf.org/download/bio2rdf-2008.owl
Michel Dumontier will design
Bio2RDF.owl ontology next version
What is known about hexokinase ?
Submit your query...
    To the web search engine
●


    To existing public web site offering data
●

    integration services;
    Using Bio2RDF SPARQL endpoints
●


        Submitting a SPARQL query;
    ●


        Using facet browser interface from Virtuoso 6.0
    ●

        server;
        Dereferencing Bio2RDF search URI;
    ●


        Using a Taverna workflow composed of SPARQL
    ●

        queries to obtain federated results from KEGG,
        Entrez Gene and GO;
The usual unsemantic way
Existing integrated search services


                     EBI/EB-eye
 NCBI/Entrez




KEGG/DBGET           GoPubmed
By submitting a SPARQL query
   http://atlas.bio2rdf.org/sparql
What is know about « hexokinase »
                with semantic ?
select ?t1 ?p2 count(*)
where {
    ?s1 ?p1 ?o1 .
    FILTER( bif:contains(?o1, quot;hexokinasequot;)) .
    ?s1 a ?t1 .
    ?s1 ?p2 ?o2 .
}
ORDER BY ?t1 ?p2
Use Virtuoso 6.0 facet browser
    http://lod.openlinksw.com/
Dereferencing search URL
http://bio2rdf.org/search/hexokinase
How can we submit a complex
query over the network of SPARQL
            endpoints ?
By building a mashup with Taverna
1) Write your complex SPARQL query as if a
  global graph would be available
2) Identify the needed namespaces and split the
  query to fetch each data source separetly
3) Build a mashup using a Taverna workflow that
  instanciate a local triplestore
4) Execute your complex query locally on the
  mashup
The SPARQL query needed
 (dont try this home, do it on the web !)
Get the list of genes
    from KEGG pathways of a specified taxon
    Clear graph
●



    Get KEGG pathways list for a
●

    specific taxon
    For each pathway get genes
●

    list and import instances
    Count the number of genes
●

    found




                                   http://www.myexperiment.org/workflows/747
Insert into local triplestore
       GeneID genes and KEGG pathways
    Get the list of genes
●



    Get the list of pathways
●



    Insert into local triplestore
●

    each corresponding graph




                                    http://www.myexperiment.org/workflows/748
Insert into local triplestore
             the needed GO annotations
    Get the GO annotations for
●

    each gene
Finally, the neeeded query merging
KEGG, Entrez Gene and GO together
Bio2RDF resources
Bio2RDF's mirrors
http://quebec.bio2rdf.org/
  http://qut.bio2rdf.org/
Bio2RDF SPARQL endpoints
http://www.freebase.com/view/user/bio2rdf/public/sparql
Life Science Raw Data Now
http://quebec.bio2rdf.org/download
Visit our Wiki rdfiser cookbook
http://bio2rdf.wiki.sourceforge.net/
Bio2RDF news




 http://bio2rdf.blogspot.com/
                                http://www.slideshare.net/search/slideshow?q=bio2rdf




                                        http://scholar.google.com/scholar?q=bio2rdf
http://groups.google.ca/group/bio2rdf
Our 2009 objectives
    Get approval from data provider to distribute
●

    RDF dump and publish SPARQL endpoints
    (UniProt, BioCyc, Pathway Commons, Bind are
    in);
    Start using Virtuoso 6 cluster;
●


    Design more services accessible with REST
●

    protocol via our JSP package;
    Recruit mirror server;
●


    Develop new rdfiser program in a community
●

    effort;
Thanks
Jean Morissette, Nicole Tourigny

    The Bio2RDF community
●


    Centre de recherche du CHUL
●


    Université Laval
●


    Dumontier Lab
●


    QUT eResearch Center
●


    Openlink Virtuoso
●

More Related Content

What's hot

Jsf 110530152515-phpapp01
Jsf 110530152515-phpapp01Jsf 110530152515-phpapp01
Jsf 110530152515-phpapp01Eric Bourdet
 
Ch02 撰寫與設定 Servlet
Ch02 撰寫與設定 ServletCh02 撰寫與設定 Servlet
Ch02 撰寫與設定 ServletJustin Lin
 
Orion Context Broker 1.15.0
Orion Context Broker 1.15.0Orion Context Broker 1.15.0
Orion Context Broker 1.15.0Fermin Galan
 
Ch04 會話管理
Ch04 會話管理Ch04 會話管理
Ch04 會話管理Justin Lin
 
Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js + Expres...
Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js +  Expres...Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js +  Expres...
Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js + Expres...Edureka!
 
Defending against Java Deserialization Vulnerabilities
 Defending against Java Deserialization Vulnerabilities Defending against Java Deserialization Vulnerabilities
Defending against Java Deserialization VulnerabilitiesLuca Carettoni
 
[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.
[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.
[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.NAVER D2
 
Use Node.js to create a REST API
Use Node.js to create a REST APIUse Node.js to create a REST API
Use Node.js to create a REST APIFabien Vauchelles
 
05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안
05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안
05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안Opennaru, inc.
 
Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式
Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式
Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式Justin Lin
 
Spring security oauth2
Spring security oauth2Spring security oauth2
Spring security oauth2axykim00
 
خدمات الويب (Web Services) و كيف تنشئها
 خدمات الويب (Web Services) و كيف تنشئها  خدمات الويب (Web Services) و كيف تنشئها
خدمات الويب (Web Services) و كيف تنشئها lunarhalo
 
NGINX Back to Basics Part 3: Security (Japanese Version)
NGINX Back to Basics Part 3: Security (Japanese Version)NGINX Back to Basics Part 3: Security (Japanese Version)
NGINX Back to Basics Part 3: Security (Japanese Version)NGINX, Inc.
 
Puppeteer can automate that! - Frontmania
Puppeteer can automate that! - FrontmaniaPuppeteer can automate that! - Frontmania
Puppeteer can automate that! - FrontmaniaÖnder Ceylan
 

What's hot (20)

Jsf 110530152515-phpapp01
Jsf 110530152515-phpapp01Jsf 110530152515-phpapp01
Jsf 110530152515-phpapp01
 
Ch02 撰寫與設定 Servlet
Ch02 撰寫與設定 ServletCh02 撰寫與設定 Servlet
Ch02 撰寫與設定 Servlet
 
Orion Context Broker 1.15.0
Orion Context Broker 1.15.0Orion Context Broker 1.15.0
Orion Context Broker 1.15.0
 
Nextjs13.pptx
Nextjs13.pptxNextjs13.pptx
Nextjs13.pptx
 
Ch04 會話管理
Ch04 會話管理Ch04 會話管理
Ch04 會話管理
 
Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js + Expres...
Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js +  Expres...Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js +  Expres...
Node.js Express Tutorial | Node.js Tutorial For Beginners | Node.js + Expres...
 
React Server Side Rendering with Next.js
React Server Side Rendering with Next.jsReact Server Side Rendering with Next.js
React Server Side Rendering with Next.js
 
NodeJS for Beginner
NodeJS for BeginnerNodeJS for Beginner
NodeJS for Beginner
 
Defending against Java Deserialization Vulnerabilities
 Defending against Java Deserialization Vulnerabilities Defending against Java Deserialization Vulnerabilities
Defending against Java Deserialization Vulnerabilities
 
[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.
[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.
[135] 오픈소스 데이터베이스, 은행 서비스에 첫발을 내밀다.
 
Use Node.js to create a REST API
Use Node.js to create a REST APIUse Node.js to create a REST API
Use Node.js to create a REST API
 
05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안
05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안
05. 마이크로서비스 아키텍처 환경에서의 SSO 구축방안
 
Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式
Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式
Servlet & JSP 教學手冊第二版 - 第 1 章:簡介Web應用程式
 
Doküman Yönetim Sistemi / M-Files
Doküman Yönetim Sistemi / M-FilesDoküman Yönetim Sistemi / M-Files
Doküman Yönetim Sistemi / M-Files
 
Spring security oauth2
Spring security oauth2Spring security oauth2
Spring security oauth2
 
خدمات الويب (Web Services) و كيف تنشئها
 خدمات الويب (Web Services) و كيف تنشئها  خدمات الويب (Web Services) و كيف تنشئها
خدمات الويب (Web Services) و كيف تنشئها
 
Nodejs presentation
Nodejs presentationNodejs presentation
Nodejs presentation
 
NGINX Back to Basics Part 3: Security (Japanese Version)
NGINX Back to Basics Part 3: Security (Japanese Version)NGINX Back to Basics Part 3: Security (Japanese Version)
NGINX Back to Basics Part 3: Security (Japanese Version)
 
NEXT.JS
NEXT.JSNEXT.JS
NEXT.JS
 
Puppeteer can automate that! - Frontmania
Puppeteer can automate that! - FrontmaniaPuppeteer can automate that! - Frontmania
Puppeteer can automate that! - Frontmania
 

Viewers also liked

Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemBio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemFrançois Belleau
 
Knowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisKnowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisCOST action BM1006
 
Linux for bioinformatics
Linux for bioinformaticsLinux for bioinformatics
Linux for bioinformaticscursoNGS
 
XVIII FIRA DEL JOGUET- Feria del Juguete antiguo en Dénia
XVIII FIRA DEL JOGUET- Feria del Juguete antiguo en DéniaXVIII FIRA DEL JOGUET- Feria del Juguete antiguo en Dénia
XVIII FIRA DEL JOGUET- Feria del Juguete antiguo en DéniaJuan Fernández Moro
 
Aquality Forum 2016 - Lateral Thinking
Aquality Forum 2016 - Lateral ThinkingAquality Forum 2016 - Lateral Thinking
Aquality Forum 2016 - Lateral ThinkingMarco Pellegrini
 
Rekrutterings dag - MHI Vestas Offshore Wind
Rekrutterings dag - MHI Vestas Offshore WindRekrutterings dag - MHI Vestas Offshore Wind
Rekrutterings dag - MHI Vestas Offshore WindThomas Grænge
 
España: Impacto y recuperación del clima laboral tras una reestructuración
España: Impacto y recuperación del clima laboral tras una reestructuración  España: Impacto y recuperación del clima laboral tras una reestructuración
España: Impacto y recuperación del clima laboral tras una reestructuración LLYC
 
Análisis del libro didáctico
Análisis del libro didácticoAnálisis del libro didáctico
Análisis del libro didácticoRafaela Sá
 
Internship Development Portfolio-DCCG
Internship Development Portfolio-DCCGInternship Development Portfolio-DCCG
Internship Development Portfolio-DCCGGladys Sanchez
 
Ejemplo de un encerado diagnóstico en un pfu
Ejemplo de un encerado diagnóstico en un pfuEjemplo de un encerado diagnóstico en un pfu
Ejemplo de un encerado diagnóstico en un pfuSimone Vasquez
 
Top 8 digital marketing conference of 2016
Top 8 digital marketing conference of 2016Top 8 digital marketing conference of 2016
Top 8 digital marketing conference of 2016SZI Technologies
 
Tina's_Professional_Portfolio
Tina's_Professional_PortfolioTina's_Professional_Portfolio
Tina's_Professional_PortfolioTina Hamilton
 
Wip 43560 project-king william 2016
Wip 43560 project-king william 2016Wip 43560 project-king william 2016
Wip 43560 project-king william 2016Mark Klingman
 
Acreditacion de la Educacion Superior
Acreditacion de la Educacion SuperiorAcreditacion de la Educacion Superior
Acreditacion de la Educacion SuperiorMilton Guillin
 
Informatica juridica principios rectores. juan melean
Informatica juridica principios rectores. juan meleanInformatica juridica principios rectores. juan melean
Informatica juridica principios rectores. juan meleanjm11540042
 
Semantic Web from the 2013 Perspective
Semantic Web from the 2013 PerspectiveSemantic Web from the 2013 Perspective
Semantic Web from the 2013 PerspectiveAdrian Paschke
 
Estudio comparativo sobre la adaptación de cofias metálicas
Estudio comparativo sobre la adaptación de cofias metálicasEstudio comparativo sobre la adaptación de cofias metálicas
Estudio comparativo sobre la adaptación de cofias metálicasALVAROUAC
 

Viewers also liked (20)

Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge SystemBio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
Bio2RDF: Towards A Mashup To Build Bioinformatics Knowledge System
 
Knowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysisKnowledge management for integrative omics data analysis
Knowledge management for integrative omics data analysis
 
Linux for bioinformatics
Linux for bioinformaticsLinux for bioinformatics
Linux for bioinformatics
 
XVIII FIRA DEL JOGUET- Feria del Juguete antiguo en Dénia
XVIII FIRA DEL JOGUET- Feria del Juguete antiguo en DéniaXVIII FIRA DEL JOGUET- Feria del Juguete antiguo en Dénia
XVIII FIRA DEL JOGUET- Feria del Juguete antiguo en Dénia
 
Radio on the Move Trend Update
Radio on the Move Trend UpdateRadio on the Move Trend Update
Radio on the Move Trend Update
 
Aquality Forum 2016 - Lateral Thinking
Aquality Forum 2016 - Lateral ThinkingAquality Forum 2016 - Lateral Thinking
Aquality Forum 2016 - Lateral Thinking
 
Rekrutterings dag - MHI Vestas Offshore Wind
Rekrutterings dag - MHI Vestas Offshore WindRekrutterings dag - MHI Vestas Offshore Wind
Rekrutterings dag - MHI Vestas Offshore Wind
 
España: Impacto y recuperación del clima laboral tras una reestructuración
España: Impacto y recuperación del clima laboral tras una reestructuración  España: Impacto y recuperación del clima laboral tras una reestructuración
España: Impacto y recuperación del clima laboral tras una reestructuración
 
Vacuna contra el papilomavirus humano
Vacuna contra el papilomavirus humanoVacuna contra el papilomavirus humano
Vacuna contra el papilomavirus humano
 
Análisis del libro didáctico
Análisis del libro didácticoAnálisis del libro didáctico
Análisis del libro didáctico
 
Ok t area envio iii
Ok t area envio iiiOk t area envio iii
Ok t area envio iii
 
Internship Development Portfolio-DCCG
Internship Development Portfolio-DCCGInternship Development Portfolio-DCCG
Internship Development Portfolio-DCCG
 
Ejemplo de un encerado diagnóstico en un pfu
Ejemplo de un encerado diagnóstico en un pfuEjemplo de un encerado diagnóstico en un pfu
Ejemplo de un encerado diagnóstico en un pfu
 
Top 8 digital marketing conference of 2016
Top 8 digital marketing conference of 2016Top 8 digital marketing conference of 2016
Top 8 digital marketing conference of 2016
 
Tina's_Professional_Portfolio
Tina's_Professional_PortfolioTina's_Professional_Portfolio
Tina's_Professional_Portfolio
 
Wip 43560 project-king william 2016
Wip 43560 project-king william 2016Wip 43560 project-king william 2016
Wip 43560 project-king william 2016
 
Acreditacion de la Educacion Superior
Acreditacion de la Educacion SuperiorAcreditacion de la Educacion Superior
Acreditacion de la Educacion Superior
 
Informatica juridica principios rectores. juan melean
Informatica juridica principios rectores. juan meleanInformatica juridica principios rectores. juan melean
Informatica juridica principios rectores. juan melean
 
Semantic Web from the 2013 Perspective
Semantic Web from the 2013 PerspectiveSemantic Web from the 2013 Perspective
Semantic Web from the 2013 Perspective
 
Estudio comparativo sobre la adaptación de cofias metálicas
Estudio comparativo sobre la adaptación de cofias metálicasEstudio comparativo sobre la adaptación de cofias metálicas
Estudio comparativo sobre la adaptación de cofias metálicas
 

Similar to Bio2RDF cloud of Virtuoso SPARQL endpoints

Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013François Belleau
 
Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012François Belleau
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelabCAMELIA BOBAN
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionArabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionJasonRafeMiller
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And VisualizationIvan Ermilov
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Informationballoon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference InformationKai Schlegel
 
Introduction to BioHackathon 2014
Introduction to BioHackathon 2014Introduction to BioHackathon 2014
Introduction to BioHackathon 2014Toshiaki Katayama
 
Presentation at the EMBL-EBI Industry RDF meeting
Presentation at the EMBL-EBI  Industry RDF meetingPresentation at the EMBL-EBI  Industry RDF meeting
Presentation at the EMBL-EBI Industry RDF meetingJohannes Keizer
 
BioPAX Models and Pathways
BioPAX Models and PathwaysBioPAX Models and Pathways
BioPAX Models and PathwaysMichel Dumontier
 
Publishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDFPublishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDFPeterWinstanley1
 
W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2nolmar01
 

Similar to Bio2RDF cloud of Virtuoso SPARQL endpoints (20)

Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
Bio2RDF@BH2010
Bio2RDF@BH2010Bio2RDF@BH2010
Bio2RDF@BH2010
 
Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012
 
2013 eswc-bio2rdf-r2
2013 eswc-bio2rdf-r22013 eswc-bio2rdf-r2
2013 eswc-bio2rdf-r2
 
GDG Meets U event - Big data & Wikidata - no lies codelab
GDG Meets U event - Big data & Wikidata -  no lies codelabGDG Meets U event - Big data & Wikidata -  no lies codelab
GDG Meets U event - Big data & Wikidata - no lies codelab
 
SADI CSHALS 2013
SADI CSHALS 2013SADI CSHALS 2013
SADI CSHALS 2013
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionArabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Informationballoon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
 
Introduction to BioHackathon 2014
Introduction to BioHackathon 2014Introduction to BioHackathon 2014
Introduction to BioHackathon 2014
 
Publishing Linked Data from RDB
Publishing Linked Data from RDBPublishing Linked Data from RDB
Publishing Linked Data from RDB
 
Bio2RDF should we do it
Bio2RDF should we do itBio2RDF should we do it
Bio2RDF should we do it
 
AGROVOC, AGRIS and the CIARD RING, using RDF vocabularies and technologies f...
AGROVOC, AGRIS and the CIARD RING,  using RDF vocabularies and technologies f...AGROVOC, AGRIS and the CIARD RING,  using RDF vocabularies and technologies f...
AGROVOC, AGRIS and the CIARD RING, using RDF vocabularies and technologies f...
 
Presentation at the EMBL-EBI Industry RDF meeting
Presentation at the EMBL-EBI  Industry RDF meetingPresentation at the EMBL-EBI  Industry RDF meeting
Presentation at the EMBL-EBI Industry RDF meeting
 
BioPAX Models and Pathways
BioPAX Models and PathwaysBioPAX Models and Pathways
BioPAX Models and Pathways
 
2016-07-06-openphacts-docker
2016-07-06-openphacts-docker2016-07-06-openphacts-docker
2016-07-06-openphacts-docker
 
Publishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDFPublishing "5 star" data: the case for RDF
Publishing "5 star" data: the case for RDF
 
W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2
 

More from François Belleau

Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020François Belleau
 
Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017François Belleau
 
2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ES2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ESFrançois Belleau
 
BD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submissionBD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submissionFrançois Belleau
 
Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)François Belleau
 
Bio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conferenceBio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conferenceFrançois Belleau
 
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDFAcfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDFFrançois Belleau
 
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...François Belleau
 
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and MouseBio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and MouseFrançois Belleau
 

More from François Belleau (15)

Bio2RDF @ DILS 2008
Bio2RDF @ DILS 2008Bio2RDF @ DILS 2008
Bio2RDF @ DILS 2008
 
Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020Pitch Reactome2json_ld @ swat4hcls 2020
Pitch Reactome2json_ld @ swat4hcls 2020
 
Show de boucane pour ELK
Show de boucane pour ELKShow de boucane pour ELK
Show de boucane pour ELK
 
Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017Pitch Qliic coopérathon 2017
Pitch Qliic coopérathon 2017
 
2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ES2015-11-17 Présentation SEAO et ES
2015-11-17 Présentation SEAO et ES
 
Linuq 20160130
Linuq 20160130Linuq 20160130
Linuq 20160130
 
textOdossier
textOdossiertextOdossier
textOdossier
 
BD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submissionBD2K hackathon - Bio2RDF submission
BD2K hackathon - Bio2RDF submission
 
Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)Découvrir le web sémantique en 15 minutes (Decideo 2014)
Découvrir le web sémantique en 15 minutes (Decideo 2014)
 
Bio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conferenceBio2RDF poster for Biocurator 2014 conference
Bio2RDF poster for Biocurator 2014 conference
 
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDFAcfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
Acfas 2013 - Comment publier sur le web sémantique : la méthode de Bio2RDF
 
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
Producing, Publishing and Consuming Linked Data Three lessons from the Bio2RD...
 
Bio2RDF-ISMB2008
Bio2RDF-ISMB2008Bio2RDF-ISMB2008
Bio2RDF-ISMB2008
 
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and MouseBio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
Bio2RDF : A Semantic Web Atlas of post genomic knowledge about Human and Mouse
 
Bio2RDF/Virtuoso
Bio2RDF/VirtuosoBio2RDF/Virtuoso
Bio2RDF/Virtuoso
 

Recently uploaded

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 

Recently uploaded (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 

Bio2RDF cloud of Virtuoso SPARQL endpoints

  • 1. Bio2RDF cloud of Virtuoso SPARQL endpoints Life Science Raw Data Now François Belleau, Marc-Alexandre Nolin, Peter Ansell, Michel Dumontier 30th April 2009 W3C-HCLS F2F Meeting, Cambridge, MA
  • 2. Agenda Why we did Bio2RDF ? ● How we did it ? ● What is know about hexokinase ? ● Where we are going ? ●
  • 3. The problem According to NAR 2009 Database collection 1170 public databases exists. How can they be integrated to behave like a global coherent resource ?
  • 4. Public map of 1744 namespaces according to BioMoby, NAR, SRS, GO, NCBI, UniProt
  • 5. Bio2RDF vision in 2007 Johanne Luciano vision for knowledge integration in 2005 W3C vision of semantic web in 2006
  • 6. Bio2RDF Mouse and Human Atlas map in 2008 65 millions triples
  • 7. Bio2RDF actual contribution to the Linked Data cloud Linked data cloud in 2007 Linked data cloud in March 2009 http://linkeddata.org/ http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/Statistics
  • 8. Bio2RDF cloud map of 2,3 billions triples in 2009
  • 9. Why do it ? Not to replace HTML or XML by an other new format, RDF and OWL, but to answer science question by submiting SPARQL query over the global knowledge base accessible through the Internet to the Life Science SPARQL endpoints cloud.
  • 10. Solution Bio2RDF approach to the data integration problem in bioinformatics : Apply the semantic web approach based on RDF, OWL and SPARQL technologies.
  • 11. How we did it ? Bio2RDF architecture
  • 12. Our design principles http://www.w3.org/DesignIssues/LinkedData http://bio2rdf.wiki.sourceforge.net/Banff%20Manifesto
  • 13. YeastHub design in 2005 Conversion of Dataset to RDF ● Use of Sesame Triplestore ● SeRQL query interface ● http://www.ncbi.nlm.nih.gov/pubmed/15961502
  • 14. Bio2RDF at ISMB 2005 the begining Thanks to Kei Cheung, Johanne Luciano, Eric Neumann and Christopher Baker they draw the lines.
  • 16. Actual Architecture Offline rdfising process ● ● Virtuoso SPARQL endpoints network ● Namespace resolution through DNS subdomain
  • 17. Main REST services Describe a ressource by a dereferencable URI ● http://bio2rdf.org/ns:id ● Global services over federated endpoints ● http://bio2rdf.org/links/ns:id ● http://bio2rdf.org/search/searchedTerm ● Targeted services to a specific endpoint ● http://bio2rdf.org/linksns/ns2/ns1:id ● http://bio2rdf.org/searchns/ns/searchedTerm ● other services are available. ●
  • 18. Describe service implementation http://bio2rdf.org/ns:id ● Corresponding SPARQL query : ● CONSTRUCT { ● ?s ?p ?o . } WHERE { ?s ?p ?o . FILTER(?s = <http://bio2rdf.org/ns:id>). } Submited at this URL ● http://ns.bio2rdf.org/sparql?query=... ● Based of DNS subdomain resolution service –
  • 19. Bio2RDF JSP server software http://sourceforge.net/projects/bio2rdf/
  • 20. Peter Ansell is writing the Bio2RDF JSP server The software transform Bio2RDF URIs to SPARQL ● queries in real time. Its aim is to access normalised RDF information ● located in multiple endpoints using the concept of Public Namespaces and Private Record Identifiers and distributed SPARQL queries which are matched to the content in each endpoint. Each of the following databases have normalisation ● rules which normalise them back to bio2rdf.org URI's :Dbpedia, Drugbank, LinkedCT, HCLS KB/Neurocommons, Diseasome, Dailymed, Bioguid DOI
  • 21. Bio2RDF.war package future Provide more pipes to perform integrated actions without ● having to put HTTP SPARQL requests into a workflow system when a URI resolution can perform the query in a distributed and normalised manner more efficiently Bring together the current distributed efforts to provide a ● complete HTML redirection registry so that a large percentage of Bio2RDF namespaces can be redirected with http://bio2rdf.org/html/namespace:identifier Form ontologies describing the query type, provider, rdf ● normalisation rule, namespace paradigm Integrate http://rdf.myexperiment.org/sparql and similar ● workflow RDF endpoints so that scientific workflows can be linked to their data cleanly
  • 23. Michel Dumontier will design Bio2RDF.owl ontology next version
  • 24. What is known about hexokinase ?
  • 25. Submit your query... To the web search engine ● To existing public web site offering data ● integration services; Using Bio2RDF SPARQL endpoints ● Submitting a SPARQL query; ● Using facet browser interface from Virtuoso 6.0 ● server; Dereferencing Bio2RDF search URI; ● Using a Taverna workflow composed of SPARQL ● queries to obtain federated results from KEGG, Entrez Gene and GO;
  • 27. Existing integrated search services EBI/EB-eye NCBI/Entrez KEGG/DBGET GoPubmed
  • 28. By submitting a SPARQL query http://atlas.bio2rdf.org/sparql
  • 29. What is know about « hexokinase » with semantic ? select ?t1 ?p2 count(*) where { ?s1 ?p1 ?o1 . FILTER( bif:contains(?o1, quot;hexokinasequot;)) . ?s1 a ?t1 . ?s1 ?p2 ?o2 . } ORDER BY ?t1 ?p2
  • 30. Use Virtuoso 6.0 facet browser http://lod.openlinksw.com/
  • 32. How can we submit a complex query over the network of SPARQL endpoints ?
  • 33. By building a mashup with Taverna 1) Write your complex SPARQL query as if a global graph would be available 2) Identify the needed namespaces and split the query to fetch each data source separetly 3) Build a mashup using a Taverna workflow that instanciate a local triplestore 4) Execute your complex query locally on the mashup
  • 34. The SPARQL query needed (dont try this home, do it on the web !)
  • 35. Get the list of genes from KEGG pathways of a specified taxon Clear graph ● Get KEGG pathways list for a ● specific taxon For each pathway get genes ● list and import instances Count the number of genes ● found http://www.myexperiment.org/workflows/747
  • 36. Insert into local triplestore GeneID genes and KEGG pathways Get the list of genes ● Get the list of pathways ● Insert into local triplestore ● each corresponding graph http://www.myexperiment.org/workflows/748
  • 37. Insert into local triplestore the needed GO annotations Get the GO annotations for ● each gene
  • 38. Finally, the neeeded query merging KEGG, Entrez Gene and GO together
  • 42. Life Science Raw Data Now http://quebec.bio2rdf.org/download
  • 43. Visit our Wiki rdfiser cookbook http://bio2rdf.wiki.sourceforge.net/
  • 44. Bio2RDF news http://bio2rdf.blogspot.com/ http://www.slideshare.net/search/slideshow?q=bio2rdf http://scholar.google.com/scholar?q=bio2rdf http://groups.google.ca/group/bio2rdf
  • 45. Our 2009 objectives Get approval from data provider to distribute ● RDF dump and publish SPARQL endpoints (UniProt, BioCyc, Pathway Commons, Bind are in); Start using Virtuoso 6 cluster; ● Design more services accessible with REST ● protocol via our JSP package; Recruit mirror server; ● Develop new rdfiser program in a community ● effort;
  • 46. Thanks Jean Morissette, Nicole Tourigny The Bio2RDF community ● Centre de recherche du CHUL ● Université Laval ● Dumontier Lab ● QUT eResearch Center ● Openlink Virtuoso ●