SlideShare a Scribd company logo
Bio2RDF: Convert, Provide And
          Reuse.

     Marc-Alexandre Nolin1, Jacques Corbeil1,
      Luc Lamontagne1, Michel Dumontier1,2

              1
                Laval University, Canada
             2
               Carleton University, Canada

                  manolin@gmail.com
Presentation Plan
        • Bio2RDF
             – Methodology
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work



2010-10-13             Bio2RDF: Convert, Provide And Reuse.   2
Bio2RDF
        • Bio2RDF uses open-source Semantic Web
          technologies to provide interlinked life science
          data to support biological knowledge discovery.
             •   Over 40 databases converted
             •   Over 30 billion triples
             •   Global mirroring
             –   Quebec City, Quebec, Canada
             –   Ottawa, Ontario, Canada
             –   Guelph, Ontario, Canada
             –   Brisbane, Australia
        – part of LOD mashup at http://lod.openlinksw.com

2010-10-13                 Bio2RDF: Convert, Provide And Reuse.   3
2010 Linked Open Data Cloud




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”



2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   4
Bio2RDF is the major contributor
                 to the Life Sciences LOD




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   5
What is Linked Open Data?
        • Linked open data is
             •   data that is free to use
             •   machine understandable (uses RDF/OWL)
             •   can be looked up using web protocols
             •   has meaningful relations between data items
                 (generated from supplied cross-references, or
                 text-based mappings)




2010-10-13                 Bio2RDF: Convert, Provide And Reuse.   6
LOD Methodology
        Applying Tim Berners-Lee 4 rules
             1. Use URIs as names for things
             2. Use HTTP URIs so that people can look up those
                names
             3. When someone looks up a URI, provide useful
                information, using the standards (RDF*, SPARQL)
             4. Include links to other URIs, so that they can
                discover more things

             –   http://www.w3.org/DesignIssues/LinkedData


2010-10-13               Bio2RDF: Convert, Provide And Reuse.     7
Bio2RDF Methodology
        We extend LOD rules by:
             1. Consistently naming all resources
                http://bio2rdf.org/namespace:identifier
             2. Resolving Bio2RDF URIs to a set of statements
                about the requested resource




2010-10-13                Bio2RDF: Convert, Provide And Reuse.   8
Data providers have been
                linking data for years
      •      Links are done from
             one HTML page to
             another
      •      This works for human
             consumption, but
             doesn’t scale with huge
             amounts of data




2010-10-13                Bio2RDF: Convert, Provide And Reuse.   9
However, most linked open data is
 created and provided without the
 help of the original data provider
Why be part of the linked data
                       cloud?
        •     Enable queries that span over more than 1
              database.
               –   Example: filtering a PubMed search by a
                   microarray level of expression filter
        •     Reduce the size of a database by only
              referencing data instead of including it in
              a database record (e.g. citations)



2010-10-13             Bio2RDF: Convert, Provide And Reuse.   11
But something is missing !




             derived from Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-
                  cloud.net/”


2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                    12
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13            Bio2RDF: Convert, Provide And Reuse.   13
Convert




        • Take a step back and think about what your data
          represents.
        • Forget about the way it is currently represented in
          your relational database
2010-10-13             Bio2RDF: Convert, Provide And Reuse.     14
What is RDFizing?
        • RDFizing converts legacy data into one or
          more RDF statements or triples:
                     <Subject> <Predicate> <Object>
        • Triples have correspondence for a standard
          database
             – Subject  row identifier
             – Predicate  column name
             – Object  value

2010-10-13              Bio2RDF: Convert, Provide And Reuse.   15
Convert
        • Converting to RDF just for the sake of providing
          another format will not add much to your offering
          if the result is your old relational database format
          … in RDF
        • Get rid of relational database artifacts while
             rdfizing
        1. Uses simple and stable identifiers to name
           resources
        2. Create types for the entities that your data refers to
           with in your data and specify the nature of the
           relations that hold between them

2010-10-13              Bio2RDF: Convert, Provide And Reuse.        16
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13             Bio2RDF: Convert, Provide And Reuse.   17
Provide – RDF Documents
        • URIs can be created with a REST-like look
        • Example of stable URIs
               http://purl.uniprot.org/uniprot/P19367
                  http://bio2rdf.org/uniprot:P19367
        • Documents containing statements should be
          accessible using web technology (HTTP protocol)
        • Provide data dumps
        • But in the LOD world, we want to query online
          databases!
2010-10-13            Bio2RDF: Convert, Provide And Reuse.   18
Provide - SPARQL
        • SPARQL is the query language for RDF/OWL
          that uses web technology (HTTP)
        • SPARQL endpoints make it possible to query
          databases using SPARQL
        • Distributed SPARQL will carve up the query and
          determine which endpoints need to be queried




2010-10-13            Bio2RDF: Convert, Provide And Reuse.   19
Provide
        • Publish the scheme you will use for your URIs so
          that other providers may use it

        • Provide access to documents with resolvable URIs
          (can be looked up using a web browser)
               http://geneprovider.com/gene:identifier
        • Now other data providers can use this identifer
          instead of copying the data into their own!

2010-10-13             Bio2RDF: Convert, Provide And Reuse.   20
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13             Bio2RDF: Convert, Provide And Reuse.   21
Reuse
        • RDF version of your documents without
          resolvable external links is just another file
          format
        • One of the most problematic issues is that
          the RDF generated by some providers are
          only inward looking -> they don’t reuse
          published URIs (Polite URI)


2010-10-13            Bio2RDF: Convert, Provide And Reuse.   22
Reuse
        • Example with Uniprot RDF
        • Uniprot is one of the first data providers to offers
          stable and resolvable URI for its documents.
          However, we can’t use directly the RDF they
          provide. Look at this extract of Human HK1 in RDF
        <rdf:Description rdf:about="http://purl.uniprot.org/uniprot/P19367">
            <rdf:type rdf:resource="http://purl.uniprot.org/core/Protein" />
            <rdfs:seeAlso rdf:resource="http://purl.uniprot.org/refseq/NP_277035.2"/>
        </rdf:Description>

        • The problem is that http://purl.uniprot.org/refseq/NP_277035.2
          resolves to the NCBI HTML page of NP_277035.2
        • Since NCBI does not provide RDF, it’s a dead end


2010-10-13                      Bio2RDF: Convert, Provide And Reuse.                    23
Reuse
        • Uniprot did the same things we have done at
          Bio2RDF. They create a URI in their namespace.
          From that URI which they control, they decided to
          redirect to the original HTML document of the
          specified ID

        • The difference with Bio2RDF is that we also
          resolve the other URI to an RDF document


2010-10-13            Bio2RDF: Convert, Provide And Reuse.    24
Reuse
        • Relational database artifacts : copying of data
          from one provider to another
        • Uniprot Citation entry IN a protein document
        <rdf:Description rdf:about="http://purl.uniprot.org/citations/10686099">
        <rdf:type rdf:resource="http://purl.uniprot.org/core/Journal_Citation" />
        <title>Crystal structures of mutant monomeric hexokinase I reveal multiple
            ...</title>
        <author>Aleshin A.E.</author>
        <author>Kirby C.</author>
        <skos:exactMatch rdf:resource="http://purl.uniprot.org/medline/20223513" />
        <skos:exactMatch rdf:resource="http://purl.uniprot.org/pubmed/10686099" />
        </>

        • In a Linked Data world, you only need to have a
          link to the PubMed URI at NCBI. The up to date
          information is there.


2010-10-13                      Bio2RDF: Convert, Provide And Reuse.                  25
With all this, how is now the LOD for life
               sciences by original data providers ?




2010-10-13              Bio2RDF: Convert, Provide And Reuse.   26
What is still missing !




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”




2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   27
Some handshake




2010-10-13    Bio2RDF: Convert, Provide And Reuse.   28
The new version of LOD for
             life sciences by original providers




2010-10-13          Bio2RDF: Convert, Provide And Reuse.   29
Something is missing !




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”




2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   30
Bottom line
        • That is not quite what we had at the beginning
        • The original network made by third party provider still
          exist. Use it like if it was another provider.
        • What about documents without an RDF version?
             – From NAR, there is 1230 databases. Too much for Bio2RDF or
               any other to convert entirely.
             – Use third party providers, like Bio2RDF, NeuroCommons,
               LinkedLifeData, Shared Names, etc.
             – Add these in addition to original data URIs
             – Some databases, for various reason, might never create RDF
               version of their data. So third party will continue to have their use
               by providing their data in RDF

2010-10-13                    Bio2RDF: Convert, Provide And Reuse.                     31
By applying those conversion, publication and
         reuse rules to your data, we will witness the
         birth of a more stable network of linked data
          … and the death (in the very long term) of
                           Bio2RDF.

                Let's kill Bio2RDF together !!



2010-10-13           Bio2RDF: Convert, Provide And Reuse.   32
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13             Bio2RDF: Convert, Provide And Reuse.   33
Future Work
        • Data processing workflow
        • New facet-based user interface to browse
          and formulate sophisticated queries
        • Full text indexing for autocompletion
          support
        • Exploring knowledge discovery
          possibilities in the linked data network

2010-10-13           Bio2RDF: Convert, Provide And Reuse.   34
Thanks
        • Bio2RDF community
             – Centre de recherche du CHUL
             – Dumontier Lab members
             – QUT eResearch Center
        • Triplestore provided by Openlink Virtuoso
        • François Belleau



2010-10-13             Bio2RDF: Convert, Provide And Reuse.   35
Acknowledgment
        • Marc-Alexandre Nolin funding provided by
          CANARIE via the C-BRASS project
        • Servers in Quebec City are provided by
          Jacques Corbeil of Laval University




2010-10-13          Bio2RDF: Convert, Provide And Reuse.   36
Contact Information
        •    Mailing list : bio2rdf@googlegroups.com
        •    URL : http://bio2rdf.org
        •    Wiki : http://sourceforge.net/apps/mediawiki/bio2rdf
        •    Blog : http://bio2rdf.blogspot.com




2010-10-13                 Bio2RDF: Convert, Provide And Reuse.     37

More Related Content

What's hot

MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
3 Round Stones
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersHerbert Van de Sompel
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
James Powell
 
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
National Information Standards Organization (NISO)
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparqlDhavalkumar Thakker
 
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Karen R
 
Gap Analysis
Gap AnalysisGap Analysis
Gap Analysis
PRELIDA Project
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
EUCLID project
 
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
eby
 
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataRDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
ASIS&T
 
2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer
Johannes Keizer
 
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
Dag Endresen
 
Building Collections in IRs from External Data Sources
Building Collections in IRs from External Data SourcesBuilding Collections in IRs from External Data Sources
Building Collections in IRs from External Data SourcesSusan Matveyeva
 
The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)
Myungjin Lee
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
Jisc
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksAndrea Nuzzolese
 
Why open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and modelsWhy open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and models
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)
Myungjin Lee
 
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Integration toward Semantic Web: Development of  Ontologies and RDF ...Database Integration toward Semantic Web: Development of  Ontologies and RDF ...
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Center for Life Science
 

What's hot (19)

MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
 
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparql
 
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
 
Gap Analysis
Gap AnalysisGap Analysis
Gap Analysis
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
 
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataRDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
 
2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer
 
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
 
Building Collections in IRs from External Data Sources
Building Collections in IRs from External Data SourcesBuilding Collections in IRs from External Data Sources
Building Collections in IRs from External Data Sources
 
The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links
 
Why open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and modelsWhy open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and models
 
The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)
 
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Integration toward Semantic Web: Development of  Ontologies and RDF ...Database Integration toward Semantic Web: Development of  Ontologies and RDF ...
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
 

Viewers also liked

AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2
MarcioAranha
 
AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2
MarcioAranha
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...
nolmar01
 
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario RipperAcorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
MarcioAranha
 
Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3
nolmar01
 
SOC Business Opportunity Flipchart
SOC Business Opportunity FlipchartSOC Business Opportunity Flipchart
SOC Business Opportunity Flipchart
JulieCollins
 
AcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2AuditoriumAcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2Auditorium
MarcioAranha
 
AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1
MarcioAranha
 
AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1
MarcioAranha
 
AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1
MarcioAranha
 
Springpractice
SpringpracticeSpringpractice
Springpractice
azuerseccion
 
Insurance Lead Services Presentation
Insurance Lead Services PresentationInsurance Lead Services Presentation
Insurance Lead Services Presentation
DEG Consulting
 
Jeopardy Usa
Jeopardy UsaJeopardy Usa
Jeopardy Usa
azuerseccion
 
Fate of Nanoparticles
Fate of NanoparticlesFate of Nanoparticles
Fate of Nanoparticlesslacerda
 
Animal Reproduction
Animal ReproductionAnimal Reproduction
Animal Reproduction
azuerseccion
 

Viewers also liked (15)

AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2
 
AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...
 
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario RipperAcorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
 
Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3
 
SOC Business Opportunity Flipchart
SOC Business Opportunity FlipchartSOC Business Opportunity Flipchart
SOC Business Opportunity Flipchart
 
AcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2AuditoriumAcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2Auditorium
 
AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1
 
AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1
 
AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1
 
Springpractice
SpringpracticeSpringpractice
Springpractice
 
Insurance Lead Services Presentation
Insurance Lead Services PresentationInsurance Lead Services Presentation
Insurance Lead Services Presentation
 
Jeopardy Usa
Jeopardy UsaJeopardy Usa
Jeopardy Usa
 
Fate of Nanoparticles
Fate of NanoparticlesFate of Nanoparticles
Fate of Nanoparticles
 
Animal Reproduction
Animal ReproductionAnimal Reproduction
Animal Reproduction
 

Similar to W4 4 marc-alexandre-nolin-v2

Charper.lawdi.20120601
Charper.lawdi.20120601Charper.lawdi.20120601
Charper.lawdi.20120601
charper
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Anita de Waard
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationseswcsummerschool
 
Wed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationsWed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservations
eswcsummerschool
 
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
CONUL Conference
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
Anja Jentzsch
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
Ivan Herman
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
Martin Hepp
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
guestecacad2
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
National Information Standards Organization (NISO)
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
ASIS&T
 
OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012
nw13
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
National Information Standards Organization (NISO)
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxLIS EPI Meeting
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
EUCLID project
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Creating Knowledge out of Interlinked Data
 

Similar to W4 4 marc-alexandre-nolin-v2 (20)

Charper.lawdi.20120601
Charper.lawdi.20120601Charper.lawdi.20120601
Charper.lawdi.20120601
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
 
Wed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationsWed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservations
 
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
 
OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
 
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
 
Converting GHO to RDF
Converting GHO to RDFConverting GHO to RDF
Converting GHO to RDF
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 

Recently uploaded

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 

Recently uploaded (20)

The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 

W4 4 marc-alexandre-nolin-v2

  • 1. Bio2RDF: Convert, Provide And Reuse. Marc-Alexandre Nolin1, Jacques Corbeil1, Luc Lamontagne1, Michel Dumontier1,2 1 Laval University, Canada 2 Carleton University, Canada manolin@gmail.com
  • 2. Presentation Plan • Bio2RDF – Methodology • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 2
  • 3. Bio2RDF • Bio2RDF uses open-source Semantic Web technologies to provide interlinked life science data to support biological knowledge discovery. • Over 40 databases converted • Over 30 billion triples • Global mirroring – Quebec City, Quebec, Canada – Ottawa, Ontario, Canada – Guelph, Ontario, Canada – Brisbane, Australia – part of LOD mashup at http://lod.openlinksw.com 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 3
  • 4. 2010 Linked Open Data Cloud “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 4
  • 5. Bio2RDF is the major contributor to the Life Sciences LOD “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 5
  • 6. What is Linked Open Data? • Linked open data is • data that is free to use • machine understandable (uses RDF/OWL) • can be looked up using web protocols • has meaningful relations between data items (generated from supplied cross-references, or text-based mappings) 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 6
  • 7. LOD Methodology Applying Tim Berners-Lee 4 rules 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs, so that they can discover more things – http://www.w3.org/DesignIssues/LinkedData 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 7
  • 8. Bio2RDF Methodology We extend LOD rules by: 1. Consistently naming all resources http://bio2rdf.org/namespace:identifier 2. Resolving Bio2RDF URIs to a set of statements about the requested resource 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 8
  • 9. Data providers have been linking data for years • Links are done from one HTML page to another • This works for human consumption, but doesn’t scale with huge amounts of data 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 9
  • 10. However, most linked open data is created and provided without the help of the original data provider
  • 11. Why be part of the linked data cloud? • Enable queries that span over more than 1 database. – Example: filtering a PubMed search by a microarray level of expression filter • Reduce the size of a database by only referencing data instead of including it in a database record (e.g. citations) 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 11
  • 12. But something is missing ! derived from Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod- cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 12
  • 13. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 13
  • 14. Convert • Take a step back and think about what your data represents. • Forget about the way it is currently represented in your relational database 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 14
  • 15. What is RDFizing? • RDFizing converts legacy data into one or more RDF statements or triples: <Subject> <Predicate> <Object> • Triples have correspondence for a standard database – Subject  row identifier – Predicate  column name – Object  value 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 15
  • 16. Convert • Converting to RDF just for the sake of providing another format will not add much to your offering if the result is your old relational database format … in RDF • Get rid of relational database artifacts while rdfizing 1. Uses simple and stable identifiers to name resources 2. Create types for the entities that your data refers to with in your data and specify the nature of the relations that hold between them 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 16
  • 17. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 17
  • 18. Provide – RDF Documents • URIs can be created with a REST-like look • Example of stable URIs http://purl.uniprot.org/uniprot/P19367 http://bio2rdf.org/uniprot:P19367 • Documents containing statements should be accessible using web technology (HTTP protocol) • Provide data dumps • But in the LOD world, we want to query online databases! 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 18
  • 19. Provide - SPARQL • SPARQL is the query language for RDF/OWL that uses web technology (HTTP) • SPARQL endpoints make it possible to query databases using SPARQL • Distributed SPARQL will carve up the query and determine which endpoints need to be queried 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 19
  • 20. Provide • Publish the scheme you will use for your URIs so that other providers may use it • Provide access to documents with resolvable URIs (can be looked up using a web browser) http://geneprovider.com/gene:identifier • Now other data providers can use this identifer instead of copying the data into their own! 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 20
  • 21. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 21
  • 22. Reuse • RDF version of your documents without resolvable external links is just another file format • One of the most problematic issues is that the RDF generated by some providers are only inward looking -> they don’t reuse published URIs (Polite URI) 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 22
  • 23. Reuse • Example with Uniprot RDF • Uniprot is one of the first data providers to offers stable and resolvable URI for its documents. However, we can’t use directly the RDF they provide. Look at this extract of Human HK1 in RDF <rdf:Description rdf:about="http://purl.uniprot.org/uniprot/P19367"> <rdf:type rdf:resource="http://purl.uniprot.org/core/Protein" /> <rdfs:seeAlso rdf:resource="http://purl.uniprot.org/refseq/NP_277035.2"/> </rdf:Description> • The problem is that http://purl.uniprot.org/refseq/NP_277035.2 resolves to the NCBI HTML page of NP_277035.2 • Since NCBI does not provide RDF, it’s a dead end 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 23
  • 24. Reuse • Uniprot did the same things we have done at Bio2RDF. They create a URI in their namespace. From that URI which they control, they decided to redirect to the original HTML document of the specified ID • The difference with Bio2RDF is that we also resolve the other URI to an RDF document 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 24
  • 25. Reuse • Relational database artifacts : copying of data from one provider to another • Uniprot Citation entry IN a protein document <rdf:Description rdf:about="http://purl.uniprot.org/citations/10686099"> <rdf:type rdf:resource="http://purl.uniprot.org/core/Journal_Citation" /> <title>Crystal structures of mutant monomeric hexokinase I reveal multiple ...</title> <author>Aleshin A.E.</author> <author>Kirby C.</author> <skos:exactMatch rdf:resource="http://purl.uniprot.org/medline/20223513" /> <skos:exactMatch rdf:resource="http://purl.uniprot.org/pubmed/10686099" /> </> • In a Linked Data world, you only need to have a link to the PubMed URI at NCBI. The up to date information is there. 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 25
  • 26. With all this, how is now the LOD for life sciences by original data providers ? 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 26
  • 27. What is still missing ! “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 27
  • 28. Some handshake 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 28
  • 29. The new version of LOD for life sciences by original providers 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 29
  • 30. Something is missing ! “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 30
  • 31. Bottom line • That is not quite what we had at the beginning • The original network made by third party provider still exist. Use it like if it was another provider. • What about documents without an RDF version? – From NAR, there is 1230 databases. Too much for Bio2RDF or any other to convert entirely. – Use third party providers, like Bio2RDF, NeuroCommons, LinkedLifeData, Shared Names, etc. – Add these in addition to original data URIs – Some databases, for various reason, might never create RDF version of their data. So third party will continue to have their use by providing their data in RDF 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 31
  • 32. By applying those conversion, publication and reuse rules to your data, we will witness the birth of a more stable network of linked data … and the death (in the very long term) of Bio2RDF. Let's kill Bio2RDF together !! 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 32
  • 33. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 33
  • 34. Future Work • Data processing workflow • New facet-based user interface to browse and formulate sophisticated queries • Full text indexing for autocompletion support • Exploring knowledge discovery possibilities in the linked data network 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 34
  • 35. Thanks • Bio2RDF community – Centre de recherche du CHUL – Dumontier Lab members – QUT eResearch Center • Triplestore provided by Openlink Virtuoso • François Belleau 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 35
  • 36. Acknowledgment • Marc-Alexandre Nolin funding provided by CANARIE via the C-BRASS project • Servers in Quebec City are provided by Jacques Corbeil of Laval University 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 36
  • 37. Contact Information • Mailing list : bio2rdf@googlegroups.com • URL : http://bio2rdf.org • Wiki : http://sourceforge.net/apps/mediawiki/bio2rdf • Blog : http://bio2rdf.blogspot.com 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 37