SlideShare a Scribd company logo
1 of 42
The Semantic Web and Linked
        Open Data



               Pete DeVries
             TaxonConcept.org
      http://www.taxonconcept.org/
       Department of Entomology
     University of Wisconsin - Madison
What is the Semantic Web and how
          does it Work?
                   Lets Look at the Traditional Way
                                   Taxon Table




                                 Location Table



This data structure is really only interpretable within the context of this specific database
Data Islands




The result are database islands that contain a lot of redundant data which is independently curated.

                         Each effort benefits little from the other efforts.
Data Sets often Overlap




                               Text




What they don’t have is a common set of field names or ID’s
Each Data set has is own “Vocabulary”

 Different Fields
 Different Names for the Same Fields
 Same Names for Different Fields
 Different ways of Interpreting those Fields

These nuances in meaning are often only understood by the
          designers of each individual data set.

 Consider how differently people interpret the meaning of
            what seem to be the same terms
Where the Semantic Web Helps
               Tim Berners-Lee’s 4 Rules

1. Use URIs* as names for things
2. Use HTTP URIs so that people can look up those names.
3. When someone looks up a URI, provide useful information.
4. Include links to other URIs. so that they can discover more things.


           *URI = Uniform Resource Identifier
       http://www.w3.org/DesignIssues/LinkedData.html
Use URIs as Names for Things?


Instead of “Door County” use
http://sws.geonames.org/5250768/
For Humans this URI Dereferences to a
    Human Interpretable Web Page



                  Text
                    Text
For Machines this Dereferences a
   Machine Interpretable File




             As N-Triples
Why Would Anyone Think this Made Sense?




 Now, each of these different databases are using an ID with a shared meaning.

         A meaning that can be determined by dereferencing the URI.

        All the data sets that use this vocabulary are now connectable.

 All the data sets that are linked to this URI are now also linked to each other.
Life Sciences Example

Example: Two databases with county records

One uses “La Crosse County,” the other lists “La Crosse” for La
Crosse County, Wisconsin

You want to link and merge those records so that it is clear that you
mean a particular species was observed in a particular county
Normalize the Meaning between Data Sources
Use this shared vocabulary to integrate these two data sources

Use that shared vocabulary to find and link to other relevant data
As More Data Sets Adopt these Principles




The individual datasets are no longer islands, but are one interconnected knowledge base
Other Benefits

 Reduced duplication of effort and a better separation of concerns
   It would be more efficient for me to simply link to a bibliographic
 reference URI on a site that specializes in that then to create my own
                        bibliographic database.

Similarly, it would be more efficient for the bibliographic database to link
to a URI in a nomenclatural database than curates that aspect separately.

When represented as URI’s in a Semantic Web database or “Triple Store”,
 information can be encoded more efficiently ~32 bytes per statement

     Enabling usable knowledge bases that scale to billions of “facts”
Example: The Linked Open Data Cloud




          Over 55 billion triples and rising
What is Linked Open Data?




1. data representation using open standards
2. use of hyperlinks to make it work on the global web
Wikipedia Images linked to my Species Concepts




     TaxonConcept <=> Dbpedia <=> WikiCommons Images
           Virtuoso OpenSource and Microsoft Pivot
             (some images are too large to display)
How do I Mark up my Data?
    Your data set can continue to exist in its current relational
database form, but you need to expose it to the semantic web in a
                          different form

The goal is to make structured data accessible and discoverable via
                            hyperlinks.
    It also includes the use of hyperlinks to denote properties/
             predicates that have well defined semantics.
These semantics are what ontologies and vocabularies deliver with
       more fidelity that what's available in a typical RDMS.
    Thus, the Semantic Web isn't a destination - it the effect of
publishing data in line with a set of principles as outlined in TimBL's
                                meme.
Knowledge as Triples
Statements are represented in a triple structure

        Subject ➜ Predicate ➜ Object

•   An English text version of a triple might look like

•   Ochlerotatus triseriatus expected in La Crosse County, WI
Machine Processable Version
Ochlerotatus triseriatus is expected in La Crosse County, WI

       Now represented as the following triple*


          http://lod.taxonconcept.org/ses/iuCXz#Species

    http://lod.taxonconcept.org/ontology/txn.owl#isExpectedIn

                http://sws.geonames.org/5258961/



        *Not Meant for Human Consumption
Expressing RDF

 RDF = Resource Description Framework

 Ways to Express RDF (Serialization Formats)

                   RDF/XML
      http://www.w3.org/TR/REC-rdf-syntax/
              Notation 3 (N3)
http://www.w3.org/DesignIssues/Notation3.html

                    Subsets of N3
         Turtle (Terse RDF Triple Language)
                      N-Triples
The Same Triple in Different Formats
                                    RDF/XML (.rdf)




                                        N3 (.n3)




                                       Turtle (.ttl)




                      You might find one of these forms easier to create.
     There are various tools that will allow you to convert between one form and another.
If you need RDF/XML, but can create N3; author in N3 then convert those files to RDF/XML.
How do I tell the Semantic Web
      about my Data?

           PingtheSemanticWeb
      http://pingthesemanticweb.com/
            Semantic Sitemaps
http://sw.deri.org/2007/07/sitemapextension/
PingtheSemanticWeb.com
  Enter the URL for your RDF documents
Semantic SiteMaps




          http://site.example.com/sitemap.xml
        http://site.example.com/sitemap.xml.gz
Refer to the sitemap.xml file in your sites robots.txt file
How can I Find other Potentially Useful
             Data Sets?
CKAN Comprehensive Knowledge Archive Network
               http://ckan.net/
Ask the LOD Cloud




Enter in term or name like “Quercus alba”, to see what entities contain that term or name
LOD Cloud Query Result
How can I set up my own Knowledge Base?
           Virtuoso Open-Source Edition
             http://virtuoso.openlinksw.com/
How can I Query a Knowledge Base?
                            SPARQL
                http://en.wikipedia.org/wiki/SPARQL
              http://www.w3.org/TR/rdf-sparql-query/
                   Query using the Web Interface
           Query using your own script or web application

                              Example




  “Describe those occurrences of the species concept Boloria selene”
iSPARQL Query Example Web Interface
iSPARQL Query Result
More Elaborate SPARQL Query




Query for those mammals that are “expected in” Wisconsin.

* use optional keyword for those attributes that may not exist
* the query includes those attributes that should be returned
The result set will be feed through Microsoft Pivot for Browsing
Result View




Live Query of the LOD Cloud Data Set
Efforts to Align Vocabularies

http://labs.mondeca.com/dataset/lov/index.html
Early EoL LOD
Early EoL LOD
Early EoL LOD




 Knowledge Base View
What does the Future hold for the
Semantic Web and Linked Open Data

Improvements in the quantity and quality of LOD data sets.
           Improved Alignment of Vocabularies
        Improvements in SPARQL and Quadstores
Human and Machine Interpretable Views Merged in RDFa
          Better Visualization and Analysis Tools
Other Resources
             Linked Open Data http://linkeddata.org/
             W3C.org http://esw.w3.org/Main_Page
public-lod email list http://lists.w3.org/Archives/Public/public-lod/
        TaxonConcept.org http://www.taxonconcept.org/
 TaxonConcept.org Examples http://bit.ly/bundles/pjdlinkeddata/



                     SlideShare Talks
         Evolution Towards Web 3.0: The Semantic Web
http://www.slideshare.net/LeeFeigenbaum/evolution-towards-web-30-the-
                             semantic-web
Recommendations
Try using and experimenting with existing vocabularies before creating
your own.
Although these technologies allow you to run queries that you might not
have anticipated, thinking about use cases etc. will provide some guidance
on the best way to markup your data.
Start with simple models and representations and add complexity as you
gain experience.
You may not want or be able to expose all your data to the LOD Cloud,
but exposing the metadata in commonly used vocabularies will make your
data more “findable”
 Some vocabularies* are still under development and discussion, but in
 many cases you can modify your SQL to RDF export to accommodate
 changes.
* For instance, it is not clear to me what is the “best” vocabulary for
representing publications.
Acknowledgments
           Kingsley Idehen
   http://www.openlinksw.com/blog/~kidehen/


David “Paddy” Patterson mbl.edu

     Anne Thessen mbl.edu

  Dmitry Mozzherin mbl.edu

         Han Wang rpi.edu

      Patrick Leary eol.org

More Related Content

What's hot

Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015Cason Snow
 
The Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQLThe Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQLMyungjin Lee
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale Bernadette Hyland-Wood
 
ORE and SWAP: Composition and Complexity
ORE and SWAP: Composition and ComplexityORE and SWAP: Composition and Complexity
ORE and SWAP: Composition and ComplexityEduserv Foundation
 
Introduction to Linked Data
Introduction to Linked DataIntroduction to Linked Data
Introduction to Linked DataJuan Sequeda
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic  Web and Linked DataAn introduction to Semantic  Web and Linked Data
An introduction to Semantic Web and Linked DataGabriela Agustini
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic WebPeter Mika
 
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...eswcsummerschool
 
Get on the Linked Data Web!
Get on the Linked Data Web!Get on the Linked Data Web!
Get on the Linked Data Web!Armin Haller
 
Year of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkeyYear of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkeyPeter Mika
 
Semantic Search Summer School2009
Semantic Search Summer School2009Semantic Search Summer School2009
Semantic Search Summer School2009Peter Mika
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataMatthew Rowe
 
From the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking upFrom the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking upDavide Palmisano
 
Research Data Sharing: A Basic Framework
Research Data Sharing: A Basic FrameworkResearch Data Sharing: A Basic Framework
Research Data Sharing: A Basic FrameworkPaul Groth
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011Peter Mika
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Jane Stevenson
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Serverwebhostingguy
 
A Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsA Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsArmin Haller
 

What's hot (20)

Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015
 
The Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQLThe Semantic Web #10 - SPARQL
The Semantic Web #10 - SPARQL
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
ORE and SWAP: Composition and Complexity
ORE and SWAP: Composition and ComplexityORE and SWAP: Composition and Complexity
ORE and SWAP: Composition and Complexity
 
Introduction to Linked Data
Introduction to Linked DataIntroduction to Linked Data
Introduction to Linked Data
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic  Web and Linked DataAn introduction to Semantic  Web and Linked Data
An introduction to Semantic Web and Linked Data
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic Web
 
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
ESWC SS 2013 - Tuesday Tutorial 1 Maribel Acosta and Barry Norton: Providing ...
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Get on the Linked Data Web!
Get on the Linked Data Web!Get on the Linked Data Web!
Get on the Linked Data Web!
 
Year of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkeyYear of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkey
 
Semantic Search Summer School2009
Semantic Search Summer School2009Semantic Search Summer School2009
Semantic Search Summer School2009
 
Semantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic DataSemantic Technologies: Representing Semantic Data
Semantic Technologies: Representing Semantic Data
 
From the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking upFrom the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking up
 
Research Data Sharing: A Basic Framework
Research Data Sharing: A Basic FrameworkResearch Data Sharing: A Basic Framework
Research Data Sharing: A Basic Framework
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
 
A Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsA Semantic Data Model for Web Applications
A Semantic Data Model for Web Applications
 
Linking up your data
Linking up your dataLinking up your data
Linking up your data
 

Similar to The Semantic Web and Linked Open Data: An Introduction

Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Museums Computer Group
 
Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2Jenel Farrell
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialLeeFeigenbaum
 
Semantic web assignment1
Semantic web assignment1Semantic web assignment1
Semantic web assignment1BarryK88
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Linked dataresearch
Linked dataresearchLinked dataresearch
Linked dataresearchTope Omitola
 
Semantic Web (Web 3.0)
Semantic Web (Web 3.0)Semantic Web (Web 3.0)
Semantic Web (Web 3.0)John Dougherty
 
Building a semantic website
Building a semantic websiteBuilding a semantic website
Building a semantic websiteCJ Jenkins
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsRinke Hoekstra
 
Linked Data Technology and Status
Linked Data Technology and StatusLinked Data Technology and Status
Linked Data Technology and StatusMyungjin Lee
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Morgan Briles
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Webebiquity
 
PoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataPoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataAndreas Blumauer
 
DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0John Breslin
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 

Similar to The Semantic Web and Linked Open Data: An Introduction (20)

Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?
 
Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
 
Semantic web assignment1
Semantic web assignment1Semantic web assignment1
Semantic web assignment1
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Linked Data
Linked DataLinked Data
Linked Data
 
Linked dataresearch
Linked dataresearchLinked dataresearch
Linked dataresearch
 
Semantic web
Semantic web Semantic web
Semantic web
 
Semantic Web (Web 3.0)
Semantic Web (Web 3.0)Semantic Web (Web 3.0)
Semantic Web (Web 3.0)
 
Building a semantic website
Building a semantic websiteBuilding a semantic website
Building a semantic website
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
 
Linked Data Technology and Status
Linked Data Technology and StatusLinked Data Technology and Status
Linked Data Technology and Status
 
Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web Linked data 101: Getting Caught in the Semantic Web
Linked data 101: Getting Caught in the Semantic Web
 
Finding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic WebFinding knowledge, data and answers on the Semantic Web
Finding knowledge, data and answers on the Semantic Web
 
PoolParty SKOS and Linked Data
PoolParty SKOS and Linked DataPoolParty SKOS and Linked Data
PoolParty SKOS and Linked Data
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0DM110 - Week 10 - Semantic Web / Web 3.0
DM110 - Week 10 - Semantic Web / Web 3.0
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 

Recently uploaded

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 

Recently uploaded (20)

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 

The Semantic Web and Linked Open Data: An Introduction

  • 1. The Semantic Web and Linked Open Data Pete DeVries TaxonConcept.org http://www.taxonconcept.org/ Department of Entomology University of Wisconsin - Madison
  • 2. What is the Semantic Web and how does it Work? Lets Look at the Traditional Way Taxon Table Location Table This data structure is really only interpretable within the context of this specific database
  • 3. Data Islands The result are database islands that contain a lot of redundant data which is independently curated. Each effort benefits little from the other efforts.
  • 4. Data Sets often Overlap Text What they don’t have is a common set of field names or ID’s
  • 5. Each Data set has is own “Vocabulary” Different Fields Different Names for the Same Fields Same Names for Different Fields Different ways of Interpreting those Fields These nuances in meaning are often only understood by the designers of each individual data set. Consider how differently people interpret the meaning of what seem to be the same terms
  • 6. Where the Semantic Web Helps Tim Berners-Lee’s 4 Rules 1. Use URIs* as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information. 4. Include links to other URIs. so that they can discover more things. *URI = Uniform Resource Identifier http://www.w3.org/DesignIssues/LinkedData.html
  • 7. Use URIs as Names for Things? Instead of “Door County” use http://sws.geonames.org/5250768/
  • 8. For Humans this URI Dereferences to a Human Interpretable Web Page Text Text
  • 9. For Machines this Dereferences a Machine Interpretable File As N-Triples
  • 10. Why Would Anyone Think this Made Sense? Now, each of these different databases are using an ID with a shared meaning. A meaning that can be determined by dereferencing the URI. All the data sets that use this vocabulary are now connectable. All the data sets that are linked to this URI are now also linked to each other.
  • 11. Life Sciences Example Example: Two databases with county records One uses “La Crosse County,” the other lists “La Crosse” for La Crosse County, Wisconsin You want to link and merge those records so that it is clear that you mean a particular species was observed in a particular county
  • 12. Normalize the Meaning between Data Sources Use this shared vocabulary to integrate these two data sources Use that shared vocabulary to find and link to other relevant data
  • 13. As More Data Sets Adopt these Principles The individual datasets are no longer islands, but are one interconnected knowledge base
  • 14. Other Benefits Reduced duplication of effort and a better separation of concerns It would be more efficient for me to simply link to a bibliographic reference URI on a site that specializes in that then to create my own bibliographic database. Similarly, it would be more efficient for the bibliographic database to link to a URI in a nomenclatural database than curates that aspect separately. When represented as URI’s in a Semantic Web database or “Triple Store”, information can be encoded more efficiently ~32 bytes per statement Enabling usable knowledge bases that scale to billions of “facts”
  • 15. Example: The Linked Open Data Cloud Over 55 billion triples and rising
  • 16. What is Linked Open Data? 1. data representation using open standards 2. use of hyperlinks to make it work on the global web
  • 17. Wikipedia Images linked to my Species Concepts TaxonConcept <=> Dbpedia <=> WikiCommons Images Virtuoso OpenSource and Microsoft Pivot (some images are too large to display)
  • 18. How do I Mark up my Data? Your data set can continue to exist in its current relational database form, but you need to expose it to the semantic web in a different form The goal is to make structured data accessible and discoverable via hyperlinks. It also includes the use of hyperlinks to denote properties/ predicates that have well defined semantics. These semantics are what ontologies and vocabularies deliver with more fidelity that what's available in a typical RDMS. Thus, the Semantic Web isn't a destination - it the effect of publishing data in line with a set of principles as outlined in TimBL's meme.
  • 19. Knowledge as Triples Statements are represented in a triple structure Subject ➜ Predicate ➜ Object • An English text version of a triple might look like • Ochlerotatus triseriatus expected in La Crosse County, WI
  • 20. Machine Processable Version Ochlerotatus triseriatus is expected in La Crosse County, WI Now represented as the following triple* http://lod.taxonconcept.org/ses/iuCXz#Species http://lod.taxonconcept.org/ontology/txn.owl#isExpectedIn http://sws.geonames.org/5258961/ *Not Meant for Human Consumption
  • 21. Expressing RDF RDF = Resource Description Framework Ways to Express RDF (Serialization Formats) RDF/XML http://www.w3.org/TR/REC-rdf-syntax/ Notation 3 (N3) http://www.w3.org/DesignIssues/Notation3.html Subsets of N3 Turtle (Terse RDF Triple Language) N-Triples
  • 22. The Same Triple in Different Formats RDF/XML (.rdf) N3 (.n3) Turtle (.ttl) You might find one of these forms easier to create. There are various tools that will allow you to convert between one form and another. If you need RDF/XML, but can create N3; author in N3 then convert those files to RDF/XML.
  • 23. How do I tell the Semantic Web about my Data? PingtheSemanticWeb http://pingthesemanticweb.com/ Semantic Sitemaps http://sw.deri.org/2007/07/sitemapextension/
  • 24. PingtheSemanticWeb.com Enter the URL for your RDF documents
  • 25. Semantic SiteMaps http://site.example.com/sitemap.xml http://site.example.com/sitemap.xml.gz Refer to the sitemap.xml file in your sites robots.txt file
  • 26. How can I Find other Potentially Useful Data Sets? CKAN Comprehensive Knowledge Archive Network http://ckan.net/
  • 27. Ask the LOD Cloud Enter in term or name like “Quercus alba”, to see what entities contain that term or name
  • 28. LOD Cloud Query Result
  • 29. How can I set up my own Knowledge Base? Virtuoso Open-Source Edition http://virtuoso.openlinksw.com/
  • 30. How can I Query a Knowledge Base? SPARQL http://en.wikipedia.org/wiki/SPARQL http://www.w3.org/TR/rdf-sparql-query/ Query using the Web Interface Query using your own script or web application Example “Describe those occurrences of the species concept Boloria selene”
  • 31. iSPARQL Query Example Web Interface
  • 33. More Elaborate SPARQL Query Query for those mammals that are “expected in” Wisconsin. * use optional keyword for those attributes that may not exist * the query includes those attributes that should be returned The result set will be feed through Microsoft Pivot for Browsing
  • 34. Result View Live Query of the LOD Cloud Data Set
  • 35. Efforts to Align Vocabularies http://labs.mondeca.com/dataset/lov/index.html
  • 38. Early EoL LOD Knowledge Base View
  • 39. What does the Future hold for the Semantic Web and Linked Open Data Improvements in the quantity and quality of LOD data sets. Improved Alignment of Vocabularies Improvements in SPARQL and Quadstores Human and Machine Interpretable Views Merged in RDFa Better Visualization and Analysis Tools
  • 40. Other Resources Linked Open Data http://linkeddata.org/ W3C.org http://esw.w3.org/Main_Page public-lod email list http://lists.w3.org/Archives/Public/public-lod/ TaxonConcept.org http://www.taxonconcept.org/ TaxonConcept.org Examples http://bit.ly/bundles/pjdlinkeddata/ SlideShare Talks Evolution Towards Web 3.0: The Semantic Web http://www.slideshare.net/LeeFeigenbaum/evolution-towards-web-30-the- semantic-web
  • 41. Recommendations Try using and experimenting with existing vocabularies before creating your own. Although these technologies allow you to run queries that you might not have anticipated, thinking about use cases etc. will provide some guidance on the best way to markup your data. Start with simple models and representations and add complexity as you gain experience. You may not want or be able to expose all your data to the LOD Cloud, but exposing the metadata in commonly used vocabularies will make your data more “findable” Some vocabularies* are still under development and discussion, but in many cases you can modify your SQL to RDF export to accommodate changes. * For instance, it is not clear to me what is the “best” vocabulary for representing publications.
  • 42. Acknowledgments Kingsley Idehen http://www.openlinksw.com/blog/~kidehen/ David “Paddy” Patterson mbl.edu Anne Thessen mbl.edu Dmitry Mozzherin mbl.edu Han Wang rpi.edu Patrick Leary eol.org

Editor's Notes

  1. Today I am going to give you a brief overview of the semantic web and how it can be useful for life sciences data.\n
  2. Here is a traditional table in a spreadsheet. It is a list of the various species and includes ID field that is used to connect it to another table for locations.\nIn this we have a representation for taxa and a representation for location that are often specific to this and only this database.\n\nOther similar databases that might be useful will have different names for the fields, and different names within the fields for what is often the same entity. \n\nWhat you have is a data island that knows nothing else about potentially related data and shares nothing about itself to other data sets.\n
  3. The result of this structure are large islands of data which are difficult to integrate. \n\nEach of these gains little value from other data sets and are of little value to other datasets.\n
  4. Different sets often overlap\n
  5. \n
  6. Use URIs as names for things\nUse HTTP URIs so that people can look up those names.\nWhen someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)\nInclude links to other URIs. so that they can discover more things.\n\n
  7. \n
  8. \n
  9. \n
  10. \n
  11. Lets look at an example of how these unique id&amp;#x2019;s can be used to reduce ambiguity and allow easier integration of disparate data sets.\nHere we have two databases of collection records, one for Wisconsin Insects the other for Wisconsin Plants.\nOne of the databases use &amp;#x201C;county&amp;#x201D; for the county field, others use &amp;#x201C;cnty&amp;#x201D;. In addition, one database lists &amp;#x201C;La Crosse County&amp;#x201D; while another lists &amp;#x201C;La Crosse&amp;#x201D;. \nWhat you want to do is link and merge those records so that it is clear that you mean that a particular species was observed in a particular county.\n
  12. If both these data sources use the geonames vocabulary, then it is easy to integrate the data for both insects and plants.\nIt is also possible to look for other uses of this identifier to find related data about this particular county.\n\n
  13. \n
  14. \n
  15. Linked Data is data that is linked together following the principles laid out by Tim Berners-Lee.\nLinked Open Data is Linked Data that is open and accessible.\nThere are ways to query this knowledge base, but you can also create your own subset for your own knowledge base.\n
  16. Linked Data is data that is linked together following the principles laid out by Tim Berners-Lee.\nLinked Open Data is Linked Data that is open and accessible.\nThere are ways to query this knowledge base, but you can also create your own subset for your own knowledge base.\n
  17. Since all these data sets are connected you can do some interesting things.\nMy data set is linked to Wikipedia through Dbpedia I can easily pull in all the images for my species that are in Wikipedia.\nThe RDF icons are for images that are too large to be displayed.\n
  18. Your data set can continue to exist in its current relational database form, but you need to expose it to the semantic web in a different form.\n\nThe goal is to make structured data accessible and discoverable via hyperlinks. \n\nIt also includes the use of hyperlinks to denote properties/predicates that have well defined semantics. \n\nThese semantics are what ontologies and vocabularies deliver with more fidelity that what&apos;s available in a typical RDMS. \n\nThus, the Semantic Web isn&apos;t a destination - it the effect of publishing data in line with a set of principles as outlined in TimBL&apos;s meme.\n
  19. The semantic web represents statements as triples.\nTriples consist of a subject predicate and object.\nAn english language version of a triple might look something like this. \n&amp;#x201C;Ochlerotatus triseriatus occurrence in La Crosse County, WI\n
  20. I can now use these unique identifiers to make machine processable statements about these entities.\nThe statement &amp;#x201C;Ochlerotatus triseriatus is expected in La Crosse County, WI&amp;#x201D;\nCan now be represented as the following triple\n&amp;#xA0;&lt;http://lod.taxonconcept.org/ses/iuCXz#Species&gt;\n&amp;#xA0;&lt;http://lod.taxonconcept.org/ontology/txn.owl#isExpectedIn&gt;\n&amp;#xA0;&lt;http://sws.geonames.org/5258961/&gt; .\n&amp;#xA0;It is important to recognize that these statements are part of the database, but they are not there for humans to process, they are there so that it is clear to this system and others what we actually mean.\n
  21. \n
  22. Here are the different ways of representing that original triple.\n
  23. \n
  24. \n
  25. \n
  26. CKAN serves as a registry of data sets. It does not represent all linked data sets but it is the information source that is used to generate the LOD Cloud Diagram\n
  27. \n
  28. \n
  29. \n
  30. Triple or Quadstores have their own SQL-like query language called SPARQL\n
  31. Virtuoso has a human accessible iSPARQL interface\n
  32. \n
  33. \n
  34. Faceted view allows you to display the returned results in different ways.\n
  35. &amp;#x201C;The LOV dataset contains the description of RDFS vocabularies or OWL ontologies defined for and used by datasets in the Linked Data Cloud. Whenever available each vocabulary includes references to the datasets using it, in particular those listed in CKAN.&amp;#x201D;\n\nIt documents what vocabularies different data sets are using.\n\n
  36. Here are some examples of the early EoL LOD data. We have a taxon, the Giant Panda which links to several things most notably various types of data objects.\n\n
  37. Here is one such data object. It is an Image Object that links to the original and various other representations of the image. \n\n
  38. This shows a scientific name string, which is linked to three different data sets that include taxa with that same scientific name. If the name is a homonym it is possible to see that there are different kinds of concepts with that same name.\n\n
  39. Improvements in the quantity and quality of LOD data sets.\nImproved Alignment of Vocabularies\nImprovements in SPARQL and Quadstores\nHuman and Machine Interpretable Views Merged in RDFa\nBetter Visualization and Analysis Tools\n
  40. \n
  41. \n
  42. \n