SlideShare a Scribd company logo
The Semantic Web and
why Wikipedia should bother
          Jakob Voß




                       Wikimania 2007
            Taipei, Taiwan, 2007-08-03
Agenda

(1) The Semantic Web
(2) Wikipedia’s contribution
(3) Examples and problems
(4) Possible solutions
The Semantic Web

    Everything can be linked via its URI
●



    Every data in triples with typed links
●




                      Image taken from: Semantic Wikipedia (2006)
The Semantic Web

    Ontologies define
●


    common structures and rules
    More data is generated by aggregation
●


    and reasoning on distributed data from
    several sources
    Software agents understand your
●


    commands, aggregate, reason, decide
    and act independently (at least in theory)
Wikipedia’s contribution

    Largest source of freely available
●


    non-specialized data
    Templates and categories
●


    contain structured data
        Persondata
    –

        DBpedia.org
    –

        Geodata
    –

        ...
    –

    Semantic MediaWiki
●


    adds typed links and attributes
Aggregating and Reasoning
Aggregating and Reasoning

Which polish authors are currently
   most published in Germany?
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●


        List of published books by book vendors
    –
        or by the German National Library
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●



    Authors
●


        National Library catalouge contains author
    –
        and uniquely identifies author by PND-ID
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●



    Authors
●



    Polish authors
●


        German Wikipedia contains PND => article
    –

        Article linked via Interwiki => more articles
    –

        Biographical articles contain place of birth
    –

        Place of birth linked to country via category
    –
Aggregating and Reasoning

subject       predicate       object
Publication   published-in   Germany
Publication   has-author      Person
Person          born-in         Town
Town            place-in      Poland
Where is Poland?
Where is Poland?


          Somewhere here
Where is Poland?


                        Somewhere here
Or five times here in
Maine, Ohia, or NY
Where is Poland?


                        Somewhere here
Or five times here in
Maine, Ohia, or NY




                               Or did you mean
                               Poland, Kiribati?
Poland around 1619




Polish-Lithuanian Commonwealth
Poland 1772...1793..1795
Poland 1945–
Where is Poland?

    Reality is complex, confusing, and fuzzy
●



    What’s the »default« Poland?
●



    Humans can look up context in Wikipedia
●



    Semantic Web only consists of statements
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●



    George W. Bush     2001-01-20 – 2009-01-20
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●



    George W. Bush     2001-01-20 – 2009-01-20
●



    Barack Obama       2009-01-20 –
●
Example #2

    Presidents of the United States
    Bill Clinton        1993-01-20 – 2001-01-20
●



    George W. Bush      2001-01-20 – 2009-01-20
●



    Barack Obama        2009-01-20 – 2013-01-20
●



    A. Schwarzenegger   2013-01-20 –
●
Presidents of the United States

    George W. Bush       2001-01-20 – 2002-06-29
●



    Dick Cheney               07:09 – 09:24 a.m.
●



    George W. Bush       2002-06-29 – 2007-07-21
●



    Dick Cheney               07:14 – 09:21 a.m.
●



    George W. Bush      2007–07-21 –
●




                  Twice president of the US
                     (see 25th amendment)
Presidents of the United States

    The devil is in the details ;-)
●



    Automatic reasoning will
●


    give you inconvenient results
Example #3

 Finally a clear division




                        女性
男性
                        XX
XY
So let’s formalize...

owl:disjointWith
”Classes may be stated to be disjoint from
 each other. For example, Man and Woman
 can be stated to be disjoint classes. [...] a
 reasoner can deduce that if A is an
 instance of Man, then A is not an instance
 of Woman.“
OWL Web Ontology Language Guide
 http://www.w3.org/TR/owl-guide/
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●




       Intersexuality, Hermaproditism
    Chromosomal sex inconsistent with phenotypic
●


    sex or phenotype is not just male or female
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●




       Intersexuality, Hermaproditism
    Chromosomal sex inconsistent with phenotypic
●


    sex or phenotype is not just male or female

                Gender identity
    Gender with which a person identifies
●


    independent from biological sex.
A clear division?

    Reality is far more complicated
●



    Many kinds of exceptions
●
Problems

    Clear divisions discriminate
●



    Discussion and context gets lost
●



    Example #4
●


     IF  your name = X
     AND X on a list of suspected terrorists
    THEN you have a problem
Not our problem?

    Ẁikipedia is already used as
●


    source by millions of people
    People can think, judge and ask,
●


    computers cannot
    We create definitions that will be used in
●


    thousands of applications
    Statistics lie
●


    Aggragation/resoning even lies better
Possible Solutions

    More of all (data, aggregation, reasoning)
●



    Less of all
●



    Statements about statements
●



    Fuzzy logic
●



    Data provenance / data lineage
●



    Allow exceptions
●



    Teach people to be careful
●



    Do not expect or believe simple answers
●



    It’s just dirty data
●
Summary

    Semantic Web is great
●



    Reality is based on exceptions
●



    Simplification is useful but dangerous
●



    Data POV != NPOV
●



    We also bear responsability for
●


    stupid use of Wikipedia data
    Never stop analyzing and thinking
●


    instead of relying on computers
More to read

    Shadbolt, Berners-Lee, and Hall: The Semantic Web
●


    Revisited. IEEE Intelligent Systems 21 (3) pp. 96-101.
    May/June 2006.
    http://eprints.ecs.soton.ac.uk/12614/01/Semantic_Web_Revisted.pdf

    Völkel, Krötzsch, Vrandecic, Haller, and Studer:
●


    Semantic Wikipedia. Proceedings of the WWW2006.
    http://www.aifb.uni-karlsruhe.de/Publikationen/showPublikation_english?publ_id=

    Doctorow: Metacrap: Putting the torch to seven straw-
●


    men of the meta-utopia. August 2001.
    http://www.well.com/~doctorow/metacrap.htm

    Geoffrey and Star: Sorting Things Out: Classification
●


    and Its Consequences. MIT Press, 1999.

More Related Content

Similar to Jakob Voss Wikipedia2007

University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009Tom Moritz
 
Describing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDescribing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classification
Dan Brickley
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
STIinnsbruck
 
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Cartegraph
 
Argumentative Essay Structure
Argumentative Essay StructureArgumentative Essay Structure
Argumentative Essay Structure
Veronica Withers
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysis
Luke Czarnecki
 
Essay Writing Examples Uk
Essay Writing Examples UkEssay Writing Examples Uk
Essay Writing Examples Uk
Vanessa Henderson
 
Wikipedia and Civic Engagement
Wikipedia and Civic EngagementWikipedia and Civic Engagement
Wikipedia and Civic Engagement
Andrew Lih
 
Coincidences
CoincidencesCoincidences
Coincidences
Robb Muirhead
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for Semantics
John Breslin
 
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritWho's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Dominik Lukes
 
The Potential of Web 3.0
The Potential of Web 3.0The Potential of Web 3.0
The Potential of Web 3.0
Carsten Ullrich
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities research
Enrico Daga
 
The Persuasive Speech.ppt
The Persuasive Speech.pptThe Persuasive Speech.ppt
The Persuasive Speech.ppt
SupreethaS8
 
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionFrom Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
STLab
 
Developmental Psychology Theoretical Approaches Essay
 Developmental Psychology Theoretical Approaches Essay Developmental Psychology Theoretical Approaches Essay
Developmental Psychology Theoretical Approaches Essay
Patty Buckley
 
(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And UnderstandingAlexandre Linhares
 
Content - Cory Doctorow
Content - Cory DoctorowContent - Cory Doctorow
Content - Cory Doctorow
George Grayson
 

Similar to Jakob Voss Wikipedia2007 (20)

University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009
 
Describing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDescribing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classification
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
 
Argumentative Essay Structure
Argumentative Essay StructureArgumentative Essay Structure
Argumentative Essay Structure
 
Infooverload
InfooverloadInfooverload
Infooverload
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysis
 
Essay Writing Examples Uk
Essay Writing Examples UkEssay Writing Examples Uk
Essay Writing Examples Uk
 
Wikipedia and Civic Engagement
Wikipedia and Civic EngagementWikipedia and Civic Engagement
Wikipedia and Civic Engagement
 
Coincidences
CoincidencesCoincidences
Coincidences
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for Semantics
 
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritWho's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
 
The Potential of Web 3.0
The Potential of Web 3.0The Potential of Web 3.0
The Potential of Web 3.0
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities research
 
The Persuasive Speech.ppt
The Persuasive Speech.pptThe Persuasive Speech.ppt
The Persuasive Speech.ppt
 
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionFrom Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
 
Developmental Psychology Theoretical Approaches Essay
 Developmental Psychology Theoretical Approaches Essay Developmental Psychology Theoretical Approaches Essay
Developmental Psychology Theoretical Approaches Essay
 
Class 4
Class 4Class 4
Class 4
 
(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding
 
Content - Cory Doctorow
Content - Cory DoctorowContent - Cory Doctorow
Content - Cory Doctorow
 

More from Bertalan Mesko, MD

Medical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comMedical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.com
Bertalan Mesko, MD
 
Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma
Bertalan Mesko, MD
 
Medicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldMedicine in Second Life, the virtual world
Medicine in Second Life, the virtual world
Bertalan Mesko, MD
 
Practicing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraPracticing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 Era
Bertalan Mesko, MD
 
Jason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsJason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired Patients
Bertalan Mesko, MD
 
Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0
Bertalan Mesko, MD
 
Medicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerMedicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student blogger
Bertalan Mesko, MD
 
The impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareThe impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcare
Bertalan Mesko, MD
 
Medicine 2.0
Medicine 2.0Medicine 2.0
Medicine 2.0
Bertalan Mesko, MD
 

More from Bertalan Mesko, MD (9)

Medical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comMedical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.com
 
Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma
 
Medicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldMedicine in Second Life, the virtual world
Medicine in Second Life, the virtual world
 
Practicing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraPracticing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 Era
 
Jason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsJason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired Patients
 
Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0
 
Medicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerMedicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student blogger
 
The impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareThe impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcare
 
Medicine 2.0
Medicine 2.0Medicine 2.0
Medicine 2.0
 

Recently uploaded

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
Globus
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 

Recently uploaded (20)

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 

Jakob Voss Wikipedia2007

  • 1. The Semantic Web and why Wikipedia should bother Jakob Voß Wikimania 2007 Taipei, Taiwan, 2007-08-03
  • 2. Agenda (1) The Semantic Web (2) Wikipedia’s contribution (3) Examples and problems (4) Possible solutions
  • 3. The Semantic Web Everything can be linked via its URI ● Every data in triples with typed links ● Image taken from: Semantic Wikipedia (2006)
  • 4. The Semantic Web Ontologies define ● common structures and rules More data is generated by aggregation ● and reasoning on distributed data from several sources Software agents understand your ● commands, aggregate, reason, decide and act independently (at least in theory)
  • 5. Wikipedia’s contribution Largest source of freely available ● non-specialized data Templates and categories ● contain structured data Persondata – DBpedia.org – Geodata – ... – Semantic MediaWiki ● adds typed links and attributes
  • 7. Aggregating and Reasoning Which polish authors are currently most published in Germany?
  • 8. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● List of published books by book vendors – or by the German National Library
  • 9. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● Authors ● National Library catalouge contains author – and uniquely identifies author by PND-ID
  • 10. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● Authors ● Polish authors ● German Wikipedia contains PND => article – Article linked via Interwiki => more articles – Biographical articles contain place of birth – Place of birth linked to country via category –
  • 11. Aggregating and Reasoning subject predicate object Publication published-in Germany Publication has-author Person Person born-in Town Town place-in Poland
  • 13. Where is Poland? Somewhere here
  • 14. Where is Poland? Somewhere here Or five times here in Maine, Ohia, or NY
  • 15. Where is Poland? Somewhere here Or five times here in Maine, Ohia, or NY Or did you mean Poland, Kiribati?
  • 19. Where is Poland? Reality is complex, confusing, and fuzzy ● What’s the »default« Poland? ● Humans can look up context in Wikipedia ● Semantic Web only consists of statements ●
  • 20. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ●
  • 21. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ●
  • 22. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ● Barack Obama 2009-01-20 – ●
  • 23. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ● Barack Obama 2009-01-20 – 2013-01-20 ● A. Schwarzenegger 2013-01-20 – ●
  • 24. Presidents of the United States George W. Bush 2001-01-20 – 2002-06-29 ● Dick Cheney 07:09 – 09:24 a.m. ● George W. Bush 2002-06-29 – 2007-07-21 ● Dick Cheney 07:14 – 09:21 a.m. ● George W. Bush 2007–07-21 – ● Twice president of the US (see 25th amendment)
  • 25. Presidents of the United States The devil is in the details ;-) ● Automatic reasoning will ● give you inconvenient results
  • 26. Example #3 Finally a clear division 女性 男性 XX XY
  • 27. So let’s formalize... owl:disjointWith ”Classes may be stated to be disjoint from each other. For example, Man and Woman can be stated to be disjoint classes. [...] a reasoner can deduce that if A is an instance of Man, then A is not an instance of Woman.“ OWL Web Ontology Language Guide http://www.w3.org/TR/owl-guide/
  • 28. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ●
  • 29. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ● Intersexuality, Hermaproditism Chromosomal sex inconsistent with phenotypic ● sex or phenotype is not just male or female
  • 30. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ● Intersexuality, Hermaproditism Chromosomal sex inconsistent with phenotypic ● sex or phenotype is not just male or female Gender identity Gender with which a person identifies ● independent from biological sex.
  • 31. A clear division? Reality is far more complicated ● Many kinds of exceptions ●
  • 32. Problems Clear divisions discriminate ● Discussion and context gets lost ● Example #4 ● IF your name = X AND X on a list of suspected terrorists THEN you have a problem
  • 33. Not our problem? Ẁikipedia is already used as ● source by millions of people People can think, judge and ask, ● computers cannot We create definitions that will be used in ● thousands of applications Statistics lie ● Aggragation/resoning even lies better
  • 34. Possible Solutions More of all (data, aggregation, reasoning) ● Less of all ● Statements about statements ● Fuzzy logic ● Data provenance / data lineage ● Allow exceptions ● Teach people to be careful ● Do not expect or believe simple answers ● It’s just dirty data ●
  • 35. Summary Semantic Web is great ● Reality is based on exceptions ● Simplification is useful but dangerous ● Data POV != NPOV ● We also bear responsability for ● stupid use of Wikipedia data Never stop analyzing and thinking ● instead of relying on computers
  • 36. More to read Shadbolt, Berners-Lee, and Hall: The Semantic Web ● Revisited. IEEE Intelligent Systems 21 (3) pp. 96-101. May/June 2006. http://eprints.ecs.soton.ac.uk/12614/01/Semantic_Web_Revisted.pdf Völkel, Krötzsch, Vrandecic, Haller, and Studer: ● Semantic Wikipedia. Proceedings of the WWW2006. http://www.aifb.uni-karlsruhe.de/Publikationen/showPublikation_english?publ_id= Doctorow: Metacrap: Putting the torch to seven straw- ● men of the meta-utopia. August 2001. http://www.well.com/~doctorow/metacrap.htm Geoffrey and Star: Sorting Things Out: Classification ● and Its Consequences. MIT Press, 1999.