• Save
Embrace The Chaos
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Embrace The Chaos

on

  • 3,381 views

A controversial discussion of the utility of DBpedia as authority data with examples from a project at the Library of Congress. Part of an ExLibris-sponsored panel discussion at ALA Chicago 2009.

A controversial discussion of the utility of DBpedia as authority data with examples from a project at the Library of Congress. Part of an ExLibris-sponsored panel discussion at ALA Chicago 2009.

Statistics

Views

Total Views
3,381
Views on SlideShare
3,363
Embed Views
18

Actions

Likes
2
Downloads
0
Comments
0

5 Embeds 18

http://www.slideshare.net 9
http://www.plaxo.com 3
http://porzadek-rzeczy.bn.org.pl 3
http://www.linkedin.com 2
https://www.linkedin.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Embrace The Chaos Presentation Transcript

  • 1. Embrace the Chaos 7/11/2009 Ex Libris Authority Panel
  • 2. There are many ‘webs’ now
    • Hypertext web
      • Hyperlinked resources for humans
    • Semantic web
      • Resources for thinking machines
    • Linked Open Data web
      • Hyperlinked data resources for machines (thinking or not)
    7/11/2009 Ex Libris Authority Panel
  • 3. Just a bit about RDF
    • RDF ‘statements’ have a simple but formal grammar:
    • <subject> predicate object
    • <resource> property Value
    • <http://RDVocab.info/Elements> dc:title &quot;RDA Elements (Group 1)&quot;
    7/11/2009 Ex Libris Authority Panel
  • 4. RDF Vocabularies A predefined set of predicates 7/11/2009 Ex Libris Authority Panel
  • 5. SKOS
    • “ Simple Knowledge Organization System”
    “ Using SKOS, concepts can be identified using URIs, labeled with strings in one or more languages, assigned notations (codes), documented with various types of note, linked to other concepts and organized into hierarchies, aggregated into concept schemes , grouped into collections , and mapped to concepts in other schemes.” 7/11/2009 Ex Libris Authority Panel
  • 6. A trip down Linked Data lane Courtesy of The National Digital Newspaper Program (special thanks to Ed Summers, Library of Congress) 7/11/2009 Ex Libris Authority Panel
  • 7. Chronicling America
    • “… search and view newspaper pages from 1880-1922 and find information about American newspapers published between 1690-present.”
    7/11/2009 Ex Libris Authority Panel
  • 8. About this Newspaper: The Daytona Daily News
    • HTML http://chroniclingamerica.loc.gov/lccn/sn93063916/
    • RDF http://chroniclingamerica.loc.gov/lccn/sn93063916.rdf
    • MARC (HTML) http://chroniclingamerica.loc.gov/lccn/sn93063916/marc/
    • MARC (XML) http://chroniclingamerica.loc.gov/lccn/sn93063916/marc.xml
    • WorldCat (HTML only?) http://www.worldcat.org/oclc/1631353
    7/11/2009 Ex Libris Authority Panel
  • 9. Un-Linked Data
    • MARC21 has a naming convention for place names … 752 $a United States $b Florida $c Volusia $d Daytona Beach.
    • Wikipedia also has a naming convention for place names … http://en.wikipedia.org/wiki/Daytona_Beach,_Florida
    • Ed created a little script to use the 752 hierarchy to build a Wikipedia URL and see if it would resolve as a URI from DBpedia…
    7/11/2009 Ex Libris Authority Panel
  • 10. Linked Data
    • Dbpedia: <dcterms:coverage rdf:resource= http://dbpedia.org/resource/Daytona_Beach%2C_Florida />
    • Geonames: <dcterms:coverage rdf:resource= http://sws.geonames.org/4152872/ />
    7/11/2009 Ex Libris Authority Panel
  • 11. DBpedia
    • HTML http://dbpedia.org/ page /Daytona_Beach%2C_Florida
    • RDF http://dbpedia.org/ resource /Daytona_Beach%2C_Florida
    • The data source -- Wikipedia (HTML) http://en.wikipedia.org/wiki/Daytona_Beach,_Florida
    7/11/2009 Ex Libris Authority Panel
  • 12. DBpedia
    • “ The DBpedia knowledge base currently describes more than 2.6 million things, including at least…
    • 213,000 persons
    • 328,000 places
    • 57,000 music albums
    • 36,000 films
    • 20,000 companies.”
    7/11/2009 Ex Libris Authority Panel
  • 13. DBpedia
    • Dbpedia is “a community effort to extract structured information from Wikipedia and to make this information available on the Web.”
    7/11/2009 Ex Libris Authority Panel
  • 14. Dbpedia (a taste)
    • dbpedia-owl:areaCode 386
    • dbpedia-owl: areaLand 152.0 (58.68)
    • dbpedia-owl:areaTotal 168.2 (64.93)
    • dbpedia-owl:areaWater 16.2 (6.25)
    • dbpedia-owl:elevation 9 (4)
    • dbpedia-owl:establishedTitle Incorporated
    • dbpedia-owl:foundingDate 1876-07
    7/11/2009 Ex Libris Authority Panel
  • 15. Dbpedia (a taste)
    • What the real data looks like…
    • <dbpedia-owl: areaLand rdf: datatype =&quot;http://dbpedia.org/ontology/ squareKilometr e&quot; xmlns:dbpedia-owl=&quot;http://dbpedia.org/ontology/&quot;>
    • 152.0
    • </dbpedia-owl:areaLand>
    • <dbpedia-owl: areaLand rdf: datatype =&quot;http://dbpedia.org/ontology/ squareMile &quot; xmlns:dbpedia-owl=&quot;http://dbpedia.org/ontology/&quot;>
    • 58.68
    • </dbpedia-owl:areaLand>
    7/11/2009 Ex Libris Authority Panel
  • 16. Dbpedia (a taste)
    • dbpprop:abstract
    • Daytona Beach is a city in Volusia County, Florida, USA. According to 2006 U.S. Census Bureau estimates, the city has a population of 64,421. Daytona Beach ... »more» (en)
    • Daytona Beach er en by i Volusia County i Florida i USA. I 2004 blev indbyggertallet opgjort til 64. 422 personer. Daytona er kendt som ferieby, og er ... »more» (da)
    • Daytona Beach (playa de Daytona) es un municipio de Florida en los Estados Unidos de América. Su población es de 67. 000 habitantes en 2004 y su principal ... »more» (es)
    • Daytona Beach ist eine Stadt an der Ostküste Floridas direkt am Atlantik in Volusia County. (de)
    7/11/2009 Ex Libris Authority Panel
  • 17. Dbpedia (a taste)
    • dbpprop:abstract
    • Daytona Beach on kaupunki Yhdysvalloissa Floridan osavaltiossa. Vuonna 2000 kaupungissa oli 64 122 asukasta, ja siellä kävi lähes kahdeksan miljoonaa turistia. ... »more» (fi)
    • Daytona Beach est une ville dans l’État de Floride, États-Unis. Au recensement de 2000, elle comptait 64 112 habitants et s’étendait sur près de 152 km², ... »more» (fr)
    • Daytona Beach è un comune degli Stati Uniti d'America, nella Contea di Volusia, nello stato della Florida. La città è conosciuta per il suo motorsport, ... »more» (it)
    7/11/2009 Ex Libris Authority Panel
  • 18. Dbpedia (a taste)
    • dbpprop:abstract
    • デ イトナビーチ ( Daytona Beach) は、アメリカ合衆国フロリダ州ヴォルシア郡に位置する都市である。 2004 年現在、アメリカ合衆国統計局によって記録された人口見込みは 64,422 人で ある。 デイトナビーチの広大な砂浜は、レース場やテストコースに最適であり、 1959 年にデイトナ・インターナショナル・スピードウェイが建設され使用されるよ うになるまで、全米のレーサーたちのメッカであった。レース場の近くにデイトナビーチ国際空港やエンブリ・リドル航空大学がある。 ... » more» (ja)
    • Daytona Beach is een plaats (city) in de Amerikaanse staat Florida, en valt bestuurlijk gezien onder Volusia County. (nl)
    7/11/2009 Ex Libris Authority Panel
  • 19. Dbpedia (a taste)
    • dbpprop:abstract
    • Daytona Beach - miasto w USA, w hrabstwie Volusia, w stanie Floryda, nad Oceanem Atlantyckim. Około 64,4 tys. mieszkańców. Rocznie przyjeżdża tu 8 mln. turystów. (pl)
    • Daytona Beach é uma cidade localizada no Estado americano de Flórida, no Condado de Volusia. Daytona Beach é sede da NASCAR, a maior associação automobilística do país. A cidade também sedia o principal circuito das competições da Stock Car norte-americana, o Daytona International Speedway. (pt)
    7/11/2009 Ex Libris Authority Panel
  • 20. Dbpedia (a taste)
    • dbpprop:abstract
    • Дейтона-Бич — город в штате Флорида, США. ( ru)
    • Daytona Beach är en ort i Volusia County i Florida i USA. 2004, uppgisk befolkningen till 64 422 personer. Daytona är en känd semesterort. Daytona är även ... »more» (sv)
    • 代托纳比奇( Daytona Beach ) 是美国佛罗里达州沃卢西亚县的一座城市。 根据 2000 年美国人口普查,代托纳比奇共有 64,112 人,其中白人占 62.33% 、非裔美国人占 32.75% 、亚裔美国人占 1.73% 。 ( zh)
    7/11/2009 Ex Libris Authority Panel
  • 21. Dbpedia (a taste)
    • rdfs:label
    • Daytona Beach, Florida (en)
    • Daytona Beach (da)
    • David Silverman (es)
    • Daytona Beach (es)
    • Daytona Beach (de)
    • Daytona Beach (fi)
    • Daytona Beach (fr)
    • Daytona Beach (it)
    7/11/2009 Ex Libris Authority Panel
  • 22. Dbpedia (a taste)
    • rdfs:label
    • デイトナビーチ ( ja)
    • Daytona Beach (nl)
    • Daytona Beach (pl)
    • Daytona Beach (pt)
    • Дейтона-Бич ( ru)
    • Daytona Beach, Florida (sv)
    • 代托纳比奇 ( 佛罗里达州 ) ( zh)
    7/11/2009 Ex Libris Authority Panel
  • 23. Dbpedia (a taste)
    • is dbpprop:birthPlace of
    • dbpedia:Marshall_Teague
    • dbpedia:Bruce_McNorton
    • dbpedia:Christie_Wolf
    • dbpedia:Kenneth_C._Griffin
    • dbpedia:T._T._Toliver
    • dbpedia:Vince_Carter
    • dbpedia:George_McCloud
    • dbpedia:Hank_Mizell
    7/11/2009 Ex Libris Authority Panel
  • 24. Dbpedia (a taste)
    • is dbpprop:deathPlace of
    • dbpedia:Marshall_Teague
    • dbpedia:John_Morgan_Walden
    • dbpedia:Orville_E._Babcock
    • dbpedia:Luigi_Beccali
    • dbpedia:Paul_de_Bruyn
    • dbpedia:Charles_Martin_Hall
    • dbpedia:Bill_France%2C_Jr.
    • dbpedia:Halina_Konopacka
    7/11/2009 Ex Libris Authority Panel
  • 25. Dbpedia (a taste)
    • is dbpprop:headquarters of
    • dbpedia:Trans-Florida_Airlines
    • dbpedia:NASCAR
    • dbpedia:Grand_American_Road_Racing_Association
    7/11/2009 Ex Libris Authority Panel
  • 26. Dbpedia (a taste)
    • is dbpprop:location of
    • dbpedia:Amos_Kling_House
    • dbpedia:Bartholomew_J._Donnelly_House
    • dbpedia:Costa_Del_Mar
    • dbpedia:Jackie_Robinson_Ballpark
    • dbpedia:Mary_McLeod_Bethune_Home
    • dbpedia:Olds_Hall
    • dbpedia:Daytona_Beach_International_Airport
    • dbpedia:Embry-Riddle_Aeronautical_University%2C_Worldwide
    7/11/2009 Ex Libris Authority Panel
  • 27. Dbpedia (even more data)
    • owl:sameAs
    • Rdfabout: The 2000 U.S. Census http://www.rdfabout.com/rdf/usgov/geo/us/fl/counties/volusia_county/daytona_beach
    • GeoNames http://sws.geonames.org/4152872/
    • Freebase fbase:Daytona Beach, Florida
    7/11/2009 Ex Libris Authority Panel
  • 28. Some open questions
    • There are hundreds, if not thousands of people tracking down place names in Wikipedia and making sure they are normalized and geo-referenced.
    • Is this crowd-sourced, Wikipedia data ‘authoritative’?
    • Who is ‘David Silverman’ and why is he a Spanish label for Daytona Beach?
    7/11/2009 Ex Libris Authority Panel
  • 29. Some open questions
    • Chronicling America’s data for the Daytona Beach Daily News references Dbpedia but there’s no corresponding reference in Dbpedia, even though there’s a ‘place’ where it could be referenced.
    • How do we make sure that happens?
    • Where’s the library data anyway?
    7/11/2009 Ex Libris Authority Panel
  • 30. Some open questions
    • DBpedia uses it’s own vocabulary for many statements, chooses to use skos:subject instead of dc:subject, foaf:name instead of dc:title.
    • Was there a specific reason for this choice?
    • What’s the best way to make sure everyone can understand these choices?
    7/11/2009 Ex Libris Authority Panel
  • 31. Some open questions
    • Linking to Other People’s Data is great, but...
    • What happens if the linked data disappears?
    • What happens if it’s hijacked?
    • If I cache it locally to solve those problems, how does my system know to refresh the cache?
    • And other freshness-related questions…
    7/11/2009 Ex Libris Authority Panel
  • 32. Things I could be talking about…
    • If I had 2 more hours (at least) …
    • Flickr Commons
    • CommonTag
    • Google Fusion Tables
    • RDFa – indexed by Google & Yahoo
    • Dipity
    • Bibapp
    • Crowdsourcing
    • Mashups
    7/11/2009 Ex Libris Authority Panel
  • 33. Thanks!
    • Jon Phipps
    • Metadata Management Associates http://managemetadata.com [email_address] +1 (607) 216-4899
    • The Open Metadata Registry “A place to register and manage vocabularies” http://metadataregistry.org
    7/11/2009 Ex Libris Authority Panel