Your SlideShare is downloading. ×
Embrace the Chaos 7/11/2009 Ex Libris Authority Panel
There are many ‘webs’ now <ul><li>Hypertext web  </li></ul><ul><ul><li>Hyperlinked resources for humans </li></ul></ul><ul...
Just a bit about RDF <ul><li>RDF ‘statements’ have a simple but formal grammar: </li></ul><ul><li><subject>  predicate  ob...
RDF Vocabularies A predefined set of predicates 7/11/2009 Ex Libris Authority Panel
SKOS <ul><li>“ Simple Knowledge Organization System” </li></ul>“ Using SKOS,  concepts  can be identified using URIs,  lab...
A trip down Linked Data lane Courtesy of  The National Digital Newspaper Program (special thanks to Ed Summers, Library of...
Chronicling America <ul><li>“… search and view newspaper pages from 1880-1922 and find information about American newspape...
About this Newspaper:  The Daytona Daily News <ul><li>HTML http://chroniclingamerica.loc.gov/lccn/sn93063916/ </li></ul><u...
Un-Linked Data <ul><li>MARC21 has a  naming convention for place names … 752 $a United States $b Florida $c Volusia $d Day...
Linked Data <ul><li>Dbpedia: <dcterms:coverage rdf:resource= http://dbpedia.org/resource/Daytona_Beach%2C_Florida  /> </li...
DBpedia <ul><li>HTML http://dbpedia.org/ page /Daytona_Beach%2C_Florida </li></ul><ul><li>RDF http://dbpedia.org/ resource...
DBpedia <ul><li>“ The DBpedia knowledge base currently describes more than 2.6 million things, including at least…  </li><...
DBpedia <ul><li>Dbpedia is “a community effort to extract structured information from Wikipedia and to make this informati...
Dbpedia (a taste) <ul><li>dbpedia-owl:areaCode  386 </li></ul><ul><li>dbpedia-owl: areaLand   152.0 (58.68) </li></ul><ul>...
Dbpedia (a taste) <ul><li>What the real data looks like… </li></ul><ul><li><dbpedia-owl: areaLand   rdf: datatype =&quot;h...
Dbpedia (a taste) <ul><li>dbpprop:abstract   </li></ul><ul><li>Daytona Beach is a city in Volusia County, Florida, USA. Ac...
Dbpedia (a taste) <ul><li>dbpprop:abstract   </li></ul><ul><li>Daytona Beach on kaupunki Yhdysvalloissa Floridan osavaltio...
Dbpedia (a taste) <ul><li>dbpprop:abstract   </li></ul><ul><li>デ イトナビーチ  ( Daytona Beach)  は、アメリカ合衆国フロリダ州ヴォルシア郡に位置する都市である。...
Dbpedia (a taste) <ul><li>dbpprop:abstract   </li></ul><ul><li>Daytona Beach - miasto w USA, w hrabstwie Volusia, w stanie...
Dbpedia (a taste) <ul><li>dbpprop:abstract   </li></ul><ul><li>Дейтона-Бич — город в штате Флорида, США.  ( ru) </li></ul>...
Dbpedia (a taste) <ul><li>rdfs:label   </li></ul><ul><li>Daytona Beach, Florida  (en) </li></ul><ul><li>Daytona Beach  (da...
Dbpedia (a taste) <ul><li>rdfs:label   </li></ul><ul><li>デイトナビーチ  ( ja) </li></ul><ul><li>Daytona Beach  (nl) </li></ul><u...
Dbpedia (a taste) <ul><li>is  dbpprop:birthPlace  of </li></ul><ul><li>dbpedia:Marshall_Teague </li></ul><ul><li>dbpedia:B...
Dbpedia (a taste) <ul><li>is  dbpprop:deathPlace  of </li></ul><ul><li>dbpedia:Marshall_Teague </li></ul><ul><li>dbpedia:J...
Dbpedia (a taste) <ul><li>is  dbpprop:headquarters  of </li></ul><ul><li>dbpedia:Trans-Florida_Airlines </li></ul><ul><li>...
Dbpedia (a taste) <ul><li>is  dbpprop:location  of </li></ul><ul><li>dbpedia:Amos_Kling_House </li></ul><ul><li>dbpedia:Ba...
Dbpedia (even more data) <ul><li>owl:sameAs   </li></ul><ul><li>Rdfabout: The 2000 U.S. Census http://www.rdfabout.com/rdf...
Some open questions <ul><li>There are hundreds, if not thousands of people tracking down place names in Wikipedia and maki...
Some open questions <ul><li>Chronicling America’s data for the Daytona Beach Daily News references Dbpedia but there’s no ...
Some open questions <ul><li>DBpedia uses it’s own vocabulary for many statements, chooses to use skos:subject instead of d...
Some open questions <ul><li>Linking to Other People’s Data is great, but... </li></ul><ul><li>What happens if the linked d...
Things I could be talking about… <ul><li>If I had 2 more hours (at least) … </li></ul><ul><li>Flickr Commons </li></ul><ul...
Thanks! <ul><li>Jon Phipps </li></ul><ul><li>Metadata Management Associates http://managemetadata.com [email_address]   +1...
Upcoming SlideShare
Loading in...5
×

Embrace The Chaos

2,209

Published on

A controversial discussion of the utility of DBpedia as authority data with examples from a project at the Library of Congress. Part of an ExLibris-sponsored panel discussion at ALA Chicago 2009.

Published in: Technology, Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
2,209
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Transcript of "Embrace The Chaos"

  1. 1. Embrace the Chaos 7/11/2009 Ex Libris Authority Panel
  2. 2. There are many ‘webs’ now <ul><li>Hypertext web </li></ul><ul><ul><li>Hyperlinked resources for humans </li></ul></ul><ul><li>Semantic web </li></ul><ul><ul><li>Resources for thinking machines </li></ul></ul><ul><li>Linked Open Data web </li></ul><ul><ul><li>Hyperlinked data resources for machines (thinking or not) </li></ul></ul>7/11/2009 Ex Libris Authority Panel
  3. 3. Just a bit about RDF <ul><li>RDF ‘statements’ have a simple but formal grammar: </li></ul><ul><li><subject> predicate object </li></ul><ul><li><resource> property Value </li></ul><ul><li><http://RDVocab.info/Elements> dc:title &quot;RDA Elements (Group 1)&quot; </li></ul>7/11/2009 Ex Libris Authority Panel
  4. 4. RDF Vocabularies A predefined set of predicates 7/11/2009 Ex Libris Authority Panel
  5. 5. SKOS <ul><li>“ Simple Knowledge Organization System” </li></ul>“ Using SKOS, concepts can be identified using URIs, labeled with strings in one or more languages, assigned notations (codes), documented with various types of note, linked to other concepts and organized into hierarchies, aggregated into concept schemes , grouped into collections , and mapped to concepts in other schemes.” 7/11/2009 Ex Libris Authority Panel
  6. 6. A trip down Linked Data lane Courtesy of The National Digital Newspaper Program (special thanks to Ed Summers, Library of Congress) 7/11/2009 Ex Libris Authority Panel
  7. 7. Chronicling America <ul><li>“… search and view newspaper pages from 1880-1922 and find information about American newspapers published between 1690-present.” </li></ul>7/11/2009 Ex Libris Authority Panel
  8. 8. About this Newspaper: The Daytona Daily News <ul><li>HTML http://chroniclingamerica.loc.gov/lccn/sn93063916/ </li></ul><ul><li>RDF http://chroniclingamerica.loc.gov/lccn/sn93063916.rdf </li></ul><ul><li>MARC (HTML) http://chroniclingamerica.loc.gov/lccn/sn93063916/marc/ </li></ul><ul><li>MARC (XML) http://chroniclingamerica.loc.gov/lccn/sn93063916/marc.xml </li></ul><ul><li>WorldCat (HTML only?) http://www.worldcat.org/oclc/1631353 </li></ul>7/11/2009 Ex Libris Authority Panel
  9. 9. Un-Linked Data <ul><li>MARC21 has a naming convention for place names … 752 $a United States $b Florida $c Volusia $d Daytona Beach. </li></ul><ul><li>Wikipedia also has a naming convention for place names … http://en.wikipedia.org/wiki/Daytona_Beach,_Florida </li></ul><ul><li>Ed created a little script to use the 752 hierarchy to build a Wikipedia URL and see if it would resolve as a URI from DBpedia… </li></ul>7/11/2009 Ex Libris Authority Panel
  10. 10. Linked Data <ul><li>Dbpedia: <dcterms:coverage rdf:resource= http://dbpedia.org/resource/Daytona_Beach%2C_Florida /> </li></ul><ul><li>Geonames: <dcterms:coverage rdf:resource= http://sws.geonames.org/4152872/ /> </li></ul>7/11/2009 Ex Libris Authority Panel
  11. 11. DBpedia <ul><li>HTML http://dbpedia.org/ page /Daytona_Beach%2C_Florida </li></ul><ul><li>RDF http://dbpedia.org/ resource /Daytona_Beach%2C_Florida </li></ul><ul><li>The data source -- Wikipedia (HTML) http://en.wikipedia.org/wiki/Daytona_Beach,_Florida </li></ul>7/11/2009 Ex Libris Authority Panel
  12. 12. DBpedia <ul><li>“ The DBpedia knowledge base currently describes more than 2.6 million things, including at least… </li></ul><ul><li>213,000 persons </li></ul><ul><li>328,000 places </li></ul><ul><li>57,000 music albums </li></ul><ul><li>36,000 films </li></ul><ul><li>20,000 companies.” </li></ul>7/11/2009 Ex Libris Authority Panel
  13. 13. DBpedia <ul><li>Dbpedia is “a community effort to extract structured information from Wikipedia and to make this information available on the Web.” </li></ul>7/11/2009 Ex Libris Authority Panel
  14. 14. Dbpedia (a taste) <ul><li>dbpedia-owl:areaCode 386 </li></ul><ul><li>dbpedia-owl: areaLand 152.0 (58.68) </li></ul><ul><li>dbpedia-owl:areaTotal 168.2 (64.93) </li></ul><ul><li>dbpedia-owl:areaWater 16.2 (6.25) </li></ul><ul><li>dbpedia-owl:elevation 9 (4) </li></ul><ul><li>dbpedia-owl:establishedTitle Incorporated </li></ul><ul><li>dbpedia-owl:foundingDate 1876-07 </li></ul>7/11/2009 Ex Libris Authority Panel
  15. 15. Dbpedia (a taste) <ul><li>What the real data looks like… </li></ul><ul><li><dbpedia-owl: areaLand rdf: datatype =&quot;http://dbpedia.org/ontology/ squareKilometr e&quot; xmlns:dbpedia-owl=&quot;http://dbpedia.org/ontology/&quot;> </li></ul><ul><li>152.0 </li></ul><ul><li></dbpedia-owl:areaLand> </li></ul><ul><li> <dbpedia-owl: areaLand rdf: datatype =&quot;http://dbpedia.org/ontology/ squareMile &quot; xmlns:dbpedia-owl=&quot;http://dbpedia.org/ontology/&quot;> </li></ul><ul><li>58.68 </li></ul><ul><li></dbpedia-owl:areaLand> </li></ul>7/11/2009 Ex Libris Authority Panel
  16. 16. Dbpedia (a taste) <ul><li>dbpprop:abstract </li></ul><ul><li>Daytona Beach is a city in Volusia County, Florida, USA. According to 2006 U.S. Census Bureau estimates, the city has a population of 64,421. Daytona Beach ... »more» (en) </li></ul><ul><li>Daytona Beach er en by i Volusia County i Florida i USA. I 2004 blev indbyggertallet opgjort til 64. 422 personer. Daytona er kendt som ferieby, og er ... »more» (da) </li></ul><ul><li>Daytona Beach (playa de Daytona) es un municipio de Florida en los Estados Unidos de América. Su población es de 67. 000 habitantes en 2004 y su principal ... »more» (es) </li></ul><ul><li>Daytona Beach ist eine Stadt an der Ostküste Floridas direkt am Atlantik in Volusia County. (de) </li></ul>7/11/2009 Ex Libris Authority Panel
  17. 17. Dbpedia (a taste) <ul><li>dbpprop:abstract </li></ul><ul><li>Daytona Beach on kaupunki Yhdysvalloissa Floridan osavaltiossa. Vuonna 2000 kaupungissa oli 64 122 asukasta, ja siellä kävi lähes kahdeksan miljoonaa turistia. ... »more» (fi) </li></ul><ul><li>Daytona Beach est une ville dans l’État de Floride, États-Unis. Au recensement de 2000, elle comptait 64 112 habitants et s’étendait sur près de 152 km², ... »more» (fr) </li></ul><ul><li>Daytona Beach è un comune degli Stati Uniti d'America, nella Contea di Volusia, nello stato della Florida. La città è conosciuta per il suo motorsport, ... »more» (it) </li></ul>7/11/2009 Ex Libris Authority Panel
  18. 18. Dbpedia (a taste) <ul><li>dbpprop:abstract </li></ul><ul><li>デ イトナビーチ ( Daytona Beach) は、アメリカ合衆国フロリダ州ヴォルシア郡に位置する都市である。 2004 年現在、アメリカ合衆国統計局によって記録された人口見込みは 64,422 人で ある。 デイトナビーチの広大な砂浜は、レース場やテストコースに最適であり、 1959 年にデイトナ・インターナショナル・スピードウェイが建設され使用されるよ うになるまで、全米のレーサーたちのメッカであった。レース場の近くにデイトナビーチ国際空港やエンブリ・リドル航空大学がある。 ... » more» (ja) </li></ul><ul><li>Daytona Beach is een plaats (city) in de Amerikaanse staat Florida, en valt bestuurlijk gezien onder Volusia County. (nl) </li></ul>7/11/2009 Ex Libris Authority Panel
  19. 19. Dbpedia (a taste) <ul><li>dbpprop:abstract </li></ul><ul><li>Daytona Beach - miasto w USA, w hrabstwie Volusia, w stanie Floryda, nad Oceanem Atlantyckim. Około 64,4 tys. mieszkańców. Rocznie przyjeżdża tu 8 mln. turystów. (pl) </li></ul><ul><li>Daytona Beach é uma cidade localizada no Estado americano de Flórida, no Condado de Volusia. Daytona Beach é sede da NASCAR, a maior associação automobilística do país. A cidade também sedia o principal circuito das competições da Stock Car norte-americana, o Daytona International Speedway. (pt) </li></ul>7/11/2009 Ex Libris Authority Panel
  20. 20. Dbpedia (a taste) <ul><li>dbpprop:abstract </li></ul><ul><li>Дейтона-Бич — город в штате Флорида, США. ( ru) </li></ul><ul><li>Daytona Beach är en ort i Volusia County i Florida i USA. 2004, uppgisk befolkningen till 64 422 personer. Daytona är en känd semesterort. Daytona är även ... »more» (sv) </li></ul><ul><li>代托纳比奇( Daytona Beach ) 是美国佛罗里达州沃卢西亚县的一座城市。 根据 2000 年美国人口普查,代托纳比奇共有 64,112 人,其中白人占 62.33% 、非裔美国人占 32.75% 、亚裔美国人占 1.73% 。 ( zh) </li></ul>7/11/2009 Ex Libris Authority Panel
  21. 21. Dbpedia (a taste) <ul><li>rdfs:label </li></ul><ul><li>Daytona Beach, Florida (en) </li></ul><ul><li>Daytona Beach (da) </li></ul><ul><li>David Silverman (es) </li></ul><ul><li>Daytona Beach (es) </li></ul><ul><li>Daytona Beach (de) </li></ul><ul><li>Daytona Beach (fi) </li></ul><ul><li>Daytona Beach (fr) </li></ul><ul><li>Daytona Beach (it) </li></ul>7/11/2009 Ex Libris Authority Panel
  22. 22. Dbpedia (a taste) <ul><li>rdfs:label </li></ul><ul><li>デイトナビーチ ( ja) </li></ul><ul><li>Daytona Beach (nl) </li></ul><ul><li>Daytona Beach (pl) </li></ul><ul><li>Daytona Beach (pt) </li></ul><ul><li>Дейтона-Бич ( ru) </li></ul><ul><li>Daytona Beach, Florida (sv) </li></ul><ul><li>代托纳比奇 ( 佛罗里达州 ) ( zh) </li></ul>7/11/2009 Ex Libris Authority Panel
  23. 23. Dbpedia (a taste) <ul><li>is dbpprop:birthPlace of </li></ul><ul><li>dbpedia:Marshall_Teague </li></ul><ul><li>dbpedia:Bruce_McNorton </li></ul><ul><li>dbpedia:Christie_Wolf </li></ul><ul><li>dbpedia:Kenneth_C._Griffin </li></ul><ul><li>dbpedia:T._T._Toliver </li></ul><ul><li>dbpedia:Vince_Carter </li></ul><ul><li>dbpedia:George_McCloud </li></ul><ul><li>dbpedia:Hank_Mizell </li></ul>7/11/2009 Ex Libris Authority Panel
  24. 24. Dbpedia (a taste) <ul><li>is dbpprop:deathPlace of </li></ul><ul><li>dbpedia:Marshall_Teague </li></ul><ul><li>dbpedia:John_Morgan_Walden </li></ul><ul><li>dbpedia:Orville_E._Babcock </li></ul><ul><li>dbpedia:Luigi_Beccali </li></ul><ul><li>dbpedia:Paul_de_Bruyn </li></ul><ul><li>dbpedia:Charles_Martin_Hall </li></ul><ul><li>dbpedia:Bill_France%2C_Jr. </li></ul><ul><li>dbpedia:Halina_Konopacka </li></ul>7/11/2009 Ex Libris Authority Panel
  25. 25. Dbpedia (a taste) <ul><li>is dbpprop:headquarters of </li></ul><ul><li>dbpedia:Trans-Florida_Airlines </li></ul><ul><li>dbpedia:NASCAR </li></ul><ul><li>dbpedia:Grand_American_Road_Racing_Association </li></ul>7/11/2009 Ex Libris Authority Panel
  26. 26. Dbpedia (a taste) <ul><li>is dbpprop:location of </li></ul><ul><li>dbpedia:Amos_Kling_House </li></ul><ul><li>dbpedia:Bartholomew_J._Donnelly_House </li></ul><ul><li>dbpedia:Costa_Del_Mar </li></ul><ul><li>dbpedia:Jackie_Robinson_Ballpark </li></ul><ul><li>dbpedia:Mary_McLeod_Bethune_Home </li></ul><ul><li>dbpedia:Olds_Hall </li></ul><ul><li>dbpedia:Daytona_Beach_International_Airport </li></ul><ul><li>dbpedia:Embry-Riddle_Aeronautical_University%2C_Worldwide </li></ul>7/11/2009 Ex Libris Authority Panel
  27. 27. Dbpedia (even more data) <ul><li>owl:sameAs </li></ul><ul><li>Rdfabout: The 2000 U.S. Census http://www.rdfabout.com/rdf/usgov/geo/us/fl/counties/volusia_county/daytona_beach </li></ul><ul><li>GeoNames http://sws.geonames.org/4152872/ </li></ul><ul><li>Freebase fbase:Daytona Beach, Florida </li></ul>7/11/2009 Ex Libris Authority Panel
  28. 28. Some open questions <ul><li>There are hundreds, if not thousands of people tracking down place names in Wikipedia and making sure they are normalized and geo-referenced. </li></ul><ul><li>Is this crowd-sourced, Wikipedia data ‘authoritative’? </li></ul><ul><li>Who is ‘David Silverman’ and why is he a Spanish label for Daytona Beach? </li></ul>7/11/2009 Ex Libris Authority Panel
  29. 29. Some open questions <ul><li>Chronicling America’s data for the Daytona Beach Daily News references Dbpedia but there’s no corresponding reference in Dbpedia, even though there’s a ‘place’ where it could be referenced. </li></ul><ul><li>How do we make sure that happens? </li></ul><ul><li>Where’s the library data anyway? </li></ul>7/11/2009 Ex Libris Authority Panel
  30. 30. Some open questions <ul><li>DBpedia uses it’s own vocabulary for many statements, chooses to use skos:subject instead of dc:subject, foaf:name instead of dc:title. </li></ul><ul><li>Was there a specific reason for this choice? </li></ul><ul><li>What’s the best way to make sure everyone can understand these choices? </li></ul>7/11/2009 Ex Libris Authority Panel
  31. 31. Some open questions <ul><li>Linking to Other People’s Data is great, but... </li></ul><ul><li>What happens if the linked data disappears? </li></ul><ul><li>What happens if it’s hijacked? </li></ul><ul><li>If I cache it locally to solve those problems, how does my system know to refresh the cache? </li></ul><ul><li>And other freshness-related questions… </li></ul>7/11/2009 Ex Libris Authority Panel
  32. 32. Things I could be talking about… <ul><li>If I had 2 more hours (at least) … </li></ul><ul><li>Flickr Commons </li></ul><ul><li>CommonTag </li></ul><ul><li>Google Fusion Tables </li></ul><ul><li>RDFa – indexed by Google & Yahoo </li></ul><ul><li>Dipity </li></ul><ul><li>Bibapp </li></ul><ul><li>Crowdsourcing </li></ul><ul><li>Mashups </li></ul>7/11/2009 Ex Libris Authority Panel
  33. 33. Thanks! <ul><li>Jon Phipps </li></ul><ul><li>Metadata Management Associates http://managemetadata.com [email_address] +1 (607) 216-4899 </li></ul><ul><li>The Open Metadata Registry “A place to register and manage vocabularies” http://metadataregistry.org </li></ul>7/11/2009 Ex Libris Authority Panel

×