• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Www2008, Semantic Marc
 

Www2008, Semantic Marc

on

  • 1,028 views

 

Statistics

Views

Total Views
1,028
Views on SlideShare
1,028
Embed Views
0

Actions

Likes
1
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Final copy is in the proceedings of WWW2008.
  • It's running off this stuff We do a bit of work to it... Looking at it this makes my eyes go fuzzy
  • Is that better? What we do is we keep the field data...
  • We've done work on leader elements, indicators and fixed format fields, but today to keep things simple I'm just taking basic field data. Let's throw the rest away for now.
  • When you see white boxes like this I mean that the data model contains literal text strings. Let's move them around a bit to make space for something.
  • Because we're describing the thing the record was about, rather than the record itself, we can use different labels. Let's take just one of these relationships
  • The problem becomes clear if we move something out of the way, to make room for something else.
  • Something else has a title of "harry potter and the order of the phoenix" Unsurprisingly, something and something else have a lot in common
  • This approach is what I would call record-centric, we haven't got any more meaning than was in the original record. Let's talk that through as an example [click] The two books are both by JK Rowling, but this is not JK Rowling, this is a string of text. This is JK Rowling...
  • OK, so it's not JK Rowling, it's a picture of JK Rowling - I asked, but she wouldn't come. The point is that the words Rowling, JK and the author known as JK Rowling are different things Not a new thought I know...
  • This man made great use of new technology back in the 1800s, he represented authors, titles and subjects as abstract concepts, with each author being recognised even if they shared the same name. Subjects were recognised as different even if they shared common terms and more. He wrote about it
  • In this great book - anyone read it? The technology he was taking advantage of?
  • The card... What I'm trying to do with the data is not really different to the introduction of card catalogs over bound volumes So, let's get back to the present day...
  • Focussing on JK for the moment, let's throw the rest away
  • Remember I said a resource could represent anything? well, let's use one to represent the author...
  • Let's get rid of the strings, and just leave the concepts, that makes things look simpler. Now, one of the things that has gone unsaid so far is that something and something else came from monograph records, so we know that they are both books.
  • dbpedia, geonames (Embrun, France), CIA Factbook, MusicBrainz/Zitgist (Diana Ross), DBLP (Advances in Orthonormalizing Computation.), RDF BookMashup (Weaving the Web), RDF BookMashup
  • URI is a superset of what we can do with Primary keys
  • We take the data elements
  • We take the data elements
  • Take the algorithms used for lookups Turn them around to make hashes Keep the essence of the natural data Discard syntactic differences If too long (or you like GUIDs) MD5 them.
  • * Stardust (novel), an illustrated novel by Neil Gaiman and Charles Vess * Stardust (2007 film), the film adaptation of the above book * Stardust (1990 novel), a 1990 novel by Robert B. Parker * Stardust (1974 film), a film about a rock star, starring David Essex. * Stardust (1940 film), a film with Linda Darnell and John Payne * Stardust (game), a shoot 'em up computer game * Stardust (magazine), a popular Indian film magazine

Www2008, Semantic Marc Www2008, Semantic Marc Presentation Transcript

  • Semantic MARC. or, How Charles Ammi Cutter was using RDF in 1867 or, How Charles Ammi Cutter was using RDF in 1867
    • Rob Styles & Nadeem Shabir
    shared innovation ™ Photo Copyright © Karen Coyle, reproduced with permission
  • http://events.linkeddata.org/ldow2008/#program shared innovation ™
  • Photos Copyright © Rob Styles, Nadeem Shabir and Danny Ayers, reproduced with permission shared innovation ™
  • Photos Copyright © Rob Styles, Nadeem Shabir and Danny Ayers, reproduced with permission shared innovation ™
  • Photos Copyright © Rob Styles, Nadeem Shabir and Danny Ayers, reproduced with permission shared innovation ™
  • Photos Copyright © Rob Styles, Nadeem Shabir and Danny Ayers, reproduced with permission http://www.youtube.com/watch?v=6eGcsGPgUTw shared innovation ™
    • Resources v Literals
    • Synthetic or Natural Keys
    • Dealing with Ambiguity
    shared innovation ™
    • 00673nam a2200217 a 45040010033000000030009000330 0500170004200800410005901500190010002000170011903 5001700136040003100153082001600184100001900200245 0062002192600033002813000020003146500060003346500 031003946550030004259cbbe7fc3a7346d99c281979d45b6 79cUK-BiTAL20050705133033.0990831s1999    enk     j      000 ||eng|d  aGB99Y57412bnb  a0747542155 :   a()0747542155  aStDuBDScStDuBDSdUK­BiTAL04a823. 9142211 aRowling, J. K.00aHarry Potter and the Pr isoner of Azkaban /cJ.K. Rowling. aLondon :bBloom sbury,c1999.  a317p. ;c21 cm. 0aPotter, Harry (Fi ctitious character)vJuvenile fiction. 0aWizardsvJ uvenile fiction. 7aChildren's stories.2lcsh
    shared innovation ™
    • =LDR 00673nam a2200217 a 4504 =001 9cbbe7fc3a7346d99c281979d45b679c =003 UK-BiTAL =005 20050705133033.0 =008 990831s1999enk j00||eng|d =015 aGB99Y5741$2bnb =020 a0747542155 : =035 a()0747542155 =040 aStDuBDS$cStDuBDS$dUK-BiTAL =082 04$a823.914$221 =100 1$aRowling, J. K. =245 00$aHarry Potter and the Prisoner of Azkaban /$cJ.K. Rowling. =260 aLondon :$bBloomsbury,$c1999. =300 a317p. ;$c21 cm. =650 $aPotter, Harry (Fictitious character)$vJuvenile fiction. =650 $aWizards$vJuvenile fiction. =655 7$aChildren's stories.$2lcsh
    shared innovation ™
    • =LDR 00673nam a2200217 a 4504 =001 9cbbe7fc3a7346d99c281979d45b679c =003 UK-BiTAL =005 20050705133033.0 =008 990831s1999enk j00||eng|d =015 aGB99Y5741$2bnb =020 a0747542155 : =035 a()0747542155 =040 aStDuBDS$cStDuBDS$dUK-BiTAL =082 04$a823.914$221 =100 1$aRowling, J. K. =245 00$aHarry Potter and the Prisoner of Azkaban /$cJ.K. Rowling. =260 aLondon :$bBloomsbury,$c1999. =300 a317p. ;$c21 cm. =650 $aPotter, Harry (Fictitious character)$vJuvenile fiction. =650 $aWizards$vJuvenile fiction. =655 7$aChildren's stories.$2lcsh
    Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories something shared innovation ™ title author genre genre subject subject isbn
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories something shared innovation ™ title author genre genre subject subject isbn
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™ something title author genre genre subject subject isbn
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories Juvenile fiction Rowling, J. K. Wizards Children's stories Potter, Harry (Fictitious character) Harry Potter and the Order of the Phoenix 0747551006 shared innovation ™ something something else
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories Juvenile fiction Rowling, J. K. Wizards Children's stories Potter, Harry (Fictitious character) Harry Potter and the Order of the Phoenix 0747551006 shared innovation ™ something something else
  • J. K. Rowling, Copyright © Steven Hill, 2007 - Released under Creative Commons CC-BY-SA - http://commons.wikimedia.org/wiki/Image:Jk-rowling-crop.JPG Joanne K. Rowling shared innovation ™
  • Photograph circa 1880, Photographer unknown. Charles Ammi Cutter (1837 – 1903) shared innovation ™
  • Cutter, C. A., Rules for a Dictionary Catalog , Available at: http://digital.library.unt.edu/permalink/meta-dc-1048 shared innovation ™
  • shared innovation ™ Photo Copyright © Karen Coyle, reproduced with permission
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories Juvenile fiction Rowling, J. K. Wizards Children's stories Potter, Harry (Fictitious character) Harry Potter and the Order of the Phoenix 0747551006 shared innovation ™ something something else
  • Rowling, J. K. shared innovation ™ something something else the author known as J. K. Rowling
  • Harry ... Phoenix Rowling, J. K. Potter, Harry (Fictitious character) Harry ... Azkaban wizards shared innovation ™ the words "Harry ... Phoenix" used as a title something something else the author known as J. K. Rowling the fictitious wizard known as Harry Potter the words "Harry ... Azkaban" used as a title the concept of magical wizards book
    • /resource/Dog
    • /3020251/
    • /factbook/resource/China
    • /music/artist/60d41417-feda-4734-bbbf-7dcc30e08a83
    • /dblp/resource/record/journals/ac/DavisR61
    • /rdf/usgov/geo/us/or
    • /bookmashup/books/006251587X
    • /bookmashup/doc/persons/Iain+M+Banks
    shared innovation ™
    • The record is nothing but the content of its fields, just as an RDF node is nothing but the connections: the property values. The mapping is very direct * a record is an RDF node; * the field (column) name is RDF propertyType; and * the record field (table cell) is a value. Indeed, one of the main driving forces for the Semantic web, has always been the expression, on the Web, of the vast amount of relational database information in a way that can be processsed by machines.
      • Relational Databases on the Semantic Web, Sir Tim Berners-Lee http://www.w3.org/DesignIssues/RDB-RDF.html
    ‘‘ shared innovation ™
  • URI ⊇ Primary/Foreign Keys shared innovation ™
    • Using natural keys is the traditional approach, in line with Codd's original relational model. When you use them, you have only natural data that means something to users. This is good if users will ask ad hoc queries directly to the database in raw SQL. You can also often reduce the numbers of joins when using natural keys because you don't have to go to a lookup table to convert an ID to a description.
      • The Cost of GUIDs as Primary Keys Jimmy Nilsson http://www.informit.com/articles/article.aspx?p=25862
    ‘‘ shared innovation ™
    • The fundamental issue is that keys are a significant source of coupling within a relational schema, and as a result they are difficult to change. The implication is that you generally want to avoid keys with business meaning because business meaning changes.
      • Choosing a Primary Key: Natural or Surrogate? Scott W Ambler http://www.agiledata.org/essays/keys.html
    ‘‘ shared innovation ™
    • =LDR 00673nam a2200217 a 4504 =001 9cbbe7fc3a7346d99c281979d45b679c =003 UK-BiTAL =005 20050705133033.0 =008 990831s1999enk j00||eng|d =015 aGB99Y5741$2bnb =020 a0747542155 : =035 a()0747542155 =040 aStDuBDS$cStDuBDS$dUK-BiTAL =082 04$a823.914$221 =100 1$aRowling, J. K. =245 00$aHarry Potter and the Prisoner of Azkaban /$cJ.K. Rowling. =260 aLondon :$bBloomsbury,$c1999. =300 a317p. ;$c21 cm. =650 $aPotter, Harry (Fictitious character)$vJuvenile fiction. =650 $aWizards$vJuvenile fiction. =655 7$aChildren's stories.$2lcsh
    shared innovation ™
    • =LDR 00673nam a2200217 a 4504 =001 9cbbe7fc3a7346d99c281979d45b679c =003 UK-BiTAL =005 20050705133033.0 =008 990831s1999enk j00||eng|d =015 aGB99Y5741$2bnb =020 a0747542155 : =035 a()0747542155 =040 aStDuBDS$cStDuBDS$dUK-BiTAL =082 04$a823.914$221 =100 1$aRowling, J. K. =245 00$aHarry Potter and the Prisoner of Azkaban /$cJ.K. Rowling. =260 aLondon :$bBloomsbury,$c1999. =300 a317p. ;$c21 cm. =650 $aPotter, Harry (Fictitious character)$vJuvenile fiction. =650 $aWizards$vJuvenile fiction. =655 7$aChildren's stories.$2lcsh
    Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™
  • Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories /people/36082b69-ba77-486b-b27d-bf3ac3f1bfe7 /titles/08944d4d-5b46-4bf5-9acf-3102b181de95 urn:isbn:0747542155 /character/e8b7ae0c-f465-4251-9bc9-bc4b6a61eb21 /topics/08f0fa23-0cb8-4a66-a310-dfd8ed95e0ae /genres/ea65a567-bc36-4a23-a9de-bad053d18568 /genres/f96eda4a-42ab-4d57-8fc9-96e6f6f81e98 shared innovation ™
  • Conclusion...
    • Synthetic Keys are a
    • Closed-World Mechanism.
    Image Credit goes here shared innovation ™
  • Conclusion...
    • Natural Keys are Open, difficult and require some additional thinking.
    Image Credit goes here shared innovation ™
  • Example... Image Credit goes here Rowling, J. K. R o w l i n g , J . K . r j k shared innovation ™
  • Example... Image Credit goes here Rowling, J. K. o w l i n g r j k /people/ shared innovation ™
  • Example... Image Credit goes here Prisoner of Azkaban , Harry Potter and The Harry Potter and The Prisoner of Azkaban shared innovation ™
  • Example... prisoner of azkaban harry potter and the harry potter and the prisoner of azkaban shared innovation ™
  • Example... prisoner of azkaban harry potter and the harry potter and the prisoner of azkaban shared innovation ™
  • Example... harry potter and the prisoner of azkaban shared innovation ™ Prisoner of Azkaban , Harry Potter and The Harry Potter and The Prisoner of Azkaban
  • Example... harry potter and the prisoner of azkaban /titles/ shared innovation ™
  • Example data from Wikipedia disambiguation page for 'Stardust' shared innovation ™ adaptation thing one Novel Neil Gaiman Charles Vess thing two Film 2007 is not thing three Novel Robert B Parker 1990 thing four Film 1974 David Essex thing five Linda Darnell 1940 Film thing seven Computer Game thing six Magazine India /titles/stardust
    • http://purl.org/vocab/frbr/core#
    shared innovation ™
  • shared innovation ™ frbr:Work frbr:Expression frbr:Manifestation frbr:Item
  • http://www.loc.gov/marc/marc-functional-analysis/tool.html shared innovation ™
  • http://www.loc.gov/marc/marc-functional-analysis/tool.html#table shared innovation ™
  • rowlingjk andazkabanharryofpotterprisonerthe /works/ shared innovation ™
  • rowlingjk andazkabanharryofpotterprisonerthe /works/ /works/4e2fc306b548098b8277c07719176998 MD5 shared innovation ™
  • Example data from The Library of Congress Name Authority File shared innovation ™ Jones, Adrian, 1845-1938 Jones, Adrian, 1955- Jones, Adrian, 1956- Jones, Adrian, 1961- Jones, Adrian, 1933- Jones, Adrian
  • What's in a name? Slide, Copyright © 2006, Tim Berners-Lee, reproduced in fair-use shared innovation ™
  • Conclusions shared innovation ™
  • http://events.linkeddata.org/ldow2008/#program shared innovation ™
  • Photos Copyright © 2008, Rob Styles, Nadeem Shabir and Danny Ayers, reproduced with permission Rob Styles [email_address] aka mmmmmRob irc.freenode.net #talis Nadeem Shabir [email_address] aka KiYanWang irc.freenode.net #talis Danny Ayers [email_address] aka danja irc.freenode.net #talis shared innovation ™
  • http://blogs.talis.com/nodalities shared innovation ™
  • This work is Copyright © 2008 Talis Group Limited. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).
    • For any reuse or distribution, you must make clear to others the license terms of this work.
    • Any of the above conditions can be waived if you get permission from the copyright holder.Nothing in this license impairs or restricts the author's moral rights.
    • Some Content in the work may be licensed under different terms, this is noted separately.
    shared innovation ™
  • shared innovation TM