Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

LD4L OCLC Data Strategy

1,804 views

Published on

Presentation to the LD4L Workshop at Stanford University 23rd February 2015

Published in: Technology

LD4L OCLC Data Strategy

  1. 1. Richard  Wallis OCLC  Data   Strategy Technology  Evangelist   @rjw LD4L  Workshop  –  Stanford  University  –  February  23rd  2015
  2. 2. Richard  Wallis OCLC  Data   Strategy Technology  Evangelist   @rjw LD4L  Workshop  –  Stanford  University  –  February  23rd  2015 Building  on  a  Web  of  Knowledge
  3. 3. The  Web  of  …
  4. 4. The  Web  of  … Documents Active  Documents Discovery ☌
  5. 5. The  Web  of  … Documents Active  Documents Discovery ☌ ✔ ✔
  6. 6. The  Web  of  … Documents Active  Documents Discovery ☌ ✔ ✔ ✗
  7. 7. The  Web  of  … Documents Active  Documents Discovery Data ☌☌ ✔ ✔ ✗
  8. 8. The  Web  of  … Documents Active  Documents Discovery Data ☌☌ ✔ ✔ ✔✗ ✗
  9. 9. The  Web  of  … Documents Active  Documents Discovery Data Knowledge ☌☌ ✔ ✔ ✔✗ ✗☌
  10. 10. The  Web  of  … Documents Active  Documents Discovery Data Knowledge ☌☌ ✔ ✔ ✔✗ ✗ ? ☌
  11. 11. http://www.opte.org/ A  Web  of  Data  
  12. 12. http://www.opte.org/ The  Web  of  Data  
  13. 13. http://www.opte.org/ The  Web  of  Data   A  Library  Shaped  Black  Hole  ?
  14. 14. Entities  in  a   Knowledge  Graph
  15. 15. Entities  in  a   Knowledge  Graph
  16. 16. Open  Linked  Data  -­‐  Silos Library  Linked  Data Projects
  17. 17. British   Library German   National   Library French   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data Projects
  18. 18. British   Library German   National   Library French   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data Projects
  19. 19. British   Library German   National   Library French   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  20. 20. British   Library German   National   Library French   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Library  Linked  Data
  21. 21. British   Library German   National   Library French   National   Library Swedish   National   Library Open  Linked  Data  -­‐  Silos Behind  A  Vocabulary  Barrier Library  Linked  Data
  22. 22. A  general  purpose  vocabulary  for   describing  things  on  the  web
  23. 23. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" "15%  of  the  Web"
  24. 24. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y "15%  of  the  Web"
  25. 25. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   "15%  of  the  Web"
  26. 26. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML "15%  of  the  Web"
  27. 27. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML • RDFa,  Microdata,  JSON-­‐LD "15%  of  the  Web"
  28. 28. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML • RDFa,  Microdata,  JSON-­‐LD • Descriptive  data "15%  of  the  Web"
  29. 29. A  general  purpose  vocabulary  for   describing  things  on  the  web "Used  by  5  million   domains" "25%  of  pages  in  our   indexes" de  facto y • Linked  Data   • Embedded  in  HTML • RDFa,  Microdata,  JSON-­‐LD • Descriptive  data • Active  links "15%  of  the  Web"
  30. 30. THE  LIBRARY  KNOWLEDGE  GRAPH Towards person place object concept organization work
  31. 31. The  library  knowledge  graph
 A  graph  of  relationships person place object concept organization work
  32. 32. The  library  knowledge  graph
 A  graph  of  relationships person place object concept organization work start  here
  33. 33. The  library  knowledge  graph
 A  graph  of  relationships person place object concept organization work start  here
  34. 34. The  library  knowledge  graph
 A  graph  of  relationships person place object concept organization work start  here ILL  and  AnalyticsCataloging Discovery Integration  with  the  web The  library  knowledge  graph
 Putting  entities  in  library  workflows
  35. 35. ILL  and  AnalyticsCataloging Discovery Integration  with  the  web The  library  knowledge  graph
 Putting  entities  in  library  workflows
  36. 36. Entities  and  library  workflows
 Cataloging Improve  data  quality   • Link  to  authoritative  sources   A  new  approach  to  cataloging   • Point  and  click  cataloging   • Managing  entities  instead  of   managing  records   Consistent  with  RDA
  37. 37. Entities  and  library  workflows
 Cataloging Improve  data  quality   • Link  to  authoritative  sources   A  new  approach  to  cataloging   • Point  and  click  cataloging   • Managing  entities  instead  of   managing  records   Consistent  with  RDA
  38. 38. Entities  and  library  workflows
 Cataloging Improve  data  quality   • Link  to  authoritative  sources   A  new  approach  to  cataloging   • Point  and  click  cataloging   • Managing  entities  instead  of   managing  records   Consistent  with  RDA Entities  and  library  workflows
 Discovery
  39. 39. Entities  and  library  workflows
 Discovery
  40. 40. Entities  and  library  workflows
 DiscoveryEntities  and  library  workflows
 Web  exposure Be  found  on  the  web   Connect  your  users  to   unique  content   What  the  web  requires  for   web  exposure:   • Aggregation   • Familiar  structures   • A  Network  of  Links   • Entity  Identifiers
  41. 41. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work
  42. 42. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work
  43. 43. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work OCLC’s   Approach  to   Discoverable   Data Model  things  
 of  interest  to  the  web.  
  44. 44. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work OCLC’s   Approach  to   Discoverable   Data Model  things  
 of  interest  to  the  web.  
  45. 45. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work OCLC’s   Approach  to   Discoverable   Data Model  things  
 of  interest  to  the  web.   Make  those  things  available  via
 structures  familiar  to  the  web. Schema  Bib  Extend  –  http://www.w3.org/community/schemabibex
  46. 46. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work OCLC’s   Approach  to   Discoverable   Data Model  things  
 of  interest  to  the  web.   Make  those  things  available  via
 structures  familiar  to  the  web. Schema  Bib  Extend  –  http://www.w3.org/community/schemabibex BiblioGraph.net  –  http://bibliograph.net
  47. 47. WHAT’S  HAPPENING A  Library  Data  Revolution person place object concept organization work OCLC’s   Approach  to   Discoverable   Data Model  things  
 of  interest  to  the  web.   Make  those  things  available  via
 structures  familiar  to  the  web. Improve  library  workflows. Schema  Bib  Extend  –  http://www.w3.org/community/schemabibex BiblioGraph.net  –  http://bibliograph.net
  48. 48. ENTITIES  AND  WORLDCAT The  Library  Data  Revolution person place object concept organization work
  49. 49. Getting  from  here  to  there
  50. 50. Data  from  a
 converted  record  does   not  an  entity  make Transformation  into  Linked  Data  is  just  a  beginning  … Getting  from  here  to  there
  51. 51. Data  from  a
 converted  record  does   not  an  entity  make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  the  aggregate Getting  from  here  to  there
  52. 52. Data  from  a
 converted  record  does   not  an  entity  make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  the  aggregate • Identify,  map,  merge  -­‐  evidence  based Getting  from  here  to  there
  53. 53. Data  from  a
 converted  record  does   not  an  entity  make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  the  aggregate • Identify,  map,  merge  -­‐  evidence  based • Relate  to  external  sources Getting  from  here  to  there
  54. 54. Data  from  a
 converted  record  does   not  an  entity  make Transformation  into  Linked  Data  is  just  a  beginning  … • Mine  and  analyse  the  aggregate • Identify,  map,  merge  -­‐  evidence  based • Relate  to  external  sources • Share  authoritative  entities Getting  from  here  to  there
  55. 55. • 197+  million  Work  descriptions  and  URIs   • Schema.org  +  BiblioGraph.net   • RDF  Data  formats   • RDF/XML,  Turtle,  Triples,  JSON-­‐LD   • Links  to  WorldCat  manifestations   • Links  to  Dewey,  LCSH,  LCNAF,  VIAF,  FAST   • Open  Data  license  via  Linked  Data  Explorer   •  2015:  Discovery  API,  Metadata  API   • Released  April  2014 http://www.oclc.org/dataThe  Work  Entity
  56. 56. • 98+  million  Person  descriptions  and  URIs   • Person  entities  with  authority:  20.2  million   • Person  entities  without  authority:  78.3  million   • Schema.org  +  BiblioGraph.net   • Harvested  from  WorldCat  data  and  enriched  from  other  hubs  RDF   Data  formats   • RDF/XML,  Turtle,  Triples,  JSON-­‐LD   • Links  to  WorldCat  Works.    Added  links  from  WC  Works.   • Open  Data  license  via  Linked  Data  Explorer   •  2015:  Linked  Data  Explorer,  Discovery  API http://www.oclc.org/dataThe  Person  Entity
  57. 57. • Photo  credit:  http://measuringupblog.com/app/wp-­‐content/uploads/2013/11/blogpic2.jpg
  58. 58. • Photo  credit:  http://measuringupblog.com/app/wp-­‐content/uploads/2013/11/blogpic2.jpg Can  we  measure  impact?
  59. 59. Monthly  Unique  Visitors
  60. 60. ✓ VIAF,  ISNI,  FAST  Publish  Linked  Data   ✓ WorldCat.org  Linked  Data  Release  –  using  Schema.org   ✓ Internal  agreement  on  data  strategy   ✓ Evangelism   ✓ Research  &  Design  with  Data  Architecture  Group   ✓ Data  mining  of  WorldCat  resources   ✓ WorldCat  Works  Released   2012   2014 2013 OCLC   Entity-­‐ Based  Data   Strategy
  61. 61. ✓ VIAF,  ISNI,  FAST  Publish  Linked  Data   ✓ WorldCat.org  Linked  Data  Release  –  using  Schema.org   ✓ Internal  agreement  on  data  strategy   ✓ Evangelism   ✓ Research  &  Design  with  Data  Architecture  Group   ✓ Data  mining  of  WorldCat  resources   ✓ WorldCat  Works  Released   2012   2014 • Application  Integration   • WorldCat  Discovery   • Analytics   • Discovery  API   • Cataloging   ! … • More  Entities  Released   • Person   • Manifestation   • Organization   • Concept   ! ! • New  Products                 • Continuing  Evangelism   ! • New  Services   • Continuing  Innovation   ! 2013 OCLC   Entity-­‐ Based  Data   Strategy
  62. 62. ✓ VIAF,  ISNI,  FAST  Publish  Linked  Data   ✓ WorldCat.org  Linked  Data  Release  –  using  Schema.org   ✓ Internal  agreement  on  data  strategy   ✓ Evangelism   ✓ Research  &  Design  with  Data  Architecture  Group   ✓ Data  mining  of  WorldCat  resources   ✓ WorldCat  Works  Released   2012   2014 • Application  Integration   • WorldCat  Discovery   • Analytics   • Discovery  API   • Cataloging   ! … • More  Entities  Released   • Person   • Manifestation   • Organization   • Concept   ! ! • New  Products                 • Continuing  Evangelism   ! • New  Services   • Continuing  Innovation   ! 2013 OCLC   Entity-­‐ Based  Data   Strategy
  63. 63. ✓ VIAF,  ISNI,  FAST  Publish  Linked  Data   ✓ WorldCat.org  Linked  Data  Release  –  using  Schema.org   ✓ Internal  agreement  on  data  strategy   ✓ Evangelism   ✓ Research  &  Design  with  Data  Architecture  Group   ✓ Data  mining  of  WorldCat  resources   ✓ WorldCat  Works  Released   2012   2014 • Application  Integration   • WorldCat  Discovery   • Analytics   • Discovery  API   • Cataloging   ! … • More  Entities  Released   • Person   • Manifestation   • Organization   • Concept   ! ! • New  Products                 • Continuing  Evangelism   ! • New  Services   • Continuing  Innovation   ! 2013 OCLC   Entity-­‐ Based  Data   Strategy
  64. 64. ✓ VIAF,  ISNI,  FAST  Publish  Linked  Data   ✓ WorldCat.org  Linked  Data  Release  –  using  Schema.org   ✓ Internal  agreement  on  data  strategy   ✓ Evangelism   ✓ Research  &  Design  with  Data  Architecture  Group   ✓ Data  mining  of  WorldCat  resources   ✓ WorldCat  Works  Released   2012   2014 • Application  Integration   • WorldCat  Discovery   • Analytics   • Discovery  API   • Cataloging   ! … • More  Entities  Released   • Person   • Manifestation   • Organization   • Concept   ! ! • New  Products                 • Continuing  Evangelism   ! • New  Services   • Continuing  Innovation   ! 2013 OCLC   Entity-­‐ Based  Data   Strategy
  65. 65. ✓ VIAF,  ISNI,  FAST  Publish  Linked  Data   ✓ WorldCat.org  Linked  Data  Release  –  using  Schema.org   ✓ Internal  agreement  on  data  strategy   ✓ Evangelism   ✓ Research  &  Design  with  Data  Architecture  Group   ✓ Data  mining  of  WorldCat  resources   ✓ WorldCat  Works  Released   2012   2014 • Application  Integration   • WorldCat  Discovery   • Analytics   • Discovery  API   • Cataloging   ! … • More  Entities  Released   • Person   • Manifestation   • Organization   • Concept   ! ! • New  Products                 • Continuing  Evangelism   ! • New  Services   • Continuing  Innovation   ! 2013 OCLC   Entity-­‐ Based  Data   Strategy
  66. 66. Richard  Wallis OCLC  Data   Strategy Technology  Evangelist   @rjw LD4L  Workshop  –  Stanford  University  –  February  23rd  2015
  67. 67. Richard  Wallis OCLC  Data   Strategy Technology  Evangelist   @rjw LD4L  Workshop  –  Stanford  University  –  February  23rd  2015 Building  on  a  Web  of  Knowledge

×