Role of Linked Data for Scholarly Publishers

878 views

Published on

Society of Scholarly Publishing Conference 2012 talk on "Making Semantics Work". Bernadette Hyland describes what publishers need to be paying attention to with respect to data reuse and sharing. She describes goals, approaches and platforms for the internal and external publishing of data as Linked Data for more efficient and effective integration, reuse and distribution.

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

Role of Linked Data for Scholarly Publishers

  1. 1. Making Semantics Work The Role of Linked Data in Scholarly Publishing 1 June 2012 Arlington VA USA Brief by Bernadette Hyland, co-chair, W3C Government Linked Data Working Group CEO, 3 Round Stones, Inc Email. bhyland@3roundstones.com Twitter: @BernHyland This presentation: http://slideshare.net/3roundstonesWednesday, May 30, 12 1
  2. 2. What is the semantic web? from the W3C web site The Semantic Web is a web of data. The Semantic Web is about two things. It is about common formats for integration and combination of data drawn from diverse sources… It is also about language for recording how data relates to real world objects. Slide credit: Scott Brinker @chiefmartecWednesday, May 30, 12 2
  3. 3. Content Data is King Slide credit: Scott Brinker @chiefmartecWednesday, May 30, 12 3
  4. 4. Data is really flowing...Wednesday, May 30, 12 4
  5. 5. Linked data is about data that is reusable A simple yet revolutionary change in perspective.Wednesday, May 30, 12 5
  6. 6. We’re living in a golden age ... Photo credit: http://www.flickr.com/photos/sjungling/5974860/Wednesday, May 30, 12 6
  7. 7. “Knowledge is of two kinds. We know a subject our ourselves, or we know where we can find information upon it.” by Samuel Johnson 18th Century British author, linguist & lexicographerWednesday, May 30, 12 7
  8. 8. from: to: LinkedEnterpriseDataWednesday, May 30, 12 8
  9. 9. Wednesday, May 30, 12 9
  10. 10. Book Publishers Goals: Improve internal manuscript pipelines, expose additional ways of finding and using contentWednesday, May 30, 12 10
  11. 11. New MediaWednesday, May 30, 12 11
  12. 12. Scholars pain point ... #1 - Data access and reuse Large amounts of diverse data produced by complex experiments, simulations & observations • The growth rate of PubMed alone is one paper per minute • Hard to validate, reproduce & leverage scientific data • Not easily accessible nor interlinked (Exception is ‘omics’ research, deposit of sequences required for publication)Wednesday, May 30, 12 12
  13. 13. Publishers looking to ... 1. Lower costs of combining data silos 2. Control data quality & protect data/brand standards 3. Produce high quality data for external consumption 4. Leverage structured data increasingly available via the Web 5. Distribute & promote content (SEO++) 6. Increase paid subscriptions 7. Provide new data initiatives, i.e, a “kitchen” for mashupsWednesday, May 30, 12 13
  14. 14. Business decisions are yours... “Marketing” Determining how much data to share… …or not to share. “Legal” Slide courtesy of Scott Brinker @chiefmartecWednesday, May 30, 12 14
  15. 15. Some data may be better harnessed as an incentive for other business goals • For internal use • For external use by • new & existing authors • new editors • new subscribers • new partnersWednesday, May 30, 12 15
  16. 16. Wednesday, May 30, 12 16
  17. 17. Why Linked Data matters ... • It scales ... to Web-scale • Does not require a super model • Based on International Data Exchange Standards (RDF, SPARQL) • Lingua franca for data exchangeWednesday, May 30, 12 17
  18. 18. Wednesday, May 30, 12 18
  19. 19. Wednesday, May 30, 12 19
  20. 20. Wednesday, May 30, 12 20
  21. 21. • Linked Data is about publishing and consuming data using international data standards • Based on 20 year old idea • A system of linked information systemsWednesday, May 30, 12 21
  22. 22. Data landscape Semantic Technologies RDBMS Linked Linked Semantic Open Enterprise CRM Web Data Data BIWednesday, May 30, 12 22
  23. 23. Agreement on priorities may be contentious among stakeholders.Wednesday, May 30, 12 23
  24. 24. YouTube HDTV watch Better watch videos videos Publish videos Share videos Rate videos Discuss videosWednesday, May 30, 12 24
  25. 25. Linked Data RDBMS Use data Use data Publish data Share data Rate data Discuss dataWednesday, May 30, 12 25
  26. 26. Credit: Bradley P. Allen, Elsevier LabsWednesday, May 30, 12 26
  27. 27. Linked Data Management platformWednesday, May 30, 12 27
  28. 28. CONTENT LINKED DATA MANAGEMENT MANAGEMENT SYSTEM SYSTEM DATA TEXT UNSTRUCTURED STRUCTURED DATA TEXTWednesday, May 30, 12 28
  29. 29. • Callimachus is a framework for data-driven applications based on Linked Data principles • Callimachus allows Web developers to easily create data driven applications for the Web • Availableas Open Source (FLOSS) & commercially supported versionWednesday, May 30, 12 29
  30. 30. Espaces Semantiques US Government Printing OfficeWednesday, May 30, 12 30
  31. 31. Publishing Linked Data will require continual nurturing but the rewards are worth itWednesday, May 30, 12 31
  32. 32. Goal: Achieve balance between openness vs. protection, distributed vs. controlled, standardized vs. loosely- coupled data relationships. Slide credit: Scott Brinker @chiefmartecWednesday, May 30, 12 32
  33. 33. Recommendations • Seek balance for sharing and reuse • Data is king • Publish in reusable format (RDF family of standards) • Use OPEN vs proprietary in data formats • Define a URI Policy and Strategy, document it and ensure editors & authors use it • Best practices and vocabularies exist -- don’t recreate the wheelWednesday, May 30, 12 33
  34. 34. Bernadette Hyland bhyland@3roundstones.com @BernHyland http://3RoundStones.com Email info@3RoundStones.com Twitter @3RoundStonesWednesday, May 30, 12 34
  35. 35. This work is Copyright © 2011-2012 3 Round Stones Inc. It is licensed under the Creative Commons Attribution 3.0 Unported License Full details at: http://creativecommons.org/licenses/by/3.0/ You are free: to Share — to copy, distribute and transmit the work to Remix — to adapt the work Under the following conditions: Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.Wednesday, May 30, 12 35

×