Connecting Publications and Data


Published on

Connecting Scientific Resources Breakout
Science Online London 2010 - British Library

Session abstract: "Do you have data? Have you decided that you want to publish that data in a friendly way? Then this session is for you. Allowing your data to be linked to other data sets is an obvious way to make your data more useful, and to contribute back to the data community that you are a part of, but the mechanics of how you do that is not always so clear cut. This session will discuss just that. With experts from the publishing world, the liked data community, and scientific data services, this is a unique opportunity to get an insight into how to create linked scientific data, and what you can do with it once you have created it."-

Published in: Technology
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Connecting Publications and Data

  1. 1. 1
  2. 2. 2
  3. 3. Traditional supplementary data has its place and remains valuable, but… 3
  4. 4. …that is not what I will be speaking about today… 4
  5. 5. …I am talking about integrating supplementary data in context. In the case of working with PANGAEA, data is not only in context, but visualized. Publishers need to work to integrate third-party data repositories into the context of related articles. How to scale this though? More information about the PANGAEA repository here: More information about this collaboration here: _01616 5
  6. 6. Another example of supplementary data integrated and visualized in context. 6
  7. 7. 7
  8. 8. 8
  9. 9. Manual entity linking while having its place, is difficult to scale and… 9
  10. 10. Will not be what I am discussing today (but remember this example for later). Instead… 10
  11. 11. …I will be discussing automated entity extraction which has a number of benefits including. •Easily extendable to new / other entities •Works retrospectively on older content •Does create recall / precision errors (not actually a benefit, but important to note) Furthermore… 11
  12. 12. …with links directly in the text… 12
  13. 13. …data and information is integrated in context… 13
  14. 14. 1. Colours & tags genes, proteins, molecule names 2. Clicking shows a summary of features for the term (ie: sequence or 2D structure) 3. User can click on links in the pop-up leading out to more information The Reflect integration also links to information in context… 14
  15. 15. Reflect integration in ScienceDirect was a result of EMBL winning the Elsevier Grand Challenge. “The Elsevier Grand Challenge: Knowledge Enhancement in the Life Sciences is a contest created to improve the way scientific information is communicated and used. The contest invites members of the scientific community to describe and prototype a tool to improve the interpretation and identification of meaning in (online) journals and text databases relating to the life sciences. Specifically we are looking for new ways to: improve the process/methods/results of creating, reviewing and editing scientific content interpret, visualize or connect the knowledge more effectively, and/or provide tools/ideas for measuring the impact of these improvements.” - On route to a scalable model? 15
  16. 16. Remember this example from earlier? Integrating PDB content in context will greatly enhance the user experience (watch for this change). Things are improving fast, but how to keep up with it all? 16
  17. 17. 17
  18. 18. Q: How can we scale the PANGAEA, NextBio and Reflect model to its full potential? A: By opening up our platform to third-party developers to develop applications in context on Scopus and ScienceDirect. “SciVerse Applications is the place to find and market applications, or collaborate with the scientific community to create applications.” - 18
  19. 19. Platform built on the (open source) Apache Shindig framework for Open Social gadgets and RESTful APIs. Customer entitlements for ScienceDirect and Scopus remain the same for users. 19
  20. 20. Applications can be developed in context in multiple locations throughout ScienceDirect, Scopus, and Hub Beta. 20
  21. 21. 21
  22. 22. 22
  23. 23. 23
  24. 24. 24