One Scientist’s Wish List for Scientific Publishers


Published on

Keynote presentation American Chemical Society Editors Meeting, La Jolla CA USA, Jan 6, 2012

Published in: Education, Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • A PubCast is a scientific publication combined with a corresponding video. Really, that means --any video that is relevant to the work described in the publication can be combined to create a PubCast. The point is to draw attention to the research and promote the article’s readership, exposure and possible citation. There are many reasons why video combined with the paper helps accomplish this. But in order to gain the greatest benefit from a PubCast --it is MOST beneficial if the publication starts out as an open access publication…
  • One Scientist’s Wish List for Scientific Publishers

    1. 1. One Scientist’s Wish List for Scientific Publishers Philip E. Bourne University of California San Diego [email_address] Conference of ACS Editors Jan 6, 2012, La Jolla CA
    2. 2. My {Biased} Perspective <ul><li>I am a domain scientist (computational molecular biology) </li></ul><ul><li>I co-direct the RCSB Protein Data Bank </li></ul><ul><li>I co-founded and am EIC of one Public Library of Science (PLoS) open access journals – There must be a business model </li></ul><ul><li>I co-founded a company, SciVee Inc., that is attempting to leverage the perceived changes in scholarly communication </li></ul><ul><li>I support a small academic scholarly communication group </li></ul>
    3. 3. What Drives Me…. The Story of Meredith
    4. 4. Meredith was Successful Because…. <ul><li>She is an exceptional individual </li></ul><ul><li>Much of the data and knowledge she needed she had to beg for; she used the UCSD library </li></ul><ul><li>She easily got access to computer time </li></ul><ul><li>I want others less smart like myself to be successful too.. </li></ul>
    5. 5. What is Wrong Today? <ul><li>Formal science communication: </li></ul><ul><ul><li>Occurs too slowly </li></ul></ul><ul><ul><li>Reaches too few people </li></ul></ul><ul><ul><li>Costs too much </li></ul></ul><ul><ul><li>Ignores the data </li></ul></ul><ul><ul><li>Is very hard to reproduce </li></ul></ul><ul><li>Is stuck in the era of the printing press – we need to move Beyond the PDF and use the power of the medium </li></ul>
    6. 6. It’ s a Start… <ul><li>We have scratched the surface, but have yet to explore the core </li></ul><ul><li>But look what we could do… </li></ul>
    7. 7. My Dream <ul><li>User reads a paper (one view of the info) </li></ul><ul><li>Clicks on a figure which can be analyzed </li></ul><ul><li>Clicking the figure gives a composite database + journal view </li></ul><ul><li>This takes you to yet more papers or databases </li></ul>1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results 4. The composite view has links to pertinent blocks of literature text and back to the PDB 1. 2. 3. 4. The Knowledge and Data Cycle
    8. 8. The Dream Can Only be Realized in the Context of the Complete Research Enterprise Literature Data Methods
    9. 9. My Current Reality
    10. 10. Let Me Try and Illustrate What is Possible Through Our Own Little Contributions
    11. 11. Much of What We Try and Do Leverages Open Access Content 2008 Open Access: Taking Full Advantage of the Content PLoS Comp. Biol. 4(3) e1000037
    12. 12. Literature: OA Biosciences <ul><li>Meredith certainly benefited from the OA literature, but not enough … </li></ul><ul><li>… we have a long way to go in several respects </li></ul>PubMed Central Contents Nov 8, 2011 PubMed Contents Nov 8, 2011 Literature Data Methods
    13. 13. Literature: Semantically Enriched Version of PubMed Central Literature Data Methods ; NAR 2008 36(S2) W385-389
    14. 14. Literature: So What has Happened with This? <ul><li>The value seems to be in restful services that enable databases to recognize where in the literature their data are referenced </li></ul><ul><li>We use this in the PDB in the following way…. </li></ul>Literature Data Methods
    15. 15. Data & Literature: Integration Literature Data Methods BMC Bioinformatics 2010 11:220
    16. 16. Literature: Word Add-in for Authors <ul><li>Allows authors to add metadata as they write, before they submit the manuscript </li></ul><ul><li>Authors are assisted by automated term recognition </li></ul><ul><ul><li>OBO ontologies </li></ul></ul><ul><ul><li>Database IDs </li></ul></ul><ul><li>Metadata are embedded directly into the manuscript document via XML tags, OOXML format </li></ul><ul><ul><li>Open </li></ul></ul><ul><ul><li>Machine-readable </li></ul></ul><ul><li>Open source, Microsoft Public License </li></ul> Literature Data Methods BMC Bioinformatics 2010 11:103.
    17. 17. Literature: Knowledge Discovery Shared Function Literature Data Methods Immunology Literature Cardiac Disease Literature
    18. 18. Lets Turn Our Attention to Data ….
    19. 19. Data: Thoughts from the Biosciences <ul><li>Growth – DNA sequence data are doubling every 5 months </li></ul><ul><li>NLM/NCBI is amazing </li></ul><ul><li>Other great resources – but a slow realization they are appearing stove piped </li></ul><ul><li>Data are undervalued (more on this) </li></ul><ul><li>We need a data registry for deep search and comparison </li></ul>Literature Data Methods
    20. 20. Data: Data Journals – Lots of Talk <ul><li>Nature Publishing Group </li></ul><ul><li>F1000 </li></ul><ul><li>PLoS at Large </li></ul><ul><li>eLife </li></ul><ul><li>Global Biodiversity Information Facility </li></ul><ul><li>Some fields do it naturally – Astronomy, Earth Sciences … </li></ul>Literature Data Methods
    21. 21. Data: PLoS Data Pages <ul><li>To be introduced later this year </li></ul><ul><li>Shared metadata description? </li></ul><ul><li>Data in Dryad or equivalent </li></ul><ul><li>History could repeat itself – JMS revisited </li></ul>Literature Data Methods
    22. 22. .. And Finally Methods
    23. 23. Methods: Is Reproducibility a Myth? <ul><li>My views of reproducibility: </li></ul><ul><ul><li>We all express the importance, but the only time it is tested is when something is truly novel or error is suspected </li></ul></ul><ul><ul><li>Reproducability covers a spectrum of meaning – by whom and with how much effort </li></ul></ul><ul><ul><li>The longer the time lag the less likely something is reproducible </li></ul></ul>Literature Data Methods Nature Reviews Drug Discovery 10, 643-644 (September 2011)
    24. 24. PLoS Comp Biol Software <ul><li>Requires source be deposited in an open source public repository </li></ul><ul><li>Encourages a copy of record be deposited with the article </li></ul><ul><li>Requires that the reviewer be able to test the software if they wish - implies data, documentation, test parameters and output be provided for checking </li></ul>Motivation: S.Veretnik, J.L.Fink, and P.E. Bourne 2008 Computational Biology Resources Lack Persistence and Usability. PLoS Comp. Biol. . 4(7): e1000136 Literature Data Methods
    25. 25. Methods: Workflow Tools Might be the Answer? Taverna Wings Literature Data Methods
    26. 26. Methods: Workflows - Our Own Experience <ul><li>Its hard and embarrassing </li></ul><ul><li>We have a working prototype using Wings </li></ul><ul><li>I can feel the potential productivity gains </li></ul><ul><li>My students are more doubtful </li></ul><ul><li>Its been a lot of fun and will enable us to improve our processes regardless of the workflow system itself </li></ul>Literature Data Methods
    27. 27. Methods: Yes The Workflow is Real Literature Data Methods
    28. 28. Methods: Problems with Publishing Workflows <ul><li>Workflows are not linear </li></ul><ul><li>Workflow : paper is not 1:1 </li></ul><ul><li>Confidentiality </li></ul><ul><li>Peer review </li></ul><ul><li>Infrastructure </li></ul><ul><li>Community acceptance </li></ul><ul><li>Reward system </li></ul><ul><li>No publisher seems willing to touch them </li></ul>Literature Data Methods
    29. 29. And Now for Something Completely Different
    30. 30. Experiments in Rich Media Mashups
    31. 31. Pubcast – Video Integrated with the Full Text of the Paper
    32. 32. Rich Media as Scholarship Products Application Product Primary Customers Journals PubCast Journals, publishers, societies Meetings PosterCast Societies, conference orgs. SlideCast Comm. PaperCast Societies, journals Podcast SlideCast Education PosterCast Societies, universities SlideCast Books BookCast Publishers, book sellers
    33. 33. Android iPhone Windows Phone 7 Step 1 presenter starts PowerPoint Step 2 presenter starts recording on smart phone Step 3 presenter stops recording and initiates upload Slides Website Step 5 slides and podcast are automatically synchronized Sync File Podcast Step 6 listener plays back synchronized presentation Podium Capture Mac PC Step 4 slides are uploaded
    34. 34. Nothing is Going to Change Unless The Reward System Changes The Right Thing To Do Reward P.E. Bourne 2011 Ten Simple Rules for Getting Ahead as a Computational Biologist in Academia. PLoS Comp. Biol. 7(1) e1002001.
    35. 35. Interim Solution: Use the Traditional Reward System The Wikipedia Experiment – Topic Pages <ul><li>Identify areas of Wikipedia that relate to the journal that are missing of stubs </li></ul><ul><li>Develop a Wikipedia page in the sandbox </li></ul><ul><li>Have a Topic Page Editor Review the page </li></ul><ul><li>Publish the copy of record with associated rewards </li></ul><ul><li>Release the living version into Wikipedia </li></ul>
    36. 36. In Summary: I Do Not Want to Do Any of This – I Want You to Do It P.E. Bourne 2010 What Do I Want from the Publisher of the Future? PLoS Comp Biol 6(5): e1000787
    37. 37. What Does That Mean? The “Publisher” becomes Part of the Scientific Workflow Scientist Idea Experiment Data Conclusions Publish Laboratory Publisher Maybe The Line is Somewhere Else?
    38. 38. Maybe The Line is Somewhere Else? Scientist Idea Experiment Data Conclusions Publish Laboratory Publisher Institution? Lab Notebook ?
    39. 39. Maybe The Line is Somewhere Else? Scientist Idea Experiment Data Conclusions Publish Laboratory Publisher Institution? Lab Notebook ?
    40. 40. Questions? [email_address]