Revision is constantly occurring. Even primary data can be revised.
Science is interpretation
Much of scientific data is secondary datasets of interpretative, information.
Primary Data Primary Data Primary Data Secondary Curated Data Processed Data Secondary Curated Data Secondary Data Integrated data Processing details Capture details Update revise Update revise revise revise
Repeatable, reproducible, comparable and reusable research.
Sharing to propagates expertise and build reputation.
120 Simulation tools 1,200 Seminars, podcasts, etc. 77,000 Users worldwide 550 Contributors Developed by the NSF Network for Computational Nanotechnology Online since October 2002 [Michael McLennan] http://nanoHUB.org
Seamless integration between data and publications
From the Public Library of Science people.
1. A link brings up figures from the paper 0. Full text of PLoS papers stored in a database 2. Clicking the paper figure retrieves data from the PDB which is analyzed 3. A composite view of journal and database content results 4. The composite view has links to pertinent blocks of literature text and back to the PDB 1. 2. 3. 4. The Knowledge and Data Cycle http://biolit.ucsd.edu [Phil Bourne]
The reproducible and interactive research documents* Mixed stewardship research documents The recombinant, compound research documents The virtual research document Multi-versioned, dynamic research document *Papers, Books, whatever. 2020
Data, image, model, process, workflow, podcast, slideset* Finding, citation, peer review, preservation, identity, versioning, security, privacy, copyright management, format authority Authority on metadata descriptions Propagation of descriptions * Insert new research commodity type here 2020
What does this mean for library services? Seamless interlinking of data, literature and other research commodities Integrated search across external resources Selective quality curation Hell is other people’s (lack of) semantic metadata 2020
Supporting Paul, The Scientist Search/Discover Serendipitous Finding Collaborative Searching Structural Search Keeping Current Gather Collecting Manage Organizing Create Annotating Review & Rate Describe Write Share Publish Sharing Rights Integrated search Automatic paper download Continual queries Paper recommendation Alert Project and Personal Internal search Refereed and Grey literature Tag, annotate, rate Templates Multi-author authoring Bibliography management Version management Copyright tools (CC and SC) Linking up data, models and other components [Roger Barga]
Team science is increasingly composed of co-authors located at different universities.
“ virtual communities of scholars” produce higher impact work than comparable co-located teams or solo scientists.
True for all fields and team sizes.
Studies of 19.9 million research articles over 5 decades as recorded in the Web of Science database, and an additional 2.1 million patent records from 1975-2005. Using the Web of Science database to analyze the collaboration arrangements of over 4,000,000 papers over a 30 year period Sources: Wuchty, Jones, and Uzzi Noshir Contractor
Competitive advantage. Academic vanity. Reputation. Adoption. Scrutiny. Being scooped. Misinterpretation. New Reward Schemes Rewards Fears
What is the role of the library? Trusted curator Trusted data manager Quality arbiter Knowledge disseminator Format authority Add value content provider Metadata / controlled vocabulary provider Add value service provider 2020
Services Embedding into the Researchers Workflow The Cloud
Combine it with a Google Scholar search for number of citations
Mash the results into the PubMed results with a link to Google Scholar
What does this mean for library services? With not For Opening up to researcher’s tools and research environments for discovery, management and curation of research commodities Enabling and encouraging new services and new content to add new value Remove obstacles to interoperate and share Collaborate, don’t control
Prior to leaving home Paul, a Manchester graduate student, syncs his IPhone with the latest papers, delivered overnight by the library via a news syndication feed. On the bus he reviews the stream, selecting a paper close to his interest in HIV-1 proteases.
The data shows apparent anomalies with his own work, and the method, an automated script, looks suspect.
Being on-line he notices that a colleague in Madrid has also discovered the same paper through a blog discussion and they Instant Message, annotating the results together.
By the time the bus stops he has recomputed the results, proven the anomaly, made a rebuttal in the form of a pubcast to the Journal Editor, sent it to the journal and annotated the article with a comment and the pubcast.
Duncan Hull, Steve Pettifer, Doug Kell, Defrosting the digital library: bibliographic tools for the next generation web to appear in PLoS Computational Biology
Michael Nielsen, The Future of Science http://michaelnielsen.org/blog/?p=448
Philip Bourne Will a biological database be different from a biological journal, PLOS Computational Biology 1(3) www.ploscompbiol.org
James A. Evans Electronic Publication and the Narrowing of Science and Scholarship Science 18 July 2008: Vol. 321. no. 5887, pp. 395 - 399 http://www.sciencemag.org/cgi/content/abstract/321/5887/395
James Hendler Reinventing Academic Publishing, Editorials for IEEE Intelligent Systems http://www.mindswap.org/blog/2007/08/14/reinventing-academic-publishing-%E2%80%93-part-i/ http://www.mindswap.org/blog/2007/11/23/reinventing-academic-publishing-%E2%80%93-part-ii/ http://www.mindswap.org/blog/2008/01/03/reinventing-academic-publishing-%E2%80%93-part-iii/