Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

The future of scientific information & communication


Published on

Our access to scientific information has changed in ways that were hardly imagined even by the early pioneers of the internet. The immense quantities of data and the array of tools available to search and analyze online content continues to expand while the pace of change does not appear to be slowing. While scientists now have access to the enormous capacities and capability of the internet the vast majority of scientific communication continues to be through peer-reviewed scientific journals. The measure of a scientist’s contribution is primarily represented by their publication profile and the citations to their published works and offers an incomplete view of their activities. However, we are at the beginning of a new revolution where the ability to communicate offers the opportunity to embrace new forms of publishing and where scientific participation and influence will be measured in new ways. This presentation will provide an overview of our new generation of “openness” in which open source, open standards, open access and open data are proliferating. The future of scientific information and communication will be underpinned by these efforts, influenced by increasing participation from the scientific community and facilitated collaboration and ultimately accelerate scientific progress.

Published in: Technology, Education
  • Be the first to comment

The future of scientific information & communication

  1. 1. The future of scientificinformation & communication Antony Williams SUNY Potsdam, April 12th 2013
  2. 2. How does the internet influence you?• How many of you visit the internet/check your email less than a dozen times per day?• Where do you go for fact-checking?• How many on Facebook? How many on Twitter?• You know you have an online profile right?• Scientists…how many of you are working on building a scientific profile online?• How many of you online now???
  3. 3. Me….and my vanity!
  4. 4. Searching Antony Williams
  5. 5. Searching ChemConnector…
  6. 6.
  7. 7. Wikipedia
  8. 8. LinkedIn
  9. 9.
  10. 10. And Mendeley
  11. 11. And My Co-author Graph
  12. 12. And Videos–YouTube–SciVee–Vimeo–Slideshare
  13. 13. I am Quantified…
  14. 14. ResearchGate
  15. 15. Google Scholar Citations
  16. 16. LinkedIn
  17. 17. AltMetrics
  18. 18. Usage, Citations, Social Media…
  19. 19. Scientists are “Quantified”• Stats are gathered and analyzed• Employers can find them, tenure will depend on them, funding are affected by them• Scientists Impact Factors, H-index and many other variants• Science is both competitive and collaborative
  20. 20. If it was not just about me…• Together we might: – build an encyclopedia – …and rate restaurants – …share book reviews – …and movie reviews – …and reviews of service providers – …organize sit-ins and social action – …and more data might just be Open
  21. 21. If it was not just about me…• Together we might: – build an encyclopedia – …and rate restaurants – …provide book reviews to each other – …or movie reviews – …or reviews of service providers – …organize sit-ins and social action – …and more data might just be Open – …more scientists might collaborate and share
  22. 22. It is so difficult to navigate… IP? IP? What’s the What’s the structure? structure? Are they in Are they in our file? our file? What’s What’s similar? similar? What’s the What’s the Pharmacology Pharmacology target? target? data? data? Known Known Pathways? Pathways? Competitors? Competitors? Working On Working On Connections to Connections to Now? Now? disease? disease? Expressed in Expressed in right cell type? right cell type?
  23. 23. Let’s Change the World• Let’s map together all historical chemistry data and build systems to integrate new data• Heck, let’s integrate chemistry and biology data and add in disease data too• Lets model the data and see if we can extract new relationships – quantitative and qualitative• Let’s make it all available on the web
  24. 24. That’s a BIG Request
  25. 25. What About Something Smaller?• We’re going to map the world• We’re going to take photos of as many places as we can and link them together• We’ll let people annotate and curate the map• Then let’s make it available free on the web• We’ll make it available for decision making• Put it on Mobile Devices, Give it Away
  26. 26. Where am I from?
  27. 27. Wikipedia
  28. 28. Wikipedia
  29. 29. I care…I want to contribute…
  30. 30. The Power of Contribution
  31. 31. How do you spell Afonwen?
  32. 32. Whoa…• So the world can be mapped…• We can enter a 3D environment within the map• We can add annotations• We can use the data, we can reference it, we can extract it, we can make decisions with it• And we can do it on our lap, in our hands• Let’s crowdsource chemistry and biology!!!
  33. 33. Science is being Crowdsourced• Crowdsourcing science is happening… – Contribution of data • Our data, About us • Our data, generated in labs • Open Data, data validation and curation – Contribution of software • Open Source, Open Standards – Contribution of funding
  34. 34. If we can map the planet…• …then we should map the Galaxy!
  35. 35. GalaxyZoo
  36. 36. Various ways to contribute
  37. 37. Where Am I From?
  38. 38. Where Am I From?
  39. 39. What can be done with Big Data
  40. 40. Patients Like Me
  41. 41. Patients Like Me
  42. 42. I am Chemist
  43. 43. Back to this….• Let’s map together all historical chemistry data and build systems to integrate new data• Heck, let’s integrate chemistry and biology data and add in disease data too• Lets model the data and see if we can extract new relationships – quantitative and qualitative• Let’s make it all available on the web
  44. 44. How can I contribute to chemistry?• Publish data, share data, validate and curate data• Publish chemicals, syntheses and data• “Publish” – Papers, Blogs, Reports, Tweets, Presentations, Videos• Contribute to Wikipedia• Participate in chemistry communities• Contribute to the Big Data
  45. 45. About Me…as a Chemist• I’ve performed a few dozen chemical syntheses• I’ve run thousands of analytical spectra• I’ve generated thousands of NMR assignments• I’ve probably published <5% of all work• Most of it has been lost• But things can be different today….
  46. 46. Blog• Opinions, procedures, observations, experiences
  47. 47. PresentationsPresentations,Videos, Report,Pre-publications
  48. 48. YouTube/Vimeo/SciVee• Presentations are easy to turn into movies and publish to these services• Literally “gives you a voice”
  49. 49. Data as a Publication
  50. 50. Data as a Publication?
  51. 51.
  52. 52. Contributing to the “Big Data” Maps
  53. 53. My Data Contributions…
  54. 54. Data & Curations to ChemSpider• The Royal Society of Chemistry free database• 28.5 million chemicals and growing daily• Software interfaces to integrate to• Amenable to community contribution – Deposit structures, property data, spectral data – Data annotation, validation and curation
  55. 55. • 3-year Innovative Medicines Initiative project• Integrating chemistry and biology data using semantic web technologies• Open source code, open data and open standards• Academics, Pharma companies, Publishers….
  56. 56. The Publishers!?
  57. 57. (Some) Publishers are Changing?• Data cannot be copyrighted and we have lots• Scientists contribute data in document form • Most publishers are open to Open Access• Scientific publications are built on data so what can be done to release the data? Much data is not published? Many scientists will not share…
  58. 58. Publications - a summary of work• Scientific publications are a summary of work – Is all work reported? – How much science is lost to pruning? – What of value sits in notebooks and is lost?• How much data is lost? – How many compounds never reported? – How many syntheses fail or succeed? – How many characterization measurements?
  59. 59. Community Repository for Data• Funding agencies encourage sharing of data• Increasing availability of “Open Data”• Institutional repositories have no specific domain support• Why not develop a community repository for chemistry data – private, public, embargoed?• Provides data to develop models/algorithms?
  60. 60. Chemical DatabaseService• National Chemical Database Service for UK Academics• Integrating Commercial Databases and Services• Chemicals, analytical data, prediction algorithms• Development of data repository
  61. 61. Model Building with Community Data• Community data as a basis of model building – Consume data from available databases, community data, new publications and build predictive algorithms for the community – How many algorithms are reported and lost? How much repeat work is done in the domain of algorithmic development?
  62. 62. Pulling Data from our Archive• Our contribution to the world of chemistry data• DERA – digitally enabling the RSC archive – Text mining • Find chemicals, reactions, analytical data, properties – Algorithmic checking • Validate algorithmically what we can - robots – “Web 2.0 interfaces” for curating and validating
  63. 63. What if we could capture it all?Digitally Enhancing the RSC Archive
  64. 64. Human Validation and Curation
  65. 65. Web 2.0 Contribution• We have been contributing to the web for a along time already – but how much in chemistry?• A few blogs, an increasing amount of tweeting but what about data sharing in chemistry?
  66. 66. The Old Way of Challenging
  67. 67. Challenging Science…
  68. 68. Collaboration towards completion
  69. 69. Detailed constructive dialog
  70. 70. Oxidation by Sodium Hydride?
  71. 71. The Blogosphere Analyzes…
  72. 72. The Blogosphere Analyzes…
  73. 73. How much is in the archives?
  74. 74. Open Notebook Science Analysis
  75. 75. Oxidation by Sodium Hydride?
  76. 76. What is Hexacyclinol?
  77. 77. The Blogosphere “Discusses”…
  78. 78. What is real, what is fake?
  79. 79. Chemistry is Dangerous!
  80. 80. Chemistry is Dangerous• Florida DJs May Face Felony for April Fools Water Joke Worse Than Rubios “… told their listeners that "dihydrogen monoxide" was coming out of the taps throughout the Fort Myers area.”
  81. 81.
  82. 82. How do you recognize good vs bad?
  83. 83. Is this real?
  84. 84. Junk vs Real“We then established a collaboration withprofessor Sum Ting Wong, a fugitive from theNorth Korean University Hu Yu Hai Ding”“..identified as the new protein Wai So Dim”
  85. 85. What is real, what is fake?
  86. 86. Helping to change science• Participation and contribution• Immediacy of action• Platforms for contribution• Openness…whatever that is
  87. 87. Openness – Carries Licensing • Openness may be hard.. • Open Access flavors • Open Source licenses • Open Data licenses • Open Notebook Science
  88. 88. Getting Called Out in Public… Rules for Licensing Data
  89. 89. Challenged in the Twittersphere
  90. 90. Annotating Articles Today…
  91. 91. Attribution to me…
  92. 92. Remember Quantifying Scientists• Scientists Impact Factors. Science is both competitive and collaborative• Can we measure ALL contributions to science?
  93. 93. Article-Level metrics are here
  94. 94. The Alt-Metrics Manifesto•
  95. 95. ImpactStory
  96. 96. ImpactStory
  97. 97. Scientists AltMetrics
  98. 98. Detailed Usage Statistics
  99. 99. Usage, Citations, Social Media, Etc
  100. 100. Enabled by• Persistent unique digital identifier• Integrates to workflows such as manuscript and grant submission• Supports automated linkages with your professional activities
  101. 101. Micropublishing How much data is lost?• How many reactions never get published?• How much data could be shared?• How many properties are measured and lost?• What stands in the way of sharing? – Is it technology? – Permissions? “The Boss”, Licensing?
  102. 102. Micropublishing Syntheses
  103. 103. ChemSpider SyntheticPages
  104. 104. What is real, what is fake?
  105. 105. Profile
  106. 106. Interactive Data
  107. 107. Rewards and Recognition• The badgesonomy culture of recognition is growing.• Badges are commonplace – FourSquare – Klout
  108. 108. Rewards and Recognition• Rewards and Recognition starting with CSSP then expands to other platforms• Including paths to expose such recognition on AltMetrics platforms – in discussion…
  109. 109. Impact by Data Set onDataIC50 Measurements for 62 substituted benzoxazolesChemSpider Data Repository: DOI: 10.1356/CSID784.4
  110. 110. What Does the Future Hold?
  111. 111. The Data Deluge Will Not Go Away
  112. 112. The Linked Network Will Grow
  113. 113. We DON’T want this world..
  114. 114. Thanks Martin!
  115. 115. We’re not there yetYou can’t get there from here
  116. 116. Thank youEmail: williamsa@rsc.orgTwitter: ChemConnectorPersonal Blog: www.chemconnector.comSLIDES: