SlideShare a Scribd company logo
The Great Promise of Online Data for
    Chemistry and the Life Sciences

                           Antony J Williams
                      Silverchair Colloquium 2012
READ FAST – IT’S HAPPENING NOW

  20 minutes, >40 slides

Disruption Can be Cheap,
 Fast and Unexpectedly
        Successful
Online Chemistry Databases in 2007
A search gave LOTS of “info”..
What is Yohimbine?
For chemists…try filtering!
Why not Index the web of chemistry?
 Build a search engine for chemistry

 Index all public domain chemicals and link

 Build a structure searchable web

 Crowdsource new chemistry from the community

 Crowdsource curation and annotation
Create a structure-centric hub
Answering Real Questions
 Questions a chemist might ask…
   What is the melting point of n-heptanol?
   What is the chemical structure of Xanax?
   Chemically, what is phenolphthalein?
   What are the stereocenters of cholesterol?
   Where can I find publications about xylene?
   What are the different trade names for Ketoconazole?
   What is the NMR spectrum of Aspirin?
   What are the safety handling issues for Thymol Blue?
The World of Online Chemistry
   Safety data
   Toxicity data
   Blogs and Wikis
   Property databases
   Experimental results
   Scientific publications
   Compound aggregators
   Open Notebook Science
   Metabolic pathway databases
   Encyclopedic articles (Wikipedia)
Linked Data for Life Sciences growing…
Solve Real World Problems
 Provide programmable interface against content
 Provide a chemistry database tuned to integrators
RSC and ChemSpider – May 2009
Why RSC acquired ChemSpider
 Commitment to serve the community

 Bring cheminformatics expertise in-house

 Add additional data to publications

 Potential freemium model – web services, data

 Because data is critical to science
Making sense of data is overwhelming
Publications are Hosts to Data
Data has value, is Free, is Open
 Data cannot be copyrighted. A particular
  expression of data, such as a chart or table in a
  publication, can be.

 Data licensing is being dealt with and openness
  encouraged

 Research data mandates are starting…

 Who will manage the integration and curation
  and keep the access FREE!
Tell me about Yohimbine…
Of course it is out there…
SOME Chemistry Databases in 2012
Tell me more…but…
   Where can I find the electronic structure?
   Papers/Patents about Yohimbine?
   What are the side effects of Yohimbine?
   Where can I order Yohimbine?
   What are the physicochemical properties?
   What are the associated metabolic pathways?
   Different synonyms of Yohimbine?
   Are there side effects with Yohimbine?

 ChemSpider links all of this information and more
Yohimbine on ChemSpider
RSC Databases are Integrated
RSC Journals are Integrated
Patents are Linked
Google Books are Integrated
And so are…
   Chemical vendors
   Safety and Toxicity information
   Experimental and Predicted properties
   Analytical data
   Images and Movies

 And all for free…
And all “mobile”
Not only compounds but syntheses
And analytical data…
The world can take and contribute
 Scientists can deposit their data

 They can annotate and curate

 They can download data

 They can embed data in the social network

 They can integrate and connect
Integrate to electronic lab notebooks
Integrate to electronic lab notebooks
Integrate to instruments and software
 Primary analytical instrumentation vendors integrate

   Agilent, Bruker, Thermo, Waters


 Cheminformatics vendors link to ChemSpider

   Accelrys, ACD/Labs, ChemAxon, iChemLabs
Publications are a summary of work
 Scientific publications are a summary of work
   Is all work reported?
   How much science is lost to pruning?
   What of value sits in notebooks and is lost?

 How much data is lost?
   How many compounds never reported?
   How many syntheses fail or succeed?
   How many characterization measurements?
What if we could capture it all?
Start with data in publications
But in the time of Big Data…it’s linked!
ONE example – data for life sciences
                                                    IP?
                            What’s the
                            structure?
                                                Are they in
                                                 our file?
                              What’s
                             similar?
                                                What’s the
                          Pharmacology           target?
                              data?

                                          Known
                                        Pathways?
                         Competitors?
                                                Working On
                          Connections             Now?
                          to disease?
                                          Expressed in
                                         right cell type?
 Crowdsourcing across drug discovery
 Open PHACTS : partnership between European
  Community and European Pharma Companies
 22 partners, 8 pharmaceutical companies, 3
  biotechs working together for 3 years

 Freely accessible for knowledge discovery and
  verification.
    Data on chemistry and biology
    Pharmacological profiles
    Proprietary and public data sources.
All that glisters is not gold…
Crowdsourced Assertions
 The future of publishing will include generation
  and consumption of “nanopublications”




 http://www.nanopub.org/
Nanopublications??
So what’s the business model?
 Decisions are based on data

 Publications encapsulate, reference and link data

 More data is free and open. More services and
  APIS allow access – free or for fee. Ask Google

 The large-scale licensed content business model
  is at risk without interfaces to integrate and mine
Acknowledgments
 The RSC ChemSpider team

 Our users, our depositors, our curators

 GGA Software Services, OpenEye, ACD/Labs
  and a lot of Open Source code!

 And Al Gore for supporting the internet
http://
  en.wikipedia.org/wiki/Al_Gore_and_information_techn
Thank you

Email: williamsa@rsc.org
Twitter: ChemConnector
Personal Blog: www.chemconnector.com
SLIDES: www.slideshare.net/AntonyWilliams

More Related Content

What's hot

ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Navigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpiderNavigating the Complex Web of Chemistry Using ChemSpider
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
RSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For ChemistsRSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For Chemists
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Building A Community Resource For The Life Sciences
Building A Community Resource For The Life SciencesBuilding A Community Resource For The Life Sciences
Why Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider Crawling Across the Web of Chemistry Using ChemSpider
ChemSpider hosting linking and curating chemistry data for the community
ChemSpider  hosting linking and curating chemistry data for the communityChemSpider  hosting linking and curating chemistry data for the community
ChemSpider hosting linking and curating chemistry data for the community
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Connecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpiderConnecting Chemists to the Internet Through ChemSpider
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider as a chemical term resolver
ChemSpider as a chemical term resolverChemSpider as a chemical term resolver
ChemSpider as a chemical term resolver
Royal Society of Chemistry
 
Chem spider as a chemical term resolver
Chem spider as a chemical term resolverChem spider as a chemical term resolver
Taming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can HelpTaming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can Help
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

What's hot (20)

ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
ChemSpider as a Foundation for Crowdsourcing and Collaborations in Open Chemi...
 
Navigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpiderNavigating the Complex Web of Chemistry Using ChemSpider
Navigating the Complex Web of Chemistry Using ChemSpider
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
RSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For ChemistsRSC ChemSpider – Building An Internet Based Community For Chemists
RSC ChemSpider – Building An Internet Based Community For Chemists
 
Building A Community Resource For The Life Sciences
Building A Community Resource For The Life SciencesBuilding A Community Resource For The Life Sciences
Building A Community Resource For The Life Sciences
 
Why Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpider
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider Crawling Across the Web of Chemistry Using ChemSpider
Crawling Across the Web of Chemistry Using ChemSpider
 
ChemSpider hosting linking and curating chemistry data for the community
ChemSpider  hosting linking and curating chemistry data for the communityChemSpider  hosting linking and curating chemistry data for the community
ChemSpider hosting linking and curating chemistry data for the community
 
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
ChemSpider - Building a Foundation for the Semantic Web by Hosting a Crowd So...
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Connecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpiderConnecting Chemists to the Internet Through ChemSpider
Connecting Chemists to the Internet Through ChemSpider
 
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
 
Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...Structure representations in public chemistry databases: The challenges of va...
Structure representations in public chemistry databases: The challenges of va...
 
ChemSpider as a chemical term resolver
ChemSpider as a chemical term resolverChemSpider as a chemical term resolver
ChemSpider as a chemical term resolver
 
Chem spider as a chemical term resolver
Chem spider as a chemical term resolverChem spider as a chemical term resolver
Chem spider as a chemical term resolver
 
Taming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can HelpTaming The Wild West Of Internet Based Chemistry You Can Help
Taming The Wild West Of Internet Based Chemistry You Can Help
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
ChemSpider - Building a Crowdsourced Chemical Database for the Chemistry Comm...
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
 

Viewers also liked

HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
Amit Jhunjhunwala
 
France presentation eleanor
France presentation eleanorFrance presentation eleanor
France presentation eleanorPhilip Copeland
 
φωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλαφωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλα3dimchan
 
индия
индияиндия
индия
banditka
 
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Alan Quayle
 
DSA - delivering on the promise of bespoke support
DSA  - delivering on the promise of bespoke support DSA  - delivering on the promise of bespoke support
DSA - delivering on the promise of bespoke support
iansyst
 
困髮族五大原因
困髮族五大原因困髮族五大原因
困髮族五大原因formosa858
 
Volunteer in Italy 2012
Volunteer in Italy 2012Volunteer in Italy 2012
Volunteer in Italy 2012
AYAvolunteer
 
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina FrancaTutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
Massimiliano Martucci
 
Slideshare
SlideshareSlideshare
Slidesharebolona
 
5434 avtodsdsdsds
5434 avtodsdsdsds5434 avtodsdsdsds
5434 avtodsdsdsds
NightLightW
 
Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Яндекс.Деньги
 
Caràcters poligénics. 2
Caràcters poligénics. 2Caràcters poligénics. 2
Caràcters poligénics. 2
Julián de la Fuente
 
Places in kolkata
Places in kolkataPlaces in kolkata
Kalkulus 2 minggu 11
Kalkulus 2   minggu 11Kalkulus 2   minggu 11
Kalkulus 2 minggu 11
Iwan Pranoto
 
Derivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mbaDerivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mba
Babasab Patil
 

Viewers also liked (20)

HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
HOW TO SURVIVE A HEART ATTACK WHEN ALONE ?
 
France presentation eleanor
France presentation eleanorFrance presentation eleanor
France presentation eleanor
 
φωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλαφωτοσύνθεση εβελίνα και χρυσούλα
φωτοσύνθεση εβελίνα και χρυσούλα
 
индия
индияиндия
индия
 
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
Making Telecoms the Essential Spice of Every Business Ecosystem: The Slow, Pa...
 
DSA - delivering on the promise of bespoke support
DSA  - delivering on the promise of bespoke support DSA  - delivering on the promise of bespoke support
DSA - delivering on the promise of bespoke support
 
что такое вселенная
что такое вселеннаячто такое вселенная
что такое вселенная
 
困髮族五大原因
困髮族五大原因困髮族五大原因
困髮族五大原因
 
Volunteer in Italy 2012
Volunteer in Italy 2012Volunteer in Italy 2012
Volunteer in Italy 2012
 
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina FrancaTutte le liste dei candidati alle elezioni 2012 a Martina Franca
Tutte le liste dei candidati alle elezioni 2012 a Martina Franca
 
Slideshare
SlideshareSlideshare
Slideshare
 
Kalender actie
Kalender actieKalender actie
Kalender actie
 
5434 avtodsdsdsds
5434 avtodsdsdsds5434 avtodsdsdsds
5434 avtodsdsdsds
 
Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)Системы электронных платежей в России (исследование TNS)
Системы электронных платежей в России (исследование TNS)
 
Caràcters poligénics. 2
Caràcters poligénics. 2Caràcters poligénics. 2
Caràcters poligénics. 2
 
Places in kolkata
Places in kolkataPlaces in kolkata
Places in kolkata
 
Fotoscurso
FotoscursoFotoscurso
Fotoscurso
 
Presentation1
Presentation1Presentation1
Presentation1
 
Kalkulus 2 minggu 11
Kalkulus 2   minggu 11Kalkulus 2   minggu 11
Kalkulus 2 minggu 11
 
Derivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mbaDerivatives markets ppt @ bec doms bagalkot mba
Derivatives markets ppt @ bec doms bagalkot mba
 

Similar to The Great Promise of Online Data for Chemistry and the Life Sciences

Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Chemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the handChemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the hand
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Chemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScienceChemical Database Projects Delivered by RSC eScience
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Open Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific ResearchOpen Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific Research
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Slides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSlides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinal
Sean Ekins
 
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
Sean Ekins
 
The future of scientific information & communication
The future of scientific information & communicationThe future of scientific information & communication
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of ExperiencesCrowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & Practice
Sean Ekins
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Chemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityChemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityRoyal Society of Chemistry
 
Qualifying Online Information Resources for Chemists
Qualifying Online Information Resources for ChemistsQualifying Online Information Resources for Chemists
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Engaging participation from the chemistry community
Engaging participation from the chemistry communityEngaging participation from the chemistry community
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Similar to The Great Promise of Online Data for Chemistry and the Life Sciences (20)

Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
 
Chemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the handChemistry made mobile – the expanding world of chemistry in the hand
Chemistry made mobile – the expanding world of chemistry in the hand
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Chemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScienceChemical Database Projects Delivered by RSC eScience
Chemical Database Projects Delivered by RSC eScience
 
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
ChemSpider – A Crowdsourcing Environment for Hosting and Validating Chemistry...
 
Open Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific ResearchOpen Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific Research
 
Slides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSlides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinal
 
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
 
Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...Acs collaborative computational technologies for biomedical research an enabl...
Acs collaborative computational technologies for biomedical research an enabl...
 
The future of scientific information & communication
The future of scientific information & communicationThe future of scientific information & communication
The future of scientific information & communication
 
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...Collaborative Computational Technologies for Biomedical Research: An Enabler ...
Collaborative Computational Technologies for Biomedical Research: An Enabler ...
 
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of ExperiencesCrowdsourcing Chemistry for the Community – 5 Years of Experiences
Crowdsourcing Chemistry for the Community – 5 Years of Experiences
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & Practice
 
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
ChemSpider - Does Community Engagement work to Build a Quality Online Resourc...
 
Chemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the communityChemspider hosting linking and curating chemistry data for the community
Chemspider hosting linking and curating chemistry data for the community
 
Qualifying Online Information Resources for Chemists
Qualifying Online Information Resources for ChemistsQualifying Online Information Resources for Chemists
Qualifying Online Information Resources for Chemists
 
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
 
Engaging participation from the chemistry community
Engaging participation from the chemistry communityEngaging participation from the chemistry community
Engaging participation from the chemistry community
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 

Recently uploaded

Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 

Recently uploaded (20)

Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 

The Great Promise of Online Data for Chemistry and the Life Sciences

  • 1. The Great Promise of Online Data for Chemistry and the Life Sciences Antony J Williams Silverchair Colloquium 2012
  • 2. READ FAST – IT’S HAPPENING NOW 20 minutes, >40 slides Disruption Can be Cheap, Fast and Unexpectedly Successful
  • 4. A search gave LOTS of “info”.. What is Yohimbine?
  • 6. Why not Index the web of chemistry?  Build a search engine for chemistry  Index all public domain chemicals and link  Build a structure searchable web  Crowdsource new chemistry from the community  Crowdsource curation and annotation
  • 8.
  • 9. Answering Real Questions  Questions a chemist might ask…  What is the melting point of n-heptanol?  What is the chemical structure of Xanax?  Chemically, what is phenolphthalein?  What are the stereocenters of cholesterol?  Where can I find publications about xylene?  What are the different trade names for Ketoconazole?  What is the NMR spectrum of Aspirin?  What are the safety handling issues for Thymol Blue?
  • 10. The World of Online Chemistry  Safety data  Toxicity data  Blogs and Wikis  Property databases  Experimental results  Scientific publications  Compound aggregators  Open Notebook Science  Metabolic pathway databases  Encyclopedic articles (Wikipedia)
  • 11. Linked Data for Life Sciences growing…
  • 12. Solve Real World Problems  Provide programmable interface against content  Provide a chemistry database tuned to integrators
  • 13. RSC and ChemSpider – May 2009
  • 14. Why RSC acquired ChemSpider  Commitment to serve the community  Bring cheminformatics expertise in-house  Add additional data to publications  Potential freemium model – web services, data  Because data is critical to science
  • 15. Making sense of data is overwhelming
  • 17. Data has value, is Free, is Open  Data cannot be copyrighted. A particular expression of data, such as a chart or table in a publication, can be.  Data licensing is being dealt with and openness encouraged  Research data mandates are starting…  Who will manage the integration and curation and keep the access FREE!
  • 18. Tell me about Yohimbine…
  • 19. Of course it is out there…
  • 21. Tell me more…but…  Where can I find the electronic structure?  Papers/Patents about Yohimbine?  What are the side effects of Yohimbine?  Where can I order Yohimbine?  What are the physicochemical properties?  What are the associated metabolic pathways?  Different synonyms of Yohimbine?  Are there side effects with Yohimbine?  ChemSpider links all of this information and more
  • 23. RSC Databases are Integrated
  • 24. RSC Journals are Integrated
  • 26. Google Books are Integrated
  • 27. And so are…  Chemical vendors  Safety and Toxicity information  Experimental and Predicted properties  Analytical data  Images and Movies  And all for free…
  • 29. Not only compounds but syntheses
  • 31. The world can take and contribute  Scientists can deposit their data  They can annotate and curate  They can download data  They can embed data in the social network  They can integrate and connect
  • 32. Integrate to electronic lab notebooks
  • 33. Integrate to electronic lab notebooks
  • 34. Integrate to instruments and software  Primary analytical instrumentation vendors integrate  Agilent, Bruker, Thermo, Waters  Cheminformatics vendors link to ChemSpider  Accelrys, ACD/Labs, ChemAxon, iChemLabs
  • 35. Publications are a summary of work  Scientific publications are a summary of work  Is all work reported?  How much science is lost to pruning?  What of value sits in notebooks and is lost?  How much data is lost?  How many compounds never reported?  How many syntheses fail or succeed?  How many characterization measurements?
  • 36. What if we could capture it all?
  • 37. Start with data in publications
  • 38. But in the time of Big Data…it’s linked!
  • 39. ONE example – data for life sciences IP? What’s the structure? Are they in our file? What’s similar? What’s the Pharmacology target? data? Known Pathways? Competitors? Working On Connections Now? to disease? Expressed in right cell type?
  • 40.  Crowdsourcing across drug discovery  Open PHACTS : partnership between European Community and European Pharma Companies  22 partners, 8 pharmaceutical companies, 3 biotechs working together for 3 years  Freely accessible for knowledge discovery and verification.  Data on chemistry and biology  Pharmacological profiles  Proprietary and public data sources.
  • 41.
  • 42. All that glisters is not gold…
  • 43. Crowdsourced Assertions  The future of publishing will include generation and consumption of “nanopublications”  http://www.nanopub.org/
  • 45. So what’s the business model?  Decisions are based on data  Publications encapsulate, reference and link data  More data is free and open. More services and APIS allow access – free or for fee. Ask Google  The large-scale licensed content business model is at risk without interfaces to integrate and mine
  • 46. Acknowledgments  The RSC ChemSpider team  Our users, our depositors, our curators  GGA Software Services, OpenEye, ACD/Labs and a lot of Open Source code!  And Al Gore for supporting the internet http:// en.wikipedia.org/wiki/Al_Gore_and_information_techn
  • 47. Thank you Email: williamsa@rsc.org Twitter: ChemConnector Personal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams