In less than a decade the internet has provided us access to enormous quantities of chemistry data. Chemists have embraced the web as a rich source of data and knowledge. However, all that glisters …
In less than a decade the internet has provided us access to enormous quantities of chemistry data. Chemists have embraced the web as a rich source of data and knowledge. However, all that glisters is not gold and while online searches can now provide us access to information associated with many tens of millions of chemicals, can allow us to traverse patents, publications and public domain databases the promise of high quality data on the web needs to be tempered with caution.
In recent years the crowdsourcing approach to developing curated content has been growing. Can such approaches allow us to bring to bear the collective wisdom of the crowd to validate and enhance the availability of trusted chemistry data online or are algorithms likely to be more powerful in terms of validating data? While it is now possible to search the web using a query language form natural to chemists – that of “structure searching the web” - increasingly scientists are likely going to have to accept joint responsibility for the quality of data online for the foreseeable future. Their participation is likely to come through engaging in open science, the provision of data under open licenses and by offering their skills to the community.
This presentation will provide an overview of the present state of chemistry data online, the challenges and risks of managing and accessing data in the wild and how an internet for chemistry continues to expand in scope and possibilities.