Slides from talk for ICZN/SHNH symposium in honour of Charles Sherbon: "Anchoring Biodiversity Information: From Sherborn to the 21st Century and Beyond"
ASIDIC Spring Meeting, Las Vegas, NV March 2008. This presentation presents the challenges for the researcher, publisher and the library as their collective worlds are constantly colliding as the need to be published, to maintain the subscription model
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can EditDario Taraborelli
Slides for my September 23 talk on Wikidata and WikiCite – NIH Frontiers in Data Science lecture series.
Persistent URL: https://dx.doi.org/10.6084/m9.figshare.3850821
Supporting the exploding dimensions of the chemical sciences via global netwo...Valery Tkachenko
The Royal Society of Chemistry is building is a comprehensive federated platform for chemical informatics in a Big Research Data world. The resulting platform is a blend of social, informatics and knowledge components which itself produces new dimensions in the chemical sciences to support activities such as Open Innovation and sharing of data. The platform itself would be isolated and insular unless a broad collaboration between societies, industry and universities is created in a federated and open way. In this presentation we will talk about one of these efforts, between RSC and Moscow State University, to facilitate the development, population and use of a global networking platform for the chemical sciences.
Open Access for Early Career ResearchersRoss Mounce
My talk for the University of Bath Open Access Week session; 23rd October 2013.
http://www.bath.ac.uk/learningandteaching/rdu/courses/pgskills/modules/RP00335.htm
Linked Open Data and Systematic TaxonomyJoel Richard
A short talk in which I briefly discuss the Smithsonian Libraries' plans for Linked Open Data related to our Taxonomic Literature II and Index Animalium digitization projects.
Slides from talk for ICZN/SHNH symposium in honour of Charles Sherbon: "Anchoring Biodiversity Information: From Sherborn to the 21st Century and Beyond"
ASIDIC Spring Meeting, Las Vegas, NV March 2008. This presentation presents the challenges for the researcher, publisher and the library as their collective worlds are constantly colliding as the need to be published, to maintain the subscription model
Wikidata: Verifiable, Linked Open Knowledge That Anyone Can EditDario Taraborelli
Slides for my September 23 talk on Wikidata and WikiCite – NIH Frontiers in Data Science lecture series.
Persistent URL: https://dx.doi.org/10.6084/m9.figshare.3850821
Supporting the exploding dimensions of the chemical sciences via global netwo...Valery Tkachenko
The Royal Society of Chemistry is building is a comprehensive federated platform for chemical informatics in a Big Research Data world. The resulting platform is a blend of social, informatics and knowledge components which itself produces new dimensions in the chemical sciences to support activities such as Open Innovation and sharing of data. The platform itself would be isolated and insular unless a broad collaboration between societies, industry and universities is created in a federated and open way. In this presentation we will talk about one of these efforts, between RSC and Moscow State University, to facilitate the development, population and use of a global networking platform for the chemical sciences.
Open Access for Early Career ResearchersRoss Mounce
My talk for the University of Bath Open Access Week session; 23rd October 2013.
http://www.bath.ac.uk/learningandteaching/rdu/courses/pgskills/modules/RP00335.htm
Linked Open Data and Systematic TaxonomyJoel Richard
A short talk in which I briefly discuss the Smithsonian Libraries' plans for Linked Open Data related to our Taxonomic Literature II and Index Animalium digitization projects.
This is a presentation given to the Royal Society General Assembly in Birmingham on November 20th 2009. This covers the present status and future vision for ChemSpider
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...datascienceiqss
Richard Ball and Norm Medeiros will demonstrate how Dataverse is used within their Project TIER (Teaching Integrity in Empirical Economics) initiative to organize and showcase student work for transparency and reproducibility. Richard and Norm will discuss the prospect of extending Dataverse to serve as a resource for the Project TIER network of institutions and instructors.
Open scholarship [a FOSTER open science talk]Ross Mounce
A talk by Dr Ross Mounce, given at the FOSTER Open Science event 4th September, King's College London http://www.fosteropenscience.eu/event/foster-discovering-open-practices-pgr-and-early-career-researchers-0
The Internet is the world’s publicly accessible container for a myriad of resources containing chemistry related data. Whether it be collections of millions of chemical compounds with their associated properties, interactive displays for analytical data, access to publications and patents or tapping into the increasing availability of online computational engines, the web has became the primary enabling technology to source information and data. Scientists collectively applaud and utilize the availability of such resources and an increasing proportion of the community are willing to support these resources by contributing both their data and skills to help curate and validate information on the web. This “crowdsourcing” has started to contribute large amounts of data to the commons and serves has a valuable platform for reference and, potentially, discovery.
ChemSpider is one of the chemistry community’s primary online resources and allows scientists to search across 25 million unique chemical compounds linked out to over 400 original data sources and has become a central hub for searching for chemistry-related data. The platform however offers much more to the community and has become a central repository for analytical data, specifically spectra, is a host for community-authored chemical syntheses and facilitates data curation and annotation by any of its users. This presentation will provide an overview of the ChemSpider platform in terms of available data and its efforts to act as a public repository and clearing ground for data curation. We will discuss how such a platform, when coupled with game-based approaches, facilitates both teaching and data validation and will discuss whether public domain resources such as ChemSpider will ultimately become authorities for chemistry.
This is a presentation I gave at the Library of Congress as part of a NFAIS/FLICC/CENDI meeting as outlined here: http://www.chemspider.com/blog/making-the-web-work-for-science-presentation-at-the-library-of-congress.html
The presentation provides an overview of some of the challenges the publishers face moving forward, how they are responding to it, how InChI is an enabling technology, how quality is important.
The increasing availability of free and open access resources for scientists on the internet presents us with a revolution in data availability. The Royal Society of Chemistry hosts ChemSpider, a free access website for chemists built with the intention of building community for chemists (http://www.chemspider.com/).
ChemSpider is an aggregator of chemistry related information, at present over 20 million unique chemical entities linked out to over 300 separate data sources, ChemSpider has taken on the task of both robotically and manually curating publicly available data sources. It is also a public deposition platform where chemists can deposit their own data including novel structures, analytical data, synthesis procedures and host data associated with the growing activities associated with Open Notebook Science.
This presentation will examine chemistry on the internet, the dubious quality of what is available and how the ChemSpider crowdsourced curation platform is fast becoming one of the centralized hubs for resourcing information about chemical entities.
We will also review our efforts to provide free resources for synthesis procedures, spectral data and structure-based searching of the chemistry literature and how chemists can contribute directly to each of these projects.
This is a presentation given to the Royal Society General Assembly in Birmingham on November 20th 2009. This covers the present status and future vision for ChemSpider
The Project TIER Dataverse: Archiving and Sharing Replicable Student Research...datascienceiqss
Richard Ball and Norm Medeiros will demonstrate how Dataverse is used within their Project TIER (Teaching Integrity in Empirical Economics) initiative to organize and showcase student work for transparency and reproducibility. Richard and Norm will discuss the prospect of extending Dataverse to serve as a resource for the Project TIER network of institutions and instructors.
Open scholarship [a FOSTER open science talk]Ross Mounce
A talk by Dr Ross Mounce, given at the FOSTER Open Science event 4th September, King's College London http://www.fosteropenscience.eu/event/foster-discovering-open-practices-pgr-and-early-career-researchers-0
The Internet is the world’s publicly accessible container for a myriad of resources containing chemistry related data. Whether it be collections of millions of chemical compounds with their associated properties, interactive displays for analytical data, access to publications and patents or tapping into the increasing availability of online computational engines, the web has became the primary enabling technology to source information and data. Scientists collectively applaud and utilize the availability of such resources and an increasing proportion of the community are willing to support these resources by contributing both their data and skills to help curate and validate information on the web. This “crowdsourcing” has started to contribute large amounts of data to the commons and serves has a valuable platform for reference and, potentially, discovery.
ChemSpider is one of the chemistry community’s primary online resources and allows scientists to search across 25 million unique chemical compounds linked out to over 400 original data sources and has become a central hub for searching for chemistry-related data. The platform however offers much more to the community and has become a central repository for analytical data, specifically spectra, is a host for community-authored chemical syntheses and facilitates data curation and annotation by any of its users. This presentation will provide an overview of the ChemSpider platform in terms of available data and its efforts to act as a public repository and clearing ground for data curation. We will discuss how such a platform, when coupled with game-based approaches, facilitates both teaching and data validation and will discuss whether public domain resources such as ChemSpider will ultimately become authorities for chemistry.
This is a presentation I gave at the Library of Congress as part of a NFAIS/FLICC/CENDI meeting as outlined here: http://www.chemspider.com/blog/making-the-web-work-for-science-presentation-at-the-library-of-congress.html
The presentation provides an overview of some of the challenges the publishers face moving forward, how they are responding to it, how InChI is an enabling technology, how quality is important.
The increasing availability of free and open access resources for scientists on the internet presents us with a revolution in data availability. The Royal Society of Chemistry hosts ChemSpider, a free access website for chemists built with the intention of building community for chemists (http://www.chemspider.com/).
ChemSpider is an aggregator of chemistry related information, at present over 20 million unique chemical entities linked out to over 300 separate data sources, ChemSpider has taken on the task of both robotically and manually curating publicly available data sources. It is also a public deposition platform where chemists can deposit their own data including novel structures, analytical data, synthesis procedures and host data associated with the growing activities associated with Open Notebook Science.
This presentation will examine chemistry on the internet, the dubious quality of what is available and how the ChemSpider crowdsourced curation platform is fast becoming one of the centralized hubs for resourcing information about chemical entities.
We will also review our efforts to provide free resources for synthesis procedures, spectral data and structure-based searching of the chemistry literature and how chemists can contribute directly to each of these projects.
This was a presentation I gave to an audience at Nature Publishing Group in New York on May 7th 2009. It's a long presentation and over an hour in length. Not much new here relative to other presentations...just a knitting together of many of the others on here.
There is an increasing availability of free and open access resources for scientists to use on the internet. Coupled with an increasing number of Open Source software programs we are in the middle of a revolution in data availability and tools to manipulate these data. ChemSpider is a free access website built with the intention of providing a structure centric community for chemists. As an aggregator of chemistry related information from many sources, at present over 21.5 million unique chemical entities from over 190 separate data sources, ChemSpider has taken on the task of both robotically and manually integrating and curating publicly available data sources. ChemSpider has also provided an environment for users to deposit, curate and annotate chemistry-related information. This has allowed the community to enhance ChemSpider by adding analytical data, associating synthetic pathways and publications and connecting to social networking resources. I will discuss how ChemSpider is fast becoming the premier curated platform and centralized hub for resourcing information about chemical entities and how the platform provides the foundation data for services allowing the analysis of analytical data and collaborative science.
The presentation of ChemSpider was to a groub of science librarians, specifically chemistry librarians, and was meant to provide an overview of the platform and answer the question posed: What is the difference between ChemSpider, CAS Scifinder and Reaxys.
The ChemSpider database is a resource hosted by the Royal Society of Chemistry. With over 28 million unique chemicals on the database linked out to over 400 data sources the platform provides access to experimental and predicted data (properties, spectra etc.), links to publications, patents and a myriad of other resources. The ChemSpider database has been used as the foundation of a number of other resources for chemists including ChemSpider SyntheticPages, the Learn Chemistry Wiki and the Spectral Game. This presentation will provide an overview of ChemSpider and discuss how chemists can both derive value from and contribute to the content available from the database and its related resources. We will also discuss our view of future platform for managing personal, institutional and public chemistry in a shared environment.
These are the slides I will be giving here at the Science Commons Symposium Pacific Northwest at the Microsoft Campus here in Redmond in about 5 minutes time
ChemSpider is a free access website for chemists built with the intention of providing a structure centric community for chemists. It was developed to index available sources of chemical structures and their associated data into a single searchable repository and making it available to everybody, at no charge. While there are a large number of databases containing chemical compounds and data available online their inherent quality, accuracy and completeness is severely lacking. ChemSpider has provided a platform so that the chemistry community could contribute to improving the quality of data online and expanding the information to include data such as reaction syntheses, analytical data, experimental properties and linkages to other valuable resources. It has grown into a resource containing over 21 million unique chemical structures from over 200 data sources.
This presentation will provide an overview of ChemSpider and its value to chemists as a search tool, as a public repository of information and how it can become one of the primary foundations of internet-based chemistry. I will also discuss the vision for ChemSpider and some of the lofty goals we are setting for the system moving forward.
ChemSpider was developed with the intention of aggregating and indexing available sources of chemical structures and their associated information into a single searchable repository and making it available to everybody, at no charge. There are many tens of chemical structure databases such as literature data, chemical vendor catalogs, molecular properties, environmental data, toxicity data, analytical data etc. and no single way to search across them. Despite the diversity of databases available online their inherent quality, accuracy and completeness is lacking in many regards. ChemSpider was established to provide a platform whereby the chemistry community could contribute to cleaning up the data, improving the quality of data online and expanding the information available to include data such as reaction syntheses, analytical data and experimental properties. ChemSpider has now grown into a database of over 20 million chemical substances integrated with over 300 disparate data sources, many of these directly supporting the Life Sciences. This presentation will provide an overview of our efforts to improve the quality of data online, to provide a foundation for the semantic web for chemistry and to provide access to a set online tools and services to support access to these data. I will also discuss how ChemSpider is being used to enhance Semantic Publishing in Chemistry at RSC.
This is a presentation given in Track 4, Open Access and Cheminformatics, at the Bio-IT Meeting in Boston on April 21st 2010. It is a general overview of ChemSpider activities to link together the internet for chemists and validate and curate data. We won the Bio-IT Best Practices Community Service Award that evening also.
ChemSpider is a free access website for chemists built with the vision of providing a structure centric community for chemists. Vision is great…execution is better. ChemSpider is now one of the internet’s primary portals for chemistry offering access to over 23 million unique chemical structures from over 200 data sources and expanding daily. Even though there are tens if not hundreds of chemical structure databases such as literature data, chemical vendor catalogs, molecular properties, environmental data, toxicity data, analytical data etc. there has been no single way to search across them. Despite the fact that there are a large number of databases containing chemical compounds and data available online their inherent quality, accuracy and completeness remains lacking in many regards. With ChemSpider we have provided a platform whereby the chemistry community could contribute to cleaning up the data, improving the quality of data online and expanding the information available to include data such as reaction syntheses, analytical data, experimental properties and linking to other valuable resources.
This presentation will provide an overview of ChemSpider and its value to chemists as a search tool, as a public repository of information and how it can become one of the primary foundations of internet-based chemistry. I will also discuss the vision for ChemSpider and some of the exciting goals we are setting for the system moving forward.
This is a presentation I gave at the FDA on December 1st 2009 in Wahington DC as part of a symposium involving PubChem, ChemIDPLus, PillBox, DailyMed and other related systems. The focus was, as usual, on the quality of data online and how to clean up the information and with a specific focus on the quality of data on the FDA's DailyMed and our efforts to apply semantic markup to the DailyMed articles
The internet has provided access to unprecedented quantities of data. In the domain of chemistry specifically over the past decade the web has become populated with tens of millions of chemical structures and related properties of assays together with tens of thousands of spectra and syntheses. The data have, to a large extent, remained disparate and disconnected. In recent years with the wave of Web 2.0 participation any chemist can contribute to both the sharing and validation of chemistry-related data whether it be via Wikipedia, the online encyclopedia, or one of the multiple public compound databases. The presentation will offer a perspective of what is available today, our experiences of building a public compound database to link together the internet and a suggested path forward for enabling even greater integration and connectivity for chemistry data for the masses to both use and participate in developing.
This is a general presentation about our efforts to build an internet based community for chemists using ChemSpider. A general overview of data quality online, crowdsourced deposition and curation and our progress to deliver a solution to the community for resourcing data.
Educators and students now have access to rich internet resources of information. RSC’s ChemSpider is a community resource of structure-based chemistry delivering data including chemical compound collections, reaction synthesis procedures, physicochemical property and various forms of spectral data. ChemSpider offers the opportunity for the community to participate in populating, annotating and curating the data on ChemSpider. We believe that ChemSpider offers an opportunity for educators and students to participate in the ongoing development of a rich resource for the chemistry community. This presentation will suggest some potential uses of the ChemSpider website in terms of integrating into lesson plans. We will also outline how students can expose their structure and reaction-based research work via the ChemSpider platform for the benefit of the community and their online scientific reputation.
With an intention to provide a high quality free internet resource of chemistry related data for the community, ChemSpider has aggregated almost 25 million compounds linked out to over 400 data sources and provided a platform for the community to both deposit and curate data. This experiment in crowdsourcing for chemistry has now been running for over three years. This presentation will review a number of aspects of the project including (a) the level of community participation in depositing and curating data; (b) the nature of data and content supplied by the community; (c) how ChemSpider is used by the community; (d) using game-based systems to assist in data curation; (e) algorithmic-based approaches to data validation and filtering; and (f) sharing data curation efforts with other online databases.
The original abstract for the talk is below BUT the talk changed based on a big interest in InChI and the possibilities to use in a Semantic Web for Chemistry
The increasing availability of free and open access resources for scientists on the internet presents us with a revolution in data availability. However, freedom costs and in many cases the cost is quality. ChemSpider is a free access website for chemists built with the intention of providing a structure centric community for chemists. As an aggregator of chemistry related information from many sources, at present over 21.5 million unique chemical entities from over 150 separate data sources, ChemSpider has taken on the task of both robotically and manually curating publicly available data sources. This presentation will provide an overview of how a curated platform can become the centralized hub for resourcing information about chemical entities. We will also present ChemMantis, an entity extraction platform for extracting chemical names and scientific terms in documents and providing a platform for structure-based searching of Open Access chemistry literature.
There is an increasing availability of free and open access resources for scientists to use on the internet. Coupled with the increasing availability of Open Source software tools we are in the middle of a revolution in data availability and tools to manipulate these data. ChemSpider is a free access website for chemists built with the intention of providing a structure centric community for chemists. As an aggregator of chemistry related information from many sources, at present over 21.5 million unique chemical entities from over 200 separate data sources, ChemSpider has taken on the task of both robotically and manually curating publicly available data sources. This presentation will provide an overview of the ChemSpider platform and how it is fast becoming the centralized hub for resourcing information about chemical entities.
This is a "vendor presentation" that I gave in New Orleans on June 14th 2010. It is a very general presentation on ChemSpider relevant to the needs of librarians.
Similar to Checking, Curating And Qualifying Chemistry (20)
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.