SlideShare a Scribd company logo
1 of 35
Download to read offline
Consultancy Services
Informatics Pick-n-Mix
Erin Bolstad
Global Consulting Team
Skills, Resources, Experience:
● Project Management
● Solutions/packaging
● Customizations/Integration of
existing/3rd party software
● Rich scientific background
● Software design and development
(Java, Groovy, JS, .NET, mobile, etc.)
● Toolkit development (ETL, DSL, API,
ORM, etc)
Consultants
AppScis
Dedicated consulting
developers
Consultants
AppScis
Developers (100+)
Dedicated consulting
developers
Hungary
USA
Czech
Republic
Argentina
Project Examples
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
Project Examples
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
● Customized developer and end-user training
Project Examples
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
● Customized solutions: thin clients, integration of existing and in-house tech
● Customized developer and end-user training
DuPont Doc2DB Database
● System to trawl DuPont ELN + stored documents (PDF, Doc, etc)
● WebApp for complex queries
● Based on Doc2DB technology
Novartis Reactions Database
● Reactions data warehouse
o Import from various legacy databases (ISIS)
o Live feed from CambridgeSoft ELN
● Database migration: ISIS -> ChemAxon
● CambridgeSoft ELN chemistry -> ChemAxon
● AJAX-style web application providing query and filtering capabilities
● Web services interface for fitting into SOA
Project Examples
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
● Customized solutions: thin clients, integration of existing and in-house tech
● Spotfire integration (GSK)
● Customized developer and end-user training
Project Examples
● Customized project incubator
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
● Customized solutions: thin clients, integration of existing and in-house tech
● Spotfire integration (GSK)
● Customized developer and end-user training
Customized Product Incubator
Compound Registration
● Customizable business rules
○ Tautomers, solvents, isotopes,
formulations, alternate
identifications
● Salt and solvent multiplicity
● Templated bulk registrations
● API for updating downstream
processes, integrating to other
products (like ELN)
● Sits on database with thin client
interface - IT required
● Enterprise level: 1000’s of users
MiniReg
Customized Product Incubator
● Simple compound registration system sitting
within IJC. No IT overhead or support needed.
● Chemistry handling
○ Single salts
○ Batches
○ Samples Properties
● Biology assay handling
○ Aggregation at db level
● Highly customizable at database level
● Small - medium companies
(currently in ~10 companies)
Project Examples
● Customized project incubator
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
● Customized solutions: thin clients, integration of existing and in-house tech
● Spotfire integration (GSK)
● Customized developer and end-user training
● Large scale global project management
Large Scale PM for Global Pharma
● GSK: IJC rollout as a global reporting tool, assisted/trained,
customization of IJC, additional admin software creation. (3+ yrs)
● BMS: IJC customization/development for global roll-out needs,
training, migration of data, consulting on data-mart integration. Thin
client development. (2+ yrs)
Project Examples
● Customized project incubator
● Migrations: database migration, PM of customized development, form and data-access design,
customized training.
● Customized solutions: thin clients, integration of existing and in-house tech
● Spotfire integration (GSK)
● Customized components of large initiatives (OIDD, IMI)
● Customized developer and end-user training
● Large scale global project management
European Lead Factory is part of IMI
● Innovative Medicines Initiative
● http://www.imi.europa.eu/
IMI supports collaborative research
projects and builds networks of
industrial and academic experts in
order to boost pharmaceutical
innovation in Europe.
Part of Framework 7
Approx 25 projects covering many
aspects of healthcare, also
including:
● eTox
● Open PHACTS
European Lead Factory
EFPIA members
AZ
Bayer
Lundbeck
Janssen
Merck KGaA
Sanofi
UCB
SME members
BioAscent
ChemAxon
Edelris
GABO:mi
Lead Discovery Center
Merachem
Pivot Park Screening
Centre
Sygnature Discovery
Syncom
Taros Chemicals
TI Pharma
Academic members
University of Dundee
Leiden University
Max Planck Institute of Molecular Physiology
Netherlands Cancer Institute
Radbound University, Nijmegen
University of Groningen
Technical University of Denmark
University of Duisburg-Essen
University of Leeds
University of Nottingham
VU University Amsterdam
An alliance of participants collaborating on
identifying novel leads
ELF philosophy
Novel
Leads
Novel
Compounds
300,000 from EFPIA
members internal
collections
200,000 from newly
synthesized libraries
Novel
Screens
13 Work packages
1. Programme Recruitment HTS
2. Compound Logistics
3. Assay Development
4. High Throughput Screening
5. Hit Characterisation
6. Medicinal Chemistry
7. Publication Policy / External Relations
8. Information Technology
9. Sourcing chemical library proposals
10. Review/selection of chemical library proposals
11. Experimental Validation
12. Library Production
13. Project Management
Library generation workflow
Crowdsourcing Consortium participant
Library selection committee
Assessed on:
1. Molecular properties
2. Structural features
3. Novelty
4. Diversity potential
5. Synthetic tractability
6. Innovative design
Experimental validation Design optimisation
Library production
Architecture
2 x Intel(R) Xeon(R) CPU E5-2620 v2 @ 2.10GHz (2 x 6 core + HT)
64GB RAM
Tomcat 7 MySQLFilesystem
Web App
Web
Browser Workflow engine
Job queuing system
Document store
Report generation
Automated email notification
Role management
Dynamically generated UI
Workflows and Roles
Three workflows for different types of user:
1. Consortium member
2. Non-EU/EAA citizen
3. EU/EAA citizen
Supported by workflow engine
Different levels of access
1. Submitter
2. LSC member
3. LSC Chair
4. ….
UI dynamically generated to ensure that you
only see what you’re supposed to see and
only do what you’re supposed to do
Step 1: registration
Users register themselves and
provide personal and affiliation
details
Step 2: enter library
Define your library in one of three ways:
1. Upload SDF
2. Upload MRV with Markush library
3. Pick R-groups from within pre-defined lists
and Markush Enumeration is used to
generate the library
Sketch scaffold with Marvin JS
Give it a title
Provide accompanying info including rationale and
synthesis validation info
Step 3: initial property
calculations
SSS for scaffold against reference set of 12M
structures using JChem Search
Exact match search for enumerated structures
against reference set of 12M structures
Property distributions of enumerated structures.
Properties generated with Chemical Terms
expressions with Calculator Plugins
Checking enumerated structures against UK MDA
legislation database from Compliance Checker
Matching enumerated structures against sets of
SMARTS filters provided by EFPIA members using
Chemical Terms match() and matchCount()
functions
Step 4: submit library
Further round of property calculations performed
and library now visible to Library Selection
Committee for consideration
At this stage user has handed over ownership of the
library idea to the European Lead Factory
Step 5: final property calculations
ECFP4 fingerprint
comparison with 12M
structures from reference
set
Fuzzy Pharmacophore fingerprint
comparison with 12M structures
from reference set
Generate histogram of most similar
structure in reference set for each
compound in the library.
~1000 x 12M x 2 comparisons.
“Old” molecular descriptor tables:
● For each search read descriptors from
database -> very IO bound.
● Single threaded.
● -> days for completion
“New” fast similarity search:
● Descriptors generated as binary file.
● Read into memory once
● Search fully multi-threaded.
● -> minutes for completion
● (1M x 1M completes in ~25 min)
Step 6: assessment
Approve Reject
Refine
Library Selection Committee
Status summary
"The Library Selection Committee has been using the web tool since the
start of 2014 to support the selection of library proposals. Since then, well
over 100 library proposals have been considered, around 80 of which have
been approved for synthetic validation. The procedure for assessing and
processing the proposals has been straightforward. The tool has allowed
the proposals to be handled confidentially and to be assessed rigorously; it
has saved huge time for both library proposal submitters and members of
the Library Selection Committee."
Adam Nelson
Chair Library Selection Committee
ChemAxon and Open Innovation
Professional Services (BINGO!)
● Product development
● Product customization
● Product Integration
● Workflow design/management
● Project management
● Creative solutions

More Related Content

What's hot

Making Postgres Central in Your Data Center
Making Postgres Central in Your Data CenterMaking Postgres Central in Your Data Center
Making Postgres Central in Your Data CenterEDB
 
How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?andrea huang
 
Performance analysis of MongoDB and HBase
Performance analysis of MongoDB and HBasePerformance analysis of MongoDB and HBase
Performance analysis of MongoDB and HBaseSindhujanDhayalan
 
Simplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformSimplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformGlobus
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...Peter Löwe
 
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recallICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recallDr. Haxel Consult
 

What's hot (9)

Making Postgres Central in Your Data Center
Making Postgres Central in Your Data CenterMaking Postgres Central in Your Data Center
Making Postgres Central in Your Data Center
 
How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?
 
Performance analysis of MongoDB and HBase
Performance analysis of MongoDB and HBasePerformance analysis of MongoDB and HBase
Performance analysis of MongoDB and HBase
 
Simplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformSimplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus Platform
 
HydraFS
HydraFSHydraFS
HydraFS
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...
 
Nosql databases
Nosql databasesNosql databases
Nosql databases
 
NISO Webinar: Metadata for Preservation: A Digital Object's Best Friend
NISO Webinar: Metadata for Preservation: A Digital Object's Best Friend NISO Webinar: Metadata for Preservation: A Digital Object's Best Friend
NISO Webinar: Metadata for Preservation: A Digital Object's Best Friend
 
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recallICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
 

Similar to USUGM 2014 - Erin Bolstad (ChemAxon): Consultancy report - New capabilities and reviewing some major projects

EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...
EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...
EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...ChemAxon
 
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...Alex Zeltov
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesMongoDB
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedRobert Grossman
 
Iochem.carles bo
Iochem.carles boIochem.carles bo
Iochem.carles bomaredata
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformYaoyu Wang
 
Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsHisham Arafat
 
Scalable Preservation Workflows
Scalable Preservation WorkflowsScalable Preservation Workflows
Scalable Preservation WorkflowsSCAPE Project
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of MetadataJim Dowling
 
Solution Manager in Denodo Platform 7.0: Admin Made Simple
Solution Manager in Denodo Platform 7.0: Admin Made SimpleSolution Manager in Denodo Platform 7.0: Admin Made Simple
Solution Manager in Denodo Platform 7.0: Admin Made SimpleDenodo
 
Ict uses in libraries
Ict uses in librariesIct uses in libraries
Ict uses in librariesLiaquat Rahoo
 
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryMarcus Hanwell
 
C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1Bill Liu
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNeo4j
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
 
AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019
AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019
AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019VMware Tanzu
 

Similar to USUGM 2014 - Erin Bolstad (ChemAxon): Consultancy report - New capabilities and reviewing some major projects (20)

EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...
EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...
EUGM 2014 - Erin Bolstad (ChemAxon): An Introduction to Global Consulting Ser...
 
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...Im symposium presentation -  OCR and Text analytics for Medical Chart Review ...
Im symposium presentation - OCR and Text analytics for Medical Chart Review ...
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
Iochem.carles bo
Iochem.carles boIochem.carles bo
Iochem.carles bo
 
RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
 
Service Integration to Enhance RDM
Service Integration to Enhance RDMService Integration to Enhance RDM
Service Integration to Enhance RDM
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud Platform
 
Engineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platformsEngineering patterns for implementing data science models on big data platforms
Engineering patterns for implementing data science models on big data platforms
 
Scalable Preservation Workflows
Scalable Preservation WorkflowsScalable Preservation Workflows
Scalable Preservation Workflows
 
Data Science with the Help of Metadata
Data Science with the Help of MetadataData Science with the Help of Metadata
Data Science with the Help of Metadata
 
Solution Manager in Denodo Platform 7.0: Admin Made Simple
Solution Manager in Denodo Platform 7.0: Admin Made SimpleSolution Manager in Denodo Platform 7.0: Admin Made Simple
Solution Manager in Denodo Platform 7.0: Admin Made Simple
 
Ict uses in libraries
Ict uses in librariesIct uses in libraries
Ict uses in libraries
 
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
 
C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4j
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019
AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019
AI on Greenplum Using
 Apache MADlib and MADlib Flow - Greenplum Summit 2019
 
Summit2013 eventos onto quad
Summit2013   eventos onto quadSummit2013   eventos onto quad
Summit2013 eventos onto quad
 

More from ChemAxon

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?ChemAxon
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemAxon
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive modelsChemAxon
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...ChemAxon
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data managementChemAxon
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseChemAxon
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation ChemAxon
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...ChemAxon
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...ChemAxon
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryChemAxon
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...ChemAxon
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloudChemAxon
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationChemAxon
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction ChemAxon
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...ChemAxon
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology ChemAxon
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem MicroservicesChemAxon
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choralChemAxon
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5ChemAxon
 

More from ChemAxon (20)

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 

Recently uploaded

Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsJean Silva
 
Advantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptxAdvantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptxRTS corp
 
Effort Estimation Techniques used in Software Projects
Effort Estimation Techniques used in Software ProjectsEffort Estimation Techniques used in Software Projects
Effort Estimation Techniques used in Software ProjectsDEEPRAJ PATHAK
 
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics
 
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...Bert Jan Schrijver
 
Understanding Plagiarism: Causes, Consequences and Prevention.pptx
Understanding Plagiarism: Causes, Consequences and Prevention.pptxUnderstanding Plagiarism: Causes, Consequences and Prevention.pptx
Understanding Plagiarism: Causes, Consequences and Prevention.pptxSasikiranMarri
 
Key Steps in Agile Software Delivery Roadmap
Key Steps in Agile Software Delivery RoadmapKey Steps in Agile Software Delivery Roadmap
Key Steps in Agile Software Delivery RoadmapIshara Amarasekera
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Mastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptxMastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptxAS Design & AST.
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?Alexandre Beguel
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesVictoriaMetrics
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shardsChristopher Curtin
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdfPros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdfkalichargn70th171
 
2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf
2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf
2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdfAndrey Devyatkin
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolsosttopstonverter
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogueitservices996
 
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdfEnhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdfRTS corp
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorTier1 app
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingShane Coughlan
 

Recently uploaded (20)

Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero results
 
Advantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptxAdvantages of Cargo Cloud Solutions.pptx
Advantages of Cargo Cloud Solutions.pptx
 
Effort Estimation Techniques used in Software Projects
Effort Estimation Techniques used in Software ProjectsEffort Estimation Techniques used in Software Projects
Effort Estimation Techniques used in Software Projects
 
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
 
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
 
Understanding Plagiarism: Causes, Consequences and Prevention.pptx
Understanding Plagiarism: Causes, Consequences and Prevention.pptxUnderstanding Plagiarism: Causes, Consequences and Prevention.pptx
Understanding Plagiarism: Causes, Consequences and Prevention.pptx
 
Key Steps in Agile Software Delivery Roadmap
Key Steps in Agile Software Delivery RoadmapKey Steps in Agile Software Delivery Roadmap
Key Steps in Agile Software Delivery Roadmap
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Mastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptxMastering Project Planning with Microsoft Project 2016.pptx
Mastering Project Planning with Microsoft Project 2016.pptx
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 Updates
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdfPros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
Pros and Cons of Selenium In Automation Testing_ A Comprehensive Assessment.pdf
 
2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf
2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf
2024-04-09 - From Complexity to Clarity - AWS Summit AMS.pdf
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration tools
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogue
 
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdfEnhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
Enhancing Supply Chain Visibility with Cargo Cloud Solutions.pdf
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryError
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
 

USUGM 2014 - Erin Bolstad (ChemAxon): Consultancy report - New capabilities and reviewing some major projects

  • 2. Global Consulting Team Skills, Resources, Experience: ● Project Management ● Solutions/packaging ● Customizations/Integration of existing/3rd party software ● Rich scientific background ● Software design and development (Java, Groovy, JS, .NET, mobile, etc.) ● Toolkit development (ETL, DSL, API, ORM, etc) Consultants AppScis Dedicated consulting developers Consultants AppScis Developers (100+) Dedicated consulting developers Hungary USA Czech Republic Argentina
  • 3. Project Examples ● Migrations: database migration, PM of customized development, form and data-access design, customized training.
  • 4. Project Examples ● Migrations: database migration, PM of customized development, form and data-access design, customized training. ● Customized developer and end-user training
  • 5. Project Examples ● Migrations: database migration, PM of customized development, form and data-access design, customized training. ● Customized solutions: thin clients, integration of existing and in-house tech ● Customized developer and end-user training
  • 6. DuPont Doc2DB Database ● System to trawl DuPont ELN + stored documents (PDF, Doc, etc) ● WebApp for complex queries ● Based on Doc2DB technology
  • 7.
  • 8.
  • 9. Novartis Reactions Database ● Reactions data warehouse o Import from various legacy databases (ISIS) o Live feed from CambridgeSoft ELN ● Database migration: ISIS -> ChemAxon ● CambridgeSoft ELN chemistry -> ChemAxon ● AJAX-style web application providing query and filtering capabilities ● Web services interface for fitting into SOA
  • 10.
  • 11. Project Examples ● Migrations: database migration, PM of customized development, form and data-access design, customized training. ● Customized solutions: thin clients, integration of existing and in-house tech ● Spotfire integration (GSK) ● Customized developer and end-user training
  • 12.
  • 13.
  • 14. Project Examples ● Customized project incubator ● Migrations: database migration, PM of customized development, form and data-access design, customized training. ● Customized solutions: thin clients, integration of existing and in-house tech ● Spotfire integration (GSK) ● Customized developer and end-user training
  • 15. Customized Product Incubator Compound Registration ● Customizable business rules ○ Tautomers, solvents, isotopes, formulations, alternate identifications ● Salt and solvent multiplicity ● Templated bulk registrations ● API for updating downstream processes, integrating to other products (like ELN) ● Sits on database with thin client interface - IT required ● Enterprise level: 1000’s of users
  • 16. MiniReg Customized Product Incubator ● Simple compound registration system sitting within IJC. No IT overhead or support needed. ● Chemistry handling ○ Single salts ○ Batches ○ Samples Properties ● Biology assay handling ○ Aggregation at db level ● Highly customizable at database level ● Small - medium companies (currently in ~10 companies)
  • 17. Project Examples ● Customized project incubator ● Migrations: database migration, PM of customized development, form and data-access design, customized training. ● Customized solutions: thin clients, integration of existing and in-house tech ● Spotfire integration (GSK) ● Customized developer and end-user training ● Large scale global project management
  • 18. Large Scale PM for Global Pharma ● GSK: IJC rollout as a global reporting tool, assisted/trained, customization of IJC, additional admin software creation. (3+ yrs) ● BMS: IJC customization/development for global roll-out needs, training, migration of data, consulting on data-mart integration. Thin client development. (2+ yrs)
  • 19. Project Examples ● Customized project incubator ● Migrations: database migration, PM of customized development, form and data-access design, customized training. ● Customized solutions: thin clients, integration of existing and in-house tech ● Spotfire integration (GSK) ● Customized components of large initiatives (OIDD, IMI) ● Customized developer and end-user training ● Large scale global project management
  • 20. European Lead Factory is part of IMI ● Innovative Medicines Initiative ● http://www.imi.europa.eu/ IMI supports collaborative research projects and builds networks of industrial and academic experts in order to boost pharmaceutical innovation in Europe. Part of Framework 7 Approx 25 projects covering many aspects of healthcare, also including: ● eTox ● Open PHACTS
  • 21. European Lead Factory EFPIA members AZ Bayer Lundbeck Janssen Merck KGaA Sanofi UCB SME members BioAscent ChemAxon Edelris GABO:mi Lead Discovery Center Merachem Pivot Park Screening Centre Sygnature Discovery Syncom Taros Chemicals TI Pharma Academic members University of Dundee Leiden University Max Planck Institute of Molecular Physiology Netherlands Cancer Institute Radbound University, Nijmegen University of Groningen Technical University of Denmark University of Duisburg-Essen University of Leeds University of Nottingham VU University Amsterdam An alliance of participants collaborating on identifying novel leads
  • 22. ELF philosophy Novel Leads Novel Compounds 300,000 from EFPIA members internal collections 200,000 from newly synthesized libraries Novel Screens
  • 23. 13 Work packages 1. Programme Recruitment HTS 2. Compound Logistics 3. Assay Development 4. High Throughput Screening 5. Hit Characterisation 6. Medicinal Chemistry 7. Publication Policy / External Relations 8. Information Technology 9. Sourcing chemical library proposals 10. Review/selection of chemical library proposals 11. Experimental Validation 12. Library Production 13. Project Management
  • 24. Library generation workflow Crowdsourcing Consortium participant Library selection committee Assessed on: 1. Molecular properties 2. Structural features 3. Novelty 4. Diversity potential 5. Synthetic tractability 6. Innovative design Experimental validation Design optimisation Library production
  • 25. Architecture 2 x Intel(R) Xeon(R) CPU E5-2620 v2 @ 2.10GHz (2 x 6 core + HT) 64GB RAM Tomcat 7 MySQLFilesystem Web App Web Browser Workflow engine Job queuing system Document store Report generation Automated email notification Role management Dynamically generated UI
  • 26. Workflows and Roles Three workflows for different types of user: 1. Consortium member 2. Non-EU/EAA citizen 3. EU/EAA citizen Supported by workflow engine Different levels of access 1. Submitter 2. LSC member 3. LSC Chair 4. …. UI dynamically generated to ensure that you only see what you’re supposed to see and only do what you’re supposed to do
  • 27. Step 1: registration Users register themselves and provide personal and affiliation details
  • 28. Step 2: enter library Define your library in one of three ways: 1. Upload SDF 2. Upload MRV with Markush library 3. Pick R-groups from within pre-defined lists and Markush Enumeration is used to generate the library Sketch scaffold with Marvin JS Give it a title Provide accompanying info including rationale and synthesis validation info
  • 29. Step 3: initial property calculations SSS for scaffold against reference set of 12M structures using JChem Search Exact match search for enumerated structures against reference set of 12M structures Property distributions of enumerated structures. Properties generated with Chemical Terms expressions with Calculator Plugins Checking enumerated structures against UK MDA legislation database from Compliance Checker Matching enumerated structures against sets of SMARTS filters provided by EFPIA members using Chemical Terms match() and matchCount() functions
  • 30. Step 4: submit library Further round of property calculations performed and library now visible to Library Selection Committee for consideration At this stage user has handed over ownership of the library idea to the European Lead Factory
  • 31. Step 5: final property calculations ECFP4 fingerprint comparison with 12M structures from reference set Fuzzy Pharmacophore fingerprint comparison with 12M structures from reference set Generate histogram of most similar structure in reference set for each compound in the library. ~1000 x 12M x 2 comparisons. “Old” molecular descriptor tables: ● For each search read descriptors from database -> very IO bound. ● Single threaded. ● -> days for completion “New” fast similarity search: ● Descriptors generated as binary file. ● Read into memory once ● Search fully multi-threaded. ● -> minutes for completion ● (1M x 1M completes in ~25 min)
  • 32. Step 6: assessment Approve Reject Refine Library Selection Committee
  • 33. Status summary "The Library Selection Committee has been using the web tool since the start of 2014 to support the selection of library proposals. Since then, well over 100 library proposals have been considered, around 80 of which have been approved for synthetic validation. The procedure for assessing and processing the proposals has been straightforward. The tool has allowed the proposals to be handled confidentially and to be assessed rigorously; it has saved huge time for both library proposal submitters and members of the Library Selection Committee." Adam Nelson Chair Library Selection Committee
  • 34. ChemAxon and Open Innovation
  • 35. Professional Services (BINGO!) ● Product development ● Product customization ● Product Integration ● Workflow design/management ● Project management ● Creative solutions