SlideShare a Scribd company logo
Consistent Stereochemical
search at GSK
Support for v3000 mol format.
Richard Bolton
Stephen Swanson
Abstract
2
• GSK has recently upgraded its ChemAxon small molecule registration system to allow
end-users the ability to mark-up and register molecules with complex atom based stereo
centres with no registrar intervention. These are stored using the V3000 mol
format. These more fully described structures will now be pushed to the GSK Chemistry
ODS and indexed using the latest JChem cartridge. This in turn will allow upgrade of
structure query web services and IJC projects to give accurate and consistent behaviour
across IJC clients and other web-service powered structure search tools when running
complex stereo-chemical queries. This presentation will discuss the current issues and
how the move to using v3000 mol format resolves these problems. A timeline will be
shared which shows when GSK will have moved to v3000 mol as its standard for full
representation of small molecules; this solution will also align with the future
requirements of European legislation on fully describing medicinal products.
19 May 2015ChemAxon UGM Budapest
Agenda
1. Small molecule registration at GSK. Full atom mark-up and end user
registration checker. A status update.
2. Downstream inconsistency in structure search
3. Plans to use v3000 mol format as GSK’s new standard and how that fixes the
problems.
4. The work that needs doing and the timeline
5. Where next IJC Browser and IDMP.
319 May 2015ChemAxon UGM Budapest
Small Molecule Registration. A reminder.
4
• Small Molecule Registration collaboration project with ChemAxon started in
March 2011 and went to production in mid 2012.
• Simplification of small molecule registration
– A-novel-Approach-to- Pharmaceutical-Registration :Charlie Wilkins San Diego 2011
– Going live with registration as a service : Richard Bolton/Akos Papp Budapest 2012
– Registration as a Service: the full story after half a year. Rama Bhamidipati/Akos Papp Boston2012
• End user tools to check registration were rolled out in mid 2014
19 May 2015ChemAxon UGM Budapest
Current Registration Process
5
Other?
Compound Name
CCE 001
CCE 002
CCE 003
Purchased
Compounds
Compound Collection Enhancement send the SD File
to Registry as received and registered as new entities
optionally, request
assistance from the
registrar
Contract
Chemist
Chemist
CCE
Chemist
eLNB or
WebReg
Fully Specified compounds
Registrar
Tools
Compound Collection
Enhancement purchase
compounds and receive data files
with structure information
Registrar
SD FilesSD Files
Fully Specified compounds Business Rules
Staging
Compounds that are fully
describe in regard to
stereochemistry, racemic content
are automatically registered
Chemist or contract coordinator
enter compounds to be registered
and registerability before
submitting
Compounds with undefined
attributes are sent to the
registrars
Registrars modify compounds for
consistency, confirm with the
chemist then register
Registry
x
x
x
19 May 2015ChemAxon UGM Budapest
Small Molecule registration statistics
6
• Between April 2014 and April 2015
– 175K compounds registered.(end user singleton and bulk registration)
– 50K end user singletons
– 24k with chiral centers
– 9.5k with 2-9 chiral centers
– 700 with >=10 chiral centers (more likely to be peptides or DNA, soon to be go to our new Biological
Registration system)
• So far in 2015
– Singletons autoregistered 78%
– Singletons registered manually by the submitting scientist (non-registrar manual) 8%
– Singletons registered manually by registrars 14%
19 May 2015ChemAxon UGM Budapest
Side-effects of full atom mark-up.
7
• Although the Compound registration system now contains full atom mark-up
downstream systems still rely on v2000 mol format copies and/or standard
smiles strings.
• Structures can be overlaid with ‘pictures’ of the mark-up so users can tell
‘identical’ structures apart, but these are not query features.
• Running a query in IJC or using the GSK structure search web service can give
different results for molecules containing relative stereochemistry.
• Scientists agreed that this inconsistency needed to be resolved.
19 May 2015ChemAxon UGM Budapest
8
Examples 1 Generic
–
IJC
4917 compounds
Helium
Query Structure
Unspecified
Stereochemistry
4921 compounds
Problem 1: Searching With Unspecified Stereochemistry
9
Note the 4917 compound set may not be a subset of the 4921 found in IJC. It may contain other
structures based upon what is entered. It takes a long time to determine which actually contain what
you were looking for.
19 May 2015ChemAxon UGM Budapest
Problem 2: Searching With Specified Stereochemistry
10
–
Helium
IJC
4921 compounds
Query Structure
Specified
Stereochemistry
Returns all the structures
with the unspecified
stereochemistry, not the
specific And1
designations!
If the user is more specific in the search, IJC ignores the input (searches via unspecified
stereochemistry) and Helium just affords an error leaving the scientist unsure of the results.
19 May 2015ChemAxon UGM Budapest
Problem 3 Data Integrity
11
Get GSK ID
Get Structure
Helium
This compound number defines a
non-specific structure!
GSK 12345A
GSK 12345A
19 May 2015ChemAxon UGM Budapest
Problem 3 Potential Data Integrity converting structure to ID to structure
12
Get GSK ID
Get Structure
Helium
This compound number defines a
non-specific structure!
GSK 12345A
GSK 12345A
Structure X
absolute stereo
Structure Y
relative stereo
19 May 2015ChemAxon UGM Budapest
13
Examples 2 More specific
Isomer Synthesis
4 Isomers (R,R, R,S, S,R, S,S)
2 Isomers (R,R, R,S,)
2 Isomers (R,R, S,R,)
A
B
C
19 May 2015ChemAxon UGM Budapest
4 Diasteromers – Registration (A)
Racemic starting materials + Isomer Separation
19 May 2015ChemAxon UGM Budapest
4 Diasteromers – Registration (C)
From (R) Amine + Isomer Separation
From (S) Amine + Isomer Separation
19 May 2015ChemAxon UGM Budapest
Substructure Searching Outcomes
Did we make this structure?
Did we include all relevant
structures in the patent?
Is this structure part of
an Alliance agreement?
Can I trust my SSS results?
19 May 2015ChemAxon UGM Budapest
Target and proposal
Target
• Ensure consistent and correct search results irrespective of the application being
used.
Proposal
• Cascade the fully described v3000 molfile that is stored in Registry to both IJC
and Chemistry Web Services.
1819 May 2015ChemAxon UGM Budapest
What needs changing to achieve this?
 Upgrade CODS and ACD databases from Oracle 10g to Oracle 12c and ensure they contain v3000 mol
and index
 Update JChem Cartridge in Chemistry ODS (CODS) and the Available Chemical Directory (ACD) to enable
full stereo-chemical searching
 Reconfigure IJC to search across and return from the new structure field (ChemAxon provided script)
 Add methods both Structure Search and ID Lookup Web services which search across and return V3000
results.
 Maintain the existing V2000 methods and results to ensure that other downstream applications are not
impacted
 Upgrade Helium v3 to v4 and configure it to search across and return the V3000 results.
 Ensure that Helium and IJC return identical results with identical V3000 queries
 Plan remediation of the other 34 apps dependent on the web services.
1919 May 2015ChemAxon UGM Budapest
Timeline
20
May Jun Jul Aug Sep Oct Nov Dec Jan
JChem upgrade in CODS &
ACD
Project Approval Project Release
Helium v4 He 4 + V3000
2
3 Sandbox Dev (TST/INT) Prd
1
4
UKxxx605 Oracle 12c upgrade & testing5
Resolve
Registry Issue
IJC & CL/SS
Migrate Chemistry Webservices
to Java 86
Warranty
Project Close
19 May 2015ChemAxon UGM Budapest
What next..............
IDMP requirements
– The full description of all materials in a medicinal product will be required by mid 2016.
– V3000 mol is a format that can fulfil that requirement.
IJC Browser
– ChemAxon are working inside the GSK environment to tune IJC Browser
–Plans to move to Browser this year if performance is acceptable in our complex environment.
2119 May 2015ChemAxon UGM Budapest
Thank you for your attention
2219 May 2015ChemAxon UGM Budapest
Questions?

More Related Content

Similar to EUGM15 - Richard Bolton (GlaxoSmithKline): Consistent Stereochemical search at GlaxoSmithKline

EUGM 2014 - Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...
EUGM 2014 -  Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...EUGM 2014 -  Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...
EUGM 2014 - Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...
ChemAxon
 
Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...
Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...
Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...
Frank Oellien
 
Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...
Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...
Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...
Yandex Data Factory
 
EGG2012_Paper_Stachura_KERP
EGG2012_Paper_Stachura_KERPEGG2012_Paper_Stachura_KERP
EGG2012_Paper_Stachura_KERPNick Stein
 
EIT RM Summit 2020, September 30 [CROCODILE]
EIT RM Summit 2020, September 30 [CROCODILE]EIT RM Summit 2020, September 30 [CROCODILE]
EIT RM Summit 2020, September 30 [CROCODILE]
Jokin Hidalgo
 
2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGM2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGMBob Coner
 
FuelCellEurope RCS Workshop
FuelCellEurope RCS WorkshopFuelCellEurope RCS Workshop
FuelCellEurope RCS Workshop
divacreative
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
Frederik van den Broek
 
Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...
Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...
Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...
SIFOfgem
 
Introduction of NGK Spark Plug_20221005.pdf
Introduction of NGK Spark Plug_20221005.pdfIntroduction of NGK Spark Plug_20221005.pdf
Introduction of NGK Spark Plug_20221005.pdf
Ghousia Islam
 
EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...
EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...
EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...
ChemAxon
 
Carbon-ML - a barcode for product CO2e declarations
Carbon-ML - a barcode for product CO2e declarationsCarbon-ML - a barcode for product CO2e declarations
Carbon-ML - a barcode for product CO2e declarations
LynnConnolly4
 
Understanding20and20 simulating20the20iec206185020standard
Understanding20and20 simulating20the20iec206185020standardUnderstanding20and20 simulating20the20iec206185020standard
Understanding20and20 simulating20the20iec206185020standard
Manojlooki
 
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
BIOVIA
 
Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...
Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...
Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...
raj takhar
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?
Sunghwan Kim
 
Presentation of ChemSPider at PubChem Public Meeting
Presentation of ChemSPider at PubChem Public MeetingPresentation of ChemSPider at PubChem Public Meeting
OCRE webinar - April 14 - Dave Heyns.pdf
OCRE webinar - April 14 - Dave Heyns.pdfOCRE webinar - April 14 - Dave Heyns.pdf
OCRE webinar - April 14 - Dave Heyns.pdf
OCRE | Open Clouds for Research Environments
 
SCIP Reporting on Complex Products
SCIP Reporting on Complex ProductsSCIP Reporting on Complex Products
SCIP Reporting on Complex Products
raj takhar
 
project.pptx
project.pptxproject.pptx
project.pptx
Uzma443495
 

Similar to EUGM15 - Richard Bolton (GlaxoSmithKline): Consistent Stereochemical search at GlaxoSmithKline (20)

EUGM 2014 - Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...
EUGM 2014 -  Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...EUGM 2014 -  Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...
EUGM 2014 - Richard Bolton (GlaxoSmithKline): GlaxoSmithKline: 5 years with ...
 
Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...
Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...
Intervet Chemicals Directory (ICD) - A Framework Combining Accelrys Pipeline ...
 
Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...
Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...
Cutting Steelmaking Costs Without Sacrificing Quality. Machine Learning for M...
 
EGG2012_Paper_Stachura_KERP
EGG2012_Paper_Stachura_KERPEGG2012_Paper_Stachura_KERP
EGG2012_Paper_Stachura_KERP
 
EIT RM Summit 2020, September 30 [CROCODILE]
EIT RM Summit 2020, September 30 [CROCODILE]EIT RM Summit 2020, September 30 [CROCODILE]
EIT RM Summit 2020, September 30 [CROCODILE]
 
2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGM2013_06_27 Dotmatics UGM
2013_06_27 Dotmatics UGM
 
FuelCellEurope RCS Workshop
FuelCellEurope RCS WorkshopFuelCellEurope RCS Workshop
FuelCellEurope RCS Workshop
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
 
Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...
Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...
Show and Tell - Data and Digitalisation, Gas Network Asset Monitoring and Ana...
 
Introduction of NGK Spark Plug_20221005.pdf
Introduction of NGK Spark Plug_20221005.pdfIntroduction of NGK Spark Plug_20221005.pdf
Introduction of NGK Spark Plug_20221005.pdf
 
EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...
EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...
EUGM 2013 - Michael Dippolito (Deltasoft): Great Migrations! – Approaches to ...
 
Carbon-ML - a barcode for product CO2e declarations
Carbon-ML - a barcode for product CO2e declarationsCarbon-ML - a barcode for product CO2e declarations
Carbon-ML - a barcode for product CO2e declarations
 
Understanding20and20 simulating20the20iec206185020standard
Understanding20and20 simulating20the20iec206185020standardUnderstanding20and20 simulating20the20iec206185020standard
Understanding20and20 simulating20the20iec206185020standard
 
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
(ATS6-PLAT01) Chemistry Harmonization: Bringing together the Direct 9 and Pip...
 
Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...
Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...
Chemical Watch Chemicals Management for Electronics Europe 2020: IPC Data Exc...
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?
 
Presentation of ChemSPider at PubChem Public Meeting
Presentation of ChemSPider at PubChem Public MeetingPresentation of ChemSPider at PubChem Public Meeting
Presentation of ChemSPider at PubChem Public Meeting
 
OCRE webinar - April 14 - Dave Heyns.pdf
OCRE webinar - April 14 - Dave Heyns.pdfOCRE webinar - April 14 - Dave Heyns.pdf
OCRE webinar - April 14 - Dave Heyns.pdf
 
SCIP Reporting on Complex Products
SCIP Reporting on Complex ProductsSCIP Reporting on Complex Products
SCIP Reporting on Complex Products
 
project.pptx
project.pptxproject.pptx
project.pptx
 

More from ChemAxon

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
ChemAxon
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
ChemAxon
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
ChemAxon
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
ChemAxon
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
ChemAxon
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
ChemAxon
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
ChemAxon
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
ChemAxon
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
ChemAxon
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
ChemAxon
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
ChemAxon
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
ChemAxon
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
ChemAxon
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
ChemAxon
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
ChemAxon
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
ChemAxon
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
ChemAxon
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
ChemAxon
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
ChemAxon
 

More from ChemAxon (20)

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 

Recently uploaded

Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
RASHMI M G
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
frank0071
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
muralinath2
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 

Recently uploaded (20)

Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 

EUGM15 - Richard Bolton (GlaxoSmithKline): Consistent Stereochemical search at GlaxoSmithKline

  • 1. Consistent Stereochemical search at GSK Support for v3000 mol format. Richard Bolton Stephen Swanson
  • 2. Abstract 2 • GSK has recently upgraded its ChemAxon small molecule registration system to allow end-users the ability to mark-up and register molecules with complex atom based stereo centres with no registrar intervention. These are stored using the V3000 mol format. These more fully described structures will now be pushed to the GSK Chemistry ODS and indexed using the latest JChem cartridge. This in turn will allow upgrade of structure query web services and IJC projects to give accurate and consistent behaviour across IJC clients and other web-service powered structure search tools when running complex stereo-chemical queries. This presentation will discuss the current issues and how the move to using v3000 mol format resolves these problems. A timeline will be shared which shows when GSK will have moved to v3000 mol as its standard for full representation of small molecules; this solution will also align with the future requirements of European legislation on fully describing medicinal products. 19 May 2015ChemAxon UGM Budapest
  • 3. Agenda 1. Small molecule registration at GSK. Full atom mark-up and end user registration checker. A status update. 2. Downstream inconsistency in structure search 3. Plans to use v3000 mol format as GSK’s new standard and how that fixes the problems. 4. The work that needs doing and the timeline 5. Where next IJC Browser and IDMP. 319 May 2015ChemAxon UGM Budapest
  • 4. Small Molecule Registration. A reminder. 4 • Small Molecule Registration collaboration project with ChemAxon started in March 2011 and went to production in mid 2012. • Simplification of small molecule registration – A-novel-Approach-to- Pharmaceutical-Registration :Charlie Wilkins San Diego 2011 – Going live with registration as a service : Richard Bolton/Akos Papp Budapest 2012 – Registration as a Service: the full story after half a year. Rama Bhamidipati/Akos Papp Boston2012 • End user tools to check registration were rolled out in mid 2014 19 May 2015ChemAxon UGM Budapest
  • 5. Current Registration Process 5 Other? Compound Name CCE 001 CCE 002 CCE 003 Purchased Compounds Compound Collection Enhancement send the SD File to Registry as received and registered as new entities optionally, request assistance from the registrar Contract Chemist Chemist CCE Chemist eLNB or WebReg Fully Specified compounds Registrar Tools Compound Collection Enhancement purchase compounds and receive data files with structure information Registrar SD FilesSD Files Fully Specified compounds Business Rules Staging Compounds that are fully describe in regard to stereochemistry, racemic content are automatically registered Chemist or contract coordinator enter compounds to be registered and registerability before submitting Compounds with undefined attributes are sent to the registrars Registrars modify compounds for consistency, confirm with the chemist then register Registry x x x 19 May 2015ChemAxon UGM Budapest
  • 6. Small Molecule registration statistics 6 • Between April 2014 and April 2015 – 175K compounds registered.(end user singleton and bulk registration) – 50K end user singletons – 24k with chiral centers – 9.5k with 2-9 chiral centers – 700 with >=10 chiral centers (more likely to be peptides or DNA, soon to be go to our new Biological Registration system) • So far in 2015 – Singletons autoregistered 78% – Singletons registered manually by the submitting scientist (non-registrar manual) 8% – Singletons registered manually by registrars 14% 19 May 2015ChemAxon UGM Budapest
  • 7. Side-effects of full atom mark-up. 7 • Although the Compound registration system now contains full atom mark-up downstream systems still rely on v2000 mol format copies and/or standard smiles strings. • Structures can be overlaid with ‘pictures’ of the mark-up so users can tell ‘identical’ structures apart, but these are not query features. • Running a query in IJC or using the GSK structure search web service can give different results for molecules containing relative stereochemistry. • Scientists agreed that this inconsistency needed to be resolved. 19 May 2015ChemAxon UGM Budapest
  • 9. – IJC 4917 compounds Helium Query Structure Unspecified Stereochemistry 4921 compounds Problem 1: Searching With Unspecified Stereochemistry 9 Note the 4917 compound set may not be a subset of the 4921 found in IJC. It may contain other structures based upon what is entered. It takes a long time to determine which actually contain what you were looking for. 19 May 2015ChemAxon UGM Budapest
  • 10. Problem 2: Searching With Specified Stereochemistry 10 – Helium IJC 4921 compounds Query Structure Specified Stereochemistry Returns all the structures with the unspecified stereochemistry, not the specific And1 designations! If the user is more specific in the search, IJC ignores the input (searches via unspecified stereochemistry) and Helium just affords an error leaving the scientist unsure of the results. 19 May 2015ChemAxon UGM Budapest
  • 11. Problem 3 Data Integrity 11 Get GSK ID Get Structure Helium This compound number defines a non-specific structure! GSK 12345A GSK 12345A 19 May 2015ChemAxon UGM Budapest
  • 12. Problem 3 Potential Data Integrity converting structure to ID to structure 12 Get GSK ID Get Structure Helium This compound number defines a non-specific structure! GSK 12345A GSK 12345A Structure X absolute stereo Structure Y relative stereo 19 May 2015ChemAxon UGM Budapest
  • 13. 13 Examples 2 More specific
  • 14. Isomer Synthesis 4 Isomers (R,R, R,S, S,R, S,S) 2 Isomers (R,R, R,S,) 2 Isomers (R,R, S,R,) A B C 19 May 2015ChemAxon UGM Budapest
  • 15. 4 Diasteromers – Registration (A) Racemic starting materials + Isomer Separation 19 May 2015ChemAxon UGM Budapest
  • 16. 4 Diasteromers – Registration (C) From (R) Amine + Isomer Separation From (S) Amine + Isomer Separation 19 May 2015ChemAxon UGM Budapest
  • 17. Substructure Searching Outcomes Did we make this structure? Did we include all relevant structures in the patent? Is this structure part of an Alliance agreement? Can I trust my SSS results? 19 May 2015ChemAxon UGM Budapest
  • 18. Target and proposal Target • Ensure consistent and correct search results irrespective of the application being used. Proposal • Cascade the fully described v3000 molfile that is stored in Registry to both IJC and Chemistry Web Services. 1819 May 2015ChemAxon UGM Budapest
  • 19. What needs changing to achieve this?  Upgrade CODS and ACD databases from Oracle 10g to Oracle 12c and ensure they contain v3000 mol and index  Update JChem Cartridge in Chemistry ODS (CODS) and the Available Chemical Directory (ACD) to enable full stereo-chemical searching  Reconfigure IJC to search across and return from the new structure field (ChemAxon provided script)  Add methods both Structure Search and ID Lookup Web services which search across and return V3000 results.  Maintain the existing V2000 methods and results to ensure that other downstream applications are not impacted  Upgrade Helium v3 to v4 and configure it to search across and return the V3000 results.  Ensure that Helium and IJC return identical results with identical V3000 queries  Plan remediation of the other 34 apps dependent on the web services. 1919 May 2015ChemAxon UGM Budapest
  • 20. Timeline 20 May Jun Jul Aug Sep Oct Nov Dec Jan JChem upgrade in CODS & ACD Project Approval Project Release Helium v4 He 4 + V3000 2 3 Sandbox Dev (TST/INT) Prd 1 4 UKxxx605 Oracle 12c upgrade & testing5 Resolve Registry Issue IJC & CL/SS Migrate Chemistry Webservices to Java 86 Warranty Project Close 19 May 2015ChemAxon UGM Budapest
  • 21. What next.............. IDMP requirements – The full description of all materials in a medicinal product will be required by mid 2016. – V3000 mol is a format that can fulfil that requirement. IJC Browser – ChemAxon are working inside the GSK environment to tune IJC Browser –Plans to move to Browser this year if performance is acceptable in our complex environment. 2119 May 2015ChemAxon UGM Budapest
  • 22. Thank you for your attention 2219 May 2015ChemAxon UGM Budapest Questions?