SlideShare a Scribd company logo
Making workflows Work
Enterprise KNIME deployment at Lilly
James A. Lumley (Research IT UK)
ChemAxon UGM Budapest 2014
Making workflows Work!
1. Why KNIME?
2. Old meets New
3. Don’t mention structures
4. Better conversions
20/05/2014
Making workflows Work!
1. Why KNIME?
20/05/2014
KNIME@Lilly ‘Freemium’ turned ‘Premium’
• 2010: Strong usage including opensource contributions
by Mike Bodkins UK CompChem group
• 2012: Research IT consolidated workflow tools via
KNIME.com Enterprise license and built an infrastructure
to develop and deploy the tool globally
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 6
Java/Eclipse platform allows easy
creation of custom extensions
(including security model)
Server helps drive Sci/IT collab
and knowledge capture
Integration with existing legacy
systems & data (esp. via SOA)
Strong precedence for
Workflow Software in Pharma
Infrastructure to support the deployment
Company Confidential © 2014 Eli Lilly and Company
OpenSource Nodes:
+
• Due a ‘refresh’
• Chemaxon dependency in many
nodes including:
• Chemical structure handling
(conversions), sketcher (Marvin),
Molecule Difference check (testing)
and rendering (views)
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 8
Example Lilly Node
using Chemaxon:
• Multi-molecule sketcher
extension based on Marvin
• Configure to sketch and edit
multiple structures or reactions
• Output multiple structures
(port_0) or reactions (port_1)
on node execution
• Internally reuse code for
sketcher applet in webportal
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 9
2 years on, significant usage*:
• CompChem/MedChem, ADME Reporting, Analytical
Technologies Automation, Sample Management,
Automating Data ETL & Data Exploration…
http://www.knime.com/files/004_kuduk2013-jamesalumley-lilly.pdf
http://www.knime.com/knime-user-day-uk-2013-news
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 10
*
Making workflows Work!
2. Old Meets New:
KNIME working alongside legacy systems
Many nodes link legacy systems:
1. Retain ‘trusted’ status of internal data access tools
(e.g.: internal system for integrated data access,
Mobius)*
2. Retain power of in house legacy predictive modelling
code e.g.: SVM models unix code
3. Interface with new systems e.g.: AT Structure
Verification tools
50% of >100 internal nodes use SOA or similar to
serve analytics tools and data to KNIME
http://www.triconference.com/11/ird
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
*
Making workflows Work!
3. Don’t Mention Structures:
Getting KNIME to work with different data
security models
20/05/2014 Company Confidential © 2014 Eli Lilly and Company 14
Huge reliance on SOA to provide Tools and Data to KNIME:
+ moves data security issues to web service layer
+ reduces CPU load on ‘office’ laptops
- Services needs constant monitoring
- Large work effort adding NTLM Auth to Webservice nodes
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 15
• In application support page/tab
• Status of Webservices (separates node errors from service layer errors)
• Links to Webpages
• Known Bugs/Issues from Redmine
Making workflows Work!
4. Better conversions
Ensuring good interplay between the many
chemical data types in KNIME without users feeling
the pain
• Converter nodes in top 20 most commonly used nodes in analysis
of >2000 workflows on Lilly KNIME server
• Some workflows contain around 50% converter nodes
• New users confused by multiple molecule types and conversions
(Analysis from Summer 2012)
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
Lilly Matched Pairs node
requires RDKit type
Internal unix code
(service layer) requires
Smiles value
Property calculator
needs CDK type
Internal data
retrieval system
serves data and
molecule as chime
type
converter
converter
converter converter
• Different Chemical
Types don’t work well
together
• Users constantly
converting chemical
data from one ‘type’ to
another
• Worse for Lilly nodes
that utilise many formats
with no ‘standard’
vendor like
representation
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
Aim:
• Remove need for user to manually add chemical converter nodes
• Ensure Nodes that use different Chemical formats to work together
better
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
KNIME.com introduced “Adaptor Cell” in 2.9
• Container with several representations of same entity
• Node can add additional representations that can be re-used by
downstream nodes
• Avoids multiple conversions
• Original representation still present
• Vendor Specific! No pseudo standards such as SDF
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 21
SDF  RDKit  CDK  Indigo
Lilly Solution for (Pseudo) standards:
• Extension point for handling Molecule Type conversions
• Depends on Marvin library for Molecule conversions
+
• In development!
• Will be released opensource
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 22
Before
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 23
• Extension point moves conversions into Node configuration
• Workflow still documents explicit type conversions
• Still retains support for Converter nodes if/when appropriate
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 24
5/20/2014 Company Confidential © 2014 Eli Lilly and Company 25
Converters could be ‘chained’ if
direct conversion not available
(e.g.: InChI or Chime). Example
shown in dialogue:
Before
After
Requires SMILES
Making Workflows Work:
• Added many legacy tools and data services into
KNIME via custom nodes and SOA
• Aided usability by adding dashboard for service layer
monitoring
• Added authentication handling via NTLM Auth to
provide data authentication at source
• Adding molecule handling framework to reduce
number of molecule conversions users need
20/05/2014 Company Confidential © 2014 Eli Lilly and Company
Acknowledgements
Java Coding & Infrastructure (Lilly):
Luke Bullard, Tom Wilkin
Project Management, End User support, Expert Users (& Testers), Previous Developers etc.:
Derek Marren, Marnie Williams, Pip Turner, Matt Hirst, Dave Thorner, Dave Evans, Mike Bodkin,
Niko Fechner, Roger Robinson, Jibo Wang, Christos Nicolaou, Beth Wright, Gary Sharman,
Simon Richards, Stuart Morton, Jason Ochoada, Jim Hughes
(In no particular order!)
KNIME.com
Bernd, Thorsten, Thomas, Aaron ++

More Related Content

Similar to EUGM 2014 - James Lumley (Eli Lilly and Co.): Making Workflows Work: Enterprise deployment of KNIME at Lilly

DevOps and APIs: Great Alone, Better Together
DevOps and APIs: Great Alone, Better Together DevOps and APIs: Great Alone, Better Together
DevOps and APIs: Great Alone, Better Together
MuleSoft
 
Cloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons Learned
Cloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons LearnedCloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons Learned
Cloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons Learned
VMware Tanzu
 
CloudExpo NY 2014: Moving Mission Critical Applications to the Cloud
CloudExpo NY 2014: Moving Mission Critical Applications to the CloudCloudExpo NY 2014: Moving Mission Critical Applications to the Cloud
CloudExpo NY 2014: Moving Mission Critical Applications to the Cloud
Kacy Clarke
 
150601 gartner cloud_summit_vfinal
150601 gartner cloud_summit_vfinal150601 gartner cloud_summit_vfinal
150601 gartner cloud_summit_vfinal
Michael Burian
 
Bridging the Cloud Sign-On Gap
Bridging the Cloud Sign-On GapBridging the Cloud Sign-On Gap
Bridging the Cloud Sign-On Gap
OracleIDM
 
Data Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourData Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part Four
Hans Hultgren
 
Kovair Product Capabilities – An Overview
Kovair Product Capabilities – An OverviewKovair Product Capabilities – An Overview
Kovair Product Capabilities – An Overview
Kovair
 
DellEMC Forum NYC - DevOps and Digital Trans vPublic
DellEMC Forum NYC - DevOps and Digital Trans vPublicDellEMC Forum NYC - DevOps and Digital Trans vPublic
DellEMC Forum NYC - DevOps and Digital Trans vPublic
Don Demcsak
 
VMworld 2013: How to Build a Hybrid Cloud in Less than a Day
VMworld 2013: How to Build a Hybrid Cloud in Less than a Day VMworld 2013: How to Build a Hybrid Cloud in Less than a Day
VMworld 2013: How to Build a Hybrid Cloud in Less than a Day
VMworld
 
Cloud Foundry Technical Overview at IBM Interconnect 2016
Cloud Foundry Technical Overview at IBM Interconnect 2016Cloud Foundry Technical Overview at IBM Interconnect 2016
Cloud Foundry Technical Overview at IBM Interconnect 2016
Stormy Peters
 
apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...
apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...
apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...
apidays
 
Building a DevOps Team that Isn't Evil
Building a DevOps Team that Isn't EvilBuilding a DevOps Team that Isn't Evil
Building a DevOps Team that Isn't Evil
IBM UrbanCode Products
 
Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)
Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)
Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)
VMware Tanzu
 
Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...
Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...
Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...
Amazon Web Services
 
Mobility and federation of Cloud computing
Mobility and federation of Cloud computingMobility and federation of Cloud computing
Mobility and federation of Cloud computing
David Wallom
 
DevOps Culture & Enablement with Postgres Plus Cloud Database
DevOps Culture & Enablement with Postgres Plus Cloud DatabaseDevOps Culture & Enablement with Postgres Plus Cloud Database
DevOps Culture & Enablement with Postgres Plus Cloud Database
EDB
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data Success
Cloudera, Inc.
 
Proven Practices for Office 365 Deployment, Security and Management
Proven Practices for Office 365 Deployment, Security and ManagementProven Practices for Office 365 Deployment, Security and Management
Proven Practices for Office 365 Deployment, Security and Management
Perficient, Inc.
 
Container Technologies and Transformational value
Container Technologies and Transformational valueContainer Technologies and Transformational value
Container Technologies and Transformational value
Mihai Criveti
 
Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)
Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)
Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)
VMware Tanzu
 

Similar to EUGM 2014 - James Lumley (Eli Lilly and Co.): Making Workflows Work: Enterprise deployment of KNIME at Lilly (20)

DevOps and APIs: Great Alone, Better Together
DevOps and APIs: Great Alone, Better Together DevOps and APIs: Great Alone, Better Together
DevOps and APIs: Great Alone, Better Together
 
Cloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons Learned
Cloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons LearnedCloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons Learned
Cloud Foundry Summit 2015: A Year of Innovation: Cloud Foundry Lessons Learned
 
CloudExpo NY 2014: Moving Mission Critical Applications to the Cloud
CloudExpo NY 2014: Moving Mission Critical Applications to the CloudCloudExpo NY 2014: Moving Mission Critical Applications to the Cloud
CloudExpo NY 2014: Moving Mission Critical Applications to the Cloud
 
150601 gartner cloud_summit_vfinal
150601 gartner cloud_summit_vfinal150601 gartner cloud_summit_vfinal
150601 gartner cloud_summit_vfinal
 
Bridging the Cloud Sign-On Gap
Bridging the Cloud Sign-On GapBridging the Cloud Sign-On Gap
Bridging the Cloud Sign-On Gap
 
Data Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part FourData Vault ReConnect Speed Presenting PM Part Four
Data Vault ReConnect Speed Presenting PM Part Four
 
Kovair Product Capabilities – An Overview
Kovair Product Capabilities – An OverviewKovair Product Capabilities – An Overview
Kovair Product Capabilities – An Overview
 
DellEMC Forum NYC - DevOps and Digital Trans vPublic
DellEMC Forum NYC - DevOps and Digital Trans vPublicDellEMC Forum NYC - DevOps and Digital Trans vPublic
DellEMC Forum NYC - DevOps and Digital Trans vPublic
 
VMworld 2013: How to Build a Hybrid Cloud in Less than a Day
VMworld 2013: How to Build a Hybrid Cloud in Less than a Day VMworld 2013: How to Build a Hybrid Cloud in Less than a Day
VMworld 2013: How to Build a Hybrid Cloud in Less than a Day
 
Cloud Foundry Technical Overview at IBM Interconnect 2016
Cloud Foundry Technical Overview at IBM Interconnect 2016Cloud Foundry Technical Overview at IBM Interconnect 2016
Cloud Foundry Technical Overview at IBM Interconnect 2016
 
apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...
apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...
apidays LIVE Paris 2021 - APIs - How did we get here and where are we going n...
 
Building a DevOps Team that Isn't Evil
Building a DevOps Team that Isn't EvilBuilding a DevOps Team that Isn't Evil
Building a DevOps Team that Isn't Evil
 
Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)
Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)
Keynote: Architecting for Continuous Delivery (Pivotal Cloud Platform Roadshow)
 
Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...
Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...
Application Darwinism: Why Most Enterprise Apps Will Move to the Cloud (SVC20...
 
Mobility and federation of Cloud computing
Mobility and federation of Cloud computingMobility and federation of Cloud computing
Mobility and federation of Cloud computing
 
DevOps Culture & Enablement with Postgres Plus Cloud Database
DevOps Culture & Enablement with Postgres Plus Cloud DatabaseDevOps Culture & Enablement with Postgres Plus Cloud Database
DevOps Culture & Enablement with Postgres Plus Cloud Database
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data Success
 
Proven Practices for Office 365 Deployment, Security and Management
Proven Practices for Office 365 Deployment, Security and ManagementProven Practices for Office 365 Deployment, Security and Management
Proven Practices for Office 365 Deployment, Security and Management
 
Container Technologies and Transformational value
Container Technologies and Transformational valueContainer Technologies and Transformational value
Container Technologies and Transformational value
 
Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)
Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)
Unlock your VMWare Investment with Pivotal Cloud Foundry (VMworld 2014)
 

More from ChemAxon

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
ChemAxon
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
ChemAxon
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
ChemAxon
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
ChemAxon
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
ChemAxon
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
ChemAxon
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
ChemAxon
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
ChemAxon
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
ChemAxon
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
ChemAxon
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
ChemAxon
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
ChemAxon
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
ChemAxon
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
ChemAxon
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
ChemAxon
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
ChemAxon
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
ChemAxon
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
ChemAxon
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
ChemAxon
 

More from ChemAxon (20)

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 

Recently uploaded

Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
Pedro J. Molina
 
Alluxio Webinar | 10x Faster Trino Queries on Your Data Platform
Alluxio Webinar | 10x Faster Trino Queries on Your Data PlatformAlluxio Webinar | 10x Faster Trino Queries on Your Data Platform
Alluxio Webinar | 10x Faster Trino Queries on Your Data Platform
Alluxio, Inc.
 
Kubernetes at Scale: Going Multi-Cluster with Istio
Kubernetes at Scale:  Going Multi-Cluster  with IstioKubernetes at Scale:  Going Multi-Cluster  with Istio
Kubernetes at Scale: Going Multi-Cluster with Istio
Severalnines
 
42 Ways to Generate Real Estate Leads - Sellxpert
42 Ways to Generate Real Estate Leads - Sellxpert42 Ways to Generate Real Estate Leads - Sellxpert
42 Ways to Generate Real Estate Leads - Sellxpert
vaishalijagtap12
 
Using Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query PerformanceUsing Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query Performance
Grant Fritchey
 
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptxOperational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
sandeepmenon62
 
TMU毕业证书精仿办理
TMU毕业证书精仿办理TMU毕业证书精仿办理
TMU毕业证书精仿办理
aeeva
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
dakas1
 
Superpower Your Apache Kafka Applications Development with Complementary Open...
Superpower Your Apache Kafka Applications Development with Complementary Open...Superpower Your Apache Kafka Applications Development with Complementary Open...
Superpower Your Apache Kafka Applications Development with Complementary Open...
Paul Brebner
 
Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)
alowpalsadig
 
Liberarsi dai framework con i Web Component.pptx
Liberarsi dai framework con i Web Component.pptxLiberarsi dai framework con i Web Component.pptx
Liberarsi dai framework con i Web Component.pptx
Massimo Artizzu
 
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
kgyxske
 
DevOps Consulting Company | Hire DevOps Services
DevOps Consulting Company | Hire DevOps ServicesDevOps Consulting Company | Hire DevOps Services
DevOps Consulting Company | Hire DevOps Services
seospiralmantra
 
How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?
ToXSL Technologies
 
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
safelyiotech
 
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdfThe Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
kalichargn70th171
 
The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...
The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...
The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...
kalichargn70th171
 
Enhanced Screen Flows UI/UX using SLDS with Tom Kitt
Enhanced Screen Flows UI/UX using SLDS with Tom KittEnhanced Screen Flows UI/UX using SLDS with Tom Kitt
Enhanced Screen Flows UI/UX using SLDS with Tom Kitt
Peter Caitens
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
gapen1
 
WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...
WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...
WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...
Luigi Fugaro
 

Recently uploaded (20)

Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
 
Alluxio Webinar | 10x Faster Trino Queries on Your Data Platform
Alluxio Webinar | 10x Faster Trino Queries on Your Data PlatformAlluxio Webinar | 10x Faster Trino Queries on Your Data Platform
Alluxio Webinar | 10x Faster Trino Queries on Your Data Platform
 
Kubernetes at Scale: Going Multi-Cluster with Istio
Kubernetes at Scale:  Going Multi-Cluster  with IstioKubernetes at Scale:  Going Multi-Cluster  with Istio
Kubernetes at Scale: Going Multi-Cluster with Istio
 
42 Ways to Generate Real Estate Leads - Sellxpert
42 Ways to Generate Real Estate Leads - Sellxpert42 Ways to Generate Real Estate Leads - Sellxpert
42 Ways to Generate Real Estate Leads - Sellxpert
 
Using Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query PerformanceUsing Query Store in Azure PostgreSQL to Understand Query Performance
Using Query Store in Azure PostgreSQL to Understand Query Performance
 
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptxOperational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
Operational ease MuleSoft and Salesforce Service Cloud Solution v1.0.pptx
 
TMU毕业证书精仿办理
TMU毕业证书精仿办理TMU毕业证书精仿办理
TMU毕业证书精仿办理
 
一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理一比一原版(USF毕业证)旧金山大学毕业证如何办理
一比一原版(USF毕业证)旧金山大学毕业证如何办理
 
Superpower Your Apache Kafka Applications Development with Complementary Open...
Superpower Your Apache Kafka Applications Development with Complementary Open...Superpower Your Apache Kafka Applications Development with Complementary Open...
Superpower Your Apache Kafka Applications Development with Complementary Open...
 
Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)Photoshop Tutorial for Beginners (2024 Edition)
Photoshop Tutorial for Beginners (2024 Edition)
 
Liberarsi dai framework con i Web Component.pptx
Liberarsi dai framework con i Web Component.pptxLiberarsi dai framework con i Web Component.pptx
Liberarsi dai framework con i Web Component.pptx
 
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
 
DevOps Consulting Company | Hire DevOps Services
DevOps Consulting Company | Hire DevOps ServicesDevOps Consulting Company | Hire DevOps Services
DevOps Consulting Company | Hire DevOps Services
 
How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?How Can Hiring A Mobile App Development Company Help Your Business Grow?
How Can Hiring A Mobile App Development Company Help Your Business Grow?
 
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
 
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdfThe Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
 
The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...
The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...
The Power of Visual Regression Testing_ Why It Is Critical for Enterprise App...
 
Enhanced Screen Flows UI/UX using SLDS with Tom Kitt
Enhanced Screen Flows UI/UX using SLDS with Tom KittEnhanced Screen Flows UI/UX using SLDS with Tom Kitt
Enhanced Screen Flows UI/UX using SLDS with Tom Kitt
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
 
WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...
WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...
WMF 2024 - Unlocking the Future of Data Powering Next-Gen AI with Vector Data...
 

EUGM 2014 - James Lumley (Eli Lilly and Co.): Making Workflows Work: Enterprise deployment of KNIME at Lilly

  • 1. Making workflows Work Enterprise KNIME deployment at Lilly James A. Lumley (Research IT UK) ChemAxon UGM Budapest 2014
  • 2. Making workflows Work! 1. Why KNIME? 2. Old meets New 3. Don’t mention structures 4. Better conversions 20/05/2014
  • 3. Making workflows Work! 1. Why KNIME? 20/05/2014
  • 4.
  • 5. KNIME@Lilly ‘Freemium’ turned ‘Premium’ • 2010: Strong usage including opensource contributions by Mike Bodkins UK CompChem group • 2012: Research IT consolidated workflow tools via KNIME.com Enterprise license and built an infrastructure to develop and deploy the tool globally 20/05/2014 Company Confidential © 2014 Eli Lilly and Company
  • 6. 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 6 Java/Eclipse platform allows easy creation of custom extensions (including security model) Server helps drive Sci/IT collab and knowledge capture Integration with existing legacy systems & data (esp. via SOA) Strong precedence for Workflow Software in Pharma
  • 7. Infrastructure to support the deployment Company Confidential © 2014 Eli Lilly and Company
  • 8. OpenSource Nodes: + • Due a ‘refresh’ • Chemaxon dependency in many nodes including: • Chemical structure handling (conversions), sketcher (Marvin), Molecule Difference check (testing) and rendering (views) 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 8
  • 9. Example Lilly Node using Chemaxon: • Multi-molecule sketcher extension based on Marvin • Configure to sketch and edit multiple structures or reactions • Output multiple structures (port_0) or reactions (port_1) on node execution • Internally reuse code for sketcher applet in webportal 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 9
  • 10. 2 years on, significant usage*: • CompChem/MedChem, ADME Reporting, Analytical Technologies Automation, Sample Management, Automating Data ETL & Data Exploration… http://www.knime.com/files/004_kuduk2013-jamesalumley-lilly.pdf http://www.knime.com/knime-user-day-uk-2013-news 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 10 *
  • 11. Making workflows Work! 2. Old Meets New: KNIME working alongside legacy systems
  • 12. Many nodes link legacy systems: 1. Retain ‘trusted’ status of internal data access tools (e.g.: internal system for integrated data access, Mobius)* 2. Retain power of in house legacy predictive modelling code e.g.: SVM models unix code 3. Interface with new systems e.g.: AT Structure Verification tools 50% of >100 internal nodes use SOA or similar to serve analytics tools and data to KNIME http://www.triconference.com/11/ird 20/05/2014 Company Confidential © 2014 Eli Lilly and Company *
  • 13. Making workflows Work! 3. Don’t Mention Structures: Getting KNIME to work with different data security models
  • 14. 20/05/2014 Company Confidential © 2014 Eli Lilly and Company 14 Huge reliance on SOA to provide Tools and Data to KNIME: + moves data security issues to web service layer + reduces CPU load on ‘office’ laptops - Services needs constant monitoring - Large work effort adding NTLM Auth to Webservice nodes
  • 15. 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 15 • In application support page/tab • Status of Webservices (separates node errors from service layer errors) • Links to Webpages • Known Bugs/Issues from Redmine
  • 16. Making workflows Work! 4. Better conversions Ensuring good interplay between the many chemical data types in KNIME without users feeling the pain
  • 17. • Converter nodes in top 20 most commonly used nodes in analysis of >2000 workflows on Lilly KNIME server • Some workflows contain around 50% converter nodes • New users confused by multiple molecule types and conversions (Analysis from Summer 2012) 20/05/2014 Company Confidential © 2014 Eli Lilly and Company
  • 18. 20/05/2014 Company Confidential © 2014 Eli Lilly and Company Lilly Matched Pairs node requires RDKit type Internal unix code (service layer) requires Smiles value Property calculator needs CDK type Internal data retrieval system serves data and molecule as chime type converter converter converter converter
  • 19. • Different Chemical Types don’t work well together • Users constantly converting chemical data from one ‘type’ to another • Worse for Lilly nodes that utilise many formats with no ‘standard’ vendor like representation 20/05/2014 Company Confidential © 2014 Eli Lilly and Company
  • 20. Aim: • Remove need for user to manually add chemical converter nodes • Ensure Nodes that use different Chemical formats to work together better 20/05/2014 Company Confidential © 2014 Eli Lilly and Company
  • 21. KNIME.com introduced “Adaptor Cell” in 2.9 • Container with several representations of same entity • Node can add additional representations that can be re-used by downstream nodes • Avoids multiple conversions • Original representation still present • Vendor Specific! No pseudo standards such as SDF 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 21 SDF  RDKit  CDK  Indigo
  • 22. Lilly Solution for (Pseudo) standards: • Extension point for handling Molecule Type conversions • Depends on Marvin library for Molecule conversions + • In development! • Will be released opensource 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 22
  • 23. Before 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 23
  • 24. • Extension point moves conversions into Node configuration • Workflow still documents explicit type conversions • Still retains support for Converter nodes if/when appropriate 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 24
  • 25. 5/20/2014 Company Confidential © 2014 Eli Lilly and Company 25 Converters could be ‘chained’ if direct conversion not available (e.g.: InChI or Chime). Example shown in dialogue: Before After Requires SMILES
  • 26. Making Workflows Work: • Added many legacy tools and data services into KNIME via custom nodes and SOA • Aided usability by adding dashboard for service layer monitoring • Added authentication handling via NTLM Auth to provide data authentication at source • Adding molecule handling framework to reduce number of molecule conversions users need 20/05/2014 Company Confidential © 2014 Eli Lilly and Company
  • 27. Acknowledgements Java Coding & Infrastructure (Lilly): Luke Bullard, Tom Wilkin Project Management, End User support, Expert Users (& Testers), Previous Developers etc.: Derek Marren, Marnie Williams, Pip Turner, Matt Hirst, Dave Thorner, Dave Evans, Mike Bodkin, Niko Fechner, Roger Robinson, Jibo Wang, Christos Nicolaou, Beth Wright, Gary Sharman, Simon Richards, Stuart Morton, Jason Ochoada, Jim Hughes (In no particular order!) KNIME.com Bernd, Thorsten, Thomas, Aaron ++