SlideShare a Scribd company logo
1 of 31
Avogadro, Open Chemistry and Semantics

             August 21, 2012
            Skolnik Symposium

             Marcus D. Hanwell
             Kyle Lutz
                                         1	
  
Introduction to Kitware
•  Founded in 1998: 5 former GE Research employees
•  105 employees: more than 50 PhDs
•  Privately held, profitable from creation, no debt
•  Rapidly Growing: >30% in 2011, 7M web-visitors/quarter
•  Offices
                                           •  2011 Small Business
   –  Albany, NY                              Administration’s
   –  Carrboro, NC                            Tibbetts Award

   –  Santa Fe, NM                         •  HPCWire Readers
                                              and Editor’s Choice
   –  Lyon, France
                                           •  Inc’s 5000 List: 2008
   –  Bangalore, India                        to 2011
Avogadro
•  Project began in 2006
•  Split into library &
   application (plugin-based)
•  One of very few open source editors
•  Designed to be extensible from the start
•  Generates input & reads output from many
   codes
•  An active and growing community
•  Chemistry needs a free, open framework
             http://avogadro.openmolecules.net/
                                                  3	
  
Avogadro Paper Published 8/13/12




http://www.jcheminf.com/content/4/1/17

                                         4	
  
Structure to Input Deck




                          5	
  
Vision for the Future
•  Advancing the state-of-the-art
•  Tight integration is needed
   •    Computational codes
   •    Clusters/supercomputers
   •    Data repositories
   •    Reduce, reuse, recycle!
•  Facilitating sharing and
   searching of data
•  Embracing open data, cheminformatics

                                          6	
  
Opening Up Chemistry
•  One of the most closed sciences
•  Lots of black box proprietary codes
  –  Only a few have access to the code
  –  Publishing results from black box codes
  –  Many file formats in use, little agreement
•  More papers should be including data
•  Growing need for open standards
•  Open tools needed to make that happen
                                                  7	
  
Introduction to Open Chemistry
•  User-friendly integration with
  –  Computational codes
  –  HPC/cloud resources
  –  Database/informatics resources




                                      8	
  
Introduction
•  An open approach to chemistry software
                                          Build, Test
  –  Open source frameworks               & Package
                                                        Community
                                                         Review
  –  Developed openly
  –  Cross-platform
  –  Tested, verified
                              Software
  –  Contribution model      Repository

  –  Supported by Kitware experts                       Developers
                                                         & Users
•  BSD licensed to facilitate research/reuse

                                                              9	
  
Open Chemistry Development Team

•  Assembled an inter-disciplinary team
•  Domain specialists: quantum chemistry,
   biology, solid-state materials
•  Computer scientists: build systems,
   queuing, graphics, software process
•  Marcus, Kyle, David L., Chris, David C.



                                             10	
  
OpenChemistry.org
•    New website to promote open chemistry
•    Hosts project-specific pages
•    Provides an identity for related projects
•    Promotes shared ownership of projects
     –  Website
     –  Code submission/review
     –  Testing infrastructure
     –  Wiki, mailing lists, news, galleries

                                                 11	
  
12	
  
Applications Being Developed
•  Three independent applications
•  Communication handled with local sockets
•  Avogadro 2 – structure editing, input
   generation, output viewing, and analysis
•  MoleQueue – running local and remote
   jobs in standalone programs, management
•  ChemData – Storage of data, searching,
   entry, annotation
                                          13	
  
Open Frameworks
•  AvogadroLibs – core data structures and
   algorithms shared across codes
•  OpenQube – a collaboration platform for
   quantum data ingestion and visualization
•  Chemkit – file I/O, exploration and
   chemoinformatics analysis
•  VTK – specialized chemistry visualization/
   data structures, use of above

                                                14	
  
Project Diagram: Libraries/Apps
Core,	
  command	
  line	
          GUI/Visualiza:on	
            HPC	
  

                 OpenQube	
                     Avogadro	
  



   AvogadroLibs	
                     VTK	
                    MoleQueue	
  



                      Chemkit	
                 ChemData	
  




                                                                            15	
  
Typical Workflow
        Log File                      Input File
                   Edit/Analyze	
  




   Results	
           Data	
         Job	
  Submission	
  




                                        Local
                   Calcula:on	
        Remote

                                                          16	
  
Proposed Workflow

       Log File                      Input File
                  Edit/Analyze	
  



  Results	
           Data	
         Job	
  Submission	
  




                                       Local
                  Calcula:on	
        Remote

                                                         17	
  
Optimal Workflow

       Log File                         Input File
                    Avogadro	
  



  Results	
       ChemData	
            Job	
  Submission	
  



                  MoleQueue	
        Local

                                     Remote
                    Calcula:on	
  

                                                            18	
  
Avogadro2
•  Project began 2006
•  Split into library &
    application (plugin-based)
•  One of very few open source editors
•  Still using Qt, C++, Eigen, OpenGL
•  Uses AvogadroLibs and OpenQube for core data
•  Introduces client-server dataflow/patterns
•  Includes new, efficient rendering code
•  More liberally licensed – from GPL to BSD

                                              19	
  
Avogadro: Visualization
•    GPU-accelerated rendering
•    VTK for advanced visualization
•    Support for 2D and 3D data plots
•    Optimized data structures
     –  Large data
     –  Streaming
•  Reworked interface
     –  Tighter database/workflow integration

                                                20	
  
MoleQueue: Job Management
•  Tighter integration with remote queues
•  Integration with databases
  –  Retains full log of computational jobs
  –  Triggers actions on completion
•  Plugin-based system
  –  Easy addition of new codes
  –  Easy addition of new queuing systems
•  Provides a client API for applications

                                              21	
  
MoleQueue
•  Supports configuration of a variety of
   remote clusters and queuing software
New CML I/O
•  Development of modular CML code
•  Allows for multi-pass parsing of CML
•  Keeps the CML closer to application
•  Much faster, easier to extend and change
•  Moving from simple CML to full semantic
   documents that can be edited
•  Learned from previous work in VTK and
   Open Babel
                                              23	
  
File Format: CML & HDF5
•  Leverages our experience with XDMF
•  CML stores semantic data
  –  Name, formula, atoms, bonds
  –  Computational code, theory, basis set
•  HDF5 used to store heavy data
  –  Basis set, intermediate data
  –  Eigenvectors, SCF matrix
  –  Volumetric data (MOs, electron density)

                                               24	
  
Rethinking Input File Generation
•  Can we create a CML representation?
     –  Could be loaded directly by some codes
     –  Could be translated to input files for others
•    Would allow search on input and output
•    Could be stored and published
•    Makes it easier to set up calculations
•    Creates a more uniform experience


                                                        25	
  
Advanced Impostor Rendering
•  Using a scene, vertex buffer objects, and
   OpenGL shading language
•  Impostor techniques
  –  Sphere goes from 100s of triangles to 2!
  –  No artifacts from triangulation
  –  Scales to millions of spheres on modest GPU




                                                   26	
  
Impostor Sphere Rendering




                            27	
  
Building Community
•  Community around chemistry
                                                     Build, Test
   projects                                          & Package
                                                                   Community
•  Using Kitware’s software process                                 Review
   –  Ensuring quality with continuous
      testing
   –  Code contributions on the web
   –  Public mailing lists, bug trackers,
      code review
•  Promoting projects and                Software
                                        Repository
   participation
   –  Publications
                                                                   Developers
   –  Conferences                                                   & Users
   –  Workshops



                                                                            28	
  
Software Process
•  Source code publicly hosted using Git
•  Gerrit for code review
•  CTest/CDash for testing/summary
  –  Gerrit can use CDash@Home
     •  Test proposed changes before merging
•  CDash can now provide binaries
  –  Built nightly, available for direct download
•  Wiki, mailing list, bug tracker

                                                    29	
  
Conclusions
•  Real opportunity to make an impact
•  Improve research, industry and teaching
•  Semantic data at the center of our work
  –  Storage
  –  Search
  –  Interaction with computational codes
  –  Comparison with experimental data
•  Add support for iOS, Android and web

                                             30	
  
Acknowledgements
•  Google Summer of Code for initial summer funding
•  Avogadro developers: Geoffrey R. Hutchison, Donald E. Curtis,
   David C. Lonie, Tim Vandermeersch and many more contributors,
   users and supporters
•  Kitware, Inc. for their unique business model & support
•  The Engineer Research and Development Center’s Environmental
   Laboratory for recent funding
•  Open-source projects, standards and services we build on: Qt, Open
   Babel, GLEW, CML, CACTUS Resolver, many, many more projects
•  Support of many code developers including MOPAC, NWChem, Q-
   Chem and others
•  Support from Peter Murray-Rust and the Blue Obelisk



                                                                   31	
  

More Related Content

What's hot

C-13 NMR Spectroscopy
C-13 NMR SpectroscopyC-13 NMR Spectroscopy
C-13 NMR SpectroscopyMANISHSAHU106
 
energy minimization
energy minimizationenergy minimization
energy minimizationpradeep kore
 
Druggability of NCEs
Druggability of NCEsDruggability of NCEs
Druggability of NCEsDheeraj Kumar
 
Computer Aided Drug Design QSAR Related Methods
Computer Aided Drug Design QSAR Related MethodsComputer Aided Drug Design QSAR Related Methods
Computer Aided Drug Design QSAR Related MethodsJahan B Ghasemi
 
FISCHER AND NEWMAN PROJECTIONS
FISCHER AND NEWMAN PROJECTIONSFISCHER AND NEWMAN PROJECTIONS
FISCHER AND NEWMAN PROJECTIONSPRUTHVIRAJ K
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsarRahul B S
 
Potential Energy Surface & Molecular Graphics
Potential Energy Surface & Molecular GraphicsPotential Energy Surface & Molecular Graphics
Potential Energy Surface & Molecular GraphicsPrasanthperceptron
 
Molecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug designMolecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug designAjay Kumar
 
2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORS2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORSSmita Jain
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryAbhik Seal
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug designROHIT
 
Ligand based drug design
Ligand based drug designLigand based drug design
Ligand based drug designSatyendra Yadav
 
Spin spin coupling and coupling constant
Spin spin coupling and coupling constantSpin spin coupling and coupling constant
Spin spin coupling and coupling constantHimal Barakoti
 

What's hot (20)

Drug likeness Properties
Drug likeness  PropertiesDrug likeness  Properties
Drug likeness Properties
 
C-13 NMR Spectroscopy
C-13 NMR SpectroscopyC-13 NMR Spectroscopy
C-13 NMR Spectroscopy
 
energy minimization
energy minimizationenergy minimization
energy minimization
 
Druggability of NCEs
Druggability of NCEsDruggability of NCEs
Druggability of NCEs
 
Mass spectrometry
Mass spectrometry Mass spectrometry
Mass spectrometry
 
Computer Aided Drug Design QSAR Related Methods
Computer Aided Drug Design QSAR Related MethodsComputer Aided Drug Design QSAR Related Methods
Computer Aided Drug Design QSAR Related Methods
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
FISCHER AND NEWMAN PROJECTIONS
FISCHER AND NEWMAN PROJECTIONSFISCHER AND NEWMAN PROJECTIONS
FISCHER AND NEWMAN PROJECTIONS
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsar
 
Potential Energy Surface & Molecular Graphics
Potential Energy Surface & Molecular GraphicsPotential Energy Surface & Molecular Graphics
Potential Energy Surface & Molecular Graphics
 
QSAR
QSARQSAR
QSAR
 
Molecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug designMolecular and Quantum Mechanics in drug design
Molecular and Quantum Mechanics in drug design
 
2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORS2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORS
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug Discovery
 
Qsar
QsarQsar
Qsar
 
Docking
DockingDocking
Docking
 
Computer aided drug design
Computer aided drug designComputer aided drug design
Computer aided drug design
 
Woodward__fieser Rule
Woodward__fieser Rule Woodward__fieser Rule
Woodward__fieser Rule
 
Ligand based drug design
Ligand based drug designLigand based drug design
Ligand based drug design
 
Spin spin coupling and coupling constant
Spin spin coupling and coupling constantSpin spin coupling and coupling constant
Spin spin coupling and coupling constant
 

Viewers also liked

Chemistry project on casein in mik
Chemistry project on casein in mikChemistry project on casein in mik
Chemistry project on casein in mikVirat Prasad
 
Atom Economy - Comenius Project
Atom Economy - Comenius ProjectAtom Economy - Comenius Project
Atom Economy - Comenius Projectclasse4ach
 
chemistry Project class 12
chemistry Project class 12chemistry Project class 12
chemistry Project class 12Vighnesh Jm
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Rushil Aggarwal
 
The Open Chemistry Project
The Open Chemistry ProjectThe Open Chemistry Project
The Open Chemistry ProjectMarcus Hanwell
 
Struktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tijStruktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tijFleurati
 
Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"classe4ach
 
Projekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se KimiseProjekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se KimiseIris Sakej
 
Chemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKSChemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKSNaveen R
 
Chemistry project for Class 12
Chemistry project for Class 12Chemistry project for Class 12
Chemistry project for Class 12Shahban Ali
 
chemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honeychemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honeyRadha Gupta
 
Chemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday lifeChemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday lifeShashvat Sharma
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Self-employed
 

Viewers also liked (15)

C7 lesson part four
C7 lesson part fourC7 lesson part four
C7 lesson part four
 
Chemistry project on casein in mik
Chemistry project on casein in mikChemistry project on casein in mik
Chemistry project on casein in mik
 
Atom Economy - Comenius Project
Atom Economy - Comenius ProjectAtom Economy - Comenius Project
Atom Economy - Comenius Project
 
chemistry Project class 12
chemistry Project class 12chemistry Project class 12
chemistry Project class 12
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12
 
The Open Chemistry Project
The Open Chemistry ProjectThe Open Chemistry Project
The Open Chemistry Project
 
Struktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tijStruktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tij
 
Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"
 
Projekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se KimiseProjekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se Kimise
 
Chemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKSChemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKS
 
Chemistry project for Class 12
Chemistry project for Class 12Chemistry project for Class 12
Chemistry project for Class 12
 
chemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honeychemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honey
 
Chemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday lifeChemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday life
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12
 
Chemistry project
Chemistry projectChemistry project
Chemistry project
 

Similar to Avogadro, Open Chemistry and Semantics

Chemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopChemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopMarcus Hanwell
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisMarcus Hanwell
 
Open Source Visualization of Scientific Data
Open Source Visualization of Scientific DataOpen Source Visualization of Scientific Data
Open Source Visualization of Scientific DataMarcus Hanwell
 
IMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Centre of Competence
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNeo4j
 
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Subbu Rama
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013Christopher Curtin
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015WaveMaker, Inc.
 
Kitware: Qt and Scientific Computing
Kitware: Qt and Scientific ComputingKitware: Qt and Scientific Computing
Kitware: Qt and Scientific Computingaccount inactive
 
Adopting OpenTelemetry
Adopting OpenTelemetryAdopting OpenTelemetry
Adopting OpenTelemetryVincent Behar
 
UI Dev in Big data world using open source
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open sourceTech Triveni
 
Zero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is kingZero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is kingplumbee
 
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...Jean Vanderdonckt
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realistsKarthik Gaekwad
 
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERContinuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERIndrajit Poddar
 
Road to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache HopRoad to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache HopNeo4j
 
How static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ codeHow static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ codecppfrug
 
Architectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and ConsistentlyArchitectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and ConsistentlyComsysto Reply GmbH
 

Similar to Avogadro, Open Chemistry and Semantics (20)

Chemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopChemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the Desktop
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
 
Open Source Visualization of Scientific Data
Open Source Visualization of Scientific DataOpen Source Visualization of Scientific Data
Open Source Visualization of Scientific Data
 
IMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens Neudecker
 
Avoiding cloud lock-in
Avoiding cloud lock-inAvoiding cloud lock-in
Avoiding cloud lock-in
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4j
 
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015
 
Kitware: Qt and Scientific Computing
Kitware: Qt and Scientific ComputingKitware: Qt and Scientific Computing
Kitware: Qt and Scientific Computing
 
Adopting OpenTelemetry
Adopting OpenTelemetryAdopting OpenTelemetry
Adopting OpenTelemetry
 
UI Dev in Big data world using open source
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open source
 
Zero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is kingZero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is king
 
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realists
 
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERContinuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
 
Road to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache HopRoad to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache Hop
 
How static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ codeHow static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ code
 
Architectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and ConsistentlyArchitectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and Consistently
 

More from Marcus Hanwell

Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryMarcus Hanwell
 
Avogadro 2 and Open Chemistry
Avogadro 2 and Open ChemistryAvogadro 2 and Open Chemistry
Avogadro 2 and Open ChemistryMarcus Hanwell
 
Avogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational ChemistryAvogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational ChemistryMarcus Hanwell
 
Saving Science - Open Up or Perish
Saving Science - Open Up or PerishSaving Science - Open Up or Perish
Saving Science - Open Up or PerishMarcus Hanwell
 
Open Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open SourceOpen Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open SourceMarcus Hanwell
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceMarcus Hanwell
 
CMake: Improving Software Quality and Process
CMake: Improving Software Quality and ProcessCMake: Improving Software Quality and Process
CMake: Improving Software Quality and ProcessMarcus Hanwell
 

More from Marcus Hanwell (7)

Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
 
Avogadro 2 and Open Chemistry
Avogadro 2 and Open ChemistryAvogadro 2 and Open Chemistry
Avogadro 2 and Open Chemistry
 
Avogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational ChemistryAvogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational Chemistry
 
Saving Science - Open Up or Perish
Saving Science - Open Up or PerishSaving Science - Open Up or Perish
Saving Science - Open Up or Perish
 
Open Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open SourceOpen Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open Source
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open Science
 
CMake: Improving Software Quality and Process
CMake: Improving Software Quality and ProcessCMake: Improving Software Quality and Process
CMake: Improving Software Quality and Process
 

Recently uploaded

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Avogadro, Open Chemistry and Semantics

  • 1. Avogadro, Open Chemistry and Semantics August 21, 2012 Skolnik Symposium Marcus D. Hanwell Kyle Lutz 1  
  • 2. Introduction to Kitware •  Founded in 1998: 5 former GE Research employees •  105 employees: more than 50 PhDs •  Privately held, profitable from creation, no debt •  Rapidly Growing: >30% in 2011, 7M web-visitors/quarter •  Offices •  2011 Small Business –  Albany, NY Administration’s –  Carrboro, NC Tibbetts Award –  Santa Fe, NM •  HPCWire Readers and Editor’s Choice –  Lyon, France •  Inc’s 5000 List: 2008 –  Bangalore, India to 2011
  • 3. Avogadro •  Project began in 2006 •  Split into library & application (plugin-based) •  One of very few open source editors •  Designed to be extensible from the start •  Generates input & reads output from many codes •  An active and growing community •  Chemistry needs a free, open framework http://avogadro.openmolecules.net/ 3  
  • 4. Avogadro Paper Published 8/13/12 http://www.jcheminf.com/content/4/1/17 4  
  • 5. Structure to Input Deck 5  
  • 6. Vision for the Future •  Advancing the state-of-the-art •  Tight integration is needed •  Computational codes •  Clusters/supercomputers •  Data repositories •  Reduce, reuse, recycle! •  Facilitating sharing and searching of data •  Embracing open data, cheminformatics 6  
  • 7. Opening Up Chemistry •  One of the most closed sciences •  Lots of black box proprietary codes –  Only a few have access to the code –  Publishing results from black box codes –  Many file formats in use, little agreement •  More papers should be including data •  Growing need for open standards •  Open tools needed to make that happen 7  
  • 8. Introduction to Open Chemistry •  User-friendly integration with –  Computational codes –  HPC/cloud resources –  Database/informatics resources 8  
  • 9. Introduction •  An open approach to chemistry software Build, Test –  Open source frameworks & Package Community Review –  Developed openly –  Cross-platform –  Tested, verified Software –  Contribution model Repository –  Supported by Kitware experts Developers & Users •  BSD licensed to facilitate research/reuse 9  
  • 10. Open Chemistry Development Team •  Assembled an inter-disciplinary team •  Domain specialists: quantum chemistry, biology, solid-state materials •  Computer scientists: build systems, queuing, graphics, software process •  Marcus, Kyle, David L., Chris, David C. 10  
  • 11. OpenChemistry.org •  New website to promote open chemistry •  Hosts project-specific pages •  Provides an identity for related projects •  Promotes shared ownership of projects –  Website –  Code submission/review –  Testing infrastructure –  Wiki, mailing lists, news, galleries 11  
  • 12. 12  
  • 13. Applications Being Developed •  Three independent applications •  Communication handled with local sockets •  Avogadro 2 – structure editing, input generation, output viewing, and analysis •  MoleQueue – running local and remote jobs in standalone programs, management •  ChemData – Storage of data, searching, entry, annotation 13  
  • 14. Open Frameworks •  AvogadroLibs – core data structures and algorithms shared across codes •  OpenQube – a collaboration platform for quantum data ingestion and visualization •  Chemkit – file I/O, exploration and chemoinformatics analysis •  VTK – specialized chemistry visualization/ data structures, use of above 14  
  • 15. Project Diagram: Libraries/Apps Core,  command  line   GUI/Visualiza:on   HPC   OpenQube   Avogadro   AvogadroLibs   VTK   MoleQueue   Chemkit   ChemData   15  
  • 16. Typical Workflow Log File Input File Edit/Analyze   Results   Data   Job  Submission   Local Calcula:on   Remote 16  
  • 17. Proposed Workflow Log File Input File Edit/Analyze   Results   Data   Job  Submission   Local Calcula:on   Remote 17  
  • 18. Optimal Workflow Log File Input File Avogadro   Results   ChemData   Job  Submission   MoleQueue   Local Remote Calcula:on   18  
  • 19. Avogadro2 •  Project began 2006 •  Split into library & application (plugin-based) •  One of very few open source editors •  Still using Qt, C++, Eigen, OpenGL •  Uses AvogadroLibs and OpenQube for core data •  Introduces client-server dataflow/patterns •  Includes new, efficient rendering code •  More liberally licensed – from GPL to BSD 19  
  • 20. Avogadro: Visualization •  GPU-accelerated rendering •  VTK for advanced visualization •  Support for 2D and 3D data plots •  Optimized data structures –  Large data –  Streaming •  Reworked interface –  Tighter database/workflow integration 20  
  • 21. MoleQueue: Job Management •  Tighter integration with remote queues •  Integration with databases –  Retains full log of computational jobs –  Triggers actions on completion •  Plugin-based system –  Easy addition of new codes –  Easy addition of new queuing systems •  Provides a client API for applications 21  
  • 22. MoleQueue •  Supports configuration of a variety of remote clusters and queuing software
  • 23. New CML I/O •  Development of modular CML code •  Allows for multi-pass parsing of CML •  Keeps the CML closer to application •  Much faster, easier to extend and change •  Moving from simple CML to full semantic documents that can be edited •  Learned from previous work in VTK and Open Babel 23  
  • 24. File Format: CML & HDF5 •  Leverages our experience with XDMF •  CML stores semantic data –  Name, formula, atoms, bonds –  Computational code, theory, basis set •  HDF5 used to store heavy data –  Basis set, intermediate data –  Eigenvectors, SCF matrix –  Volumetric data (MOs, electron density) 24  
  • 25. Rethinking Input File Generation •  Can we create a CML representation? –  Could be loaded directly by some codes –  Could be translated to input files for others •  Would allow search on input and output •  Could be stored and published •  Makes it easier to set up calculations •  Creates a more uniform experience 25  
  • 26. Advanced Impostor Rendering •  Using a scene, vertex buffer objects, and OpenGL shading language •  Impostor techniques –  Sphere goes from 100s of triangles to 2! –  No artifacts from triangulation –  Scales to millions of spheres on modest GPU 26  
  • 28. Building Community •  Community around chemistry Build, Test projects & Package Community •  Using Kitware’s software process Review –  Ensuring quality with continuous testing –  Code contributions on the web –  Public mailing lists, bug trackers, code review •  Promoting projects and Software Repository participation –  Publications Developers –  Conferences & Users –  Workshops 28  
  • 29. Software Process •  Source code publicly hosted using Git •  Gerrit for code review •  CTest/CDash for testing/summary –  Gerrit can use CDash@Home •  Test proposed changes before merging •  CDash can now provide binaries –  Built nightly, available for direct download •  Wiki, mailing list, bug tracker 29  
  • 30. Conclusions •  Real opportunity to make an impact •  Improve research, industry and teaching •  Semantic data at the center of our work –  Storage –  Search –  Interaction with computational codes –  Comparison with experimental data •  Add support for iOS, Android and web 30  
  • 31. Acknowledgements •  Google Summer of Code for initial summer funding •  Avogadro developers: Geoffrey R. Hutchison, Donald E. Curtis, David C. Lonie, Tim Vandermeersch and many more contributors, users and supporters •  Kitware, Inc. for their unique business model & support •  The Engineer Research and Development Center’s Environmental Laboratory for recent funding •  Open-source projects, standards and services we build on: Qt, Open Babel, GLEW, CML, CACTUS Resolver, many, many more projects •  Support of many code developers including MOPAC, NWChem, Q- Chem and others •  Support from Peter Murray-Rust and the Blue Obelisk 31