SlideShare a Scribd company logo
1 of 31
Avogadro, Open Chemistry and Semantics

             August 21, 2012
            Skolnik Symposium

             Marcus D. Hanwell
             Kyle Lutz
                                         1	
  
Introduction to Kitware
•  Founded in 1998: 5 former GE Research employees
•  105 employees: more than 50 PhDs
•  Privately held, profitable from creation, no debt
•  Rapidly Growing: >30% in 2011, 7M web-visitors/quarter
•  Offices
                                           •  2011 Small Business
   –  Albany, NY                              Administration’s
   –  Carrboro, NC                            Tibbetts Award

   –  Santa Fe, NM                         •  HPCWire Readers
                                              and Editor’s Choice
   –  Lyon, France
                                           •  Inc’s 5000 List: 2008
   –  Bangalore, India                        to 2011
Avogadro
•  Project began in 2006
•  Split into library &
   application (plugin-based)
•  One of very few open source editors
•  Designed to be extensible from the start
•  Generates input & reads output from many
   codes
•  An active and growing community
•  Chemistry needs a free, open framework
             http://avogadro.openmolecules.net/
                                                  3	
  
Avogadro Paper Published 8/13/12




http://www.jcheminf.com/content/4/1/17

                                         4	
  
Structure to Input Deck




                          5	
  
Vision for the Future
•  Advancing the state-of-the-art
•  Tight integration is needed
   •    Computational codes
   •    Clusters/supercomputers
   •    Data repositories
   •    Reduce, reuse, recycle!
•  Facilitating sharing and
   searching of data
•  Embracing open data, cheminformatics

                                          6	
  
Opening Up Chemistry
•  One of the most closed sciences
•  Lots of black box proprietary codes
  –  Only a few have access to the code
  –  Publishing results from black box codes
  –  Many file formats in use, little agreement
•  More papers should be including data
•  Growing need for open standards
•  Open tools needed to make that happen
                                                  7	
  
Introduction to Open Chemistry
•  User-friendly integration with
  –  Computational codes
  –  HPC/cloud resources
  –  Database/informatics resources




                                      8	
  
Introduction
•  An open approach to chemistry software
                                          Build, Test
  –  Open source frameworks               & Package
                                                        Community
                                                         Review
  –  Developed openly
  –  Cross-platform
  –  Tested, verified
                              Software
  –  Contribution model      Repository

  –  Supported by Kitware experts                       Developers
                                                         & Users
•  BSD licensed to facilitate research/reuse

                                                              9	
  
Open Chemistry Development Team

•  Assembled an inter-disciplinary team
•  Domain specialists: quantum chemistry,
   biology, solid-state materials
•  Computer scientists: build systems,
   queuing, graphics, software process
•  Marcus, Kyle, David L., Chris, David C.



                                             10	
  
OpenChemistry.org
•    New website to promote open chemistry
•    Hosts project-specific pages
•    Provides an identity for related projects
•    Promotes shared ownership of projects
     –  Website
     –  Code submission/review
     –  Testing infrastructure
     –  Wiki, mailing lists, news, galleries

                                                 11	
  
12	
  
Applications Being Developed
•  Three independent applications
•  Communication handled with local sockets
•  Avogadro 2 – structure editing, input
   generation, output viewing, and analysis
•  MoleQueue – running local and remote
   jobs in standalone programs, management
•  ChemData – Storage of data, searching,
   entry, annotation
                                          13	
  
Open Frameworks
•  AvogadroLibs – core data structures and
   algorithms shared across codes
•  OpenQube – a collaboration platform for
   quantum data ingestion and visualization
•  Chemkit – file I/O, exploration and
   chemoinformatics analysis
•  VTK – specialized chemistry visualization/
   data structures, use of above

                                                14	
  
Project Diagram: Libraries/Apps
Core,	
  command	
  line	
          GUI/Visualiza:on	
            HPC	
  

                 OpenQube	
                     Avogadro	
  



   AvogadroLibs	
                     VTK	
                    MoleQueue	
  



                      Chemkit	
                 ChemData	
  




                                                                            15	
  
Typical Workflow
        Log File                      Input File
                   Edit/Analyze	
  




   Results	
           Data	
         Job	
  Submission	
  




                                        Local
                   Calcula:on	
        Remote

                                                          16	
  
Proposed Workflow

       Log File                      Input File
                  Edit/Analyze	
  



  Results	
           Data	
         Job	
  Submission	
  




                                       Local
                  Calcula:on	
        Remote

                                                         17	
  
Optimal Workflow

       Log File                         Input File
                    Avogadro	
  



  Results	
       ChemData	
            Job	
  Submission	
  



                  MoleQueue	
        Local

                                     Remote
                    Calcula:on	
  

                                                            18	
  
Avogadro2
•  Project began 2006
•  Split into library &
    application (plugin-based)
•  One of very few open source editors
•  Still using Qt, C++, Eigen, OpenGL
•  Uses AvogadroLibs and OpenQube for core data
•  Introduces client-server dataflow/patterns
•  Includes new, efficient rendering code
•  More liberally licensed – from GPL to BSD

                                              19	
  
Avogadro: Visualization
•    GPU-accelerated rendering
•    VTK for advanced visualization
•    Support for 2D and 3D data plots
•    Optimized data structures
     –  Large data
     –  Streaming
•  Reworked interface
     –  Tighter database/workflow integration

                                                20	
  
MoleQueue: Job Management
•  Tighter integration with remote queues
•  Integration with databases
  –  Retains full log of computational jobs
  –  Triggers actions on completion
•  Plugin-based system
  –  Easy addition of new codes
  –  Easy addition of new queuing systems
•  Provides a client API for applications

                                              21	
  
MoleQueue
•  Supports configuration of a variety of
   remote clusters and queuing software
New CML I/O
•  Development of modular CML code
•  Allows for multi-pass parsing of CML
•  Keeps the CML closer to application
•  Much faster, easier to extend and change
•  Moving from simple CML to full semantic
   documents that can be edited
•  Learned from previous work in VTK and
   Open Babel
                                              23	
  
File Format: CML & HDF5
•  Leverages our experience with XDMF
•  CML stores semantic data
  –  Name, formula, atoms, bonds
  –  Computational code, theory, basis set
•  HDF5 used to store heavy data
  –  Basis set, intermediate data
  –  Eigenvectors, SCF matrix
  –  Volumetric data (MOs, electron density)

                                               24	
  
Rethinking Input File Generation
•  Can we create a CML representation?
     –  Could be loaded directly by some codes
     –  Could be translated to input files for others
•    Would allow search on input and output
•    Could be stored and published
•    Makes it easier to set up calculations
•    Creates a more uniform experience


                                                        25	
  
Advanced Impostor Rendering
•  Using a scene, vertex buffer objects, and
   OpenGL shading language
•  Impostor techniques
  –  Sphere goes from 100s of triangles to 2!
  –  No artifacts from triangulation
  –  Scales to millions of spheres on modest GPU




                                                   26	
  
Impostor Sphere Rendering




                            27	
  
Building Community
•  Community around chemistry
                                                     Build, Test
   projects                                          & Package
                                                                   Community
•  Using Kitware’s software process                                 Review
   –  Ensuring quality with continuous
      testing
   –  Code contributions on the web
   –  Public mailing lists, bug trackers,
      code review
•  Promoting projects and                Software
                                        Repository
   participation
   –  Publications
                                                                   Developers
   –  Conferences                                                   & Users
   –  Workshops



                                                                            28	
  
Software Process
•  Source code publicly hosted using Git
•  Gerrit for code review
•  CTest/CDash for testing/summary
  –  Gerrit can use CDash@Home
     •  Test proposed changes before merging
•  CDash can now provide binaries
  –  Built nightly, available for direct download
•  Wiki, mailing list, bug tracker

                                                    29	
  
Conclusions
•  Real opportunity to make an impact
•  Improve research, industry and teaching
•  Semantic data at the center of our work
  –  Storage
  –  Search
  –  Interaction with computational codes
  –  Comparison with experimental data
•  Add support for iOS, Android and web

                                             30	
  
Acknowledgements
•  Google Summer of Code for initial summer funding
•  Avogadro developers: Geoffrey R. Hutchison, Donald E. Curtis,
   David C. Lonie, Tim Vandermeersch and many more contributors,
   users and supporters
•  Kitware, Inc. for their unique business model & support
•  The Engineer Research and Development Center’s Environmental
   Laboratory for recent funding
•  Open-source projects, standards and services we build on: Qt, Open
   Babel, GLEW, CML, CACTUS Resolver, many, many more projects
•  Support of many code developers including MOPAC, NWChem, Q-
   Chem and others
•  Support from Peter Murray-Rust and the Blue Obelisk



                                                                   31	
  

More Related Content

What's hot

Molecular dynamics Simulation.pptx
Molecular dynamics Simulation.pptxMolecular dynamics Simulation.pptx
Molecular dynamics Simulation.pptxHassanShah396906
 
Pharmacoinformatics Database basics(sree)
Pharmacoinformatics Database basics(sree)Pharmacoinformatics Database basics(sree)
Pharmacoinformatics Database basics(sree)Sreekanth Gali
 
PREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptx
PREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptxPREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptx
PREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptxMO.SHAHANAWAZ
 
Advanced spectral analysis
Advanced spectral analysisAdvanced spectral analysis
Advanced spectral analysisMangalKamble2
 
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screeningDeependra Ban
 
Crystal structure
Crystal structureCrystal structure
Crystal structureShivam Jain
 
DENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARM
DENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARMDENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARM
DENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARMShikha Popali
 
Cheminformatics
CheminformaticsCheminformatics
Cheminformaticsbaoilleach
 
Understanding Smiles
Understanding Smiles Understanding Smiles
Understanding Smiles Abhik Seal
 
Fragment based drug design
Fragment based drug designFragment based drug design
Fragment based drug designEkta Tembhare
 
Autodock Made Easy with MGL Tools - Molecular Docking
Autodock Made Easy with MGL Tools - Molecular DockingAutodock Made Easy with MGL Tools - Molecular Docking
Autodock Made Easy with MGL Tools - Molecular DockingGirinath Pillai
 

What's hot (20)

Pharmacophore identification
Pharmacophore identificationPharmacophore identification
Pharmacophore identification
 
Chem draw software
Chem draw software Chem draw software
Chem draw software
 
Molecular dynamics Simulation.pptx
Molecular dynamics Simulation.pptxMolecular dynamics Simulation.pptx
Molecular dynamics Simulation.pptx
 
Pharmacoinformatics Database basics(sree)
Pharmacoinformatics Database basics(sree)Pharmacoinformatics Database basics(sree)
Pharmacoinformatics Database basics(sree)
 
PREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptx
PREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptxPREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptx
PREDICTION AND ANALYSIS OF ADMET PROPERTIES OF NEW.pptx
 
Chem draW
Chem draWChem draW
Chem draW
 
Advanced spectral analysis
Advanced spectral analysisAdvanced spectral analysis
Advanced spectral analysis
 
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
 
Crystal structure
Crystal structureCrystal structure
Crystal structure
 
DENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARM
DENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARMDENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARM
DENOVO DRUG DESIGN AS PER PCI SYLLABUS M.PHARM
 
3 D QSAR Approaches and Contour Map Analysis
3 D QSAR Approaches and Contour Map Analysis3 D QSAR Approaches and Contour Map Analysis
3 D QSAR Approaches and Contour Map Analysis
 
docking
docking docking
docking
 
Chemoinformatic
Chemoinformatic Chemoinformatic
Chemoinformatic
 
Cheminformatics
CheminformaticsCheminformatics
Cheminformatics
 
Homogenous catalyst
Homogenous catalyst Homogenous catalyst
Homogenous catalyst
 
Hetcor
HetcorHetcor
Hetcor
 
Understanding Smiles
Understanding Smiles Understanding Smiles
Understanding Smiles
 
Fragment based drug design
Fragment based drug designFragment based drug design
Fragment based drug design
 
Autodock Made Easy with MGL Tools - Molecular Docking
Autodock Made Easy with MGL Tools - Molecular DockingAutodock Made Easy with MGL Tools - Molecular Docking
Autodock Made Easy with MGL Tools - Molecular Docking
 
Molecular Orbital Theory
Molecular Orbital Theory  Molecular Orbital Theory
Molecular Orbital Theory
 

Viewers also liked

Chemistry project on casein in mik
Chemistry project on casein in mikChemistry project on casein in mik
Chemistry project on casein in mikVirat Prasad
 
Atom Economy - Comenius Project
Atom Economy - Comenius ProjectAtom Economy - Comenius Project
Atom Economy - Comenius Projectclasse4ach
 
chemistry Project class 12
chemistry Project class 12chemistry Project class 12
chemistry Project class 12Vighnesh Jm
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Rushil Aggarwal
 
The Open Chemistry Project
The Open Chemistry ProjectThe Open Chemistry Project
The Open Chemistry ProjectMarcus Hanwell
 
Struktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tijStruktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tijFleurati
 
Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"classe4ach
 
Projekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se KimiseProjekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se KimiseIris Sakej
 
Chemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKSChemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKSNaveen R
 
Chemistry project for Class 12
Chemistry project for Class 12Chemistry project for Class 12
Chemistry project for Class 12Shahban Ali
 
chemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honeychemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honeyRadha Gupta
 
Chemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday lifeChemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday lifeShashvat Sharma
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Self-employed
 

Viewers also liked (15)

C7 lesson part four
C7 lesson part fourC7 lesson part four
C7 lesson part four
 
Chemistry project on casein in mik
Chemistry project on casein in mikChemistry project on casein in mik
Chemistry project on casein in mik
 
Atom Economy - Comenius Project
Atom Economy - Comenius ProjectAtom Economy - Comenius Project
Atom Economy - Comenius Project
 
chemistry Project class 12
chemistry Project class 12chemistry Project class 12
chemistry Project class 12
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12
 
The Open Chemistry Project
The Open Chemistry ProjectThe Open Chemistry Project
The Open Chemistry Project
 
Struktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tijStruktura e atomit dhe bërthama e tij
Struktura e atomit dhe bërthama e tij
 
Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"Atom economy - "Green Chemistry Project"
Atom economy - "Green Chemistry Project"
 
Projekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se KimiseProjekt Kimi - Historiku i shkences se Kimise
Projekt Kimi - Historiku i shkences se Kimise
 
Chemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKSChemistry Investigatory Project on COLD DRINKS
Chemistry Investigatory Project on COLD DRINKS
 
Chemistry project for Class 12
Chemistry project for Class 12Chemistry project for Class 12
Chemistry project for Class 12
 
chemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honeychemistry project for class 12 on analysis of honey
chemistry project for class 12 on analysis of honey
 
Chemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday lifeChemistry project on chemistry in everyday life
Chemistry project on chemistry in everyday life
 
Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12Chemistry Investigatory Project Class 12
Chemistry Investigatory Project Class 12
 
Chemistry project
Chemistry projectChemistry project
Chemistry project
 

Similar to Avogadro, Open Chemistry and Semantics

Chemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopChemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopMarcus Hanwell
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisMarcus Hanwell
 
Open Source Visualization of Scientific Data
Open Source Visualization of Scientific DataOpen Source Visualization of Scientific Data
Open Source Visualization of Scientific DataMarcus Hanwell
 
IMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Centre of Competence
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNeo4j
 
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Subbu Rama
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013Christopher Curtin
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015WaveMaker, Inc.
 
Kitware: Qt and Scientific Computing
Kitware: Qt and Scientific ComputingKitware: Qt and Scientific Computing
Kitware: Qt and Scientific Computingaccount inactive
 
Adopting OpenTelemetry
Adopting OpenTelemetryAdopting OpenTelemetry
Adopting OpenTelemetryVincent Behar
 
UI Dev in Big data world using open source
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open sourceTech Triveni
 
Zero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is kingZero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is kingplumbee
 
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...Jean Vanderdonckt
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realistsKarthik Gaekwad
 
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERContinuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERIndrajit Poddar
 
Road to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache HopRoad to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache HopNeo4j
 
How static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ codeHow static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ codecppfrug
 
Architectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and ConsistentlyArchitectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and ConsistentlyComsysto Reply GmbH
 

Similar to Avogadro, Open Chemistry and Semantics (20)

Chemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the DesktopChemical Databases and Open Chemistry on the Desktop
Chemical Databases and Open Chemistry on the Desktop
 
Open Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & AnalysisOpen Chemistry: Input Preparation, Data Visualization & Analysis
Open Chemistry: Input Preparation, Data Visualization & Analysis
 
Open Source Visualization of Scientific Data
Open Source Visualization of Scientific DataOpen Source Visualization of Scientific Data
Open Source Visualization of Scientific Data
 
IMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens Neudecker
 
Avoiding cloud lock-in
Avoiding cloud lock-inAvoiding cloud lock-in
Avoiding cloud lock-in
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4j
 
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015
 
Kitware: Qt and Scientific Computing
Kitware: Qt and Scientific ComputingKitware: Qt and Scientific Computing
Kitware: Qt and Scientific Computing
 
Adopting OpenTelemetry
Adopting OpenTelemetryAdopting OpenTelemetry
Adopting OpenTelemetry
 
UI Dev in Big data world using open source
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open source
 
Zero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is kingZero to ten million daily users in four weeks: sustainable speed is king
Zero to ten million daily users in four weeks: sustainable speed is king
 
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
An Open Source Workbench for Prototyping Multimodal Interactions Based on Off...
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realists
 
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERContinuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
 
Road to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache HopRoad to NODES - Handling Neo4j Data with Apache Hop
Road to NODES - Handling Neo4j Data with Apache Hop
 
How static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ codeHow static analysis supports quality over 50 million lines of C++ code
How static analysis supports quality over 50 million lines of C++ code
 
Architectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and ConsistentlyArchitectural Decisions: Smoothly and Consistently
Architectural Decisions: Smoothly and Consistently
 

More from Marcus Hanwell

Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryMarcus Hanwell
 
Avogadro 2 and Open Chemistry
Avogadro 2 and Open ChemistryAvogadro 2 and Open Chemistry
Avogadro 2 and Open ChemistryMarcus Hanwell
 
Avogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational ChemistryAvogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational ChemistryMarcus Hanwell
 
Saving Science - Open Up or Perish
Saving Science - Open Up or PerishSaving Science - Open Up or Perish
Saving Science - Open Up or PerishMarcus Hanwell
 
Open Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open SourceOpen Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open SourceMarcus Hanwell
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceMarcus Hanwell
 
CMake: Improving Software Quality and Process
CMake: Improving Software Quality and ProcessCMake: Improving Software Quality and Process
CMake: Improving Software Quality and ProcessMarcus Hanwell
 

More from Marcus Hanwell (7)

Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
 
Avogadro 2 and Open Chemistry
Avogadro 2 and Open ChemistryAvogadro 2 and Open Chemistry
Avogadro 2 and Open Chemistry
 
Avogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational ChemistryAvogadro: Open Source Libraries and Application for Computational Chemistry
Avogadro: Open Source Libraries and Application for Computational Chemistry
 
Saving Science - Open Up or Perish
Saving Science - Open Up or PerishSaving Science - Open Up or Perish
Saving Science - Open Up or Perish
 
Open Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open SourceOpen Chemistry: Realizing Open Data, Open Standards, and Open Source
Open Chemistry: Realizing Open Data, Open Standards, and Open Source
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open Science
 
CMake: Improving Software Quality and Process
CMake: Improving Software Quality and ProcessCMake: Improving Software Quality and Process
CMake: Improving Software Quality and Process
 

Recently uploaded

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Recently uploaded (20)

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 

Avogadro, Open Chemistry and Semantics

  • 1. Avogadro, Open Chemistry and Semantics August 21, 2012 Skolnik Symposium Marcus D. Hanwell Kyle Lutz 1  
  • 2. Introduction to Kitware •  Founded in 1998: 5 former GE Research employees •  105 employees: more than 50 PhDs •  Privately held, profitable from creation, no debt •  Rapidly Growing: >30% in 2011, 7M web-visitors/quarter •  Offices •  2011 Small Business –  Albany, NY Administration’s –  Carrboro, NC Tibbetts Award –  Santa Fe, NM •  HPCWire Readers and Editor’s Choice –  Lyon, France •  Inc’s 5000 List: 2008 –  Bangalore, India to 2011
  • 3. Avogadro •  Project began in 2006 •  Split into library & application (plugin-based) •  One of very few open source editors •  Designed to be extensible from the start •  Generates input & reads output from many codes •  An active and growing community •  Chemistry needs a free, open framework http://avogadro.openmolecules.net/ 3  
  • 4. Avogadro Paper Published 8/13/12 http://www.jcheminf.com/content/4/1/17 4  
  • 5. Structure to Input Deck 5  
  • 6. Vision for the Future •  Advancing the state-of-the-art •  Tight integration is needed •  Computational codes •  Clusters/supercomputers •  Data repositories •  Reduce, reuse, recycle! •  Facilitating sharing and searching of data •  Embracing open data, cheminformatics 6  
  • 7. Opening Up Chemistry •  One of the most closed sciences •  Lots of black box proprietary codes –  Only a few have access to the code –  Publishing results from black box codes –  Many file formats in use, little agreement •  More papers should be including data •  Growing need for open standards •  Open tools needed to make that happen 7  
  • 8. Introduction to Open Chemistry •  User-friendly integration with –  Computational codes –  HPC/cloud resources –  Database/informatics resources 8  
  • 9. Introduction •  An open approach to chemistry software Build, Test –  Open source frameworks & Package Community Review –  Developed openly –  Cross-platform –  Tested, verified Software –  Contribution model Repository –  Supported by Kitware experts Developers & Users •  BSD licensed to facilitate research/reuse 9  
  • 10. Open Chemistry Development Team •  Assembled an inter-disciplinary team •  Domain specialists: quantum chemistry, biology, solid-state materials •  Computer scientists: build systems, queuing, graphics, software process •  Marcus, Kyle, David L., Chris, David C. 10  
  • 11. OpenChemistry.org •  New website to promote open chemistry •  Hosts project-specific pages •  Provides an identity for related projects •  Promotes shared ownership of projects –  Website –  Code submission/review –  Testing infrastructure –  Wiki, mailing lists, news, galleries 11  
  • 12. 12  
  • 13. Applications Being Developed •  Three independent applications •  Communication handled with local sockets •  Avogadro 2 – structure editing, input generation, output viewing, and analysis •  MoleQueue – running local and remote jobs in standalone programs, management •  ChemData – Storage of data, searching, entry, annotation 13  
  • 14. Open Frameworks •  AvogadroLibs – core data structures and algorithms shared across codes •  OpenQube – a collaboration platform for quantum data ingestion and visualization •  Chemkit – file I/O, exploration and chemoinformatics analysis •  VTK – specialized chemistry visualization/ data structures, use of above 14  
  • 15. Project Diagram: Libraries/Apps Core,  command  line   GUI/Visualiza:on   HPC   OpenQube   Avogadro   AvogadroLibs   VTK   MoleQueue   Chemkit   ChemData   15  
  • 16. Typical Workflow Log File Input File Edit/Analyze   Results   Data   Job  Submission   Local Calcula:on   Remote 16  
  • 17. Proposed Workflow Log File Input File Edit/Analyze   Results   Data   Job  Submission   Local Calcula:on   Remote 17  
  • 18. Optimal Workflow Log File Input File Avogadro   Results   ChemData   Job  Submission   MoleQueue   Local Remote Calcula:on   18  
  • 19. Avogadro2 •  Project began 2006 •  Split into library & application (plugin-based) •  One of very few open source editors •  Still using Qt, C++, Eigen, OpenGL •  Uses AvogadroLibs and OpenQube for core data •  Introduces client-server dataflow/patterns •  Includes new, efficient rendering code •  More liberally licensed – from GPL to BSD 19  
  • 20. Avogadro: Visualization •  GPU-accelerated rendering •  VTK for advanced visualization •  Support for 2D and 3D data plots •  Optimized data structures –  Large data –  Streaming •  Reworked interface –  Tighter database/workflow integration 20  
  • 21. MoleQueue: Job Management •  Tighter integration with remote queues •  Integration with databases –  Retains full log of computational jobs –  Triggers actions on completion •  Plugin-based system –  Easy addition of new codes –  Easy addition of new queuing systems •  Provides a client API for applications 21  
  • 22. MoleQueue •  Supports configuration of a variety of remote clusters and queuing software
  • 23. New CML I/O •  Development of modular CML code •  Allows for multi-pass parsing of CML •  Keeps the CML closer to application •  Much faster, easier to extend and change •  Moving from simple CML to full semantic documents that can be edited •  Learned from previous work in VTK and Open Babel 23  
  • 24. File Format: CML & HDF5 •  Leverages our experience with XDMF •  CML stores semantic data –  Name, formula, atoms, bonds –  Computational code, theory, basis set •  HDF5 used to store heavy data –  Basis set, intermediate data –  Eigenvectors, SCF matrix –  Volumetric data (MOs, electron density) 24  
  • 25. Rethinking Input File Generation •  Can we create a CML representation? –  Could be loaded directly by some codes –  Could be translated to input files for others •  Would allow search on input and output •  Could be stored and published •  Makes it easier to set up calculations •  Creates a more uniform experience 25  
  • 26. Advanced Impostor Rendering •  Using a scene, vertex buffer objects, and OpenGL shading language •  Impostor techniques –  Sphere goes from 100s of triangles to 2! –  No artifacts from triangulation –  Scales to millions of spheres on modest GPU 26  
  • 28. Building Community •  Community around chemistry Build, Test projects & Package Community •  Using Kitware’s software process Review –  Ensuring quality with continuous testing –  Code contributions on the web –  Public mailing lists, bug trackers, code review •  Promoting projects and Software Repository participation –  Publications Developers –  Conferences & Users –  Workshops 28  
  • 29. Software Process •  Source code publicly hosted using Git •  Gerrit for code review •  CTest/CDash for testing/summary –  Gerrit can use CDash@Home •  Test proposed changes before merging •  CDash can now provide binaries –  Built nightly, available for direct download •  Wiki, mailing list, bug tracker 29  
  • 30. Conclusions •  Real opportunity to make an impact •  Improve research, industry and teaching •  Semantic data at the center of our work –  Storage –  Search –  Interaction with computational codes –  Comparison with experimental data •  Add support for iOS, Android and web 30  
  • 31. Acknowledgements •  Google Summer of Code for initial summer funding •  Avogadro developers: Geoffrey R. Hutchison, Donald E. Curtis, David C. Lonie, Tim Vandermeersch and many more contributors, users and supporters •  Kitware, Inc. for their unique business model & support •  The Engineer Research and Development Center’s Environmental Laboratory for recent funding •  Open-source projects, standards and services we build on: Qt, Open Babel, GLEW, CML, CACTUS Resolver, many, many more projects •  Support of many code developers including MOPAC, NWChem, Q- Chem and others •  Support from Peter Murray-Rust and the Blue Obelisk 31