SlideShare a Scribd company logo
1 of 32
Download to read offline
Practicing
Open Science


William J Schroeder, Kitware, Inc.

Brian Wylie, Sandia National Labs

Marcus Hanwell, Kitware, Inc.
Speakers & Topics

§  William Schroeder, President & CEO, Kitware, Inc.
 -  The whys and hows of Open Science


§  Dr. Marcus Hanwell, R&D Engineer, Kitware, Inc.
 -  Building an open-source research program (in Chemistry)


§  Brian Wylie, Sandia National Labs
 -  Research collaborations from a government perspective
The Scientific Method



                        •  Document	
  
                        	
  
                        •  Share	
  
                             •  Data	
  
                             •  Methodology	
  

                         •  Archive	
  

                                                  Galileo Galilei 1613
Open Science
                             Ensuring reproducibility	

§  Open Documents
 -  Hypothesis
 -  Descriptions                            REPRODUCIBILITY

 -  Results
                             Positive Evidence          Negative Evidence

§  Open Data
                               Accumulate                   Disproof
                                Support                    Hypothesis
§  Open Methodology
 -  Experimental apparatus
 -  Software                 If it isn’t reproducible, it
 -  Workflow                         isn’t science	

 -  Parameter Sets
Example: OSA Interactive Science Publishing (ISP)
§  Augmented PDF
§  Contains links to executable viewer
§  Downloads data and viewer as necessary to reproduce
    paper images (results)
Example: Insight Journal
§  Timely publishing of publications, data, and software
§  Evaluated automatically; further reviewed by community




                 PDF doc
                                      Journal Git
                                      Repository
                  Code


                   Input
    Author         Data


                  Results      Web              Build
                   Data        Site            Machines
Benefits of Open Science

§  Collaboration                                  “…much of our intelligence and creativity
                                                   results from interactions with tools and
 -  Leveraging international communities           artifacts and from collaborating with other
    and expertize                                  individuals.”
                                                                                    -- Shneiderman


§  Agile Innovation
 -  Facilitate technology mashups
 -  Move science to application faster
 -  More focus on technology; less on protection


§  Business Models
 -  Growing the pie, creating new opportunities
 -  Customization, software integration
Example: Collaboration
§  NIH National Center of Biomedical Computing NA-MIC
§  Developing the OS NA-MIC Kit; 3D Slicer application
Example: Agile Innovation (Open Source for Medical Imaging)
Creating VTK (Visualization
Toolkit)




                              Led to the creation of:
                                -  ITK
                                -  VolView                 and finally…

                                -  BioImageXD
                                -  Osirix
                                -  MedINRIA
                                -  VisTrails
                                -  NIH / NCI caBIG – XIP
                                -  VR-Renderer
                                -  IGSTK
                                -  ParaView
                                -  Etc….
Example: Business Models

§  Kitware: Building open source collaboration
    platforms
 -  The usual support and training
 -  Consulting
 -  Engaging in collaborative R&D
                                                  CMake	



 -  Providing technology integration services,
    aka creating custom solutions
                                                   CDash
The Open Technology Highway

§  Provide an open infrastructure
 -  Support research, teaching, non-profit
    and commercial activities
   -  Any (legal) activity can hang off of the highway
 -  Spur innovation, create opportunities
 -  Get from idea to product faster
   -  Do not have to replicate technology
   -  Too many toll gates (i.e., closed systems,
      unreasonable IP) slows everything down
   -  Prefer non-reciprocal licenses
Next Up

§  Marcus: Building a research program for chemistry


§  Brian: open science and research collaboration from a government perspective
Open Chemistry
Growing a Research Program Through Open
Source

Dr. Marcus Hanwell, Kitware, Inc.
Grass Roots Effort
§  Bootstrapped several efforts without funding
 -  Spare time
 -  Parts of other projects when possible


§  Formed an “unorganization” – Blue Obelisk
 -  Published first article in 2005
 -  Open data, open standards and open source
 -  Meet at ACS and other conferences when possible
 -  Follow-up article currently in press


§  Quixote collaboration more recently
 -  Provide meaningful data storage and exchange
 -  Principally targeting computational chemistry
The Early Years
§  Avogadro projected started in 2006


§  First funded work in 2007 by Marcus Hanwell
 -  Google Summer of Code student
 -  Final year of Ph.D. spent the summer coding
 -  Funded as part of KDE project – Kalzium editor


§  Built on several other open source projects
 -  Qt, Eigen, Open Babel, Blue Obelisk Data Repository


§  Also uses open standards, such as OpenGL for rendering


§  Cross platform, open source stack
Community Tools, Standards and Resources
§  Make extensive use of Qt for standard GUI elements
 -  Much more than just GUI – multithreading, web resources
 -  Avogadro chosen as an outstanding example of “Qt in Use”
 -  Marcus Hanwell recently chosen as a “Qt Ambassador”
§  OpenGL for cross platform 3D rendering
 -  Accelerated rendering of 3D molecular geometry
 -  Facilitates interacting with the scene
 -  Use of GLSL for impressive, fast rendering
§  Open Babel for chemical input/output and more
 -  There are a lot of chemical file formats…
 -  Has a lot of chemical knowledge, e.g. bond perception
§  Git for distributed version control
 -  We work across multiple sites, time zones and institutions
 -  Gerrit for code review more recently – improving code quality
Evangelizing: Getting the Message Out
§  Traditional social media used to communicate
 -  Blogs, Planets, Twitter, Identi.ca, Friendfeed, Google+


§  Talks and posters at conferences
 -  Open source conferences talking about chemistry
 -  Chemistry conferences talking about open source chemistry


§  Several meetings and workshops about open chemistry
 -  Daresbury Laboratory: Chemical Visualization and Quixote
 -  NIH National Cancer Institute – Databases and Open Chemistry


§  Publications in the traditional journals
§  Screencasts showing off what the software can do
§  In person workshops and training sessions
Bringing About Real Change
§  2011 is the ”International Year of Chemistry”
§  Chemistry has been quite closed traditionally
§  We are working hard to change this
§  Recently led a Phase I SBIR to develop “open chemistry tools”
 -  GUI acting as the center of the chemical workflow
 -  Database application using MongoDB, chemically aware
 -  Cluster integration on the desktop – submit, monitor and retrieve
§  Chemical simulation/calculation now biggest HPC user in military
§  Open tools can use both open and closed computational codes
 -  Largely written in Fortran to run on clusters
 -  NWChem recently open sourced – PNNL quantum code
 -  Already work with GAMESS, GAMESS-UK, Q-Chem, Gaussian…
§  The time is right for change in chemistry
 -  Opportunity to accelerate the rate of research
Funding Open Chemistry Tools
§  Kitware’s core business is based on “open collaboration platforms”
§  Led a Phase I Small Business Innovation Research project (US Army)
 -  Invited to apply for Phase II funding, currently pending
§  Make use of Apache and BSD licenses
 -  Allow for participation of a wider cross-section of the community
 -  Reduced licensing complications
 -  Important for industry and government collaboration
§  Successfully taken part in Google Summer of Code – funded students
 -  Student in 2007 working on Avogadro and Kalzium
 -  Mentor for KDE in 2008-2010
 -  VTK organization administrator and mentor in 2011
§  Looking to other funding agencies and collaborations in future
Developing in Niche Areas
§  The population of active researchers in chemistry is relatively small
 -  The number of those researchers who code is even smaller
 -  Of those, the number that wish to contribute to open source is tiny


§  Developing and nurturing these communities can be challenging


§  Some students develop a feature in a summer and disappear


§  Other professors might develop code over the summers


§  Have to lower the barrier to entry as much as possible


§  Often need to help with tools, build systems, etc
Enabling Technologies in Chemistry
§  Large number of computational chemistry codes
 -  Many do not have dedicated user interfaces
 -  Forming a new area enabling chemical workflows
 -  Some of the open source codes that can benefit
   -  NWChem – quantum chemistry code
   -  Quantum Espresso – plane wave code
 -  Free for use codes such as GAMESS
 -  Commercial codes such as Molpro, Q-Chem, others
 -  These codes are executed in a separate process


§  Libraries that can be used in the GUI:
 -  The Visualization Toolkit (VTK) provides advanced rendering
 -  ParaView library provides client-server technology for large data
Working With Academia, Industry and Government
§  In the past licensing has not been ideal
 -  Some form of GPL or non-commercial only license fine for most academics
 -  Industry and government need more liberal licenses in general, e.g. BSD, Apache 2




§  Can be challenging to ensure everyone gets something out of the deal
§  Avoiding the trap of dual-licensing – often kills community and shared ownership
§  Funders can find it harder to understand commercialization
§  We normally employ a services/consulting role
Government	
  Open	
  Source	
  
 	
   	
   	
   	
   	
   	
   	
  Collabora'ons	
  
Brian Wylie
Sandia National Laboratories




           Sandia	
  Na7onal	
  Laboratories	
  is	
  a	
  mul7-­‐program	
  laboratory	
  managed	
  and	
  operated	
  by	
  Sandia	
  
        Corpora7on,	
  a	
  wholly	
  owned	
  subsidiary	
  of	
  Lockheed	
  Mar7n	
  Corpora7on,	
  for	
  the	
  U.S.	
  Department	
  of	
  
              Energy’s	
  Na7onal	
  Nuclear	
  Security	
  Administra7on	
  under	
  contract	
  DE-­‐AC04-­‐94AL85000.	
  
Government Open Source
                                                                                       Resources	
  


•  GOSCON	
  Government	
  Open	
  Source	
  Conference	
  (goscon.org)	
  

•  Open	
  Source	
  Center:	
  Foreign	
  open	
  source	
  intelligence	
  data	
  (opensource.gov)	
  

•  Open	
  Source	
  SoQware	
  Ins7tute:	
  Non-­‐profit	
  corp/govt/acad	
  (oss-­‐ins7tute.org)	
  

•  Government	
  Open	
  Source	
  SoQware	
  Resource	
  Centre	
  (gossrc.org)	
  	
  

•  Center	
  for	
  Strategic	
  and	
  Interna7onal	
  Studies	
  (tracks	
  open	
  source	
  legisla7on	
  csis.org)	
  	
  
Government Open Source
                                                                           Around	
  the	
  World	
  
      180	
  
                                Open	
  Source	
  Ini'a'ves	
  by	
  Region	
  (2000-­‐2009)	
  
      160	
  
      140	
  
      120	
  
      100	
  
                                                                                                   Failed	
  
        80	
                                                                                       Proposed	
  
        60	
                                                                                       Approved	
  
        40	
  
        20	
  
          0	
  
                  Europe	
     Asia	
      La7n	
      North	
   Africa	
       Middle	
  
                                          America	
   America	
                  East	
  
             Data	
  Courtesy	
  of	
  the	
  Center	
  for	
  Strategic	
  and	
  Interna'onal	
  Studies	
  
Government Open Source
                                    Example	
  Projects	
  
     Sandia	
  	
  
   Los	
  Alamos	
  



      Kitware	
  



  University	
  of	
  
     Utah	
  
                         Open	
  source	
  data	
  analysis	
  and	
  visualiza7on	
  pla[orm	
  
Government Open Source
                    Example	
  Projects	
  

    Sandia	
  




   Kitware	
  


   Indiana	
  
  University	
  
  Stanford	
  
Government Open Source
                                            Collabora'on	
  Benefits	
  
                    No	
  specific	
  vendor	
  “lock-­‐in/out”	
  
                    Allows	
  a	
  diversified	
  development	
  team	
  
   Government	
     Known	
  code	
  base	
  (strengths	
  and	
  weaknesses)	
  
                    Typically	
  easier	
  to	
  integra7on	
  with	
  other	
  OS	
  tools	
  
                    Improvement	
  of	
  the	
  OS	
  project	
  
                    	
  
                    Money	
  
   Commercial	
     Leveraging	
  project	
  for	
  other/future	
  work	
  
                    Improvement	
  of	
  the	
  OS	
  project	
  
                    	
  
                    	
  
                    Student/Professor	
  support	
  
    Academic	
      Publishing/Sharing	
  
                    Improvement	
  of	
  the	
  OS	
  project	
  
                    	
  
                    	
  
Government Open Source
                                                  Collabora'on	
  Issues	
  
                    Need	
  to	
  relax	
  into	
  exis7ng	
  OS	
  license*	
  
   Government	
     New	
  projects	
  should	
  pick	
  a	
  liberal	
  OS	
  license	
  
                    Funding	
  source	
  may	
  hesitate	
  on	
  Open	
  Source	
  
                    Proprietary	
  projects	
  /	
  Intellectual	
  Property	
  
                    	
  
                    Government	
  bureaucracy	
  
   Commercial	
     Mixed	
  soQware	
  skill	
  set	
  
                    Deliverables	
  can	
  get	
  distorted	
  
                    *	
  No	
  gov’t	
  sell	
  back	
  clause	
  
                    	
  
                    	
  

    Academic	
      Work	
  may	
  not	
  be	
  publica7on	
  material	
  
                    If	
  you	
  do	
  publish,	
  it	
  may	
  be	
  a	
  joint	
  publica7on	
  
Government Open Source
                    Ques'ons	
  Sec'on	
  
Contact Information

§  Will Schroeder will.schroeder@kitware.com


§  Brian Wylie bnwylie@sandia.gov


§  Marcus Hanwell marcus.hanwell@kitware.com
(view included video)

More Related Content

What's hot

Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Projectvty
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC,  Service QA and DataverseIntegration of WORSICA’s thematic service in EOSC,  Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataversevty
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyvty
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...vty
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014aceas13tern
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...Vyacheslav Tykhonov
 
LDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationLDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationTanu Malik
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataversevty
 
PTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityPTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityTanu Malik
 
Cytoscape: Now and Future
Cytoscape: Now and FutureCytoscape: Now and Future
Cytoscape: Now and FutureKeiichiro Ono
 
WBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBenchWBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBencht_ivanov
 
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...University of California, San Diego
 
What's New in Cytoscape
What's New in CytoscapeWhat's New in Cytoscape
What's New in CytoscapeKeiichiro Ono
 
Ipaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, IanIpaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, IanBoris Glavic
 
Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsGeoffrey Fox
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Ricard de la Vega
 
GlobusWorld 2015
GlobusWorld 2015GlobusWorld 2015
GlobusWorld 2015Tanu Malik
 
Primers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code ReviewPrimers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code ReviewDelft University of Technology
 

What's hot (20)

Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC,  Service QA and DataverseIntegration of WORSICA’s thematic service in EOSC,  Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
 
Bioinformatics on Azure
Bioinformatics on AzureBioinformatics on Azure
Bioinformatics on Azure
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhy
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...
 
LDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationLDV: Light-weight Database Virtualization
LDV: Light-weight Database Virtualization
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataverse
 
PTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityPTU: Using Provenance for Repeatability
PTU: Using Provenance for Repeatability
 
Cytoscape: Now and Future
Cytoscape: Now and FutureCytoscape: Now and Future
Cytoscape: Now and Future
 
WBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBenchWBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBench
 
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
 
What's New in Cytoscape
What's New in CytoscapeWhat's New in Cytoscape
What's New in Cytoscape
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
Ipaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, IanIpaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, Ian
 
Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other things
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
GlobusWorld 2015
GlobusWorld 2015GlobusWorld 2015
GlobusWorld 2015
 
Primers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code ReviewPrimers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code Review
 

Viewers also liked

Laying out the Principles of Open Science
Laying out the Principles of Open ScienceLaying out the Principles of Open Science
Laying out the Principles of Open ScienceKaitlin Thaney
 
The Open Science Grid
The Open Science GridThe Open Science Grid
The Open Science GridRob Gardner
 
Open Science and Data Sharing - CERF
Open Science and Data Sharing - CERFOpen Science and Data Sharing - CERF
Open Science and Data Sharing - CERFKaitlin Thaney
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Hilmar Lapp
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data ManagementLibrary_Connect
 
Columbia Talk on Open Notebook Science
Columbia Talk on Open Notebook ScienceColumbia Talk on Open Notebook Science
Columbia Talk on Open Notebook ScienceJean-Claude Bradley
 
Building Capacity for Open Science
Building Capacity for Open ScienceBuilding Capacity for Open Science
Building Capacity for Open ScienceKaitlin Thaney
 
Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Reme Melero
 
The Future of Open Science
The Future of Open ScienceThe Future of Open Science
The Future of Open SciencePhilip Bourne
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Rene Von schomberg
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Sciencepetermurrayrust
 
Directions in Open Science
Directions in Open ScienceDirections in Open Science
Directions in Open ScienceMike Travers
 
Introduction to open science
Introduction to open scienceIntroduction to open science
Introduction to open scienceReme Melero
 
What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?Leslie Chan
 
Winning research proposals with open science
Winning research proposals with open scienceWinning research proposals with open science
Winning research proposals with open scienceIvo Grigorov
 

Viewers also liked (20)

Laying out the Principles of Open Science
Laying out the Principles of Open ScienceLaying out the Principles of Open Science
Laying out the Principles of Open Science
 
The Open Science Grid
The Open Science GridThe Open Science Grid
The Open Science Grid
 
Open Science and Data Sharing - CERF
Open Science and Data Sharing - CERFOpen Science and Data Sharing - CERF
Open Science and Data Sharing - CERF
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
 
Open science
Open scienceOpen science
Open science
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data Management
 
Columbia Talk on Open Notebook Science
Columbia Talk on Open Notebook ScienceColumbia Talk on Open Notebook Science
Columbia Talk on Open Notebook Science
 
Building Capacity for Open Science
Building Capacity for Open ScienceBuilding Capacity for Open Science
Building Capacity for Open Science
 
Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020 Open Science and European Access Policies in H2020
Open Science and European Access Policies in H2020
 
The Future of Open Science
The Future of Open ScienceThe Future of Open Science
The Future of Open Science
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Relationships between Open Science, Science 2.0, and Social Media
Relationships between Open Science, Science 2.0, and Social MediaRelationships between Open Science, Science 2.0, and Social Media
Relationships between Open Science, Science 2.0, and Social Media
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts';
 
Open data and Open Science
Open data and Open ScienceOpen data and Open Science
Open data and Open Science
 
Directions in Open Science
Directions in Open ScienceDirections in Open Science
Directions in Open Science
 
Introduction to open science
Introduction to open scienceIntroduction to open science
Introduction to open science
 
What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?What is Open Science and what role does it play in Development?
What is Open Science and what role does it play in Development?
 
Open science
Open scienceOpen science
Open science
 
Open Science: What, why, how?
Open Science: What, why, how? Open Science: What, why, how?
Open Science: What, why, how?
 
Winning research proposals with open science
Winning research proposals with open scienceWinning research proposals with open science
Winning research proposals with open science
 

Similar to Oscon 2011 Practicing Open Science

Oscon 2011 chemistry_hanwell
Oscon 2011 chemistry_hanwellOscon 2011 chemistry_hanwell
Oscon 2011 chemistry_hanwellwill-schroeder
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
Simbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational ResearchSimbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational Researchjpk
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC smarru
 
SGCI at Earth Science Information Partners meeting
SGCI at Earth Science Information Partners meetingSGCI at Earth Science Information Partners meeting
SGCI at Earth Science Information Partners meetingNancy Wilkins-Diehr
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchYehia El-khatib
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Blue BRIDGE
 
Open Source Movement
Open Source MovementOpen Source Movement
Open Source MovementMesut Yılmaz
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011Lee Dirks
 
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...Chris Dagdigian
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...Ig Bittencourt
 
Usability, Reusability and Reproducibility of Bioinformatic Applications
 Usability, Reusability and Reproducibility of Bioinformatic Applications  Usability, Reusability and Reproducibility of Bioinformatic Applications
Usability, Reusability and Reproducibility of Bioinformatic Applications Sandra Gesing
 
Open Source Scientific Software
Open Source Scientific SoftwareOpen Source Scientific Software
Open Source Scientific SoftwareGael Varoquaux
 

Similar to Oscon 2011 Practicing Open Science (20)

Oscon 2011 schroeder
Oscon 2011 schroederOscon 2011 schroeder
Oscon 2011 schroeder
 
Oscon 2011 chemistry_hanwell
Oscon 2011 chemistry_hanwellOscon 2011 chemistry_hanwell
Oscon 2011 chemistry_hanwell
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
Sgci sc18-11-14-18
Sgci sc18-11-14-18Sgci sc18-11-14-18
Sgci sc18-11-14-18
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Simbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational ResearchSimbios - Open Science in Biocomputational Research
Simbios - Open Science in Biocomputational Research
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC
 
SGCI at Earth Science Information Partners meeting
SGCI at Earth Science Information Partners meetingSGCI at Earth Science Information Partners meeting
SGCI at Earth Science Information Partners meeting
 
Open Science
Open ScienceOpen Science
Open Science
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific Research
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
Open Source Movement
Open Source MovementOpen Source Movement
Open Source Movement
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
 
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
 
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...From Research to Innovation: Linked Open Data and Gamification to Design Inte...
From Research to Innovation: Linked Open Data and Gamification to Design Inte...
 
Usability, Reusability and Reproducibility of Bioinformatic Applications
 Usability, Reusability and Reproducibility of Bioinformatic Applications  Usability, Reusability and Reproducibility of Bioinformatic Applications
Usability, Reusability and Reproducibility of Bioinformatic Applications
 
Open Source Scientific Software
Open Source Scientific SoftwareOpen Source Scientific Software
Open Source Scientific Software
 

Recently uploaded

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Recently uploaded (20)

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Oscon 2011 Practicing Open Science

  • 1. Practicing Open Science William J Schroeder, Kitware, Inc. Brian Wylie, Sandia National Labs Marcus Hanwell, Kitware, Inc.
  • 2. Speakers & Topics §  William Schroeder, President & CEO, Kitware, Inc. -  The whys and hows of Open Science §  Dr. Marcus Hanwell, R&D Engineer, Kitware, Inc. -  Building an open-source research program (in Chemistry) §  Brian Wylie, Sandia National Labs -  Research collaborations from a government perspective
  • 3. The Scientific Method •  Document     •  Share   •  Data   •  Methodology   •  Archive   Galileo Galilei 1613
  • 4. Open Science Ensuring reproducibility §  Open Documents -  Hypothesis -  Descriptions REPRODUCIBILITY -  Results Positive Evidence Negative Evidence §  Open Data Accumulate Disproof Support Hypothesis §  Open Methodology -  Experimental apparatus -  Software If it isn’t reproducible, it -  Workflow isn’t science -  Parameter Sets
  • 5. Example: OSA Interactive Science Publishing (ISP) §  Augmented PDF §  Contains links to executable viewer §  Downloads data and viewer as necessary to reproduce paper images (results)
  • 6. Example: Insight Journal §  Timely publishing of publications, data, and software §  Evaluated automatically; further reviewed by community PDF doc Journal Git Repository Code Input Author Data Results Web Build Data Site Machines
  • 7. Benefits of Open Science §  Collaboration “…much of our intelligence and creativity results from interactions with tools and -  Leveraging international communities artifacts and from collaborating with other and expertize individuals.” -- Shneiderman §  Agile Innovation -  Facilitate technology mashups -  Move science to application faster -  More focus on technology; less on protection §  Business Models -  Growing the pie, creating new opportunities -  Customization, software integration
  • 8. Example: Collaboration §  NIH National Center of Biomedical Computing NA-MIC §  Developing the OS NA-MIC Kit; 3D Slicer application
  • 9. Example: Agile Innovation (Open Source for Medical Imaging) Creating VTK (Visualization Toolkit) Led to the creation of: -  ITK -  VolView and finally… -  BioImageXD -  Osirix -  MedINRIA -  VisTrails -  NIH / NCI caBIG – XIP -  VR-Renderer -  IGSTK -  ParaView -  Etc….
  • 10. Example: Business Models §  Kitware: Building open source collaboration platforms -  The usual support and training -  Consulting -  Engaging in collaborative R&D CMake -  Providing technology integration services, aka creating custom solutions CDash
  • 11. The Open Technology Highway §  Provide an open infrastructure -  Support research, teaching, non-profit and commercial activities -  Any (legal) activity can hang off of the highway -  Spur innovation, create opportunities -  Get from idea to product faster -  Do not have to replicate technology -  Too many toll gates (i.e., closed systems, unreasonable IP) slows everything down -  Prefer non-reciprocal licenses
  • 12. Next Up §  Marcus: Building a research program for chemistry §  Brian: open science and research collaboration from a government perspective
  • 13. Open Chemistry Growing a Research Program Through Open Source Dr. Marcus Hanwell, Kitware, Inc.
  • 14. Grass Roots Effort §  Bootstrapped several efforts without funding -  Spare time -  Parts of other projects when possible §  Formed an “unorganization” – Blue Obelisk -  Published first article in 2005 -  Open data, open standards and open source -  Meet at ACS and other conferences when possible -  Follow-up article currently in press §  Quixote collaboration more recently -  Provide meaningful data storage and exchange -  Principally targeting computational chemistry
  • 15. The Early Years §  Avogadro projected started in 2006 §  First funded work in 2007 by Marcus Hanwell -  Google Summer of Code student -  Final year of Ph.D. spent the summer coding -  Funded as part of KDE project – Kalzium editor §  Built on several other open source projects -  Qt, Eigen, Open Babel, Blue Obelisk Data Repository §  Also uses open standards, such as OpenGL for rendering §  Cross platform, open source stack
  • 16. Community Tools, Standards and Resources §  Make extensive use of Qt for standard GUI elements -  Much more than just GUI – multithreading, web resources -  Avogadro chosen as an outstanding example of “Qt in Use” -  Marcus Hanwell recently chosen as a “Qt Ambassador” §  OpenGL for cross platform 3D rendering -  Accelerated rendering of 3D molecular geometry -  Facilitates interacting with the scene -  Use of GLSL for impressive, fast rendering §  Open Babel for chemical input/output and more -  There are a lot of chemical file formats… -  Has a lot of chemical knowledge, e.g. bond perception §  Git for distributed version control -  We work across multiple sites, time zones and institutions -  Gerrit for code review more recently – improving code quality
  • 17. Evangelizing: Getting the Message Out §  Traditional social media used to communicate -  Blogs, Planets, Twitter, Identi.ca, Friendfeed, Google+ §  Talks and posters at conferences -  Open source conferences talking about chemistry -  Chemistry conferences talking about open source chemistry §  Several meetings and workshops about open chemistry -  Daresbury Laboratory: Chemical Visualization and Quixote -  NIH National Cancer Institute – Databases and Open Chemistry §  Publications in the traditional journals §  Screencasts showing off what the software can do §  In person workshops and training sessions
  • 18. Bringing About Real Change §  2011 is the ”International Year of Chemistry” §  Chemistry has been quite closed traditionally §  We are working hard to change this §  Recently led a Phase I SBIR to develop “open chemistry tools” -  GUI acting as the center of the chemical workflow -  Database application using MongoDB, chemically aware -  Cluster integration on the desktop – submit, monitor and retrieve §  Chemical simulation/calculation now biggest HPC user in military §  Open tools can use both open and closed computational codes -  Largely written in Fortran to run on clusters -  NWChem recently open sourced – PNNL quantum code -  Already work with GAMESS, GAMESS-UK, Q-Chem, Gaussian… §  The time is right for change in chemistry -  Opportunity to accelerate the rate of research
  • 19. Funding Open Chemistry Tools §  Kitware’s core business is based on “open collaboration platforms” §  Led a Phase I Small Business Innovation Research project (US Army) -  Invited to apply for Phase II funding, currently pending §  Make use of Apache and BSD licenses -  Allow for participation of a wider cross-section of the community -  Reduced licensing complications -  Important for industry and government collaboration §  Successfully taken part in Google Summer of Code – funded students -  Student in 2007 working on Avogadro and Kalzium -  Mentor for KDE in 2008-2010 -  VTK organization administrator and mentor in 2011 §  Looking to other funding agencies and collaborations in future
  • 20. Developing in Niche Areas §  The population of active researchers in chemistry is relatively small -  The number of those researchers who code is even smaller -  Of those, the number that wish to contribute to open source is tiny §  Developing and nurturing these communities can be challenging §  Some students develop a feature in a summer and disappear §  Other professors might develop code over the summers §  Have to lower the barrier to entry as much as possible §  Often need to help with tools, build systems, etc
  • 21. Enabling Technologies in Chemistry §  Large number of computational chemistry codes -  Many do not have dedicated user interfaces -  Forming a new area enabling chemical workflows -  Some of the open source codes that can benefit -  NWChem – quantum chemistry code -  Quantum Espresso – plane wave code -  Free for use codes such as GAMESS -  Commercial codes such as Molpro, Q-Chem, others -  These codes are executed in a separate process §  Libraries that can be used in the GUI: -  The Visualization Toolkit (VTK) provides advanced rendering -  ParaView library provides client-server technology for large data
  • 22. Working With Academia, Industry and Government §  In the past licensing has not been ideal -  Some form of GPL or non-commercial only license fine for most academics -  Industry and government need more liberal licenses in general, e.g. BSD, Apache 2 §  Can be challenging to ensure everyone gets something out of the deal §  Avoiding the trap of dual-licensing – often kills community and shared ownership §  Funders can find it harder to understand commercialization §  We normally employ a services/consulting role
  • 23. Government  Open  Source                Collabora'ons   Brian Wylie Sandia National Laboratories Sandia  Na7onal  Laboratories  is  a  mul7-­‐program  laboratory  managed  and  operated  by  Sandia   Corpora7on,  a  wholly  owned  subsidiary  of  Lockheed  Mar7n  Corpora7on,  for  the  U.S.  Department  of   Energy’s  Na7onal  Nuclear  Security  Administra7on  under  contract  DE-­‐AC04-­‐94AL85000.  
  • 24. Government Open Source Resources   •  GOSCON  Government  Open  Source  Conference  (goscon.org)   •  Open  Source  Center:  Foreign  open  source  intelligence  data  (opensource.gov)   •  Open  Source  SoQware  Ins7tute:  Non-­‐profit  corp/govt/acad  (oss-­‐ins7tute.org)   •  Government  Open  Source  SoQware  Resource  Centre  (gossrc.org)     •  Center  for  Strategic  and  Interna7onal  Studies  (tracks  open  source  legisla7on  csis.org)    
  • 25. Government Open Source Around  the  World   180   Open  Source  Ini'a'ves  by  Region  (2000-­‐2009)   160   140   120   100   Failed   80   Proposed   60   Approved   40   20   0   Europe   Asia   La7n   North   Africa   Middle   America   America   East   Data  Courtesy  of  the  Center  for  Strategic  and  Interna'onal  Studies  
  • 26. Government Open Source Example  Projects   Sandia     Los  Alamos   Kitware   University  of   Utah   Open  source  data  analysis  and  visualiza7on  pla[orm  
  • 27. Government Open Source Example  Projects   Sandia   Kitware   Indiana   University   Stanford  
  • 28. Government Open Source Collabora'on  Benefits   No  specific  vendor  “lock-­‐in/out”   Allows  a  diversified  development  team   Government   Known  code  base  (strengths  and  weaknesses)   Typically  easier  to  integra7on  with  other  OS  tools   Improvement  of  the  OS  project     Money   Commercial   Leveraging  project  for  other/future  work   Improvement  of  the  OS  project       Student/Professor  support   Academic   Publishing/Sharing   Improvement  of  the  OS  project      
  • 29. Government Open Source Collabora'on  Issues   Need  to  relax  into  exis7ng  OS  license*   Government   New  projects  should  pick  a  liberal  OS  license   Funding  source  may  hesitate  on  Open  Source   Proprietary  projects  /  Intellectual  Property     Government  bureaucracy   Commercial   Mixed  soQware  skill  set   Deliverables  can  get  distorted   *  No  gov’t  sell  back  clause       Academic   Work  may  not  be  publica7on  material   If  you  do  publish,  it  may  be  a  joint  publica7on  
  • 30. Government Open Source Ques'ons  Sec'on  
  • 31. Contact Information §  Will Schroeder will.schroeder@kitware.com §  Brian Wylie bnwylie@sandia.gov §  Marcus Hanwell marcus.hanwell@kitware.com