SlideShare a Scribd company logo
1 of 24
Download to read offline
1
Digital Science
Reproducibility and Visibility in Astronomy
José Enrique Ruiz on behalf of the Wf4Ever Team
SCIOPS 2013
ESAC, FRIDAY 13th SEPTEMBER 2013
2
Wf4Ever
Digital Science - Reproducibility and Visibility in Astronomy
1.  Intelligent Software Components (ISOCO, Spain)
2.  University of Manchester (UNIMAN, UK)
3.  Universidad Politécnica de Madrid (UPM, Spain)
4.  Poznan Supercomputing and Networking Centre (Poland)
5.  University of Oxford and OeRC (OXF, UK)
6.  Instituto Astrofísica Andalucía (IAA-CSIC, Spain)
7.  Leiden University Medical Centre (LUMC, NL)
4
Wf4Ever
Advanced Workflow Preservation Technologies for Enhanced Science
3
1
6
7
5
2
2011 - 2013
3
Astronomy research lifecycle is entirely digital
»  Observation proposals
»  Data reduction pipelines
»  Analysis of science ready data
»  Catalogs of objects and data archives
»  Publish process
›  Final data results
›  Experiment in DL
ADS/arXiv
Reproducible research is still not
possible in a digital world
A rich infrastructure of data is not
efficiently used
A normalized preservation of
methodology is needed
Tools
Astronomy Research Lifecycle
Digital Science - Reproducibility and Visibility in Astronomy
4
Reproducibility and The Scientific Method
Digital Science - Reproducibility and Visibility in Astronomy
http://xkcd.com/242/
Benefits
»  Publishing knowledge, not advertising
»  The author, the referee, the re-user
»  Reputation, prestige and respect
»  Higher quality of publications
›  Authors will be more careful
›  Many eyes to check results
Challenges
»  Hard and time consuming
»  Need incentives – not rewarded now
5
Reproducibility and The Scientific Method
Digital Science - Reproducibility and Visibility in Astronomy
I don’t know how!
6
Visibility, Efficiency and Reuse
Digital Science - Reproducibility and Visibility in Astronomy
Optimize return on investments made on big facilities
»  Avoid duplication of efforts and reinvention
»  How to discover and not duplicate ?
»  How to re-use and not duplicate ?
»  How to make use of best practices ?
»  How to use the rich infrastructure of data ?
»  Intellectual contribs are encoded in software
More data in archives does not imply more knowledge
»  Expose complete scientific record, not the story
»  Allow easy discovery of methods and tools
7
Visibility and Social Discovery
Digital Science - Reproducibility and Visibility in Astronomy
Paper discovery: the social dimension
8
The Executable Paper
Digital Science - Reproducibility and Visibility in Astronomy
Time has come to go beyond the PDF
9
Digital Astronomy in the Local Desktop
Digital Science - Reproducibility and Visibility in Astronomy
Going beyond automation
Organization!
10
Digital Astronomy in the Local Desktop
Workflows to Access and Massage VO Data
# CIG Vhel e_Vhel r_Vhel Dist MType e_MType OptAssym r_MType Bmag e_Bmag
1 7299.0 3.0 1 96.9 5.0 1.5 1 1 14.167 0.271 0.173 0.571 0.040 13.383
2 6983.0 6.0 2 94.7 6.0 1.5 0 1 15.722 0.324 0.255 0.278 0.031 15.157
3 4.0 1.5 0 1 16.057 0.507 0.246 0.354 15.457
4 2310.0 1.0 3 31.9 3.0 1.5 0 1 12.818 0.424 0.252 0.863 0.017 11.685
5 7865.0 10.0 3 105.9 0.0 1.5 0 1 15.602 0.364 0.225 0.131 0.118 15.128
72 5164.0 9.0 2 68.5 5.0 1.5 1 1 14.445 0.325 0.315 0.367 0.028 13.735
Capture !
Actions, Tasks, Dependencies, Provenance!
!
Improve !
Clarity and Reproducibility!
11
Scientific Workflows
Digital Science - Reproducibility and Visibility in Astronomy
Living Tutorials!
Templates for Re-use!
Expedite Training!
Reduce time to insight!
Avoid reinvention!
Digital Libraries of workflows may boost the use
of the existing infrastructure of data (VO) !
12
!
!
Software
›  Taverna
›  Kepler
›  Pegasus
›  Triana
›  ESO Reflex
Scientific Workflows
Digital Science - Reproducibility and Visibility in Astronomy
Related Initiatives
›  ER-Flow
›  VAMDC
›  HELIO
›  Cyber-SKA
›  IceCore
›  Montage
›  Astro-WISE
›  AstroGrid
IVOA
›  AstroGrid
›  Grid&WS WG
›  VO France Wf WG
Self descriptive WS
›  PDL
›  SimDAL, S3
13
!
!
AstroTaverna: Create, annotate and run a workflow
http://amiga.iaa.es/p/290-astrotaverna.htm
Astronomical Research Objects in Action
Digital Science - Reproducibility and Visibility in Astronomy
14
!
!
AstroTaverna: Create, annotate and run a workflow
http://amiga.iaa.es/p/290-astrotaverna.htm
Astronomical Research Objects in Action
Digital Science - Reproducibility and Visibility in Astronomy
15
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Prof. Kevin Vinsen
ASKAP Datacubes
16
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Prof. Kevin Vinsen
SKA Datacubes
17
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Much wider FoV and spectral coverage
»  Large volumes for a single observed dataset
Automated surveys
»  Huge amounts of tabular data
We are moving into a world where
»  computing and storage are cheap
»  data movement is death
Extraction of scientifically relevant
info from a multiD param. space
»  Exploration services
»  Anomaly detection
»  Cross-matching data
»  Dimensionality reduction
Detailed inspection and
subset
»  Filtering
»  Extraction
»  Re-Projection
»  Analysis services
18
»  A cloud of Web Services
»  Archives speaking Web Services
Process should benefit of the same privileges acquired by data
Preserving the method ensures replication of final results at any moment
Archives should evolve from data providers into
»  Virtual Data providers
»  Software Tasks providers
Astronomy of multi archives/facilities/wavelength
Interconnected and interoperable archives
»  Data -> Virtual Observatory
»  Software Tasks
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Preservation
The move computing to data paradigm
19
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
Distributed
Technical Objects Social Objects
Expose experimental context in a structured way in order to be understood
20
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
IPython Notebook solutions
»  Web-browser as the working desktop
»  Python code, plots and data, living with rich-text documentation
»  Cloud-based adaptive scalable computing environment
»  Fully shareable, re-usable and executable wikis
»  Social platform and Git versioning
21
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
ADSLabs
ADO Linked Components
»  Authors
»  Publications
»  Journals
»  Objects SIMBAD
»  Tabular data behind the plots CDS
»  ASCL reference of used software
»  Observing time Proposals
»  Used facilities, surveys or missions
http://labs.adsabs.harvard.edu/
Incentives
Similar Initiative to ESO Telbib!
22
!
!
The Incentive
Papers with data links are cited more than those without
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
Effect of E-printing on Citation Rates in Astronomy and Physics
2006. Edwin A. Henneken et al.
23
!
!
The Incentive
Papers with data links are cited more than those without
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
Effect of E-printing on Citation Rates in Astronomy and Physics
2006. Edwin A. Henneken et al.
24
Conclusions
Digital Science - Reproducibility and Visibility in Astronomy
»  Reproducibility is at the very heart of the scientific method
»  Improving visibility is key in order to avoid reinvention
»  Social dimension of science stressed in the discovery process
»  Highly specialized science needs re-use to achieve efficiency
»  In a digital world, publish decomposable executable papers
»  Capture provenance and structure in the local desktop
»  Scientific workflows go beyond automation: provide clarity and structure
»  Transfer rate is more than an issue for next generation of archives
»  The move computing to data paradigm -> back to old terminals
»  Process should benefit of the same privileges acquired by data
»  Digital libraries of web-services-based workflows
»  The distributed digital workflow-centric Research Object
»  Preserving knowledge - not only data or advertising
jer@iaa.es

More Related Content

What's hot

Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLourdes Verdes-Montenegro
 
A Biological Internet?: Eywa
A Biological Internet?: EywaA Biological Internet?: Eywa
A Biological Internet?: EywaEugene Siow
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilitiesIan Foster
 
Introduction NL-HUG (April)
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)Evert Lammerts
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationIan Foster
 
What to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformWhat to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformMario Juric
 
Big Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No CodeBig Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No CodeLiana Ye
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudAmazon Web Services
 
A New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceA New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceUniversity of Washington
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterIan Foster
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!Ian Foster
 
Big data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at KitwareBig data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at Kitwarebigdataviz_bay
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data Visualizationbigdataviz_bay
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Ian Foster
 
NERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie BardNERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie BardPacificResearchPlatform
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science ServicesIan Foster
 
XLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaXLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaUniversity of Washington
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 

What's hot (20)

Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 version
 
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
 
A Biological Internet?: Eywa
A Biological Internet?: EywaA Biological Internet?: Eywa
A Biological Internet?: Eywa
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilities
 
Introduction NL-HUG (April)
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
 
What to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformWhat to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science Platform
 
Big Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No CodeBig Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No Code
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the Cloud
 
A New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceA New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScience
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and Jupyter
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
 
Big data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at KitwareBig data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at Kitware
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data Visualization
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013
 
NERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie BardNERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie Bard
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science Services
 
XLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaXLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and Myria
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 

Similar to Digital Science: Reproducibility and Visibility in Astronomy

Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Andrea Scharnhorst
 
Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview   Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview Jennifer D'Souza
 
Digital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDigital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDavid De Roure
 
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDigital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDavid De Roure
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingAndrea Scharnhorst
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesAndrea Scharnhorst
 
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkDatabricks
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402vrij
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Kerstin Lehnert
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFOlga Scrivner
 
Rebecca Grant DAH Research Presentation
Rebecca Grant DAH Research PresentationRebecca Grant DAH Research Presentation
Rebecca Grant DAH Research Presentationdri_ireland
 
Use r 2013 tutorial - r and cloud computing for higher education and research
Use r 2013   tutorial - r and cloud computing for higher education and researchUse r 2013   tutorial - r and cloud computing for higher education and research
Use r 2013 tutorial - r and cloud computing for higher education and researchkchine3
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 

Similar to Digital Science: Reproducibility and Visibility in Astronomy (20)

Digital Science
Digital ScienceDigital Science
Digital Science
 
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
 
Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview   Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview
 
Digital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDigital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social Machines
 
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDigital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research funding
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studies
 
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache Spark
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
 
If only I had a map!
If only I had a map!If only I had a map!
If only I had a map!
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVF
 
Knowledge Graphs for Scholarly Communication
Knowledge Graphs for Scholarly CommunicationKnowledge Graphs for Scholarly Communication
Knowledge Graphs for Scholarly Communication
 
Christine borgman keynote
Christine borgman keynoteChristine borgman keynote
Christine borgman keynote
 
Rebecca Grant DAH Research Presentation
Rebecca Grant DAH Research PresentationRebecca Grant DAH Research Presentation
Rebecca Grant DAH Research Presentation
 
Use r 2013 tutorial - r and cloud computing for higher education and research
Use r 2013   tutorial - r and cloud computing for higher education and researchUse r 2013   tutorial - r and cloud computing for higher education and research
Use r 2013 tutorial - r and cloud computing for higher education and research
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 

More from Jose Enrique Ruiz

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroidsJose Enrique Ruiz
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
Wf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationWf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationJose Enrique Ruiz
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesJose Enrique Ruiz
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflowsJose Enrique Ruiz
 
Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataJose Enrique Ruiz
 
Curating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsCurating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsJose Enrique Ruiz
 
Collaborative Digital Experiments
Collaborative Digital ExperimentsCollaborative Digital Experiments
Collaborative Digital ExperimentsJose Enrique Ruiz
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCAJose Enrique Ruiz
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VOJose Enrique Ruiz
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropJose Enrique Ruiz
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz
 

More from Jose Enrique Ruiz (14)

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
Wf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationWf4Ever: Workflow Preservation
Wf4Ever: Workflow Preservation
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
 
Workflow Preservation
Workflow PreservationWorkflow Preservation
Workflow Preservation
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
 
Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D data
 
Curating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsCurating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital Experiments
 
Collaborative Digital Experiments
Collaborative Digital ExperimentsCollaborative Digital Experiments
Collaborative Digital Experiments
 
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 

Recently uploaded

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Digital Science: Reproducibility and Visibility in Astronomy

  • 1. 1 Digital Science Reproducibility and Visibility in Astronomy José Enrique Ruiz on behalf of the Wf4Ever Team SCIOPS 2013 ESAC, FRIDAY 13th SEPTEMBER 2013
  • 2. 2 Wf4Ever Digital Science - Reproducibility and Visibility in Astronomy 1.  Intelligent Software Components (ISOCO, Spain) 2.  University of Manchester (UNIMAN, UK) 3.  Universidad Politécnica de Madrid (UPM, Spain) 4.  Poznan Supercomputing and Networking Centre (Poland) 5.  University of Oxford and OeRC (OXF, UK) 6.  Instituto Astrofísica Andalucía (IAA-CSIC, Spain) 7.  Leiden University Medical Centre (LUMC, NL) 4 Wf4Ever Advanced Workflow Preservation Technologies for Enhanced Science 3 1 6 7 5 2 2011 - 2013
  • 3. 3 Astronomy research lifecycle is entirely digital »  Observation proposals »  Data reduction pipelines »  Analysis of science ready data »  Catalogs of objects and data archives »  Publish process ›  Final data results ›  Experiment in DL ADS/arXiv Reproducible research is still not possible in a digital world A rich infrastructure of data is not efficiently used A normalized preservation of methodology is needed Tools Astronomy Research Lifecycle Digital Science - Reproducibility and Visibility in Astronomy
  • 4. 4 Reproducibility and The Scientific Method Digital Science - Reproducibility and Visibility in Astronomy http://xkcd.com/242/ Benefits »  Publishing knowledge, not advertising »  The author, the referee, the re-user »  Reputation, prestige and respect »  Higher quality of publications ›  Authors will be more careful ›  Many eyes to check results Challenges »  Hard and time consuming »  Need incentives – not rewarded now
  • 5. 5 Reproducibility and The Scientific Method Digital Science - Reproducibility and Visibility in Astronomy I don’t know how!
  • 6. 6 Visibility, Efficiency and Reuse Digital Science - Reproducibility and Visibility in Astronomy Optimize return on investments made on big facilities »  Avoid duplication of efforts and reinvention »  How to discover and not duplicate ? »  How to re-use and not duplicate ? »  How to make use of best practices ? »  How to use the rich infrastructure of data ? »  Intellectual contribs are encoded in software More data in archives does not imply more knowledge »  Expose complete scientific record, not the story »  Allow easy discovery of methods and tools
  • 7. 7 Visibility and Social Discovery Digital Science - Reproducibility and Visibility in Astronomy Paper discovery: the social dimension
  • 8. 8 The Executable Paper Digital Science - Reproducibility and Visibility in Astronomy Time has come to go beyond the PDF
  • 9. 9 Digital Astronomy in the Local Desktop Digital Science - Reproducibility and Visibility in Astronomy Going beyond automation Organization!
  • 10. 10 Digital Astronomy in the Local Desktop Workflows to Access and Massage VO Data # CIG Vhel e_Vhel r_Vhel Dist MType e_MType OptAssym r_MType Bmag e_Bmag 1 7299.0 3.0 1 96.9 5.0 1.5 1 1 14.167 0.271 0.173 0.571 0.040 13.383 2 6983.0 6.0 2 94.7 6.0 1.5 0 1 15.722 0.324 0.255 0.278 0.031 15.157 3 4.0 1.5 0 1 16.057 0.507 0.246 0.354 15.457 4 2310.0 1.0 3 31.9 3.0 1.5 0 1 12.818 0.424 0.252 0.863 0.017 11.685 5 7865.0 10.0 3 105.9 0.0 1.5 0 1 15.602 0.364 0.225 0.131 0.118 15.128 72 5164.0 9.0 2 68.5 5.0 1.5 1 1 14.445 0.325 0.315 0.367 0.028 13.735 Capture ! Actions, Tasks, Dependencies, Provenance! ! Improve ! Clarity and Reproducibility!
  • 11. 11 Scientific Workflows Digital Science - Reproducibility and Visibility in Astronomy Living Tutorials! Templates for Re-use! Expedite Training! Reduce time to insight! Avoid reinvention! Digital Libraries of workflows may boost the use of the existing infrastructure of data (VO) !
  • 12. 12 ! ! Software ›  Taverna ›  Kepler ›  Pegasus ›  Triana ›  ESO Reflex Scientific Workflows Digital Science - Reproducibility and Visibility in Astronomy Related Initiatives ›  ER-Flow ›  VAMDC ›  HELIO ›  Cyber-SKA ›  IceCore ›  Montage ›  Astro-WISE ›  AstroGrid IVOA ›  AstroGrid ›  Grid&WS WG ›  VO France Wf WG Self descriptive WS ›  PDL ›  SimDAL, S3
  • 13. 13 ! ! AstroTaverna: Create, annotate and run a workflow http://amiga.iaa.es/p/290-astrotaverna.htm Astronomical Research Objects in Action Digital Science - Reproducibility and Visibility in Astronomy
  • 14. 14 ! ! AstroTaverna: Create, annotate and run a workflow http://amiga.iaa.es/p/290-astrotaverna.htm Astronomical Research Objects in Action Digital Science - Reproducibility and Visibility in Astronomy
  • 15. 15 The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Prof. Kevin Vinsen ASKAP Datacubes
  • 16. 16 The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Prof. Kevin Vinsen SKA Datacubes
  • 17. 17 The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Much wider FoV and spectral coverage »  Large volumes for a single observed dataset Automated surveys »  Huge amounts of tabular data We are moving into a world where »  computing and storage are cheap »  data movement is death Extraction of scientifically relevant info from a multiD param. space »  Exploration services »  Anomaly detection »  Cross-matching data »  Dimensionality reduction Detailed inspection and subset »  Filtering »  Extraction »  Re-Projection »  Analysis services
  • 18. 18 »  A cloud of Web Services »  Archives speaking Web Services Process should benefit of the same privileges acquired by data Preserving the method ensures replication of final results at any moment Archives should evolve from data providers into »  Virtual Data providers »  Software Tasks providers Astronomy of multi archives/facilities/wavelength Interconnected and interoperable archives »  Data -> Virtual Observatory »  Software Tasks The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Preservation The move computing to data paradigm
  • 19. 19 Research Objects Digital Science - Reproducibility and Visibility in Astronomy Distributed Technical Objects Social Objects Expose experimental context in a structured way in order to be understood
  • 20. 20 Research Objects Digital Science - Reproducibility and Visibility in Astronomy IPython Notebook solutions »  Web-browser as the working desktop »  Python code, plots and data, living with rich-text documentation »  Cloud-based adaptive scalable computing environment »  Fully shareable, re-usable and executable wikis »  Social platform and Git versioning
  • 21. 21 Research Objects Digital Science - Reproducibility and Visibility in Astronomy ADSLabs ADO Linked Components »  Authors »  Publications »  Journals »  Objects SIMBAD »  Tabular data behind the plots CDS »  ASCL reference of used software »  Observing time Proposals »  Used facilities, surveys or missions http://labs.adsabs.harvard.edu/ Incentives Similar Initiative to ESO Telbib!
  • 22. 22 ! ! The Incentive Papers with data links are cited more than those without Research Objects Digital Science - Reproducibility and Visibility in Astronomy Effect of E-printing on Citation Rates in Astronomy and Physics 2006. Edwin A. Henneken et al.
  • 23. 23 ! ! The Incentive Papers with data links are cited more than those without Research Objects Digital Science - Reproducibility and Visibility in Astronomy Effect of E-printing on Citation Rates in Astronomy and Physics 2006. Edwin A. Henneken et al.
  • 24. 24 Conclusions Digital Science - Reproducibility and Visibility in Astronomy »  Reproducibility is at the very heart of the scientific method »  Improving visibility is key in order to avoid reinvention »  Social dimension of science stressed in the discovery process »  Highly specialized science needs re-use to achieve efficiency »  In a digital world, publish decomposable executable papers »  Capture provenance and structure in the local desktop »  Scientific workflows go beyond automation: provide clarity and structure »  Transfer rate is more than an issue for next generation of archives »  The move computing to data paradigm -> back to old terminals »  Process should benefit of the same privileges acquired by data »  Digital libraries of web-services-based workflows »  The distributed digital workflow-centric Research Object »  Preserving knowledge - not only data or advertising jer@iaa.es