SlideShare a Scribd company logo
VO Web-services-based
           astronomy workflows!

                             Jose Enrique Ruiz!
                                    IAA - CSIC!

Manchester 13th July 2011!

Curating and preserving collaborative digital experiments

                       1.  Intelligent Software Components (ISOCO, Spain)!
                       2.  University of Manchester (UNIMAN, UK)!
     2     7
                       3.  Universidad Politécnica de Madrid (UPM, Spain)!
      5!       4!
                       4.  Poznan Supercomputing and Networking Centre
                           (PSNC, Poland)!
                       5.  Universisty of Oxford (OXF, UK)!
                       6.  Instituto Astrofísica Andalucía (IAA-CSIC, Spain)!
   1! 3!               7.  Leiden University Medical Centre (LUMC, NL)!
Who are you ?!

The AMIGA Group!
    Analysis of the interstellar Medium of Isolated Galaxies!
        Statistical baseline of isolated galaxies to compare!
        with the behaviour of galaxies in denser environments!

                  Multi    study of ~1000 galaxies!
Instituto Astrofisica de Andalucia - CSIC!
Univ . Granada, Obs. Marseille, Obs. Paris, !
NAOJ, FCRAO, UNAM, Univ. Edinburgh, !
IRAM, ESO, Kapteyn Astronomical Institute.!
P.I. Lourdes Verdes-Montenegro!!
Who are you ?!

VO Virtual Observatory!
•    International Virtual Observatory Alliance (IVOA)!
•    Interoperability and Discovery!
•    Publishing and Accessing Data!
•    Service Oriented Architecture (SoA)!
•    Integration of Software and Data!
•    Distributed Resources!
•    Panchromatic Astronomy!

•  Data Models!
•  Web Services!
•  Semantics!
Who are you ?!

VO Virtual Observatory!
Who are you ?!

The AMIGA VO Catalog!
The Data Provider!
Who are you ?!

Radio Astronomy Data Model for Single-Dish telescopes !
Who are you ?!

RADAMS Implementation
Who are you ?!

    VO Archives Developments
Robledo DSS-63!
•  Madrid Deep Space Communication Complex (MDSCC)!
•  70m single dish in Robledo de Chavela (Madrid)!
•  5% operational time for observations!
•  K band Spectra (18 - 26 GHz)!
•  H2O Masers, methanol, NH3,..!


                           TAPAS – IRAM 30m!
                           •  Telescope Archive for Public Access System!
                           •  Bolometric observations, maps, spectra!
                           •  Rotational molecular transitions!
                           •  ~200 scientific projects / year, 1TB!

     Radio Astronomy DAta Model for Single-dish telescopes!
Who are you ?!

The AMIGA Group!
Analysis of the interstellar Medium of Isolated Galaxies!
    Statistical baseline of isolated galaxies to compare!
    with the behaviour of galaxies in denser environments!

                  Multi   study of ~1000 galaxies!
       Need of intensive and complex analysis of 3D data!
                  2D spatial + 1 Velocity!
Who are you ?!

Velocity Datacubes!

      M. Krips – ESO 3D2008 Workshop – Garching!
Who are you ?!

Groningen Image Processing SYstem!

                        Connectivity !
                        •  VO Archives !
                        •  VO Software!
                        •  Usability GUI!
                        •  VO Web Services!
                        Kapteyn Astronomical Institute!
                        IAA - CSIC!
Who are you ?!

B0DEGA Below 0 DEgrees GAlaxies!
P.I. : D. Espada!
Legacy project of Submillimiter Array interferometer (SMA)!!
CfA (Harvard-Smithsonian Center for Astrophysics)!
ASIAA (Institute of Academia Sinica Astronomy and Astrophysics) !
          Molecular gas properties of a survey of nearby galaxies.!

    30 processed and reduced datacubes of galaxies!
Who are you ?!

The B0DEGA 3D VO Catalog!
The Data and Service provider!

                                 Aladin VO Software!
The Virtual Observatory!
The Virtual Observatory!
Infrastructure of interoperable data and services. Standards for:!
•  Providers to share data and services!
•  Developers to discover the services, find and access the data!
Goal: astronomers to use this infrastructure in a seamless way!
The Virtual Observatory!

Standards for Web Services!
•  Most of the Web Services in Astronomy!
•  They are registered and curated !
    •  VO Registry!
•  WS for Humans!
    •  Data discovery and data access!
    •  Accessed with local software (Europe)!
    •  Integrated in web portals (USA)!
•  WS for Machines!
    •  Storage, transport, authentication, etc.!
The Virtual Observatory!

The VO Registry!
•  If you are not registered, you are not in the VO!
•  Web forms to register services!
•  Three VO Registries!
    •  Euro-VO!
    •  National Virtual Observatory (USA)!
    •  AstroGrid (UK)!
•  Harvesting among registries!
•  A VO Registry register resources!
    •  Organizations!
    •  Authorities!
    •  Data collections!
    •  Services!
The Virtual Observatory!

WS for Humans!
•    Most WS provide “just” Data Discovery and Access!
•    Associated to a very specific Archive!
•    Designed to discover!
        •  VO Services!
        •  Catalogs!
        •  Images!
        •  Spectra!
•    Parameters-based -> Standards!
•    Responses are always VOTables!
        •  Characterization of data!
        •  Actual data values !
                •  List of services !
                •  Spreadsheets for catalogues!
                •  Links to binaries for images and spectra!
The Virtual Observatory!

WS for Humans!
•  Sesame name resolver is one of the most used!
    •  Resolves objects names into coordinates!
    •  Provided by Centre de Données de Strasboug (CDS) !
•  Data Discovery and Access (RESTful)!
    •  ConeSearch!
    •  Simple Image Access!
    •  Simple Spectra Access!
        •  Parameters: RA, DEC, SIZE !
    •  Table Access Protocol (TAP), OpenSkyQuery, SkyNodes!
        •  Astronomical Data Query Langage (ADQL) requests!
•  Sparse complex services (SOAP)!
    •  Mosaicing of images, footprint of regions, spectral
       building and fitting, principal components analysis in
    •  Common Execution Architecture (AstroGrid)- not took off!
The Virtual Observatory!

WS for Machines!
•  Implementation in progress!
    •  More standards than implemented services!
•  Universal Worker Service (Grid oriented)!
    •  asynchronous!
    •  stateful!
    •  job oriented services!
•  VOSpace!
    •  distributed storage!
    •  will be provided for Big Data archives!
•  Single Sign-On and Credential Delegation!
•  Registry Interfaces: services acting on the Registry!
The Virtual Observatory!

•    VO Services Support Interface (REST binding)!
•    In progress of implementation!
•    Provides interoperability among services!
•    Common Contract for all VO services!
•    Self-descriptive services!
         "- operations and data!
              /capabilities /tables!
          -  state of the service !
              /availability /upSince /downAt /backAt /note!
•    XML/VOTable VOSI files!
•    VOSI files stored in service provider server!
•    Files are scanned by VO Regrisries!
•    Provide also state of the service!
The Virtual Observatory!

XML Format!
•  Characterization of Data!
    •  Semantics!
        •  UCDs (Universal Content Descriptors)!
    •  Data Models!
        •  UTypes!
•  Actual Data!
    •  Tabular data!
    •  Links to binary data!
The Virtual Observatory!

Ontologies, SKOS Vocabularies!

The Virtual Observatory!

Ontologies, SKOS Vocabularies!
VO Software!
VO Software!
VO Software!
Aladin Sky Atlas!
VO Software!
VO Software!
A Cloud of Services!

The next generation of archives!
    Much wider FoV and spectral coverage!
    •  Large volumes for an observed datacube!
    •  Subproducts are Virtual Data generated on-the-fly!

    Automated surveys !
    •  Huge amounts of tabular data!
    •  Services for Knowledge Discovery in Databases!
A Cloud of Services!

Cube sizes!

ASKAP Cubes!
Prof. Kevin Vinsen !
A Cloud of Services!
The overall picture!
Distributed, scalable and flexible infrastructure!
•  Grid + Cloud may solve storage and processing!
•  Bandwidth is the issue!

Big Data Science performance is highly dependent
upon I/O data rates (local and transfer)!
The data is the infrastructure!
•  Interconnected and interoperable archives!
•  Distributed, multi-wavelength and multi-facilities!
Archives speaking Web Services!
ALMA, LSST, ASKAP, MeerKAT, LOFAR, Apertif,...!
A Cloud of Services!
The overall picture!
We are moving into a world where !
•  computing and storage are cheap !
•  data movement is death!
Archives should evolve from data providers into virtual data
and services providers, where web services may help to solve
bandwidth issues.!
Web Services!
•  Smaller virtual data subproducts!
•  Distributed, multi-archive, multi-wavelength astronomy!
•  Workflows as a disruptive working methodology!
A Cloud of Services!

3D Data Services!
•    Cutout!
•    Resample!
•    Spectrum extraction!
•    2D slice extraction!
•    Dimensional reduction!
•    Filtering/Flagging!
•    2D Moments!
•    Complex transformations!
Scientific Use Cases!

Exploration services!
KDD - Knowledge Discovery in Databases!
Understand what information is contained within the
data in order to know how we can efficiently extract it !

•  Anomaly detection!
•  Cross-matching data!
•  Dimensionality reduction!

Extraction of scientifically !
relevant information from a!
multidimensional parameter space.!                visIt software

Scientific Use Cases!

Data Mining!
Some key astronomy problems that can be addressed with data mining
•    Cross-Match objects from different catalogues!
•    The distance problem (e.g., Photometric Redshift estimators)!
•    Star-Galaxy Separation!
•    Cosmic-Ray Detection in images!
•    Supernova Detection and Classification!
•    Morphological Classification (galaxies, AGN, gravitat. lenses, ...)!
•    Class and Subclass Discovery (brown dwarfs stars, ...)!
•    Dimension Reduction = Correlation Discovery!
•    Learning Rules for improved classifiers !
•    Classification of massive data streams!
•    Real-time Classification of Astronomical Events !
•    Clustering of massive data collections!
•    Novelty, Anomaly, Outlier Detection in massive databases!

Scientific Use Cases!

Scientific Use Cases!

Scientific Use Cases!

Multidimensional Clustering!
Scientific Use Cases!


                    Cepheid Variables!
                    Cosmic yardsticks!
                    -- One Correlation!
                    -- Two Classes!!
Scientific Use Cases!

Outlier detection!
Scientific Use Cases!

Self Organizing Map!
Organizing information in complex data collections!
Find hidden relationships and patterns!
Based on links among keywords and metadata !
Scientific Use Cases!

The time domain!
•  VO Sky Event reporting metadata!
•  What, Where, Who, How ?!
•  Stars flares ,GRBs, solar, atmospheric particle bursts,..!
The Helio-VO Project!
Scientific Use Cases!

The VO-Experiment!
•  Data Mining Oriented!
•  VO Services !
       •  Discovery !
       •  Access!
       •  Waiting for analysis services!
•  Local software (also some Web portals)!
       •  Crossmatching!
       •  Inspection!
       •  Visualization!
•  Web services associated to archives of big facilities!
       •  Hinders cross-boundary science!
Scientific Use Cases!
XMM Observations of the AMIGA Sample!
TopCat Hands-On !
Let’s do some science !!
Scientific Use Cases!
XMM Observations of the AMIGA Sample!
                             Slightly brighter!
Scientific Use Cases!
XMM Observations of the AMIGA Sample!
                             Slightly brighter!
Scientific Use Cases!
XMM Observations of the AMIGA Sample!
                             Slightly brighter!
                             Brighter in FIR!
Scientific Use Cases!
XMM Observations of the AMIGA Sample!
                             Slightly brighter!
                             Brighter in FIR!
                             Excess in longer !

Why Workflows ?!
Web-services-based vs. Pipelines!
•    Expose the scientific methodology!
•    Keep the provenance !
•    Pack the experiment !
•    Enable !
     •  repeatable results !
     •  reproducibility!
     •  reuse, repurpose!
     •  cross-boundary science!
     •  preservation!

Workflows Preservation!
All components related to the!
research lifecycle should be available. !
Preserved and easily retrievables !
•    Proposals!
•    Data!
•    Processes!
•    Workflows!
•    Publications!


Open questions for Web Services!
In the Virtual Observatory!
•    Curation and preservation (identifiers)!
•    Discovery (semantics) of web services!
•    Characterization: input, outputs, functionality, etc.!
•    Copies (authenticity) or similar used as alternates !
•    Permissions (authentication), licenses, platform, costs,..!
•    Metrics for quality: popularity, use stats, logs uptime, etc.!
•    Versioning and authoring (referenced and acknowledged)!
In a cloud of services and data, Web Services should benefit
of the same privileges acquired by Data.!

IVOA Note on Workflows!

•  No VO services-based Wfs!
•  Helio Project Wfs!
•  VOTables parsing!
•  Internal services!

•  Querying Catalogue!

Working with the v2.3!

Simple AMIGA ConeSearch!

•    Xpath plugin not a useful for extracting info from VOTable!
•    Helio-VO beanshell used instead (Thanks !)!
•    Visualization of results.. (VOTables) !

     XMM Multi-ConeSearch!

•    Lot of previous VOTable parsing ..!
•    The response is 1051 VOTables !!
•    VOTable merging tool needed!

AMIGA Multi-ConeSearch!

•    Lot of beanshells for VOTabl and CSV parsing ..!
•    Beanshells development needed for splitting lists into values!
•    STILTS Library needed for VOTable crossmatching!

The VO-experiment!
•    Discover Services!
•    Multi-query!
•    Crossmatching!
•    Inspection!
•    Visualization and Comparison!
Proposed shortcuts for Taverna!
•    VORegistry Access Perspective!
•    STILTS VOTable Library !
•    SAMP (Connectivity with VO Software)!
•    Python based beanshells!
•    Simple standard astronomy functions!
Thanks !!
Wf4Ever @ Manchester!
•    Carole Goble!
•    Sean Bechhofer!
•    Jiten Baghat!
•    Stian Soiland-Reyes!
•    Kalid Belhajjame!
•  John Brooke!
•  Donal Felows!
•  Anja Leblanc!
Thanks !!
Thanks !!
Thanks !!
Thanks !!

More Related Content

What's hot

A Recommender Story: Improving Backend Data Quality While Reducing Costs
A Recommender Story: Improving Backend Data Quality While Reducing CostsA Recommender Story: Improving Backend Data Quality While Reducing Costs
A Recommender Story: Improving Backend Data Quality While Reducing Costs
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
Ian Foster
Overview of the W3C Semantic Sensor Network (SSN) ontology
Overview of the W3C Semantic Sensor Network (SSN) ontologyOverview of the W3C Semantic Sensor Network (SSN) ontology
Overview of the W3C Semantic Sensor Network (SSN) ontology
Raúl García Castro
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesDiscovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Ian Foster
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Laurent Lefort
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
Mario Juric
Scaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesScaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data Challenges
Matthew Vaughn
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
VHIR Vall d’Hebron Institut de Recerca
From Data to Knowledge with Workflows & Provenance
From Data to Knowledge with Workflows & ProvenanceFrom Data to Knowledge with Workflows & Provenance
From Data to Knowledge with Workflows & Provenance
Bertram Ludäscher
Toward Semantic Sensor Data Archives on the Web
Toward Semantic Sensor Data Archives on the WebToward Semantic Sensor Data Archives on the Web
Toward Semantic Sensor Data Archives on the Web
Jean-Paul Calbimonte
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
Ian Foster
Data Infrastructure Development for SKA/Jasper Horrell
Data Infrastructure Development for SKA/Jasper HorrellData Infrastructure Development for SKA/Jasper Horrell
Data Infrastructure Development for SKA/Jasper Horrell
African Open Science Platform
Quick Introduction to Cytoscape for Undergraduates
Quick Introduction to Cytoscape for UndergraduatesQuick Introduction to Cytoscape for Undergraduates
Quick Introduction to Cytoscape for Undergraduates
Keiichiro Ono
Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)
Krishna Sankar
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
Gaignard Alban
Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.  Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.
catherine roussey
Scalable Data Science and Deep Learning with H2O
Scalable Data Science and Deep Learning with H2OScalable Data Science and Deep Learning with H2O
Scalable Data Science and Deep Learning with H2O
The Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years InThe Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years In
Larry Smarr
Spark streaming
Spark streamingSpark streaming
Spark streaming
Noam Shaish
The Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentThe Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environment
Rutger Vos

What's hot (20)

A Recommender Story: Improving Backend Data Quality While Reducing Costs
A Recommender Story: Improving Backend Data Quality While Reducing CostsA Recommender Story: Improving Backend Data Quality While Reducing Costs
A Recommender Story: Improving Backend Data Quality While Reducing Costs
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
Overview of the W3C Semantic Sensor Network (SSN) ontology
Overview of the W3C Semantic Sensor Network (SSN) ontologyOverview of the W3C Semantic Sensor Network (SSN) ontology
Overview of the W3C Semantic Sensor Network (SSN) ontology
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy SciencesDiscovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
Semantically-Enabling the Web of Things: The W3C Semantic Sensor Network Onto...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
Scaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesScaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data Challenges
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, B...
From Data to Knowledge with Workflows & Provenance
From Data to Knowledge with Workflows & ProvenanceFrom Data to Knowledge with Workflows & Provenance
From Data to Knowledge with Workflows & Provenance
Toward Semantic Sensor Data Archives on the Web
Toward Semantic Sensor Data Archives on the WebToward Semantic Sensor Data Archives on the Web
Toward Semantic Sensor Data Archives on the Web
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
Data Infrastructure Development for SKA/Jasper Horrell
Data Infrastructure Development for SKA/Jasper HorrellData Infrastructure Development for SKA/Jasper Horrell
Data Infrastructure Development for SKA/Jasper Horrell
Quick Introduction to Cytoscape for Undergraduates
Quick Introduction to Cytoscape for UndergraduatesQuick Introduction to Cytoscape for Undergraduates
Quick Introduction to Cytoscape for Undergraduates
Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.  Weather Station Data Publication at Irstea: an implementation Report.
Weather Station Data Publication at Irstea: an implementation Report.
Scalable Data Science and Deep Learning with H2O
Scalable Data Science and Deep Learning with H2OScalable Data Science and Deep Learning with H2O
Scalable Data Science and Deep Learning with H2O
The Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years InThe Pacific Research Platform
 Two Years In
The Pacific Research Platform
 Two Years In
Spark streaming
Spark streamingSpark streaming
Spark streaming
The Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environmentThe Galaxy bioinformatics workflow environment
The Galaxy bioinformatics workflow environment

Similar to VO web-services-based astronomy workflows

Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataJose Enrique Ruiz
Wf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationWf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationJose Enrique Ruiz
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesJose Enrique Ruiz
Curating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsCurating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsJose Enrique Ruiz
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VOJose Enrique Ruiz
Collaborative Digital Experiments
Collaborative Digital ExperimentsCollaborative Digital Experiments
Collaborative Digital ExperimentsJose Enrique Ruiz
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Joint ALMA Observatory
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache Spark
Danis biosystematics2011
Danis biosystematics2011Danis biosystematics2011
Danis biosystematics2011
Bruno Danis
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
Linked Data Access Goes Mobile: Context Aware Authorization for Graph Stores
Linked Data Access Goes Mobile: Context Aware Authorization for Graph StoresLinked Data Access Goes Mobile: Context Aware Authorization for Graph Stores
Linked Data Access Goes Mobile: Context Aware Authorization for Graph StoresLuca Costabello
Biomedical Atlas Centre
Biomedical Atlas CentreBiomedical Atlas Centre
Biomedical Atlas Centre
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropJose Enrique Ruiz
Context-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph StoresContext-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph Stores
Serena Villata
SemsorGrid4Env (Newsfromthefront 2010)
SemsorGrid4Env (Newsfromthefront 2010)SemsorGrid4Env (Newsfromthefront 2010)
SemsorGrid4Env (Newsfromthefront 2010)STI International
Our World is Socio-technical
Our World is Socio-technicalOur World is Socio-technical
Our World is Socio-technical
Markus Luczak-Rösch

Similar to VO web-services-based astronomy workflows (20)

Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D data
Wf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationWf4Ever: Workflow Preservation
Wf4Ever: Workflow Preservation
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
Curating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsCurating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital Experiments
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
Workflow Preservation
Workflow PreservationWorkflow Preservation
Workflow Preservation
Collaborative Digital Experiments
Collaborative Digital ExperimentsCollaborative Digital Experiments
Collaborative Digital Experiments
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
Roberts leiden110213
Roberts leiden110213Roberts leiden110213
Roberts leiden110213
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache Spark
Danis biosystematics2011
Danis biosystematics2011Danis biosystematics2011
Danis biosystematics2011
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
Linked Data Access Goes Mobile: Context Aware Authorization for Graph Stores
Linked Data Access Goes Mobile: Context Aware Authorization for Graph StoresLinked Data Access Goes Mobile: Context Aware Authorization for Graph Stores
Linked Data Access Goes Mobile: Context Aware Authorization for Graph Stores
Biomedical Atlas Centre
Biomedical Atlas CentreBiomedical Atlas Centre
Biomedical Atlas Centre
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
Context-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph StoresContext-Aware Access Control for RDF Graph Stores
Context-Aware Access Control for RDF Graph Stores
SemsorGrid4Env (Newsfromthefront 2010)
SemsorGrid4Env (Newsfromthefront 2010)SemsorGrid4Env (Newsfromthefront 2010)
SemsorGrid4Env (Newsfromthefront 2010)
Our World is Socio-technical
Our World is Socio-technicalOur World is Socio-technical
Our World is Socio-technical

More from Jose Enrique Ruiz

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
Jose Enrique Ruiz
IPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutablesIPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutables
Jose Enrique Ruiz
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
Jose Enrique Ruiz
Open Science and Executable Papers
Open Science and Executable PapersOpen Science and Executable Papers
Open Science and Executable Papers
Jose Enrique Ruiz
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paper
Jose Enrique Ruiz
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
Jose Enrique Ruiz
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCAJose Enrique Ruiz
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz

More from Jose Enrique Ruiz (9)

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
IPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutablesIPython Notebooks - Hacia los papers ejecutables
IPython Notebooks - Hacia los papers ejecutables
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
Open Science and Executable Papers
Open Science and Executable PapersOpen Science and Executable Papers
Open Science and Executable Papers
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paper
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science
Digital ScienceDigital Science
Digital Science
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i

Recently uploaded

Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas

Recently uploaded (20)

Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas

VO web-services-based astronomy workflows

  • 1. VO Web-services-based astronomy workflows! Jose Enrique Ruiz! IAA - CSIC! Manchester 13th July 2011!
  • 3. Wf4Ever! Curating and preserving collaborative digital experiments 1.  Intelligent Software Components (ISOCO, Spain)! 2.  University of Manchester (UNIMAN, UK)! 2 7 3.  Universidad Politécnica de Madrid (UPM, Spain)! 5! 4! 4.  Poznan Supercomputing and Networking Centre (PSNC, Poland)! 5.  Universisty of Oxford (OXF, UK)! 6.  Instituto Astrofísica Andalucía (IAA-CSIC, Spain)! 1! 3! 7.  Leiden University Medical Centre (LUMC, NL)! 6!
  • 4. Who are you ?! The AMIGA Group! Analysis of the interstellar Medium of Isolated Galaxies! ! Statistical baseline of isolated galaxies to compare! with the behaviour of galaxies in denser environments! Multi study of ~1000 galaxies! ! Instituto Astrofisica de Andalucia - CSIC! Univ . Granada, Obs. Marseille, Obs. Paris, ! NAOJ, FCRAO, UNAM, Univ. Edinburgh, ! IRAM, ESO, Kapteyn Astronomical Institute.! ! P.I. Lourdes Verdes-Montenegro!!
  • 5. Who are you ?! VO Virtual Observatory! •  International Virtual Observatory Alliance (IVOA)! •  Interoperability and Discovery! •  Publishing and Accessing Data! •  Service Oriented Architecture (SoA)! •  Integration of Software and Data! •  Distributed Resources! •  Panchromatic Astronomy! •  Data Models! •  Web Services! •  Semantics! !
  • 6. Who are you ?! VO Virtual Observatory! !
  • 7. Who are you ?! The AMIGA VO Catalog! The Data Provider!
  • 8. Who are you ?! RADAMS! Radio Astronomy Data Model for Single-Dish telescopes !
  • 9. Who are you ?! RADAMS Implementation
  • 10. Who are you ?! VO Archives Developments Robledo DSS-63! •  Madrid Deep Space Communication Complex (MDSCC)! •  70m single dish in Robledo de Chavela (Madrid)! •  5% operational time for observations! •  K band Spectra (18 - 26 GHz)! •  H2O Masers, methanol, NH3,..! ! ! TAPAS – IRAM 30m! •  Telescope Archive for Public Access System! •  Bolometric observations, maps, spectra! •  Rotational molecular transitions! •  ~200 scientific projects / year, 1TB! Radio Astronomy DAta Model for Single-dish telescopes!
  • 11. Who are you ?! The AMIGA Group! Analysis of the interstellar Medium of Isolated Galaxies! ! Statistical baseline of isolated galaxies to compare! with the behaviour of galaxies in denser environments! ! Multi study of ~1000 galaxies! +! Need of intensive and complex analysis of 3D data! 2D spatial + 1 Velocity!
  • 12. Who are you ?! Velocity Datacubes! ! M. Krips – ESO 3D2008 Workshop – Garching!
  • 13. Who are you ?! GIPSY! Groningen Image Processing SYstem! Connectivity ! •  VO Archives ! •  VO Software! ! Accessibility! •  Usability GUI! •  VO Web Services! ! Kapteyn Astronomical Institute! IAA - CSIC!
  • 14. Who are you ?! B0DEGA Below 0 DEgrees GAlaxies! P.I. : D. Espada! Legacy project of Submillimiter Array interferometer (SMA)!! ! IAA-CSIC! CfA (Harvard-Smithsonian Center for Astrophysics)! ASIAA (Institute of Academia Sinica Astronomy and Astrophysics) ! ! Molecular gas properties of a survey of nearby galaxies.! 30 processed and reduced datacubes of galaxies!
  • 15. Who are you ?! The B0DEGA 3D VO Catalog! The Data and Service provider! Aladin VO Software!
  • 16. The Virtual Observatory! The Virtual Observatory! Infrastructure of interoperable data and services. Standards for:! •  Providers to share data and services! •  Developers to discover the services, find and access the data! Goal: astronomers to use this infrastructure in a seamless way!
  • 17. The Virtual Observatory! Standards for Web Services! •  Most of the Web Services in Astronomy! •  They are registered and curated ! •  VO Registry! •  WS for Humans! •  Data discovery and data access! •  Accessed with local software (Europe)! •  Integrated in web portals (USA)! •  WS for Machines! •  Storage, transport, authentication, etc.!
  • 18. The Virtual Observatory! The VO Registry! •  If you are not registered, you are not in the VO! •  Web forms to register services! •  Three VO Registries! •  Euro-VO! •  National Virtual Observatory (USA)! •  AstroGrid (UK)! •  Harvesting among registries! •  A VO Registry register resources! •  Organizations! •  Authorities! •  Data collections! •  Services!
  • 19. The Virtual Observatory! WS for Humans! •  Most WS provide “just” Data Discovery and Access! •  Associated to a very specific Archive! •  Designed to discover! •  VO Services! •  Catalogs! •  Images! •  Spectra! •  Parameters-based -> Standards! •  Responses are always VOTables! •  Characterization of data! •  Actual data values ! •  List of services ! •  Spreadsheets for catalogues! •  Links to binaries for images and spectra!
  • 20. The Virtual Observatory! WS for Humans! •  Sesame name resolver is one of the most used! •  Resolves objects names into coordinates! •  Provided by Centre de Données de Strasboug (CDS) ! •  Data Discovery and Access (RESTful)! •  ConeSearch! •  Simple Image Access! •  Simple Spectra Access! •  Parameters: RA, DEC, SIZE ! •  Table Access Protocol (TAP), OpenSkyQuery, SkyNodes! •  Astronomical Data Query Langage (ADQL) requests! •  Sparse complex services (SOAP)! •  Mosaicing of images, footprint of regions, spectral building and fitting, principal components analysis in spectra..! •  Common Execution Architecture (AstroGrid)- not took off!
  • 21. The Virtual Observatory! WS for Machines! •  Implementation in progress! •  More standards than implemented services! •  Universal Worker Service (Grid oriented)! •  asynchronous! •  stateful! •  job oriented services! •  VOSpace! •  distributed storage! •  will be provided for Big Data archives! •  Single Sign-On and Credential Delegation! •  Registry Interfaces: services acting on the Registry!
  • 22. The Virtual Observatory! VOSI! •  VO Services Support Interface (REST binding)! •  In progress of implementation! •  Provides interoperability among services! •  Common Contract for all VO services! •  Self-descriptive services! "- operations and data! /capabilities /tables! -  state of the service ! /availability /upSince /downAt /backAt /note! •  XML/VOTable VOSI files! •  VOSI files stored in service provider server! •  Files are scanned by VO Regrisries! •  Provide also state of the service!
  • 23. The Virtual Observatory! VOTables! ! XML Format! •  Characterization of Data! •  Semantics! •  UCDs (Universal Content Descriptors)! •  Data Models! •  UTypes! •  Actual Data! •  Tabular data! •  Links to binary data!
  • 24. The Virtual Observatory! Ontologies, SKOS Vocabularies! M16!
  • 31. A Cloud of Services! The next generation of archives! ! Much wider FoV and spectral coverage! •  Large volumes for an observed datacube! •  Subproducts are Virtual Data generated on-the-fly! Automated surveys ! •  Huge amounts of tabular data! •  Services for Knowledge Discovery in Databases!
  • 32. A Cloud of Services! Cube sizes! ! ASKAP Cubes! Prof. Kevin Vinsen !
  • 33. A Cloud of Services! The overall picture! ! Distributed, scalable and flexible infrastructure! •  Grid + Cloud may solve storage and processing! •  Bandwidth is the issue! Big Data Science performance is highly dependent upon I/O data rates (local and transfer)! ! The data is the infrastructure! •  Interconnected and interoperable archives! •  Distributed, multi-wavelength and multi-facilities! ! Archives speaking Web Services! ALMA, LSST, ASKAP, MeerKAT, LOFAR, Apertif,...!
  • 34. A Cloud of Services! The overall picture! ! We are moving into a world where ! •  computing and storage are cheap ! •  data movement is death! ! Archives should evolve from data providers into virtual data and services providers, where web services may help to solve bandwidth issues.! ! Web Services! •  Smaller virtual data subproducts! •  Distributed, multi-archive, multi-wavelength astronomy! •  Workflows as a disruptive working methodology! !
  • 35. A Cloud of Services! 3D Data Services! •  Cutout! •  Resample! •  Spectrum extraction! •  2D slice extraction! •  Dimensional reduction! •  Filtering/Flagging! •  2D Moments! •  Complex transformations! !
  • 36. Scientific Use Cases! Exploration services! KDD - Knowledge Discovery in Databases! Understand what information is contained within the data in order to know how we can efficiently extract it ! •  Anomaly detection! •  Cross-matching data! •  Dimensionality reduction! ! Extraction of scientifically ! relevant information from a! multidimensional parameter space.! visIt software !
  • 37. Scientific Use Cases! Data Mining! Some key astronomy problems that can be addressed with data mining techniques:! ! •  Cross-Match objects from different catalogues! •  The distance problem (e.g., Photometric Redshift estimators)! •  Star-Galaxy Separation! •  Cosmic-Ray Detection in images! •  Supernova Detection and Classification! •  Morphological Classification (galaxies, AGN, gravitat. lenses, ...)! •  Class and Subclass Discovery (brown dwarfs stars, ...)! •  Dimension Reduction = Correlation Discovery! •  Learning Rules for improved classifiers ! •  Classification of massive data streams! •  Real-time Classification of Astronomical Events ! •  Clustering of massive data collections! •  Novelty, Anomaly, Outlier Detection in massive databases! !
  • 41. Scientific Use Cases! Clustering! ! Cepheid Variables! Cosmic yardsticks! ! -- One Correlation! -- Two Classes!!
  • 43. Scientific Use Cases! Self Organizing Map! Organizing information in complex data collections! Find hidden relationships and patterns! Based on links among keywords and metadata ! ! ! !
  • 44. Scientific Use Cases! The time domain! •  VO Sky Event reporting metadata! •  What, Where, Who, How ?! •  Stars flares ,GRBs, solar, atmospheric particle bursts,..! ! The Helio-VO Project! ! ! ! ! ! ! ! !
  • 45. Scientific Use Cases! The VO-Experiment! •  Data Mining Oriented! •  VO Services ! •  Discovery ! •  Access! •  Waiting for analysis services! •  Local software (also some Web portals)! •  Crossmatching! •  Inspection! •  Visualization! •  Web services associated to archives of big facilities! •  Hinders cross-boundary science! ! ! !
  • 46. Scientific Use Cases! XMM Observations of the AMIGA Sample! ! ! TopCat Hands-On ! Let’s do some science !! ! !
  • 47. Scientific Use Cases! XMM Observations of the AMIGA Sample! ! ! Slightly brighter!
  • 48. Scientific Use Cases! XMM Observations of the AMIGA Sample! ! ! Slightly brighter! Closer!
  • 49. Scientific Use Cases! XMM Observations of the AMIGA Sample! ! ! Slightly brighter! Closer! Brighter in FIR!
  • 50. Scientific Use Cases! XMM Observations of the AMIGA Sample! ! ! Slightly brighter! Closer! Brighter in FIR! Excess in longer ! !
  • 51. Wf4Ever! Why Workflows ?! Web-services-based vs. Pipelines! ! •  Expose the scientific methodology! •  Keep the provenance ! •  Pack the experiment ! •  Enable ! •  repeatable results ! •  reproducibility! •  reuse, repurpose! •  cross-boundary science! •  preservation!
  • 52. Wf4Ever! Workflows Preservation! ! All components related to the! research lifecycle should be available. ! ! Preserved and easily retrievables ! ! •  Proposals! •  Data! •  Processes! •  Workflows! •  Publications! ! !
  • 53. IVOA Wf! Open questions for Web Services! In the Virtual Observatory! ! •  Curation and preservation (identifiers)! •  Discovery (semantics) of web services! •  Characterization: input, outputs, functionality, etc.! •  Copies (authenticity) or similar used as alternates ! •  Permissions (authentication), licenses, platform, costs,..! •  Metrics for quality: popularity, use stats, logs uptime, etc.! •  Versioning and authoring (referenced and acknowledged)! ! In a cloud of services and data, Web Services should benefit of the same privileges acquired by Data.!
  • 54. IVOA Wf! IVOA Note on Workflows! ! !
  • 55. MyExperiment! Astronomy! •  No VO services-based Wfs! •  Helio Project Wfs! •  VOTables parsing! •  Internal services! Amiga! •  Querying Catalogue!
  • 57. Taverna! Simple AMIGA ConeSearch! •  Xpath plugin not a useful for extracting info from VOTable! •  Helio-VO beanshell used instead (Thanks !)! •  Visualization of results.. (VOTables) !
  • 58. Taverna! XMM Multi-ConeSearch! •  Lot of previous VOTable parsing ..! •  The response is 1051 VOTables !! •  VOTable merging tool needed!
  • 59. Taverna! AMIGA Multi-ConeSearch! •  Lot of beanshells for VOTabl and CSV parsing ..! •  Beanshells development needed for splitting lists into values! •  STILTS Library needed for VOTable crossmatching!
  • 60. Taverna! The VO-experiment! •  Discover Services! •  Multi-query! •  Crossmatching! •  Inspection! •  Visualization and Comparison! ! Proposed shortcuts for Taverna! •  VORegistry Access Perspective! •  STILTS VOTable Library ! •  SAMP (Connectivity with VO Software)! •  Python based beanshells! •  Simple standard astronomy functions! !
  • 61. Thanks !! ! Wf4Ever @ Manchester! •  Carole Goble! •  Sean Bechhofer! •  Jiten Baghat! •  Stian Soiland-Reyes! •  Kalid Belhajjame! ! Helio-VO! •  John Brooke! •  Donal Felows! •  Anja Leblanc! ! !