Your SlideShare is downloading. ×
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed Experience and Perspectives
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed Experience and Perspectives


Published on

Published in: Technology

  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

No notes for slide
  • Transcript

    • 1. Work Group Meeting on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed Experience and Perspectives Rudolf B. Husar CAPITA, Washington University, St. Louis, MO Ispra, JRC, March 14. 2004
      • Data handling approach
      • Software tools
      • Participant involvement
      • Data dissemination
      • Integration problems
      • Summary
    • 2. Policy Guidance for HTAP IT
      • Terry Keating, HTAP Task Force Co-Chair:
      • Transparency of the HTAP Process
        • Acceptance of the Tech Report will depend on openness
      • Inclusiveness and Ease of Participation
        • Facilitate participation by smaller contributors
    • 3. Current Air Quality Information ‘Ecosystem’
      • AQ info is distributed over many ‘dimensions’: Geography, Content, Agency, Program…
      • AQ info content includes: emissions , ambient & satellite data and model outputs
      • Info is provided and consumed by different agencies , (NASA, NOAA, EPA…)
      • Providers have different access protocols , formats, and information usage conditions
      Lack of Interoperability Poor data & model utilization Less informed decision making
    • 4. GEOSS Info Flow Architecture A General Framework Accepted by Members Model Model Data Data
    • 5. Federated Network for Air Quality Data and Processing Services Project Team: Software Architecture: R. Husar Software Implementation: K. Höijärvi Data and Applications: S. Falke, R. Husar Data Handling Approach
    • 6. Federated Data System for Air Quality
      • The challenge is to design a general supportive infrastructure
      • Simply connecting the relevant provides and users for each info product is messy
      • The info system infrastructure needs to facilitate the creation of info products
      AQ Compliance Nowcast/Forecast Status & Trends Find Data Gaps ID New Problems ……… Info Needs Reports
      • Providers supply the ‘raw material’ (data and models) for ‘refined’ info products
      Emission Surface Satellite Model Single Datasets Providers Wrappers Where? What? When? Federate Data Structuring
      • Structuring the heterogeneous data into where-when-what ‘cubes’ simplifies the mess
      Slice & Dice Explore Data Viewers
      • The ‘cubed’ data can be accessed and explored by slicing-dicing tools
      Programs Integrate Understand
      • More elaborate data integration and fusion can be done by web service chaining
      Non-intrusive Linking & Mediation Data Users Data Providers
    • 7. Data Handling Approach: DataFed
      •   Approach: Mediation Between Users and Data Providers
      • DataFed assumes autonomous data management ( a la Internet)
      • Non-intrusive third-party data wrapping for unified web service (WS) access
      • End-user programming of applications through chaining of WS components
      • Applications
      • Building browsers and analysis tools for distributed monitoring data   
      • Serve as data and service resources for user programs (science, GIS tools)
      • Support application projects , e.g. FASTNET, Exceptional Event Rule
    • 8. Typical DataFed AQ Analysis Tools Consoles : Data from diverse sources are displayed to create a rich context for exploration and analysis CATT : Combined Aerosol Trajectory Tool for the browsing backtrajectories for specified chemical conditions Viewer: General purpose spatio-temporal data browser and view editor applicable for all DataFed datasets
    • 9. Quebec Smoke July, 7, 2002 SeaWiFS Satellite Aerosol Chemical Air Trajectory Map Boarder Web Service Composition
    • 10. Software Tools: Demo: Networked Data Access, Processing and Fusion AeroCOM – VIEWS SO4
      • http:// =VIEWS_KYU
    • 11. Visualization Goal: 4D, User-selectable layers Image below completed in 1998 Satellite Data Layers: Land Reflectance (SeaWiFS) Fire Pixels (ASTR Night) High Cloud (GOES, Meteosat, GMS2) Aerosol AOT (AVHRR)
    • 12. 4D Dynamic Visualization Demonstrate interactions, allow exploration ‘Google Earth’ for Earth Science is now possible
    • 13. Earth Science Information Partners Air Quality Cluster 1. Serve as facilitator to Earth Science information community . 2. Promote efficient flow of ES data from collection to end-use . 3. Improve quality and usability of ES data and info systems 4. Expand the use of Earth science information Participant Involvement Partners
      • N ASA
      • NOAA
      • EPA
      • USGS
      • DOE
      • NSF
      • Industry…
    • 14. Data Dissemination & Use – Service Based
      • Provide Catalog Services (Publish, Find data)
      • Allow Data Access (‘Bind’) – Use International Standards
      • Add tools for exploration and analysis
      Near Real Time Data Integration Delayed Data Integration Surface Air Quality AIRNOW O3, PM25 ASOS_STI Visibility, 300 sites METAR Visibility, 1200 sites VIEWS_OL 40+ Aerosol Parameters Satellite MODIS_AOT AOT, Idea Project GASP Reflectance, AOT TOMS Absorption Indx, Refl. SEAW_US Reflectance, AOT Model Output NAAPS Dust, Smoke, Sulfate, AOT WRF Sulfate Fire Data HMS_Fire Fire Pixels MODIS_Fire Fire Pixels Surface Meteorology RADAR NEXTRAD SURF_MET Temp, Dewp, Humidity… SURF_WIND Wind vectors ATAD Trajectory, VIEWS locs.
    • 15. WCS - Interoperable Data Access Service netCDF – Machine independent encoding ncML – XML data description CF – Naming, structure convention OGC Web Coverage Service - Interoperable Data Access Query Data Syntax + Semantics Coverage (parameter) BBOX Time Range netCDF Example Profile
      • SERVICE=wcs ‘service
      • REQUEST=GetCoverage,VER=1.0 ‘service method
      • COVERAGE=AIRNOW.pmfine ‘what
      • CRS=EPSG:4326 ‘projection
      • BBOX=-125,22,-61,51,0,0 ‘where
      • TIME=2005-06-6T15:00:00Z ‘when
      • FORMAT=netCDF ‘return format
    • 16. GALEON Interoperability Experiment Geo-interface for Atmosphere, Land, Earth, and Ocean netCDF Unify Earth Science & GIS Data Flows Strong European Participation IT – S. Nativi, L. Bigagli UK – Andrew Wolf DE – Peter Bauman) B. Domenico B. Domenico GALEON UNIDATA U Florence/CNR-IMMA WCS Server
    • 17. OGC WCS Demonstration: THREDDS_GFS 4Dim Dataset
      • Lat/Lon Box Elev Range Time Range
      • Map: BBOX=-180,-90,180,90, 1350,1350& TIME=2005-12-06/2005-12-06/PT3H
      • Time: BBOX=-34,49.05,-34,49.05, 1350,1350& TIME=2005-12-05/2005-12-08/PT3H
      • Elev: BBOX=-34,49.05,-34,49.05, 0,18000 & TIME=2005-12-06/2005-12-06/PT3H
      The form of the WCS query is the same for all slices through the data cube (views) The only difference in the views is the thickness of the slices in each dimension Return grid is in multiple formats (NetCDF, CSV, GML, PNG, … ) Map View Services WCS Query Time View Services WCS Query Elevation View Services WCS Query
    • 18. Summary
      • Current systems data & model analysis are heterogeneous
      • Standardization is a key need for agile IT systems
      • Non-intrusive mediators can achieve virtual standardization
      • Technologies are currently available for dynamic NETWORKING
      • We eager to share our networked data, tools and methods
    • 19. OGC WCS Demonstration: Grid, Image, Station Data Types
      • Coverage=THEEDDS.T& BBOX=-126,24,-65,52,0,0 &TIME=2002-07-07/2002-07-07&FORMAT=NetCDF
      • Coverage=SURF.Bext& BBOX=-126,24,-65,52,0,0 &TIME=2002-07-07/2002-07-07&FORMAT=NetCDF-table
      • Coverage=SEAW.Refl& BBOX=-126,24,-65,52,0,0 &TIME=2002-07-07/2002-07-07&FORMAT=GeoTIFF
      • COVERAGE=sst& BBOX=-126,24,-65,52,0,0 &TIME=2001-01-01,2001-01-01&FORMAT=NetCDF
      UNIDATA – THREDDS/GALEON WCS DataFed GALEON WCS U Florence, It GALEON WCS DataFed GALEON WCS Grid Grid Image Station Services WCS Query Services WCS Query Services WCS Query Services WCS Query
    • 20. Single Data Model for All AQ Data
      • Most Views are slices through a cube of data organized by lat, lon, altitude, and time (X,Y,Z,T)
      Multidimensional Data Cube
    • 21. OGC Web Coverage Service (WCS) Specification
      • HTTP GET/POST based interfaces
      • Services have XML service descriptions (“Capabilities”, “Description”)
      • Filter parameters allow selection of subsets of source data
      • Output formats advertised by each service instance
      OGC WCS getCoverage Schema Suitable for wrapping with SOAP envelope, WSDL access, loose coupling WCS is for "coverages" – information representing space -time- varying phenomena WCS describes, requests and delivers coverages in spatio-temporal domain WCS version 1.1 is limited to grids/"simple” coverages with homogeneous range sets
    • 22. AQ Monitoring Network Data Storage and Delivery through OGC Protocols Relational Data Model Star Schema WMS WCS SOS SensorML WFS Observations Station Info. Param/Sensor/Method Data View Services WMS Stations Par-Meth Observations SOS WCS Data Access
    • 23. Technology Support for ‘Integrated Solutions’ Air Quality Information System
    • 24. Data Access through Adapters through DataFed SOAP,HTTP Get OGC WCS HTTP Get, Post OGC WMS HTTP Get Station-Point SQL Server, Files… Sequence Image, file nDim Grid OpenDAP NetCDF, … Other Traject., Event, Pic Sources Diverse formats Many data models Data Wrapper Data into geo-cubes Queries to views Virtual Data Cube Global geo-cube data model Makes queries data-neutral Others? e.g. OpenDAP Output Protocol dependent User specified GeoTable CSV,XLS,GML GeoGrid GML,NetCDF.. GeoImage GeoTIFF, PNG.. Other MS Dataset.. Query Adapter Maps query to protocol User selects protocols