Integration and Management of Diverse Environmental Data Sets

968 views
891 views

Published on

Presentation I gave as part of the New Frontiers in Data Integration session at Summit 09 in Banff on Oct. 14, 2009. It discusses some current work that the Grid Research Centre is doing in relation to data management and integration.

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
968
On SlideShare
0
From Embeds
0
Number of Embeds
9
Actions
Shares
0
Downloads
8
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Integration and Management of Diverse Environmental Data Sets

  1. 1. Integration and Management of Diverse Environmental Data Sets Cameron Kiddle Research Fellow Grid Research Centre University of Calgary
  2. 2. Outline <ul><li>Data Challenges </li></ul><ul><li>GeoChronos (Spectral Libraries) </li></ul><ul><li>Spectral Library Demonstration </li></ul><ul><li>Cloud Services for Water Management </li></ul><ul><li>Cloud Services for Water Management Demonstration </li></ul>Summit 09 Oct. 14, 2009
  3. 3. Data Challenges - Acquisition <ul><li>Many different data sources </li></ul><ul><li>Different regulations/mechanisms for accessing data </li></ul><ul><li>Lack of automation </li></ul><ul><li>Finding the right data </li></ul>Summit 09 Oct. 14, 2009
  4. 4. Data Challenges - Management <ul><li>Scattered and unorganized data </li></ul><ul><li>Inadequate tools for recording/maintaining metadata </li></ul><ul><ul><li>Data without metadata is meaningless </li></ul></ul><ul><ul><li>Lack of suitable metadata standards </li></ul></ul><ul><ul><li>Validation of metadata </li></ul></ul><ul><li>Tracking provenance of data </li></ul>Summit 09 Oct. 14, 2009
  5. 5. Data Challenges – Pre-processing <ul><li>Raw data typically cannot be directly analyzed </li></ul><ul><li>Significant amount of time spent preparing data for analysis </li></ul><ul><li>Lack of automation </li></ul>Summit 09 Oct. 14, 2009
  6. 6. GeoChronos <ul><li>Partners </li></ul><ul><ul><li>CANARIE (NEP-1) </li></ul></ul><ul><ul><li>Center for Earth Observation Sciences, University of Alberta </li></ul></ul><ul><ul><li>Cybera </li></ul></ul><ul><ul><li>Grid Research Centre, University of Calgary </li></ul></ul>Summit 09 Oct. 14, 2009
  7. 7. GeoChronos <ul><li>An on-line platform (http://geochronos.org/) </li></ul><ul><ul><li>For: </li></ul></ul><ul><ul><ul><li>Earth Observation Scientists </li></ul></ul></ul><ul><ul><li>Facilitating: </li></ul></ul><ul><ul><ul><li>Collaboration between scientists </li></ul></ul></ul><ul><ul><ul><li>Application access, management and sharing </li></ul></ul></ul><ul><ul><ul><li>Data access, management and sharing </li></ul></ul></ul><ul><ul><li>Leveraging: </li></ul></ul><ul><ul><ul><li>Web 2.0 and social networking technologies </li></ul></ul></ul><ul><ul><ul><li>Cloud computing technologies </li></ul></ul></ul><ul><ul><ul><li>Semantic Web technologies </li></ul></ul></ul>Summit 09 Oct. 14, 2009
  8. 8. GeoChronos <ul><li>Data Solutions - Spectral Libraries </li></ul><ul><ul><li>Store, share and browse spectral data </li></ul></ul><ul><ul><li>View spectral plots, metadata, ancillary data and maps </li></ul></ul><ul><ul><li>Manage and generate metadata for spectra </li></ul></ul><ul><ul><li>Create and share metadata schemas </li></ul></ul><ul><li>Technology </li></ul><ul><ul><li>iRODS ( http://www.irods.org/ ) for data storage/management </li></ul></ul><ul><ul><li>Semantic Web technologies such as RDF (Resource Description Framework) to link/relate data </li></ul></ul><ul><li>Next Steps </li></ul><ul><ul><li>Generalization of spectral library solution - acquire, store, manage, browse and share other types of data (i.e., satellite, flux, phenology, meteorological, etc.) </li></ul></ul><ul><ul><li>Automate data workflows (i.e., mosaic, reproject and subset MODIS data) using cloud-based services </li></ul></ul>Summit 09 Oct. 14, 2009
  9. 9. Spectral Library - Demonstration Summit 09 Oct. 14, 2009
  10. 10. Spectral Library - Browse Summit 09 Oct. 14, 2009
  11. 11. Spectral Library - Metadata Summit 09 Oct. 14, 2009
  12. 12. Spectral Library - Ancillary Data Summit 09 Oct. 14, 2009
  13. 13. Cloud Services for Water Management <ul><li>Partners </li></ul><ul><ul><li>Alberta Advanced Education and Technology </li></ul></ul><ul><ul><li>Alberta WaterSMART </li></ul></ul><ul><ul><li>Cybera </li></ul></ul><ul><ul><li>Geosensor Web Lab, University of Calgary </li></ul></ul><ul><ul><li>Grid Research Centre, University of Calgary </li></ul></ul><ul><ul><li>Tesera Systems Inc </li></ul></ul>Summit 09 Oct. 14, 2009
  14. 14. Cloud Services for Water Management <ul><li>In preliminary stages of project </li></ul><ul><li>Explore use of cloud services to store, manipulate and expose data related to water management </li></ul><ul><li>Link and correlate a wide variety of data from a large number of sources </li></ul><ul><li>Cloud-based analysis and visualization tools </li></ul><ul><li>Investigate integration with existing Alberta WaterPortal (http://www.albertawater.com/) </li></ul>Summit 09 Oct. 14, 2009
  15. 15. Cloud Services for Water Management - Demonstration Summit 09 Oct. 14, 2009
  16. 16. Data Types and Locations Displayed on Google Earth Summit 09 Oct. 14, 2009
  17. 17. Water Flow Data Shown using Google Visualization API Summit 09 Oct. 14, 2009
  18. 18. Integrating and Visualizing Different Data Sets From Calgary’s Flood in 2005 on Google Earth Summit 09 Oct. 14, 2009
  19. 19. Contact Information Summit 09 Oct. 14, 2009 Cameron Kiddle [email_address] http://pages.cspc.ucalgary.ca/~kiddlec/ http://grid.ucalgary.ca/

×