SlideShare a Scribd company logo
1 of 18
Download to read offline
Dataset Independent Subsetting
A Dataset Independent Subsetting Prototype
http://minnie.cs.uah.edu/
Matthew R. Smith - matt.smith@msfc.nasa.gov
Bruce Beaumont
Dr. Sara J. Graves
The University of Alabama in Huntsville
Information Technology & Systems Laboratory

UAH

The University of Alabama in Huntsville

8-10 September 1997
Outline
Context
Purpose
Design
Functionality
Web pages
Future
Summary

UAH

The University of Alabama in Huntsville

8-10 September 1997
Context
NASA’s Mission to Planet Earth (MTPE)
Earth Observing System (EOS)
Data and Information System (DIS)
EOSDIS Core System (ECS) Contractor:
Hughes Information Technology Systems

Design and Implement a prototype datasetindependent subsetter

UAH

The University of Alabama in Huntsville

8-10 September 1997
Subsetting?
l Goal:

to provide a science data user with only
the data they request as quickly as possible.

l

Benefits science data users and data centers:
- reduces analysis time by reducing amount of data
- reduces time for data delivery
- reduces resources (network, personnel, media, etc.)

l

Steps:
- locate spatial, temporal, and spectral area of interest
- extract data
- re-assemble for distribution

UAH

The University of Alabama in Huntsville

8-10 September 1997
Design
Web-based
Dataset - independent
HDF-EOS formatted data
HDF-EOS software library
Data types
Swath
Grid

UAH

The University of Alabama in Huntsville

8-10 September 1997
Functionality
Front-end ( user interface )
Forms-based Web application - obtains subsetting
selection criteria
criteria file (ODL)

Back-end ( subsetter )
C software using HDF-EOS and HDF libraries
executed in batch mode

UAH

The University of Alabama in Huntsville

8-10 September 1997
User Interface
File selection
Parameters/channels
Geographic bounding box
Time range
Subsampling stride
Non-geolocated objects

UAH

The University of Alabama in Huntsville

8-10 September 1997
Summary of Current
Functionality
Subsetter Functionality
Can subset grid and swath data
Files may contain multiple grids and/or swaths; user may select
any or all for subsetting
Subset swath data on latitude/longitude and/or time
Subset grid data on latitude/longitude
Non-geolocated data may be included or excluded
Output is HDF-EOS file using same data types
“Back-end” runs as a batch job at archive center
User may check status of job and/or cancel it
E-mail sent to user when complete
Data retrieved via FTP

UAH

The University of Alabama in Huntsville

8-10 September 1997
Restrictions
Number of subsettable datasets limited by HDF-EOS library
subsetting functions:
Latitude must be “Latitude” or “Colatitude”
Longitude must be “Longitude”
Latitude and longitude must be FLOAT32 or FLOAT64
Latitude and longitude must be 1- or 2-dimensional
Latitude and longitude must have identical dimensions
Time must be “Time”
Time must be FLOAT64 in TAI93 format
Time must be 1- or 2-dimensional
“Track” must be slowest varying dimension in geo fields
Grid data must be in one of six supported projections

UAH

The University of Alabama in Huntsville

8-10 September 1997
Future Plans
Relax requirements for latitude/longitude and time in
swath datasets
Provide Java-based GUI for area-of-interest selection
Allow user to apply one subset specification to multiple
input files
Study integrating subsetter with a data visualization tool
Study separating structural metadata from data

UAH

The University of Alabama in Huntsville

8-10 September 1997
What is Needed
More test datasets in HDF-EOS format
Additional support for modifications to HDF-EOS calls
Accurate HDF-EOS documentation (internal and external)
Functional Java map applet
Resolution of metadata issues
Publication of official metadata standards
Name, content, and format of granule metadata

UAH

The University of Alabama in Huntsville

8-10 September 1997
Risks
HDF-EOS not currently in widespread use
HDF-EOS requirements for dataset-independent
subsetting not widely known to data producers
Legacy datasets are not in HDF-EOS format
Converting to HDF-EOS may increase storage
requirements
Many datasets are on non-volatile media

UAH

The University of Alabama in Huntsville

8-10 September 1997
Summary
A prototype Web-based dataset-independent subsetter has
been developed by UAH.
Allows spatial, temporal, and spectral subsetting and
subsampling of HDF-EOS datasets
Benefits science data users and data centers
Great potential. but limited current use

UAH

The University of Alabama in Huntsville

8-10 September 1997

More Related Content

What's hot

Components of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISComponents of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISKaium Chowdhury
 
QUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GISQUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GISDEVANG KAPADIA
 
CCLS Internship Presentation
CCLS Internship PresentationCCLS Internship Presentation
CCLS Internship PresentationCharles Naut
 
ppt spatial data
ppt spatial datappt spatial data
ppt spatial dataRahul Kumar
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data miningKrish_ver2
 
2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowski2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowskiGIS in the Rockies
 
Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...
Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...
Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...guest3f5c39
 
2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...
2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...
2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...Nikos Koutsoupias
 
EcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN InfrastructureEcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN InfrastructureTERN Australia
 
Using R for Climate Raster Data Extraction
Using R for Climate Raster Data ExtractionUsing R for Climate Raster Data Extraction
Using R for Climate Raster Data ExtractionMichele Tobias
 
070726 Igarss07 Barcelona
070726 Igarss07 Barcelona070726 Igarss07 Barcelona
070726 Igarss07 BarcelonaRudolf Husar
 
Query optimization and challenges in DDBMS with Review Algorithms.
Query optimization and challenges in DDBMS with Review Algorithms.Query optimization and challenges in DDBMS with Review Algorithms.
Query optimization and challenges in DDBMS with Review Algorithms.Beingprp
 
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence CentreEOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence CentreEOSC-hub project
 
DARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and SpaceDARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and SpaceMatteo Romanello
 

What's hot (20)

Components of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISComponents of Spatial Data Quality in GIS
Components of Spatial Data Quality in GIS
 
QUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GISQUERY AND NETWORK ANALYSIS IN GIS
QUERY AND NETWORK ANALYSIS IN GIS
 
CCLS Internship Presentation
CCLS Internship PresentationCCLS Internship Presentation
CCLS Internship Presentation
 
ppt spatial data
ppt spatial datappt spatial data
ppt spatial data
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data mining
 
EWGAT
EWGATEWGAT
EWGAT
 
2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowski2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowski
 
Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...
Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...
Aggressive Subchannel Allocation Algorithm For Efficient Dynamic Channel Allo...
 
2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...
2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...
2019 ifcs Koutsoupias Mikelis - Document Clustering via Multiple Corresponden...
 
EcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN InfrastructureEcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN Infrastructure
 
TASS Poster
TASS PosterTASS Poster
TASS Poster
 
Using R for Climate Raster Data Extraction
Using R for Climate Raster Data ExtractionUsing R for Climate Raster Data Extraction
Using R for Climate Raster Data Extraction
 
070726 Igarss07 Barcelona
070726 Igarss07 Barcelona070726 Igarss07 Barcelona
070726 Igarss07 Barcelona
 
Spatial Data Model
Spatial Data ModelSpatial Data Model
Spatial Data Model
 
Lessons Learned in Generating Crosswalks for Earth Science Metadata
Lessons Learned in Generating Crosswalks for Earth Science MetadataLessons Learned in Generating Crosswalks for Earth Science Metadata
Lessons Learned in Generating Crosswalks for Earth Science Metadata
 
HDF Town Hall
HDF Town HallHDF Town Hall
HDF Town Hall
 
ADAPTER
ADAPTERADAPTER
ADAPTER
 
Query optimization and challenges in DDBMS with Review Algorithms.
Query optimization and challenges in DDBMS with Review Algorithms.Query optimization and challenges in DDBMS with Review Algorithms.
Query optimization and challenges in DDBMS with Review Algorithms.
 
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence CentreEOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
 
DARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and SpaceDARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and Space
 

Viewers also liked

Viewers also liked (20)

The Application of LinkWinds to EOS
The Application of LinkWinds to EOSThe Application of LinkWinds to EOS
The Application of LinkWinds to EOS
 
IBM Visualization Data Explorer
IBM Visualization Data ExplorerIBM Visualization Data Explorer
IBM Visualization Data Explorer
 
An Introduction to HDF (1997)
An Introduction to HDF (1997)An Introduction to HDF (1997)
An Introduction to HDF (1997)
 
HDF-EOS Software Developer/Vendor Workshop Wrapup
HDF-EOS Software Developer/Vendor Workshop WrapupHDF-EOS Software Developer/Vendor Workshop Wrapup
HDF-EOS Software Developer/Vendor Workshop Wrapup
 
Incorporating ISO Metadata Using HDF Product Designer
Incorporating ISO Metadata Using HDF Product DesignerIncorporating ISO Metadata Using HDF Product Designer
Incorporating ISO Metadata Using HDF Product Designer
 
Current HDF Tools (1997)
Current HDF Tools (1997)Current HDF Tools (1997)
Current HDF Tools (1997)
 
HDF And HDF-EOS Tools
HDF And HDF-EOS ToolsHDF And HDF-EOS Tools
HDF And HDF-EOS Tools
 
PCMDI Software System
PCMDI Software SystemPCMDI Software System
PCMDI Software System
 
Pilot Project for HDF5 Metadata Structures for SWOT
Pilot Project for HDF5 Metadata Structures for SWOTPilot Project for HDF5 Metadata Structures for SWOT
Pilot Project for HDF5 Metadata Structures for SWOT
 
NEON HDF5
NEON HDF5NEON HDF5
NEON HDF5
 
SPD and KEA: HDF5 based file formats for Earth Observation
SPD and KEA: HDF5 based file formats for Earth ObservationSPD and KEA: HDF5 based file formats for Earth Observation
SPD and KEA: HDF5 based file formats for Earth Observation
 
Utilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud ComputingUtilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud Computing
 
HDF Update 2016
HDF Update 2016HDF Update 2016
HDF Update 2016
 
ICESat-2 Metadata and Status
ICESat-2 Metadata and StatusICESat-2 Metadata and Status
ICESat-2 Metadata and Status
 
HDF Cloud Services
HDF Cloud ServicesHDF Cloud Services
HDF Cloud Services
 
Scientific Computing and Visualization using HDF
Scientific Computing and Visualization using HDFScientific Computing and Visualization using HDF
Scientific Computing and Visualization using HDF
 
An HDF-EOS DataBlade using Informix's Object-Relational Database
An HDF-EOS DataBlade using Informix's Object-Relational DatabaseAn HDF-EOS DataBlade using Informix's Object-Relational Database
An HDF-EOS DataBlade using Informix's Object-Relational Database
 
Breakthrough Listen
Breakthrough ListenBreakthrough Listen
Breakthrough Listen
 
Matlab, Big Data, and HDF Server
Matlab, Big Data, and HDF ServerMatlab, Big Data, and HDF Server
Matlab, Big Data, and HDF Server
 
Metadata Requirements for EOSDIS Data Providers
Metadata Requirements for EOSDIS Data ProvidersMetadata Requirements for EOSDIS Data Providers
Metadata Requirements for EOSDIS Data Providers
 

Similar to Dataset Independent Subsetting

The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
Fr1T101-Kuo-20110729 IGARSS ESC.pptx
Fr1T101-Kuo-20110729 IGARSS ESC.pptxFr1T101-Kuo-20110729 IGARSS ESC.pptx
Fr1T101-Kuo-20110729 IGARSS ESC.pptxgrssieee
 
Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...
Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...
Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...The HDF-EOS Tools and Information Center
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentationTERN Australia
 
2004-11-13 Supersite Relational Database Project: (Data Portal?)
2004-11-13 Supersite Relational Database Project: (Data Portal?)2004-11-13 Supersite Relational Database Project: (Data Portal?)
2004-11-13 Supersite Relational Database Project: (Data Portal?)Rudolf Husar
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
Smr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 PubsSmr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 PubsRudolf Husar
 
Supercharging your Apache OODT deployments with the Process Control System
Supercharging your Apache OODT deployments with the Process Control SystemSupercharging your Apache OODT deployments with the Process Control System
Supercharging your Apache OODT deployments with the Process Control SystemChris Mattmann
 
Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...
Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...
Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...chrismalzone
 
A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...
A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...
A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...Larry Smarr
 
Building a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureBuilding a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureLarry Smarr
 
060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 IspraRudolf Husar
 
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...Rudolf Husar
 
Managing research data at Bristol
Managing research data at BristolManaging research data at Bristol
Managing research data at BristolSimon Price
 
Recent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using GlobusRecent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using GlobusGlobus
 

Similar to Dataset Independent Subsetting (20)

Subsetting
SubsettingSubsetting
Subsetting
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Fr1T101-Kuo-20110729 IGARSS ESC.pptx
Fr1T101-Kuo-20110729 IGARSS ESC.pptxFr1T101-Kuo-20110729 IGARSS ESC.pptx
Fr1T101-Kuo-20110729 IGARSS ESC.pptx
 
Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...
Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...
Implementation of OGC Web Coverage Service Using HDF5/HDF-EOS5 as the Base Fi...
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentation
 
2004-11-13 Supersite Relational Database Project: (Data Portal?)
2004-11-13 Supersite Relational Database Project: (Data Portal?)2004-11-13 Supersite Relational Database Project: (Data Portal?)
2004-11-13 Supersite Relational Database Project: (Data Portal?)
 
Srds Pres011120
Srds Pres011120Srds Pres011120
Srds Pres011120
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Welcome to HDF Workshop V
Welcome to HDF Workshop VWelcome to HDF Workshop V
Welcome to HDF Workshop V
 
Smr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 PubsSmr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 Pubs
 
Supercharging your Apache OODT deployments with the Process Control System
Supercharging your Apache OODT deployments with the Process Control SystemSupercharging your Apache OODT deployments with the Process Control System
Supercharging your Apache OODT deployments with the Process Control System
 
Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...
Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...
Moving beyond the 4th Dimension of Quantifying, Analyzing & Visualizing A...
 
Grid1
Grid1Grid1
Grid1
 
A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...
A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...
A High-Performance Campus-Scale Cyberinfrastructure: The Technical, Political...
 
CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...
CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...
CLIM Program: Remote Sensing Workshop, Satellites and Stovepipes - Jay Morris...
 
Building a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureBuilding a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration Infrastructure
 
060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra060314 Ispra Htap Presentations Husar 060314 Ispra
060314 Ispra Htap Presentations Husar 060314 Ispra
 
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
 
Managing research data at Bristol
Managing research data at BristolManaging research data at Bristol
Managing research data at Bristol
 
Recent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using GlobusRecent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using Globus
 

More from The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

More from The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 

Recently uploaded

Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

Dataset Independent Subsetting

  • 1. Dataset Independent Subsetting A Dataset Independent Subsetting Prototype http://minnie.cs.uah.edu/ Matthew R. Smith - matt.smith@msfc.nasa.gov Bruce Beaumont Dr. Sara J. Graves The University of Alabama in Huntsville Information Technology & Systems Laboratory UAH The University of Alabama in Huntsville 8-10 September 1997
  • 3. Context NASA’s Mission to Planet Earth (MTPE) Earth Observing System (EOS) Data and Information System (DIS) EOSDIS Core System (ECS) Contractor: Hughes Information Technology Systems Design and Implement a prototype datasetindependent subsetter UAH The University of Alabama in Huntsville 8-10 September 1997
  • 4. Subsetting? l Goal: to provide a science data user with only the data they request as quickly as possible. l Benefits science data users and data centers: - reduces analysis time by reducing amount of data - reduces time for data delivery - reduces resources (network, personnel, media, etc.) l Steps: - locate spatial, temporal, and spectral area of interest - extract data - re-assemble for distribution UAH The University of Alabama in Huntsville 8-10 September 1997
  • 5. Design Web-based Dataset - independent HDF-EOS formatted data HDF-EOS software library Data types Swath Grid UAH The University of Alabama in Huntsville 8-10 September 1997
  • 6. Functionality Front-end ( user interface ) Forms-based Web application - obtains subsetting selection criteria criteria file (ODL) Back-end ( subsetter ) C software using HDF-EOS and HDF libraries executed in batch mode UAH The University of Alabama in Huntsville 8-10 September 1997
  • 7. User Interface File selection Parameters/channels Geographic bounding box Time range Subsampling stride Non-geolocated objects UAH The University of Alabama in Huntsville 8-10 September 1997
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13. Summary of Current Functionality Subsetter Functionality Can subset grid and swath data Files may contain multiple grids and/or swaths; user may select any or all for subsetting Subset swath data on latitude/longitude and/or time Subset grid data on latitude/longitude Non-geolocated data may be included or excluded Output is HDF-EOS file using same data types “Back-end” runs as a batch job at archive center User may check status of job and/or cancel it E-mail sent to user when complete Data retrieved via FTP UAH The University of Alabama in Huntsville 8-10 September 1997
  • 14. Restrictions Number of subsettable datasets limited by HDF-EOS library subsetting functions: Latitude must be “Latitude” or “Colatitude” Longitude must be “Longitude” Latitude and longitude must be FLOAT32 or FLOAT64 Latitude and longitude must be 1- or 2-dimensional Latitude and longitude must have identical dimensions Time must be “Time” Time must be FLOAT64 in TAI93 format Time must be 1- or 2-dimensional “Track” must be slowest varying dimension in geo fields Grid data must be in one of six supported projections UAH The University of Alabama in Huntsville 8-10 September 1997
  • 15. Future Plans Relax requirements for latitude/longitude and time in swath datasets Provide Java-based GUI for area-of-interest selection Allow user to apply one subset specification to multiple input files Study integrating subsetter with a data visualization tool Study separating structural metadata from data UAH The University of Alabama in Huntsville 8-10 September 1997
  • 16. What is Needed More test datasets in HDF-EOS format Additional support for modifications to HDF-EOS calls Accurate HDF-EOS documentation (internal and external) Functional Java map applet Resolution of metadata issues Publication of official metadata standards Name, content, and format of granule metadata UAH The University of Alabama in Huntsville 8-10 September 1997
  • 17. Risks HDF-EOS not currently in widespread use HDF-EOS requirements for dataset-independent subsetting not widely known to data producers Legacy datasets are not in HDF-EOS format Converting to HDF-EOS may increase storage requirements Many datasets are on non-volatile media UAH The University of Alabama in Huntsville 8-10 September 1997
  • 18. Summary A prototype Web-based dataset-independent subsetter has been developed by UAH. Allows spatial, temporal, and spectral subsetting and subsampling of HDF-EOS datasets Benefits science data users and data centers Great potential. but limited current use UAH The University of Alabama in Huntsville 8-10 September 1997