SlideShare a Scribd company logo
1 of 32
Download to read offline
HDF-EOS Subsetting:
HEW and other tools
HDF and HDF-EOS Workshop VI
San Francisco, CA – December 4-5, 2002

Dr. Sara Graves, Bruce Beaumont,
Helen Conover, Matt Smith
Presented by Steve Tanner
University of Alabama in Huntsville, Information Technology and Systems Center
SUBSET.ORG
Agenda
• Subsetting
• HDF-EOS Web-based subsetter
(HEW)
• HEW Subsetting Appliance:
Integration with ECS
• HSE: A new Subsetting Engine
• Software availability options
• HEW Demonstration
December 4-5, 2002

HDF-EOS Workshop VI
Subsetting
• Goal: to provide a science data user with only the data they
request as quickly as possible.
• Benefits science data users and data centers:
- reduces analysis time by reducing amount of data
- reduces time for data delivery
- reduces resources (network, personnel, media, etc.)
• Steps:
- locate spatial / temporal / spectral area of interest
- extract
- re-assemble for distribution/use

December 4-5, 2002

HDF-EOS Workshop VI
Currently
Available/Planned
Subsetting Applications

• HDF-EOS Subsetting - HEW
–
–
–
–
–
–
–

Complete System
Subsetting Back-end Only
SPOT - Subsettability Checker
Subsetting Center at UAH
HEW Integration with ECS
HEW Subsetting Engine
Subsetting as a Web Service

(available)
(available)
(available)
(available)
(in work)
(in work)
(planned)

• Customized Subsetting
–
–

MODIS tools
Coarse-grain SSM/I Subsetter

(available)
(available)

• General Purpose Customizable Subsetting
–
–

December 4-5, 2002

Based on ADaM Data Mining Engine
Subsetting Tool using ESML

HDF-EOS Workshop VI

(available)
(in work)
HEW
• HDF-EOS Web-based Subsetter
–
–

Dataset-independent for HDF-EOS
files
Optional Front-end/GUI
• Uses HTML forms and JavaScript

–

Back-end
• Needs subset criteria and HDF-EOS data
• Performs subsetting as a “batch” job

–

http://subset.org

December 4-5, 2002

HDF-EOS Workshop VI
HEW Back End
•
•
•
•
•
•

Uses HDF-EOS (and HDF) library
Instructed via a subset criteria file (ODL)
Handles multiple similar files
Handles Swath and/or Grid objects
Unix (SGI & Sun) executables available
Subsetted output files contain:
•
•
•
•

StructMetadata (HDF-EOS)
ArchiveMetadata*
ProductMetadata (added by HEW
ODL file)
CoreMetadata* (w/ modified bounding box &
time info)
•

optionally placed in .met file
* if present in parent file

December 4-5, 2002

HDF-EOS Workshop VI
HEW Complete System
The User Interface checks the HDF-EOS file and presents the
attributes to the user.
The user interacts with the browser to specify the subsetting
criteria.
The User Interface creates the subsetting criteria file.
The HEW Back-end uses the subset criteria file with the input HDFEOS file to create the subset HDF-EOS file.
User’s
Brows
er
(HTML
)

User
Interface
(CGI)
Subsetting
API (ODL)

HEW Backend
December 4-5, 2002

Input
file

HDF-EOS Workshop VI

Output
file
HEW Back-end Only

Usersupplied
front-end

Subsetting
API (ODL)

Input
file

HEW Backend

Output
file

A user-supplied program or other application creates the
subsetting criteria file.
The HEW Back-end uses the subset criteria file with the input
HDF-EOS file to create the subsetted HDF-EOS file.

December 4-5, 2002

HDF-EOS Workshop VI
Example Subset Criteria File
GROUP = SUBSET
PARENT_FILE =(“/AQUA/AMSR/AE_L2A.hdfeos”)
LATITUDE_RANGE = (35.000000, 40.000000)
LONGITUDE_RANGE = (-77.000000, -72.000000)
EMAIL = “matt.smith@msfc.nasa.gov”
MET_FILE = YES
GROUP = SPOG
NAME = “swath_1”
TYPE = “SWATH”
PARAMETERS = (“89.0V_Res.1_TB”,
“89.0V_Res.2_TB”)
SUBSAMPLING = (“GeoTrack”, 2,
“GeoXtrack”, 1)
END_GROUP = SPOG
END_GROUP = SUBSET
END

December 4-5, 2002

HDF-EOS Workshop VI
HSE: HEW Subsetting Engine
Similar functionality to HEW back end, but contained
within a callable function
User Application Code
Subse tting crit e

Return cod

HEW Subset ting Engine
function

Input
file (s)

HSE_LogMsg
callback

Output
file(s)

HSE_StatusMs
callback

• User application
code builds
subsetting criteria
structure
• Subsetting engine
function calls user’s
functions for status
and log messages
• No e-mail is sent

• Beta version and documentation available now
Supports only grid subsetting at this time
– BCEA and SOM grids cannot be subsetted yet
–

• Swath subsetting available first quarter 2003
December 4-5, 2002

HDF-EOS Workshop VI
HEW Subsettable
data
EOS DATASETS
• Terra
MODIS
MOPITT

OTHERS
• TRMM
TMI
• NOAA-15,16
AMSU-A

ASTER
• Aqua
AMSR-E
AIRS
• Aura
HIRDLS

December 4-5, 2002

•

any other HDF-EOS data written
with HDF-EOS library subsetting
calls in mind

HDF-EOS Workshop VI
SPOT
• Subsettability “checker”
–
–
–
–
–
–

Displays content/structure of HDF-EOS
files
Examines files for subsettability by HEW
Simple command-line interface
Stand-alone operation
v1.4 now available for SGI and Sun
Available at subset.org

December 4-5, 2002

HDF-EOS Workshop VI
Subsetting Center
Remote user pushes file (or subsetting center could pull file) to
subsetting center
Remote user interacts with HEW to perform subsetting
Remote user pulls file (or subsetting center could push file) to remote
site

Remote User
ftp

Subsetting Center
HEW
Input
disk

December 4-5, 2002

HDF-EOS Workshop VI

Output
disk
Subsetting as a Web
Service (Planned)
Subsetting Center
Subset request
Subsetted data
Science
User
URL to
data

Data Store
December 4-5, 2002

HDF-EOS Workshop VI
HSA: HEW Subsetting
Appliance
• UAH/ITSC has been working with ESDIS, ECS
(EOSDIS Core System), and DAACs to
incorporate subsetting appliance into the
ECS/EOS Data Gateway (EDG) system.
• Provides for subsetting as part of ECS
automated order filling process.
• Infrastructure written by ITSC/UAH to allow
other appliances (subsetting, reprojecting,
reformatting, etc.) to be added later by
other data centers.

December 4-5, 2002

HDF-EOS Workshop VI
ECS integration plans
•
•
•
•
•

EDG v3.4 has basic subsetting options
Testing at NSIDC, operational in Jan ‘03
Testing at LPDAAC (EDC)
Testing will begin at GDAAC in ‘03
Further enhancements as requested by
DAACs

December 4-5, 2002

HDF-EOS Workshop VI
Tools developed for MODIS
Scientists
• MODIS – Land, Quality Assessment
– modland – subsetter for MODIS
gridded data
– stitcher – pieces together 2 or 4
contiguous MODIS tiles

• MODIS – Atmosphere
–

modair – specialized subsetter
for MODIS swaths

December 4-5, 2002

HDF-EOS Workshop VI
Subsetting Tool using ESML (in
work)
HDF-EOS ESML
file

Binary/
ASCII

ESML
file

Network
For HDF-EOS data not
formatted for
subsetting: ESML file
can be used to
correct the semantic
tag required to subset
HDF-EOS data without
the need to recreate
the data file
December 4-5, 2002

ESML Library
Subsetting Algorithm

Subsetted Data
HDF-EOS Workshop VI

Other
Formats

ESML
file
Subsetting web-site
subset.org
The subsetting “portal” is being
created for everyone involved in
subsetting
Advertising
Forums
Data
Software
Glossary
Tutorials
Links to specialized subsetters
December 4-5, 2002

HDF-EOS Workshop VI
Subsetting Demonstration

December 4-5, 2002

HDF-EOS Workshop VI
Select TMI
files for
subsetting
Display
combined
geographic
coverage
Select
subset
parameter
Combined
geographic
coverage
of input
files
Geographic
Select
geographic
subset
criteria
subset
criteria
Output
directory
for this
job
Display
combined
geographicc
coverage of
subset
results
Combined
geographic
coverage
of subset
results

More Related Content

What's hot

Disaster Recovery in the Hadoop Ecosystem: Preparing for the Improbable
Disaster Recovery in the Hadoop Ecosystem: Preparing for the ImprobableDisaster Recovery in the Hadoop Ecosystem: Preparing for the Improbable
Disaster Recovery in the Hadoop Ecosystem: Preparing for the ImprobableStefan Kupstaitis-Dunkler
 
Advanced Security In Hadoop Cluster
Advanced Security In Hadoop ClusterAdvanced Security In Hadoop Cluster
Advanced Security In Hadoop ClusterEdureka!
 
Hadoop security
Hadoop securityHadoop security
Hadoop securityBiju Nair
 
HP Helion OpenStack Community Edition Deployment
HP Helion OpenStack Community Edition DeploymentHP Helion OpenStack Community Edition Deployment
HP Helion OpenStack Community Edition DeploymentMarton Kiss
 
Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Andrew Underwood
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseDataWorks Summit
 
HPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and WorkflowsHPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and Workflowsinside-BigData.com
 
DataCore vFilO lets users locate files across multiple NAS systems with a sin...
DataCore vFilO lets users locate files across multiple NAS systems with a sin...DataCore vFilO lets users locate files across multiple NAS systems with a sin...
DataCore vFilO lets users locate files across multiple NAS systems with a sin...Principled Technologies
 
co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.Yousef Fadila
 
Peter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of IrelandPeter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of Irelanddri_ireland
 
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data ArchitectureRunning Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data ArchitectureDataWorks Summit
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNDataWorks Summit
 
OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...
OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...
OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...NETWAYS
 

What's hot (20)

Disaster Recovery in the Hadoop Ecosystem: Preparing for the Improbable
Disaster Recovery in the Hadoop Ecosystem: Preparing for the ImprobableDisaster Recovery in the Hadoop Ecosystem: Preparing for the Improbable
Disaster Recovery in the Hadoop Ecosystem: Preparing for the Improbable
 
Advanced Security In Hadoop Cluster
Advanced Security In Hadoop ClusterAdvanced Security In Hadoop Cluster
Advanced Security In Hadoop Cluster
 
Hadoop security
Hadoop securityHadoop security
Hadoop security
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
HP Helion OpenStack Community Edition Deployment
HP Helion OpenStack Community Edition DeploymentHP Helion OpenStack Community Edition Deployment
HP Helion OpenStack Community Edition Deployment
 
Google Colaboratory for HDF-EOS
Google Colaboratory for HDF-EOSGoogle Colaboratory for HDF-EOS
Google Colaboratory for HDF-EOS
 
Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
 
HPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and WorkflowsHPC Storage and IO Trends and Workflows
HPC Storage and IO Trends and Workflows
 
DataCore vFilO lets users locate files across multiple NAS systems with a sin...
DataCore vFilO lets users locate files across multiple NAS systems with a sin...DataCore vFilO lets users locate files across multiple NAS systems with a sin...
DataCore vFilO lets users locate files across multiple NAS systems with a sin...
 
Big Data
Big DataBig Data
Big Data
 
Parallel Computing with HDF Server
Parallel Computing with HDF ServerParallel Computing with HDF Server
Parallel Computing with HDF Server
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 
co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.co-Hadoop: Data co-location on Hadoop.
co-Hadoop: Data co-location on Hadoop.
 
Peter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of IrelandPeter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of Ireland
 
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data ArchitectureRunning Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
 
HDF Status and Development
HDF Status and DevelopmentHDF Status and Development
HDF Status and Development
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
 
OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...
OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...
OSDC 2014: Sebastian Harl - SysDB the system management and inventory collect...
 
Beginning hive and_apache_pig
Beginning hive and_apache_pigBeginning hive and_apache_pig
Beginning hive and_apache_pig
 

Viewers also liked (9)

Subsetting
SubsettingSubsetting
Subsetting
 
Joseph Ours - The Scourge Of Testing: Test Data Management
Joseph Ours - The Scourge Of Testing: Test Data ManagementJoseph Ours - The Scourge Of Testing: Test Data Management
Joseph Ours - The Scourge Of Testing: Test Data Management
 
Utilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud ComputingUtilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud Computing
 
HDF Cloud Services
HDF Cloud ServicesHDF Cloud Services
HDF Cloud Services
 
Scientific Computing and Visualization using HDF
Scientific Computing and Visualization using HDFScientific Computing and Visualization using HDF
Scientific Computing and Visualization using HDF
 
SPD and KEA: HDF5 based file formats for Earth Observation
SPD and KEA: HDF5 based file formats for Earth ObservationSPD and KEA: HDF5 based file formats for Earth Observation
SPD and KEA: HDF5 based file formats for Earth Observation
 
Test data management a case study Presented at SiGIST
Test data management a case study Presented at SiGISTTest data management a case study Presented at SiGIST
Test data management a case study Presented at SiGIST
 
Matlab, Big Data, and HDF Server
Matlab, Big Data, and HDF ServerMatlab, Big Data, and HDF Server
Matlab, Big Data, and HDF Server
 
Breakthrough Listen
Breakthrough ListenBreakthrough Listen
Breakthrough Listen
 

Similar to HDF-EOS Subsetting: HEW and other tools

Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...The HDF-EOS Tools and Information Center
 

Similar to HDF-EOS Subsetting: HEW and other tools (20)

Subsetting at UAH
Subsetting at UAHSubsetting at UAH
Subsetting at UAH
 
HDFView With an HDF-EOS Plug-in
HDFView With an HDF-EOS Plug-inHDFView With an HDF-EOS Plug-in
HDFView With an HDF-EOS Plug-in
 
Easy Access of NASA HDF data via OPeNDAP
Easy Access of NASA HDF data via OPeNDAPEasy Access of NASA HDF data via OPeNDAP
Easy Access of NASA HDF data via OPeNDAP
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
HDF5 iRODS
HDF5 iRODSHDF5 iRODS
HDF5 iRODS
 
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout MapsEnsuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
 
ESDIS Status (2002)
ESDIS Status (2002)ESDIS Status (2002)
ESDIS Status (2002)
 
HDF-EOS APIs, tools, etc.
HDF-EOS APIs, tools, etc.HDF-EOS APIs, tools, etc.
HDF-EOS APIs, tools, etc.
 
HDF-EOS Maintenance, Current Development and Tools
HDF-EOS Maintenance, Current Development and ToolsHDF-EOS Maintenance, Current Development and Tools
HDF-EOS Maintenance, Current Development and Tools
 
Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
 
Easy Remote Access Via OPeNDAP
Easy Remote Access Via OPeNDAPEasy Remote Access Via OPeNDAP
Easy Remote Access Via OPeNDAP
 
HDF Updae
HDF UpdaeHDF Updae
HDF Updae
 
HDF OPeNDAP project update and demo
HDF OPeNDAP project update and demoHDF OPeNDAP project update and demo
HDF OPeNDAP project update and demo
 
HDF-Java Products
HDF-Java ProductsHDF-Java Products
HDF-Java Products
 
HDF4 Mapping Project Update
HDF4 Mapping Project UpdateHDF4 Mapping Project Update
HDF4 Mapping Project Update
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
Data Interoperability
Data InteroperabilityData Interoperability
Data Interoperability
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Status of HDF-EOS, Related Software, and Tools
Status of HDF-EOS, Related Software, and ToolsStatus of HDF-EOS, Related Software, and Tools
Status of HDF-EOS, Related Software, and Tools
 
Support for NPP/NPOESS/JPSS by The HDF Group
 Support for NPP/NPOESS/JPSS by The HDF Group Support for NPP/NPOESS/JPSS by The HDF Group
Support for NPP/NPOESS/JPSS by The HDF Group
 

More from The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

More from The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
Leveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software TestingLeveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software Testing
 
HDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's GuideHDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's Guide
 
HDF Status Update
HDF Status UpdateHDF Status Update
HDF Status Update
 

Recently uploaded

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Recently uploaded (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

HDF-EOS Subsetting: HEW and other tools

  • 1. HDF-EOS Subsetting: HEW and other tools HDF and HDF-EOS Workshop VI San Francisco, CA – December 4-5, 2002 Dr. Sara Graves, Bruce Beaumont, Helen Conover, Matt Smith Presented by Steve Tanner University of Alabama in Huntsville, Information Technology and Systems Center SUBSET.ORG
  • 2. Agenda • Subsetting • HDF-EOS Web-based subsetter (HEW) • HEW Subsetting Appliance: Integration with ECS • HSE: A new Subsetting Engine • Software availability options • HEW Demonstration December 4-5, 2002 HDF-EOS Workshop VI
  • 3. Subsetting • Goal: to provide a science data user with only the data they request as quickly as possible. • Benefits science data users and data centers: - reduces analysis time by reducing amount of data - reduces time for data delivery - reduces resources (network, personnel, media, etc.) • Steps: - locate spatial / temporal / spectral area of interest - extract - re-assemble for distribution/use December 4-5, 2002 HDF-EOS Workshop VI
  • 4. Currently Available/Planned Subsetting Applications • HDF-EOS Subsetting - HEW – – – – – – – Complete System Subsetting Back-end Only SPOT - Subsettability Checker Subsetting Center at UAH HEW Integration with ECS HEW Subsetting Engine Subsetting as a Web Service (available) (available) (available) (available) (in work) (in work) (planned) • Customized Subsetting – – MODIS tools Coarse-grain SSM/I Subsetter (available) (available) • General Purpose Customizable Subsetting – – December 4-5, 2002 Based on ADaM Data Mining Engine Subsetting Tool using ESML HDF-EOS Workshop VI (available) (in work)
  • 5. HEW • HDF-EOS Web-based Subsetter – – Dataset-independent for HDF-EOS files Optional Front-end/GUI • Uses HTML forms and JavaScript – Back-end • Needs subset criteria and HDF-EOS data • Performs subsetting as a “batch” job – http://subset.org December 4-5, 2002 HDF-EOS Workshop VI
  • 6. HEW Back End • • • • • • Uses HDF-EOS (and HDF) library Instructed via a subset criteria file (ODL) Handles multiple similar files Handles Swath and/or Grid objects Unix (SGI & Sun) executables available Subsetted output files contain: • • • • StructMetadata (HDF-EOS) ArchiveMetadata* ProductMetadata (added by HEW ODL file) CoreMetadata* (w/ modified bounding box & time info) • optionally placed in .met file * if present in parent file December 4-5, 2002 HDF-EOS Workshop VI
  • 7. HEW Complete System The User Interface checks the HDF-EOS file and presents the attributes to the user. The user interacts with the browser to specify the subsetting criteria. The User Interface creates the subsetting criteria file. The HEW Back-end uses the subset criteria file with the input HDFEOS file to create the subset HDF-EOS file. User’s Brows er (HTML ) User Interface (CGI) Subsetting API (ODL) HEW Backend December 4-5, 2002 Input file HDF-EOS Workshop VI Output file
  • 8. HEW Back-end Only Usersupplied front-end Subsetting API (ODL) Input file HEW Backend Output file A user-supplied program or other application creates the subsetting criteria file. The HEW Back-end uses the subset criteria file with the input HDF-EOS file to create the subsetted HDF-EOS file. December 4-5, 2002 HDF-EOS Workshop VI
  • 9. Example Subset Criteria File GROUP = SUBSET PARENT_FILE =(“/AQUA/AMSR/AE_L2A.hdfeos”) LATITUDE_RANGE = (35.000000, 40.000000) LONGITUDE_RANGE = (-77.000000, -72.000000) EMAIL = “matt.smith@msfc.nasa.gov” MET_FILE = YES GROUP = SPOG NAME = “swath_1” TYPE = “SWATH” PARAMETERS = (“89.0V_Res.1_TB”, “89.0V_Res.2_TB”) SUBSAMPLING = (“GeoTrack”, 2, “GeoXtrack”, 1) END_GROUP = SPOG END_GROUP = SUBSET END December 4-5, 2002 HDF-EOS Workshop VI
  • 10. HSE: HEW Subsetting Engine Similar functionality to HEW back end, but contained within a callable function User Application Code Subse tting crit e Return cod HEW Subset ting Engine function Input file (s) HSE_LogMsg callback Output file(s) HSE_StatusMs callback • User application code builds subsetting criteria structure • Subsetting engine function calls user’s functions for status and log messages • No e-mail is sent • Beta version and documentation available now Supports only grid subsetting at this time – BCEA and SOM grids cannot be subsetted yet – • Swath subsetting available first quarter 2003 December 4-5, 2002 HDF-EOS Workshop VI
  • 11. HEW Subsettable data EOS DATASETS • Terra MODIS MOPITT OTHERS • TRMM TMI • NOAA-15,16 AMSU-A ASTER • Aqua AMSR-E AIRS • Aura HIRDLS December 4-5, 2002 • any other HDF-EOS data written with HDF-EOS library subsetting calls in mind HDF-EOS Workshop VI
  • 12. SPOT • Subsettability “checker” – – – – – – Displays content/structure of HDF-EOS files Examines files for subsettability by HEW Simple command-line interface Stand-alone operation v1.4 now available for SGI and Sun Available at subset.org December 4-5, 2002 HDF-EOS Workshop VI
  • 13. Subsetting Center Remote user pushes file (or subsetting center could pull file) to subsetting center Remote user interacts with HEW to perform subsetting Remote user pulls file (or subsetting center could push file) to remote site Remote User ftp Subsetting Center HEW Input disk December 4-5, 2002 HDF-EOS Workshop VI Output disk
  • 14. Subsetting as a Web Service (Planned) Subsetting Center Subset request Subsetted data Science User URL to data Data Store December 4-5, 2002 HDF-EOS Workshop VI
  • 15. HSA: HEW Subsetting Appliance • UAH/ITSC has been working with ESDIS, ECS (EOSDIS Core System), and DAACs to incorporate subsetting appliance into the ECS/EOS Data Gateway (EDG) system. • Provides for subsetting as part of ECS automated order filling process. • Infrastructure written by ITSC/UAH to allow other appliances (subsetting, reprojecting, reformatting, etc.) to be added later by other data centers. December 4-5, 2002 HDF-EOS Workshop VI
  • 16. ECS integration plans • • • • • EDG v3.4 has basic subsetting options Testing at NSIDC, operational in Jan ‘03 Testing at LPDAAC (EDC) Testing will begin at GDAAC in ‘03 Further enhancements as requested by DAACs December 4-5, 2002 HDF-EOS Workshop VI
  • 17. Tools developed for MODIS Scientists • MODIS – Land, Quality Assessment – modland – subsetter for MODIS gridded data – stitcher – pieces together 2 or 4 contiguous MODIS tiles • MODIS – Atmosphere – modair – specialized subsetter for MODIS swaths December 4-5, 2002 HDF-EOS Workshop VI
  • 18. Subsetting Tool using ESML (in work) HDF-EOS ESML file Binary/ ASCII ESML file Network For HDF-EOS data not formatted for subsetting: ESML file can be used to correct the semantic tag required to subset HDF-EOS data without the need to recreate the data file December 4-5, 2002 ESML Library Subsetting Algorithm Subsetted Data HDF-EOS Workshop VI Other Formats ESML file
  • 19. Subsetting web-site subset.org The subsetting “portal” is being created for everyone involved in subsetting Advertising Forums Data Software Glossary Tutorials Links to specialized subsetters December 4-5, 2002 HDF-EOS Workshop VI
  • 20. Subsetting Demonstration December 4-5, 2002 HDF-EOS Workshop VI
  • 26.
  • 27.
  • 28.
  • 31.