SlideShare a Scribd company logo
NPP/ NPOESS
Product Data Format

Richard E. Ullman
NOAA/NESDIS/IPO • NASA/GSFC/NPP
Algorithm Division • System Engineering
Data/Information Architecture richard.
ullman@nasa.gov
NPOESS•INTEGRATED PROGRAM OFFICE
Scope
•

•

The file profile described in this presentation applies to
all NPOESS delivered environmental remote sensing
products of the following types:
–
–
–
–
–
–

SDR • Sensor Data Record
TDR • Temperature Data Record
EDR • Environmental Data Record
IP • Intermediate Product
ARP • Application Related Product
GEO • Geolocation

This profile does not apply to other types such as:

– RDR • Raw Data Record
– DDR • Data Delivery Record
– Mission notices, status notices, documentation, software
deliveries, etc

NPOESS•INTEGRATED PROGRAM OFFICE

2

The HDF Workshop 6, 2007
Nov
HDF5 for NPOESS
•

•

•
•
•
•

Hierarchical Data Format 5 (HDF5) is the format for delivery of
processed products from the National Polar-orbiting Operational
Environmental Satellite System (NPOESS) and for the NPOESS
Preparatory Program (NPP).
HDF5 is a general purpose library and file format for storing
scientific data. Two primary objects:
• Dataset, a multidimensional array of data elements
• Group, a structure for organizing objects
Efficient storage and I/O, including parallel I/O.
Free, open source software, multiple platforms.
Data stored in HDF5 is used in many fields from computational fluid
dynamics to film making.
Data can be stored in HDF5 in an endless variety of ways, so it is
important to standardize how NPOESS product data is organized in
HDF5.

NPOESS•INTEGRATED PROGRAM OFFICE

3

The HDF Workshop 6, 2007
Nov
Format Strengths
•

Straight HDF5.

•

Consistent HDF5 group structure

•

Allows for flexible temporal aggregation

– No need for additional libraries.
– Organization for each product is the same as all
others.
– Data “payload” is always in a product group within
All_Data group.
– Granules are appended by extending dataset
dimension.

NPOESS•INTEGRATED PROGRAM OFFICE

4

The HDF Workshop 6, 2007
Nov
Format Challenges
•
•
•
•
•

Geolocation appears in a separate product
group and may be in separate HDF5 file.
Field metadata, used to interpret data (similar to
netCDF CF) are in separate product profile file.
Quality flags must be parsed before they can be
interpreted.
Information needed for un-scaling scaled
integers is not obvious.
HDF5 indirect reference link API, used to link
metadata to the data in NPOESS’ use is
complex and not supported by all analysis COTS
implementations.

NPOESS•INTEGRATED PROGRAM OFFICE

5

The HDF Workshop 6, 2007
Nov
Information
Model UML
Diagram

NPOESS•INTEGRATED PROGRAM OFFICE

6

The HDF Workshop 6, 2007
Nov
An Example Product Group
•

In this example product group:
–
–
–
–
–
–
–

Five datasets constitute the product.
There are two common dimensions.
There are three congruent datasets.
Two datasets contain scale and offset values.
One dataset contains quality flags by element.
There are two granules in this aggregation.
Dimension “alongTrack” crosses the “granule
boundary.”

NPOESS•INTEGRATED PROGRAM OFFICE

7

The HDF Workshop 6, 2007
Nov
Example extracted from VIIRS Sea
Surface Temperature EDR
Field Name

Description

Data Type

Dimensions

Units

BulkTemp

Sea Surface Bulk
Temperature

16-bit unsigned
integer

[ N*768, 3200 ]

Kelvin /
Unitless

SkinTemp

Sea Surface Skin
Temperature

16-bit unsigned
integer

[ N*768, 3200 ]

Kelvin /
Unitless

QF1_VIIRSSSTEDR

Land/Water
Background

1-bit

[N*768, 3200 ]

Unitless

SST Skin Quality

2-bit

Unitless

SST Bulk Quality

2-bit

Unitless

Aerosol Correction

3-bit

Unitless

Bulk SST Scale

32-bit float

Bulk SST Offset

32-bit float

Skin SST Scale

32-bit float

Skin SST Offset

32-bit float

SSTBulkFactors

SSTSkinFactors

[ N*2]

Unitless
Kelvin

[ N*2]

Unitless
Kelvin

N in Dimension is number of granules
NPOESS•INTEGRATED PROGRAM OFFICE

8

The HDF Workshop 6, 2007
Nov
Example Product Group
NPOESS Product Group
alo
Tr ng
ac
k

crossTrack

Granule 0
BulkTemp
Granule 0

SSTBulkFactors

Granule 1
SkinTemp
Granule 1

S
O
S
O

Granule 0

QF1_VIIRSSSTEDR
Granule 1

NPOESS•INTEGRATED PROGRAM OFFICE

Granule 0
Granule 1

SSTSkinFactors
S
O
S
O

Granule 0
Granule 1

9

The HDF Workshop 6, 2007
Nov
Dimensions
•
•
•
•
•

Dimensions are defined for each field.
Fields are related by congruency and common
dimensions.
Common dimensions are given the same name.
One dimension crosses the granule boundary.
When multiple granules are “aggregated” the
“granule boundary” dimension is extended.
Dimension names and attributes are provided in
the product profile.

NPOESS•INTEGRATED PROGRAM OFFICE

10

The HDF Workshop 6, 2007
Nov
Scaled Integer Storage
•
•
•
•
•

For storage efficiency floating point data values may be
stored as scaled integers.
To re-generate the data value, the dataset element must
be multiplied by a supplied scale factor and an integer
offset added.
The scale factor and offset are provided, one pair for
each granule as a separate dataset.
The scale and offset value is the same for all granules
produced with a given version of an algorithm - not
dynamic scaling.
The fact that a dataset is a scaled value and the
association between the data dataset and the scale
factor dataset is contained in the product profile.

NPOESS•INTEGRATED PROGRAM OFFICE

11

The HDF Workshop 6, 2007
Nov
Quality Flags by Element
•
•
•

Most NPOESS products contain multiple indicators of
quality on an element by element basis.
Quality flags are associated by congruency (shared
dimension) with a data array.
Multiple Flags of less than 8-bits are “packed” into
structures aligned on 8-bit boundaries.
1
1

Quality Flags

0
1
0

1-bit
2-bit
2-bit

0
0

3-bit

1

NPOESS•INTEGRATED PROGRAM OFFICE

12

The HDF Workshop 6, 2007
Nov
Geolocation
•
•
•

Geolocation products are constructed using the same
conventions as SDRs and EDRs.
Geolocation datasets have a congruence relationship
with the same dimensions as the datasets to which they
apply.
The association between a data product with its
geolocation product is made on one of two ways:
– The geolocation product may be packaged as a separate
product group within the same HDF5 file.
– The name of a separate geolocation product file may be stored
in the N_GEO_Ref attribute on the root HDF group.
– Choice of “as a product group” or “as a separate file” is made
upon order from the NPOESS IDPS.

NPOESS•INTEGRATED PROGRAM OFFICE

13

The HDF Workshop 6, 2007
Nov
Common Geolocation Fields
for VIIRS Products
Field Name

Comments

Dimensions

Units

Data Type

StartTime

since epoch 1/1/1958 [per scan or swath]

microseconds 64-bit signed integer

MidTime

since epoch 1/1/1958 [per scan or swath]

microseconds 64-bit signed integer

SCPosition

ECR coordinates

[per scan or swath]

meters

SCVelocity

ECR coordinates

[per scan or swath] meters/second

32-bit float
32-bit float

Latitude

[per cell]

degrees

32-bit float

Longitude

[per cell]

degrees

32-bit float

SolarZenithAngle

[per cell]

degrees

32-bit float

SolarAzimuthAngle

[per cell]

degrees

32-bit float

SensorZenithAngle

[per cell]

degrees

32-bit float

SensorAzimuthAngle

[per cell]

degrees

32-bit float

[per cell]

meters

32-bit float

[per cell]

meters

32-bit float

Height
SatelliteRange

geoid or terrain

NPOESS•INTEGRATED PROGRAM OFFICE

14

The HDF Workshop 6, 2007
Nov
Product Profiles
•

XML documents provide definition of product
fields.
– Product Profile is delivered as part of the product
documentation.
– Contains metadata such as units of measure,
dimension names, legend entries, etc
– A separate profile per product, but each conforms to
the same NPOESS Product document type definition
(dtd) and XML schema definition (xsd).
– A style sheet is provided that can render the profile
for a web browser.
– Example:VIIRI_IIT_II D
RIxml

NPOESS•INTEGRATED PROGRAM OFFICE

15

The HDF Workshop 6, 2007
Nov
17 field attributes in XML
product profile (1..9)
Attribute Name

Type

DataType

String

Description

String
Set of
Boolean
set of
string
String
Set of
string
Set of
number
Set of
string
Set of
number

Dimension_GranuleBoundary
Dimension_Name
Field_Name
FillValue_Name
FillValue_Value
LegendEntry_Name
LegendEntry_Value

Comments
String format is:
“%d-bit %s”, where %d is the number of bits and %s is one of:
o
signed integer
o
unsigned integer
o
floating point
o
<blank> (a bitfield)
A descriptive text
True (1) indicates that this is dimension extends when granules
are appended.
Name match indicates that this dimension is congruent with the
same dimension names in other datasets in this product group.
The name of the HDF5 dataset that contains the field values.

Data type matches type of dataset.

Data type matches type of dataset.

NPOESS•INTEGRATED PROGRAM OFFICE

16

The HDF Workshop 6, 2007
Nov
17 field attributes in XML
product profile (10..17)
Attribute Name

Type

Comments

MeasurementUnits

String

Consistent with SI naming and Unidata’s “udunits” package

NumberOfDimensions

Integer

Integer greater than zero.

NumberOfFillValues

Integer

NumberOfLegendEntries

Integer

RangeMax

Number

RangeMin

Number

Scaled

Boolean

ScaleFactorName

String

If zero, then no FillValue_Name and FillValue_Value attributes
are present. Fill Values are used for primary data fields only.
If zero, then no LegendEntry_Name and LegendEntry_Value
attributes are present. Legend entries are used for quality fields
only.
Maximum expected value of field elements in the product, not
just this dataset instance. Data type matches type of dataset.
Minimum expected value of field elements in the product, not
just this instance. Data type matches type of dataset.
True indicates that the dataset is scaled. Note that fill values are
in the dataset type and so must be tested before un-scaling.
the name of the HDF5 dataset that contains scaling coefficients.
To un-scale the elements, first multiply the scaled element by
the first element and then add the second element. If the dataset
is not scaled, Scale_AttributeName will not exist.

NPOESS•INTEGRATED PROGRAM OFFICE

17

The HDF Workshop 6, 2007
Nov

More Related Content

Similar to NPP/NPOESS Product Data Format

Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
The HDF-EOS Tools and Information Center
 
UML Representation of NPOESS Data Products in HDF5
UML Representation of NPOESS Data Products in HDF5UML Representation of NPOESS Data Products in HDF5
UML Representation of NPOESS Data Products in HDF5
The HDF-EOS Tools and Information Center
 
ICESat-2 H5-ES Product Development Strategy
ICESat-2 H5-ES Product Development StrategyICESat-2 H5-ES Product Development Strategy
ICESat-2 H5-ES Product Development Strategy
The HDF-EOS Tools and Information Center
 
Profile of NPOESS HDF5 Files
Profile of NPOESS HDF5 FilesProfile of NPOESS HDF5 Files
Profile of NPOESS HDF5 Files
The HDF-EOS Tools and Information Center
 
Profile of NPOESS HDF5 Files
Profile of NPOESS HDF5 FilesProfile of NPOESS HDF5 Files
Profile of NPOESS HDF5 Files
The HDF-EOS Tools and Information Center
 
HDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's GuideHDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's Guide
The HDF-EOS Tools and Information Center
 
From allotrope to reference master data management
From allotrope to reference master data management From allotrope to reference master data management
From allotrope to reference master data management
OSTHUS
 
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking AlgorithmPerformance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
IRJET Journal
 
Ranjitbanshpal1
Ranjitbanshpal1Ranjitbanshpal1
Ranjitbanshpal1
ranjit banshpal
 
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
IRJET Journal
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Profile of HDF-EOS5 Files
Profile of HDF-EOS5 FilesProfile of HDF-EOS5 Files
Profile of HDF-EOS5 Files
The HDF-EOS Tools and Information Center
 
HDF5 for NPOESS Data Products
HDF5 for NPOESS Data ProductsHDF5 for NPOESS Data Products
HDF5 for NPOESS Data Products
The HDF-EOS Tools and Information Center
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
EDINA, University of Edinburgh
 
Range Query on Big Data Based on Map Reduce
Range Query on Big Data Based on Map ReduceRange Query on Big Data Based on Map Reduce
Range Query on Big Data Based on Map Reduce
IJMER
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
Robert Grossman
 
Non-Specialized File Format Extension
Non-Specialized File Format ExtensionNon-Specialized File Format Extension
Non-Specialized File Format Extension
CSCJournals
 
Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1
Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1
Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1
The HDF-EOS Tools and Information Center
 
GI2013 ppt kafka&team-inspire in pocket
GI2013 ppt kafka&team-inspire in  pocketGI2013 ppt kafka&team-inspire in  pocket
GI2013 ppt kafka&team-inspire in pocket
IGN Vorstand
 
Inspire in pocket dresden 2
Inspire in  pocket dresden 2Inspire in  pocket dresden 2
Inspire in pocket dresden 2
Karel Charvat
 

Similar to NPP/NPOESS Product Data Format (20)

Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
Content Framework for Operational Environmental Remote Sensing Data Sets: NPO...
 
UML Representation of NPOESS Data Products in HDF5
UML Representation of NPOESS Data Products in HDF5UML Representation of NPOESS Data Products in HDF5
UML Representation of NPOESS Data Products in HDF5
 
ICESat-2 H5-ES Product Development Strategy
ICESat-2 H5-ES Product Development StrategyICESat-2 H5-ES Product Development Strategy
ICESat-2 H5-ES Product Development Strategy
 
Profile of NPOESS HDF5 Files
Profile of NPOESS HDF5 FilesProfile of NPOESS HDF5 Files
Profile of NPOESS HDF5 Files
 
Profile of NPOESS HDF5 Files
Profile of NPOESS HDF5 FilesProfile of NPOESS HDF5 Files
Profile of NPOESS HDF5 Files
 
HDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's GuideHDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's Guide
 
From allotrope to reference master data management
From allotrope to reference master data management From allotrope to reference master data management
From allotrope to reference master data management
 
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking AlgorithmPerformance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
 
Ranjitbanshpal1
Ranjitbanshpal1Ranjitbanshpal1
Ranjitbanshpal1
 
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
Profile of HDF-EOS5 Files
Profile of HDF-EOS5 FilesProfile of HDF-EOS5 Files
Profile of HDF-EOS5 Files
 
HDF5 for NPOESS Data Products
HDF5 for NPOESS Data ProductsHDF5 for NPOESS Data Products
HDF5 for NPOESS Data Products
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
Range Query on Big Data Based on Map Reduce
Range Query on Big Data Based on Map ReduceRange Query on Big Data Based on Map Reduce
Range Query on Big Data Based on Map Reduce
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
Non-Specialized File Format Extension
Non-Specialized File Format ExtensionNon-Specialized File Format Extension
Non-Specialized File Format Extension
 
Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1
Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1
Harmonization of HDF with National Imagery Transfer Format (NITF) 2.1
 
GI2013 ppt kafka&team-inspire in pocket
GI2013 ppt kafka&team-inspire in  pocketGI2013 ppt kafka&team-inspire in  pocket
GI2013 ppt kafka&team-inspire in pocket
 
Inspire in pocket dresden 2
Inspire in  pocket dresden 2Inspire in  pocket dresden 2
Inspire in pocket dresden 2
 

More from The HDF-EOS Tools and Information Center

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
The HDF-EOS Tools and Information Center
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
The HDF-EOS Tools and Information Center
 
The State of HDF
The State of HDFThe State of HDF
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
The HDF-EOS Tools and Information Center
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
The HDF-EOS Tools and Information Center
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
The HDF-EOS Tools and Information Center
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
The HDF-EOS Tools and Information Center
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
The HDF-EOS Tools and Information Center
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
The HDF-EOS Tools and Information Center
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
The HDF-EOS Tools and Information Center
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
The HDF-EOS Tools and Information Center
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
The HDF-EOS Tools and Information Center
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
The HDF-EOS Tools and Information Center
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
The HDF-EOS Tools and Information Center
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
The HDF-EOS Tools and Information Center
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
The HDF-EOS Tools and Information Center
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
The HDF-EOS Tools and Information Center
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
The HDF-EOS Tools and Information Center
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020

More from The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 

Recently uploaded

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 

Recently uploaded (20)

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 

NPP/NPOESS Product Data Format

  • 1. NPP/ NPOESS Product Data Format Richard E. Ullman NOAA/NESDIS/IPO • NASA/GSFC/NPP Algorithm Division • System Engineering Data/Information Architecture richard. ullman@nasa.gov NPOESS•INTEGRATED PROGRAM OFFICE
  • 2. Scope • • The file profile described in this presentation applies to all NPOESS delivered environmental remote sensing products of the following types: – – – – – – SDR • Sensor Data Record TDR • Temperature Data Record EDR • Environmental Data Record IP • Intermediate Product ARP • Application Related Product GEO • Geolocation This profile does not apply to other types such as: – RDR • Raw Data Record – DDR • Data Delivery Record – Mission notices, status notices, documentation, software deliveries, etc NPOESS•INTEGRATED PROGRAM OFFICE 2 The HDF Workshop 6, 2007 Nov
  • 3. HDF5 for NPOESS • • • • • • Hierarchical Data Format 5 (HDF5) is the format for delivery of processed products from the National Polar-orbiting Operational Environmental Satellite System (NPOESS) and for the NPOESS Preparatory Program (NPP). HDF5 is a general purpose library and file format for storing scientific data. Two primary objects: • Dataset, a multidimensional array of data elements • Group, a structure for organizing objects Efficient storage and I/O, including parallel I/O. Free, open source software, multiple platforms. Data stored in HDF5 is used in many fields from computational fluid dynamics to film making. Data can be stored in HDF5 in an endless variety of ways, so it is important to standardize how NPOESS product data is organized in HDF5. NPOESS•INTEGRATED PROGRAM OFFICE 3 The HDF Workshop 6, 2007 Nov
  • 4. Format Strengths • Straight HDF5. • Consistent HDF5 group structure • Allows for flexible temporal aggregation – No need for additional libraries. – Organization for each product is the same as all others. – Data “payload” is always in a product group within All_Data group. – Granules are appended by extending dataset dimension. NPOESS•INTEGRATED PROGRAM OFFICE 4 The HDF Workshop 6, 2007 Nov
  • 5. Format Challenges • • • • • Geolocation appears in a separate product group and may be in separate HDF5 file. Field metadata, used to interpret data (similar to netCDF CF) are in separate product profile file. Quality flags must be parsed before they can be interpreted. Information needed for un-scaling scaled integers is not obvious. HDF5 indirect reference link API, used to link metadata to the data in NPOESS’ use is complex and not supported by all analysis COTS implementations. NPOESS•INTEGRATED PROGRAM OFFICE 5 The HDF Workshop 6, 2007 Nov
  • 6. Information Model UML Diagram NPOESS•INTEGRATED PROGRAM OFFICE 6 The HDF Workshop 6, 2007 Nov
  • 7. An Example Product Group • In this example product group: – – – – – – – Five datasets constitute the product. There are two common dimensions. There are three congruent datasets. Two datasets contain scale and offset values. One dataset contains quality flags by element. There are two granules in this aggregation. Dimension “alongTrack” crosses the “granule boundary.” NPOESS•INTEGRATED PROGRAM OFFICE 7 The HDF Workshop 6, 2007 Nov
  • 8. Example extracted from VIIRS Sea Surface Temperature EDR Field Name Description Data Type Dimensions Units BulkTemp Sea Surface Bulk Temperature 16-bit unsigned integer [ N*768, 3200 ] Kelvin / Unitless SkinTemp Sea Surface Skin Temperature 16-bit unsigned integer [ N*768, 3200 ] Kelvin / Unitless QF1_VIIRSSSTEDR Land/Water Background 1-bit [N*768, 3200 ] Unitless SST Skin Quality 2-bit Unitless SST Bulk Quality 2-bit Unitless Aerosol Correction 3-bit Unitless Bulk SST Scale 32-bit float Bulk SST Offset 32-bit float Skin SST Scale 32-bit float Skin SST Offset 32-bit float SSTBulkFactors SSTSkinFactors [ N*2] Unitless Kelvin [ N*2] Unitless Kelvin N in Dimension is number of granules NPOESS•INTEGRATED PROGRAM OFFICE 8 The HDF Workshop 6, 2007 Nov
  • 9. Example Product Group NPOESS Product Group alo Tr ng ac k crossTrack Granule 0 BulkTemp Granule 0 SSTBulkFactors Granule 1 SkinTemp Granule 1 S O S O Granule 0 QF1_VIIRSSSTEDR Granule 1 NPOESS•INTEGRATED PROGRAM OFFICE Granule 0 Granule 1 SSTSkinFactors S O S O Granule 0 Granule 1 9 The HDF Workshop 6, 2007 Nov
  • 10. Dimensions • • • • • Dimensions are defined for each field. Fields are related by congruency and common dimensions. Common dimensions are given the same name. One dimension crosses the granule boundary. When multiple granules are “aggregated” the “granule boundary” dimension is extended. Dimension names and attributes are provided in the product profile. NPOESS•INTEGRATED PROGRAM OFFICE 10 The HDF Workshop 6, 2007 Nov
  • 11. Scaled Integer Storage • • • • • For storage efficiency floating point data values may be stored as scaled integers. To re-generate the data value, the dataset element must be multiplied by a supplied scale factor and an integer offset added. The scale factor and offset are provided, one pair for each granule as a separate dataset. The scale and offset value is the same for all granules produced with a given version of an algorithm - not dynamic scaling. The fact that a dataset is a scaled value and the association between the data dataset and the scale factor dataset is contained in the product profile. NPOESS•INTEGRATED PROGRAM OFFICE 11 The HDF Workshop 6, 2007 Nov
  • 12. Quality Flags by Element • • • Most NPOESS products contain multiple indicators of quality on an element by element basis. Quality flags are associated by congruency (shared dimension) with a data array. Multiple Flags of less than 8-bits are “packed” into structures aligned on 8-bit boundaries. 1 1 Quality Flags 0 1 0 1-bit 2-bit 2-bit 0 0 3-bit 1 NPOESS•INTEGRATED PROGRAM OFFICE 12 The HDF Workshop 6, 2007 Nov
  • 13. Geolocation • • • Geolocation products are constructed using the same conventions as SDRs and EDRs. Geolocation datasets have a congruence relationship with the same dimensions as the datasets to which they apply. The association between a data product with its geolocation product is made on one of two ways: – The geolocation product may be packaged as a separate product group within the same HDF5 file. – The name of a separate geolocation product file may be stored in the N_GEO_Ref attribute on the root HDF group. – Choice of “as a product group” or “as a separate file” is made upon order from the NPOESS IDPS. NPOESS•INTEGRATED PROGRAM OFFICE 13 The HDF Workshop 6, 2007 Nov
  • 14. Common Geolocation Fields for VIIRS Products Field Name Comments Dimensions Units Data Type StartTime since epoch 1/1/1958 [per scan or swath] microseconds 64-bit signed integer MidTime since epoch 1/1/1958 [per scan or swath] microseconds 64-bit signed integer SCPosition ECR coordinates [per scan or swath] meters SCVelocity ECR coordinates [per scan or swath] meters/second 32-bit float 32-bit float Latitude [per cell] degrees 32-bit float Longitude [per cell] degrees 32-bit float SolarZenithAngle [per cell] degrees 32-bit float SolarAzimuthAngle [per cell] degrees 32-bit float SensorZenithAngle [per cell] degrees 32-bit float SensorAzimuthAngle [per cell] degrees 32-bit float [per cell] meters 32-bit float [per cell] meters 32-bit float Height SatelliteRange geoid or terrain NPOESS•INTEGRATED PROGRAM OFFICE 14 The HDF Workshop 6, 2007 Nov
  • 15. Product Profiles • XML documents provide definition of product fields. – Product Profile is delivered as part of the product documentation. – Contains metadata such as units of measure, dimension names, legend entries, etc – A separate profile per product, but each conforms to the same NPOESS Product document type definition (dtd) and XML schema definition (xsd). – A style sheet is provided that can render the profile for a web browser. – Example:VIIRI_IIT_II D RIxml NPOESS•INTEGRATED PROGRAM OFFICE 15 The HDF Workshop 6, 2007 Nov
  • 16. 17 field attributes in XML product profile (1..9) Attribute Name Type DataType String Description String Set of Boolean set of string String Set of string Set of number Set of string Set of number Dimension_GranuleBoundary Dimension_Name Field_Name FillValue_Name FillValue_Value LegendEntry_Name LegendEntry_Value Comments String format is: “%d-bit %s”, where %d is the number of bits and %s is one of: o signed integer o unsigned integer o floating point o <blank> (a bitfield) A descriptive text True (1) indicates that this is dimension extends when granules are appended. Name match indicates that this dimension is congruent with the same dimension names in other datasets in this product group. The name of the HDF5 dataset that contains the field values. Data type matches type of dataset. Data type matches type of dataset. NPOESS•INTEGRATED PROGRAM OFFICE 16 The HDF Workshop 6, 2007 Nov
  • 17. 17 field attributes in XML product profile (10..17) Attribute Name Type Comments MeasurementUnits String Consistent with SI naming and Unidata’s “udunits” package NumberOfDimensions Integer Integer greater than zero. NumberOfFillValues Integer NumberOfLegendEntries Integer RangeMax Number RangeMin Number Scaled Boolean ScaleFactorName String If zero, then no FillValue_Name and FillValue_Value attributes are present. Fill Values are used for primary data fields only. If zero, then no LegendEntry_Name and LegendEntry_Value attributes are present. Legend entries are used for quality fields only. Maximum expected value of field elements in the product, not just this dataset instance. Data type matches type of dataset. Minimum expected value of field elements in the product, not just this instance. Data type matches type of dataset. True indicates that the dataset is scaled. Note that fill values are in the dataset type and so must be tested before un-scaling. the name of the HDF5 dataset that contains scaling coefficients. To un-scale the elements, first multiply the scaled element by the first element and then add the second element. If the dataset is not scaled, Scale_AttributeName will not exist. NPOESS•INTEGRATED PROGRAM OFFICE 17 The HDF Workshop 6, 2007 Nov