SlideShare a Scribd company logo
1 of 16
Content Framework for Operational
Environmental Remote Sensing Data
Sets: NPOESS Concepts
Alan M. Goldberg
agoldber@mitre.org
NOTICE
This technical data was produced for the U.S. Government under Contract No.
50-SPNA-9-00010, and is subject to the Rights in Technical Data - General
clause at FAR 52.227-14 (JUN 1987)
© 2004 The MITRE Corporation

Approved for public release; distribution unlimited
Motivation
NPOESS will collect an unprecedented quantity and variety of satellite environmental data from a
constellation of satellites carrying multiple remote sensing and in situ sensors.
NPOESS realized the need for standardization and good metadata in connection with
environmental data sets. Global data sets, such as those produced by satellites or used in
modeling and simulation, are at the leading edge of information stewardship challenges.
Data sets will be delivered to operational users by pre-subscription through Interface Data
Processors (IDPS) at the four US processing Centrals, and through direct readout Field
Terminals (FTS) worldwide. Data will be retrieved by near- and long-term researchers
through NOAA’s data archives.
NPOESS committed early to using existing standards in a rational manner to support our mission
goals.
– Standards-based data products ease product usability.
– Comprehensive metadata contributes to maximizing the continuing value of large,
complex data sets. Standards-based metadata ease product understanding.
– Working from a clean sheet of paper, NPOESS can serve as a “workshop” for best
practices in data stewardship.
This paper presents the results of planning for the NPOESS data product delivery system. It
presents a view of the optimal data product design, much of which is being implemented in
NPOESS.
End-to-End Process
NPOESS-Unique
Collection
Space
Segment

NPOESS-Unique
Processing

Demodulator

User
Terminal

Antenna
RF
Electronics

Application Processing,
Display, & Dissemination

FT
Processor FTS
Element

External I/F

Other
Data

IDPS

C3S/DRR

Central Element

Central

The National Polar-orbiting Operational Environmental Satellite System (NPOESS) is managed by the
NOAA Integrated Program Office, on behalf of the U.S. Dept. of Commerce, Dept. of Defense, and
NASA. Northrop-Grumman Space Technology is prime contractor, with Raytheon the subcontractor
for ground processing & operations
Architectural Considerations
Sizing
•
Overall data rates have increased by two orders of magnitude
•
more products
•
impact the on existing user systems and applications
•
decision to “loosely” couple
•
simple data interface with flexibility on both sides
•
decided to deliver relatively short-duration data granules, containing typically 30 seconds of data
Users and use patterns: designed to serve operational users, current science users, and future archival
researchers
•
Operational users need effectively all of the data as soon as possible
•
Research users need current environmental information, but usually have time and resources to improve
product quality with post-processing
•
Archival researchers look for highly selective data sets
•
Users via Centrals or via field terminals
Sensor complexity
•
multiple versions of the data
•
the raw bitstreams originating from the sensors (RDRs)
•
calibrated fluxes measured by the sensor (SDRs)
•
environmental variables estimated at the source (EDRs)
•
sensors themselves produce a wide variety of data types
•
various techniques to maximize performance within bandwidth
Anticipating change
•
detailed formats, contents, product lists, and interfaces that must be accommodated by NPOESS data
product framework format through the mission lifetime
SENSORS

CCSDS (mux, code, frame) &Encrypt

Delivered Raw

Packetization
Compression
Aux.
Sensor
Data

Cal.
Source

ENVIRONMENTAL
SOURCE
COMPONENTS

Filtration
A/D Conversion
Detection
Flux
Manipulation

Comm
Processing

C3S

Comm
Receiver

RDR
Production

RDR Level
TDR Level

SDR
Production

SDR Level

EDR Level

EDR
Production

IDPS produces mission data sets which recreate or estimate signals at 4 points in the sensing
chain. Ancillary data, brought from other systems and used in EDR processing, is captured.
Auxiliary data, produced within the NPOESS system to support processing, is kept with the
mission data or incorporated in documentation. Metadatata provided for all.

IDPS

Comm
Xmitter

Data
Store

OTHER
SUBSYSTEMS

SPACE SEGMENT

Data Products Are More Than Scenes
Typical Data Organization
RDRs
SDRs
EDRs
Ancillary

Time-Series of
Packet Types

Binary
Headers

u
v
w

Multispectral
Imagery

Vector
Flux

Data
Volume
Geolocation

Calibration
Table

Slit
Spectra

Quality
Column Data
Thematic Layers
Geolocation

?

Abstract
External Data

Sounding
FT
Spectra

Imagery
First Decision – File Format
The NPOESS program identified several key characteristics for the file format within which data products
would be delivered. Hierarchical Data Format ver.5 (HDF5) was found to be the best solution within
technical and programmatic constraints:
•
A single format with proven ability to handle environmental data products (EDRs); more abstract data
structures in RDRs, TDRs & SDRs; and other products delivered to users and archives
•
Capability to incorporate full metadata
•
Supported by the user community and other institutional support, with an adequate practical lifetime;
interoperable with DoD standards
•
Ability to handle large data sets (such as full orbits) and small data sets (individual granules) with
acceptable efficiency
•
Ability to handle multiple arrays and array types within the same granule, such as observational data arrays
and geolocation arrays
•
High efficiency for reading and writing; built-in compression function; capability to “chunk” large data
arrays to access prestructured subsets
•
Sufficiently self-documenting to permit variable formats
•
Available with development tools to expedite file definition and applications
•
Acceptable licensing terms
•
Supported on all likely user platforms & operating systems
•
Support for all likely atomic data types
•
Simple data objects and groups which permit application-specific structures to be created
File Structure Implemented in HDF5
B

Dataset

A

File
file attributes

Header
Granule

Granule

Data
Dataspace

int16

current = 12e-9

Datatype
2
Rank

Dim_3=2
Dim_2=4

Dim_1=5
Dimensions

Dataset

Dataset

temp = 56
time = 32.4

Dataset

Dataset

Attributes
chunked; compressed
Storage layout

granule attributes

granule attributes

HDF5 provides a simple, logical file structure based on a Dataset, comprising a data array and a header
which describes the dataspace. Datasets can be structured hierarchically. NPOESS datasets and
additional attributes – incorporating the metadata – combine into granules, and granules combine into
product files. Granules are concatenated in a file in such a way that they can be addressed either
collectively or individually. Individual datasets will be created for elements such as mission data, quality
assessment, geospatial location, time, illumination, and viewing geometry. Users may access subsets of
the full data using HDF5 utilities.
Second Decision - Metadata
Based on extensive prior experiences in the earth sciences and other sciences, NPOESS highest
level operational requirements specified that comprehensive metadata would be delivered
with the data. In our context, metadata incorporates the following “data about data”:
• Identification
• Content summary
• Content meaning
• Content structure and format
• Acquisition and processing history – provenance
• Distribution and availability
The National Spatial Data Infrastructure (NSDI) provides basic guidance for metadata. The
Federal Geospatial Data Committee (FGDC) sets content standards. The program establishes
compatible extensions to the standard. The program also defines the representation. Over one
hundred metadata items have been defined for NPOESS.
Basic identification metadata is duplicated in a User Block at the front of the file, where it can be
read without HDF software.
FGDC Metadata Base & RS Standards

Metadata

1

2

Identification

3

Spatial
Data
Organization

Data
Quality

6

Distribution
Information

Base Standard
RSE Standard

4

7

Metadata
Reference
Information

5

Spatial
Reference

8

Platform &
Mission
Information

Entity &
Attribute
Information

9

Instrument
Information
Quasistatic
NSDI
metadata
Complete
granule
metadata

Dynamic
NSDI
metadata
Dynamic
detail
metadata

eHandbook

external reference
Identification (UB)
Aggregate metadata
Common metadata

file

Unique metadata
granule

Concept

Analysis

Repositories

NPOESS will create all applicable metadata identified in the FGDC Base Standard and Remote
Sensing Extensions, plus mission-unique. Comprehensive metadata at the granule level will be
captured in different ways, consistent with usage patterns. Metadata which changes infrequently will
be collected in an external online reference. Metadata which changes with each granule is saved with
it. Metadata which describes a file or all the granules in the file is extracted and stored with the file.
Third Decision – File Organization
Primary characteristics of a design
•Systematic data organization
•Simplifies data maintenance, enhancement, documentation, retrieval, visualization, and exploitation
•NPOESS evaluation looked at lessons learned from EOS, and best practices elsewhere
•Must work with abstract, generalized scientific, and specifically geospatial data sets
Result: Specific derived requirements closely match the “Climate & Forecast Conventions”, primarily
developed for use with netCDF. It provides guidelines for consistent, complete, and clear data entity and
attribute definition. NPOESS is attempting to implement this data design.
Need clear identification of truly independent and dependent variable contents:
•For most unresampled data, the natural independent variables are the index attributes defining the discrete
points in space and time at which the data samples were collected. Index might be a time-series sample number,
detector number, energy or spectral bin.
•Usually include an associated independent variable for each index attribute, such as time, direction, position, or
energy level. These relate to indices by calibration, known to be very accurate.
•Some associated independent variables may be multidimensional. E.g., polar geolocation or solar elevation is
a deterministic function of spatial indices.
•Always a primary dependent variable: function of independent variable(s). May be an abstract binary object,
function of time or place. Usually, well defined variables which are a function of temporal, spatial, or spectral
indices.
•Often 1+ associated dependent variable arrays, such as quality estimates or telemetry values, associated by
design with the primary independant variable
•Optional supplementary arrays, such as calibrations, are functions of 1+ index attributes used in the primary
and associated data arrays.
Concatenation: One index attribute, usually time or time-like, can often be defined as the index which
establishes continuity from one granule to the next.
Clear Index & Array Definitions
Primary
Index

Primary Array
e.g., Flux, Brightness,
Counts, NDVI

Index
Attribute

n-Dimensional Dependant
Variable (Entity) Array
2-D Independent
Variable Arrays
e.g., lat/lon, sun
alt/az, land mask

Associated
Independent
Variable(s)
Putting Together the Framework

Granule Boundaries
Time

Supporting Timeline Data
Sensor Data Quality Flag Bits
Sensor Data Arrays

Geolocation Arrays
Putting Together the Framework
With careful design, and based on lessons learned from previous programs, a comprehensive data
product design can be achieved. The design eases development and maintenance, by
providing a common approach to data and metadata.
Granules form the basic unit of production and cataloguing. They are essentially self-contained.
Each granule contains the primary data arrays, associated data arrays, and descriptive attributes –
including metadata – needed to understand the information content.
To facilitate efficient delivery, multiple granules of the same type are combined into product files.
Common attributes of all granules in a file may be extracted to the file level as common
metadata. Summary and identification attributes are created and added to the file. Basic
identification metadata is extracted from the HDF format and saved as an ASCII ‘user block’
at the physical start of the file.
Finally, metadata which changes only rarely is maintained separately as an electronic handbook,
and is incorporated in the granules by reference.
eHandbook

HDF File
Block

Root

File
Attributes

User
Block

File Metadata
Common Metadata

Data
Product

Other File Attributes

Granule

Granule

Granule

Granule
Attributes
Granule
Metadata
Datasets

Index Variables

Primary Array

Associated Arrays

Other
Granule
Attributes

Granule
Attributes
Granule
Metadata
Datasets

Associated Arrays

Granule
Metadata

Other
Granule
Attributes

Index Variables

Primary Array

Granule
Attributes

Datasets

Index Variables

Primary Array

Associated Arrays

Other
Granule
Attributes

More Related Content

What's hot

Method and apparatus for automated negotiation for resources on a switched un...
Method and apparatus for automated negotiation for resources on a switched un...Method and apparatus for automated negotiation for resources on a switched un...
Method and apparatus for automated negotiation for resources on a switched un...Tal Lavian Ph.D.
 
Presentation Ispass 2012 Session6 Presentation1
Presentation Ispass 2012 Session6 Presentation1Presentation Ispass 2012 Session6 Presentation1
Presentation Ispass 2012 Session6 Presentation1sairahul321
 
Method and apparatus for scheduling resources on a switched underlay network
Method and apparatus for scheduling resources on a switched underlay networkMethod and apparatus for scheduling resources on a switched underlay network
Method and apparatus for scheduling resources on a switched underlay networkTal Lavian Ph.D.
 
The Gordon Data-intensive Supercomputer. Enabling Scientific Discovery
The Gordon Data-intensive Supercomputer. Enabling Scientific DiscoveryThe Gordon Data-intensive Supercomputer. Enabling Scientific Discovery
The Gordon Data-intensive Supercomputer. Enabling Scientific DiscoveryIntel IT Center
 
IRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of HadoopIRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of HadoopIRJET Journal
 
An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...Alexander Decker
 
A novel cache resolution technique for cooperative caching in wireless mobile...
A novel cache resolution technique for cooperative caching in wireless mobile...A novel cache resolution technique for cooperative caching in wireless mobile...
A novel cache resolution technique for cooperative caching in wireless mobile...csandit
 
Fota Delta Size Reduction Using FIle Similarity Algorithms
Fota Delta Size Reduction Using FIle Similarity AlgorithmsFota Delta Size Reduction Using FIle Similarity Algorithms
Fota Delta Size Reduction Using FIle Similarity AlgorithmsShivansh Gaur
 
White Paper: Hadoop in Life Sciences — An Introduction
White Paper: Hadoop in Life Sciences — An Introduction   White Paper: Hadoop in Life Sciences — An Introduction
White Paper: Hadoop in Life Sciences — An Introduction EMC
 
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGIHadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGIAllen Day, PhD
 
مشروع قواعد البيانات
مشروع قواعد البيانات مشروع قواعد البيانات
مشروع قواعد البيانات Safiya Najeh
 
Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst...
 Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst... Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst...
Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst...hydrologyproject001
 
Dynamic Metadata Management in Semantic File Systems
Dynamic Metadata Management in Semantic File SystemsDynamic Metadata Management in Semantic File Systems
Dynamic Metadata Management in Semantic File SystemsIJERA Editor
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014aceas13tern
 

What's hot (19)

hadoop
hadoophadoop
hadoop
 
Method and apparatus for automated negotiation for resources on a switched un...
Method and apparatus for automated negotiation for resources on a switched un...Method and apparatus for automated negotiation for resources on a switched un...
Method and apparatus for automated negotiation for resources on a switched un...
 
Presentation Ispass 2012 Session6 Presentation1
Presentation Ispass 2012 Session6 Presentation1Presentation Ispass 2012 Session6 Presentation1
Presentation Ispass 2012 Session6 Presentation1
 
Method and apparatus for scheduling resources on a switched underlay network
Method and apparatus for scheduling resources on a switched underlay networkMethod and apparatus for scheduling resources on a switched underlay network
Method and apparatus for scheduling resources on a switched underlay network
 
The Gordon Data-intensive Supercomputer. Enabling Scientific Discovery
The Gordon Data-intensive Supercomputer. Enabling Scientific DiscoveryThe Gordon Data-intensive Supercomputer. Enabling Scientific Discovery
The Gordon Data-intensive Supercomputer. Enabling Scientific Discovery
 
Computer Forensic
Computer ForensicComputer Forensic
Computer Forensic
 
IRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of HadoopIRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
 
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
 
An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...An asynchronous replication model to improve data available into a heterogene...
An asynchronous replication model to improve data available into a heterogene...
 
D04501036040
D04501036040D04501036040
D04501036040
 
A novel cache resolution technique for cooperative caching in wireless mobile...
A novel cache resolution technique for cooperative caching in wireless mobile...A novel cache resolution technique for cooperative caching in wireless mobile...
A novel cache resolution technique for cooperative caching in wireless mobile...
 
Fota Delta Size Reduction Using FIle Similarity Algorithms
Fota Delta Size Reduction Using FIle Similarity AlgorithmsFota Delta Size Reduction Using FIle Similarity Algorithms
Fota Delta Size Reduction Using FIle Similarity Algorithms
 
White Paper: Hadoop in Life Sciences — An Introduction
White Paper: Hadoop in Life Sciences — An Introduction   White Paper: Hadoop in Life Sciences — An Introduction
White Paper: Hadoop in Life Sciences — An Introduction
 
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGIHadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
 
مشروع قواعد البيانات
مشروع قواعد البيانات مشروع قواعد البيانات
مشروع قواعد البيانات
 
Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst...
 Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst... Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst...
Download-manuals-ground water-manual-gw-volume9operationmanualdatatransferst...
 
Dynamic Metadata Management in Semantic File Systems
Dynamic Metadata Management in Semantic File SystemsDynamic Metadata Management in Semantic File Systems
Dynamic Metadata Management in Semantic File Systems
 
Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014Ben Evans SPEDDEXES 2014
Ben Evans SPEDDEXES 2014
 
Bv25430436
Bv25430436Bv25430436
Bv25430436
 

Similar to Content Framework for Operational Environmental Remote Sensing Data Sets: NPOESS Concepts

MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...grssieee
 
Data Management Planning - 02/21/13
Data Management Planning - 02/21/13Data Management Planning - 02/21/13
Data Management Planning - 02/21/13Lizzy_Rolando
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014aceas13tern
 
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...IRJET Journal
 
Geo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceGeo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceeSAT Publishing House
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentationTERN Australia
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataRobert Grossman
 
pacs picture archeving comunication system instrumentation
pacs picture archeving comunication system instrumentationpacs picture archeving comunication system instrumentation
pacs picture archeving comunication system instrumentationDarshan Reddy
 
Sector Cloudcom Tutorial
Sector Cloudcom TutorialSector Cloudcom Tutorial
Sector Cloudcom Tutoriallilyco
 
PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...
PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...
PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...Andreas Schreiber
 

Similar to Content Framework for Operational Environmental Remote Sensing Data Sets: NPOESS Concepts (20)

Hdg geo discussion
Hdg geo discussionHdg geo discussion
Hdg geo discussion
 
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
MO3.L10 - NATIONAL POLAR-ORBITING OPERATIONAL ENVIRONMENTAL SATELLITE SYSTEM ...
 
Overview and Status of HDF in NPOESS & NPP
Overview and Status of HDF in NPOESS & NPPOverview and Status of HDF in NPOESS & NPP
Overview and Status of HDF in NPOESS & NPP
 
Welcome to HDF Workshop V
Welcome to HDF Workshop VWelcome to HDF Workshop V
Welcome to HDF Workshop V
 
Data Management Planning - 02/21/13
Data Management Planning - 02/21/13Data Management Planning - 02/21/13
Data Management Planning - 02/21/13
 
Hdf5
Hdf5Hdf5
Hdf5
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014
 
UML Representation of NPOESS Data Products in HDF5
UML Representation of NPOESS Data Products in HDF5UML Representation of NPOESS Data Products in HDF5
UML Representation of NPOESS Data Products in HDF5
 
HDF Lessons from NPOESS & Future Opportunities
HDF Lessons from NPOESS  & Future OpportunitiesHDF Lessons from NPOESS  & Future Opportunities
HDF Lessons from NPOESS & Future Opportunities
 
MEDIN Discovery Metadata Standard
MEDIN Discovery Metadata StandardMEDIN Discovery Metadata Standard
MEDIN Discovery Metadata Standard
 
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS ...
 
Geo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceGeo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduce
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentation
 
Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
RFCs for HDF5 and HDF-EOS5 Status Update
RFCs for HDF5 and HDF-EOS5 Status UpdateRFCs for HDF5 and HDF-EOS5 Status Update
RFCs for HDF5 and HDF-EOS5 Status Update
 
pacs picture archeving comunication system instrumentation
pacs picture archeving comunication system instrumentationpacs picture archeving comunication system instrumentation
pacs picture archeving comunication system instrumentation
 
Sector Cloudcom Tutorial
Sector Cloudcom TutorialSector Cloudcom Tutorial
Sector Cloudcom Tutorial
 
NPP/NPOESS Product Data Format
NPP/NPOESS Product Data FormatNPP/NPOESS Product Data Format
NPP/NPOESS Product Data Format
 
PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...
PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...
PyModESt: A Python Framework for Staging of Geo-referenced Data on the Coll...
 

More from The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

More from The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 

Recently uploaded

Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Content Framework for Operational Environmental Remote Sensing Data Sets: NPOESS Concepts

  • 1. Content Framework for Operational Environmental Remote Sensing Data Sets: NPOESS Concepts Alan M. Goldberg agoldber@mitre.org NOTICE This technical data was produced for the U.S. Government under Contract No. 50-SPNA-9-00010, and is subject to the Rights in Technical Data - General clause at FAR 52.227-14 (JUN 1987) © 2004 The MITRE Corporation Approved for public release; distribution unlimited
  • 2. Motivation NPOESS will collect an unprecedented quantity and variety of satellite environmental data from a constellation of satellites carrying multiple remote sensing and in situ sensors. NPOESS realized the need for standardization and good metadata in connection with environmental data sets. Global data sets, such as those produced by satellites or used in modeling and simulation, are at the leading edge of information stewardship challenges. Data sets will be delivered to operational users by pre-subscription through Interface Data Processors (IDPS) at the four US processing Centrals, and through direct readout Field Terminals (FTS) worldwide. Data will be retrieved by near- and long-term researchers through NOAA’s data archives. NPOESS committed early to using existing standards in a rational manner to support our mission goals. – Standards-based data products ease product usability. – Comprehensive metadata contributes to maximizing the continuing value of large, complex data sets. Standards-based metadata ease product understanding. – Working from a clean sheet of paper, NPOESS can serve as a “workshop” for best practices in data stewardship. This paper presents the results of planning for the NPOESS data product delivery system. It presents a view of the optimal data product design, much of which is being implemented in NPOESS.
  • 3. End-to-End Process NPOESS-Unique Collection Space Segment NPOESS-Unique Processing Demodulator User Terminal Antenna RF Electronics Application Processing, Display, & Dissemination FT Processor FTS Element External I/F Other Data IDPS C3S/DRR Central Element Central The National Polar-orbiting Operational Environmental Satellite System (NPOESS) is managed by the NOAA Integrated Program Office, on behalf of the U.S. Dept. of Commerce, Dept. of Defense, and NASA. Northrop-Grumman Space Technology is prime contractor, with Raytheon the subcontractor for ground processing & operations
  • 4. Architectural Considerations Sizing • Overall data rates have increased by two orders of magnitude • more products • impact the on existing user systems and applications • decision to “loosely” couple • simple data interface with flexibility on both sides • decided to deliver relatively short-duration data granules, containing typically 30 seconds of data Users and use patterns: designed to serve operational users, current science users, and future archival researchers • Operational users need effectively all of the data as soon as possible • Research users need current environmental information, but usually have time and resources to improve product quality with post-processing • Archival researchers look for highly selective data sets • Users via Centrals or via field terminals Sensor complexity • multiple versions of the data • the raw bitstreams originating from the sensors (RDRs) • calibrated fluxes measured by the sensor (SDRs) • environmental variables estimated at the source (EDRs) • sensors themselves produce a wide variety of data types • various techniques to maximize performance within bandwidth Anticipating change • detailed formats, contents, product lists, and interfaces that must be accommodated by NPOESS data product framework format through the mission lifetime
  • 5. SENSORS CCSDS (mux, code, frame) &Encrypt Delivered Raw Packetization Compression Aux. Sensor Data Cal. Source ENVIRONMENTAL SOURCE COMPONENTS Filtration A/D Conversion Detection Flux Manipulation Comm Processing C3S Comm Receiver RDR Production RDR Level TDR Level SDR Production SDR Level EDR Level EDR Production IDPS produces mission data sets which recreate or estimate signals at 4 points in the sensing chain. Ancillary data, brought from other systems and used in EDR processing, is captured. Auxiliary data, produced within the NPOESS system to support processing, is kept with the mission data or incorporated in documentation. Metadatata provided for all. IDPS Comm Xmitter Data Store OTHER SUBSYSTEMS SPACE SEGMENT Data Products Are More Than Scenes
  • 6. Typical Data Organization RDRs SDRs EDRs Ancillary Time-Series of Packet Types Binary Headers u v w Multispectral Imagery Vector Flux Data Volume Geolocation Calibration Table Slit Spectra Quality Column Data Thematic Layers Geolocation ? Abstract External Data Sounding FT Spectra Imagery
  • 7. First Decision – File Format The NPOESS program identified several key characteristics for the file format within which data products would be delivered. Hierarchical Data Format ver.5 (HDF5) was found to be the best solution within technical and programmatic constraints: • A single format with proven ability to handle environmental data products (EDRs); more abstract data structures in RDRs, TDRs & SDRs; and other products delivered to users and archives • Capability to incorporate full metadata • Supported by the user community and other institutional support, with an adequate practical lifetime; interoperable with DoD standards • Ability to handle large data sets (such as full orbits) and small data sets (individual granules) with acceptable efficiency • Ability to handle multiple arrays and array types within the same granule, such as observational data arrays and geolocation arrays • High efficiency for reading and writing; built-in compression function; capability to “chunk” large data arrays to access prestructured subsets • Sufficiently self-documenting to permit variable formats • Available with development tools to expedite file definition and applications • Acceptable licensing terms • Supported on all likely user platforms & operating systems • Support for all likely atomic data types • Simple data objects and groups which permit application-specific structures to be created
  • 8. File Structure Implemented in HDF5 B Dataset A File file attributes Header Granule Granule Data Dataspace int16 current = 12e-9 Datatype 2 Rank Dim_3=2 Dim_2=4 Dim_1=5 Dimensions Dataset Dataset temp = 56 time = 32.4 Dataset Dataset Attributes chunked; compressed Storage layout granule attributes granule attributes HDF5 provides a simple, logical file structure based on a Dataset, comprising a data array and a header which describes the dataspace. Datasets can be structured hierarchically. NPOESS datasets and additional attributes – incorporating the metadata – combine into granules, and granules combine into product files. Granules are concatenated in a file in such a way that they can be addressed either collectively or individually. Individual datasets will be created for elements such as mission data, quality assessment, geospatial location, time, illumination, and viewing geometry. Users may access subsets of the full data using HDF5 utilities.
  • 9. Second Decision - Metadata Based on extensive prior experiences in the earth sciences and other sciences, NPOESS highest level operational requirements specified that comprehensive metadata would be delivered with the data. In our context, metadata incorporates the following “data about data”: • Identification • Content summary • Content meaning • Content structure and format • Acquisition and processing history – provenance • Distribution and availability The National Spatial Data Infrastructure (NSDI) provides basic guidance for metadata. The Federal Geospatial Data Committee (FGDC) sets content standards. The program establishes compatible extensions to the standard. The program also defines the representation. Over one hundred metadata items have been defined for NPOESS. Basic identification metadata is duplicated in a User Block at the front of the file, where it can be read without HDF software.
  • 10. FGDC Metadata Base & RS Standards Metadata 1 2 Identification 3 Spatial Data Organization Data Quality 6 Distribution Information Base Standard RSE Standard 4 7 Metadata Reference Information 5 Spatial Reference 8 Platform & Mission Information Entity & Attribute Information 9 Instrument Information
  • 11. Quasistatic NSDI metadata Complete granule metadata Dynamic NSDI metadata Dynamic detail metadata eHandbook external reference Identification (UB) Aggregate metadata Common metadata file Unique metadata granule Concept Analysis Repositories NPOESS will create all applicable metadata identified in the FGDC Base Standard and Remote Sensing Extensions, plus mission-unique. Comprehensive metadata at the granule level will be captured in different ways, consistent with usage patterns. Metadata which changes infrequently will be collected in an external online reference. Metadata which changes with each granule is saved with it. Metadata which describes a file or all the granules in the file is extracted and stored with the file.
  • 12. Third Decision – File Organization Primary characteristics of a design •Systematic data organization •Simplifies data maintenance, enhancement, documentation, retrieval, visualization, and exploitation •NPOESS evaluation looked at lessons learned from EOS, and best practices elsewhere •Must work with abstract, generalized scientific, and specifically geospatial data sets Result: Specific derived requirements closely match the “Climate & Forecast Conventions”, primarily developed for use with netCDF. It provides guidelines for consistent, complete, and clear data entity and attribute definition. NPOESS is attempting to implement this data design. Need clear identification of truly independent and dependent variable contents: •For most unresampled data, the natural independent variables are the index attributes defining the discrete points in space and time at which the data samples were collected. Index might be a time-series sample number, detector number, energy or spectral bin. •Usually include an associated independent variable for each index attribute, such as time, direction, position, or energy level. These relate to indices by calibration, known to be very accurate. •Some associated independent variables may be multidimensional. E.g., polar geolocation or solar elevation is a deterministic function of spatial indices. •Always a primary dependent variable: function of independent variable(s). May be an abstract binary object, function of time or place. Usually, well defined variables which are a function of temporal, spatial, or spectral indices. •Often 1+ associated dependent variable arrays, such as quality estimates or telemetry values, associated by design with the primary independant variable •Optional supplementary arrays, such as calibrations, are functions of 1+ index attributes used in the primary and associated data arrays. Concatenation: One index attribute, usually time or time-like, can often be defined as the index which establishes continuity from one granule to the next.
  • 13. Clear Index & Array Definitions Primary Index Primary Array e.g., Flux, Brightness, Counts, NDVI Index Attribute n-Dimensional Dependant Variable (Entity) Array 2-D Independent Variable Arrays e.g., lat/lon, sun alt/az, land mask Associated Independent Variable(s)
  • 14. Putting Together the Framework Granule Boundaries Time Supporting Timeline Data Sensor Data Quality Flag Bits Sensor Data Arrays Geolocation Arrays
  • 15. Putting Together the Framework With careful design, and based on lessons learned from previous programs, a comprehensive data product design can be achieved. The design eases development and maintenance, by providing a common approach to data and metadata. Granules form the basic unit of production and cataloguing. They are essentially self-contained. Each granule contains the primary data arrays, associated data arrays, and descriptive attributes – including metadata – needed to understand the information content. To facilitate efficient delivery, multiple granules of the same type are combined into product files. Common attributes of all granules in a file may be extracted to the file level as common metadata. Summary and identification attributes are created and added to the file. Basic identification metadata is extracted from the HDF format and saved as an ASCII ‘user block’ at the physical start of the file. Finally, metadata which changes only rarely is maintained separately as an electronic handbook, and is incorporated in the granules by reference.
  • 16. eHandbook HDF File Block Root File Attributes User Block File Metadata Common Metadata Data Product Other File Attributes Granule Granule Granule Granule Attributes Granule Metadata Datasets Index Variables Primary Array Associated Arrays Other Granule Attributes Granule Attributes Granule Metadata Datasets Associated Arrays Granule Metadata Other Granule Attributes Index Variables Primary Array Granule Attributes Datasets Index Variables Primary Array Associated Arrays Other Granule Attributes

Editor's Notes

  1. The National Polar-orbiting Operational Environmental Satellite System (NPOESS) is managed by the NOAA Integrated Program Office, on behalf of the U.S. Dept. of Commerce, Dept. of Defense, and NASA. Northrop-Grumman Space Technology is prime contractor, with Raytheon the subcontractor for ground processing &amp; operations
  2. IDPS produces mission data sets which recreate or estimate signals at 4 points in the sensing chain. Ancillary data, brought from other systems and used in EDR processing, is captured. Auxiliary data, produced within the NPOESS system to support processing, is kept with the mission data or incorporated in documentation. Metadatata provided for all.
  3. HDF5 provides a simple, logical file structure based on a Dataset, comprising a data array and a header which describes the dataspace. Datasets can be structured hierarchically. NPOESS datasets and additional attributes – incorporating the metadata – combine into granules, and granules combine into product files. Granules are concatenated in a file in such a way that they can be addressed either collectively or individually. Individual datasets will be created for elements such as mission data, quality assessment, geospatial location, time, illumination, and viewing geometry. Users may access subsets of the full data using HDF5 utilities.
  4. NPOESS will create all applicable metadata identified in the FGDC Base Standard and Remote Sensing Extensions, plus mission-unique. Comprehensive metadata at the granule level will be captured in different ways, consistent with usage patterns. Metadata which changes infrequently will be collected in an external online reference. Metadata which changes with each granule is saved with it. Metadata which describes a file or all the granules in the file is extracted and stored with the file.
  5. Conceptual representation of the relationship among granule data components. Mission or scientific data is kept as arrays organized by the collection indices. Auxiliary data (e.g., attitude, ephemeris, and scanner angle encoder readings), much of as a time sequence, are kept in a similar manner. Derived parameters (e.g., polar and cartesian geolocation, and illumination and viewing geometry) and processing flags and status bits are added. Each granule is complete by itself. When desired, multiple granules can be concatenated along the time dimension. This structure can be retained until the observed or processed data is resampled onto another coordinate system.