SlideShare a Scribd company logo
1 of 15
Conf-DDDD-IN
Hierarchical Data Format
(HDF) Status Update
ESIP Summer 2018
This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C.
This document does not contain technology or Technical Data controlled under either the U.S. International Traffic
in Arms Regulations or the U.S. Export Administration Regulations.
Elena Pourmal
EED2 Technical Lead
epourmal@hdfgroup.org
Conf-DDDD-IN
2
• Update on current HDF releases
– New features
• Moving to HDF5 1.10 series
– Controlling HDF5 file versioning
– Taking advantage of HDF compression
• What is coming in HDF5 1.12?
– Non-POSIX I/O and new defaults
• Getting help with HDF software and data
Outline
Conf-DDDD-IN
3
• HDF5 1.8.21 (June 2018)
– Vulnerability patches
– Tools fixes
– Support for Intel Fortran v 18 compiler on
Windows
– There will be one more maintenance release of
HDF5 1.8 version. It is time to move to HDF5 1.10
series!
• HDFView 3.0 (June 2018)
• For more info see https://hdfgroup.org
Current HDF releases
Conf-DDDD-IN
4
• HDF5 1.10.2 ( March 2018)
– Vulnerability patches
– Enabling control over the HDF5 file
versioning
– Enabling compression for parallel (MPI I/O)
writes
• HDF5 1.10.3 is coming later this year
– Parallel compression enhancements
• See https://hdfgroup.org for details
Current HDF releases
Conf-DDDD-IN
5
• Controlling HDF5 file versioning
– HDF5 library is ALWAYS backward
compatible
• New version of the library will always read files
created by the earlier versions
– HDF5 library is forward compatible
• By default the library will create objects in a file
that can be read by the earlier versions of the
library
– HDF5 file does not have a version
– Versioning is done on an object level
Moving to HDF5 1.10 series
Conf-DDDD-IN
6
• Q: How one can assure that HDF5 files created by
HDF5 library version 1.10 and later will be read by
the applications based on HDF5 1.8 and earlier?
• A: Use H5Pset_libver_bounds( hid_t fapl_id,
H5F_libver_t low, H5F_libver_t high ) in
applications
• Uses file access property to specify the features that can
be created by the library specified by the “high”
parameter and latest versions of the objects available in
the library specified by the “low” parameter.
• H5F_LIBVER_EARLIEST, H5F_LIBVER_V18,
H5F_LIBVER_V110 , H5F_LIBVER_LATEST
Moving to HDF5 1.10 series
Conf-DDDD-IN
7
• Taking advantage of HDF5 compression
– Compression works for both sequential and
parallel (MPI I/O) writes/reads
– HDF5 supports GZIP and SZIP compressions
• Open Source and free SZIP from German Climate
Computing Center
https://www.dkrz.de/redmine/projects/aec/wiki/Downloads
• Fully compatible with SZIP provided by The HDF Group
(encoder is not free for commercial data usage)
– Multiple third-party compressions available as
plugins; see
https://portal.hdfgroup.org/display/support/Contributions
– One compression doesn’t fit all data!
Moving to HDF5 1.10 series
Conf-DDDD-IN
8
• Using compression with Sentinel Data
– HDF5 file that was created by converting Sentinel 1
GeoTiff file.
– File contains one 32-bit integer array with dimensions
20256x25478; dimensions correspond to the number
of image strips stored in the original Sentinel 1
GeoTiff file.
HDF5 Compression
Compression Compression ratio File size in bytes
No compression 1 2065283096 (2GB)
SZIP 1.062 1944126897
GZIP 1.966 1049969129
SHUFFLE + GZIP 2.192 941879752 (< 1GB)
Conf-DDDD-IN
9
• Using compression with SeaSat Data
– HDF5 file contained 3 datasets
– Table below shows CR for each dataset when using
GZIP, SZIP and combinations of SHUFFLE and GZIP
– Different compressions (highlighted) can be applied to
get compression ratio (CR) of 1.9
HDF5 Compression
Compression CR HH CR latitude CR
longitude
Total file
size in bytes
TCR
Original file
(GZIP)
1.167 2.693 2.747 407848072 1
SZIP 1.337 3.789 4.423 317040127 1.29
SHUFFLE +
GZIP
1.329 20.049 24.003 216176244 1.89
Conf-DDDD-IN
10
• Using compression with SeaSat Data
– Compression and decompression will differ depending
on the method
– Table below shows elapsed times for the h5repack to
encode data and h5dump to display data for SeaSat
file.
HDF5 Compression
Compression TCR Time to compress
using h5repack
Time to decompress with
h5dump
SZIP 1.29 0:11.34 6:21.98
BLOSC 1.59 0:13.87 6:15.92
SHUFFLE + GZIP 1.89 0:20.91 6:31.29
Conf-DDDD-IN
11
• New defaults and file format changes
– UTF-8 encoding for strings (vs. current ASCII
encoding)
– Setting “low” to H5F_LIBVER_V18 (vs.
H5F_LIBVER_EARLIEST in
H5Pset_libver_bounds( hid_t fapl_id,
H5F_libver_t low, H5F_libver_t high )
• Better performance for groups and attributes
traversals
• No limitation on the attribute sizes
– File format extensions to address misc. file
format issues (e.g., 64-bit dataspaces encoding)
What is coming in HDF5 1.12?
Conf-DDDD-IN
12
• Virtual Object Layer to perform I/O to any
storage including Object Storage
– Plugin architecture for VOL plugins
• REST VOL plugin
• VOL plugins in progress:
– RADOS: Reliable Autonomic Distributed Object Store
is part of CEPH distributed storage system.
– DAOS: Distributed Asynchronous Object Storage
(DAOS) is an open-source software-defined object
store.
What is coming in HDF5 1.12?
Conf-DDDD-IN
13
Virtual Object Layer
HDF5 APIs
VOL Layer
DAOS
plugin
HDF5
plugin
REST
plugin
ADIOS
plugin
DAOS Object Store
HDF5 library internals
Virtual File Driver
Amazon Cloud
ADIOS File on POSIX File System
HDF5 File on POSIX File System
MPI I/O SEC2 S3
Conf-DDDD-IN
14
Questions?
Conf-DDDD-IN
15
This work was supported by NASA/GSFC under
Raytheon Co. contract number NNG15HZ39C.
in partnership with

More Related Content

What's hot

AdvFS/Advanced File System Ccncepts
AdvFS/Advanced File System CcnceptsAdvFS/Advanced File System Ccncepts
AdvFS/Advanced File System CcnceptsJustin Goldberg
 
Kernel Recipes 2016 - New hwmon device registration API - Jean Delvare
Kernel Recipes 2016 -  New hwmon device registration API - Jean DelvareKernel Recipes 2016 -  New hwmon device registration API - Jean Delvare
Kernel Recipes 2016 - New hwmon device registration API - Jean DelvareAnne Nicolas
 
HDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, Datatypes
HDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, DatatypesHDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, Datatypes
HDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, DatatypesThe HDF-EOS Tools and Information Center
 
A Kernel of Truth: Intrusion Detection and Attestation with eBPF
A Kernel of Truth: Intrusion Detection and Attestation with eBPFA Kernel of Truth: Intrusion Detection and Attestation with eBPF
A Kernel of Truth: Intrusion Detection and Attestation with eBPFoholiab
 
Effective Linux Development Using PetaLinux Tools 2017.4
Effective Linux Development Using PetaLinux Tools 2017.4Effective Linux Development Using PetaLinux Tools 2017.4
Effective Linux Development Using PetaLinux Tools 2017.4Zach Pfeffer
 
Kernel Recipes 2016 - Landlock LSM: Unprivileged sandboxing
Kernel Recipes 2016 - Landlock LSM: Unprivileged sandboxingKernel Recipes 2016 - Landlock LSM: Unprivileged sandboxing
Kernel Recipes 2016 - Landlock LSM: Unprivileged sandboxingAnne Nicolas
 
Comprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge CasesComprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge CasesNetronome
 

What's hot (10)

AdvFS/Advanced File System Ccncepts
AdvFS/Advanced File System CcnceptsAdvFS/Advanced File System Ccncepts
AdvFS/Advanced File System Ccncepts
 
Performance Tuning in HDF5
Performance Tuning in HDF5 Performance Tuning in HDF5
Performance Tuning in HDF5
 
Kernel Recipes 2016 - New hwmon device registration API - Jean Delvare
Kernel Recipes 2016 -  New hwmon device registration API - Jean DelvareKernel Recipes 2016 -  New hwmon device registration API - Jean Delvare
Kernel Recipes 2016 - New hwmon device registration API - Jean Delvare
 
HDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, Datatypes
HDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, DatatypesHDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, Datatypes
HDF5 Advanced Topics - Object's Properties, Storage Methods, Filters, Datatypes
 
A Kernel of Truth: Intrusion Detection and Attestation with eBPF
A Kernel of Truth: Intrusion Detection and Attestation with eBPFA Kernel of Truth: Intrusion Detection and Attestation with eBPF
A Kernel of Truth: Intrusion Detection and Attestation with eBPF
 
Effective Linux Development Using PetaLinux Tools 2017.4
Effective Linux Development Using PetaLinux Tools 2017.4Effective Linux Development Using PetaLinux Tools 2017.4
Effective Linux Development Using PetaLinux Tools 2017.4
 
HDF4 and HDF5 Performance Preliminary Results
HDF4 and HDF5 Performance Preliminary ResultsHDF4 and HDF5 Performance Preliminary Results
HDF4 and HDF5 Performance Preliminary Results
 
Kernel Recipes 2016 - Landlock LSM: Unprivileged sandboxing
Kernel Recipes 2016 - Landlock LSM: Unprivileged sandboxingKernel Recipes 2016 - Landlock LSM: Unprivileged sandboxing
Kernel Recipes 2016 - Landlock LSM: Unprivileged sandboxing
 
HDF5 I/O Performance
HDF5 I/O PerformanceHDF5 I/O Performance
HDF5 I/O Performance
 
Comprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge CasesComprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge Cases
 

Similar to HDF Status Update

Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 productsInteroperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 productsThe HDF-EOS Tools and Information Center
 

Similar to HDF Status Update (20)

HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
Efficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAPEfficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAP
 
Hierarchical Data Formats (HDF) Update
Hierarchical Data Formats (HDF) UpdateHierarchical Data Formats (HDF) Update
Hierarchical Data Formats (HDF) Update
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)
 
HDF Update 2016
HDF Update 2016HDF Update 2016
HDF Update 2016
 
Transition from HDF4 to HDF5
Transition from HDF4 to HDF5 Transition from HDF4 to HDF5
Transition from HDF4 to HDF5
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Parallel HDF5
Parallel HDF5Parallel HDF5
Parallel HDF5
 
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 productsInteroperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
Parallel HDF5 Developments
Parallel HDF5 DevelopmentsParallel HDF5 Developments
Parallel HDF5 Developments
 
HDF Updae
HDF UpdaeHDF Updae
HDF Updae
 
HDF5 Tools
HDF5 ToolsHDF5 Tools
HDF5 Tools
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF Project Update
HDF Project UpdateHDF Project Update
HDF Project Update
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Introduction to HDF5 Data and Programming Models
Introduction to HDF5 Data and Programming ModelsIntroduction to HDF5 Data and Programming Models
Introduction to HDF5 Data and Programming Models
 
Support for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF GroupSupport for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF Group
 

More from The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

More from The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
Leveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software TestingLeveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software Testing
 
Google Colaboratory for HDF-EOS
Google Colaboratory for HDF-EOSGoogle Colaboratory for HDF-EOS
Google Colaboratory for HDF-EOS
 
Parallel Computing with HDF Server
Parallel Computing with HDF ServerParallel Computing with HDF Server
Parallel Computing with HDF Server
 
HDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's GuideHDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's Guide
 
NASA Terra Data Fusion
NASA Terra Data FusionNASA Terra Data Fusion
NASA Terra Data Fusion
 

Recently uploaded

Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 

Recently uploaded (20)

Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 

HDF Status Update

  • 1. Conf-DDDD-IN Hierarchical Data Format (HDF) Status Update ESIP Summer 2018 This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C. This document does not contain technology or Technical Data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations. Elena Pourmal EED2 Technical Lead epourmal@hdfgroup.org
  • 2. Conf-DDDD-IN 2 • Update on current HDF releases – New features • Moving to HDF5 1.10 series – Controlling HDF5 file versioning – Taking advantage of HDF compression • What is coming in HDF5 1.12? – Non-POSIX I/O and new defaults • Getting help with HDF software and data Outline
  • 3. Conf-DDDD-IN 3 • HDF5 1.8.21 (June 2018) – Vulnerability patches – Tools fixes – Support for Intel Fortran v 18 compiler on Windows – There will be one more maintenance release of HDF5 1.8 version. It is time to move to HDF5 1.10 series! • HDFView 3.0 (June 2018) • For more info see https://hdfgroup.org Current HDF releases
  • 4. Conf-DDDD-IN 4 • HDF5 1.10.2 ( March 2018) – Vulnerability patches – Enabling control over the HDF5 file versioning – Enabling compression for parallel (MPI I/O) writes • HDF5 1.10.3 is coming later this year – Parallel compression enhancements • See https://hdfgroup.org for details Current HDF releases
  • 5. Conf-DDDD-IN 5 • Controlling HDF5 file versioning – HDF5 library is ALWAYS backward compatible • New version of the library will always read files created by the earlier versions – HDF5 library is forward compatible • By default the library will create objects in a file that can be read by the earlier versions of the library – HDF5 file does not have a version – Versioning is done on an object level Moving to HDF5 1.10 series
  • 6. Conf-DDDD-IN 6 • Q: How one can assure that HDF5 files created by HDF5 library version 1.10 and later will be read by the applications based on HDF5 1.8 and earlier? • A: Use H5Pset_libver_bounds( hid_t fapl_id, H5F_libver_t low, H5F_libver_t high ) in applications • Uses file access property to specify the features that can be created by the library specified by the “high” parameter and latest versions of the objects available in the library specified by the “low” parameter. • H5F_LIBVER_EARLIEST, H5F_LIBVER_V18, H5F_LIBVER_V110 , H5F_LIBVER_LATEST Moving to HDF5 1.10 series
  • 7. Conf-DDDD-IN 7 • Taking advantage of HDF5 compression – Compression works for both sequential and parallel (MPI I/O) writes/reads – HDF5 supports GZIP and SZIP compressions • Open Source and free SZIP from German Climate Computing Center https://www.dkrz.de/redmine/projects/aec/wiki/Downloads • Fully compatible with SZIP provided by The HDF Group (encoder is not free for commercial data usage) – Multiple third-party compressions available as plugins; see https://portal.hdfgroup.org/display/support/Contributions – One compression doesn’t fit all data! Moving to HDF5 1.10 series
  • 8. Conf-DDDD-IN 8 • Using compression with Sentinel Data – HDF5 file that was created by converting Sentinel 1 GeoTiff file. – File contains one 32-bit integer array with dimensions 20256x25478; dimensions correspond to the number of image strips stored in the original Sentinel 1 GeoTiff file. HDF5 Compression Compression Compression ratio File size in bytes No compression 1 2065283096 (2GB) SZIP 1.062 1944126897 GZIP 1.966 1049969129 SHUFFLE + GZIP 2.192 941879752 (< 1GB)
  • 9. Conf-DDDD-IN 9 • Using compression with SeaSat Data – HDF5 file contained 3 datasets – Table below shows CR for each dataset when using GZIP, SZIP and combinations of SHUFFLE and GZIP – Different compressions (highlighted) can be applied to get compression ratio (CR) of 1.9 HDF5 Compression Compression CR HH CR latitude CR longitude Total file size in bytes TCR Original file (GZIP) 1.167 2.693 2.747 407848072 1 SZIP 1.337 3.789 4.423 317040127 1.29 SHUFFLE + GZIP 1.329 20.049 24.003 216176244 1.89
  • 10. Conf-DDDD-IN 10 • Using compression with SeaSat Data – Compression and decompression will differ depending on the method – Table below shows elapsed times for the h5repack to encode data and h5dump to display data for SeaSat file. HDF5 Compression Compression TCR Time to compress using h5repack Time to decompress with h5dump SZIP 1.29 0:11.34 6:21.98 BLOSC 1.59 0:13.87 6:15.92 SHUFFLE + GZIP 1.89 0:20.91 6:31.29
  • 11. Conf-DDDD-IN 11 • New defaults and file format changes – UTF-8 encoding for strings (vs. current ASCII encoding) – Setting “low” to H5F_LIBVER_V18 (vs. H5F_LIBVER_EARLIEST in H5Pset_libver_bounds( hid_t fapl_id, H5F_libver_t low, H5F_libver_t high ) • Better performance for groups and attributes traversals • No limitation on the attribute sizes – File format extensions to address misc. file format issues (e.g., 64-bit dataspaces encoding) What is coming in HDF5 1.12?
  • 12. Conf-DDDD-IN 12 • Virtual Object Layer to perform I/O to any storage including Object Storage – Plugin architecture for VOL plugins • REST VOL plugin • VOL plugins in progress: – RADOS: Reliable Autonomic Distributed Object Store is part of CEPH distributed storage system. – DAOS: Distributed Asynchronous Object Storage (DAOS) is an open-source software-defined object store. What is coming in HDF5 1.12?
  • 13. Conf-DDDD-IN 13 Virtual Object Layer HDF5 APIs VOL Layer DAOS plugin HDF5 plugin REST plugin ADIOS plugin DAOS Object Store HDF5 library internals Virtual File Driver Amazon Cloud ADIOS File on POSIX File System HDF5 File on POSIX File System MPI I/O SEC2 S3
  • 15. Conf-DDDD-IN 15 This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C. in partnership with