SlideShare a Scribd company logo
1 of 15
DAOS For Applications
Mohamad Chaarawi
Extreme Scale Architecture & Development
Intel CorporationCloud & Enterprise Solutions Group 2
2
DAOS Storage Architecture
2
DAOS Storage Engine
Intel® Optane Memory
SPDK
NVMe
Interface
Metadata, low-latency I/Os &
indexing/query
Bulk data
3D-NAND/Optane SSD
PMDK
Memory
Interface
HDD
AI/Analytics/Simulation Workflow
DAOS library
POSIX I/O HDF5 Spark…
Compute Nodes
MPI-I/O Python
Libfabric • Low-latency & high-message-rate communications
• Native support for RDMA & scalable collective operations
• Support for iWarp, RoCE, Infiniband, OPA, Slingshot, …
• DAOS library directly linked with the applications
• No need for dedicated cores
• Low memory/CPU footprint
• End-to-end OS bypass
• Non-blocking, lockless, snapshot support, …
• Fine-grained I/O with media selection strategy
• Only application data on SSD to maximize throughput
• Small I/Os aggregated in pmem & migrated to SSD in large
chunks
• Full userspace model with no system calls on I/O path
• Built-in storage management infrastructure (control plane)
• NFSv4-like ACL
Delivers high-IOPS, high-bandwidth and low-latency
storage with advanced features in a single tier
Storage Nodes
Intel CorporationCloud & Enterprise Solutions Group 3
Aggregate related datasets into manageable and
coherent entities
• Distributed consistency & automated recovery
• Full Versioning
• Simplified data management
• Snapshot
• Cross-tier Migration
• Indexing
Storage Containers
DAOS Container
datadatadatafile
dir
datadatafile
dir
datadatadatadatafile
dir
root
Encapsulated POSIX Namespace File-per-process
DAOS Container
datadatadatadatafile
datadatadatadatafile
datadatadatadatafile
datadatadatadatafile
DAOS Container
datadatadatadataset
group
datadatadataset
group
datadatadatadatadataset
group
group
HDF5 « File » Key-value store
Graph
DAOS Container
valuekey
valuekey
valuekey
valuekey valuekey
DAOS Container
node
node
node
node
node
node
DAOS Container
Columnar Database
key
key
key
key
Value
Value
Value
Value
Value
Value
Value
Value
Intel CorporationCloud & Enterprise Solutions Group 4
 Native support for structured, semi-structured & unstructured data
models
• Built on top of DCPMM
• Unconstrained by POSIX serialization
• Custom attributes
• Data access time orders of magnitude faster (µs)
• Scalable concurrent updates & high IOPS
• Non-blocking
• Enable in-storage computing
DAOS Objects
DAOS Storage Engine
Open Source Apache 2.0 License
Data Model Library/Framework
Array KV Store Multi-level KV Store
key1
val1
key3
val3
@
@
Application
NVMe SSD
DAOS
key1
val1
root
@
key3
val3
@
val2
key2
@@
@
@
val2 con’d
val2
key2
@
Application
Intel CorporationCloud & Enterprise Solutions Group 5
5
DAOS Tools
Tool dmg daos
Target Administrators Users
Lustre Equivalent lctl/mkfs/mount/IML lfs
Functionality • Storage provisioning
• Burn-in
• Firmware update
• Data plane mgmt & monitoring
• Configure/monitor scrubbing
• Pool mgmt
• Telemetry
• Pool query
• Container mgmt
• Unified namespace mgmt
• Container user attributes
• Snapshots
• Object debugging
• POSIX container configuration
Intel CorporationCloud & Enterprise Solutions Group 6
6
Application Interface
Dataset
Mover
Capacity Tier
PFS, S3, HSM, …
DAOS Storage Engine
Open Source Apache 2.0 License
POSIX I/O
HPC APPs
HDF5 MPI-IO Python
Apache
Spark
Apache
Arrow
Analytics/AI APPs
TensorFlowSEGY
Intel CorporationCloud & Enterprise Solutions Group 7
 DAOS API is new and very flexible
• Multi-level keys
• Different value types supported
• Can build all data models / IO middleware on top of it
 Most applications still based on POSIX
• Need a smooth migration path with little to no application changes
• Quick path to realize performance of DCPMM & DAOS
 POSIX implemented as a middleware instead of being the
building block of all data models.
POSIX I/O Support
Application / Framework
DAOS library (libdaos)
DAOS File System (libdfs)
Interception Library
dfuse
Single process address space
DAOS Storage Engine
RPC RDMA
End-to-end
user space
No system calls
Intel® QLC
3D Nand
SSD
Intel CorporationCloud & Enterprise Solutions Group 8
MPI-IO Driver for DAOS
The DAOS MPI-IO driver is implemented within the
I/O library in MPICH (ROMIO).
• Added as an ADIO driver
• Portable to Open-MPI, Intel MPI, etc.
• https://github.com/pmodels/mpich
MPI Files use the same DFS mapping to the
DAOS Object Model
• MPI Files can be accessed through the DFS
API
• MPI Files can be accessed through regular
POSIX with a dfuse mount over the container.
Application works seamlessly by just specifying the use of the driver by appending “daos:” to the path.
MPI-IO ROMIO driver (https://github.com/pmodels/mpich/tree/master/src/mpi/romio/adio/ad_daos)
POSIX / MPI-IO File
DAOS Byte Array Object
Special DAOS Object:
 1 Level Key
 1 Byte records
 Configurable Chunk Size
Intel CorporationCloud & Enterprise Solutions Group 9The information on this page is subject to the use and disclosure restrictions provided on the second page to this document.
HDF5 VOL Architecture



 Three main components:
• HDF5 Library
• DAOS VOL Connector
• (External) HDF5 Test Suite
HDF5 API
VOL Layer
VFD Layer
Native VOL
DAOS
VOL
SEC2
MPIO
File System DAOS
HDF5 Tools Test Suite
New Component
…
…
DAOS APIPOSIX API
Core HDF5
Library
External
Test
Suite
External
VOL
Connector
Enhanced Component
Native Component
Through
MPI I/O
Intel CorporationCloud & Enterprise Solutions Group 10
 No longer requires separate version of HDF5
• Compatible with main develop branch of HDF5
• Compatible with 1.12.x release series of HDF5 with VOL support
 Currently supported features
• New H5M MAP API to expose K/V interface to HDF5 users
• Variable length datatypes are now also supported
• Chunking is recommended storage layout to get most of DAOS performance
• Tools support (h5dump, h5ls, h5repack etc)
 Coming by end of the year
• Independent metadata writes (= independent object creation)
• Asynchronous I/O
 Available from: https://git.hdfgroup.org/projects/HDF5VOL/repos/daos-vol/
• See user’s guide for more detailed list of supported features
HDF5 DAOS VOL Connector – Current Status
Intel CorporationCloud & Enterprise Solutions Group 11
11
Unified Namespace
DAOS Object Store:
 Addresses pools, containers with uuids; objects with 128-bit IDs.
 Applications/Users are used to access files / directories in a traditional namespace
Unified Namespace allows users to create links between a file/dir in a system
namespace to DAOS pools & containers:
 daos container create --path=/mnt/project1/userA/NS1 --pool=uuid --
type=POSIX/HDF5/etc.
 Path created above becomes a special file or directory (depending on container type) with
an extended attribute with the pool and container information.
 Accessing that path from DAOS aware middleware will make the link on the fly with the
DAOS UNS library.
Intel CorporationCloud & Enterprise Solutions Group 12
12
Spark Input/Output Support
DAOS
FileSystem
DAOS POSIX API
Java Wrapper
HDFS
Hadoop FileSystem Abstract Class
Disks
DAOS
Storage
DAOS Library, DPDK/PMDK/SPDK used, Kernel bypass, zero copy
DAOS/Arrow
Wrapper
Apache Arrow Data Source
DAOS API
Wrapper
DAOS Native Data Source
(Parquet, ORC, etc.)
Scan/Parse
Project, Join, ML, etc.
Native Scan/Parse
(CSV, Parquet, ORC, etc.)
Current Spark
Implemented
Planned
Intel CorporationCloud & Enterprise Solutions Group 13
 Pythonic bindings called pydaos
• Export key-value store objects
• Integrated with python dictionaries
• Support python iterator, direct assignments, …
• Scalable & performant
• Bulk insert/retrieve
• Core written in C
• Python 2.7 & 3 support
 Python integration
• dbm
• pyprob
 TODO
• Expose snapshots
• Integration with NumPy, …
Python Support
Intel CorporationCloud & Enterprise Solutions Group 14
Resources
 Source code on GitHub
• https://github.com/daos-stack/daos
 Documentation
• http://daos.io
Community mailing list
• https://daos.groups.io
DAOS solution brief
• https://www.intel.com/content/www/us/en/high-performance-computing/
15

More Related Content

What's hot

Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Scylla Summit 2022: Scylla 5.0 New Features, Part 1Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Scylla Summit 2022: Scylla 5.0 New Features, Part 1ScyllaDB
 
From cache to in-memory data grid. Introduction to Hazelcast.
From cache to in-memory data grid. Introduction to Hazelcast.From cache to in-memory data grid. Introduction to Hazelcast.
From cache to in-memory data grid. Introduction to Hazelcast.Taras Matyashovsky
 
Presto Summit 2018 - 09 - Netflix Iceberg
Presto Summit 2018  - 09 - Netflix IcebergPresto Summit 2018  - 09 - Netflix Iceberg
Presto Summit 2018 - 09 - Netflix Icebergkbajda
 
Apache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic DatasetsApache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic DatasetsAlluxio, Inc.
 
RocksDB detail
RocksDB detailRocksDB detail
RocksDB detailMIJIN AN
 
Apache Spark Core – Practical Optimization
Apache Spark Core – Practical OptimizationApache Spark Core – Practical Optimization
Apache Spark Core – Practical OptimizationDatabricks
 
Cosco: An Efficient Facebook-Scale Shuffle Service
Cosco: An Efficient Facebook-Scale Shuffle ServiceCosco: An Efficient Facebook-Scale Shuffle Service
Cosco: An Efficient Facebook-Scale Shuffle ServiceDatabricks
 
SeaweedFS introduction
SeaweedFS introductionSeaweedFS introduction
SeaweedFS introductionchrislusf
 
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...Databricks
 
Optimizing Delta/Parquet Data Lakes for Apache Spark
Optimizing Delta/Parquet Data Lakes for Apache SparkOptimizing Delta/Parquet Data Lakes for Apache Spark
Optimizing Delta/Parquet Data Lakes for Apache SparkDatabricks
 
Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Edureka!
 
How to understand and analyze Apache Hive query execution plan for performanc...
How to understand and analyze Apache Hive query execution plan for performanc...How to understand and analyze Apache Hive query execution plan for performanc...
How to understand and analyze Apache Hive query execution plan for performanc...DataWorks Summit/Hadoop Summit
 
Storing time series data with Apache Cassandra
Storing time series data with Apache CassandraStoring time series data with Apache Cassandra
Storing time series data with Apache CassandraPatrick McFadin
 
Fine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsFine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsDatabricks
 
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)Ryan Blue
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversScyllaDB
 
The Parquet Format and Performance Optimization Opportunities
The Parquet Format and Performance Optimization OpportunitiesThe Parquet Format and Performance Optimization Opportunities
The Parquet Format and Performance Optimization OpportunitiesDatabricks
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDatabricks
 
Time-Series Apache HBase
Time-Series Apache HBaseTime-Series Apache HBase
Time-Series Apache HBaseHBaseCon
 

What's hot (20)

Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Scylla Summit 2022: Scylla 5.0 New Features, Part 1Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
 
From cache to in-memory data grid. Introduction to Hazelcast.
From cache to in-memory data grid. Introduction to Hazelcast.From cache to in-memory data grid. Introduction to Hazelcast.
From cache to in-memory data grid. Introduction to Hazelcast.
 
Presto Summit 2018 - 09 - Netflix Iceberg
Presto Summit 2018  - 09 - Netflix IcebergPresto Summit 2018  - 09 - Netflix Iceberg
Presto Summit 2018 - 09 - Netflix Iceberg
 
Apache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic DatasetsApache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic Datasets
 
Log Structured Merge Tree
Log Structured Merge TreeLog Structured Merge Tree
Log Structured Merge Tree
 
RocksDB detail
RocksDB detailRocksDB detail
RocksDB detail
 
Apache Spark Core – Practical Optimization
Apache Spark Core – Practical OptimizationApache Spark Core – Practical Optimization
Apache Spark Core – Practical Optimization
 
Cosco: An Efficient Facebook-Scale Shuffle Service
Cosco: An Efficient Facebook-Scale Shuffle ServiceCosco: An Efficient Facebook-Scale Shuffle Service
Cosco: An Efficient Facebook-Scale Shuffle Service
 
SeaweedFS introduction
SeaweedFS introductionSeaweedFS introduction
SeaweedFS introduction
 
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
 
Optimizing Delta/Parquet Data Lakes for Apache Spark
Optimizing Delta/Parquet Data Lakes for Apache SparkOptimizing Delta/Parquet Data Lakes for Apache Spark
Optimizing Delta/Parquet Data Lakes for Apache Spark
 
Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala Big Data Processing with Spark and Scala
Big Data Processing with Spark and Scala
 
How to understand and analyze Apache Hive query execution plan for performanc...
How to understand and analyze Apache Hive query execution plan for performanc...How to understand and analyze Apache Hive query execution plan for performanc...
How to understand and analyze Apache Hive query execution plan for performanc...
 
Storing time series data with Apache Cassandra
Storing time series data with Apache CassandraStoring time series data with Apache Cassandra
Storing time series data with Apache Cassandra
 
Fine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark JobsFine Tuning and Enhancing Performance of Apache Spark Jobs
Fine Tuning and Enhancing Performance of Apache Spark Jobs
 
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the Covers
 
The Parquet Format and Performance Optimization Opportunities
The Parquet Format and Performance Optimization OpportunitiesThe Parquet Format and Performance Optimization Opportunities
The Parquet Format and Performance Optimization Opportunities
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache Spark
 
Time-Series Apache HBase
Time-Series Apache HBaseTime-Series Apache HBase
Time-Series Apache HBase
 

Similar to DAOS Middleware overview

VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld
 
DELLEMC_Portfolio_hyperlinks_Complete
DELLEMC_Portfolio_hyperlinks_CompleteDELLEMC_Portfolio_hyperlinks_Complete
DELLEMC_Portfolio_hyperlinks_CompleteDELLEMC Technologies
 
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016MLconf
 
Hortonworks Data Platform with IBM Spectrum Scale
Hortonworks Data Platform with IBM Spectrum ScaleHortonworks Data Platform with IBM Spectrum Scale
Hortonworks Data Platform with IBM Spectrum ScaleAbhishek Sood
 
Red Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewRed Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewMarcel Hergaarden
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
DUG'20: 13 - HPE’s DAOS Solution Plans
DUG'20: 13 - HPE’s DAOS Solution PlansDUG'20: 13 - HPE’s DAOS Solution Plans
DUG'20: 13 - HPE’s DAOS Solution PlansAndrey Kudryavtsev
 
Open Source Security Tools for Big Data
Open Source Security Tools for Big DataOpen Source Security Tools for Big Data
Open Source Security Tools for Big DataRommel Garcia
 
Open Source Security Tools for Big Data
Open Source Security Tools for Big DataOpen Source Security Tools for Big Data
Open Source Security Tools for Big DataGreat Wide Open
 
Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red_Hat_Storage
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFSUSE Italy
 
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio ManfredOSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio ManfredNETWAYS
 
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020Redis Labs
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSandeep Patil
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Trishali Nayar
 
State of the Container Ecosystem
State of the Container EcosystemState of the Container Ecosystem
State of the Container EcosystemVinay Rao
 
IBM Platform Computing Elastic Storage
IBM Platform Computing  Elastic StorageIBM Platform Computing  Elastic Storage
IBM Platform Computing Elastic StoragePatrick Bouillaud
 
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for womenHadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for womenmaharajothip1
 

Similar to DAOS Middleware overview (20)

VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
 
DELLEMC_Portfolio_hyperlinks_Complete
DELLEMC_Portfolio_hyperlinks_CompleteDELLEMC_Portfolio_hyperlinks_Complete
DELLEMC_Portfolio_hyperlinks_Complete
 
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
 
Hortonworks Data Platform with IBM Spectrum Scale
Hortonworks Data Platform with IBM Spectrum ScaleHortonworks Data Platform with IBM Spectrum Scale
Hortonworks Data Platform with IBM Spectrum Scale
 
Hadoop data management
Hadoop data managementHadoop data management
Hadoop data management
 
Red Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) OverviewRed Hat Storage 2014 - Product(s) Overview
Red Hat Storage 2014 - Product(s) Overview
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
DUG'20: 13 - HPE’s DAOS Solution Plans
DUG'20: 13 - HPE’s DAOS Solution PlansDUG'20: 13 - HPE’s DAOS Solution Plans
DUG'20: 13 - HPE’s DAOS Solution Plans
 
Open Source Security Tools for Big Data
Open Source Security Tools for Big DataOpen Source Security Tools for Big Data
Open Source Security Tools for Big Data
 
Open Source Security Tools for Big Data
Open Source Security Tools for Big DataOpen Source Security Tools for Big Data
Open Source Security Tools for Big Data
 
Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance Red Hat Storage Day Dallas - Defiance of the Appliance
Red Hat Storage Day Dallas - Defiance of the Appliance
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
 
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio ManfredOSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
OSDC 2010 | Use Distributed Filesystem as a Storage Tier by Fabrizio Manfred
 
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
 
State of the Container Ecosystem
State of the Container EcosystemState of the Container Ecosystem
State of the Container Ecosystem
 
IBM Platform Computing Elastic Storage
IBM Platform Computing  Elastic StorageIBM Platform Computing  Elastic Storage
IBM Platform Computing Elastic Storage
 
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for womenHadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
 
{code} and containers
{code} and containers{code} and containers
{code} and containers
 

More from Andrey Kudryavtsev

DUG'20: 12 - DAOS in Lenovo’s HPC Innovation Center
DUG'20: 12 - DAOS in Lenovo’s HPC Innovation CenterDUG'20: 12 - DAOS in Lenovo’s HPC Innovation Center
DUG'20: 12 - DAOS in Lenovo’s HPC Innovation CenterAndrey Kudryavtsev
 
DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...
DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...
DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...Andrey Kudryavtsev
 
DUG'20: 10 - Storage Orchestration for Composable Storage Architectures
DUG'20: 10 - Storage Orchestration for Composable Storage ArchitecturesDUG'20: 10 - Storage Orchestration for Composable Storage Architectures
DUG'20: 10 - Storage Orchestration for Composable Storage ArchitecturesAndrey Kudryavtsev
 
DUG'20: 09 - DAOS Middleware Update
DUG'20: 09 - DAOS Middleware UpdateDUG'20: 09 - DAOS Middleware Update
DUG'20: 09 - DAOS Middleware UpdateAndrey Kudryavtsev
 
DUG'20: 08 - DAOS-SEGY Mapping
DUG'20: 08 - DAOS-SEGY MappingDUG'20: 08 - DAOS-SEGY Mapping
DUG'20: 08 - DAOS-SEGY MappingAndrey Kudryavtsev
 
DUG'20: 07 - Storing High-Energy Physics data in DAOS
DUG'20: 07 - Storing High-Energy Physics data in DAOSDUG'20: 07 - Storing High-Energy Physics data in DAOS
DUG'20: 07 - Storing High-Energy Physics data in DAOSAndrey Kudryavtsev
 
DUG'20: 06 - DAOS Adventures at CERN Openlab
DUG'20: 06 - DAOS Adventures at CERN OpenlabDUG'20: 06 - DAOS Adventures at CERN Openlab
DUG'20: 06 - DAOS Adventures at CERN OpenlabAndrey Kudryavtsev
 
DUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS Testbed
DUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS TestbedDUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS Testbed
DUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS TestbedAndrey Kudryavtsev
 
DUG'20: 04 - DAOS Feature Update
DUG'20: 04 - DAOS Feature UpdateDUG'20: 04 - DAOS Feature Update
DUG'20: 04 - DAOS Feature UpdateAndrey Kudryavtsev
 
DUG'20: 03 - Online compression with QAT in DAOS
DUG'20: 03 - Online compression with QAT in DAOSDUG'20: 03 - Online compression with QAT in DAOS
DUG'20: 03 - Online compression with QAT in DAOSAndrey Kudryavtsev
 
DUG'20: 02 - Accelerating apache spark with DAOS on Aurora
DUG'20: 02 - Accelerating apache spark with DAOS on AuroraDUG'20: 02 - Accelerating apache spark with DAOS on Aurora
DUG'20: 02 - Accelerating apache spark with DAOS on AuroraAndrey Kudryavtsev
 
DUG'20: 01 - Welcome & DAOS Update
DUG'20: 01 - Welcome & DAOS UpdateDUG'20: 01 - Welcome & DAOS Update
DUG'20: 01 - Welcome & DAOS UpdateAndrey Kudryavtsev
 

More from Andrey Kudryavtsev (12)

DUG'20: 12 - DAOS in Lenovo’s HPC Innovation Center
DUG'20: 12 - DAOS in Lenovo’s HPC Innovation CenterDUG'20: 12 - DAOS in Lenovo’s HPC Innovation Center
DUG'20: 12 - DAOS in Lenovo’s HPC Innovation Center
 
DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...
DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...
DUG'20: 11 - Platform Performance Evolution from bring-up to reaching link sa...
 
DUG'20: 10 - Storage Orchestration for Composable Storage Architectures
DUG'20: 10 - Storage Orchestration for Composable Storage ArchitecturesDUG'20: 10 - Storage Orchestration for Composable Storage Architectures
DUG'20: 10 - Storage Orchestration for Composable Storage Architectures
 
DUG'20: 09 - DAOS Middleware Update
DUG'20: 09 - DAOS Middleware UpdateDUG'20: 09 - DAOS Middleware Update
DUG'20: 09 - DAOS Middleware Update
 
DUG'20: 08 - DAOS-SEGY Mapping
DUG'20: 08 - DAOS-SEGY MappingDUG'20: 08 - DAOS-SEGY Mapping
DUG'20: 08 - DAOS-SEGY Mapping
 
DUG'20: 07 - Storing High-Energy Physics data in DAOS
DUG'20: 07 - Storing High-Energy Physics data in DAOSDUG'20: 07 - Storing High-Energy Physics data in DAOS
DUG'20: 07 - Storing High-Energy Physics data in DAOS
 
DUG'20: 06 - DAOS Adventures at CERN Openlab
DUG'20: 06 - DAOS Adventures at CERN OpenlabDUG'20: 06 - DAOS Adventures at CERN Openlab
DUG'20: 06 - DAOS Adventures at CERN Openlab
 
DUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS Testbed
DUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS TestbedDUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS Testbed
DUG'20: 05 - Very Early Experiences with a 0.5 PByte DAOS Testbed
 
DUG'20: 04 - DAOS Feature Update
DUG'20: 04 - DAOS Feature UpdateDUG'20: 04 - DAOS Feature Update
DUG'20: 04 - DAOS Feature Update
 
DUG'20: 03 - Online compression with QAT in DAOS
DUG'20: 03 - Online compression with QAT in DAOSDUG'20: 03 - Online compression with QAT in DAOS
DUG'20: 03 - Online compression with QAT in DAOS
 
DUG'20: 02 - Accelerating apache spark with DAOS on Aurora
DUG'20: 02 - Accelerating apache spark with DAOS on AuroraDUG'20: 02 - Accelerating apache spark with DAOS on Aurora
DUG'20: 02 - Accelerating apache spark with DAOS on Aurora
 
DUG'20: 01 - Welcome & DAOS Update
DUG'20: 01 - Welcome & DAOS UpdateDUG'20: 01 - Welcome & DAOS Update
DUG'20: 01 - Welcome & DAOS Update
 

Recently uploaded

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 

Recently uploaded (20)

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 

DAOS Middleware overview

  • 1. DAOS For Applications Mohamad Chaarawi Extreme Scale Architecture & Development
  • 2. Intel CorporationCloud & Enterprise Solutions Group 2 2 DAOS Storage Architecture 2 DAOS Storage Engine Intel® Optane Memory SPDK NVMe Interface Metadata, low-latency I/Os & indexing/query Bulk data 3D-NAND/Optane SSD PMDK Memory Interface HDD AI/Analytics/Simulation Workflow DAOS library POSIX I/O HDF5 Spark… Compute Nodes MPI-I/O Python Libfabric • Low-latency & high-message-rate communications • Native support for RDMA & scalable collective operations • Support for iWarp, RoCE, Infiniband, OPA, Slingshot, … • DAOS library directly linked with the applications • No need for dedicated cores • Low memory/CPU footprint • End-to-end OS bypass • Non-blocking, lockless, snapshot support, … • Fine-grained I/O with media selection strategy • Only application data on SSD to maximize throughput • Small I/Os aggregated in pmem & migrated to SSD in large chunks • Full userspace model with no system calls on I/O path • Built-in storage management infrastructure (control plane) • NFSv4-like ACL Delivers high-IOPS, high-bandwidth and low-latency storage with advanced features in a single tier Storage Nodes
  • 3. Intel CorporationCloud & Enterprise Solutions Group 3 Aggregate related datasets into manageable and coherent entities • Distributed consistency & automated recovery • Full Versioning • Simplified data management • Snapshot • Cross-tier Migration • Indexing Storage Containers DAOS Container datadatadatafile dir datadatafile dir datadatadatadatafile dir root Encapsulated POSIX Namespace File-per-process DAOS Container datadatadatadatafile datadatadatadatafile datadatadatadatafile datadatadatadatafile DAOS Container datadatadatadataset group datadatadataset group datadatadatadatadataset group group HDF5 « File » Key-value store Graph DAOS Container valuekey valuekey valuekey valuekey valuekey DAOS Container node node node node node node DAOS Container Columnar Database key key key key Value Value Value Value Value Value Value Value
  • 4. Intel CorporationCloud & Enterprise Solutions Group 4  Native support for structured, semi-structured & unstructured data models • Built on top of DCPMM • Unconstrained by POSIX serialization • Custom attributes • Data access time orders of magnitude faster (µs) • Scalable concurrent updates & high IOPS • Non-blocking • Enable in-storage computing DAOS Objects DAOS Storage Engine Open Source Apache 2.0 License Data Model Library/Framework Array KV Store Multi-level KV Store key1 val1 key3 val3 @ @ Application NVMe SSD DAOS key1 val1 root @ key3 val3 @ val2 key2 @@ @ @ val2 con’d val2 key2 @ Application
  • 5. Intel CorporationCloud & Enterprise Solutions Group 5 5 DAOS Tools Tool dmg daos Target Administrators Users Lustre Equivalent lctl/mkfs/mount/IML lfs Functionality • Storage provisioning • Burn-in • Firmware update • Data plane mgmt & monitoring • Configure/monitor scrubbing • Pool mgmt • Telemetry • Pool query • Container mgmt • Unified namespace mgmt • Container user attributes • Snapshots • Object debugging • POSIX container configuration
  • 6. Intel CorporationCloud & Enterprise Solutions Group 6 6 Application Interface Dataset Mover Capacity Tier PFS, S3, HSM, … DAOS Storage Engine Open Source Apache 2.0 License POSIX I/O HPC APPs HDF5 MPI-IO Python Apache Spark Apache Arrow Analytics/AI APPs TensorFlowSEGY
  • 7. Intel CorporationCloud & Enterprise Solutions Group 7  DAOS API is new and very flexible • Multi-level keys • Different value types supported • Can build all data models / IO middleware on top of it  Most applications still based on POSIX • Need a smooth migration path with little to no application changes • Quick path to realize performance of DCPMM & DAOS  POSIX implemented as a middleware instead of being the building block of all data models. POSIX I/O Support Application / Framework DAOS library (libdaos) DAOS File System (libdfs) Interception Library dfuse Single process address space DAOS Storage Engine RPC RDMA End-to-end user space No system calls Intel® QLC 3D Nand SSD
  • 8. Intel CorporationCloud & Enterprise Solutions Group 8 MPI-IO Driver for DAOS The DAOS MPI-IO driver is implemented within the I/O library in MPICH (ROMIO). • Added as an ADIO driver • Portable to Open-MPI, Intel MPI, etc. • https://github.com/pmodels/mpich MPI Files use the same DFS mapping to the DAOS Object Model • MPI Files can be accessed through the DFS API • MPI Files can be accessed through regular POSIX with a dfuse mount over the container. Application works seamlessly by just specifying the use of the driver by appending “daos:” to the path. MPI-IO ROMIO driver (https://github.com/pmodels/mpich/tree/master/src/mpi/romio/adio/ad_daos) POSIX / MPI-IO File DAOS Byte Array Object Special DAOS Object:  1 Level Key  1 Byte records  Configurable Chunk Size
  • 9. Intel CorporationCloud & Enterprise Solutions Group 9The information on this page is subject to the use and disclosure restrictions provided on the second page to this document. HDF5 VOL Architecture     Three main components: • HDF5 Library • DAOS VOL Connector • (External) HDF5 Test Suite HDF5 API VOL Layer VFD Layer Native VOL DAOS VOL SEC2 MPIO File System DAOS HDF5 Tools Test Suite New Component … … DAOS APIPOSIX API Core HDF5 Library External Test Suite External VOL Connector Enhanced Component Native Component Through MPI I/O
  • 10. Intel CorporationCloud & Enterprise Solutions Group 10  No longer requires separate version of HDF5 • Compatible with main develop branch of HDF5 • Compatible with 1.12.x release series of HDF5 with VOL support  Currently supported features • New H5M MAP API to expose K/V interface to HDF5 users • Variable length datatypes are now also supported • Chunking is recommended storage layout to get most of DAOS performance • Tools support (h5dump, h5ls, h5repack etc)  Coming by end of the year • Independent metadata writes (= independent object creation) • Asynchronous I/O  Available from: https://git.hdfgroup.org/projects/HDF5VOL/repos/daos-vol/ • See user’s guide for more detailed list of supported features HDF5 DAOS VOL Connector – Current Status
  • 11. Intel CorporationCloud & Enterprise Solutions Group 11 11 Unified Namespace DAOS Object Store:  Addresses pools, containers with uuids; objects with 128-bit IDs.  Applications/Users are used to access files / directories in a traditional namespace Unified Namespace allows users to create links between a file/dir in a system namespace to DAOS pools & containers:  daos container create --path=/mnt/project1/userA/NS1 --pool=uuid -- type=POSIX/HDF5/etc.  Path created above becomes a special file or directory (depending on container type) with an extended attribute with the pool and container information.  Accessing that path from DAOS aware middleware will make the link on the fly with the DAOS UNS library.
  • 12. Intel CorporationCloud & Enterprise Solutions Group 12 12 Spark Input/Output Support DAOS FileSystem DAOS POSIX API Java Wrapper HDFS Hadoop FileSystem Abstract Class Disks DAOS Storage DAOS Library, DPDK/PMDK/SPDK used, Kernel bypass, zero copy DAOS/Arrow Wrapper Apache Arrow Data Source DAOS API Wrapper DAOS Native Data Source (Parquet, ORC, etc.) Scan/Parse Project, Join, ML, etc. Native Scan/Parse (CSV, Parquet, ORC, etc.) Current Spark Implemented Planned
  • 13. Intel CorporationCloud & Enterprise Solutions Group 13  Pythonic bindings called pydaos • Export key-value store objects • Integrated with python dictionaries • Support python iterator, direct assignments, … • Scalable & performant • Bulk insert/retrieve • Core written in C • Python 2.7 & 3 support  Python integration • dbm • pyprob  TODO • Expose snapshots • Integration with NumPy, … Python Support
  • 14. Intel CorporationCloud & Enterprise Solutions Group 14 Resources  Source code on GitHub • https://github.com/daos-stack/daos  Documentation • http://daos.io Community mailing list • https://daos.groups.io DAOS solution brief • https://www.intel.com/content/www/us/en/high-performance-computing/
  • 15. 15

Editor's Notes

  1. Overall DAOS architecture Distributed object store built from the ground up for NVM technology. Includes 2 types of storage: SSD with different media, 3d-NAND, Optane. Available with NVMe interface, provide fast block-based storage Intel developing new user-space nvme driver, SPDK Pmem, Optane pmem. Fine grained storage directly memory mapped. Can do direct load store to the storage. No block abstraction Intel developing new storage stack PMDK to manage transactional update to pmem We don’t have any Object store to leverage these object stores. Lustre, GPFS built into kernel. Ceph still uses kernel IOs by going through Linux block layer. Here everything in userspace. DAOS new OS built on top of pmdk and spdk. highIOPS high BW. More advanced storage API, not just byte arrays, KV, etc. Still need to look into communication. Reuse several decade effort to optimize latency and BW for MPI. Re-use same communication middleware as MPI (libfabric). Client side, very lightweight with support to many middleware. All metadata stored on pmem. Small IOs land in pmem and later aggregated into SSD in the background to free space in Pmem. Application large data go in the SSD.
  2. Baseline API based on Storage container -> Object address space inside a pool. Container can be encapsulated POSIX namespace, or just a flat one. HDF5 has it’s own representations for a single HDF5. KV store can be one. Database, ACG 1 graph Container provides a baseline API with object. Unit of storage management is the container not a file. When you have millions of file, data migration is done on all those. We Simplified that. Fewer number of containers than files, so we can index them. Data migration of entire container. unit of Snapshot. Snapshot not typically available traditionally (used to be at FS level) but now at container level and exposed to user
  3. POSIX not the foundation any more. New key-array store API provided by DAOS with advanced features (ad-hoc concurrency control, snapshot support, asynchronous API, ..). DAOS API to be directly integrated with rich data model for better performance and extended capabilities. POSIX (i.e. File) support built as a middleware over the DAOS API. DAOS has a native KV API which means that the wire protocol is KV fetch/lookup. On the server, the tree structure is maintained in DCPM and values are either stored in DCPM or SSD depending on sizes. No file serialization is required. A client can directly lookup or insert a key/value pair over the wire and this can happen from many clients concurrently. Keys can be ordered, so we support range query. KV store are partitioned/sharded/replicated/erasure coded over multiple servers for performance & resilience. DAOS can also support complex query since it understand the key-value structure. The native indexing can also be augmented with ad-hoc application-driven index.
  4. Spark Integration with DAOS – different team at intel Written Hadoop filesystem abstraction class for DAOS. Integrated in DAOS source code. Use DAOS in user-space through libdfs transparently with this Hadoop connector. More value/acceleration to Spark, by integrating DAOS as a datasource through parquet / apache arrow, or through native DAOS API.
  5. Native Python bindings (pydaos) integrate KV store object directly with python dictionaries. Very pythonic API, you can have iterator, direct assignments, bulk insertion/retrieve Allow people who are not very familiar with C using high level language. Some integration with dbm, pyprob. More work to do