In this deck from the DDN User Group at SC19, Andrey Kudryavtsev from Intel presents: Reimagining HPC Compute and Storage Architecture with Intel Optane Technology.
"In the face of unrelenting data growth, rising numbers of high-performance computing (HPC) workloads are memory bound. Caught between the high cost and limited capacity of DRAM and the lower performance of 3D NAND SSDs, HPC users increasingly find that despite workarounds, they’re unable to keep pace with skyrocketing data volumes and increasingly complex challenges. Intel Optane technology is designed to address these challenges. Available as Intel Optane DC persistent memory and Intel Optane DC Solid State Drives (Intel Optane SSDs), this technology closes the capacity, cost, and performance gaps between DRAM and 3D NAND SSDs, providing opportunities to advance data-intensive workloads while increasing uptime and flexibility in the HPC data center."
Watch the video: https://wp.me/p3RLHQ-ljl
Learn more: https://www.intel.com/content/dam/www/public/us/en/documents/technology-briefs/what-is-optane-technology-brief.pdf
and
https://www.ddn.com/company/events/user-group-sc/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
3. 3
A complete hierarchy
Brings more data into
Memory
Bring storage closer to
the processor
Brings more Data into
Solid State StorageIntel® 3D NAND
4. What is Intel® Optane™ Technology?
Transistor-less Design
Data is written at a bit level,
so each cell's state can be
changed from a 0 or 1
independently of other cells
4
Intel® Optane™
Memory Media
Intel® Optane™ Technology design is fundamentally different from NAND
5. Intel® Optane™ SSD design is fundamentally different
Every NAND
SSD is slowed
by garbage
collection
256K NAND block
Write in pagesPages get stale
Copy to
Empty block
Empty block
Permission to use truck icon made by monkik from www.flaticon.com with reference
5
6. Intel® Optane™ SSD DC P4800x. The ideal caching solution.
6
+ =
Lower latency + higher endurance = greater SDS system efficiency
lower & more consistent latency
Average Read Latency under Random Write Workload1
Higher endurance
Drive Writes Per Day (DWPD)2
More efficient
Cache as a % of Storage Capacity3
Intel®
Optane™ SSD
DC P4800X
Intel® SSD
DC P4600
(3D NAND)3.0
DWPD
Intel® SSD DC P4600Intel® Optane™ SSD DC P4800X
Up to
60.0
DWPD
Intel®
Optane™
SSD DC
P4800X
as cache
Intel® SSD DC
P4600 (3D NAND)
as cache
Storage
Storage
Intel® SSD DC P4600Intel® Optane™ SSD DC P4800X
See appendix b FOR 1,2 & 3
7. Big and Affordable
memory
128, 256, 512GB Modules
DDR4 pin compatible
Byte addressable
Direct load/store access
High performance
storage
Native persistence
High reliability and
security
8. Persistent Memory: In a league of its own
HIGHER BANDWIDTH
Up to 3.7X read/write
bandwidth vs NVMe SSDs,
with one module; more with
multiple modules
LOWER LATENCY
Orders of magnitude lower
latency than NVMe SSDs
• 1000X lower latency than
NAND NVMe SSD at 1GB/s
Performance results are based on testing as of Feb 22, 2019 and may not reflect all publicly available security updates. No product or component can be absolutely secure.
Results have been estimated based on tests conducted on pre-production systems, and provided to you for informational purposes. Software and workloads used in performance tests may have been optimized for performance only on Intel
microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You
should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to
www.intel.com/benchmarks.
2, 3 Based on internal testing by Intel . Configuration: see slides 38 and 39.
3.7
X
HIGHER
bandwidth
up
to
10
00
X
LOWER
latency
up
to
Intel DC P4610
NVMe SSD
Intel Optane
DC SSD
P4800X
9. Operational Modes for Flexibility
7
Total Volatile Memory
Memory Mode
AFFORDABLE MEMORY CAPACITY
FOR MANY APPLICATIONS
Total Volatile Memory
Total Persistent Memory
App Direct Mode
VOLATILE AND PERSISTANT USAGE MODELS
MAX CAPACITY
Configurations shown are examples only; other configurations
supported – please refer to Intel Optane DC persistent memory
population guides
10. 6 slots per CPU
Least number of DIMM slots to
utilize max memory bandwidth
8 slots per CPU
Trade DDR bandwidth for smaller
board real estate on the DIMM slots
12 slots per CPU
Max memory capacity and
bandwidth
Purley Memory DIMM Slot Configuration Examples†
10
2-2-2 2-1-1* 1-1-1
* No difference on functionality or performance when 2nd DIMM slot is in channel 0, 1 or 2 for that integrated memory controller (IMC)
† DIMM slots shown. While DRAM DIMMs can populate all slots shown, DCPMM is only populated in slot closest to CPU in each channel.
10 slots per CPU
Trade DDR bandwidth for smaller
board real estate on the DIMM slots
2-2-1
11. Intel® Optane™ DC Technology enhanced workloads
Best fit is shown, both products may be viable
*Other names and brands may be
claimed as the property of others.
11
HPCAI / AnalyticsStorage Infrastructure CommsDatabase
MACHINE LEARNING ANALYTICS
ApacheSpark*
REAL-TIME ANALYTICS
SAS*
RDMA/REPLICATION
SDS
Ceph*
MEMORY
VMware ESXi*
MSFTHyper-V*
KVM*
RedisLabs*
Memcached*
VDI
CONTENT DELIVERY
NETWORK (CDN)
CommsSPcustom
CACHING/
PERSISTENCE
SAPHANA*
MS-SQL*
Aerospike*
Redis*
RocksDB
HYPER-CONVERGED (HCI)
STORAGE
VMware vSAN*
MicrosoftS2D
Nutanix*
MEMORY
VMWareESXi*
MSFTHyper-V*
KVM*
SCRATCH & IO
NODES
HPCFlexMemory
Animated slide to show the gaps in the pyramid. The final image, after the animations run, is useful as a ‘before pyramid’. It shows the pyramid with all three gaps.
Intel is innovating in the memory and storage space to help reduce these gaps.
In that top gap there is a real need to extend memory. Intel® Optane™ DC Persistent Memory increases the size of available memory and adds persistent data retention. Persistence is important because if there is a power loss, the data is protected and the memory reload time on restart is eliminated. There is also a performance gain from persistence.
For the storage gap, Intel has introduced Intel® Optane™ DC SSDs. Optane-based SSDs are different than NAND-based SSDs providing increased IOPs and high write performance, and consistency. The need for speed and writing/rewriting of data, like caching, logging, and journaling solutions, are the ideal scenarios for the use of Intel Optane SSDs.
The key to understand Intel's store more strategy and these two amazing Intel products is realizing over decades architects and developers have been limited to two distinct places where data resides (memory/storage) and there is a huge gap between these too. In order to have a truly workload optimized system architectures, the gap needs to be filled so that we can more efficiently turn data from a burden an asset.
To fill the memory capacity gap we saw when DRAM was not scaling – Intel Optane DC Persistent Memory in the Data Center, and in the future for Client and Commercial systems is the big bet to fill that gap.
To fill the storage performance gap to bring much lower latencies and faster access to data sets at the top of the storage layer.
To fill the cost performance gap with our differentiated and synergistic 3D NAND SSD Portfolio in the capacity tier, bringing massive density at low cost, in the warm and pushing into the cold tiers of the storage hierarchy currently serviced by HDDs
Intel Optane Technology utilizes Optane storage and memory media, which represents a radically different memory design versus standard NAND. NAND technology uses transistors or gates where the cells that contain the bit as being a 0 or 1 are arranged in blocks, so if any cell needs to be changed the entire block must be re-written.
Optane technology is a transistor-less design where data is written at a bit level so each cell's state can be changed from a 0 or 1 independently of the other cells. The cells can be changed to a high or low resistance state by applying different voltages that change the bit to a 0 or 1. Since the physical state of the cell material has changed, they can hold their values indefinitely, even without power. This more efficient design radically increases performance, improves durability, enhances capacity, and helps to lower Data Center power consumption.
Based on 3D XPoint memory media, the Intel Optane family boasts a new generation of solid-state drive (SSD) products built from the ground up and is the biggest breakthrough in Intel chip engineering in 25 years. Optane provides memory and storage that is fast, dense and non-volatile, helping to eliminate data bottlenecks.
The performance of every NAND SSD suffers from the overhead process known as garbage collection.
<CLICK> Due to the physics of NAND media, everybody's NAND media reads and writes in pages, but erases only in blocks. For performance reasons, page updates typically are written to a new unused block. As you update and write new data across the drive, old pages become stale. Stale pages can add up pretty fast on an SSD, and at some point a significant chunk of the block is obsolete. Ever been stuck behind a garbage truck? Data doesn’t like it either. For the sake of simplification we’re showing you a sequential write scenario, but know that a more typical random read/write scenario follows the same rules. When your block is obsolete, the garbage collection process kicks in. Garbage collection is a cumbersome 3-step process.
<CLICK> In the first step, current pages are copied to an empty block.
<CLICK> Step 2 consists of cleaning up the entire block by erasing both obsolete pages and the pages you just moved.
<CLICK> Finally, the block you emptied is available for new writes, and the NAND program/erase cycle starts all over again. This cumbersome process is why you’ll see reputable companies (like Intel) reporting drive performance numbers using “pre-conditioned” SSDs – meaning there is data already on the drive before the test starts, versus a fresh-out-of-the-box drive where data can be written anywhere with no garbage collection.
Slide 7: HPE has broadly qualified Intel® Optane™ SSD DC P4800X to be the go to cache option in your software defined storage environments. Intel® Optane™ DC SSDs provides the low latency, endurance, and efficiency needed from a cache drive. Let’s look at them individually:
On the far left we show latency.
Here we compared Optane SSD with our best NAND SSD. The grey (stairs like) line represents the random writes.
The Green waves, represent the NAND; notice how the average speed of read back (or latency) got higher as the load got bigger.
Notice the blue line, it never changed. This line is Intel Optane SSD and not the X-axis.
The middle sections shows endurance
Drive write per day is the number of times we can write to the drive and read it back w/o error, per day. Again, we are comparing Optane SSD with our own NAND SSD. We see that Optane SSD is 60 drive writes per day, or 20X better than most NAND SSDs on the market.
On the far right we see efficiency
When the CPU and memory are not limiting the workload, then lower latency, higher IOPS, and better endurance allow each server to support more virtual machines. Additionally, you no longer need 10% of your total capacity in cache. Many times you only need 1 or 2, 375GB Optane SSDs. The bottom line is, you don’t need to purchase many gigabytes and you can support more virtual machines per server with Intel® Optane™ DC SSDs.
Today we are launching a way to close that gap.
Big and Affordable Memory
128, 256, 512GB DDR4 Compatible Modules
Direct Load/Store Byte-Addressable Access
Super high-performance storage
High-reliability and Security
Strong ECC + Sparing Capability
256-bit AES-XTP Encryption on board
5-year warranty
Two Operational Modes
Two modes of operation…Memory Mode and App Direct. Each module can operate in each of these modes (as well as one sub mode of App Direct), or can be partitioned at boot to operate in both modes (aka Mixed Mode). We will be coming back to these modes several times over throughout our presentations to cover mode details, but at this early stage, I want to impress upon you the major points:
Memory Mode does not require SW modifications – you can operate the modules in this mode and take advantage of the large capacity, but the module acts as though it were volatile in order to pretend to be DRAM. When in this mode, you just have a single large pool of volatile memory that is seen by the system. This is useful as you can use with many legacy SW applications, and gain immediate advantage.
App Direct Mode allows you to take full advantage of the product, and its persistent nature, as it will be seen as a separate pool of PERSISTENT MEMORY by the system, and treated as such. In order to operate in this mode, the SW needs to be PM-aware, meaning that some part of it, the data access layer that interfaces with the module, needs to know there is such a thing as persistent memory. We will spend a lot more time on this later today, and in the afternoon…but for now, what is important to remember is that there are two distinct memory pools that are recognized and used by persistent memory aware SW.
We are really excited about the opportunity for both Intel® Optane™ Technologies, and what they can do for application efficiencies.
So now that we have two flavors of Intel® Optane™ Technology for the data center, which one should we use for which workloads?
Some of these are obvious once you know the state of different partner software. And of course SSD for storage and persistent memory for memory.
Specifically Intel® Optane™ DC SSDs for storage like CEPH (which is an open Filesystem) and VMware vSAN or Microsoft S2D – Storage Spaces Direct.
And then Intel® Optane™ DC Persistent Memory for things like IMDB (In Memory Database).
We have put together a general guideline for several workloads with the knowledge that in several cases either Intel® Optane™ Technology could produce a solution and the particulars will depend on the particulars of each customer.
This gives our partners a starting point in their conversations with customers based on our experience and initial testing results.
Intel® Optane™ DC SSD has been adopted by many leading companies as a caching or write buffer accelerator.
We also want to point out that this is NOT an “either or” situation.
AS you can see in the hyper-converged section there is opportunity for both SSD and persistent memory to play a role, and we can envision systems that use both technologies in a single performance centric platform.