SlideShare a Scribd company logo
1 of 7
CMM-H
Tiered Memory Solution with Built-in DRAM
Dr. Shuyi Pei
Ph.D., Sr. Engineer @Memory Solutions Lab.
Samsung Semiconductor Inc.
Larger capacity memory device at lower TCO
best suited for tiered memory solutions
Speed comparable to DRAM with NAND storage backed
and external battery power supply
Persistent memory option
CMM-H (CXL Memory Module, H: Hybrid)
Better system TCO
64-byte cache-granular fine grained access
to meet modern AI/ML workload needs
Small granularity access
Expanding capacity and utilization of memory for AI
• DRAM cache to move/store small-
sized data chunks suitable for AI/ML
Applications
• Improve data store efficiency by
writing data at the DRAM speed
• Low latency enabled by CXL.
mem protocol
Optimized for AI workloads
CMM-H Architecture
Computer System
Normal
I/O
Small
I/O
CXL.memory
DRAM Cache
CXL.io
NAND Flash
4KB
64 Bytes
128 Bytes
0
1
Title in Samsung Sharp Sans Bold (34)
Body text in Samsung Sharp Sans Medium (16)
Insert more text here.
Use this page when Samsung fonts are available.
Subtitle in Samsung Sharp Sans Bold (24)
**Compared to PCIe Gen4 NVMe SSD
• Small granularity data access
enable performance scales
with cache hits
• Direct memory access
advantage
• Large memory capacity at
lower TCO
Memory Reads per Second (Million)
Tiered Memory
8.0
2.2
1.6
1.4
1.3
1.2
1.1
1.0
1.0
0.9
16.3
3.6
2.4
2.0
1.7
1.5
1.4
1.2
1.1
1.0
1.1 1.2 1.3
1.5
1.8
2.2
2.7
3.6
5.9
32.7
43
9.9
4.9
3.2
2.4
1.9
1.66
1.4
1.2
1.1
512B
256B
128B
64B
100.0
10.0
1.0
0.1
10 20 30 40 50 60 70 80 90 100
Cache Hit Rate (%)
0
1
Title in Samsung Sharp Sans Bold (34)
Body text in Samsung Sharp Sans Medium (16)
Insert more text here.
Use this page when Samsung fonts are available.
Subtitle in Samsung Sharp Sans Bold (24)
5
• Battery-backed DRAM with
speed comparable to DDR5
• Persistence achieved with data
dumps to NAND flash
• Supports flush-on-fail with CXL
2.0 GPF feature
Persistent Memory
Operations per Second (Million)
0
35
70
105
140
100% Write 50% Write: 50% Read 10% Write: 90% Reads
DDR5 DRAM
CMM-H
Persistent Memory
Persistent Memory Competitor
0
1
Title in Samsung Sharp Sans Bold (34)
Body text in Samsung Sharp Sans Medium (16)
Insert more text here.
Use this page when Samsung fonts are available.
Subtitle in Samsung Sharp Sans Bold (24)
6
**Compared to PCIe Gen4 NVMe SSD
• Direct memory access
advantage; no software cache
overhead
• Up to ~10x better end-to-end
performance with FPGA-based
PoC**
0
12500
25000
37500
50000
0 28 55 83 110
Inferences
per
Second
Cache Hit Ratio (%)
End-to-End Performance
Block IO
CMM-H
Block IO + Host Memory Cache
DRAM Memory
0
1
Title in Samsung Sharp Sans Bold (34)
Body text in Samsung Sharp Sans Medium (16)
Insert more text here.
Use this page when Samsung fonts are available.
Subtitle in Samsung Sharp Sans Bold (24)
7
Movie Recommendation System Demo

More Related Content

What's hot

Chapter 6: Expansion Buses
Chapter 6: Expansion BusesChapter 6: Expansion Buses
Chapter 6: Expansion Buses
askme
 
DDR, GDDR, HBM SDRAM Memory
DDR, GDDR, HBM SDRAM MemoryDDR, GDDR, HBM SDRAM Memory
DDR, GDDR, HBM SDRAM Memory
Subhajit Sahu
 
High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD
High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSDHigh-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD
High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD
inside-BigData.com
 

What's hot (20)

Moving to PCI Express based SSD with NVM Express
Moving to PCI Express based SSD with NVM ExpressMoving to PCI Express based SSD with NVM Express
Moving to PCI Express based SSD with NVM Express
 
NVMe overview
NVMe overviewNVMe overview
NVMe overview
 
Parallel Processing.pptx
Parallel Processing.pptxParallel Processing.pptx
Parallel Processing.pptx
 
3D V-Cache
3D V-Cache 3D V-Cache
3D V-Cache
 
PCIe and PCIe driver in WEC7 (Windows Embedded compact 7)
PCIe and PCIe driver in WEC7 (Windows Embedded compact 7)PCIe and PCIe driver in WEC7 (Windows Embedded compact 7)
PCIe and PCIe driver in WEC7 (Windows Embedded compact 7)
 
Pci express technology 3.0
Pci express technology 3.0Pci express technology 3.0
Pci express technology 3.0
 
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptxParallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
Parallel Processing & Pipelining in Computer Architecture_Prof.Sumalatha.pptx
 
Usb 3.0 technology mindshare
Usb 3.0 technology mindshareUsb 3.0 technology mindshare
Usb 3.0 technology mindshare
 
DDR, GDDR, HBM Memory : Presentation
DDR, GDDR, HBM Memory : PresentationDDR, GDDR, HBM Memory : Presentation
DDR, GDDR, HBM Memory : Presentation
 
Andes RISC-V processor solutions
Andes RISC-V processor solutionsAndes RISC-V processor solutions
Andes RISC-V processor solutions
 
Solid state drives
Solid state drivesSolid state drives
Solid state drives
 
Chapter 6: Expansion Buses
Chapter 6: Expansion BusesChapter 6: Expansion Buses
Chapter 6: Expansion Buses
 
Differnce of two processors
Differnce of two processorsDiffernce of two processors
Differnce of two processors
 
Multiprocessor
MultiprocessorMultiprocessor
Multiprocessor
 
Dual port ram
Dual port ramDual port ram
Dual port ram
 
DDR, GDDR, HBM SDRAM Memory
DDR, GDDR, HBM SDRAM MemoryDDR, GDDR, HBM SDRAM Memory
DDR, GDDR, HBM SDRAM Memory
 
FPGA Accelerated Computing Using Amazon EC2 F1 Instances - CMP308 - re:Invent...
FPGA Accelerated Computing Using Amazon EC2 F1 Instances - CMP308 - re:Invent...FPGA Accelerated Computing Using Amazon EC2 F1 Instances - CMP308 - re:Invent...
FPGA Accelerated Computing Using Amazon EC2 F1 Instances - CMP308 - re:Invent...
 
Processor Organization and Architecture
Processor Organization and ArchitectureProcessor Organization and Architecture
Processor Organization and Architecture
 
Double data rate (ddr)
Double data rate (ddr)Double data rate (ddr)
Double data rate (ddr)
 
High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD
High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSDHigh-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD
High-Performance Big Data Analytics with RDMA over NVM and NVMe-SSD
 

Similar to Samsung: CMM-H Tiered Memory Solution with Built-in DRAM

Introduction of ram ddr3
Introduction of ram ddr3Introduction of ram ddr3
Introduction of ram ddr3
Technocratz
 

Similar to Samsung: CMM-H Tiered Memory Solution with Built-in DRAM (20)

AMD: 4th Generation EPYC CXL Demo
AMD: 4th Generation EPYC CXL DemoAMD: 4th Generation EPYC CXL Demo
AMD: 4th Generation EPYC CXL Demo
 
Sc19 ibm hms final
Sc19 ibm hms finalSc19 ibm hms final
Sc19 ibm hms final
 
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
 
diskmfr
diskmfrdiskmfr
diskmfr
 
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
Q1 Memory Fabric Forum: Breaking Through the Memory WallQ1 Memory Fabric Forum: Breaking Through the Memory Wall
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
 
Introduction of ram ddr3
Introduction of ram ddr3Introduction of ram ddr3
Introduction of ram ddr3
 
Introduction of ram ddr3
Introduction of ram ddr3Introduction of ram ddr3
Introduction of ram ddr3
 
CXL Memory Expansion, Pooling, Sharing, FAM Enablement, and Switching
CXL Memory Expansion, Pooling, Sharing, FAM Enablement, and SwitchingCXL Memory Expansion, Pooling, Sharing, FAM Enablement, and Switching
CXL Memory Expansion, Pooling, Sharing, FAM Enablement, and Switching
 
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
 
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXLQ1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
 
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
 
Towards Software Defined Persistent Memory
Towards Software Defined Persistent MemoryTowards Software Defined Persistent Memory
Towards Software Defined Persistent Memory
 
SanDisk: Persistent Memory and Cassandra
SanDisk: Persistent Memory and CassandraSanDisk: Persistent Memory and Cassandra
SanDisk: Persistent Memory and Cassandra
 
Big Memory for HPC
Big Memory for HPCBig Memory for HPC
Big Memory for HPC
 
RedisConf18 - Re-architecting Redis-on-Flash with Intel 3DX Point™ Memory
RedisConf18 - Re-architecting Redis-on-Flash with Intel 3DX Point™ MemoryRedisConf18 - Re-architecting Redis-on-Flash with Intel 3DX Point™ Memory
RedisConf18 - Re-architecting Redis-on-Flash with Intel 3DX Point™ Memory
 
Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation        Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation
 
Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure
Astera Labs:  Intelligent Connectivity for Cloud and AI InfrastructureAstera Labs:  Intelligent Connectivity for Cloud and AI Infrastructure
Astera Labs: Intelligent Connectivity for Cloud and AI Infrastructure
 
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and DevicesQ1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
 
diskmfr
diskmfrdiskmfr
diskmfr
 
Q1 Memory Fabric Forum: SMART CXL Product Lineup
Q1 Memory Fabric Forum: SMART CXL Product LineupQ1 Memory Fabric Forum: SMART CXL Product Lineup
Q1 Memory Fabric Forum: SMART CXL Product Lineup
 

More from Memory Fabric Forum

More from Memory Fabric Forum (20)

H3 Platform CXL Solution_Memory Fabric Forum.pptx
H3 Platform CXL Solution_Memory Fabric Forum.pptxH3 Platform CXL Solution_Memory Fabric Forum.pptx
H3 Platform CXL Solution_Memory Fabric Forum.pptx
 
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IPQ1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
 
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptxQ1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
 
Q1 Memory Fabric Forum: About MindShare Training
Q1 Memory Fabric Forum: About MindShare TrainingQ1 Memory Fabric Forum: About MindShare Training
Q1 Memory Fabric Forum: About MindShare Training
 
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
Q1 Memory Fabric Forum: CXL-Related Activities within OCPQ1 Memory Fabric Forum: CXL-Related Activities within OCP
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
 
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
Q1 Memory Fabric Forum: CXL Controller by Montage TechnologyQ1 Memory Fabric Forum: CXL Controller by Montage Technology
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
 
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin LabsQ1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
 
Q1 Memory Fabric Forum: CXL Form Factor Primer
Q1 Memory Fabric Forum: CXL Form Factor PrimerQ1 Memory Fabric Forum: CXL Form Factor Primer
Q1 Memory Fabric Forum: CXL Form Factor Primer
 
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
Q1 Memory Fabric Forum: Memory Fabric in a Composable SystemQ1 Memory Fabric Forum: Memory Fabric in a Composable System
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
 
Q1 Memory Fabric Forum: Big Memory Computing for AI
Q1 Memory Fabric Forum: Big Memory Computing for AIQ1 Memory Fabric Forum: Big Memory Computing for AI
Q1 Memory Fabric Forum: Big Memory Computing for AI
 
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory ModulesQ1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
 
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 UpdateQ1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
 
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
 
Q1 Memory Fabric Forum: XConn CXL Switches for AI
Q1 Memory Fabric Forum: XConn CXL Switches for AIQ1 Memory Fabric Forum: XConn CXL Switches for AI
Q1 Memory Fabric Forum: XConn CXL Switches for AI
 
Q1 Memory Fabric Forum: VMware Memory Vision
Q1 Memory Fabric Forum: VMware Memory VisionQ1 Memory Fabric Forum: VMware Memory Vision
Q1 Memory Fabric Forum: VMware Memory Vision
 
MemVerge: Memory Expansion Without Breaking the Budget
MemVerge: Memory Expansion Without Breaking the BudgetMemVerge: Memory Expansion Without Breaking the Budget
MemVerge: Memory Expansion Without Breaking the Budget
 
Micron - CXL Enabling New Pliability in the Modern Data Center.pptx
Micron - CXL Enabling New Pliability in the Modern Data Center.pptxMicron - CXL Enabling New Pliability in the Modern Data Center.pptx
Micron - CXL Enabling New Pliability in the Modern Data Center.pptx
 
MemVerge: Past Present and Future of CXL
MemVerge: Past Present and Future of CXLMemVerge: Past Present and Future of CXL
MemVerge: Past Present and Future of CXL
 
Photowave Presentation Slides - 11.8.23.pptx
Photowave Presentation Slides - 11.8.23.pptxPhotowave Presentation Slides - 11.8.23.pptx
Photowave Presentation Slides - 11.8.23.pptx
 
TE Connectivity: Card Edge Interconnects
TE Connectivity: Card Edge InterconnectsTE Connectivity: Card Edge Interconnects
TE Connectivity: Card Edge Interconnects
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Samsung: CMM-H Tiered Memory Solution with Built-in DRAM

  • 1. CMM-H Tiered Memory Solution with Built-in DRAM Dr. Shuyi Pei Ph.D., Sr. Engineer @Memory Solutions Lab. Samsung Semiconductor Inc.
  • 2. Larger capacity memory device at lower TCO best suited for tiered memory solutions Speed comparable to DRAM with NAND storage backed and external battery power supply Persistent memory option CMM-H (CXL Memory Module, H: Hybrid) Better system TCO 64-byte cache-granular fine grained access to meet modern AI/ML workload needs Small granularity access Expanding capacity and utilization of memory for AI
  • 3. • DRAM cache to move/store small- sized data chunks suitable for AI/ML Applications • Improve data store efficiency by writing data at the DRAM speed • Low latency enabled by CXL. mem protocol Optimized for AI workloads CMM-H Architecture Computer System Normal I/O Small I/O CXL.memory DRAM Cache CXL.io NAND Flash 4KB 64 Bytes 128 Bytes
  • 4. 0 1 Title in Samsung Sharp Sans Bold (34) Body text in Samsung Sharp Sans Medium (16) Insert more text here. Use this page when Samsung fonts are available. Subtitle in Samsung Sharp Sans Bold (24) **Compared to PCIe Gen4 NVMe SSD • Small granularity data access enable performance scales with cache hits • Direct memory access advantage • Large memory capacity at lower TCO Memory Reads per Second (Million) Tiered Memory 8.0 2.2 1.6 1.4 1.3 1.2 1.1 1.0 1.0 0.9 16.3 3.6 2.4 2.0 1.7 1.5 1.4 1.2 1.1 1.0 1.1 1.2 1.3 1.5 1.8 2.2 2.7 3.6 5.9 32.7 43 9.9 4.9 3.2 2.4 1.9 1.66 1.4 1.2 1.1 512B 256B 128B 64B 100.0 10.0 1.0 0.1 10 20 30 40 50 60 70 80 90 100 Cache Hit Rate (%)
  • 5. 0 1 Title in Samsung Sharp Sans Bold (34) Body text in Samsung Sharp Sans Medium (16) Insert more text here. Use this page when Samsung fonts are available. Subtitle in Samsung Sharp Sans Bold (24) 5 • Battery-backed DRAM with speed comparable to DDR5 • Persistence achieved with data dumps to NAND flash • Supports flush-on-fail with CXL 2.0 GPF feature Persistent Memory Operations per Second (Million) 0 35 70 105 140 100% Write 50% Write: 50% Read 10% Write: 90% Reads DDR5 DRAM CMM-H Persistent Memory Persistent Memory Competitor
  • 6. 0 1 Title in Samsung Sharp Sans Bold (34) Body text in Samsung Sharp Sans Medium (16) Insert more text here. Use this page when Samsung fonts are available. Subtitle in Samsung Sharp Sans Bold (24) 6 **Compared to PCIe Gen4 NVMe SSD • Direct memory access advantage; no software cache overhead • Up to ~10x better end-to-end performance with FPGA-based PoC** 0 12500 25000 37500 50000 0 28 55 83 110 Inferences per Second Cache Hit Ratio (%) End-to-End Performance Block IO CMM-H Block IO + Host Memory Cache DRAM Memory
  • 7. 0 1 Title in Samsung Sharp Sans Bold (34) Body text in Samsung Sharp Sans Medium (16) Insert more text here. Use this page when Samsung fonts are available. Subtitle in Samsung Sharp Sans Bold (24) 7 Movie Recommendation System Demo

Editor's Notes

  1. 1 TB MS-SSD memory with 8GB internal cache; Prototype performance scales for smaller granular memory accesses also as cache hit rate increases
  2. 16GB MS-SSD persistent memory; FPGA-based Prototype performance better than competitors (Optane) and close to DDR5 performance
  3. End-to-end recommendation inference performance also scales as cache hit rate increases and comes closer to higher performance and cost DDR5
  4. Movie Recommendation system is one good example to show MS-SSD’s performance. MS-SSD is HW device with Cache based ; most cost and power efficient AI recommendation system ** 40X better IO performance : PCIe Gen4 NVMe SSD 4KB Random read(0.9M IOPS) vs MS-SSD 64Byte (42.9M IOPS) (Embedding table column size for DLRM is 64Byte) https://ai.facebook.com/blog/dlrm-an-advanced-open-source-deep-learning-recommendation-model/