SlideShare a Scribd company logo
1 of 13
| AMD | Data Center Group| 2022
[Public]
AGENDA
◢ Paradigm Shift and Memory Composability Progression
◢ Runtime Memory Management
◢ Tiered Memory
◢ NUMA domains and Page Migration
◢ Runtime Memory Pooling
| AMD | Data Center Group| 2022
[Public]
PARADIGM SHIFT
◢ Scalable, high-speed CXL™ Interconnect and
PIM (Processing in Memory) contribute to the
paradigm shift in memory intensive computations
◢ Efficiency Boost of the next generation data
center
◢ Management of the Host/Accelerator
subsystems combined with the terabytes of the
Fabric Attached Memory
◢ Reduced complexity of the SW stack combined
with direct access to multiple memory
technologies
| AMD | Data Center Group| 2022
[Public]
MEMORY COMPOSABILITY PROGRESSION
Host R
P
Buffer
Host R
P
End
Point
View
Mem Direct Attach Memory Scale-Out Mem Pooling & Disaggregation
• Addresses the cost and
underutilization of the memory
• Multi-domain Pooled Memory -
memory in the pool is allocated/
released when required
• Workloads/ applications
benefiting from memory capacity
• Design optimization for {BW/$,
Memory Capacity/$, BW/core}
| AMD | Data Center Group| 2022
[Public]
RUNTIME MEMORY MANAGEMENT
| AMD | Data Center Group| 2022
[Public]
TIERED MEMORY
NUMA Domains
Page Migration
| AMD | Data Center Group| 2022
[Public]
TIERED MEMORY
NUMA DOMAINS
• Exposed to the HV, Guest OS, Apps
• OS-assisted optimization of the
memory subsystem
• Base on ACPI objects -
SRAT/SLIT/HMAT
| AMD | Data Center Group| 2022
[Public]
TIERED MEMORY
PAGE MIGRATION
CCD CCD CCD CCD
IOD
CCD CCD CCD CCD
IOD
Near
Mem
Far Mem
NUMA domains
PROC
CXL mem
Far mem
CXL
CXL
Far Mem
Near Mem
Memory Expansion
PROC
Far Mem
Near Mem
Mem as a Cache
CCD CCD CCD CCD
IOD
CCD CCD CCD CCD
IOD
Near Mem
Far Mem CXL mem
Far Mem
CXL
CXL
Near Mem
NUMA domains
MISS
Shorter latency Longer latency
Near
Mem
‒ Active page migration between Far and Near memories
‒ HV/Guest migrates hot pages into Near Mem and retire cold
pages into Far Mem
‒ Focused DMA to transfer required datasets from the Far to
Near Mem
SW Assisted Page Migration
‒ HW managed Hot Dataset
‒ Near Mem Miss redirected to the Far Mem
‒ App/ HV unawareness
DRAM as a cache optimization
| AMD | Data Center Group| 2022
[Public]
TIERED MEMORY
SW ASSISTED PAGE MIGRATION
Combined HW /SW tracking of the
Memory Page Activity/ “hotness”
Detecting Page(s) candidates for migration
Requesting HV/Guest permission to
migrate
HV/Guest API to Security Processor to
Migrate the Page(s)
Migration – stalling accessed to specific
pages/ copying the data
Page “hotness” –combined action
of the HW and SW tracking
HV/Guest authorization of the
migration
Security Processor as a root of
trust for performing the migration
| AMD | Data Center Group| 2022
[Public]
RUNTIME MEMORY ALLOCATION/POOLING
FABRIC ATTACHED MEMORY
Host Host
Tier2 Mem
Multi-Headed CXL
controller
 Multiple structures serve for fabric level memory pooling
 Combination of the private (dedicated to specific host) and shareable memory ranges
 Protection of the memory regions from unauthorized guests and hypervisor
 Allocation/Pooling of the memory ranges between Hosts is regulated by the fabric
aware SW layer (i.e., Fabric Manager)
| AMD | Data Center Group| 2022
[Public]
RUNTIME MEMORY ALLOCATION/POOLING
FABRIC ATTACHED MEMORY
 Memory Allocation Layer – communicates
<new memory allocation per Host> based
on the system/apps needs
 Fabric Manager – adjusts the fabric settings
and communicates new memory allocations
to the Host SW
 Host SW - Invokes Hot Add/Hot Removal
method to increase/ reduce (or offline) an
amount of memory allocated to the Host
 In some instances, Host SW can directly invoke
SP to adjust the memory size allocated to the
Host
 On–die Security Processor (Root of Trust) is
involved in securing an exclusive access to
the memory range
| AMD | Data Center Group| 2022
[Public]
SUMMARY
Composable Disaggregated Memory is the key approach to address
the cost and underutilization of the System Memory
Further investment in the Runtime Management of the Composable &
Multi-Type memory structures is required to maximize the system level
performance across multiple use-cases
Application Transparency is another goal of efficient Runtime
Management by abstracting away an underlying fabric/memory
infrastructure
AMD - RUNTIME MANAGEMENT OF THE DECOMPOSABLE MEMORY

More Related Content

Similar to AMD - RUNTIME MANAGEMENT OF THE DECOMPOSABLE MEMORY

CtrlS: Cloud Solutions for Retail & eCommerce
CtrlS: Cloud Solutions for Retail & eCommerceCtrlS: Cloud Solutions for Retail & eCommerce
CtrlS: Cloud Solutions for Retail & eCommerce
eTailing India
 
Realizing Exabyte-scale PM Centric Architectures and Memory Fabrics
Realizing Exabyte-scale PM Centric Architectures and Memory FabricsRealizing Exabyte-scale PM Centric Architectures and Memory Fabrics
Realizing Exabyte-scale PM Centric Architectures and Memory Fabrics
inside-BigData.com
 
Dsmp Whitepaper V5
Dsmp Whitepaper V5Dsmp Whitepaper V5
Dsmp Whitepaper V5
gelfstrom
 

Similar to AMD - RUNTIME MANAGEMENT OF THE DECOMPOSABLE MEMORY (20)

Provisioning server high_availability_considerations2
Provisioning server high_availability_considerations2Provisioning server high_availability_considerations2
Provisioning server high_availability_considerations2
 
VMworld 2013: Performance and Capacity Management of DRS Clusters
VMworld 2013: Performance and Capacity Management of DRS Clusters VMworld 2013: Performance and Capacity Management of DRS Clusters
VMworld 2013: Performance and Capacity Management of DRS Clusters
 
IBM MQ Disaster Recovery
IBM MQ Disaster RecoveryIBM MQ Disaster Recovery
IBM MQ Disaster Recovery
 
Removing Storage Related Barriers to Server and Desktop Virtualization
Removing Storage Related Barriers to Server and Desktop VirtualizationRemoving Storage Related Barriers to Server and Desktop Virtualization
Removing Storage Related Barriers to Server and Desktop Virtualization
 
Virtualization for Emerging Memory Devices
Virtualization for Emerging Memory DevicesVirtualization for Emerging Memory Devices
Virtualization for Emerging Memory Devices
 
Del 1
Del 1Del 1
Del 1
 
Video Server
Video ServerVideo Server
Video Server
 
Challenges in Managing IT Infrastructure
Challenges in Managing IT InfrastructureChallenges in Managing IT Infrastructure
Challenges in Managing IT Infrastructure
 
cloud_and_virtualization_concepts.pdf
cloud_and_virtualization_concepts.pdfcloud_and_virtualization_concepts.pdf
cloud_and_virtualization_concepts.pdf
 
SQL PASS Taiwan 七月份聚會-1
SQL PASS Taiwan 七月份聚會-1SQL PASS Taiwan 七月份聚會-1
SQL PASS Taiwan 七月份聚會-1
 
Performance management in the virtual data center
Performance management in the virtual data centerPerformance management in the virtual data center
Performance management in the virtual data center
 
A Study Of Disaggregated Memory Management Techniques With Hypervisor Based T...
A Study Of Disaggregated Memory Management Techniques With Hypervisor Based T...A Study Of Disaggregated Memory Management Techniques With Hypervisor Based T...
A Study Of Disaggregated Memory Management Techniques With Hypervisor Based T...
 
Storage Considerations for VDI - Scalar presentation at Toronto VMUG 2014
Storage Considerations for VDI - Scalar presentation at Toronto VMUG 2014Storage Considerations for VDI - Scalar presentation at Toronto VMUG 2014
Storage Considerations for VDI - Scalar presentation at Toronto VMUG 2014
 
CtrlS: Cloud Solutions for Retail & eCommerce
CtrlS: Cloud Solutions for Retail & eCommerceCtrlS: Cloud Solutions for Retail & eCommerce
CtrlS: Cloud Solutions for Retail & eCommerce
 
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
IMC Summit 2016 Keynote - Arthur Sainio - NVDIMM: Changes are Here So What’s ...
 
Realizing Exabyte-scale PM Centric Architectures and Memory Fabrics
Realizing Exabyte-scale PM Centric Architectures and Memory FabricsRealizing Exabyte-scale PM Centric Architectures and Memory Fabrics
Realizing Exabyte-scale PM Centric Architectures and Memory Fabrics
 
The Forefront of the Development for NVDIMM on Linux Kernel
The Forefront of the Development for NVDIMM on Linux KernelThe Forefront of the Development for NVDIMM on Linux Kernel
The Forefront of the Development for NVDIMM on Linux Kernel
 
Dsmp Whitepaper V5
Dsmp Whitepaper V5Dsmp Whitepaper V5
Dsmp Whitepaper V5
 
DATASHEET▶ Enterprise Cloud Backup & Recovery with Symantec NetBackup
DATASHEET▶ Enterprise Cloud Backup & Recovery with Symantec NetBackupDATASHEET▶ Enterprise Cloud Backup & Recovery with Symantec NetBackup
DATASHEET▶ Enterprise Cloud Backup & Recovery with Symantec NetBackup
 
Avoid the SAN Trap
Avoid the SAN TrapAvoid the SAN Trap
Avoid the SAN Trap
 

More from Memory Fabric Forum

More from Memory Fabric Forum (20)

H3 Platform CXL Solution_Memory Fabric Forum.pptx
H3 Platform CXL Solution_Memory Fabric Forum.pptxH3 Platform CXL Solution_Memory Fabric Forum.pptx
H3 Platform CXL Solution_Memory Fabric Forum.pptx
 
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
Q1 Memory Fabric Forum: ZeroPoint. Remove the waste. Release the power.
 
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IPQ1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
Q1 Memory Fabric Forum: Building Fast and Secure Chips with CXL IP
 
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptxQ1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
Q1 Memory Fabric Forum: Using CXL with AI Applications - Steve Scargall.pptx
 
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and DevicesQ1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
Q1 Memory Fabric Forum: Memory expansion with CXL-Ready Systems and Devices
 
Q1 Memory Fabric Forum: About MindShare Training
Q1 Memory Fabric Forum: About MindShare TrainingQ1 Memory Fabric Forum: About MindShare Training
Q1 Memory Fabric Forum: About MindShare Training
 
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
Q1 Memory Fabric Forum: CXL-Related Activities within OCPQ1 Memory Fabric Forum: CXL-Related Activities within OCP
Q1 Memory Fabric Forum: CXL-Related Activities within OCP
 
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
Q1 Memory Fabric Forum: CXL Controller by Montage TechnologyQ1 Memory Fabric Forum: CXL Controller by Montage Technology
Q1 Memory Fabric Forum: CXL Controller by Montage Technology
 
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin LabsQ1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
Q1 Memory Fabric Forum: Teledyne LeCroy | Austin Labs
 
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
Q1 Memory Fabric Forum: Breaking Through the Memory WallQ1 Memory Fabric Forum: Breaking Through the Memory Wall
Q1 Memory Fabric Forum: Breaking Through the Memory Wall
 
Q1 Memory Fabric Forum: SMART CXL Product Lineup
Q1 Memory Fabric Forum: SMART CXL Product LineupQ1 Memory Fabric Forum: SMART CXL Product Lineup
Q1 Memory Fabric Forum: SMART CXL Product Lineup
 
Q1 Memory Fabric Forum: CXL Form Factor Primer
Q1 Memory Fabric Forum: CXL Form Factor PrimerQ1 Memory Fabric Forum: CXL Form Factor Primer
Q1 Memory Fabric Forum: CXL Form Factor Primer
 
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
Q1 Memory Fabric Forum: Memory Fabric in a Composable SystemQ1 Memory Fabric Forum: Memory Fabric in a Composable System
Q1 Memory Fabric Forum: Memory Fabric in a Composable System
 
Q1 Memory Fabric Forum: Big Memory Computing for AI
Q1 Memory Fabric Forum: Big Memory Computing for AIQ1 Memory Fabric Forum: Big Memory Computing for AI
Q1 Memory Fabric Forum: Big Memory Computing for AI
 
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXLQ1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
Q1 Memory Fabric Forum: Memory Processor Interface 2023, Focus on CXL
 
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory ModulesQ1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
Q1 Memory Fabric Forum: Micron CXL-Compatible Memory Modules
 
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 UpdateQ1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
Q1 Memory Fabric Forum: Compute Express Link (CXL) 3.1 Update
 
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
Q1 Memory Fabric Forum: Advantages of Optical CXL​ for Disaggregated Compute ...
 
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
Q1 Memory Fabric Forum: Intel Enabling Compute Express Link (CXL)
 
Q1 Memory Fabric Forum: XConn CXL Switches for AI
Q1 Memory Fabric Forum: XConn CXL Switches for AIQ1 Memory Fabric Forum: XConn CXL Switches for AI
Q1 Memory Fabric Forum: XConn CXL Switches for AI
 

Recently uploaded

Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
UK Journal
 

Recently uploaded (20)

WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 

AMD - RUNTIME MANAGEMENT OF THE DECOMPOSABLE MEMORY

  • 1.
  • 2. | AMD | Data Center Group| 2022 [Public] AGENDA ◢ Paradigm Shift and Memory Composability Progression ◢ Runtime Memory Management ◢ Tiered Memory ◢ NUMA domains and Page Migration ◢ Runtime Memory Pooling
  • 3. | AMD | Data Center Group| 2022 [Public] PARADIGM SHIFT ◢ Scalable, high-speed CXL™ Interconnect and PIM (Processing in Memory) contribute to the paradigm shift in memory intensive computations ◢ Efficiency Boost of the next generation data center ◢ Management of the Host/Accelerator subsystems combined with the terabytes of the Fabric Attached Memory ◢ Reduced complexity of the SW stack combined with direct access to multiple memory technologies
  • 4. | AMD | Data Center Group| 2022 [Public] MEMORY COMPOSABILITY PROGRESSION Host R P Buffer Host R P End Point View Mem Direct Attach Memory Scale-Out Mem Pooling & Disaggregation • Addresses the cost and underutilization of the memory • Multi-domain Pooled Memory - memory in the pool is allocated/ released when required • Workloads/ applications benefiting from memory capacity • Design optimization for {BW/$, Memory Capacity/$, BW/core}
  • 5. | AMD | Data Center Group| 2022 [Public] RUNTIME MEMORY MANAGEMENT
  • 6. | AMD | Data Center Group| 2022 [Public] TIERED MEMORY NUMA Domains Page Migration
  • 7. | AMD | Data Center Group| 2022 [Public] TIERED MEMORY NUMA DOMAINS • Exposed to the HV, Guest OS, Apps • OS-assisted optimization of the memory subsystem • Base on ACPI objects - SRAT/SLIT/HMAT
  • 8. | AMD | Data Center Group| 2022 [Public] TIERED MEMORY PAGE MIGRATION CCD CCD CCD CCD IOD CCD CCD CCD CCD IOD Near Mem Far Mem NUMA domains PROC CXL mem Far mem CXL CXL Far Mem Near Mem Memory Expansion PROC Far Mem Near Mem Mem as a Cache CCD CCD CCD CCD IOD CCD CCD CCD CCD IOD Near Mem Far Mem CXL mem Far Mem CXL CXL Near Mem NUMA domains MISS Shorter latency Longer latency Near Mem ‒ Active page migration between Far and Near memories ‒ HV/Guest migrates hot pages into Near Mem and retire cold pages into Far Mem ‒ Focused DMA to transfer required datasets from the Far to Near Mem SW Assisted Page Migration ‒ HW managed Hot Dataset ‒ Near Mem Miss redirected to the Far Mem ‒ App/ HV unawareness DRAM as a cache optimization
  • 9. | AMD | Data Center Group| 2022 [Public] TIERED MEMORY SW ASSISTED PAGE MIGRATION Combined HW /SW tracking of the Memory Page Activity/ “hotness” Detecting Page(s) candidates for migration Requesting HV/Guest permission to migrate HV/Guest API to Security Processor to Migrate the Page(s) Migration – stalling accessed to specific pages/ copying the data Page “hotness” –combined action of the HW and SW tracking HV/Guest authorization of the migration Security Processor as a root of trust for performing the migration
  • 10. | AMD | Data Center Group| 2022 [Public] RUNTIME MEMORY ALLOCATION/POOLING FABRIC ATTACHED MEMORY Host Host Tier2 Mem Multi-Headed CXL controller  Multiple structures serve for fabric level memory pooling  Combination of the private (dedicated to specific host) and shareable memory ranges  Protection of the memory regions from unauthorized guests and hypervisor  Allocation/Pooling of the memory ranges between Hosts is regulated by the fabric aware SW layer (i.e., Fabric Manager)
  • 11. | AMD | Data Center Group| 2022 [Public] RUNTIME MEMORY ALLOCATION/POOLING FABRIC ATTACHED MEMORY  Memory Allocation Layer – communicates <new memory allocation per Host> based on the system/apps needs  Fabric Manager – adjusts the fabric settings and communicates new memory allocations to the Host SW  Host SW - Invokes Hot Add/Hot Removal method to increase/ reduce (or offline) an amount of memory allocated to the Host  In some instances, Host SW can directly invoke SP to adjust the memory size allocated to the Host  On–die Security Processor (Root of Trust) is involved in securing an exclusive access to the memory range
  • 12. | AMD | Data Center Group| 2022 [Public] SUMMARY Composable Disaggregated Memory is the key approach to address the cost and underutilization of the System Memory Further investment in the Runtime Management of the Composable & Multi-Type memory structures is required to maximize the system level performance across multiple use-cases Application Transparency is another goal of efficient Runtime Management by abstracting away an underlying fabric/memory infrastructure