SlideShare a Scribd company logo
1 of 43
CRITICAL MEMORY
PERFORMANCE METRICS FOR
DDR4 SYSTEMS
Barbara Aichinger
Vice President New Business Development
FuturePlus Systems Corporation
Outline
• How can we monitor the very fast low voltage DDR4
memory without effecting the system?
• Traditional Performance Metrics
– Bandwidth, Latency and Power Management
• New Performance Metrics
– Bus Mode Analysis
– Page Hit Analysis
– Multiple Open Banks Analysis
– Bank Utilization Analysis
– Row Hammer (excessive Activate) Detection
• Measured DDR4 Performance in a real target
How to Monitor the DDR4 Memory
• Use a slot interposer to ‘listen’ to the
traffic between the DIMM and the
Memory Controller
– A small amount of current is ‘tapped’ off the bus
– Only the Address, Command and Control bus needs
to be monitored
– Same probing method can be used for SODIMM and
‘memory down’ using a BGA adapter
DDR4 DIMM Interposer
Monitoring the DDR4 Memory
• Highly attenuated signals are then
amplified and the bus is ‘replicated’ inside
the equipment
The system boots and runs never
knowing the equipment is present
Performance Metrics
• Billions of clock cycles every second
• Old methods of measurement captures
small amount of time in large trace buffers
• New method uses sophisticated event
counters to:
– Never miss a clock cycle
– Count Performance Metrics for long periods
of time REAL TIME
Work Smarter not Harder
• For Performance metrics the DDR Detective® uses
counters instead the traditional trace memory
– To capture a second of DDR4 traffic would take
4.5Gbytes of logic analyzer/protocol analyzer trace
depth $$$$!
• 1 hour = 270 Gbytes of trace depth and then time to sift
through it and post process!
– By using large counters and counting events and the
time between events we can achieve hours and days
worth of metrics with no trace buffer memory and
with no time consuming post processing
4 Memory
Channels
each
channel is 2
slots
ASUS X99
DDR4
Motherboard
Traditional Measurements
• Bandwidth
– Command Bus Utilization
– Data Bus Utilization
• Power Management
• Latency
Bandwidth
• Overhead
– Any use of the bus other than a Read or a Write
– Command Bus Utilization
• Data Bus
– Utilization: the % of the time that Read or Write
Data is being transferred
– Bandwidth: the amount of data transferred per
second
Command Bus Utilization
2400MT/s
Command Bus Utilization
ACT
4% PRE
4%
RD-BL8
7%
WR-BL8
4%
PREA
0%
REF
0%ZQCS
0%
DES
81%
Data Bus Utilization
2400MT/s
DDR4 Bandwidth
Power Managementwhile running Google StressApp
Power Management AnalysisUbuntu Boot
Embedded Video
Power Management
• ~50M servers Servers World Wide
• Each Server averages 16-24 DIMMs
– 800M to 1.2B DIMMs
• Even small power savings per DIMM can add up
Every time Facebook’s data center engineers figure out a way to reduce server
consumption by a single watt, the improvement, at Facebook’s scale, has the
potential to add millions of dollars to the company’s bottom line.
Yevgeny Sverdlik
Editor in Chief
Data Center Knowledge
Latency
• Several Jedec Paramters apply:
– RD to WR same rank tSR_RTW
– RD to PRE/PREA same Rank tRTP
– WR to PRE(SB) or PREA (SR) tWR
– Read to Read different Rank tDR_RTR
– Read to Write different Rank DR_RTW
– Write to Read different Rank tDR_WTR
– Write to Write different Rank tDR_WTW
Measuring Latency
Latency Measurements
measurement made at 1867
V# Parameter Description Spec Measured
V2 tSR_RTW RD to WR same Rank 8 10
V11 tRTP RD to PRE same Rank 8 8
V12 tWR WR to PRE SB or PREA
SR
31 31
V53 tDR_RTR RD to RD diff Rank 5 6
V57 tDR_WTR WR to RD diff Rank 3 6
V59 tDR_WTW WR to WR diff Rank 5 8
Intervening Commands
Latency Measurements
V# Parameter Description Spec Measured
V1 tCCD_L RD to RD Same Bank Group 5 6
V3 tCCD_L WR to WR Same Bank
Group
5 6
V4 tCCD_S RD to RD diff Bank Group 4 4
V5 tCCD_S WR to WR diff Bank Group 4 4
V6 tRRD_L ACT to ACT Same Bank
Group
5 5
V7 tWTR_L ACT to ACT diff Bank Group 4 4
V9 tWTR_L WR to RD Same Bank Group 22 23
V10 tWTR_S WR to RD Diff Bank Group 17 19
Latency
• Good designs operate on the edge of the
spec
• Architectural tradeoffs will occur
• Do I need margin?
– Design for the worst case and buy quality
parts
New Performance Metrics
• Page Hit Analysis
– Read Hit: Page was Open
– Read Miss : Page was not Open, Transaction was preceded by an ACT
– Write Hit: Page was Open
– Write Miss: Page was not Open, Transaction was preceded by an ACT
– Unused: Page was opened and closed and never accessed
• Multiple Open Banks
– Open Banks make for faster access IF your going to that bank on the
next access…performance hit if your not
– Power hit when banks are open
• Bank Group Analysis
– New for DDR4: Back to back access to same bank is a performance hit
– Faster to have back to back accesses to different bank groups
Ubuntu Boot @ 2133
Running Google StressApp @2133
Multiple Open Banks
How many are open at any one time
0
4
8
Bank Group Access Analysis
• tCCD_L
– Takes longer for back to back RD/WR accesses
to the same bank group
• tCCD_S
– Can reduce latency by going to different bank
groups
Bank Group Access Analysis
Relative to the previous transaction how many times did the following
transaction go to the same/different bank group
More Performance Metrics!
• Bank Utilization
– What happens during a chip kill or page retirement
scenario?
– How does the traffic reallocate?
– What are the performance implications?
• Do I have system hot spots?
– Row Hammer (excessive Activates)
• Fast Boot
– Why does the system take so long to boot?
Bank Utilization
Percentage of total cycles the banks are open (running Google Stress App)
Bank Utilization
Boot Analysis
Row Hammer
Excessive ACTIVATE commands to a single Row
• An identified failure mechanism in DDR3
memory
• The result of charge leakage
• Current work around is to increase the refresh
rate
– Burns power, lowers performance
– Lowers the statistical probability of an error
Critical Applications cannot
tolerate Row Hammer failures
• Memory Vendors are specifying how many
ACTIVATES per Retention cycle to a single row the
memory can tolerate
• However there is no way the memory controller or
the memory can count this
• Critical Software applications should be tested to
ensure that the Row Hammer Threshold is not
reached
One method to detect Row Hammer
Detecting Excessive ACTIVATE Commands
Critical DDR4 Performance Metrics
• Memory Controller/System Architecture
– Can this insight lead to better designs?
– Benchmark Servers Memory Performance
• Workload Analysis
– Should the Memory Controller settings be based on criteria set by the
workload?
– Can compilers be made better?
– Can critical applications be written to avoid Row Hammer?
• Do we all need a DDR5?
– Work Smarter not Harder and understand what we have
Summary
• Power Management, Bandwidth, Latency
• NEW Metrics:
– Page Hit Analysis
– Multiple Open Banks
– Bank Group Analysis
– Bank Utilization
– Boot Analysis
– Row Hammer Detection
– New Measurements give insight into new designs
and better architectures
Contact Information
Barbara Aichinger
Vice President New Business Development
FuturePlus Systems
Barb.Aichinger@FuturePlus.com
www.FuturePlus.com
Check out our new website dedicated to
DDR Memory! www.DDRDetective.com

More Related Content

What's hot

Doc 2011101412020074
Doc 2011101412020074Doc 2011101412020074
Doc 2011101412020074
Rhythm Sun
 
High Endurance Last Level Hybrid Cache Design
High Endurance Last Level Hybrid Cache DesignHigh Endurance Last Level Hybrid Cache Design
High Endurance Last Level Hybrid Cache Design
keerthi thallam
 

What's hot (20)

Symantec Public Meetup
Symantec Public MeetupSymantec Public Meetup
Symantec Public Meetup
 
Intelligent ram
Intelligent ramIntelligent ram
Intelligent ram
 
Intel Optane Data Center Persistent Memory
Intel Optane Data Center Persistent MemoryIntel Optane Data Center Persistent Memory
Intel Optane Data Center Persistent Memory
 
Doc 2011101412020074
Doc 2011101412020074Doc 2011101412020074
Doc 2011101412020074
 
I ram
I ramI ram
I ram
 
03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides
 
Ibm spectrum scale fundamentals workshop for americas part 7 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 7 spectrumscale el...Ibm spectrum scale fundamentals workshop for americas part 7 spectrumscale el...
Ibm spectrum scale fundamentals workshop for americas part 7 spectrumscale el...
 
Intelligent RAM
Intelligent RAMIntelligent RAM
Intelligent RAM
 
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Deep Learning Training at Scale: Spring Crest Deep Learning AcceleratorDeep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
 
8665814
86658148665814
8665814
 
Mainframe Optimization in 2017
Mainframe Optimization in 2017Mainframe Optimization in 2017
Mainframe Optimization in 2017
 
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Spring Hill (NNP-I 1000): Intel's Data Center Inference ChipSpring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
 
Lakefield: Hybrid Cores in 3D Package
Lakefield: Hybrid Cores in 3D PackageLakefield: Hybrid Cores in 3D Package
Lakefield: Hybrid Cores in 3D Package
 
05. performance-concepts-26-slides
05. performance-concepts-26-slides05. performance-concepts-26-slides
05. performance-concepts-26-slides
 
Mainframe Optimization in 2017
Mainframe Optimization in 2017Mainframe Optimization in 2017
Mainframe Optimization in 2017
 
Mis lesson5 PPT
Mis lesson5 PPTMis lesson5 PPT
Mis lesson5 PPT
 
Factors influencing the success of computer architecture
Factors influencing the success of computer architectureFactors influencing the success of computer architecture
Factors influencing the success of computer architecture
 
Mainframe Education Webcast: Innovations in Your Mainframe Sort
Mainframe Education Webcast: Innovations in Your Mainframe SortMainframe Education Webcast: Innovations in Your Mainframe Sort
Mainframe Education Webcast: Innovations in Your Mainframe Sort
 
High Endurance Last Level Hybrid Cache Design
High Endurance Last Level Hybrid Cache DesignHigh Endurance Last Level Hybrid Cache Design
High Endurance Last Level Hybrid Cache Design
 
Resource Monitoring and Management II
Resource Monitoring and Management IIResource Monitoring and Management II
Resource Monitoring and Management II
 

Viewers also liked

RAM and ROM Memory Overview
RAM and ROM Memory OverviewRAM and ROM Memory Overview
RAM and ROM Memory Overview
Pankaj Khodifad
 

Viewers also liked (9)

A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
A 64Gb/s PAM-4 Transmitter with 4-Tap FFE and 2.26pJ/b Energy Efficiency in 2...
 
Introducing the next generation memory DDR4
Introducing the next generation memory DDR4Introducing the next generation memory DDR4
Introducing the next generation memory DDR4
 
Improve your PC performance with DDR4 Memory
Improve your PC performance with DDR4 MemoryImprove your PC performance with DDR4 Memory
Improve your PC performance with DDR4 Memory
 
Double data rate (ddr)
Double data rate (ddr)Double data rate (ddr)
Double data rate (ddr)
 
Understanding and Testing DDR4 R-DIMM and LR-DIMM Technology
Understanding and Testing DDR4 R-DIMM and LR-DIMM TechnologyUnderstanding and Testing DDR4 R-DIMM and LR-DIMM Technology
Understanding and Testing DDR4 R-DIMM and LR-DIMM Technology
 
Eliminate Pitfalls of DDR Memory Testing
Eliminate Pitfalls of DDR Memory TestingEliminate Pitfalls of DDR Memory Testing
Eliminate Pitfalls of DDR Memory Testing
 
Innovation and Architecture
Innovation and ArchitectureInnovation and Architecture
Innovation and Architecture
 
RAM and ROM Memory Overview
RAM and ROM Memory OverviewRAM and ROM Memory Overview
RAM and ROM Memory Overview
 
DDR3
DDR3DDR3
DDR3
 

Similar to Critical Performance Metrics for DDR4 based Systems

Informix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceInformix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performance
Keshav Murthy
 
Scalability designprinciples-v2-130718023602-phpapp02 (1)
Scalability designprinciples-v2-130718023602-phpapp02 (1)Scalability designprinciples-v2-130718023602-phpapp02 (1)
Scalability designprinciples-v2-130718023602-phpapp02 (1)
Minal Patil
 
COMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptxCOMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptx
ssusere16bd9
 

Similar to Critical Performance Metrics for DDR4 based Systems (20)

Performance Measurements: Improving Latency and Bandwidth of your DDR4 System
Performance Measurements:  Improving Latency and Bandwidth of your DDR4 SystemPerformance Measurements:  Improving Latency and Bandwidth of your DDR4 System
Performance Measurements: Improving Latency and Bandwidth of your DDR4 System
 
Right-Sizing your SQL Server Virtual Machine
Right-Sizing your SQL Server Virtual MachineRight-Sizing your SQL Server Virtual Machine
Right-Sizing your SQL Server Virtual Machine
 
Application Performance Management
Application Performance ManagementApplication Performance Management
Application Performance Management
 
05. performance-concepts
05. performance-concepts05. performance-concepts
05. performance-concepts
 
Building data intensive applications
Building data intensive applicationsBuilding data intensive applications
Building data intensive applications
 
Play With Streams
Play With StreamsPlay With Streams
Play With Streams
 
Performance tuning in sql server
Performance tuning in sql serverPerformance tuning in sql server
Performance tuning in sql server
 
Informix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performanceInformix & IWA : Operational analytics performance
Informix & IWA : Operational analytics performance
 
Scalability Design Principles - Internal Session
Scalability Design Principles - Internal SessionScalability Design Principles - Internal Session
Scalability Design Principles - Internal Session
 
Scalability designprinciples-v2-130718023602-phpapp02 (1)
Scalability designprinciples-v2-130718023602-phpapp02 (1)Scalability designprinciples-v2-130718023602-phpapp02 (1)
Scalability designprinciples-v2-130718023602-phpapp02 (1)
 
Operational-Analytics
Operational-AnalyticsOperational-Analytics
Operational-Analytics
 
EnterpriseDB's Best Practices for Postgres DBAs
EnterpriseDB's Best Practices for Postgres DBAsEnterpriseDB's Best Practices for Postgres DBAs
EnterpriseDB's Best Practices for Postgres DBAs
 
IBM Maximo Performance Tuning
IBM Maximo Performance TuningIBM Maximo Performance Tuning
IBM Maximo Performance Tuning
 
Capacity Planning For Your Growing MongoDB Cluster
Capacity Planning For Your Growing MongoDB ClusterCapacity Planning For Your Growing MongoDB Cluster
Capacity Planning For Your Growing MongoDB Cluster
 
Capacityplanning
Capacityplanning Capacityplanning
Capacityplanning
 
071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen
 
Performance tuning Grails applications SpringOne 2GX 2014
Performance tuning Grails applications SpringOne 2GX 2014Performance tuning Grails applications SpringOne 2GX 2014
Performance tuning Grails applications SpringOne 2GX 2014
 
SVC / Storwize: cache partition analysis (BVQ howto)
SVC / Storwize: cache partition analysis  (BVQ howto)   SVC / Storwize: cache partition analysis  (BVQ howto)
SVC / Storwize: cache partition analysis (BVQ howto)
 
COMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptxCOMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptx
 
ICONUK 2016: Back From the Dead: How Bad Code Kills a Good Server
ICONUK 2016: Back From the Dead: How Bad Code Kills a Good ServerICONUK 2016: Back From the Dead: How Bad Code Kills a Good Server
ICONUK 2016: Back From the Dead: How Bad Code Kills a Good Server
 

Recently uploaded

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
saurabvyas476
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
yulianti213969
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
pwgnohujw
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
23050636
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
zifhagzkk
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
mikehavy0
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
LuisMiguelPaz5
 

Recently uploaded (20)

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptx
 
sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444sourabh vyas1222222222222222222244444444
sourabh vyas1222222222222222222244444444
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSDBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
 
Pentesting_AI and security challenges of AI
Pentesting_AI and security challenges of AIPentesting_AI and security challenges of AI
Pentesting_AI and security challenges of AI
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
 
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 

Critical Performance Metrics for DDR4 based Systems

  • 1. CRITICAL MEMORY PERFORMANCE METRICS FOR DDR4 SYSTEMS Barbara Aichinger Vice President New Business Development FuturePlus Systems Corporation
  • 2. Outline • How can we monitor the very fast low voltage DDR4 memory without effecting the system? • Traditional Performance Metrics – Bandwidth, Latency and Power Management • New Performance Metrics – Bus Mode Analysis – Page Hit Analysis – Multiple Open Banks Analysis – Bank Utilization Analysis – Row Hammer (excessive Activate) Detection • Measured DDR4 Performance in a real target
  • 3. How to Monitor the DDR4 Memory • Use a slot interposer to ‘listen’ to the traffic between the DIMM and the Memory Controller – A small amount of current is ‘tapped’ off the bus – Only the Address, Command and Control bus needs to be monitored – Same probing method can be used for SODIMM and ‘memory down’ using a BGA adapter
  • 5. Monitoring the DDR4 Memory • Highly attenuated signals are then amplified and the bus is ‘replicated’ inside the equipment
  • 6. The system boots and runs never knowing the equipment is present
  • 7. Performance Metrics • Billions of clock cycles every second • Old methods of measurement captures small amount of time in large trace buffers • New method uses sophisticated event counters to: – Never miss a clock cycle – Count Performance Metrics for long periods of time REAL TIME
  • 8. Work Smarter not Harder • For Performance metrics the DDR Detective® uses counters instead the traditional trace memory – To capture a second of DDR4 traffic would take 4.5Gbytes of logic analyzer/protocol analyzer trace depth $$$$! • 1 hour = 270 Gbytes of trace depth and then time to sift through it and post process! – By using large counters and counting events and the time between events we can achieve hours and days worth of metrics with no trace buffer memory and with no time consuming post processing
  • 9. 4 Memory Channels each channel is 2 slots ASUS X99 DDR4 Motherboard
  • 10. Traditional Measurements • Bandwidth – Command Bus Utilization – Data Bus Utilization • Power Management • Latency
  • 11. Bandwidth • Overhead – Any use of the bus other than a Read or a Write – Command Bus Utilization • Data Bus – Utilization: the % of the time that Read or Write Data is being transferred – Bandwidth: the amount of data transferred per second
  • 13. Command Bus Utilization ACT 4% PRE 4% RD-BL8 7% WR-BL8 4% PREA 0% REF 0%ZQCS 0% DES 81%
  • 16.
  • 17.
  • 18. Power Managementwhile running Google StressApp
  • 19. Power Management AnalysisUbuntu Boot Embedded Video
  • 20. Power Management • ~50M servers Servers World Wide • Each Server averages 16-24 DIMMs – 800M to 1.2B DIMMs • Even small power savings per DIMM can add up Every time Facebook’s data center engineers figure out a way to reduce server consumption by a single watt, the improvement, at Facebook’s scale, has the potential to add millions of dollars to the company’s bottom line. Yevgeny Sverdlik Editor in Chief Data Center Knowledge
  • 21. Latency • Several Jedec Paramters apply: – RD to WR same rank tSR_RTW – RD to PRE/PREA same Rank tRTP – WR to PRE(SB) or PREA (SR) tWR – Read to Read different Rank tDR_RTR – Read to Write different Rank DR_RTW – Write to Read different Rank tDR_WTR – Write to Write different Rank tDR_WTW
  • 23. Latency Measurements measurement made at 1867 V# Parameter Description Spec Measured V2 tSR_RTW RD to WR same Rank 8 10 V11 tRTP RD to PRE same Rank 8 8 V12 tWR WR to PRE SB or PREA SR 31 31 V53 tDR_RTR RD to RD diff Rank 5 6 V57 tDR_WTR WR to RD diff Rank 3 6 V59 tDR_WTW WR to WR diff Rank 5 8
  • 25. Latency Measurements V# Parameter Description Spec Measured V1 tCCD_L RD to RD Same Bank Group 5 6 V3 tCCD_L WR to WR Same Bank Group 5 6 V4 tCCD_S RD to RD diff Bank Group 4 4 V5 tCCD_S WR to WR diff Bank Group 4 4 V6 tRRD_L ACT to ACT Same Bank Group 5 5 V7 tWTR_L ACT to ACT diff Bank Group 4 4 V9 tWTR_L WR to RD Same Bank Group 22 23 V10 tWTR_S WR to RD Diff Bank Group 17 19
  • 26. Latency • Good designs operate on the edge of the spec • Architectural tradeoffs will occur • Do I need margin? – Design for the worst case and buy quality parts
  • 27. New Performance Metrics • Page Hit Analysis – Read Hit: Page was Open – Read Miss : Page was not Open, Transaction was preceded by an ACT – Write Hit: Page was Open – Write Miss: Page was not Open, Transaction was preceded by an ACT – Unused: Page was opened and closed and never accessed • Multiple Open Banks – Open Banks make for faster access IF your going to that bank on the next access…performance hit if your not – Power hit when banks are open • Bank Group Analysis – New for DDR4: Back to back access to same bank is a performance hit – Faster to have back to back accesses to different bank groups
  • 30. Multiple Open Banks How many are open at any one time 0 4 8
  • 31. Bank Group Access Analysis • tCCD_L – Takes longer for back to back RD/WR accesses to the same bank group • tCCD_S – Can reduce latency by going to different bank groups
  • 32. Bank Group Access Analysis Relative to the previous transaction how many times did the following transaction go to the same/different bank group
  • 33. More Performance Metrics! • Bank Utilization – What happens during a chip kill or page retirement scenario? – How does the traffic reallocate? – What are the performance implications? • Do I have system hot spots? – Row Hammer (excessive Activates) • Fast Boot – Why does the system take so long to boot?
  • 34. Bank Utilization Percentage of total cycles the banks are open (running Google Stress App)
  • 37. Row Hammer Excessive ACTIVATE commands to a single Row • An identified failure mechanism in DDR3 memory • The result of charge leakage • Current work around is to increase the refresh rate – Burns power, lowers performance – Lowers the statistical probability of an error
  • 38. Critical Applications cannot tolerate Row Hammer failures • Memory Vendors are specifying how many ACTIVATES per Retention cycle to a single row the memory can tolerate • However there is no way the memory controller or the memory can count this • Critical Software applications should be tested to ensure that the Row Hammer Threshold is not reached
  • 39. One method to detect Row Hammer
  • 41. Critical DDR4 Performance Metrics • Memory Controller/System Architecture – Can this insight lead to better designs? – Benchmark Servers Memory Performance • Workload Analysis – Should the Memory Controller settings be based on criteria set by the workload? – Can compilers be made better? – Can critical applications be written to avoid Row Hammer? • Do we all need a DDR5? – Work Smarter not Harder and understand what we have
  • 42. Summary • Power Management, Bandwidth, Latency • NEW Metrics: – Page Hit Analysis – Multiple Open Banks – Bank Group Analysis – Bank Utilization – Boot Analysis – Row Hammer Detection – New Measurements give insight into new designs and better architectures
  • 43. Contact Information Barbara Aichinger Vice President New Business Development FuturePlus Systems Barb.Aichinger@FuturePlus.com www.FuturePlus.com Check out our new website dedicated to DDR Memory! www.DDRDetective.com