SlideShare a Scribd company logo
1 of 25
Download to read offline
DDR Compliance Testing
Its time has come!Its time has come!
Server Forum 2014 Copyright © 2014 FuturePlus Systems
Barbara P. Aichinger
FuturePlus Systems Corporation
DDR Compliance TestingDDR Compliance Testing
• If not then why now?
– Continued growth and reliance on
DDR memory
– Critical applications increasing– Critical applications increasing
– Errors don’t scale well
• Large Data Centers replacing DIMMs
every hour
Several Publications point toSeveral Publications point to
error rate much larger thanerror rate much larger than
expectedexpected• DRAM Errors in the Wild: A Large-Scale Field Study Sigmetrics 2009
• Cosmic Rays Don’t Strike Twice: Understanding the Nature of DRAM
Errors and the Implications for System Design by Andy Hwang, Ioan
Stefanovici and Bianca Schroeder)
• A Field Study of DRAM Errors 2012 by Sridharan, Liberty, RAS
Architecture AMD
• Reliability, Serviceability and Availability: Intel XEON Processor E7
Family April 2014Family April 2014
• Characterizing Application Memory Error Vulnerability to Optimize
Datacenter Cost via Hetrogeneous-Reliability Memory June 2014
Microsoft
• Avoiding server downtime from hardware errors in system memory
with HP Memory Quarantine HP Technology Brief January 2012
• Flipping bits in Memory Without Accessing Them: An Experimental
Study of DRAM Disturbance Errors by Yoongu Ki, et al Carnegie
Mellon University and Intel Labs July 2014
Open Compute ProjectOpen Compute Project
• Ready Certification
• Compliant Certification
– Will point at other standards…if they
exist
• Certification labs
– University of Texas San Antonio
– ITRI (Taiwan)
• You can Join! www.OCP.com
National Institute of StandardsNational Institute of Standards
and Technologyand Technology
• Information Technology Laboratories
– Cloud Computing Program
“…It is considered critical that government and industry begin adoption of this
technology in response to difficult economic constraints. …..Cloud computing istechnology in response to difficult economic constraints. …..Cloud computing is
currently being used; however, security, interoperability, and portability are cited
as major barriers to broader adoption….The long term goal is to provide thought
leadership and guidance around the cloud computing paradigm to catalyze its use
within industry and government. NIST aims to shorten the adoption cycle….
NIST aims to foster cloud computing systems and practices that support
interoperability, portability, and security requirements that are appropriate
and achievable for important usage scenarios.”
What would DDR4 ComplianceWhat would DDR4 Compliance
Testing Look Like?Testing Look Like?
• DRAM specific
– DIMM/SODIMM/LRDIMM
• Memory Controller specific
• The path between the two
• A Compliance Test Specification• A Compliance Test Specification
– T&M Vendors can produce MOI (Method of
Implementation)
– End users can then request this testing
• Compliance Lab Testing
DDR4 Compliance TestingDDR4 Compliance Testing
• Memory Controller Specific
– RX/TX eye specification
• Bit Error Rate
– Protocol Violations– Protocol Violations
– Timing Violations
– Performance Specifications
Memory Controller RX/TXMemory Controller RX/TX
• Compliance Masks
• Measurement can be made with an
interposer
– Qualitative -> Quantitative
Memory ControllerMemory Controller
Protocol ViolationsProtocol Violations
• The DDR4 JEDEC spec contains
rules on event ordering
– Examples
• Do not ACTIVATE a bank that is already• Do not ACTIVATE a bank that is already
open
• Do not PRECHARGE a bank that is
already closed
• Do not RD/WR a non open page
Memory ControllerMemory Controller
Timing ViolationsTiming Violations
• Clock edge boundary
– Things can not be too close together
or too far apart
– Examples– Examples
• tREFI Average refresh interval
• tRC ACT ot ACT or REF
• tMOD MRS to PDE
• tCCD_L RD to RD to Same Bank Group
65 violations identified with65 violations identified with
over 1000+ simultaneousover 1000+ simultaneous
checkschecks
Protocol and TimingProtocol and Timing
ComplianceCompliance
‘in the wild’‘in the wild’
JEDEC Specification ViolationJEDEC Specification Violation
Memory ControllerMemory Controller
Timing ViolationsTiming Violations
• Timing that occurs between clock
edges
– Setup/Hold– Setup/Hold
– Eye Mask
– Jitter
Several Scope Vendors haveSeveral Scope Vendors have
DDR4 Compliance ProductsDDR4 Compliance Products
Performance MetricsPerformance Metrics
• BER testing at particular transfer
rates
– 1E-16
• Which power management• Which power management
features are implemented
– Is the clock stopped in Self Refresh?
– Is Max Power Down implemented?
DDR4 Compliance TestingDDR4 Compliance Testing
• DRAM Specific
– DIMM/SODIMM/LRDIMM
– RX/TX eye specification
• Bit Error Rate
– Data Integrity
• Row Hammer (Excessive Activates)
– Functional/Timing Verification
– Performance Verification
Making the MeasurementMaking the Measurement
Photos Courtesy of Keysight Technologies Photos Courtesy of FuturePlus Systems
Row Hammer: ExcessiveRow Hammer: Excessive
ActivatesActivates
• Yes its real…..easy to reproduce
with software memory tests
• CMU study is the most recent
technical paper on the topictechnical paper on the topic
• Tools now look for it
• Data Centers are seeing it
Row Hammer DetectionRow Hammer Detection
Testing a Server for Row Hammer using
the DDR Detective® with software from
ThirdIO
Row Hammer detection feature of the DDR
Detective® lists the row address when the
number of ACT commands exceeds a
threshold within 64ms. Two programmable
thresholds are available
The electrical path between theThe electrical path between the
DRAM and the MemoryDRAM and the Memory
ControllerController
• End to End Measurements to verify
channel integrity
• Require Slot/Channel loading
configurations based on speed/eyeconfigurations based on speed/eye
size
• Memory channel layout/board files
with simulation results?
– Similar to what is done today with raw
cards
What about the LP DDRWhat about the LP DDR
specifications?specifications?
• Many of the tests are applicable
and can be reused
• Probing becomes more difficult but
can be accomplishedcan be accomplished
• Measuring Power Management
features becomes more critical
LPDDRxLPDDRx Compliance TestCompliance Test
Advantages to having aAdvantages to having a
Compliance Test SpecCompliance Test Spec
• It is going to help our industry
– End users are asking for this!
• Helps eliminate ambiguities in the
specificationspecification
– Ex: tREFI=Average Refresh
time…over how many cycles?
• Makes high quality validation
easier and less costly
My Contact InformationMy Contact Information
Barbara P Aichinger
Vice President
FuturePlus Systems Corporation
Barb.Aichinger@FuturePlus.comBarb.Aichinger@FuturePlus.com
Member JC42 and JC45
www.FuturePlus.com
www.DDRDetective.com

More Related Content

What's hot

Configlets, compliance, RBAC & reports - Network Configuration Manager
Configlets, compliance, RBAC & reports - Network Configuration ManagerConfiglets, compliance, RBAC & reports - Network Configuration Manager
Configlets, compliance, RBAC & reports - Network Configuration ManagerManageEngine, Zoho Corporation
 
Doc 2011101412020074
Doc 2011101412020074Doc 2011101412020074
Doc 2011101412020074Rhythm Sun
 
High performance PHP: Scaling and getting the most out of your infrastructure
High performance PHP: Scaling and getting the most out of your infrastructureHigh performance PHP: Scaling and getting the most out of your infrastructure
High performance PHP: Scaling and getting the most out of your infrastructuremkherlakian
 
Top 10 Tricks and Tools of an Oracle EPM Administrator
Top 10 Tricks and Tools of an Oracle EPM AdministratorTop 10 Tricks and Tools of an Oracle EPM Administrator
Top 10 Tricks and Tools of an Oracle EPM Administratornking821
 
03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slidesMuhammad Ahad
 
KScope14 Oracle EPM Troubleshooting
KScope14 Oracle EPM TroubleshootingKScope14 Oracle EPM Troubleshooting
KScope14 Oracle EPM TroubleshootingAlithya
 
Gulf Chemicals & Metallurgy manages 1700 interfaces with OpManager
Gulf Chemicals & Metallurgy manages 1700 interfaces with OpManagerGulf Chemicals & Metallurgy manages 1700 interfaces with OpManager
Gulf Chemicals & Metallurgy manages 1700 interfaces with OpManagerManageEngine, Zoho Corporation
 
Virtualising Tier 1 Apps
Virtualising Tier 1 AppsVirtualising Tier 1 Apps
Virtualising Tier 1 AppsIwan Rahabok
 
UKOUG APPS 14: Optimizing Performance for Oracle EPM Systems
UKOUG APPS 14: Optimizing Performance for Oracle EPM SystemsUKOUG APPS 14: Optimizing Performance for Oracle EPM Systems
UKOUG APPS 14: Optimizing Performance for Oracle EPM SystemsAlithya
 
Network performance management using OpManager
Network performance management using OpManagerNetwork performance management using OpManager
Network performance management using OpManagerManageEngine
 
Designing Highly-Available Architectures for OTM
Designing Highly-Available Architectures for OTMDesigning Highly-Available Architectures for OTM
Designing Highly-Available Architectures for OTMMavenWire
 
Casino Entertainment Company from USA manages 3000 devices using OpManager
Casino Entertainment Company from USA manages 3000 devices using OpManagerCasino Entertainment Company from USA manages 3000 devices using OpManager
Casino Entertainment Company from USA manages 3000 devices using OpManagerManageEngine, Zoho Corporation
 
EPM Infrastructure: An Investigation
EPM Infrastructure: An InvestigationEPM Infrastructure: An Investigation
EPM Infrastructure: An InvestigationAlithya
 
Storage, Virtual, and Server Profiler Training
Storage, Virtual, and Server Profiler TrainingStorage, Virtual, and Server Profiler Training
Storage, Virtual, and Server Profiler TrainingSolarWinds
 
Using Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and ReliabilityUsing Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and ReliabilityTimothy Williams
 
Ofnet:SDN Controller Testing Framework
Ofnet:SDN Controller Testing FrameworkOfnet:SDN Controller Testing Framework
Ofnet:SDN Controller Testing Frameworkmonjurul88
 
Theory Review II (pps)
Theory Review II (pps)Theory Review II (pps)
Theory Review II (pps)mladenr
 

What's hot (20)

Symantec Public Meetup
Symantec Public MeetupSymantec Public Meetup
Symantec Public Meetup
 
Configlets, compliance, RBAC & reports - Network Configuration Manager
Configlets, compliance, RBAC & reports - Network Configuration ManagerConfiglets, compliance, RBAC & reports - Network Configuration Manager
Configlets, compliance, RBAC & reports - Network Configuration Manager
 
Doc 2011101412020074
Doc 2011101412020074Doc 2011101412020074
Doc 2011101412020074
 
High performance PHP: Scaling and getting the most out of your infrastructure
High performance PHP: Scaling and getting the most out of your infrastructureHigh performance PHP: Scaling and getting the most out of your infrastructure
High performance PHP: Scaling and getting the most out of your infrastructure
 
Top 10 Tricks and Tools of an Oracle EPM Administrator
Top 10 Tricks and Tools of an Oracle EPM AdministratorTop 10 Tricks and Tools of an Oracle EPM Administrator
Top 10 Tricks and Tools of an Oracle EPM Administrator
 
03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides03. non-functional-attributes-introduction-4-slides
03. non-functional-attributes-introduction-4-slides
 
KScope14 Oracle EPM Troubleshooting
KScope14 Oracle EPM TroubleshootingKScope14 Oracle EPM Troubleshooting
KScope14 Oracle EPM Troubleshooting
 
Gulf Chemicals & Metallurgy manages 1700 interfaces with OpManager
Gulf Chemicals & Metallurgy manages 1700 interfaces with OpManagerGulf Chemicals & Metallurgy manages 1700 interfaces with OpManager
Gulf Chemicals & Metallurgy manages 1700 interfaces with OpManager
 
Virtualising Tier 1 Apps
Virtualising Tier 1 AppsVirtualising Tier 1 Apps
Virtualising Tier 1 Apps
 
UKOUG APPS 14: Optimizing Performance for Oracle EPM Systems
UKOUG APPS 14: Optimizing Performance for Oracle EPM SystemsUKOUG APPS 14: Optimizing Performance for Oracle EPM Systems
UKOUG APPS 14: Optimizing Performance for Oracle EPM Systems
 
Network performance management using OpManager
Network performance management using OpManagerNetwork performance management using OpManager
Network performance management using OpManager
 
Designing Highly-Available Architectures for OTM
Designing Highly-Available Architectures for OTMDesigning Highly-Available Architectures for OTM
Designing Highly-Available Architectures for OTM
 
Casino Entertainment Company from USA manages 3000 devices using OpManager
Casino Entertainment Company from USA manages 3000 devices using OpManagerCasino Entertainment Company from USA manages 3000 devices using OpManager
Casino Entertainment Company from USA manages 3000 devices using OpManager
 
EPM Infrastructure: An Investigation
EPM Infrastructure: An InvestigationEPM Infrastructure: An Investigation
EPM Infrastructure: An Investigation
 
Storage, Virtual, and Server Profiler Training
Storage, Virtual, and Server Profiler TrainingStorage, Virtual, and Server Profiler Training
Storage, Virtual, and Server Profiler Training
 
scada
scadascada
scada
 
Using Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and ReliabilityUsing Six Sigma to Optimize Performance and Reliability
Using Six Sigma to Optimize Performance and Reliability
 
Ofnet:SDN Controller Testing Framework
Ofnet:SDN Controller Testing FrameworkOfnet:SDN Controller Testing Framework
Ofnet:SDN Controller Testing Framework
 
Theory Review II (pps)
Theory Review II (pps)Theory Review II (pps)
Theory Review II (pps)
 
OpManager - Technical overview
OpManager - Technical overviewOpManager - Technical overview
OpManager - Technical overview
 

Similar to DDR Compliance Testing Key Points

Google Study: Could those failures be caused by design flaws
Google Study: Could those failures be caused by design flawsGoogle Study: Could those failures be caused by design flaws
Google Study: Could those failures be caused by design flawsBarbara Aichinger
 
OCP Server Memory Channel Testing DRAFT
OCP Server Memory Channel Testing DRAFTOCP Server Memory Channel Testing DRAFT
OCP Server Memory Channel Testing DRAFTBarbara Aichinger
 
Problem management foundation - Engineering
Problem management foundation - EngineeringProblem management foundation - Engineering
Problem management foundation - EngineeringRonald Bartels
 
Tokyo azure meetup #12 service fabric internals
Tokyo azure meetup #12   service fabric internalsTokyo azure meetup #12   service fabric internals
Tokyo azure meetup #12 service fabric internalsTokyo Azure Meetup
 
New Memory Solutions for Enterprise Computing
New Memory Solutions for Enterprise ComputingNew Memory Solutions for Enterprise Computing
New Memory Solutions for Enterprise ComputingIntel IT Center
 
Key Note Session IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...
Key Note Session  IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...Key Note Session  IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...
Key Note Session IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...Surekha Parekh
 
Adding Value in the Cloud with Performance Test
Adding Value in the Cloud with Performance TestAdding Value in the Cloud with Performance Test
Adding Value in the Cloud with Performance TestRodolfo Kohn
 
Privacy preserving public auditing for regenerating-code-based
Privacy preserving public auditing for regenerating-code-basedPrivacy preserving public auditing for regenerating-code-based
Privacy preserving public auditing for regenerating-code-basedNagamalleswararao Tadikonda
 
Java/Hybris performance monitoring and optimization
Java/Hybris performance monitoring and optimizationJava/Hybris performance monitoring and optimization
Java/Hybris performance monitoring and optimizationEPAM Lviv
 
CompTIA Cloud+ Objectives
CompTIA Cloud+ Objectives CompTIA Cloud+ Objectives
CompTIA Cloud+ Objectives sombat nirund
 
A Time Traveller's Guide to DB2: Technology Themes for 2014 and Beyond
A Time Traveller's Guide to DB2: Technology Themes for 2014 and BeyondA Time Traveller's Guide to DB2: Technology Themes for 2014 and Beyond
A Time Traveller's Guide to DB2: Technology Themes for 2014 and BeyondLaura Hood
 
Integration strategies best practices- Mulesoft meetup April 2018
Integration strategies   best practices- Mulesoft meetup April 2018Integration strategies   best practices- Mulesoft meetup April 2018
Integration strategies best practices- Mulesoft meetup April 2018Rohan Rasane
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkDellNMS
 
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...Deepak Shankar
 
The Diabolical Developers Guide to Performance Tuning
The Diabolical Developers Guide to Performance TuningThe Diabolical Developers Guide to Performance Tuning
The Diabolical Developers Guide to Performance TuningjClarity
 
DDS, the US Navy, and the Need for Distributed Software
DDS, the US Navy,  and the Need for Distributed SoftwareDDS, the US Navy,  and the Need for Distributed Software
DDS, the US Navy, and the Need for Distributed SoftwareGerardo Pardo-Castellote
 
Scaling apps for the big time
Scaling apps for the big timeScaling apps for the big time
Scaling apps for the big timeproitconsult
 
Ask The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid ITAsk The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid ITRightScale
 
Big Memory for HPC
Big Memory for HPCBig Memory for HPC
Big Memory for HPCMemVerge
 

Similar to DDR Compliance Testing Key Points (20)

Google Study: Could those failures be caused by design flaws
Google Study: Could those failures be caused by design flawsGoogle Study: Could those failures be caused by design flaws
Google Study: Could those failures be caused by design flaws
 
OCP Server Memory Channel Testing DRAFT
OCP Server Memory Channel Testing DRAFTOCP Server Memory Channel Testing DRAFT
OCP Server Memory Channel Testing DRAFT
 
Problem management foundation - Engineering
Problem management foundation - EngineeringProblem management foundation - Engineering
Problem management foundation - Engineering
 
Tokyo azure meetup #12 service fabric internals
Tokyo azure meetup #12   service fabric internalsTokyo azure meetup #12   service fabric internals
Tokyo azure meetup #12 service fabric internals
 
Univa Presentation at DAC 2020
Univa Presentation at DAC 2020 Univa Presentation at DAC 2020
Univa Presentation at DAC 2020
 
New Memory Solutions for Enterprise Computing
New Memory Solutions for Enterprise ComputingNew Memory Solutions for Enterprise Computing
New Memory Solutions for Enterprise Computing
 
Key Note Session IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...
Key Note Session  IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...Key Note Session  IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...
Key Note Session IDUG DB2 Seminar, 16th April London - Julian Stuhler .Trito...
 
Adding Value in the Cloud with Performance Test
Adding Value in the Cloud with Performance TestAdding Value in the Cloud with Performance Test
Adding Value in the Cloud with Performance Test
 
Privacy preserving public auditing for regenerating-code-based
Privacy preserving public auditing for regenerating-code-basedPrivacy preserving public auditing for regenerating-code-based
Privacy preserving public auditing for regenerating-code-based
 
Java/Hybris performance monitoring and optimization
Java/Hybris performance monitoring and optimizationJava/Hybris performance monitoring and optimization
Java/Hybris performance monitoring and optimization
 
CompTIA Cloud+ Objectives
CompTIA Cloud+ Objectives CompTIA Cloud+ Objectives
CompTIA Cloud+ Objectives
 
A Time Traveller's Guide to DB2: Technology Themes for 2014 and Beyond
A Time Traveller's Guide to DB2: Technology Themes for 2014 and BeyondA Time Traveller's Guide to DB2: Technology Themes for 2014 and Beyond
A Time Traveller's Guide to DB2: Technology Themes for 2014 and Beyond
 
Integration strategies best practices- Mulesoft meetup April 2018
Integration strategies   best practices- Mulesoft meetup April 2018Integration strategies   best practices- Mulesoft meetup April 2018
Integration strategies best practices- Mulesoft meetup April 2018
 
Visualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your NetworkVisualizing Your Network Health - Know your Network
Visualizing Your Network Health - Know your Network
 
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
Mastering IoT Design: Sense, Process, Connect: Processing: Turning IoT Data i...
 
The Diabolical Developers Guide to Performance Tuning
The Diabolical Developers Guide to Performance TuningThe Diabolical Developers Guide to Performance Tuning
The Diabolical Developers Guide to Performance Tuning
 
DDS, the US Navy, and the Need for Distributed Software
DDS, the US Navy,  and the Need for Distributed SoftwareDDS, the US Navy,  and the Need for Distributed Software
DDS, the US Navy, and the Need for Distributed Software
 
Scaling apps for the big time
Scaling apps for the big timeScaling apps for the big time
Scaling apps for the big time
 
Ask The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid ITAsk The Architect: RightScale & AWS Dive Deep into Hybrid IT
Ask The Architect: RightScale & AWS Dive Deep into Hybrid IT
 
Big Memory for HPC
Big Memory for HPCBig Memory for HPC
Big Memory for HPC
 

DDR Compliance Testing Key Points

  • 1. DDR Compliance Testing Its time has come!Its time has come! Server Forum 2014 Copyright © 2014 FuturePlus Systems Barbara P. Aichinger FuturePlus Systems Corporation
  • 2. DDR Compliance TestingDDR Compliance Testing • If not then why now? – Continued growth and reliance on DDR memory – Critical applications increasing– Critical applications increasing – Errors don’t scale well • Large Data Centers replacing DIMMs every hour
  • 3. Several Publications point toSeveral Publications point to error rate much larger thanerror rate much larger than expectedexpected• DRAM Errors in the Wild: A Large-Scale Field Study Sigmetrics 2009 • Cosmic Rays Don’t Strike Twice: Understanding the Nature of DRAM Errors and the Implications for System Design by Andy Hwang, Ioan Stefanovici and Bianca Schroeder) • A Field Study of DRAM Errors 2012 by Sridharan, Liberty, RAS Architecture AMD • Reliability, Serviceability and Availability: Intel XEON Processor E7 Family April 2014Family April 2014 • Characterizing Application Memory Error Vulnerability to Optimize Datacenter Cost via Hetrogeneous-Reliability Memory June 2014 Microsoft • Avoiding server downtime from hardware errors in system memory with HP Memory Quarantine HP Technology Brief January 2012 • Flipping bits in Memory Without Accessing Them: An Experimental Study of DRAM Disturbance Errors by Yoongu Ki, et al Carnegie Mellon University and Intel Labs July 2014
  • 4. Open Compute ProjectOpen Compute Project • Ready Certification • Compliant Certification – Will point at other standards…if they exist • Certification labs – University of Texas San Antonio – ITRI (Taiwan) • You can Join! www.OCP.com
  • 5. National Institute of StandardsNational Institute of Standards and Technologyand Technology • Information Technology Laboratories – Cloud Computing Program “…It is considered critical that government and industry begin adoption of this technology in response to difficult economic constraints. …..Cloud computing istechnology in response to difficult economic constraints. …..Cloud computing is currently being used; however, security, interoperability, and portability are cited as major barriers to broader adoption….The long term goal is to provide thought leadership and guidance around the cloud computing paradigm to catalyze its use within industry and government. NIST aims to shorten the adoption cycle…. NIST aims to foster cloud computing systems and practices that support interoperability, portability, and security requirements that are appropriate and achievable for important usage scenarios.”
  • 6. What would DDR4 ComplianceWhat would DDR4 Compliance Testing Look Like?Testing Look Like? • DRAM specific – DIMM/SODIMM/LRDIMM • Memory Controller specific • The path between the two • A Compliance Test Specification• A Compliance Test Specification – T&M Vendors can produce MOI (Method of Implementation) – End users can then request this testing • Compliance Lab Testing
  • 7. DDR4 Compliance TestingDDR4 Compliance Testing • Memory Controller Specific – RX/TX eye specification • Bit Error Rate – Protocol Violations– Protocol Violations – Timing Violations – Performance Specifications
  • 8. Memory Controller RX/TXMemory Controller RX/TX • Compliance Masks • Measurement can be made with an interposer – Qualitative -> Quantitative
  • 9. Memory ControllerMemory Controller Protocol ViolationsProtocol Violations • The DDR4 JEDEC spec contains rules on event ordering – Examples • Do not ACTIVATE a bank that is already• Do not ACTIVATE a bank that is already open • Do not PRECHARGE a bank that is already closed • Do not RD/WR a non open page
  • 10. Memory ControllerMemory Controller Timing ViolationsTiming Violations • Clock edge boundary – Things can not be too close together or too far apart – Examples– Examples • tREFI Average refresh interval • tRC ACT ot ACT or REF • tMOD MRS to PDE • tCCD_L RD to RD to Same Bank Group
  • 11. 65 violations identified with65 violations identified with over 1000+ simultaneousover 1000+ simultaneous checkschecks
  • 12. Protocol and TimingProtocol and Timing ComplianceCompliance ‘in the wild’‘in the wild’
  • 13. JEDEC Specification ViolationJEDEC Specification Violation
  • 14. Memory ControllerMemory Controller Timing ViolationsTiming Violations • Timing that occurs between clock edges – Setup/Hold– Setup/Hold – Eye Mask – Jitter
  • 15. Several Scope Vendors haveSeveral Scope Vendors have DDR4 Compliance ProductsDDR4 Compliance Products
  • 16. Performance MetricsPerformance Metrics • BER testing at particular transfer rates – 1E-16 • Which power management• Which power management features are implemented – Is the clock stopped in Self Refresh? – Is Max Power Down implemented?
  • 17. DDR4 Compliance TestingDDR4 Compliance Testing • DRAM Specific – DIMM/SODIMM/LRDIMM – RX/TX eye specification • Bit Error Rate – Data Integrity • Row Hammer (Excessive Activates) – Functional/Timing Verification – Performance Verification
  • 18. Making the MeasurementMaking the Measurement Photos Courtesy of Keysight Technologies Photos Courtesy of FuturePlus Systems
  • 19. Row Hammer: ExcessiveRow Hammer: Excessive ActivatesActivates • Yes its real…..easy to reproduce with software memory tests • CMU study is the most recent technical paper on the topictechnical paper on the topic • Tools now look for it • Data Centers are seeing it
  • 20. Row Hammer DetectionRow Hammer Detection Testing a Server for Row Hammer using the DDR Detective® with software from ThirdIO Row Hammer detection feature of the DDR Detective® lists the row address when the number of ACT commands exceeds a threshold within 64ms. Two programmable thresholds are available
  • 21. The electrical path between theThe electrical path between the DRAM and the MemoryDRAM and the Memory ControllerController • End to End Measurements to verify channel integrity • Require Slot/Channel loading configurations based on speed/eyeconfigurations based on speed/eye size • Memory channel layout/board files with simulation results? – Similar to what is done today with raw cards
  • 22. What about the LP DDRWhat about the LP DDR specifications?specifications? • Many of the tests are applicable and can be reused • Probing becomes more difficult but can be accomplishedcan be accomplished • Measuring Power Management features becomes more critical
  • 24. Advantages to having aAdvantages to having a Compliance Test SpecCompliance Test Spec • It is going to help our industry – End users are asking for this! • Helps eliminate ambiguities in the specificationspecification – Ex: tREFI=Average Refresh time…over how many cycles? • Makes high quality validation easier and less costly
  • 25. My Contact InformationMy Contact Information Barbara P Aichinger Vice President FuturePlus Systems Corporation Barb.Aichinger@FuturePlus.comBarb.Aichinger@FuturePlus.com Member JC42 and JC45 www.FuturePlus.com www.DDRDetective.com