SlideShare a Scribd company logo
1 of 28
Design Tradeoffs for SSD Reliability
Speaker: Po-Chuan, Chen
Contents
1. Abstract
2. Introduction
3. Background
4. Design Tradeoffs for SSD Reliability
5. Evaluation of SSD Reliability
6. Holistic Reliability Management
7. Related Work
8. Conclusion
Abstract
• Non of modern SSDs offer a one size fits all solution when considering
the multi-dimensional requirement.
• Examining the design tradeoffs
• Proposing a holistic reliability management scheme
1. Selectively employs redundancy
2. Conditionally re-reads
3. Judiciously selects data to scrub
Introduction
• Flash memory-based SSDs has become a mainstream storage device
• But, the drive for high storage density has caused the flash memory
to become less reliable and more error prone
• The high error rates in today’s flash memory are caused by various
reasons
 wear-and-tear
 gradual charge leakage
 data disturbance
This variety is caused by the fact that there is no one size fits all solution for data
protection and recovery:
each technique has a multi-dimensional design tradeoff that makes it necessary to
compositionally combine complementary solutions
But, these reliability enhancements, in fact, cause performance degradation in SSD
When?
 The use of data re-read mechanisms should be managed
 In the absence of random and sporadic errors, the overheads of
intra-SSD redundancy outweigh its benefits in terms of
performance, write amplification, and reliability.
 SSD-internal scrubbing reduces the error-induced long-tail latencies,
but it increases the internal traffic that negates its benefits
How ?
• A holistic reliability management scheme
 Conditionally uses data re-read mechanism to reduce the effects of
read disturbance
 Judiciously selects data to scrub so that the internal relocation
traffic is managed.
 Redundancy is applied only to infrequently accessed cold data to
reduce write amplification
 Frequently read read-hot data are selected for scrubbing based on a
cost-benefit analysis
Error in Flash Memory
• Three major sources of flash memory errors
 Wear
 Retention loss
 Disturbance
Disturbance and retention errors are opposing error mechanisms, but
they do not necessarily cancel each other out
始誤碼率RBER(Raw Bit Error Rate)
確切反應Nand Flash的初始可靠性狀況
SSD Reliability Enhancement Techniques
讀取數據後,將存儲的 ECC 代碼與讀取數據時生成的 ECC 代碼進行比較。
如果在任何情況下都存在不匹配,則奇偶校驗位對其進行解密以確定哪個位有錯誤並立即糾正
Design Tradeoffs for SSD Reliability
 Error-Prone Flash Memory
 Mechanism in Flash Memory Controller
 Role of Flash Translation Layer
Error-Prone Flash Memory
Mechanism in Flash Memory Controller
• Flash memory controller not only abstracts the operational details of
flash memory, but also handles common case error correction.
• Errors beyond the hard-decision ECC’s correction strength are
subsequently handled by the flash memory controller with data re-
reads.
• If the data cannot be recovered after max retry count, the firmware is
notified of a read error.
Solution
Modeling threshold voltage tuning and soft-decision decoding in a way
that each successive reads effectively reduces the RBER of the data by
retry scale factor.
Role of Flash Translation Layer
The flash translation layer (FTL) consists of a number of SSD-internal
housekeeping tasks that collectively hide the quirks of flash memory
and provide an illusion of a traditional block device
 Intra-SSD redundancy (reconstruct data when ECC fails)
 Data scrubbing (prevent errors from accumulating by relocating
them in the background)
Evaluation of SSD Reliability
DiskSim environment (simulate SSD environment)
• Error Correction Code
• Intra-SSD Redundancy
• Background Scrubbing
• Retention Test
Error Correction Code
The average response time for read requests for the three types of
SSDs at various wear states
Write amplification
實際寫入的物理資料量是寫入資料量的多倍
Ref : https://en.wikipedia.org/wiki/Write_amplification
Intra-SSD Redundancy
• It performs no better, accelerates wear through increased write
amplification, and, what’s worse, may not fully recover data due to
correlated failures.
Background Scrubbing
The scrubber’s performance overhead is less than the redundancy
scheme, and the increase in write amplification only occurs towards
the end-of-life phase.
Retention Test
In this section, exploring the effects of data loss due to charge leakage
by initializing a non-zero time-since-written value for each data.
Discussion
• Data re-read :
Degrade performance, increase the average response time
• Intra-SSD redundancy :
Bad for performance, write amplification, and reliability
But the only way to recover data when encountering a random chip and
word line failure
• Data scrubbing :
Reduce the performance degradation but not a cure-for-all
However
 The data re-read mechanism we modeled is too optimistic, as it
eventually corrects errors given enough re-read.
 Second, the short 1 hour experiments are insufficient to show UBER
< 10−15. I/Os in the order of petascale are required to experimentally
show this level of reliability.
 Real flash memories nevertheless exhibit random and sporadic fault
Holistic Reliability Management
• Redundancy should be selectively applied only to infrequently accessed
cold data to reduce write amplification while providing protection against
retention errors
• Frequently read read-hot data should be
relocated through scrubbing to reduce
the data re-reads, but the benefit of
scrubbing should be compared
against the cost of data relocation
Workload and Test Settings
Use which one ?
 ∞-bit ECC : corrects all errors (baseline)
(performance degradation due to queueing delays & garbage collection)
 ECC + re-read : flash memory controller
(repeatedly re-read until correct)
 Oracle scrub : knows the errors and preventively relocates them before
errors accumulate
(if fails → ECC + re-read approach)
 HRM (best one)
Experimental Results
Related work
 Reliability Enhancement
 Voltage prediction
 Intra-SSD redundancy
 QoS Performance
 AutoSSD
 RLGC
 ttFlash
Conclusion
• Proposing a reliability management scheme that selectively applies
appropriate techniques to different data
 The limitation of our study reveals the necessity to integrate the
SSD-level design framework
 Mathematically model the effectiveness of data rereads and data
scrubbing

More Related Content

Similar to Design Tradeoffs for SSD Reliability.pptx

Using preferred read groups in oracle asm michael ault
Using preferred read groups in oracle asm michael aultUsing preferred read groups in oracle asm michael ault
Using preferred read groups in oracle asm michael ault
Louis liu
 
SSD WhitePaper by Houman Shabani
SSD WhitePaper  by Houman ShabaniSSD WhitePaper  by Houman Shabani
SSD WhitePaper by Houman Shabani
Houman Shabani
 
Tips and Tricks for SAP Sybase IQ
Tips and Tricks for SAP  Sybase IQTips and Tricks for SAP  Sybase IQ
Tips and Tricks for SAP Sybase IQ
Don Brizendine
 

Similar to Design Tradeoffs for SSD Reliability.pptx (20)

A Study of SSD Reliability in Large Scale Enterprise Storage Deployments.pptx
A Study of SSD Reliability in Large Scale Enterprise Storage Deployments.pptxA Study of SSD Reliability in Large Scale Enterprise Storage Deployments.pptx
A Study of SSD Reliability in Large Scale Enterprise Storage Deployments.pptx
 
How Do I Know My SQL & Virtual Environments Are Ready for SSD?
How Do I Know My SQL & Virtual Environments Are Ready for SSD?How Do I Know My SQL & Virtual Environments Are Ready for SSD?
How Do I Know My SQL & Virtual Environments Are Ready for SSD?
 
Reliability of NAND-Based SSDs What Field Studies Tell Us.pptx
Reliability of NAND-Based SSDs What Field Studies Tell Us.pptxReliability of NAND-Based SSDs What Field Studies Tell Us.pptx
Reliability of NAND-Based SSDs What Field Studies Tell Us.pptx
 
FlashEmbedding: Storing Embedding Tables in SSD for Large-Scale Recommender S...
FlashEmbedding: Storing Embedding Tables in SSD for Large-Scale Recommender S...FlashEmbedding: Storing Embedding Tables in SSD for Large-Scale Recommender S...
FlashEmbedding: Storing Embedding Tables in SSD for Large-Scale Recommender S...
 
Using preferred read groups in oracle asm michael ault
Using preferred read groups in oracle asm michael aultUsing preferred read groups in oracle asm michael ault
Using preferred read groups in oracle asm michael ault
 
Ceph - High Performance Without High Costs
Ceph - High Performance Without High CostsCeph - High Performance Without High Costs
Ceph - High Performance Without High Costs
 
Reliable Hydra SSD Architecture for General Purpose Controllers
Reliable Hydra SSD Architecture for General Purpose ControllersReliable Hydra SSD Architecture for General Purpose Controllers
Reliable Hydra SSD Architecture for General Purpose Controllers
 
Oracle RAC Internals - The Cache Fusion Edition
Oracle RAC Internals - The Cache Fusion EditionOracle RAC Internals - The Cache Fusion Edition
Oracle RAC Internals - The Cache Fusion Edition
 
SSD WhitePaper by Houman Shabani
SSD WhitePaper  by Houman ShabaniSSD WhitePaper  by Houman Shabani
SSD WhitePaper by Houman Shabani
 
Clustering van IT-componenten
Clustering van IT-componentenClustering van IT-componenten
Clustering van IT-componenten
 
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
Hands-on Lab: How to Unleash Your Storage Performance by Using NVM Express™ B...
 
CaSSanDra: An SSD Boosted Key-Value Store
CaSSanDra: An SSD Boosted Key-Value StoreCaSSanDra: An SSD Boosted Key-Value Store
CaSSanDra: An SSD Boosted Key-Value Store
 
Dell whitepaper busting solid state storage myths
Dell whitepaper busting solid state storage mythsDell whitepaper busting solid state storage myths
Dell whitepaper busting solid state storage myths
 
[db tech showcase Tokyo 2018] #dbts2018 #B17 『オラクル パフォーマンス チューニング - 神話、伝説と解決策』
[db tech showcase Tokyo 2018] #dbts2018 #B17 『オラクル パフォーマンス チューニング - 神話、伝説と解決策』[db tech showcase Tokyo 2018] #dbts2018 #B17 『オラクル パフォーマンス チューニング - 神話、伝説と解決策』
[db tech showcase Tokyo 2018] #dbts2018 #B17 『オラクル パフォーマンス チューニング - 神話、伝説と解決策』
 
Towards User-Defined SLA in Cloud Flash Storage.pptx
Towards User-Defined SLA in Cloud Flash Storage.pptxTowards User-Defined SLA in Cloud Flash Storage.pptx
Towards User-Defined SLA in Cloud Flash Storage.pptx
 
Tips and Tricks for SAP Sybase IQ
Tips and Tricks for SAP  Sybase IQTips and Tricks for SAP  Sybase IQ
Tips and Tricks for SAP Sybase IQ
 
A better storage solution
A better storage solutionA better storage solution
A better storage solution
 
Webinar: All-Flash For Databases: 5 Reasons Why Current Systems Are Off Target
Webinar: All-Flash For Databases: 5 Reasons Why Current Systems Are Off TargetWebinar: All-Flash For Databases: 5 Reasons Why Current Systems Are Off Target
Webinar: All-Flash For Databases: 5 Reasons Why Current Systems Are Off Target
 
Breaking the Sound Barrier with Persistent Memory
Breaking the Sound Barrier with Persistent Memory Breaking the Sound Barrier with Persistent Memory
Breaking the Sound Barrier with Persistent Memory
 
Demartek lenovo s3200_sql_server_evaluation_2016-01
Demartek lenovo s3200_sql_server_evaluation_2016-01Demartek lenovo s3200_sql_server_evaluation_2016-01
Demartek lenovo s3200_sql_server_evaluation_2016-01
 

More from Po-Chuan Chen

Effective Structured Prompting by Meta-Learning and Representative Verbalizer...
Effective Structured Prompting by Meta-Learning and Representative Verbalizer...Effective Structured Prompting by Meta-Learning and Representative Verbalizer...
Effective Structured Prompting by Meta-Learning and Representative Verbalizer...
Po-Chuan Chen
 
Quark: Controllable Text Generation with Reinforced [Un]learning.pdf
Quark: Controllable Text Generation with Reinforced [Un]learning.pdfQuark: Controllable Text Generation with Reinforced [Un]learning.pdf
Quark: Controllable Text Generation with Reinforced [Un]learning.pdf
Po-Chuan Chen
 
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...
Po-Chuan Chen
 
On the Effectiveness of Offline RL for Dialogue Response Generation.pdf
On the Effectiveness of Offline RL for Dialogue Response Generation.pdfOn the Effectiveness of Offline RL for Dialogue Response Generation.pdf
On the Effectiveness of Offline RL for Dialogue Response Generation.pdf
Po-Chuan Chen
 
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...
Po-Chuan Chen
 
A Statistical Perspective on Retrieval-Based Models.pdf
A Statistical Perspective on Retrieval-Based Models.pdfA Statistical Perspective on Retrieval-Based Models.pdf
A Statistical Perspective on Retrieval-Based Models.pdf
Po-Chuan Chen
 
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdf
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdfAdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdf
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdf
Po-Chuan Chen
 
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...
Po-Chuan Chen
 
Offline Reinforcement Learning for Informal Summarization in Online Domains.pdf
Offline Reinforcement Learning for Informal Summarization in Online Domains.pdfOffline Reinforcement Learning for Informal Summarization in Online Domains.pdf
Offline Reinforcement Learning for Informal Summarization in Online Domains.pdf
Po-Chuan Chen
 
Cold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdf
Cold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdfCold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdf
Cold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdf
Po-Chuan Chen
 
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Po-Chuan Chen
 
A Mixture-of-Expert Approach to RL-based Dialogue Management.pdf
A Mixture-of-Expert Approach to RL-based Dialogue Management.pdfA Mixture-of-Expert Approach to RL-based Dialogue Management.pdf
A Mixture-of-Expert Approach to RL-based Dialogue Management.pdf
Po-Chuan Chen
 
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
Is Reinforcement Learning (Not) for Natural
Language Processing.pdfIs Reinforcement Learning (Not) for Natural
Language Processing.pdf
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
Po-Chuan Chen
 
HyperPrompt:Prompt-based Task-Conditioning of Transformerspdf
HyperPrompt:Prompt-based Task-Conditioning of TransformerspdfHyperPrompt:Prompt-based Task-Conditioning of Transformerspdf
HyperPrompt:Prompt-based Task-Conditioning of Transformerspdf
Po-Chuan Chen
 

More from Po-Chuan Chen (20)

E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.pdf
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.pdfE-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.pdf
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation.pdf
 
Effective Structured Prompting by Meta-Learning and Representative Verbalizer...
Effective Structured Prompting by Meta-Learning and Representative Verbalizer...Effective Structured Prompting by Meta-Learning and Representative Verbalizer...
Effective Structured Prompting by Meta-Learning and Representative Verbalizer...
 
Quark: Controllable Text Generation with Reinforced [Un]learning.pdf
Quark: Controllable Text Generation with Reinforced [Un]learning.pdfQuark: Controllable Text Generation with Reinforced [Un]learning.pdf
Quark: Controllable Text Generation with Reinforced [Un]learning.pdf
 
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible...
 
On the Effectiveness of Offline RL for Dialogue Response Generation.pdf
On the Effectiveness of Offline RL for Dialogue Response Generation.pdfOn the Effectiveness of Offline RL for Dialogue Response Generation.pdf
On the Effectiveness of Offline RL for Dialogue Response Generation.pdf
 
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transfor...
 
A Statistical Perspective on Retrieval-Based Models.pdf
A Statistical Perspective on Retrieval-Based Models.pdfA Statistical Perspective on Retrieval-Based Models.pdf
A Statistical Perspective on Retrieval-Based Models.pdf
 
A Neural Corpus Indexer for Document Retrieval.pdf
A Neural Corpus Indexer for Document Retrieval.pdfA Neural Corpus Indexer for Document Retrieval.pdf
A Neural Corpus Indexer for Document Retrieval.pdf
 
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdf
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdfAdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdf
AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning.pdf
 
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attent...
 
Active Retrieval Augmented Generation.pdf
Active Retrieval Augmented Generation.pdfActive Retrieval Augmented Generation.pdf
Active Retrieval Augmented Generation.pdf
 
Offline Reinforcement Learning for Informal Summarization in Online Domains.pdf
Offline Reinforcement Learning for Informal Summarization in Online Domains.pdfOffline Reinforcement Learning for Informal Summarization in Online Domains.pdf
Offline Reinforcement Learning for Informal Summarization in Online Domains.pdf
 
Cold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdf
Cold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdfCold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdf
Cold_Start_Reinforcement_Learning_with_Softmax_Policy_Gradient.pdf
 
Image_to_Prompts.pdf
Image_to_Prompts.pdfImage_to_Prompts.pdf
Image_to_Prompts.pdf
 
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
 
Evaluating Parameter Efficient Learning for Generation.pdf
Evaluating Parameter Efficient Learning for Generation.pdfEvaluating Parameter Efficient Learning for Generation.pdf
Evaluating Parameter Efficient Learning for Generation.pdf
 
Off-Policy Deep Reinforcement Learning without Exploration.pdf
Off-Policy Deep Reinforcement Learning without Exploration.pdfOff-Policy Deep Reinforcement Learning without Exploration.pdf
Off-Policy Deep Reinforcement Learning without Exploration.pdf
 
A Mixture-of-Expert Approach to RL-based Dialogue Management.pdf
A Mixture-of-Expert Approach to RL-based Dialogue Management.pdfA Mixture-of-Expert Approach to RL-based Dialogue Management.pdf
A Mixture-of-Expert Approach to RL-based Dialogue Management.pdf
 
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
Is Reinforcement Learning (Not) for Natural
Language Processing.pdfIs Reinforcement Learning (Not) for Natural
Language Processing.pdf
Is Reinforcement Learning (Not) for Natural Language Processing.pdf
 
HyperPrompt:Prompt-based Task-Conditioning of Transformerspdf
HyperPrompt:Prompt-based Task-Conditioning of TransformerspdfHyperPrompt:Prompt-based Task-Conditioning of Transformerspdf
HyperPrompt:Prompt-based Task-Conditioning of Transformerspdf
 

Recently uploaded

Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 

Recently uploaded (20)

Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 

Design Tradeoffs for SSD Reliability.pptx

  • 1. Design Tradeoffs for SSD Reliability Speaker: Po-Chuan, Chen
  • 2. Contents 1. Abstract 2. Introduction 3. Background 4. Design Tradeoffs for SSD Reliability 5. Evaluation of SSD Reliability 6. Holistic Reliability Management 7. Related Work 8. Conclusion
  • 3. Abstract • Non of modern SSDs offer a one size fits all solution when considering the multi-dimensional requirement. • Examining the design tradeoffs • Proposing a holistic reliability management scheme 1. Selectively employs redundancy 2. Conditionally re-reads 3. Judiciously selects data to scrub
  • 4. Introduction • Flash memory-based SSDs has become a mainstream storage device • But, the drive for high storage density has caused the flash memory to become less reliable and more error prone • The high error rates in today’s flash memory are caused by various reasons  wear-and-tear  gradual charge leakage  data disturbance
  • 5. This variety is caused by the fact that there is no one size fits all solution for data protection and recovery: each technique has a multi-dimensional design tradeoff that makes it necessary to compositionally combine complementary solutions But, these reliability enhancements, in fact, cause performance degradation in SSD
  • 6. When?  The use of data re-read mechanisms should be managed  In the absence of random and sporadic errors, the overheads of intra-SSD redundancy outweigh its benefits in terms of performance, write amplification, and reliability.  SSD-internal scrubbing reduces the error-induced long-tail latencies, but it increases the internal traffic that negates its benefits
  • 7. How ? • A holistic reliability management scheme  Conditionally uses data re-read mechanism to reduce the effects of read disturbance  Judiciously selects data to scrub so that the internal relocation traffic is managed.  Redundancy is applied only to infrequently accessed cold data to reduce write amplification  Frequently read read-hot data are selected for scrubbing based on a cost-benefit analysis
  • 8. Error in Flash Memory • Three major sources of flash memory errors  Wear  Retention loss  Disturbance Disturbance and retention errors are opposing error mechanisms, but they do not necessarily cancel each other out 始誤碼率RBER(Raw Bit Error Rate) 確切反應Nand Flash的初始可靠性狀況
  • 9. SSD Reliability Enhancement Techniques 讀取數據後,將存儲的 ECC 代碼與讀取數據時生成的 ECC 代碼進行比較。 如果在任何情況下都存在不匹配,則奇偶校驗位對其進行解密以確定哪個位有錯誤並立即糾正
  • 10. Design Tradeoffs for SSD Reliability  Error-Prone Flash Memory  Mechanism in Flash Memory Controller  Role of Flash Translation Layer
  • 12. Mechanism in Flash Memory Controller • Flash memory controller not only abstracts the operational details of flash memory, but also handles common case error correction. • Errors beyond the hard-decision ECC’s correction strength are subsequently handled by the flash memory controller with data re- reads. • If the data cannot be recovered after max retry count, the firmware is notified of a read error.
  • 13. Solution Modeling threshold voltage tuning and soft-decision decoding in a way that each successive reads effectively reduces the RBER of the data by retry scale factor.
  • 14. Role of Flash Translation Layer The flash translation layer (FTL) consists of a number of SSD-internal housekeeping tasks that collectively hide the quirks of flash memory and provide an illusion of a traditional block device  Intra-SSD redundancy (reconstruct data when ECC fails)  Data scrubbing (prevent errors from accumulating by relocating them in the background)
  • 15. Evaluation of SSD Reliability DiskSim environment (simulate SSD environment) • Error Correction Code • Intra-SSD Redundancy • Background Scrubbing • Retention Test
  • 16. Error Correction Code The average response time for read requests for the three types of SSDs at various wear states
  • 18. Intra-SSD Redundancy • It performs no better, accelerates wear through increased write amplification, and, what’s worse, may not fully recover data due to correlated failures.
  • 19. Background Scrubbing The scrubber’s performance overhead is less than the redundancy scheme, and the increase in write amplification only occurs towards the end-of-life phase.
  • 20. Retention Test In this section, exploring the effects of data loss due to charge leakage by initializing a non-zero time-since-written value for each data.
  • 21. Discussion • Data re-read : Degrade performance, increase the average response time • Intra-SSD redundancy : Bad for performance, write amplification, and reliability But the only way to recover data when encountering a random chip and word line failure • Data scrubbing : Reduce the performance degradation but not a cure-for-all
  • 22. However  The data re-read mechanism we modeled is too optimistic, as it eventually corrects errors given enough re-read.  Second, the short 1 hour experiments are insufficient to show UBER < 10−15. I/Os in the order of petascale are required to experimentally show this level of reliability.  Real flash memories nevertheless exhibit random and sporadic fault
  • 23. Holistic Reliability Management • Redundancy should be selectively applied only to infrequently accessed cold data to reduce write amplification while providing protection against retention errors • Frequently read read-hot data should be relocated through scrubbing to reduce the data re-reads, but the benefit of scrubbing should be compared against the cost of data relocation
  • 24. Workload and Test Settings
  • 25. Use which one ?  ∞-bit ECC : corrects all errors (baseline) (performance degradation due to queueing delays & garbage collection)  ECC + re-read : flash memory controller (repeatedly re-read until correct)  Oracle scrub : knows the errors and preventively relocates them before errors accumulate (if fails → ECC + re-read approach)  HRM (best one)
  • 27. Related work  Reliability Enhancement  Voltage prediction  Intra-SSD redundancy  QoS Performance  AutoSSD  RLGC  ttFlash
  • 28. Conclusion • Proposing a reliability management scheme that selectively applies appropriate techniques to different data  The limitation of our study reveals the necessity to integrate the SSD-level design framework  Mathematically model the effectiveness of data rereads and data scrubbing