SlideShare a Scribd company logo
1 of 27
Download to read offline
Defect-Aware Design
   Paradigm for Reconfigurable
   Architectures

                Rahul Jain, Anindita Mukherjee, Kolin Paul
                      Indian Institute of Technology, Delhi

Conference Travel Supported by XILINX-CMC, Hydrabad,
India
Introduction
VLSI Technology progress
for finer dimensions and
larger chip area has lead to:
  More Complex Fabrication
  Techniques
  High defect densities
  High Manufacturing Costs
Decrease in Critical Defect
Size
Increase in defect/fault-       Milton Godwin et al, “Examining
                                upcoming yield enhancement
sourcing complexity factor      challenges in the 2001 roadmap”,
                                Micro Magazine, 2002 Issue
Lower Yield
Introduction
 Low Yield problem can be overcome by
   Fault Tolerance
   Defect Tolerance
 This Relaxes Stringent Constraints on
 Manufacturing Processes
   Lowers Cost
 A tradeoff between process technology
 complexity in terms of yield and post-
 fabrication complexity in terms of defect-
 tolerance
Motivation
 Defect-tolerance more important as we near
 lithographic limits
 We are moving towards era of molecular electronics
 CAEN- Chemically assembled electronic
 nanotechnology
   Expected to have about 10% defects
 Reconfigurable Devices (e.g. FPGA’s) obvious
 choice to incorporate post-fabrication defect-
 tolerance
   Area Penalty
   Time Penalty
Defect Aware Design Flow

 FPGA used as the Reconfigurable Fabric
 PnR now expects a Defect Map as input
 Observations:
  Interconnect defects are the bottleneck as they
  occupy the most area
  Defect Distribution is clustered [C. H. Stapper 89]
  Congestion Aware Placement gives best Results
Experimental Setup

 VPR used as the FPGA PnR infrastructure
  placement and routing tool for array-based
  FPGAs from University of Toronto
 VPR modified to accept Defect Map as input
 Defects mapped to CLBs and interconnects
 Defective components assigned ZERO
 capacity, hence cannot be used in PnR
Motivational Example

 Conducted to Study the PnR behaviour for
 different Defect Densities
 Uniform Random Defect distribution for CLB’s
 and Routing Resources (RR) assumed
 Expected Behaviour:
  Placement always possible if enough defect-free
  CLB’s available
  As we increase the defect density of RR the circuit
  gets unroutable
Motivational Example
 Observed Behavior
  Expected behavior most of the time
  But some cases where Circuit is routable for Higher CLB
  defects with same or even Higher RR defects
  Reason for this is that at Higher CLB defects the
  Placement is more distributed and hence no/less
  congestion
 Conclusion
  Area Penalty Inevitable
  Placement is more distributive hence Time Penalty incurred
  Congestion Awareness needed at the Placement Step
Defect Map Generation

 Simulation Model by C.H Stapper used
 Model takes chip as a 2D Grid Array
 Uses a Symmetrical Bivariate Gaussian
 Distribution
 Cluster Shaping done on this Distribution
 Clusters randomly Rotated and Displaced
Mapping Defects on FPGA Fabric

 FPGA Fabric divided into a
 2-D Array of cells of side
 equal to interconnect width
 Interconnect modeled as 1-
 D Array of cells
 CLB modeled as 2-D array
 of cells
 Defect Map Generated for
 this FPGA 2D-Array
 If any defect maps to any
 Resource, Assign Zero
 capacity to the Resource
VPR Cost Functions

 Experiment Conducted to find the best PnR
 methodology in Defect Aware Environment
 VPR uses Simulated Annealing for Placement and
 Path_Finder Algorithm for routing
 4 types of Placement Cost Functions
   Bounding Box with Linear Cost function (BB_L)
   Bounding Box with Non-Linear Cost function (BB_NL)
   Net_timing_driven (Net)
   Path_timing_ driven (Path)
VPR Cost Functions
          Comparison of 4 Cost Functions
                                   T_Crit (10-8 s)
                    chan
 Bench      nx/ny
                     wd    BB_NL   BB_L        Net    path

  gcd        15      6     5.70    5.49        NA     NA

  scf        23     11     5.57    NA          NA     NA

 table3      27     11     7.58    8.32        6.28   6.0

 table5      22     11     8.07    8.14        NA     NA

 apex1       21      7     8.55    NA          NA     NA
RISA

 Bounding Box, Non Linear Algorithm gives
 the best results
 It is an implementation of cost-function RISA
 [Cheng, ICCAD’94]
 RISA balances demand and supply over an
 array of NxN regions
 RISA performs better than others because it
 is congestion aware
 RISA takes routability into account while
 doing placement
RISA

      Demand:
                    w×l                       w× l
D i , horiz = q ×         ; D i ,vert = q ×
                    Y×L                       X ×W
  k                             k




                                                     Chih-Liang Eric Cheng, “RISA:
      Supply:                                        accurate and efficient placement
                                                     routability modeling”, Proceedings of

       v   Sij = Tv / N     h   Sij = Th / N         the 1994 IEEE/ACM international
                                                     conference on Computer-aided design
CA-RISA

 Supply reduced due to defects
 Penalize defects with a weighing factor W
 Large value of W pushes the placements
 away from highly defective regions

    h    Sij = Th / N − W * ∑∑ N hdef
                           i   j

     v   Sij = Tv / N − W * ∑∑ N vdef
                           i   j
A Defective FPGA Fabric
                          Scf Benchmark
                          CLB’s = 418
                          nx/ny = 26
                          chan_wd = 16
                          % Def CLB’s = 2.7
                          % Def Int. = 9.8
Placement by RISA
Congestion Map for RISA Placement
Placement by CA-RISA



Routable /
Zero
Congestion
Base Cases- Defect Free Environment

                 Base Cost
 Bench    CLB        T_Crit    Total Area
   gcd     220      5.70e-08     34596
   scf     418      5.57e-08     78400
 table3    480      7.57e-08    148225
 table5    485      8.07e-08    161604
 apex1     700      8.55e-08    220900
  ex5p    1064      1.10e-07    422500
  alu4    1522      1.12e-07    454276
Current Design Methodology
              Defective Fabric
         No.of     Defect (%)        Penalty (%)
 Bench
          CLB's   clb      int     Time      Area
gcd         220     2.5     14.2     52.5      192
scf         418     1.3      7.4     39.6      382
table3      480     1.9     11.0     20.1      251
table5      485     3.7     12.6     -4.4      246
apex1       700     2.4     15.8      2.5      369
ex5p       1064     2.8     15.5      9.2      462
alu4       1522     3.2     17.6     12.6      610
Performance Evaluation of CA-RISA
                Least Time Penalty
                %Def        % Penalty
 Bench                                   % Imp
          clb      int    Time    Area
   gcd    2.5      14.2   45.6     192    4.50
   scf    1.3      7.4     8.6     382   22.15
 table3   1.9      11.0    1.1     251   15.79
 table5   1.8      12.0   -6.6     248    2.33
 apex1    2.4      15.8   -3.1     369    5.45
  ex5p    2.8      15.5    7.3     462    1.8
  alu4    3.2      17.6    4.3     610    8.0
Performance Evaluation of CA-RISA
                Least Area Penalty
                %Def        % Penalty
 Bench                                   %Imp
          clb       int   Time    Area
   gcd    2.5      14.2   45.6    192      0
   scf    2.7      9.8    30.1    341     8.59
 table3   4.0      12.1   14.6    180    20.44
 table5   2.9      13.9   15.6    200    13.42
 apex1    1.5      9.8    24.1    340     6.19
  ex5p    2.3      14.9   26.5    437     5.8
  alu4    2.8      18.3   16.5    553     10.4
RISA vs CA-RISA
                                  Congestion
Bench    nx,ny   chan_wd
                           RISA           CA-RISA
 gcd      17       12       0                  0
 scf      26       17       0                  0
 scf      26       16      578                 0
table3    25       19      373                 0
table3    24       18      280                 10
table5    25       21      692                 0
table5    26       20      738                 0
apex1     30       26      591                 0
apex1     31       25      669                 2
ex5p      34       33      1466                0
ex5p      35       32      1324                34
alu4      43       32       0                  0
alu4      42       31      578                 0
Future Work
 Proposed Methodology requires the circuit to
 be Placed and Routed for Each Chip
 Separately
 Not a Feasible Option
 Incorporate Defect Map into the Flow as late
 as possible
 One Possibility is to do PnR for non-defective
 fabric and at the last step incrementally map
 the solution to the defective fabric
Conclusion

 CA-RISA proposed and compared to existing
 methodology
 This is not a Complete Solution
 A motivation for Defect Aware Design
 Paradigm
 Tries to highlight the bottlenecks of the
 Present Methodology
Thank You

Questions

More Related Content

Viewers also liked

фейм испания кривые.Indd
фейм испания кривые.Inddфейм испания кривые.Indd
фейм испания кривые.Indddda_krasnodar
 
Estrategia empresarial redes sociales
Estrategia empresarial redes socialesEstrategia empresarial redes sociales
Estrategia empresarial redes socialesJuana Matmer
 
Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...
Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...
Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...Rahul Jain
 
An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...
An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...
An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...Rahul Jain
 
EstlGd.DataPartitioning
EstlGd.DataPartitioningEstlGd.DataPartitioning
EstlGd.DataPartitioningDaren Bieniek
 
Introduktion til nyt Intranet i Folkesparekassen
Introduktion til nyt Intranet i FolkesparekassenIntroduktion til nyt Intranet i Folkesparekassen
Introduktion til nyt Intranet i FolkesparekassenJesper Mørkenborg Kjær
 
гостиничный буклет
гостиничный буклетгостиничный буклет
гостиничный буклетdda_krasnodar
 
Vfrench slides persuasive
Vfrench slides persuasiveVfrench slides persuasive
Vfrench slides persuasiveValonnaFrench
 
Prehistory – Mediterranean Trade
Prehistory – Mediterranean TradePrehistory – Mediterranean Trade
Prehistory – Mediterranean TradeJanine_EIT
 
A Power Efficient Architecture for 2-D Discrete Wavelet Transform
A Power Efficient Architecture for 2-D Discrete Wavelet TransformA Power Efficient Architecture for 2-D Discrete Wavelet Transform
A Power Efficient Architecture for 2-D Discrete Wavelet TransformRahul Jain
 
Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000Rahul Jain
 
Geomechanics services
Geomechanics servicesGeomechanics services
Geomechanics servicesJeffT10
 
Effect of capital adequacy on the profitability of the
Effect of capital adequacy on the profitability of theEffect of capital adequacy on the profitability of the
Effect of capital adequacy on the profitability of theolufemiadebayo
 
English for life reading pre intermediate a2
English for life reading pre intermediate a2English for life reading pre intermediate a2
English for life reading pre intermediate a2Da Solustri
 

Viewers also liked (16)

Menu fishka
Menu fishkaMenu fishka
Menu fishka
 
фейм испания кривые.Indd
фейм испания кривые.Inddфейм испания кривые.Indd
фейм испания кривые.Indd
 
Estrategia empresarial redes sociales
Estrategia empresarial redes socialesEstrategia empresarial redes sociales
Estrategia empresarial redes sociales
 
Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...
Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...
Memory Architecture Exploration for Power-Efficient 2D-Discrete Wavelet Trans...
 
An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...
An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...
An Efficient Pipelined VLSI Architecture for Lifting-Based 2D-Discrete Wavele...
 
EstlGd.DataPartitioning
EstlGd.DataPartitioningEstlGd.DataPartitioning
EstlGd.DataPartitioning
 
Introduktion til nyt Intranet i Folkesparekassen
Introduktion til nyt Intranet i FolkesparekassenIntroduktion til nyt Intranet i Folkesparekassen
Introduktion til nyt Intranet i Folkesparekassen
 
project_real_wp
project_real_wpproject_real_wp
project_real_wp
 
гостиничный буклет
гостиничный буклетгостиничный буклет
гостиничный буклет
 
Vfrench slides persuasive
Vfrench slides persuasiveVfrench slides persuasive
Vfrench slides persuasive
 
Prehistory – Mediterranean Trade
Prehistory – Mediterranean TradePrehistory – Mediterranean Trade
Prehistory – Mediterranean Trade
 
A Power Efficient Architecture for 2-D Discrete Wavelet Transform
A Power Efficient Architecture for 2-D Discrete Wavelet TransformA Power Efficient Architecture for 2-D Discrete Wavelet Transform
A Power Efficient Architecture for 2-D Discrete Wavelet Transform
 
Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000Low Power Architecture for JPEG2000
Low Power Architecture for JPEG2000
 
Geomechanics services
Geomechanics servicesGeomechanics services
Geomechanics services
 
Effect of capital adequacy on the profitability of the
Effect of capital adequacy on the profitability of theEffect of capital adequacy on the profitability of the
Effect of capital adequacy on the profitability of the
 
English for life reading pre intermediate a2
English for life reading pre intermediate a2English for life reading pre intermediate a2
English for life reading pre intermediate a2
 

Similar to Defect-Aware Design Paradigm for Reconfigurable Architectures

A Comparative Analysis on Parameters of Different Adder Topologies
A Comparative Analysis on Parameters of Different Adder TopologiesA Comparative Analysis on Parameters of Different Adder Topologies
A Comparative Analysis on Parameters of Different Adder TopologiesIRJET Journal
 
Optimization MVSIS vs AIG Rewriting (ABC)
Optimization MVSIS vs AIG Rewriting (ABC)Optimization MVSIS vs AIG Rewriting (ABC)
Optimization MVSIS vs AIG Rewriting (ABC)IJEEE
 
Formal estimation of worst case communication latency in a network on chip fi...
Formal estimation of worst case communication latency in a network on chip fi...Formal estimation of worst case communication latency in a network on chip fi...
Formal estimation of worst case communication latency in a network on chip fi...Vinita Palaniveloo
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphspione30
 
C-SAW: A Framework for Graph Sampling and Random Walk on GPUs
C-SAW: A Framework for Graph Sampling and Random Walk on GPUsC-SAW: A Framework for Graph Sampling and Random Walk on GPUs
C-SAW: A Framework for Graph Sampling and Random Walk on GPUsPandey_G
 
Project report on design & implementation of high speed carry select adder
Project report on design & implementation of high speed carry select adderProject report on design & implementation of high speed carry select adder
Project report on design & implementation of high speed carry select adderssingh7603
 
CFD Cornell Energy Workshop - M.F. Campuzano Ochoa
CFD Cornell Energy Workshop - M.F. Campuzano OchoaCFD Cornell Energy Workshop - M.F. Campuzano Ochoa
CFD Cornell Energy Workshop - M.F. Campuzano OchoaMario Felipe Campuzano Ochoa
 
IRJET- Metastability Mitigation & Error Masking of High Speed Flip-Flop
IRJET- Metastability Mitigation & Error Masking of High Speed Flip-FlopIRJET- Metastability Mitigation & Error Masking of High Speed Flip-Flop
IRJET- Metastability Mitigation & Error Masking of High Speed Flip-FlopIRJET Journal
 
Design and Implementation of Different types of Carry skip adder
Design and Implementation of Different types of Carry skip adderDesign and Implementation of Different types of Carry skip adder
Design and Implementation of Different types of Carry skip adderIRJET Journal
 
Design and minimization of reversible programmable logic arrays and its reali...
Design and minimization of reversible programmable logic arrays and its reali...Design and minimization of reversible programmable logic arrays and its reali...
Design and minimization of reversible programmable logic arrays and its reali...Sajib Mitra
 
Low power & area efficient carry select adder
Low power & area efficient carry select adderLow power & area efficient carry select adder
Low power & area efficient carry select adderSai Vara Prasad P
 
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo InterlandiLazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo InterlandiDatabricks
 
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...Xiaoyu Shi
 
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...RISC-V International
 
1 s2.0-s2090447917300291-main
1 s2.0-s2090447917300291-main1 s2.0-s2090447917300291-main
1 s2.0-s2090447917300291-mainVIT-AP University
 
Reducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology MappingReducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology Mappingsatrajit
 

Similar to Defect-Aware Design Paradigm for Reconfigurable Architectures (20)

A Comparative Analysis on Parameters of Different Adder Topologies
A Comparative Analysis on Parameters of Different Adder TopologiesA Comparative Analysis on Parameters of Different Adder Topologies
A Comparative Analysis on Parameters of Different Adder Topologies
 
Optimization MVSIS vs AIG Rewriting (ABC)
Optimization MVSIS vs AIG Rewriting (ABC)Optimization MVSIS vs AIG Rewriting (ABC)
Optimization MVSIS vs AIG Rewriting (ABC)
 
Formal estimation of worst case communication latency in a network on chip fi...
Formal estimation of worst case communication latency in a network on chip fi...Formal estimation of worst case communication latency in a network on chip fi...
Formal estimation of worst case communication latency in a network on chip fi...
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
 
C-SAW: A Framework for Graph Sampling and Random Walk on GPUs
C-SAW: A Framework for Graph Sampling and Random Walk on GPUsC-SAW: A Framework for Graph Sampling and Random Walk on GPUs
C-SAW: A Framework for Graph Sampling and Random Walk on GPUs
 
Project report on design & implementation of high speed carry select adder
Project report on design & implementation of high speed carry select adderProject report on design & implementation of high speed carry select adder
Project report on design & implementation of high speed carry select adder
 
CFD Cornell Energy Workshop - M.F. Campuzano Ochoa
CFD Cornell Energy Workshop - M.F. Campuzano OchoaCFD Cornell Energy Workshop - M.F. Campuzano Ochoa
CFD Cornell Energy Workshop - M.F. Campuzano Ochoa
 
IRJET- Metastability Mitigation & Error Masking of High Speed Flip-Flop
IRJET- Metastability Mitigation & Error Masking of High Speed Flip-FlopIRJET- Metastability Mitigation & Error Masking of High Speed Flip-Flop
IRJET- Metastability Mitigation & Error Masking of High Speed Flip-Flop
 
Design and Implementation of Different types of Carry skip adder
Design and Implementation of Different types of Carry skip adderDesign and Implementation of Different types of Carry skip adder
Design and Implementation of Different types of Carry skip adder
 
Design and minimization of reversible programmable logic arrays and its reali...
Design and minimization of reversible programmable logic arrays and its reali...Design and minimization of reversible programmable logic arrays and its reali...
Design and minimization of reversible programmable logic arrays and its reali...
 
Low power & area efficient carry select adder
Low power & area efficient carry select adderLow power & area efficient carry select adder
Low power & area efficient carry select adder
 
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
 
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo InterlandiLazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
 
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
 
Fmcad08
Fmcad08Fmcad08
Fmcad08
 
Ds36715716
Ds36715716Ds36715716
Ds36715716
 
ARPS Architecture 1
ARPS Architecture 1ARPS Architecture 1
ARPS Architecture 1
 
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
 
1 s2.0-s2090447917300291-main
1 s2.0-s2090447917300291-main1 s2.0-s2090447917300291-main
1 s2.0-s2090447917300291-main
 
Reducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology MappingReducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology Mapping
 

Recently uploaded

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 

Defect-Aware Design Paradigm for Reconfigurable Architectures

  • 1. Defect-Aware Design Paradigm for Reconfigurable Architectures Rahul Jain, Anindita Mukherjee, Kolin Paul Indian Institute of Technology, Delhi Conference Travel Supported by XILINX-CMC, Hydrabad, India
  • 2. Introduction VLSI Technology progress for finer dimensions and larger chip area has lead to: More Complex Fabrication Techniques High defect densities High Manufacturing Costs Decrease in Critical Defect Size Increase in defect/fault- Milton Godwin et al, “Examining upcoming yield enhancement sourcing complexity factor challenges in the 2001 roadmap”, Micro Magazine, 2002 Issue Lower Yield
  • 3. Introduction Low Yield problem can be overcome by Fault Tolerance Defect Tolerance This Relaxes Stringent Constraints on Manufacturing Processes Lowers Cost A tradeoff between process technology complexity in terms of yield and post- fabrication complexity in terms of defect- tolerance
  • 4. Motivation Defect-tolerance more important as we near lithographic limits We are moving towards era of molecular electronics CAEN- Chemically assembled electronic nanotechnology Expected to have about 10% defects Reconfigurable Devices (e.g. FPGA’s) obvious choice to incorporate post-fabrication defect- tolerance Area Penalty Time Penalty
  • 5. Defect Aware Design Flow FPGA used as the Reconfigurable Fabric PnR now expects a Defect Map as input Observations: Interconnect defects are the bottleneck as they occupy the most area Defect Distribution is clustered [C. H. Stapper 89] Congestion Aware Placement gives best Results
  • 6. Experimental Setup VPR used as the FPGA PnR infrastructure placement and routing tool for array-based FPGAs from University of Toronto VPR modified to accept Defect Map as input Defects mapped to CLBs and interconnects Defective components assigned ZERO capacity, hence cannot be used in PnR
  • 7. Motivational Example Conducted to Study the PnR behaviour for different Defect Densities Uniform Random Defect distribution for CLB’s and Routing Resources (RR) assumed Expected Behaviour: Placement always possible if enough defect-free CLB’s available As we increase the defect density of RR the circuit gets unroutable
  • 8. Motivational Example Observed Behavior Expected behavior most of the time But some cases where Circuit is routable for Higher CLB defects with same or even Higher RR defects Reason for this is that at Higher CLB defects the Placement is more distributed and hence no/less congestion Conclusion Area Penalty Inevitable Placement is more distributive hence Time Penalty incurred Congestion Awareness needed at the Placement Step
  • 9. Defect Map Generation Simulation Model by C.H Stapper used Model takes chip as a 2D Grid Array Uses a Symmetrical Bivariate Gaussian Distribution Cluster Shaping done on this Distribution Clusters randomly Rotated and Displaced
  • 10. Mapping Defects on FPGA Fabric FPGA Fabric divided into a 2-D Array of cells of side equal to interconnect width Interconnect modeled as 1- D Array of cells CLB modeled as 2-D array of cells Defect Map Generated for this FPGA 2D-Array If any defect maps to any Resource, Assign Zero capacity to the Resource
  • 11. VPR Cost Functions Experiment Conducted to find the best PnR methodology in Defect Aware Environment VPR uses Simulated Annealing for Placement and Path_Finder Algorithm for routing 4 types of Placement Cost Functions Bounding Box with Linear Cost function (BB_L) Bounding Box with Non-Linear Cost function (BB_NL) Net_timing_driven (Net) Path_timing_ driven (Path)
  • 12. VPR Cost Functions Comparison of 4 Cost Functions T_Crit (10-8 s) chan Bench nx/ny wd BB_NL BB_L Net path gcd 15 6 5.70 5.49 NA NA scf 23 11 5.57 NA NA NA table3 27 11 7.58 8.32 6.28 6.0 table5 22 11 8.07 8.14 NA NA apex1 21 7 8.55 NA NA NA
  • 13. RISA Bounding Box, Non Linear Algorithm gives the best results It is an implementation of cost-function RISA [Cheng, ICCAD’94] RISA balances demand and supply over an array of NxN regions RISA performs better than others because it is congestion aware RISA takes routability into account while doing placement
  • 14. RISA Demand: w×l w× l D i , horiz = q × ; D i ,vert = q × Y×L X ×W k k Chih-Liang Eric Cheng, “RISA: Supply: accurate and efficient placement routability modeling”, Proceedings of v Sij = Tv / N h Sij = Th / N the 1994 IEEE/ACM international conference on Computer-aided design
  • 15. CA-RISA Supply reduced due to defects Penalize defects with a weighing factor W Large value of W pushes the placements away from highly defective regions h Sij = Th / N − W * ∑∑ N hdef i j v Sij = Tv / N − W * ∑∑ N vdef i j
  • 16. A Defective FPGA Fabric Scf Benchmark CLB’s = 418 nx/ny = 26 chan_wd = 16 % Def CLB’s = 2.7 % Def Int. = 9.8
  • 18. Congestion Map for RISA Placement
  • 19. Placement by CA-RISA Routable / Zero Congestion
  • 20. Base Cases- Defect Free Environment Base Cost Bench CLB T_Crit Total Area gcd 220 5.70e-08 34596 scf 418 5.57e-08 78400 table3 480 7.57e-08 148225 table5 485 8.07e-08 161604 apex1 700 8.55e-08 220900 ex5p 1064 1.10e-07 422500 alu4 1522 1.12e-07 454276
  • 21. Current Design Methodology Defective Fabric No.of Defect (%) Penalty (%) Bench CLB's clb int Time Area gcd 220 2.5 14.2 52.5 192 scf 418 1.3 7.4 39.6 382 table3 480 1.9 11.0 20.1 251 table5 485 3.7 12.6 -4.4 246 apex1 700 2.4 15.8 2.5 369 ex5p 1064 2.8 15.5 9.2 462 alu4 1522 3.2 17.6 12.6 610
  • 22. Performance Evaluation of CA-RISA Least Time Penalty %Def % Penalty Bench % Imp clb int Time Area gcd 2.5 14.2 45.6 192 4.50 scf 1.3 7.4 8.6 382 22.15 table3 1.9 11.0 1.1 251 15.79 table5 1.8 12.0 -6.6 248 2.33 apex1 2.4 15.8 -3.1 369 5.45 ex5p 2.8 15.5 7.3 462 1.8 alu4 3.2 17.6 4.3 610 8.0
  • 23. Performance Evaluation of CA-RISA Least Area Penalty %Def % Penalty Bench %Imp clb int Time Area gcd 2.5 14.2 45.6 192 0 scf 2.7 9.8 30.1 341 8.59 table3 4.0 12.1 14.6 180 20.44 table5 2.9 13.9 15.6 200 13.42 apex1 1.5 9.8 24.1 340 6.19 ex5p 2.3 14.9 26.5 437 5.8 alu4 2.8 18.3 16.5 553 10.4
  • 24. RISA vs CA-RISA Congestion Bench nx,ny chan_wd RISA CA-RISA gcd 17 12 0 0 scf 26 17 0 0 scf 26 16 578 0 table3 25 19 373 0 table3 24 18 280 10 table5 25 21 692 0 table5 26 20 738 0 apex1 30 26 591 0 apex1 31 25 669 2 ex5p 34 33 1466 0 ex5p 35 32 1324 34 alu4 43 32 0 0 alu4 42 31 578 0
  • 25. Future Work Proposed Methodology requires the circuit to be Placed and Routed for Each Chip Separately Not a Feasible Option Incorporate Defect Map into the Flow as late as possible One Possibility is to do PnR for non-defective fabric and at the last step incrementally map the solution to the defective fabric
  • 26. Conclusion CA-RISA proposed and compared to existing methodology This is not a Complete Solution A motivation for Defect Aware Design Paradigm Tries to highlight the bottlenecks of the Present Methodology