SlideShare a Scribd company logo
1 of 25
Download to read offline
Hao Zhuang1, 2, Wenjian Yu1*, Gang Hu1, Zuochang Ye3 
1 Department of Computer Science and Technology, 3 Institute of Microelectronics, Tsinghua University, Beijing, China 
2 School of Electronics Engineering and Computer Science, 
Peking University, Beijing, China 
Speaker: Hao Zhuang 
Numerical Characterization of Multi-Dielectric Green’s Function for 3-D Capacitance Extraction with Floating Random Walk Algorithm
Outline 
ī‚—Background 
ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction 
ī‚—Numerical characterization of multi-layer Green’s functions by FDM 
ī‚—FDM & FRW’s Numerical Results 
ī‚—Conclusions 
2
Background 
ī‚—Field Solver on Capacitance Extraction based on 
ī‚—Discretization-based method (like FastCap): 
ī‚—fast and accurate 
ī‚—not scalable to large structure due to 
ī‚—the large demand of computational time or 
ī‚—the bottleneck of memory usage. 
ī‚—Discretization-free method 
ī‚—like Floating Random Walk Algorithm (FRW) in this paper 
ī‚—Advantages: 
ī‚—lower memory usage 
ī‚—more scalability for large structures and 
ī‚—tunable accuracy 
ī‚—FRW algorithm evolved to commercial capacitance solvers like QuickCap of Magma Inc. 
ī‚—Recent advances for variation-aware capacitance extraction [ICCAD09] by MIT 
3
Backgrounds 
ī‚—Challenges 
ī‚—Little literature reveals the algorithm details of the 3-D FRW for multi-dielectric capacitance extraction. 
ī‚—CAPEM is a FRW solver to deal with these problems, but not published and only binary code available. 
ī‚—Recently, we’ve developed FRW to handle multi-dielectric structure, by sphere transition domain to go across dielectrics interface [another article in ASICON’12]. 
However, extraction of VLSI interconnects embedded in 5~10 layers of dielectrics, the efficiency would be largely lost. (see later in the talk) 
4
Outline 
ī‚—Background 
ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction 
ī‚—Numerical characterization of multi-layer Green’s functions 
ī‚—FDM & FRW’s Numerical Results 
ī‚—Conclusions 
5
3-D FRW Algorithm for Capacitance Extraction 
ī‚—Fundamental formula is potential calculation, 
is the electric potential on point r, S is a closed surface surrounding r. is called the Green’s function, 
ī‚—Recursion to express 
ī‚—Can be solved by Monte Carlo (MC) Integration 
6
3-D FRW Algorithm for Capacitance Extraction 
ī‚—For capacitance problem, set master conductor with 1 volt, other with 0 volt, calculate the charge accumulated in conductors, 
Gi is the Gaussian surface containing only master conductor inside. D(r) is the field displacement in r, F(r) is dielectric constant at r, n(r) is normal vector at r from Gaussian surface 
ī‚—Transform (3),obtain 
is weight function. 
7
3-D FRW Algorithm for Capacitance Extraction 
Fig. Transition domain’s PDF pre-computed 
Gi 
8
3-D FRW Algorithm for Capacitance Extraction 
ī‚—It is a homogeneous case in last slide. To my best of knowledge, the analytical equation for transition domain with dielectrics is not available. 
ī‚—Recently, The FRW we’ve developed handles multi-dielectric structure, by introducing sphere transition domain when hitting interface. (Algo1) 
Gaussian Surface 
Only equation we can use analytically 
9
3-D FRW Algorithm for Capacitance Extraction 
ī‚—Lost efficiency in 5~10 layers of dielectrics 
ī‚—Interface is really a problem 
Gaussian Surface 
walk stops frequently approaching dielectric interface 
increase hops! 
Only equation we can use analytically 
10 
ī‚—It is a homogeneous case in last slide. To my best of knowledge, the analytical equation for transition domain with dielectrics is not available. 
ī‚—Recently, The FRW we’ve developed handles multi-dielectric structure, by introducing sphere transition domain when hitting interface. (Algo1)
3-D FRW Algorithm for Capacitance Extraction 
11 
ī‚—The modified FRW in this paper (Algo2) 
ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability 
ī‚—and store them in GF Tables 
ī‚—to aid random walk to cross the interface
3-D FRW Algorithm for Capacitance Extraction 
ī‚—The modified FRW in this paper (Algo2) 
ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability 
ī‚—and store them in GF Tables 
ī‚—to aid random walk to cross the interface 
ī‚—Finite Set V.S infinite online walk 
īƒ  Mismatch? 
Store them in GFTs 
Gaussian Surface 
12
3-D FRW Algorithm for Capacitance Extraction 
ī‚—The modified FRW in this paper (Algo2) 
ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability 
ī‚—and store them in GF Tables 
ī‚—to aid random walk to cross the interface 
ī‚—Mismatch? Shrink the size of domain 
ī‚—Trade-off between memory & speed 
Store them in GFTs 
Gaussian Surface 
13
3-D FRW Algorithm for Capacitance Extraction 
ī‚—The modified FRW in this paper (Algo2) 
ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability 
ī‚—and store them in GF Tables 
ī‚—to aid random walk to cross the interface 
ī‚—Mismatch? Shrink the size of domain 
ī‚—Trade-off between memory & speed 
Q 
Question: How can we get the probability for transition? 
Store them in GFTs 
Gaussian Surface 
14
Outline 
ī‚—Background 
ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction 
ī‚—Numerical characterization of multi-layer Green’s functions 
ī‚—FDM & FRW’s Numerical Results 
ī‚—Conclusions 
15
Numerical characterization of multi-layer Green’s functions 
ī‚—Problem Formulation 
ī‚—Free charge space 
ī‚—Interface with continuous condition 
ī‚—Use Finite Difference method 
16
Numerical characterization of multi-layer Green’s functions 
ī‚—Matrix Formulation 
ī‚—Potential value at inner grids 
ī‚—The k-th grid’s potential by multiple a vector with 1 in k-th position and 0 (otherwise) 
ī‚—Eliminate the boundary condition vector, This is the transition probability we want! It describe the relation between center point and boundary points 
Inner grids 
Boundary points 
Points reside at interface grids 
Boundary condition 
17
Numerical characterization of multi-layer Green’s functions 
ī‚—Coefficient of inner grids and continuous condition to avoid mismatch of numeric error order 
ī‚—(a) use normal 7 point scheme 
ī‚—(b) eq(12) 
ī‚—(c) u0: eq(13) 
ī‚—And the coefficient on interface 
18
Numerical characterization of multi-layer Green’s functions 
ī‚—The situation when walk hits the interface requires interface in the middle layer of domain 
19
Outline 
ī‚—Background 
ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction 
ī‚—Numerical characterization of multi-layer Green’s functions 
ī‚—FDM & FRW’s Numerical Results 
ī‚—Conclusions 
20
FDM & FRW’s numerical result PDF Distribution solved by FDM 
21
FDM & FRW Numerical Results The efficiency of FDM 
ī‚—Comparison with the same solver utilized by CAPEM* 
* M. P. Desai, “The Capacitance Extraction Tool,” http://www.ee.iitb.ac.in/~microel/download. 
4X Speedups 
22
FDM & FRW’s Numerical Results FRW results Compared to Algo1 
ī‚—The3 layers belongs to 5 layers without thin dielectrics 
2.1X Speedups 
h 
The3 layers belongs to 9 layers without thin dielectrics 
3.5X Speedups 
ī‚—Increase only 6MB memory overhead 
41 wires in the 3 layers 
Placed in the brown zone 
23
Conclusions 
ī‚—By using pre-computed 2-layer Green’s function for cube transition domain will accelerate FRW in multi-dielectric cases around 2X~4X 
ī‚—Our generator is faster than CAPEM’s 
24
Thank you 
Q&A 
The END

More Related Content

What's hot

Flexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmeticFlexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmeticIeee Xpert
 
SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...
SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...
SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...VLSICS Design
 
Reza Talk En Kf 09
Reza Talk En Kf 09Reza Talk En Kf 09
Reza Talk En Kf 09rezatavakoli
 
Graph based transistor network generation method for supergate design
Graph based transistor network generation method for supergate designGraph based transistor network generation method for supergate design
Graph based transistor network generation method for supergate designjpstudcorner
 
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...IJERA Editor
 
FPGA Implementation of SubByte & Inverse SubByte for AES Algorithm
FPGA Implementation of SubByte & Inverse SubByte for AES AlgorithmFPGA Implementation of SubByte & Inverse SubByte for AES Algorithm
FPGA Implementation of SubByte & Inverse SubByte for AES Algorithmijsrd.com
 
h264_publication_1
h264_publication_1h264_publication_1
h264_publication_1Nan Ma
 
Flexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmeticFlexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmeticNexgen Technology
 
Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)Sangamesh Ragate
 
Relay Vehicle Formations for Optimizing Communication Quality in Robot Networks
Relay Vehicle Formations for Optimizing Communication Quality in Robot NetworksRelay Vehicle Formations for Optimizing Communication Quality in Robot Networks
Relay Vehicle Formations for Optimizing Communication Quality in Robot NetworksMd Mahbubur Rahman
 
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSORCOUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSORIJNSA Journal
 
EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...
EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...
EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...VIT-AP University
 
Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...
Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...
Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...IJRES Journal
 
Enhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulationsEnhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulationsAlexander Pozdneev
 
Efficient Design of Reversible Multiplexers with Low Quantum Cost
Efficient Design of Reversible Multiplexers with Low Quantum CostEfficient Design of Reversible Multiplexers with Low Quantum Cost
Efficient Design of Reversible Multiplexers with Low Quantum CostIJERA Editor
 
Power and Delay Analysis of Logic Circuits Using Reversible Gates
Power and Delay Analysis of Logic Circuits Using Reversible GatesPower and Delay Analysis of Logic Circuits Using Reversible Gates
Power and Delay Analysis of Logic Circuits Using Reversible GatesRSIS International
 
Area efficient parallel LFSR for cyclic redundancy check
Area efficient parallel LFSR for cyclic redundancy check  Area efficient parallel LFSR for cyclic redundancy check
Area efficient parallel LFSR for cyclic redundancy check IJECEIAES
 

What's hot (19)

Flexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmeticFlexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmetic
 
SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...
SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...
SIMULTANEOUS OPTIMIZATION OF STANDBY AND ACTIVE ENERGY FOR SUB-THRESHOLD CIRC...
 
Reza Talk En Kf 09
Reza Talk En Kf 09Reza Talk En Kf 09
Reza Talk En Kf 09
 
Graph based transistor network generation method for supergate design
Graph based transistor network generation method for supergate designGraph based transistor network generation method for supergate design
Graph based transistor network generation method for supergate design
 
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
Comparative Study of Low Power Low Area Bypass Multipliers for Signal Process...
 
J0166875
J0166875J0166875
J0166875
 
FPGA Implementation of SubByte & Inverse SubByte for AES Algorithm
FPGA Implementation of SubByte & Inverse SubByte for AES AlgorithmFPGA Implementation of SubByte & Inverse SubByte for AES Algorithm
FPGA Implementation of SubByte & Inverse SubByte for AES Algorithm
 
h264_publication_1
h264_publication_1h264_publication_1
h264_publication_1
 
carry select adder
carry select addercarry select adder
carry select adder
 
Flexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmeticFlexible dsp accelerator architecture exploiting carry save arithmetic
Flexible dsp accelerator architecture exploiting carry save arithmetic
 
Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)
 
Relay Vehicle Formations for Optimizing Communication Quality in Robot Networks
Relay Vehicle Formations for Optimizing Communication Quality in Robot NetworksRelay Vehicle Formations for Optimizing Communication Quality in Robot Networks
Relay Vehicle Formations for Optimizing Communication Quality in Robot Networks
 
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSORCOUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
 
EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...
EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...
EVOLUTION OF STRUCTURE OF SOME BINARY GROUP-BASED N-BIT COMPARATOR, N-TO-2N D...
 
Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...
Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...
Direct Design of Reversible Combinational and Sequential Circuits Using PSDRM...
 
Enhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulationsEnhanced MPSM3 for applications to quantum biological simulations
Enhanced MPSM3 for applications to quantum biological simulations
 
Efficient Design of Reversible Multiplexers with Low Quantum Cost
Efficient Design of Reversible Multiplexers with Low Quantum CostEfficient Design of Reversible Multiplexers with Low Quantum Cost
Efficient Design of Reversible Multiplexers with Low Quantum Cost
 
Power and Delay Analysis of Logic Circuits Using Reversible Gates
Power and Delay Analysis of Logic Circuits Using Reversible GatesPower and Delay Analysis of Logic Circuits Using Reversible Gates
Power and Delay Analysis of Logic Circuits Using Reversible Gates
 
Area efficient parallel LFSR for cyclic redundancy check
Area efficient parallel LFSR for cyclic redundancy check  Area efficient parallel LFSR for cyclic redundancy check
Area efficient parallel LFSR for cyclic redundancy check
 

Similar to RWCap ASCION2011

Optimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerOptimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerTELKOMNIKA JOURNAL
 
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveFPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveIJORCS
 
A Low power and area efficient CLA adder design using Full swing GDI technique
A Low power and area efficient CLA adder design using Full swing GDI techniqueA Low power and area efficient CLA adder design using Full swing GDI technique
A Low power and area efficient CLA adder design using Full swing GDI techniqueIJERA Editor
 
Study on self resetting logic with gate diffusion input (SRL-GDI)
Study on self resetting logic with gate diffusion input (SRL-GDI)Study on self resetting logic with gate diffusion input (SRL-GDI)
Study on self resetting logic with gate diffusion input (SRL-GDI)shubham jha
 
Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite eSAT Journals
 
Efficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multipliersEfficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multiplierseSAT Publishing House
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Derryck Lamptey, MPhil, CISSP
 
A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...
A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...
A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...Ealwan Lee
 
NEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNEL
NEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNELNEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNEL
NEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNELijcseit
 
EUCAP 2021_presentation (7)
EUCAP 2021_presentation (7)EUCAP 2021_presentation (7)
EUCAP 2021_presentation (7)Hamdi Bilel
 
Analysis Of Transmission Line Using MATLAB Software
Analysis Of Transmission Line Using MATLAB SoftwareAnalysis Of Transmission Line Using MATLAB Software
Analysis Of Transmission Line Using MATLAB SoftwareAllison Thompson
 
THD Optimization in 13 level photovoltaic inverter using Genetic Algorithm
THD Optimization in 13 level photovoltaic inverter using Genetic AlgorithmTHD Optimization in 13 level photovoltaic inverter using Genetic Algorithm
THD Optimization in 13 level photovoltaic inverter using Genetic AlgorithmSuman Debnath
 
JGrass: the Horton machine (FOSS4G2008)
JGrass: the Horton machine (FOSS4G2008)JGrass: the Horton machine (FOSS4G2008)
JGrass: the Horton machine (FOSS4G2008)Andrea Antonello
 
Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)	Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET) Benyamin Moadab
 

Similar to RWCap ASCION2011 (20)

AllPosters
AllPostersAllPosters
AllPosters
 
Optimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerOptimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf Optimizer
 
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A RetrospectiveFPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
 
wcnc05
wcnc05wcnc05
wcnc05
 
A Low power and area efficient CLA adder design using Full swing GDI technique
A Low power and area efficient CLA adder design using Full swing GDI techniqueA Low power and area efficient CLA adder design using Full swing GDI technique
A Low power and area efficient CLA adder design using Full swing GDI technique
 
Study on self resetting logic with gate diffusion input (SRL-GDI)
Study on self resetting logic with gate diffusion input (SRL-GDI)Study on self resetting logic with gate diffusion input (SRL-GDI)
Study on self resetting logic with gate diffusion input (SRL-GDI)
 
PRESTO POWER
PRESTO POWERPRESTO POWER
PRESTO POWER
 
PRESTO POWER
PRESTO POWERPRESTO POWER
PRESTO POWER
 
4g lte matlab
4g lte matlab4g lte matlab
4g lte matlab
 
Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite
 
Efficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multipliersEfficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multipliers
 
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
 
A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...
A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...
A Refined Skew Matrix Model of the CIM3 in the Up-Mixer Extending the Duality...
 
NEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNEL
NEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNELNEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNEL
NEW BER ANALYSIS OF OFDM SYSTEM OVER NAKAGAMI-n (RICE) FADING CHANNEL
 
EUCAP 2021_presentation (7)
EUCAP 2021_presentation (7)EUCAP 2021_presentation (7)
EUCAP 2021_presentation (7)
 
Analysis Of Transmission Line Using MATLAB Software
Analysis Of Transmission Line Using MATLAB SoftwareAnalysis Of Transmission Line Using MATLAB Software
Analysis Of Transmission Line Using MATLAB Software
 
THD Optimization in 13 level photovoltaic inverter using Genetic Algorithm
THD Optimization in 13 level photovoltaic inverter using Genetic AlgorithmTHD Optimization in 13 level photovoltaic inverter using Genetic Algorithm
THD Optimization in 13 level photovoltaic inverter using Genetic Algorithm
 
JGrass: the Horton machine (FOSS4G2008)
JGrass: the Horton machine (FOSS4G2008)JGrass: the Horton machine (FOSS4G2008)
JGrass: the Horton machine (FOSS4G2008)
 
Solution(1)
Solution(1)Solution(1)
Solution(1)
 
Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)	Ship Ad-hoc Network (SANET)
Ship Ad-hoc Network (SANET)
 

Recently uploaded

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptJasonTagapanGulla
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsSachinPawar510423
 
Gurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
TechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...Chandu841456
 
8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitterShivangiSharma879191
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 

Recently uploaded (20)

young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.ppt
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documents
 
Gurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡ī¸9711147426✨Call In girls Gurgaon Sector 51 escort service
 
TechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTACÂŽ CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
Call Us â‰Ŋ 8377877756 â‰ŧ Call Girls In Shastri Nagar (Delhi)
 

RWCap ASCION2011

  • 1. Hao Zhuang1, 2, Wenjian Yu1*, Gang Hu1, Zuochang Ye3 1 Department of Computer Science and Technology, 3 Institute of Microelectronics, Tsinghua University, Beijing, China 2 School of Electronics Engineering and Computer Science, Peking University, Beijing, China Speaker: Hao Zhuang Numerical Characterization of Multi-Dielectric Green’s Function for 3-D Capacitance Extraction with Floating Random Walk Algorithm
  • 2. Outline ī‚—Background ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction ī‚—Numerical characterization of multi-layer Green’s functions by FDM ī‚—FDM & FRW’s Numerical Results ī‚—Conclusions 2
  • 3. Background ī‚—Field Solver on Capacitance Extraction based on ī‚—Discretization-based method (like FastCap): ī‚—fast and accurate ī‚—not scalable to large structure due to ī‚—the large demand of computational time or ī‚—the bottleneck of memory usage. ī‚—Discretization-free method ī‚—like Floating Random Walk Algorithm (FRW) in this paper ī‚—Advantages: ī‚—lower memory usage ī‚—more scalability for large structures and ī‚—tunable accuracy ī‚—FRW algorithm evolved to commercial capacitance solvers like QuickCap of Magma Inc. ī‚—Recent advances for variation-aware capacitance extraction [ICCAD09] by MIT 3
  • 4. Backgrounds ī‚—Challenges ī‚—Little literature reveals the algorithm details of the 3-D FRW for multi-dielectric capacitance extraction. ī‚—CAPEM is a FRW solver to deal with these problems, but not published and only binary code available. ī‚—Recently, we’ve developed FRW to handle multi-dielectric structure, by sphere transition domain to go across dielectrics interface [another article in ASICON’12]. However, extraction of VLSI interconnects embedded in 5~10 layers of dielectrics, the efficiency would be largely lost. (see later in the talk) 4
  • 5. Outline ī‚—Background ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction ī‚—Numerical characterization of multi-layer Green’s functions ī‚—FDM & FRW’s Numerical Results ī‚—Conclusions 5
  • 6. 3-D FRW Algorithm for Capacitance Extraction ī‚—Fundamental formula is potential calculation, is the electric potential on point r, S is a closed surface surrounding r. is called the Green’s function, ī‚—Recursion to express ī‚—Can be solved by Monte Carlo (MC) Integration 6
  • 7. 3-D FRW Algorithm for Capacitance Extraction ī‚—For capacitance problem, set master conductor with 1 volt, other with 0 volt, calculate the charge accumulated in conductors, Gi is the Gaussian surface containing only master conductor inside. D(r) is the field displacement in r, F(r) is dielectric constant at r, n(r) is normal vector at r from Gaussian surface ī‚—Transform (3),obtain is weight function. 7
  • 8. 3-D FRW Algorithm for Capacitance Extraction Fig. Transition domain’s PDF pre-computed Gi 8
  • 9. 3-D FRW Algorithm for Capacitance Extraction ī‚—It is a homogeneous case in last slide. To my best of knowledge, the analytical equation for transition domain with dielectrics is not available. ī‚—Recently, The FRW we’ve developed handles multi-dielectric structure, by introducing sphere transition domain when hitting interface. (Algo1) Gaussian Surface Only equation we can use analytically 9
  • 10. 3-D FRW Algorithm for Capacitance Extraction ī‚—Lost efficiency in 5~10 layers of dielectrics ī‚—Interface is really a problem Gaussian Surface walk stops frequently approaching dielectric interface increase hops! Only equation we can use analytically 10 ī‚—It is a homogeneous case in last slide. To my best of knowledge, the analytical equation for transition domain with dielectrics is not available. ī‚—Recently, The FRW we’ve developed handles multi-dielectric structure, by introducing sphere transition domain when hitting interface. (Algo1)
  • 11. 3-D FRW Algorithm for Capacitance Extraction 11 ī‚—The modified FRW in this paper (Algo2) ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability ī‚—and store them in GF Tables ī‚—to aid random walk to cross the interface
  • 12. 3-D FRW Algorithm for Capacitance Extraction ī‚—The modified FRW in this paper (Algo2) ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability ī‚—and store them in GF Tables ī‚—to aid random walk to cross the interface ī‚—Finite Set V.S infinite online walk īƒ  Mismatch? Store them in GFTs Gaussian Surface 12
  • 13. 3-D FRW Algorithm for Capacitance Extraction ī‚—The modified FRW in this paper (Algo2) ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability ī‚—and store them in GF Tables ī‚—to aid random walk to cross the interface ī‚—Mismatch? Shrink the size of domain ī‚—Trade-off between memory & speed Store them in GFTs Gaussian Surface 13
  • 14. 3-D FRW Algorithm for Capacitance Extraction ī‚—The modified FRW in this paper (Algo2) ī‚—Pre-characterize the transition domain by Green’s Function (GF) to obtain transition probability ī‚—and store them in GF Tables ī‚—to aid random walk to cross the interface ī‚—Mismatch? Shrink the size of domain ī‚—Trade-off between memory & speed Q Question: How can we get the probability for transition? Store them in GFTs Gaussian Surface 14
  • 15. Outline ī‚—Background ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction ī‚—Numerical characterization of multi-layer Green’s functions ī‚—FDM & FRW’s Numerical Results ī‚—Conclusions 15
  • 16. Numerical characterization of multi-layer Green’s functions ī‚—Problem Formulation ī‚—Free charge space ī‚—Interface with continuous condition ī‚—Use Finite Difference method 16
  • 17. Numerical characterization of multi-layer Green’s functions ī‚—Matrix Formulation ī‚—Potential value at inner grids ī‚—The k-th grid’s potential by multiple a vector with 1 in k-th position and 0 (otherwise) ī‚—Eliminate the boundary condition vector, This is the transition probability we want! It describe the relation between center point and boundary points Inner grids Boundary points Points reside at interface grids Boundary condition 17
  • 18. Numerical characterization of multi-layer Green’s functions ī‚—Coefficient of inner grids and continuous condition to avoid mismatch of numeric error order ī‚—(a) use normal 7 point scheme ī‚—(b) eq(12) ī‚—(c) u0: eq(13) ī‚—And the coefficient on interface 18
  • 19. Numerical characterization of multi-layer Green’s functions ī‚—The situation when walk hits the interface requires interface in the middle layer of domain 19
  • 20. Outline ī‚—Background ī‚—3-D Floating Random Walk Algorithm for Capacitance Extraction ī‚—Numerical characterization of multi-layer Green’s functions ī‚—FDM & FRW’s Numerical Results ī‚—Conclusions 20
  • 21. FDM & FRW’s numerical result PDF Distribution solved by FDM 21
  • 22. FDM & FRW Numerical Results The efficiency of FDM ī‚—Comparison with the same solver utilized by CAPEM* * M. P. Desai, “The Capacitance Extraction Tool,” http://www.ee.iitb.ac.in/~microel/download. 4X Speedups 22
  • 23. FDM & FRW’s Numerical Results FRW results Compared to Algo1 ī‚—The3 layers belongs to 5 layers without thin dielectrics 2.1X Speedups h The3 layers belongs to 9 layers without thin dielectrics 3.5X Speedups ī‚—Increase only 6MB memory overhead 41 wires in the 3 layers Placed in the brown zone 23
  • 24. Conclusions ī‚—By using pre-computed 2-layer Green’s function for cube transition domain will accelerate FRW in multi-dielectric cases around 2X~4X ī‚—Our generator is faster than CAPEM’s 24
  • 25. Thank you Q&A The END