SlideShare a Scribd company logo
High Level Design & ESL
How design cost is driving innovation in
system-level designs?
Rajesh Gupta
University of California, San Diego
mesl . ucsd . edu
FMCAD, Portland, Nov. 17, 2008
My main point
 At various time VLSI design has been driven by
 Area, timing, power, reliability, manufacturing variability
 Cost of design is likely to be the driver for future
innovations in how we architect, design and
implement future ICs in each of these areas:
 Tools, Methods
 Architectures
 Programming models and methods
3
12/18/03 R. Gupta, UC San Diego
The Technology and Its Industry
ComponentsSystems
Tools
Masks
Mask data
More Silicon to More Boxes…
 Of the 72 distinct application markets that rely on value
added IC designs (ASIC, ASSP, FPGA, SOC)
 over 50% are less than $500M, 75% are less than $1B
 The rising fabless, fablite
 The US has 56% of over 1K design houses…
 …and accounts for 76% of industry revenues
 (Wireless 27%, networking 25%, consumer 20%)
 Cost is increasingly the driver for fabless
 Only 17% of designs above 500 MHz
 67% of ASIC designs are 299 MHz and lower
 Sizes pretty much evenly distributed from 100K to 5M gates Source: IBS
Is there a problem here?
WW Market Forecast : ASIC vs. FPGA
0
5000
10000
15000
20000
25000
30000
35000
2003 2004 2005 2006 2007 2008 2009 2010 2011
$(millions)
Total ASIC
Total FPGA
Source: Gartner Dataquest “ASIC and FPGA WW Market Forecast, January 2008”
More & Moore
 Most things in real-life do not
scale anywhere close to this
 Battery energy, power sources
 Size, Space, Spectrum
 Design time.
 Dealing with the effects of
Moore
 “Embedded Systems”
486
Pad limited die:
200 pins
52 mm2
0 1 2 3 4 5 6
16x
14x
12x
10x
8x
6x
4x
2x
1x
Improvement(comparedtoyear0)
Time (years)
A Tale of Two Consequence
1. EDA: Raise abstractions
 Raising abstraction has always been part of the solution strategy to
lower design costs.
 In design modeling, design synthesis, design verification
1. Architecture: Raise programmability
 Holy Grail: ASIC efficiency with CPU programmability.
 The tremendous space of architectural innovations between ASIC
and FPGA
► Let us take a look at the two sides from a familiar
perspective
FPGA v. ASIC: Cost v. Volume
FPGA
ASIC
Structured ASIC, SA
cf
New Fabric, T
xf xa
ct
ca
Volume
Total Cost
A good solution:
xf  0 or better ASIC, ct  cf
xa  infinity or better FPGA, mtma
 Currently we are: cf = 2 ca ; mf = 20 ma
 Fixed cost of FPGA design = 2 * ASIC design costs
 Per part cost of FPGAs rises 20x cost of ASIC.

ASIC/FPGA Tradeoff
F
A
SA
cf
T
xf xa
ct
ca
Volume
Total Cost
A good solution:
xf  0 or better ASIC, ct  cf
xa  infinity or better FPGA,
mtma
Better ASIC or Better FPGA?
F
A
cf
ca
Volume
Total Cost
Improved Area Utilization
Reduced
Design Cost;
Chip
implementation,
Shuttles, etc.
Space of ‘synthetic’ solutions
F
A
cf
ca
Volume
TotalCost
Better area utilization
in FPGA, 7x target
F
A
cf
ca
Better synthesis,
EDA, 2x target
F
A
cf
ca
Design for synthesis,
3x cost increase
Technical Dimensions of the
Problem
 SE: Silicon Efficiency
 Inherently better circuit implementation styles, levels, logic:
Asynchronous, GALS
 AE: Architectural Efficiency
 Inherently improved application-level performance or performance
independent of mapping methods
 PA: Programmer Accessibility
 Use existing programming models/methods to ensure IP availability
and integration.
 DP: Designer Productivity
ITRS, last updated 2006
Designer Productivity is Challenge #1
Verification
Predictable
Implementation
Embedded SW
Distributed
design, AMS
Impact on Designer Productivity
Design Technology Year Productivity
Delta
gates/DY
Comments
Physical Design (APR) 1993 38.9% 5.55K PD integration
Tall-thin Engineer 1995 63.6% 9.1K Chip/circuit/PD/Verif.
Small block reuse 1997 340% 40K 2.5K-75K gates
Large block reuse 1999 38.9% 56K 75K-1M gates
IC implementation suits 2001 63.6% 91K RTL-GDSII integration
RTL functional verification 2003 37.5% 125K SW development verif.
ES Methodology 2005 60% 200K Behavioral above RTL
Very large block reuse 2007 200% 600K >1M gates, IP cores
Homogenous parallel
processing
2009 100-200%
1.2M
Many identical cores around a main processor
Intelligent test bench 2011 37.5%2.4M Automation of verification partitioning
Concurrent SW compiler 2013 60% 3.3M Enables SW in parallel SOCs
Heterogenous massive
parallel processing
2015 100-200%
5.3M
Specialized cores around a main processor
System-level DA and
executable specs
2017-
19
100-200%
10.5M
On/off-chip integration of functions.
Total 264,000%
Raising Verification
Golden
Reference
Model
Property Checker
Refinement or
Equivalence Checker
Verification Techniques
 Scalable techniques for automatic verification
of system designs
Architecture Level
Transaction Level Model (TLM)
(Non-Synthesizable Subset)
Register Transfer Level (RTL)
Micro-architecture Level
(Synthesizable Subset)
Mostly Manual
High Level
Synthesis
Translation
Validation
Automated
Theorem
Proving
Relational Approach
Verification Techniques
Partial
Order
Reduction
Explicit Stateless
Search
Automatic Test Generation
Property checker
Refinement/Equivalence checker
Refinement Checking
Input Program
(Specification)
Transformed
Program
(Implementation)
Transformations
Refinement
Or Equivalent
CheckerChecker
Prototype Implementation -
ARCCoS
CSP
Specification
CSP
Specification
Front End Parser
Specification (CFG)
CSP
Implementation
CSP
Implementation
Implementation (CFG)
Automated
Theorem Prover
(Simplify)
A
R
C
C
o
S
Simulation
Relation
Simulation
RelationInference Engine
Partial Order
Reduction Engine
Checking Engine
Results from ARCCoS
Descriptions #Process Time (no PO)
(min:sec)
Time (PO)
(min:sec)Spec Impl Total
Simple buffer 3 4 7 00:00 00:00
Simple vending machine 1 1 2 00:00 00:00
Cyclic scheduler 3 3 6 01:01 00:49
College student tracking system 1 2 3 00:01 00:01
Single communication link 3 8 11 00:01 00:01
2 parallel communication links 6 12 18 01:28 00:04
3 parallel communication links 9 16 25 514:52 00:21
4 parallel communication links 12 20 32 DNT 01:11
5 parallel communication links 15 24 39 DNT 02:32
6 parallel communication links 18 28 46 DNT 08:29
7 parallel communication links 21 32 53 DNT 37:28
Hardware refinement 3 5 8 00:00 00:00
EP2 System 1 2 3 01:51 01:47
Example
i2: k = p
i1: sum = 0
i3: (k < 10) i6: ¬ (k < 10)
i4: k = k + 1
i5: sum = sum + k
a2
a3
a4
a5
a1
a6
a0
i7: return sum
(a) Specification
+ + <Resource Allocation:
b1
b2 b3
b4
b0
j1: sum = 0
j2: k = p
j41: t = p + 1
j4: k = t
j5: sum = sum + t
j42: t = t + 1
j7: return sum
j6: ¬ (k < 10)j3: (k < 10)
(b) Implementation
(l1, l2) 1st
Pass 2nd
Pass
1. (a0, b0) ps = pi ps = pi
2. (a2, b1) ks = ki ks = ki Λ sums = sumi Λ (ks + 1) = ti
3. (a5, b3) sums = sumi sums = sumi
sum = ∑10
i
p+1
Loop pipelining
Copy propagation
On going work
Intermediate
Representation
Static Analysis
Partial Order
Information
Explicit Stateless Model Checker
Query
Engine
Explore
Engine
SystemC
Design
Test Bench
SystemC Simulator
Satya
Closing Thoughts
 ASIC design cost is the new driver
 Solution space is expanded to include not only tools but
also architectures
 A time for tremendous creativity
F
A
cf
ca
Volume
TotalCost
Better area utilization
in FPGA, 7x target
F
A
cf
ca
Volume
TotalCost
Better area utilization
in FPGA, 7x target
F
A
cf
ca
Better synthesis,
EDA, 2x target
F
A
cf
ca
Better synthesis,
EDA, 2x target
F
A
cf
ca
Design for synthesis,
3x cost increase
F
A
cf
ca
Design for synthesis,
3x cost increase

More Related Content

What's hot

GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
NVIDIA Taiwan
 
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
Edge AI and Vision Alliance
 
"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...
"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ..."Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...
"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...
Edge AI and Vision Alliance
 
Michael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop Computing
Michael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop ComputingMichael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop Computing
Michael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop Computing
Michael Gschwind
 
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta..."The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
Edge AI and Vision Alliance
 
Test Amplification 
in the Pharo Smalltalk Ecosystem
Test Amplification 
in the Pharo Smalltalk EcosystemTest Amplification 
in the Pharo Smalltalk Ecosystem
Test Amplification 
in the Pharo Smalltalk Ecosystem
ESUG
 
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from..."Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
Edge AI and Vision Alliance
 
NI Delivers Next-Generation Test Systems with PXI
NI Delivers Next-Generation Test Systems with PXINI Delivers Next-Generation Test Systems with PXI
NI Delivers Next-Generation Test Systems with PXI
Hank Lydick
 
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- IntroLec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Hsien-Hsin Sean Lee, Ph.D.
 

What's hot (9)

GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
GTC Taiwan 2017 如何在充滿未知的巨量數據時代中建構一個數據中心
 
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
"Designing a Stereo IP Camera From Scratch," a Presentation from ELVEES
 
"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...
"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ..."Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...
"Optimizing SSD Object Detection for Low-power Devices," a Presentation from ...
 
Michael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop Computing
Michael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop ComputingMichael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop Computing
Michael Gschwind, Blue Gene/Q: Design for Sustained Multi-Petaflop Computing
 
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta..."The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
 
Test Amplification 
in the Pharo Smalltalk Ecosystem
Test Amplification 
in the Pharo Smalltalk EcosystemTest Amplification 
in the Pharo Smalltalk Ecosystem
Test Amplification 
in the Pharo Smalltalk Ecosystem
 
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from..."Developing Real-time Video Applications with CoaXPress," A Presentation from...
"Developing Real-time Video Applications with CoaXPress," A Presentation from...
 
NI Delivers Next-Generation Test Systems with PXI
NI Delivers Next-Generation Test Systems with PXINI Delivers Next-Generation Test Systems with PXI
NI Delivers Next-Generation Test Systems with PXI
 
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- IntroLec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
Lec1 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Intro
 

Similar to Fmcad08

Chapter_01 Course Introduction.pdf
Chapter_01 Course Introduction.pdfChapter_01 Course Introduction.pdf
Chapter_01 Course Introduction.pdf
VoThanhPhong3
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Intel® Software
 
hetshah_resume
hetshah_resumehetshah_resume
hetshah_resume
het shah
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Dilum Bandara
 
VLSI Design Methodologies
VLSI Design MethodologiesVLSI Design Methodologies
VLSI Design Methodologies
Keshav
 
Parallelism Processor Design
Parallelism Processor DesignParallelism Processor Design
Parallelism Processor Design
Sri Prasanna
 
Semiconductor overview
Semiconductor overviewSemiconductor overview
Semiconductor overview
Nabil Chouba
 
11 Synchoricity as the basis for going Beyond Moore
11 Synchoricity as the basis for going Beyond Moore11 Synchoricity as the basis for going Beyond Moore
11 Synchoricity as the basis for going Beyond Moore
RCCSRENKEI
 
Hard IP Core design | Convolution Encoder
Hard IP Core design | Convolution EncoderHard IP Core design | Convolution Encoder
Hard IP Core design | Convolution Encoder
Archit Vora
 
Track e magma redefining mixed so c chipex2011 - magma da
Track e  magma redefining mixed so c chipex2011 - magma daTrack e  magma redefining mixed so c chipex2011 - magma da
Track e magma redefining mixed so c chipex2011 - magma da
chiportal
 
09.50 Ernst Vrolijks
09.50 Ernst Vrolijks09.50 Ernst Vrolijks
09.50 Ernst Vrolijks
Themadagen
 
Architectural Optimizations for High Performance and Energy Efficient Smith-W...
Architectural Optimizations for High Performance and Energy Efficient Smith-W...Architectural Optimizations for High Performance and Energy Efficient Smith-W...
Architectural Optimizations for High Performance and Energy Efficient Smith-W...
NECST Lab @ Politecnico di Milano
 
Bharat gargi final project report
Bharat gargi final project reportBharat gargi final project report
Bharat gargi final project report
Bharat Biyani
 
Application Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systemsApplication Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systems
Ganesan Narayanasamy
 
Ag o product overview
Ag o product overviewAg o product overview
Ag o product overview
Manoj Nagesh
 
Trends and challenges in IP based SOC design
Trends and challenges in IP based SOC designTrends and challenges in IP based SOC design
Trends and challenges in IP based SOC design
AishwaryaRavishankar8
 
Trends in Systems and How to Get Efficient Performance
Trends in Systems and How to Get Efficient PerformanceTrends in Systems and How to Get Efficient Performance
Trends in Systems and How to Get Efficient Performance
inside-BigData.com
 
CASFPGA1.ppt
CASFPGA1.pptCASFPGA1.ppt
CASFPGA1.ppt
AswiniSamantray2
 
An Introduction to Field Programmable Gate Arrays
An Introduction to Field Programmable Gate ArraysAn Introduction to Field Programmable Gate Arrays
An Introduction to Field Programmable Gate Arrays
KingshukDas35
 
Ananthprofilepln
AnanthprofileplnAnanthprofilepln
Ananthprofilepln
ananthch
 

Similar to Fmcad08 (20)

Chapter_01 Course Introduction.pdf
Chapter_01 Course Introduction.pdfChapter_01 Course Introduction.pdf
Chapter_01 Course Introduction.pdf
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
 
hetshah_resume
hetshah_resumehetshah_resume
hetshah_resume
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
VLSI Design Methodologies
VLSI Design MethodologiesVLSI Design Methodologies
VLSI Design Methodologies
 
Parallelism Processor Design
Parallelism Processor DesignParallelism Processor Design
Parallelism Processor Design
 
Semiconductor overview
Semiconductor overviewSemiconductor overview
Semiconductor overview
 
11 Synchoricity as the basis for going Beyond Moore
11 Synchoricity as the basis for going Beyond Moore11 Synchoricity as the basis for going Beyond Moore
11 Synchoricity as the basis for going Beyond Moore
 
Hard IP Core design | Convolution Encoder
Hard IP Core design | Convolution EncoderHard IP Core design | Convolution Encoder
Hard IP Core design | Convolution Encoder
 
Track e magma redefining mixed so c chipex2011 - magma da
Track e  magma redefining mixed so c chipex2011 - magma daTrack e  magma redefining mixed so c chipex2011 - magma da
Track e magma redefining mixed so c chipex2011 - magma da
 
09.50 Ernst Vrolijks
09.50 Ernst Vrolijks09.50 Ernst Vrolijks
09.50 Ernst Vrolijks
 
Architectural Optimizations for High Performance and Energy Efficient Smith-W...
Architectural Optimizations for High Performance and Energy Efficient Smith-W...Architectural Optimizations for High Performance and Energy Efficient Smith-W...
Architectural Optimizations for High Performance and Energy Efficient Smith-W...
 
Bharat gargi final project report
Bharat gargi final project reportBharat gargi final project report
Bharat gargi final project report
 
Application Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systemsApplication Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systems
 
Ag o product overview
Ag o product overviewAg o product overview
Ag o product overview
 
Trends and challenges in IP based SOC design
Trends and challenges in IP based SOC designTrends and challenges in IP based SOC design
Trends and challenges in IP based SOC design
 
Trends in Systems and How to Get Efficient Performance
Trends in Systems and How to Get Efficient PerformanceTrends in Systems and How to Get Efficient Performance
Trends in Systems and How to Get Efficient Performance
 
CASFPGA1.ppt
CASFPGA1.pptCASFPGA1.ppt
CASFPGA1.ppt
 
An Introduction to Field Programmable Gate Arrays
An Introduction to Field Programmable Gate ArraysAn Introduction to Field Programmable Gate Arrays
An Introduction to Field Programmable Gate Arrays
 
Ananthprofilepln
AnanthprofileplnAnanthprofilepln
Ananthprofilepln
 

More from Rajesh Gupta

Gupta datamule
Gupta datamuleGupta datamule
Gupta datamule
Rajesh Gupta
 
Calit2 intro12
Calit2 intro12Calit2 intro12
Calit2 intro12
Rajesh Gupta
 
Bob conn2013
Bob conn2013Bob conn2013
Bob conn2013
Rajesh Gupta
 
Nr cgupta
Nr cguptaNr cgupta
Nr cgupta
Rajesh Gupta
 
Embedded Intro India05
Embedded Intro India05Embedded Intro India05
Embedded Intro India05
Rajesh Gupta
 
Isat06 Rev2
Isat06 Rev2Isat06 Rev2
Isat06 Rev2
Rajesh Gupta
 
Green Radio08
Green Radio08Green Radio08
Green Radio08
Rajesh Gupta
 
Access
AccessAccess
Access
Rajesh Gupta
 
Gli07
Gli07Gli07

More from Rajesh Gupta (9)

Gupta datamule
Gupta datamuleGupta datamule
Gupta datamule
 
Calit2 intro12
Calit2 intro12Calit2 intro12
Calit2 intro12
 
Bob conn2013
Bob conn2013Bob conn2013
Bob conn2013
 
Nr cgupta
Nr cguptaNr cgupta
Nr cgupta
 
Embedded Intro India05
Embedded Intro India05Embedded Intro India05
Embedded Intro India05
 
Isat06 Rev2
Isat06 Rev2Isat06 Rev2
Isat06 Rev2
 
Green Radio08
Green Radio08Green Radio08
Green Radio08
 
Access
AccessAccess
Access
 
Gli07
Gli07Gli07
Gli07
 

Recently uploaded

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 

Recently uploaded (20)

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 

Fmcad08

  • 1. High Level Design & ESL How design cost is driving innovation in system-level designs? Rajesh Gupta University of California, San Diego mesl . ucsd . edu FMCAD, Portland, Nov. 17, 2008
  • 2. My main point  At various time VLSI design has been driven by  Area, timing, power, reliability, manufacturing variability  Cost of design is likely to be the driver for future innovations in how we architect, design and implement future ICs in each of these areas:  Tools, Methods  Architectures  Programming models and methods
  • 3. 3 12/18/03 R. Gupta, UC San Diego The Technology and Its Industry ComponentsSystems Tools Masks Mask data
  • 4. More Silicon to More Boxes…  Of the 72 distinct application markets that rely on value added IC designs (ASIC, ASSP, FPGA, SOC)  over 50% are less than $500M, 75% are less than $1B  The rising fabless, fablite  The US has 56% of over 1K design houses…  …and accounts for 76% of industry revenues  (Wireless 27%, networking 25%, consumer 20%)  Cost is increasingly the driver for fabless  Only 17% of designs above 500 MHz  67% of ASIC designs are 299 MHz and lower  Sizes pretty much evenly distributed from 100K to 5M gates Source: IBS
  • 5. Is there a problem here? WW Market Forecast : ASIC vs. FPGA 0 5000 10000 15000 20000 25000 30000 35000 2003 2004 2005 2006 2007 2008 2009 2010 2011 $(millions) Total ASIC Total FPGA Source: Gartner Dataquest “ASIC and FPGA WW Market Forecast, January 2008”
  • 6. More & Moore  Most things in real-life do not scale anywhere close to this  Battery energy, power sources  Size, Space, Spectrum  Design time.  Dealing with the effects of Moore  “Embedded Systems” 486 Pad limited die: 200 pins 52 mm2 0 1 2 3 4 5 6 16x 14x 12x 10x 8x 6x 4x 2x 1x Improvement(comparedtoyear0) Time (years)
  • 7. A Tale of Two Consequence 1. EDA: Raise abstractions  Raising abstraction has always been part of the solution strategy to lower design costs.  In design modeling, design synthesis, design verification 1. Architecture: Raise programmability  Holy Grail: ASIC efficiency with CPU programmability.  The tremendous space of architectural innovations between ASIC and FPGA ► Let us take a look at the two sides from a familiar perspective
  • 8. FPGA v. ASIC: Cost v. Volume FPGA ASIC Structured ASIC, SA cf New Fabric, T xf xa ct ca Volume Total Cost A good solution: xf  0 or better ASIC, ct  cf xa  infinity or better FPGA, mtma  Currently we are: cf = 2 ca ; mf = 20 ma  Fixed cost of FPGA design = 2 * ASIC design costs  Per part cost of FPGAs rises 20x cost of ASIC. 
  • 9. ASIC/FPGA Tradeoff F A SA cf T xf xa ct ca Volume Total Cost A good solution: xf  0 or better ASIC, ct  cf xa  infinity or better FPGA, mtma
  • 10. Better ASIC or Better FPGA? F A cf ca Volume Total Cost Improved Area Utilization Reduced Design Cost; Chip implementation, Shuttles, etc. Space of ‘synthetic’ solutions
  • 11. F A cf ca Volume TotalCost Better area utilization in FPGA, 7x target F A cf ca Better synthesis, EDA, 2x target F A cf ca Design for synthesis, 3x cost increase
  • 12. Technical Dimensions of the Problem  SE: Silicon Efficiency  Inherently better circuit implementation styles, levels, logic: Asynchronous, GALS  AE: Architectural Efficiency  Inherently improved application-level performance or performance independent of mapping methods  PA: Programmer Accessibility  Use existing programming models/methods to ensure IP availability and integration.  DP: Designer Productivity
  • 13. ITRS, last updated 2006 Designer Productivity is Challenge #1 Verification Predictable Implementation Embedded SW Distributed design, AMS
  • 14. Impact on Designer Productivity Design Technology Year Productivity Delta gates/DY Comments Physical Design (APR) 1993 38.9% 5.55K PD integration Tall-thin Engineer 1995 63.6% 9.1K Chip/circuit/PD/Verif. Small block reuse 1997 340% 40K 2.5K-75K gates Large block reuse 1999 38.9% 56K 75K-1M gates IC implementation suits 2001 63.6% 91K RTL-GDSII integration RTL functional verification 2003 37.5% 125K SW development verif. ES Methodology 2005 60% 200K Behavioral above RTL Very large block reuse 2007 200% 600K >1M gates, IP cores Homogenous parallel processing 2009 100-200% 1.2M Many identical cores around a main processor Intelligent test bench 2011 37.5%2.4M Automation of verification partitioning Concurrent SW compiler 2013 60% 3.3M Enables SW in parallel SOCs Heterogenous massive parallel processing 2015 100-200% 5.3M Specialized cores around a main processor System-level DA and executable specs 2017- 19 100-200% 10.5M On/off-chip integration of functions. Total 264,000%
  • 15. Raising Verification Golden Reference Model Property Checker Refinement or Equivalence Checker Verification Techniques  Scalable techniques for automatic verification of system designs Architecture Level Transaction Level Model (TLM) (Non-Synthesizable Subset) Register Transfer Level (RTL) Micro-architecture Level (Synthesizable Subset) Mostly Manual High Level Synthesis Translation Validation Automated Theorem Proving Relational Approach Verification Techniques Partial Order Reduction Explicit Stateless Search Automatic Test Generation Property checker Refinement/Equivalence checker
  • 17. Prototype Implementation - ARCCoS CSP Specification CSP Specification Front End Parser Specification (CFG) CSP Implementation CSP Implementation Implementation (CFG) Automated Theorem Prover (Simplify) A R C C o S Simulation Relation Simulation RelationInference Engine Partial Order Reduction Engine Checking Engine
  • 18. Results from ARCCoS Descriptions #Process Time (no PO) (min:sec) Time (PO) (min:sec)Spec Impl Total Simple buffer 3 4 7 00:00 00:00 Simple vending machine 1 1 2 00:00 00:00 Cyclic scheduler 3 3 6 01:01 00:49 College student tracking system 1 2 3 00:01 00:01 Single communication link 3 8 11 00:01 00:01 2 parallel communication links 6 12 18 01:28 00:04 3 parallel communication links 9 16 25 514:52 00:21 4 parallel communication links 12 20 32 DNT 01:11 5 parallel communication links 15 24 39 DNT 02:32 6 parallel communication links 18 28 46 DNT 08:29 7 parallel communication links 21 32 53 DNT 37:28 Hardware refinement 3 5 8 00:00 00:00 EP2 System 1 2 3 01:51 01:47
  • 19. Example i2: k = p i1: sum = 0 i3: (k < 10) i6: ¬ (k < 10) i4: k = k + 1 i5: sum = sum + k a2 a3 a4 a5 a1 a6 a0 i7: return sum (a) Specification + + <Resource Allocation: b1 b2 b3 b4 b0 j1: sum = 0 j2: k = p j41: t = p + 1 j4: k = t j5: sum = sum + t j42: t = t + 1 j7: return sum j6: ¬ (k < 10)j3: (k < 10) (b) Implementation (l1, l2) 1st Pass 2nd Pass 1. (a0, b0) ps = pi ps = pi 2. (a2, b1) ks = ki ks = ki Λ sums = sumi Λ (ks + 1) = ti 3. (a5, b3) sums = sumi sums = sumi sum = ∑10 i p+1 Loop pipelining Copy propagation
  • 20. On going work Intermediate Representation Static Analysis Partial Order Information Explicit Stateless Model Checker Query Engine Explore Engine SystemC Design Test Bench SystemC Simulator Satya
  • 21. Closing Thoughts  ASIC design cost is the new driver  Solution space is expanded to include not only tools but also architectures  A time for tremendous creativity F A cf ca Volume TotalCost Better area utilization in FPGA, 7x target F A cf ca Volume TotalCost Better area utilization in FPGA, 7x target F A cf ca Better synthesis, EDA, 2x target F A cf ca Better synthesis, EDA, 2x target F A cf ca Design for synthesis, 3x cost increase F A cf ca Design for synthesis, 3x cost increase

Editor's Notes

  1. In this work I propose a verification framenwork consisting of scalable techniques for automatic verification of system designs. In particular I propose techniques for two components a Property checker and a Refinement or equivalence checker. The property checker takes as input a high-level model and through a reapeted process of checking and correction produces a golden reference model. The refinement or the equivalent checker takes as input a specification and a implementation and checks if the implementation preserves certain properties of the specification. In short, we use a combination of techniques from … to do property checking and a combination of techniques from … to equivalence checking Our results shows that our techniques are efficient and useful.
  2. Consider an Input program called Specification which goes through a series of transformation to form a transformed program called Implementation. The problem at hand is that we want to make sure that the implementation preserves some properties of the specification. One way to do this is to use a checker and prove once for all that the transformations used will always produce correct results. However, proving refinements using this method is usually very hard. Another way, and the one that we use is to prove it for each pair of programs Here we prove that the implementation is a refinement of the specification or it is equivalent to the spec. By refinement we mean that the possible set of traces of the implementation is a subset of the traces of the Specification.
  3. We have implemented these two algorithms in a system called ARCCoS. In ARCCoS we have also implemented a very simple PO technique to reduce the no of interleavings in the control state of the 2 programs
  4. Next we applied our tool through a set of benchmarks the result of which is shown in this table. The most significant result here is that we were able to run our tool in a example with 53 threads in around 37 mins We also ran our tool over parts of an industrial example namely EP2 system and was also able to infer the simulation relation pretty quickly.
  5. Next lets see an example of the transformation done by spark This program computes the sum from p+1 to 10. SPARK takes this input and applies loop pipelining and copy propagation and some other heuristics to get the scheduled IR which is shown in the right side. In particular, the instruction i4: k = k+1 is duplicated and one copy is put above the loop and one at the end of the loop. Then some instruction are coupled together based on the resource alocation to be executed concurrently. Our goal is to show that the specification and impl are equivalent. So we applied our algorithm to validate it. The result of the forward pass finds the points of interest and the local conditions The 2 nd pass or the backward pass propagates these local conditions backwards using weakest pre condition till we reach a fixpoint.
  6. This slide shows the big picture of our approach: We are building a framework called Satya that takes as input a systemC design and then converts it to an intermediate representation. We then do static analysis to generate the POR info and an initial test bench. These informations are used by an explicit stateless model checker that explores all possible behaviors of the design. It then if required generates another test bench and explore again for that test bench.