SlideShare a Scribd company logo
POWER AND CLOCK GATING
MODELLING
IN
COARSE GRAINED
RECONFIGURABLE SYSTEMS
Francesca Palumbo
University of Sassari
PolComIng
Information Engineering Unit
Tiziana Fanni, Carlo Sau, Paolo Meloni
and Luigi Raffo
University of Cagliari
Dept. of Electrical and Electronics Eng.
tiziana.fanni@diee.unica.it
ACM International Conference on
Computing Frontiers 2016
E
O
L
A
B
OUTLINE
 Problem Statement
 Background
 Dataflow Model of Computation
 Coarse Grain Reconfiguration
 Multi-Dataflow Composer Tool
 Proposed approach
 Parameters Analysis
 Power Estimation Model
 Logic Regions Analysis
 Metodology assesment
 Conclusions
2
PROBLEM STATEMENT
Consumers need:





 3
PROBLEM STATEMENT
Consumers need:
 Integrated complex and fancy
resurce-intensive applications




 3
PROBLEM STATEMENT
Consumers need:
 Integrated complex and fancy
resurce-intensive applications
 Long battery life



 3
PROBLEM STATEMENT
Consumers need:
 Integrated complex and fancy
resurce-intensive applications
 Long battery life




Possible solutions:
3
PROBLEM STATEMENT
Consumers need:
 Integrated complex and fancy
resurce-intensive applications
 Long battery life
 Dataflow Model of Computation
 Modularity and parallelism 
 INTEGRATION AND RE-USABILITY


Possible solutions:
3
PROBLEM STATEMENT
Consumers need:
 Integrated complex and fancy
resurce-intensive applications
 Long battery life
 Dataflow Model of Computation
 Modularity and parallelism 
 INTEGRATION AND RE-USABILITY
 Coarse-graine reconfiguration
 Flexibility and resource sharing 
 MULTI-APPLICATION PORTABLE DEVICES
Possible solutions:
3
BACKGROUND
Dataflow Model of Computation
 DATAFLOW FORMALISM
 Directed graph of actors
(functional units).
 Actors exchange tokens
(data packets) through
dedicated channels


 4
BACKGROUND
Dataflow Model of Computation
 DATAFLOW FORMALISM
 Directed graph of actors
(functional units).
 Actors exchange tokens
(data packets) through
dedicated channels



actions
state
4
BACKGROUND
Dataflow Model of Computation
 DATAFLOW FORMALISM
 Directed graph of actors
(functional units).
 Actors exchange tokens
(data packets) through
dedicated channels
 CHARACTERISTICS
 Explicit the intrinsic application parallelism.
 Modularity favours model
re-usability/adaptivity.
actions
state
4
BACKGROUND
Coarse-Graine Reconfiguration
A
in1
D
in2
F
in3
E
G
B
Sbox_1
Sbox_2
out1
C
Sbox_0
ID net=1
LUTCoarse Grained Reconfigurable Platform
A B C
D E C
A
F
CG
Dataflow Descriptions
N:1
α
β
γ
α
β
γ
0 0
0
5
BACKGROUND
Coarse-Graine Reconfiguration
A
in1
D
in2
F
in3
E
G
B
Sbox_1
Sbox_2
out1
C
Sbox_0
ID net=3
LUTCoarse Grained Reconfigurable Platform
A B C
D E C
A
F
CG
Dataflow Descriptions
N:1
α
β
γ
α
β
γ
1 1
0



6
BACKGROUND
Coarse-Graine Reconfiguration
A
in1
D
in2
F
in3
E
G
B
Sbox_1
Sbox_2
out1
C
Sbox_0
ID net=3
LUTCoarse Grained Reconfigurable Platform
A B C
D E C
A
F
CG
Dataflow Descriptions
N:1
α
β
γ
α
β
γ
1 1
0
 Large Power consumption  Power Saving
Methodologies
 Dynamic  Clock-gating
 Static  Power-gating
6
BACKGROUND
http://sites.unica.it/rpct/Multi-Dataflow Composer Tool
A
in1
D
in2
F
in3
E
G
B
Sbox_1
Sbox_2
out1
C
LR1
LR2: Always ON LR
LR3
LR4
LR5
LR4
RTN
RTN
RTN
RTN
RTN
RTN
ISOL
ISOL
ISOLISOL
Sbox_0
ISOL
VDD
SW
VDD
SW
VDD
SW
ISOL
VDD
SW
ID net
Power
Controller
CG_en1
CG_en4
CG_en5
CG_en3
clock
7
PROPOSED APPROACH
Parameters analysis:










8
PROPOSED APPROACH
Parameters analysis:
 Architectural parameters









8
PROPOSED APPROACH
Parameters analysis:
 Architectural parameters
 Actors
 RAMs
 Combinatorial Logic
 Sequential Logic…





8
PROPOSED APPROACH
Parameters analysis:
 Architectural parameters
 Actors
 RAMs
 Combinatorial Logic
 Sequential Logic…
 Functional parameters




8
PROPOSED APPROACH
Parameters analysis:
 Architectural parameters
 Actors
 RAMs
 Combinatorial Logic
 Sequential Logic…
 Functional parameters
 Input Data
 LR’s Activation Time


8
PROPOSED APPROACH
Parameters analysis:
 Architectural parameters
 Actors
 RAMs
 Combinatorial Logic
 Sequential Logic…
 Functional parameters
 Input Data
 LR’s Activation Time
 Technological parameters

8
PROPOSED APPROACH
Parameters analysis:
 Architectural parameters
 Actors
 RAMs
 Combinatorial Logic
 Sequential Logic…
 Functional parameters
 Input Data
 LR’s Activation Time
 Technological parameters
 As transistors get smaller the contribution of
the static contribute gets larger and not negligible.
8
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i



 9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i



 9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i



 9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Combinational
Logic
Standard Cost
9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Combinational
Logic
Sequential
Logic
Standard Cost
9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Retention CellsCombinational
Logic
Sequential
Logic
Standard Cost Overhead
9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Retention Cells
Isolation Cells
Combinational
Logic
Sequential
Logic
Standard Cost Overhead
9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Retention Cells
Isolation Cells
Power Controller
Combinational
Logic
Sequential
Logic
Standard Cost Overhead
9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Retention Cells
Isolation Cells
Power Controller
Clock Gating Cells
Combinational
Logic
Sequential
Logic
Standard Cost Overhead
9
PROPOSED APPROACH
 Power gating.
 Clock gating.
Power Estimation Model
]*)(*)([
]*)(*)([
]*)*#(*)*#([
*]/#)#(#*)()*#()([
)(_)()(
iOFFOFFiONON
iOFFOFFiONON
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TContrPTContrP
TisoISOPTisoISOP
TregrtnregregPrtnRCPcmbP
LROverExtLRPLRP
i






]*)(*)([
]*)()([
)(_)()(
iOFFOFFiONON
iON
LRactors
iiONi
TCGPTCGP
TregPcmbP
LROverExtLRPLRP
i




Retention Cells
Isolation Cells
Power Controller
Clock Gating Cells
Combinational
Logic
Sequential
Logic
Standard Cost Overhead
9
Term not present
in CG static model!
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
Area > Area_th Evaluate PG
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
Area > Area_th Evaluate PG
Power PG: +2%  Evaluate CG
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
Area > Area_th Evaluate PG
Power PG: +2%  Evaluate CG
Power CG: -5%  Apply CG
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
Area > Area_th Evaluate PG
Power PG: +2%  Evaluate CG
Power CG: -5%  Apply CG
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
Area > Area_th Evaluate PG
Power PG: +2%  Evaluate CG
Power CG: -5%  Apply CG
Area > Area_th Evaluate PG
10
PROPOSED APPROACH
LRs analysis
NO
YES
NO
YES
NO
YES
NO
YES
AREA
THRESHOLDING
ESTIMATE PG ESTIMATE CG
Select PDi to be
Implement with PG
Select PDi to be
Implement with CG
START
i=0
Increment i
STOP
1
2 3
∀PDi
i<N+1
Is Above?
Is there
Saving%?
Is there
Saving%?
LRs LR1 LR2 LR3
Area 2% 5.8% 23%
PG --- +2% -18%
CG +1% -5% ---
Area_th: 5%
Area < Area_th Evaluate CG
Power CG: +1%  Discard
Area > Area_th Evaluate PG
Power PG: +2%  Evaluate CG
Power CG: -5%  Apply CG
Area > Area_th Evaluate PG
Power CG: -18%  Apply PG
10
METODOLOGY ASSESMENT
Design Under Test: ASIC 90nm technology
host
processor
FFT
coprocessor
INTERCONNECTION LAYER
APPLICATION # KERNEL #LRs
FFT 4 8
11
METODOLOGY ASSESMENT
Design Under Test: ASIC 90nm technology
host
processor
FFT
coprocessor
INTERCONNECTION LAYER
APPLICATION # KERNEL #LRs
FFT 4 8
PDs TiON #reg Area
%
PD1 33% 1024 62.44
PD2 67% 512 0.46
PD3 37% 256 15.84
PD4 4% 0 0.41
PD5 21% 0 0.25
PD6 42% 0 0.43
PD7 58% 128 7.93
PD8 96% 512 1.36 11
METODOLOGY ASSESMENT
Design Under Test: ASIC 90nm technology
host
processor
FFT
coprocessor
INTERCONNECTION LAYER
APPLICATION # KERNEL #LRs
FFT 4 8
PDs TiON #reg Area
%
PD1 33% 1024 62.44
PD2 67% 512 0.46
PD3 37% 256 15.84
PD4 4% 0 0.41
PD5 21% 0 0.25
PD6 42% 0 0.43
PD7 58% 128 7.93
PD8 96% 512 1.36
Highly active
11
METODOLOGY ASSESMENT
Design Under Test: ASIC 90nm technology
host
processor
FFT
coprocessor
INTERCONNECTION LAYER
APPLICATION # KERNEL #LRs
FFT 4 8
PDs TiON #reg Area
%
PD1 33% 1024 62.44
PD2 67% 512 0.46
PD3 37% 256 15.84
PD4 4% 0 0.41
PD5 21% 0 0.25
PD6 42% 0 0.43
PD7 58% 128 7.93
PD8 96% 512 1.36
CombinatorialHighly active
11
METODOLOGY ASSESMENT
Design Under Test: ASIC 90nm technology
host
processor
FFT
coprocessor
INTERCONNECTION LAYER
APPLICATION # KERNEL #LRs
FFT 4 8
PDs TiON #reg Area
%
PD1 33% 1024 62.44
PD2 67% 512 0.46
PD3 37% 256 15.84
PD4 4% 0 0.41
PD5 21% 0 0.25
PD6 42% 0 0.43
PD7 58% 128 7.93
PD8 96% 512 1.36
CombinatorialHighly active Best candidate
11
METODOLOGY ASSESMENT
Design Under Test: ASIC 90nm technology
host
processor
FFT
coprocessor
INTERCONNECTION LAYER
APPLICATION # KERNEL #LRs
FFT 4 8
PDs TiON #reg Area
%
PD1 33% 1024 62.44
PD2 67% 512 0.46
PD3 37% 256 15.84
PD4 4% 0 0.41
PD5 21% 0 0.25
PD6 42% 0 0.43
PD7 58% 128 7.93
PD8 96% 512 1.36
Small LRsCombinatorialHighly active Best candidate
11
METODOLOGY ASSESMENT
Estimated Saving
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
CG PDs: PD2, PD8
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
CG PDs: PD2, PD8
Area_th: 10% PG_2
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
CG PDs: PD2, PD8
Area_th: 10% PG_2
PG PDs: PD1, PD3
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
CG PDs: PD2, PD8
Area_th: 10% PG_2
PG PDs: PD1, PD3
CG PDs: PD2, PD7, PD8
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
CG PDs: PD2, PD8
Area_th: 10% PG_2
PG PDs: PD1, PD3
CG PDs: PD2, PD7, PD8
Err. 0.33% Err. 1.41% Err. 1.08%
Err. 0.88% Err. 11.62% Err. 39.67%
12
METODOLOGY ASSESMENT
Estimated Saving
Area_th: 5% PG_1
PG PDs: PD1, PD3, PD7
CG PDs: PD2, PD8
Area_th: 10% PG_2
PG PDs: PD1, PD3
CG PDs: PD2, PD7, PD8
Err. 0.33% Err. 1.41% Err. 1.08%
Err. 0.88% Err. 11.62% Err. 39.67%
0.16% Estimated Vs
0.11% Real
12
METODOLOGY ASSESMENT
Designs Comparison
13
METODOLOGY ASSESMENT
Designs Comparison
13
METODOLOGY ASSESMENT
Designs Comparison
13
CONCLUSIONS
Advantages of the proposed approach
 Given a CGR design with n kernel and k LRs






14
CONCLUSIONS
Advantages of the proposed approach
 Given a CGR design with n kernel and k LRs
 The proposed approach requires uniquely the
analysis of the baseline CGR design:
 one synthesis
 n simulations (one foreach kernel)



14
CONCLUSIONS
Advantages of the proposed approach
 Given a CGR design with n kernel and k LRs
 The proposed approach requires uniquely the
analysis of the baseline CGR design:
 one synthesis
 n simulations (one foreach kernel)
 Otherwise:
 (2*k+1) (one PG and one CG for each LR + one of
Base the design)
 (2*k+1)*n simulation (n simulation for each design) 14
ACKNOWLEDGEMENTS
 The research leading to these results has
received funding from:
 the Region of Sardinia L.R.7/2007 under grant
agreement CRP-18324 [RPCT Project].
 the Region of Sardinia, Young Researchers Grant, POR
Sardegna FSE 2007-2013, L.R.7/2007 “Promotion of the
scientific research and technological innovation in
Sardinia”
15
Francesca Palumbo
University of Sassari
PolComIng
Information Engineering Unit
Tiziana Fanni, Carlo Sau, Paolo Meloni
and Luigi Raffo
University of Cagliari
Dept. of Electrical and Electronics Eng.
E
O
L
A
B
?
ACM International Conference on
Computing Frontiers 2016
tiziana.fanni@diee.unica.it
READ FULL ARTICLE
 ACM Digital Library
http://dl.acm.org/citation.cfm?id=2911713
 RPCT Project Website:
http://sites.unica.it/rpct/references/
REFERENCE
 @inproceedings{Fanni_CF2016,
author = {Fanni, Tiziana and Sau, Carlo and Meloni, Paolo
and Raffo, Luigi and Palumbo, Francesca},
title = {Power and Clock Gating Modelling in Coarse
Grained Reconfigurable Systems},
booktitle = {Proceedings of the ACM International
Conference on Computing Frontiers},
series = {CF '16},
year = {2016},
location = {Como, Italy},
doi = {10.1145/2903150.2911713},
publisher = {ACM},
address = {New York, NY, USA}, }
BibTex

More Related Content

What's hot

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift RegisterArea Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
IJMTST Journal
 
04 final - hobbs lave wvm solar portfolios - pvpmc
04 final - hobbs lave wvm solar portfolios - pvpmc04 final - hobbs lave wvm solar portfolios - pvpmc
04 final - hobbs lave wvm solar portfolios - pvpmc
Sandia National Laboratories: Energy & Climate: Renewables
 
2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...
2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...
2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...
Sandia National Laboratories: Energy & Climate: Renewables
 
power flow and optimal power flow
power flow and optimal power flowpower flow and optimal power flow
power flow and optimal power flow
Ahmed M. Elkholy
 
Optimal load scheduling
Optimal load schedulingOptimal load scheduling
Optimal load scheduling
Mayank Sharma
 
A presentation on economic load dispatch
A presentation on economic load dispatchA presentation on economic load dispatch
A presentation on economic load dispatch
souravsahoo28
 
Economic load dispatch(with and without losses)
Economic load dispatch(with and without losses)Economic load dispatch(with and without losses)
Economic load dispatch(with and without losses)
Asha Anu Kurian
 
Hydrothermal scheduling
Hydrothermal schedulingHydrothermal scheduling
Hydrothermal scheduling
ASHIRBAD BARIK
 
OPAL-RT RT13 Conference: New OP4500 hardware
OPAL-RT RT13 Conference: New OP4500 hardwareOPAL-RT RT13 Conference: New OP4500 hardware
OPAL-RT RT13 Conference: New OP4500 hardware
OPAL-RT TECHNOLOGIES
 
2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...
2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...
2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...
Sandia National Laboratories: Energy & Climate: Renewables
 
Voltage stability Analysis using GridCal
Voltage stability Analysis using GridCalVoltage stability Analysis using GridCal
Voltage stability Analysis using GridCal
Anmol Dwivedi
 
Economic load dispatch
Economic load dispatch Economic load dispatch
Economic load dispatch
MANGESHKULKARNI72
 
26 corbellini random forest for mismatch
26 corbellini random forest for mismatch26 corbellini random forest for mismatch
26 corbellini random forest for mismatch
Sandia National Laboratories: Energy & Climate: Renewables
 
Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...
Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...
Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...
CSCJournals
 
ECMFA 2015 - Energy Consumption Analysis and Design with Foundational UML
ECMFA 2015 - Energy Consumption Analysis and Design with Foundational UMLECMFA 2015 - Energy Consumption Analysis and Design with Foundational UML
ECMFA 2015 - Energy Consumption Analysis and Design with Foundational UML
Luca Berardinelli
 
19 characterizing pv modules using microinverter data final
19 characterizing pv modules using microinverter data   final19 characterizing pv modules using microinverter data   final
19 characterizing pv modules using microinverter data final
Sandia National Laboratories: Energy & Climate: Renewables
 
Next105 Ases Power Point Presentation Internal Final
Next105 Ases Power Point Presentation Internal FinalNext105 Ases Power Point Presentation Internal Final
Next105 Ases Power Point Presentation Internal Final
Nextronex Inc
 
Economic dispatch
Economic dispatch  Economic dispatch
Economic dispatch
Hussain Ali
 
Recent and Planned Improvements to the System Advisor Model
Recent and Planned Improvements to the System Advisor ModelRecent and Planned Improvements to the System Advisor Model
Recent and Planned Improvements to the System Advisor Model
Sandia National Laboratories: Energy & Climate: Renewables
 

What's hot (20)

Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift RegisterArea Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
Area Efficient Pulsed Clock Generator Using Pulsed Latch Shift Register
 
04 final - hobbs lave wvm solar portfolios - pvpmc
04 final - hobbs lave wvm solar portfolios - pvpmc04 final - hobbs lave wvm solar portfolios - pvpmc
04 final - hobbs lave wvm solar portfolios - pvpmc
 
2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...
2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...
2014 PV Distribution System Modeling Workshop: Capabilities of Advanced Inver...
 
power flow and optimal power flow
power flow and optimal power flowpower flow and optimal power flow
power flow and optimal power flow
 
Optimal load scheduling
Optimal load schedulingOptimal load scheduling
Optimal load scheduling
 
A presentation on economic load dispatch
A presentation on economic load dispatchA presentation on economic load dispatch
A presentation on economic load dispatch
 
5 1 voss_panasonic_sandia_epri_160509_3
5 1 voss_panasonic_sandia_epri_160509_35 1 voss_panasonic_sandia_epri_160509_3
5 1 voss_panasonic_sandia_epri_160509_3
 
Economic load dispatch(with and without losses)
Economic load dispatch(with and without losses)Economic load dispatch(with and without losses)
Economic load dispatch(with and without losses)
 
Hydrothermal scheduling
Hydrothermal schedulingHydrothermal scheduling
Hydrothermal scheduling
 
OPAL-RT RT13 Conference: New OP4500 hardware
OPAL-RT RT13 Conference: New OP4500 hardwareOPAL-RT RT13 Conference: New OP4500 hardware
OPAL-RT RT13 Conference: New OP4500 hardware
 
2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...
2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...
2014 PV Performance Modeling Workshop: Optimizing PV Designs with HelioScope:...
 
Voltage stability Analysis using GridCal
Voltage stability Analysis using GridCalVoltage stability Analysis using GridCal
Voltage stability Analysis using GridCal
 
Economic load dispatch
Economic load dispatch Economic load dispatch
Economic load dispatch
 
26 corbellini random forest for mismatch
26 corbellini random forest for mismatch26 corbellini random forest for mismatch
26 corbellini random forest for mismatch
 
Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...
Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...
Design of High Speed Low Power 15-4 Compressor Using Complementary Energy Pat...
 
ECMFA 2015 - Energy Consumption Analysis and Design with Foundational UML
ECMFA 2015 - Energy Consumption Analysis and Design with Foundational UMLECMFA 2015 - Energy Consumption Analysis and Design with Foundational UML
ECMFA 2015 - Energy Consumption Analysis and Design with Foundational UML
 
19 characterizing pv modules using microinverter data final
19 characterizing pv modules using microinverter data   final19 characterizing pv modules using microinverter data   final
19 characterizing pv modules using microinverter data final
 
Next105 Ases Power Point Presentation Internal Final
Next105 Ases Power Point Presentation Internal FinalNext105 Ases Power Point Presentation Internal Final
Next105 Ases Power Point Presentation Internal Final
 
Economic dispatch
Economic dispatch  Economic dispatch
Economic dispatch
 
Recent and Planned Improvements to the System Advisor Model
Recent and Planned Improvements to the System Advisor ModelRecent and Planned Improvements to the System Advisor Model
Recent and Planned Improvements to the System Advisor Model
 

Viewers also liked

Clock Gating
Clock GatingClock Gating
Clock Gating
Mahesh Dananjaya
 
A power gating scheme for improvred
A power gating scheme for improvredA power gating scheme for improvred
A power gating scheme for improvredIAEME Publication
 
Power Gating
Power GatingPower Gating
Power Gating
Mahesh Dananjaya
 
Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...
Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...
Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...Amit Bakshi
 
Clock gating
Clock gatingClock gating
Clock gating
Mahi
 
Low Power Design Approach in VLSI
Low Power Design Approach in VLSILow Power Design Approach in VLSI
Low Power Design Approach in VLSI
Silicon Mentor
 
Low Power VLSI Design
Low Power VLSI DesignLow Power VLSI Design
Low Power VLSI Design
Mahesh Dananjaya
 
Low power vlsi design ppt
Low power vlsi design pptLow power vlsi design ppt
Low power vlsi design ppt
Anil Yadav
 
Low Power Techniques
Low Power TechniquesLow Power Techniques
Low Power Techniques
keshava murali
 

Viewers also liked (11)

Clock Gating
Clock GatingClock Gating
Clock Gating
 
A power gating scheme for improvred
A power gating scheme for improvredA power gating scheme for improvred
A power gating scheme for improvred
 
SNUG 2011 paper
SNUG 2011 paperSNUG 2011 paper
SNUG 2011 paper
 
Power Gating
Power GatingPower Gating
Power Gating
 
Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...
Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...
Implementation of Power Gating Technique in CMOS Full Adder Cell to Reduce Le...
 
Clock gating
Clock gatingClock gating
Clock gating
 
Low Power Design Approach in VLSI
Low Power Design Approach in VLSILow Power Design Approach in VLSI
Low Power Design Approach in VLSI
 
Low Power VLSI Design
Low Power VLSI DesignLow Power VLSI Design
Low Power VLSI Design
 
12 low power techniques
12 low power techniques12 low power techniques
12 low power techniques
 
Low power vlsi design ppt
Low power vlsi design pptLow power vlsi design ppt
Low power vlsi design ppt
 
Low Power Techniques
Low Power TechniquesLow Power Techniques
Low Power Techniques
 

Similar to Power and Clock Gating Modelling in Coarse Grained Reconfigurable Systems

IRJET- FPGA Implementation of Efficient Muf Gate based Multipliers
IRJET-  	  FPGA Implementation of Efficient Muf Gate based MultipliersIRJET-  	  FPGA Implementation of Efficient Muf Gate based Multipliers
IRJET- FPGA Implementation of Efficient Muf Gate based Multipliers
IRJET Journal
 
IRJET- Design of Memristor based Multiplier
IRJET- Design of Memristor based MultiplierIRJET- Design of Memristor based Multiplier
IRJET- Design of Memristor based Multiplier
IRJET Journal
 
Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...
Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...
Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...
IJERA Editor
 
IRJET- Re-Configuration Topology for On-Chip Networks by Back-Tracking
IRJET- Re-Configuration Topology for On-Chip Networks by Back-TrackingIRJET- Re-Configuration Topology for On-Chip Networks by Back-Tracking
IRJET- Re-Configuration Topology for On-Chip Networks by Back-Tracking
IRJET Journal
 
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCMIRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET Journal
 
Design a Low Power Flip-Flop Based on a Signal Feed-Through Scheme
Design a Low Power Flip-Flop Based on a Signal Feed-Through SchemeDesign a Low Power Flip-Flop Based on a Signal Feed-Through Scheme
Design a Low Power Flip-Flop Based on a Signal Feed-Through Scheme
IRJET Journal
 
Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...
Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...
Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...
IRJET Journal
 
IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...
IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...
IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...
IRJET Journal
 
IRJET- Review on Performance of OTA Structure
IRJET- Review on Performance of OTA StructureIRJET- Review on Performance of OTA Structure
IRJET- Review on Performance of OTA Structure
IRJET Journal
 
FINAL VERSION sss.pptx
FINAL VERSION sss.pptxFINAL VERSION sss.pptx
FINAL VERSION sss.pptx
DeanAcademicsRamacha
 
Approximated computing for low power neural networks
Approximated computing for low power neural networksApproximated computing for low power neural networks
Approximated computing for low power neural networks
TELKOMNIKA JOURNAL
 
Parallel Processing Technique for Time Efficient Matrix Multiplication
Parallel Processing Technique for Time Efficient Matrix MultiplicationParallel Processing Technique for Time Efficient Matrix Multiplication
Parallel Processing Technique for Time Efficient Matrix Multiplication
IJERA Editor
 
16nm bulk cmos docccii based configurable analog
16nm bulk cmos docccii based configurable analog16nm bulk cmos docccii based configurable analog
16nm bulk cmos docccii based configurable analog
eSAT Publishing House
 
High Performance Layout Design of SR Flip Flop using NAND Gates
High Performance Layout Design of SR Flip Flop using NAND Gates High Performance Layout Design of SR Flip Flop using NAND Gates
High Performance Layout Design of SR Flip Flop using NAND Gates
IJEEE
 
Low Power High-Performance Computing on the BeagleBoard Platform
Low Power High-Performance Computing on the BeagleBoard PlatformLow Power High-Performance Computing on the BeagleBoard Platform
Low Power High-Performance Computing on the BeagleBoard Platform
a3labdsp
 
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET Journal
 
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAYPOWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
ecij
 
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAYPOWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
ecij
 
A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...
A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...
A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...
IRJET Journal
 
AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...
AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...
AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...
ijiert bestjournal
 

Similar to Power and Clock Gating Modelling in Coarse Grained Reconfigurable Systems (20)

IRJET- FPGA Implementation of Efficient Muf Gate based Multipliers
IRJET-  	  FPGA Implementation of Efficient Muf Gate based MultipliersIRJET-  	  FPGA Implementation of Efficient Muf Gate based Multipliers
IRJET- FPGA Implementation of Efficient Muf Gate based Multipliers
 
IRJET- Design of Memristor based Multiplier
IRJET- Design of Memristor based MultiplierIRJET- Design of Memristor based Multiplier
IRJET- Design of Memristor based Multiplier
 
Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...
Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...
Efficient Design of Ripple Carry Adder and Carry Skip Adder with Low Quantum ...
 
IRJET- Re-Configuration Topology for On-Chip Networks by Back-Tracking
IRJET- Re-Configuration Topology for On-Chip Networks by Back-TrackingIRJET- Re-Configuration Topology for On-Chip Networks by Back-Tracking
IRJET- Re-Configuration Topology for On-Chip Networks by Back-Tracking
 
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCMIRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
 
Design a Low Power Flip-Flop Based on a Signal Feed-Through Scheme
Design a Low Power Flip-Flop Based on a Signal Feed-Through SchemeDesign a Low Power Flip-Flop Based on a Signal Feed-Through Scheme
Design a Low Power Flip-Flop Based on a Signal Feed-Through Scheme
 
Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...
Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...
Performance analysis of NOR CAM cell using CMOS-HP, CMOS-LP and FinFET 16nm t...
 
IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...
IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...
IRJET- High Speed Multi-Rate Approach based Adaptive Filter using Multiplier-...
 
IRJET- Review on Performance of OTA Structure
IRJET- Review on Performance of OTA StructureIRJET- Review on Performance of OTA Structure
IRJET- Review on Performance of OTA Structure
 
FINAL VERSION sss.pptx
FINAL VERSION sss.pptxFINAL VERSION sss.pptx
FINAL VERSION sss.pptx
 
Approximated computing for low power neural networks
Approximated computing for low power neural networksApproximated computing for low power neural networks
Approximated computing for low power neural networks
 
Parallel Processing Technique for Time Efficient Matrix Multiplication
Parallel Processing Technique for Time Efficient Matrix MultiplicationParallel Processing Technique for Time Efficient Matrix Multiplication
Parallel Processing Technique for Time Efficient Matrix Multiplication
 
16nm bulk cmos docccii based configurable analog
16nm bulk cmos docccii based configurable analog16nm bulk cmos docccii based configurable analog
16nm bulk cmos docccii based configurable analog
 
High Performance Layout Design of SR Flip Flop using NAND Gates
High Performance Layout Design of SR Flip Flop using NAND Gates High Performance Layout Design of SR Flip Flop using NAND Gates
High Performance Layout Design of SR Flip Flop using NAND Gates
 
Low Power High-Performance Computing on the BeagleBoard Platform
Low Power High-Performance Computing on the BeagleBoard PlatformLow Power High-Performance Computing on the BeagleBoard Platform
Low Power High-Performance Computing on the BeagleBoard Platform
 
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...IRJET -  	  High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
IRJET - High Speed Inexact Speculative Adder using Carry Look Ahead Adder...
 
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAYPOWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
 
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAYPOWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
POWER GATING STRUCTURE FOR REVERSIBLE PROGRAMMABLE LOGIC ARRAY
 
A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...
A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...
A Comparative Study of PID and Fuzzy Controller for Speed Control of Brushles...
 
AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...
AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...
AN EFFICIENT COUPLED GENETIC ALGORITHM AND LOAD FLOW ALGORITHM FOR OPTIMAL PL...
 

More from MDC_UNICA

RVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer ToolRVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer Tool
MDC_UNICA
 
The Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform Composer
The Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform ComposerThe Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform Composer
The Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform Composer
MDC_UNICA
 
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
MDC_UNICA
 
DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable SystemsDSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
MDC_UNICA
 
Automatic Generation of Dataflow-Based Reconfigurable Co-processing Units
Automatic Generation of Dataflow-Based Reconfigurable Co-processing UnitsAutomatic Generation of Dataflow-Based Reconfigurable Co-processing Units
Automatic Generation of Dataflow-Based Reconfigurable Co-processing Units
MDC_UNICA
 
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
MDC_UNICA
 
Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...
Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...
Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...
MDC_UNICA
 
Automated Power Gating Methodology for Dataflow-Based Reconfigurable Systems
Automated Power Gating Methodology for Dataflow-Based Reconfigurable SystemsAutomated Power Gating Methodology for Dataflow-Based Reconfigurable Systems
Automated Power Gating Methodology for Dataflow-Based Reconfigurable Systems
MDC_UNICA
 
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC DomainReconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
MDC_UNICA
 
Power Modelling for Saving Strategies in Coarse Grained Reconfigurable Systems
Power Modelling for Saving Strategies in Coarse Grained Reconfigurable SystemsPower Modelling for Saving Strategies in Coarse Grained Reconfigurable Systems
Power Modelling for Saving Strategies in Coarse Grained Reconfigurable Systems
MDC_UNICA
 
Adaptable AES Implementation with Power-Gating Support
Adaptable AES Implementation with Power-Gating SupportAdaptable AES Implementation with Power-Gating Support
Adaptable AES Implementation with Power-Gating Support
MDC_UNICA
 
Reconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool ProjectReconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool Project
MDC_UNICA
 

More from MDC_UNICA (12)

RVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer ToolRVC: A Multi-Decoder CAL Composer Tool
RVC: A Multi-Decoder CAL Composer Tool
 
The Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform Composer
The Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform ComposerThe Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform Composer
The Multi-Dataflow Composer Tool: a Runtime Reconfigurable HDL Platform Composer
 
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
 
DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable SystemsDSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
DSE and Profiling of Multi-Context Coarse-Grained Reconfigurable Systems
 
Automatic Generation of Dataflow-Based Reconfigurable Co-processing Units
Automatic Generation of Dataflow-Based Reconfigurable Co-processing UnitsAutomatic Generation of Dataflow-Based Reconfigurable Co-processing Units
Automatic Generation of Dataflow-Based Reconfigurable Co-processing Units
 
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
 
Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...
Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...
Power-Awarness in Coarse-Grained Reconfigurable Designs: a Dataflow Based Str...
 
Automated Power Gating Methodology for Dataflow-Based Reconfigurable Systems
Automated Power Gating Methodology for Dataflow-Based Reconfigurable SystemsAutomated Power Gating Methodology for Dataflow-Based Reconfigurable Systems
Automated Power Gating Methodology for Dataflow-Based Reconfigurable Systems
 
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC DomainReconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
Reconfigurable Coprocessors Synthesis in the MPEG-RVC Domain
 
Power Modelling for Saving Strategies in Coarse Grained Reconfigurable Systems
Power Modelling for Saving Strategies in Coarse Grained Reconfigurable SystemsPower Modelling for Saving Strategies in Coarse Grained Reconfigurable Systems
Power Modelling for Saving Strategies in Coarse Grained Reconfigurable Systems
 
Adaptable AES Implementation with Power-Gating Support
Adaptable AES Implementation with Power-Gating SupportAdaptable AES Implementation with Power-Gating Support
Adaptable AES Implementation with Power-Gating Support
 
Reconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool ProjectReconfigurable Platform Composer Tool Project
Reconfigurable Platform Composer Tool Project
 

Recently uploaded

web-tech-lab-manual-final-abhas.pdf. Jer
web-tech-lab-manual-final-abhas.pdf. Jerweb-tech-lab-manual-final-abhas.pdf. Jer
web-tech-lab-manual-final-abhas.pdf. Jer
freshgammer09
 
欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台
欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台
欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台
andreassenrolf537
 
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...
Peter Gallagher
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
aozcue
 
一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理
一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理
一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理
aozcue
 
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...
PinkySharma900491
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理
peuce
 

Recently uploaded (7)

web-tech-lab-manual-final-abhas.pdf. Jer
web-tech-lab-manual-final-abhas.pdf. Jerweb-tech-lab-manual-final-abhas.pdf. Jer
web-tech-lab-manual-final-abhas.pdf. Jer
 
欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台
欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台
欧洲杯冠军-欧洲杯冠军网站-欧洲杯冠军|【​网址​🎉ac123.net🎉​】领先全球的买球投注平台
 
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...
Building a Raspberry Pi Robot with Dot NET 8, Blazor and SignalR - Slides Onl...
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证如何办理
 
一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理
一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理
一比一原版(UCSB毕业证)圣塔芭芭拉社区大学毕业证如何办理
 
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...
MATHEMATICS BRIDGE COURSE (TEN DAYS PLANNER) (FOR CLASS XI STUDENTS GOING TO ...
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证如何办理
 

Power and Clock Gating Modelling in Coarse Grained Reconfigurable Systems

  • 1. POWER AND CLOCK GATING MODELLING IN COARSE GRAINED RECONFIGURABLE SYSTEMS Francesca Palumbo University of Sassari PolComIng Information Engineering Unit Tiziana Fanni, Carlo Sau, Paolo Meloni and Luigi Raffo University of Cagliari Dept. of Electrical and Electronics Eng. tiziana.fanni@diee.unica.it ACM International Conference on Computing Frontiers 2016 E O L A B
  • 2. OUTLINE  Problem Statement  Background  Dataflow Model of Computation  Coarse Grain Reconfiguration  Multi-Dataflow Composer Tool  Proposed approach  Parameters Analysis  Power Estimation Model  Logic Regions Analysis  Metodology assesment  Conclusions 2
  • 4. PROBLEM STATEMENT Consumers need:  Integrated complex and fancy resurce-intensive applications      3
  • 5. PROBLEM STATEMENT Consumers need:  Integrated complex and fancy resurce-intensive applications  Long battery life     3
  • 6. PROBLEM STATEMENT Consumers need:  Integrated complex and fancy resurce-intensive applications  Long battery life     Possible solutions: 3
  • 7. PROBLEM STATEMENT Consumers need:  Integrated complex and fancy resurce-intensive applications  Long battery life  Dataflow Model of Computation  Modularity and parallelism   INTEGRATION AND RE-USABILITY   Possible solutions: 3
  • 8. PROBLEM STATEMENT Consumers need:  Integrated complex and fancy resurce-intensive applications  Long battery life  Dataflow Model of Computation  Modularity and parallelism   INTEGRATION AND RE-USABILITY  Coarse-graine reconfiguration  Flexibility and resource sharing   MULTI-APPLICATION PORTABLE DEVICES Possible solutions: 3
  • 9. BACKGROUND Dataflow Model of Computation  DATAFLOW FORMALISM  Directed graph of actors (functional units).  Actors exchange tokens (data packets) through dedicated channels    4
  • 10. BACKGROUND Dataflow Model of Computation  DATAFLOW FORMALISM  Directed graph of actors (functional units).  Actors exchange tokens (data packets) through dedicated channels    actions state 4
  • 11. BACKGROUND Dataflow Model of Computation  DATAFLOW FORMALISM  Directed graph of actors (functional units).  Actors exchange tokens (data packets) through dedicated channels  CHARACTERISTICS  Explicit the intrinsic application parallelism.  Modularity favours model re-usability/adaptivity. actions state 4
  • 12. BACKGROUND Coarse-Graine Reconfiguration A in1 D in2 F in3 E G B Sbox_1 Sbox_2 out1 C Sbox_0 ID net=1 LUTCoarse Grained Reconfigurable Platform A B C D E C A F CG Dataflow Descriptions N:1 α β γ α β γ 0 0 0 5
  • 13. BACKGROUND Coarse-Graine Reconfiguration A in1 D in2 F in3 E G B Sbox_1 Sbox_2 out1 C Sbox_0 ID net=3 LUTCoarse Grained Reconfigurable Platform A B C D E C A F CG Dataflow Descriptions N:1 α β γ α β γ 1 1 0    6
  • 14. BACKGROUND Coarse-Graine Reconfiguration A in1 D in2 F in3 E G B Sbox_1 Sbox_2 out1 C Sbox_0 ID net=3 LUTCoarse Grained Reconfigurable Platform A B C D E C A F CG Dataflow Descriptions N:1 α β γ α β γ 1 1 0  Large Power consumption  Power Saving Methodologies  Dynamic  Clock-gating  Static  Power-gating 6
  • 15. BACKGROUND http://sites.unica.it/rpct/Multi-Dataflow Composer Tool A in1 D in2 F in3 E G B Sbox_1 Sbox_2 out1 C LR1 LR2: Always ON LR LR3 LR4 LR5 LR4 RTN RTN RTN RTN RTN RTN ISOL ISOL ISOLISOL Sbox_0 ISOL VDD SW VDD SW VDD SW ISOL VDD SW ID net Power Controller CG_en1 CG_en4 CG_en5 CG_en3 clock 7
  • 17. PROPOSED APPROACH Parameters analysis:  Architectural parameters          8
  • 18. PROPOSED APPROACH Parameters analysis:  Architectural parameters  Actors  RAMs  Combinatorial Logic  Sequential Logic…      8
  • 19. PROPOSED APPROACH Parameters analysis:  Architectural parameters  Actors  RAMs  Combinatorial Logic  Sequential Logic…  Functional parameters     8
  • 20. PROPOSED APPROACH Parameters analysis:  Architectural parameters  Actors  RAMs  Combinatorial Logic  Sequential Logic…  Functional parameters  Input Data  LR’s Activation Time   8
  • 21. PROPOSED APPROACH Parameters analysis:  Architectural parameters  Actors  RAMs  Combinatorial Logic  Sequential Logic…  Functional parameters  Input Data  LR’s Activation Time  Technological parameters  8
  • 22. PROPOSED APPROACH Parameters analysis:  Architectural parameters  Actors  RAMs  Combinatorial Logic  Sequential Logic…  Functional parameters  Input Data  LR’s Activation Time  Technological parameters  As transistors get smaller the contribution of the static contribute gets larger and not negligible. 8
  • 23. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     9
  • 24. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     9
  • 25. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     9
  • 26. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Combinational Logic Standard Cost 9
  • 27. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Combinational Logic Sequential Logic Standard Cost 9
  • 28. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Retention CellsCombinational Logic Sequential Logic Standard Cost Overhead 9
  • 29. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Retention Cells Isolation Cells Combinational Logic Sequential Logic Standard Cost Overhead 9
  • 30. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Retention Cells Isolation Cells Power Controller Combinational Logic Sequential Logic Standard Cost Overhead 9
  • 31. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Retention Cells Isolation Cells Power Controller Clock Gating Cells Combinational Logic Sequential Logic Standard Cost Overhead 9
  • 32. PROPOSED APPROACH  Power gating.  Clock gating. Power Estimation Model ]*)(*)([ ]*)(*)([ ]*)*#(*)*#([ *]/#)#(#*)()*#()([ )(_)()( iOFFOFFiONON iOFFOFFiONON iOFFOFFiONON iON LRactors iiONi TCGPTCGP TContrPTContrP TisoISOPTisoISOP TregrtnregregPrtnRCPcmbP LROverExtLRPLRP i       ]*)(*)([ ]*)()([ )(_)()( iOFFOFFiONON iON LRactors iiONi TCGPTCGP TregPcmbP LROverExtLRPLRP i     Retention Cells Isolation Cells Power Controller Clock Gating Cells Combinational Logic Sequential Logic Standard Cost Overhead 9 Term not present in CG static model!
  • 33. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? 10
  • 34. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- 10
  • 35. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% 10
  • 36. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% 10
  • 37. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG 10
  • 38. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard 10
  • 39. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard 10
  • 40. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard Area > Area_th Evaluate PG 10
  • 41. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard Area > Area_th Evaluate PG Power PG: +2%  Evaluate CG 10
  • 42. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard Area > Area_th Evaluate PG Power PG: +2%  Evaluate CG Power CG: -5%  Apply CG 10
  • 43. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard Area > Area_th Evaluate PG Power PG: +2%  Evaluate CG Power CG: -5%  Apply CG 10
  • 44. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard Area > Area_th Evaluate PG Power PG: +2%  Evaluate CG Power CG: -5%  Apply CG Area > Area_th Evaluate PG 10
  • 45. PROPOSED APPROACH LRs analysis NO YES NO YES NO YES NO YES AREA THRESHOLDING ESTIMATE PG ESTIMATE CG Select PDi to be Implement with PG Select PDi to be Implement with CG START i=0 Increment i STOP 1 2 3 ∀PDi i<N+1 Is Above? Is there Saving%? Is there Saving%? LRs LR1 LR2 LR3 Area 2% 5.8% 23% PG --- +2% -18% CG +1% -5% --- Area_th: 5% Area < Area_th Evaluate CG Power CG: +1%  Discard Area > Area_th Evaluate PG Power PG: +2%  Evaluate CG Power CG: -5%  Apply CG Area > Area_th Evaluate PG Power CG: -18%  Apply PG 10
  • 46. METODOLOGY ASSESMENT Design Under Test: ASIC 90nm technology host processor FFT coprocessor INTERCONNECTION LAYER APPLICATION # KERNEL #LRs FFT 4 8 11
  • 47. METODOLOGY ASSESMENT Design Under Test: ASIC 90nm technology host processor FFT coprocessor INTERCONNECTION LAYER APPLICATION # KERNEL #LRs FFT 4 8 PDs TiON #reg Area % PD1 33% 1024 62.44 PD2 67% 512 0.46 PD3 37% 256 15.84 PD4 4% 0 0.41 PD5 21% 0 0.25 PD6 42% 0 0.43 PD7 58% 128 7.93 PD8 96% 512 1.36 11
  • 48. METODOLOGY ASSESMENT Design Under Test: ASIC 90nm technology host processor FFT coprocessor INTERCONNECTION LAYER APPLICATION # KERNEL #LRs FFT 4 8 PDs TiON #reg Area % PD1 33% 1024 62.44 PD2 67% 512 0.46 PD3 37% 256 15.84 PD4 4% 0 0.41 PD5 21% 0 0.25 PD6 42% 0 0.43 PD7 58% 128 7.93 PD8 96% 512 1.36 Highly active 11
  • 49. METODOLOGY ASSESMENT Design Under Test: ASIC 90nm technology host processor FFT coprocessor INTERCONNECTION LAYER APPLICATION # KERNEL #LRs FFT 4 8 PDs TiON #reg Area % PD1 33% 1024 62.44 PD2 67% 512 0.46 PD3 37% 256 15.84 PD4 4% 0 0.41 PD5 21% 0 0.25 PD6 42% 0 0.43 PD7 58% 128 7.93 PD8 96% 512 1.36 CombinatorialHighly active 11
  • 50. METODOLOGY ASSESMENT Design Under Test: ASIC 90nm technology host processor FFT coprocessor INTERCONNECTION LAYER APPLICATION # KERNEL #LRs FFT 4 8 PDs TiON #reg Area % PD1 33% 1024 62.44 PD2 67% 512 0.46 PD3 37% 256 15.84 PD4 4% 0 0.41 PD5 21% 0 0.25 PD6 42% 0 0.43 PD7 58% 128 7.93 PD8 96% 512 1.36 CombinatorialHighly active Best candidate 11
  • 51. METODOLOGY ASSESMENT Design Under Test: ASIC 90nm technology host processor FFT coprocessor INTERCONNECTION LAYER APPLICATION # KERNEL #LRs FFT 4 8 PDs TiON #reg Area % PD1 33% 1024 62.44 PD2 67% 512 0.46 PD3 37% 256 15.84 PD4 4% 0 0.41 PD5 21% 0 0.25 PD6 42% 0 0.43 PD7 58% 128 7.93 PD8 96% 512 1.36 Small LRsCombinatorialHighly active Best candidate 11
  • 54. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 12
  • 55. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 CG PDs: PD2, PD8 12
  • 56. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 CG PDs: PD2, PD8 Area_th: 10% PG_2 12
  • 57. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 CG PDs: PD2, PD8 Area_th: 10% PG_2 PG PDs: PD1, PD3 12
  • 58. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 CG PDs: PD2, PD8 Area_th: 10% PG_2 PG PDs: PD1, PD3 CG PDs: PD2, PD7, PD8 12
  • 59. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 CG PDs: PD2, PD8 Area_th: 10% PG_2 PG PDs: PD1, PD3 CG PDs: PD2, PD7, PD8 Err. 0.33% Err. 1.41% Err. 1.08% Err. 0.88% Err. 11.62% Err. 39.67% 12
  • 60. METODOLOGY ASSESMENT Estimated Saving Area_th: 5% PG_1 PG PDs: PD1, PD3, PD7 CG PDs: PD2, PD8 Area_th: 10% PG_2 PG PDs: PD1, PD3 CG PDs: PD2, PD7, PD8 Err. 0.33% Err. 1.41% Err. 1.08% Err. 0.88% Err. 11.62% Err. 39.67% 0.16% Estimated Vs 0.11% Real 12
  • 64. CONCLUSIONS Advantages of the proposed approach  Given a CGR design with n kernel and k LRs       14
  • 65. CONCLUSIONS Advantages of the proposed approach  Given a CGR design with n kernel and k LRs  The proposed approach requires uniquely the analysis of the baseline CGR design:  one synthesis  n simulations (one foreach kernel)    14
  • 66. CONCLUSIONS Advantages of the proposed approach  Given a CGR design with n kernel and k LRs  The proposed approach requires uniquely the analysis of the baseline CGR design:  one synthesis  n simulations (one foreach kernel)  Otherwise:  (2*k+1) (one PG and one CG for each LR + one of Base the design)  (2*k+1)*n simulation (n simulation for each design) 14
  • 67. ACKNOWLEDGEMENTS  The research leading to these results has received funding from:  the Region of Sardinia L.R.7/2007 under grant agreement CRP-18324 [RPCT Project].  the Region of Sardinia, Young Researchers Grant, POR Sardegna FSE 2007-2013, L.R.7/2007 “Promotion of the scientific research and technological innovation in Sardinia” 15
  • 68. Francesca Palumbo University of Sassari PolComIng Information Engineering Unit Tiziana Fanni, Carlo Sau, Paolo Meloni and Luigi Raffo University of Cagliari Dept. of Electrical and Electronics Eng. E O L A B ? ACM International Conference on Computing Frontiers 2016 tiziana.fanni@diee.unica.it
  • 69. READ FULL ARTICLE  ACM Digital Library http://dl.acm.org/citation.cfm?id=2911713  RPCT Project Website: http://sites.unica.it/rpct/references/
  • 70. REFERENCE  @inproceedings{Fanni_CF2016, author = {Fanni, Tiziana and Sau, Carlo and Meloni, Paolo and Raffo, Luigi and Palumbo, Francesca}, title = {Power and Clock Gating Modelling in Coarse Grained Reconfigurable Systems}, booktitle = {Proceedings of the ACM International Conference on Computing Frontiers}, series = {CF '16}, year = {2016}, location = {Como, Italy}, doi = {10.1145/2903150.2911713}, publisher = {ACM}, address = {New York, NY, USA}, } BibTex