SlideShare a Scribd company logo
Vinitha A Palaniveloo Jude Angelo Ambrose
Arcot Sowmya
School of Computer Science and Engineering
The University of New South Wales
Sydney, Australia
Improving GA-based NoC mapping
algorithms using a formal model
7/10/2014 The University of New South Wales 1
Outline
• Introduction
• Background and Related work
• Case study: Application of formal model to
evaluate NMAP algorithm
• NoC mapping using formal NoC model and
Genetic Algorithm (GA)
• Results
• Conclusion
7/10/2014 The University of New South Wales 2
7/10/2014 The University of New South Wales 3
Network on Chip
Application
Tasks
Application Mapping
SWITCH
ROUTING
I/P O/P
NETWORK INTERFACE
PROCESSING
ELEMENT MEMORY
Router
PE – Processing Element
MEM - Memory
Network on Chip (NoC)
NoC Mapping
7/10/2014 The University of New South Wales 4
Mapping algorithms aim at mapping
applications onto the NoC platform, while
optimizing certain metrics of interest such as
energy, performance,…
𝑛𝑃𝑟 =
𝑛!
𝑛 − 𝑟 !
Size Number of
Permutations
6 720
9 362880
16 2.09227x1013
25 1.55112x1025
Application
NoC
Infrastructure
Communication
scheme
Open Problems in Application
mapping
7/10/2014 The University of New South Wales 5
• Simulation too expensive to use in optimization
loop of mapping algorithms, so analytical
models used for performance evaluation
• Analytical models usually abstract architecture
specific details, not robust
• Instead, propose use of formal NoC model
mapping- contains details of Architecture,
Communication scheme and Application
Types of Mapping Algorithms
7/10/2014 The University of New South Wales 6
Mapping
algorithms
Static
Exact
mapping
Greedy
placement
Iterative
placement
Search based
mapping
Mapping
heuristics
Genetic
algorithms
Dynamic
Genetic Algorithms
7/10/2014 The University of New South Wales 7
• Genetic algorithms (GA) uses global
search heuristics
• Particular class of evolutionary
algorithms that use techniques
inspired by evolutionary biology-
inheritance, mutation, selection and
crossover (also called recombination)
Genetic Algorithms
7/10/2014 The University of New South Wales 8
Initialize Population
Check constraints ?
Evaluate Fitness
Select Survivors
Output Results
Vary Individuals
Yes
No
Reproduction
Crossover
Mutation
Crossover Operator
7/10/2014 The University of New South Wales 9
• Crossover
– Generating offspring from two selected parents
• Single point crossover
• Two point crossover (Multi point crossover)
• Uniform crossover
• Example:
– Parent 1: X X | X X X X X
– Parent 2: Y Y | Y Y Y Y Y
– Offspring 1: X X Y Y Y Y Y
– Offspring 2: Y Y X X X X X
Crossover is explorative, it makes a big jump to an area somewhere “in between” two
(parents)
Mutation Operator
7/10/2014 The University of New South Wales 10
Mutation
• Generating new offspring from single parent
by setting probability of mutation.
• Maintaining the diversity of the individuals
• Can “generate” new genes
• Example:
– Parent 1: X Y Y Y Y X Y
– Parent 2: X X X Y Y Y X
– Offspring 1: X Y Y X Y Y X
– Offspring 2: Y X X Y X X Y
Mutation is exploitative, it creates random small diversions, thereby staying near the parent
Fitness function
7/10/2014 The University of New South Wales 11
Fitness function
f(x)
Individuals Ranked
T1 T2 T3 T4 T5 T6
T6 T2 T3 T1 T5 T4
T1 T3 T6 T4 T5 T2
T5 T2 T3 T4 T1 T6
T4 T2 T5 T3 T1 T6
T6 T2 T3 T4 T5 T1
T1 T2 T3 T4 T5 T6
T6 T2 T3 T1 T5 T4
T1 T3 T6 T4 T5 T2
T5 T2 T3 T4 T1 T6
T4 T2 T5 T3 T1 T6
T6 T2 T3 T4 T5 T1
M1
M2
M3
M4
M5
M6
M5
M1
M2
M6
M3
M4
Fitness Function
• Analytical model
• Formal model
Analytical model for Latency Estimation
7/10/2014 The University of New South Wales 12
Latency : Time for packet to traverse network
• Latency = Hop Count+ Contention latency
– Contention latency: impacted by traffic loads and
buffers in the router, computed using probabilistic
models or queuing theory
– Hop Count: number of links traversed between
source and destination, depends on routing
algorithm, which can be computed analytically for
deterministic routing algorithms
Formal NoC model
7/10/2014 The University of New South Wales 13
P8
receive_R8?
inject_pkt_R8 ()
switch_R8 ()
arbitrate_R8 ()
sink_pkt_R8 ()
transmit_R8!
P0
receive_R0?
inject_pkt_R0 ()
switch_R0 ()
arbitrate_R0 ()
sink_pkt_R0 ()
transmit_R0!
receive_R8?
inject_pkt_R8 ()
switch_R8 ()
arbitrate_R8 ()
sink_pkt_R8 ()
transmit_R8!
P0
receive_R0?
inject_pkt_R0 ()
switch_R0 ()
arbitrate_R0 ()
sink_pkt_R0 ()
transmit_R0!
P1 P8
Pt0 = P0 || P1 || P2 || P3 || P4 || P5 || P6 || P7 || P8
Pt0 = P0 . P1 . P2 . P3 . P4 . P5 . P6 . P7 . P8
Estimating worst-case latency of NoC formally
(by model checking)
- posing latency as reachability problem
GA Algorithms Termination Criteria
7/10/2014 The University of New South Wales 14
The generational process repeated until
termination condition reached
Common terminating conditions are:
• A solution is found that satisfies minimum criteria
• Fixed number of generations reached
• Manual inspection
• Any Combinations of the above
Case study: NMAP algorithm & DSP
Filter Application
ARM Filter
FFT IFFT
Display
Memory
200
600
200200 200
Source Destination Rate
Memory ARM 200
Filter ARM 600
Filter FFT 200
Filter IFFT 200
ARM Filter 600
ARM Display 200
FFT Filter 200
IFFT Filter 200
[1] Mapping and Physical Planning of Networks-on-Chip Architectures with Quality-of-Service Guarantees
7/10/2014 The University of New South Wales 15
Peak link bandwidth as 1000MB/S
200MB/s – 20% packet injection rate
- translates to one packet every 5 clock cycles
600MB/s – 60% packet injection rate
- translates to one packet every 2 clock cycles
Solution by NMAP Algorithm [1]
0 1 2
3 4 5
Memory
FFT
Filter
ARM
IFFT
Display
7/10/2014 The University of New South Wales 16
Estimated worst-case latency using the formal model : 3.375 cycles
Exploration of mapping space by brute force
Modules
MAP
93
MAP
95
MAP
271
MAP
276
MAP
391
MAP
396
MAP
699
MAP
701
Memory R0 R0 R3 R5 R3 R5 R2 R2
Filter R5 R4 R1 R1 R1 R1 R4 R4
IFFT R3 R5 R0 R0 R2 R2 R3 R5
FFT R4 R3 R2 R2 R0 R0 R5 R3
ARM R1 R1 R4 R4 R4 R4 R1 R1
Display R2 R2 R5 R3 R5 R3 R0 R0
Mapping proposed in [1] is MAPPING 391 in out model, highlighted
above, it has minimum average worst case latency
7/10/2014 The University of New South Wales 17
Mapping with worst-case latency of 3.375 cycles
Total number of solutions = 8
• It is not possible to perform exhaustive search for
large NoCs
– 3x2 took 17min 42 sec
– 3x3 took 6.5 days (148.62 hours)
• Results of different mapping algorithms developed
for the same constraint may be different
• Mapping algorithms may find only a near optimal
solution
• Mapping algorithms may find more than one
mapping solution
Discussion
7/10/2014 The University of New South Wales 18
Evaluate possibility of using formal model to
enable mapping algorithms to find better optimal
solution
Proposal
7/10/2014 The University of New South Wales 19
Popular static
mapping
approaches
Mapping
heuristics
Genetic
Algorithms based
mapping
Using analytical
model
Using formal
model
Sample Applications
7/10/2014 The University of New South Wales 20
Application NoC Size Total traces
APP1 3x2 8
APP2 3x3 13
APP3 4x4 25
APP4 5x5 38
APP5 10x10 150
APP6 20x20 600
APP7 30x30 1349
Population size and Generation size
7/10/2014 The University of New South Wales 21
Algorithm termination condition:
• Population size = 100
• Generation size = 10
WHY?
• APP1 & APP2 => always finds at least one optimal solution
• APP3 & APP4 => output solution is not significantly improved
by increasing population and generation size
Offspring generated using two types of genetic operators:
• random mutation operator R
• single point crossover operator S
Simple mapping heuristics
7/10/2014 The University of New South Wales 22
Result
Application WCL
(Heuristic)
WCL
(R_FM)
WCL
(S_FM)
WCL
(R_AM)
WCL
(S_AM)
APP1 3.375 3.375 3.375 3.375 3.375
APP2 4.0 3.84 3.84 4.384 4.3
APP3 7.91 5.32 5.44 5.56 5.56
APP4 27.1 6.6 7.4 7.89 8.31
APP5 35.54 20.2 20.7 25.14 29.55
APP6 210.72 86.45 80.4 121.515 112.47
APP7 295.65 203.8 222.1 395.47 271.38
7/10/2014 The University of New South Wales 23
Discussion of Mapping Results
• GA based mapping algorithm generally finds better
optimal solutions than the mapping heuristic,
especially as the size of the application increases
• all the GA-based algorithms perform equally well for
smaller applications [APP1...APP4]
• GA + formal is better than GA + analytical for larger
applications [APP5...APP7]
• With random mutation R, GA + formal has WCL 20%
lower than GA + analytical
• With single point crossover S, GA + formal has WCL
13% lower than GA + analytical
7/10/2014 The University of New South Wales 24
Result
7/10/2014 The University of New South Wales 25
Execution Time
• GA + formal takes longer to compute,
compared with GA + analytical
• Random mutation R introduces more
diversity in the population, and the solution
converges to an optimal solution more quickly.
But single point crossover S is faster to
compute
7/10/2014 The University of New South Wales 26
Conclusion
7/10/2014 The University of New South Wales 27
• Formal models can be applied to solve NoC
design problem in addition to functional
verification
• Formal NoC models may be enhanced to
estimate other QoS parameters such as
energy, power and throughput, resulting in a
good generic performance model
Thank You!
7/10/2014 The University of New South Wales 28

More Related Content

Similar to Improving Genetic Algorithm (GA) based NoC mapping algorithm using a formal model

Application of formal methods for system level verification of final
Application of formal methods for system level verification of finalApplication of formal methods for system level verification of final
Application of formal methods for system level verification of final
Vinita Palaniveloo
 
Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...
Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...
Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...
Manon Bouly
 
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
thanhdowork
 
Application of OpenSees in Reliability-based Design Optimization of Structures
Application of OpenSees in Reliability-based Design Optimization of StructuresApplication of OpenSees in Reliability-based Design Optimization of Structures
Application of OpenSees in Reliability-based Design Optimization of Structures
openseesdays
 
Making Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping Party
Making Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping PartyMaking Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping Party
Making Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping Party
Kurata Takeshi
 
A reinforcement learning based routing protocol with qo s support for biomedi...
A reinforcement learning based routing protocol with qo s support for biomedi...A reinforcement learning based routing protocol with qo s support for biomedi...
A reinforcement learning based routing protocol with qo s support for biomedi...
Iffat Anjum
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Seunghyun Hwang
 
Real time traffic management - challenges and solutions
Real time traffic management - challenges and solutionsReal time traffic management - challenges and solutions
Real time traffic management - challenges and solutions
Institute for Transport Studies (ITS)
 
Unsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimationUnsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimation
Jaey Jeong
 
accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithms
Farhan Zaki
 
Arquam_reportfinal
Arquam_reportfinalArquam_reportfinal
Arquam_reportfinal
Arquam Md
 
OPAL-RT RT14 Conference: State-Space Nodal
OPAL-RT RT14 Conference: State-Space NodalOPAL-RT RT14 Conference: State-Space Nodal
OPAL-RT RT14 Conference: State-Space Nodal
OPAL-RT TECHNOLOGIES
 
Morgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distMorgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 dist
ddm314
 
Assessment Model for Opportunistic Routing
Assessment Model for Opportunistic RoutingAssessment Model for Opportunistic Routing
Assessment Model for Opportunistic Routing
Waldir Moreira
 
Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...
Jaey Jeong
 
MSc Thesis Presentation
MSc Thesis PresentationMSc Thesis Presentation
MSc Thesis Presentation
Reem Sherif
 
ThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersollThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersoll
Kyle Ingersoll
 
Local modeling in regression and time series prediction
Local modeling in regression and time series predictionLocal modeling in regression and time series prediction
Local modeling in regression and time series prediction
Gianluca Bontempi
 
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
gerogepatton
 
Sustainability: Actors, Behavior, and Transparency
Sustainability: Actors, Behavior, and TransparencySustainability: Actors, Behavior, and Transparency
Sustainability: Actors, Behavior, and Transparency
Reza Farrahi Moghaddam, PhD, BEng
 

Similar to Improving Genetic Algorithm (GA) based NoC mapping algorithm using a formal model (20)

Application of formal methods for system level verification of final
Application of formal methods for system level verification of finalApplication of formal methods for system level verification of final
Application of formal methods for system level verification of final
 
Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...
Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...
Verolog 2019 : Multiple solving approaches applied to the Heterogeneous Vehic...
 
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
 
Application of OpenSees in Reliability-based Design Optimization of Structures
Application of OpenSees in Reliability-based Design Optimization of StructuresApplication of OpenSees in Reliability-based Design Optimization of Structures
Application of OpenSees in Reliability-based Design Optimization of Structures
 
Making Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping Party
Making Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping PartyMaking Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping Party
Making Pier Data Broader and Deeper: PDR Challenge and Virtual Mapping Party
 
A reinforcement learning based routing protocol with qo s support for biomedi...
A reinforcement learning based routing protocol with qo s support for biomedi...A reinforcement learning based routing protocol with qo s support for biomedi...
A reinforcement learning based routing protocol with qo s support for biomedi...
 
End-to-End Object Detection with Transformers
End-to-End Object Detection with TransformersEnd-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
 
Real time traffic management - challenges and solutions
Real time traffic management - challenges and solutionsReal time traffic management - challenges and solutions
Real time traffic management - challenges and solutions
 
Unsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimationUnsupervised representation learning for gaze estimation
Unsupervised representation learning for gaze estimation
 
accessible-streaming-algorithms
accessible-streaming-algorithmsaccessible-streaming-algorithms
accessible-streaming-algorithms
 
Arquam_reportfinal
Arquam_reportfinalArquam_reportfinal
Arquam_reportfinal
 
OPAL-RT RT14 Conference: State-Space Nodal
OPAL-RT RT14 Conference: State-Space NodalOPAL-RT RT14 Conference: State-Space Nodal
OPAL-RT RT14 Conference: State-Space Nodal
 
Morgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distMorgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 dist
 
Assessment Model for Opportunistic Routing
Assessment Model for Opportunistic RoutingAssessment Model for Opportunistic Routing
Assessment Model for Opportunistic Routing
 
Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...Deep learning based gaze detection system for automobile drivers using nir ca...
Deep learning based gaze detection system for automobile drivers using nir ca...
 
MSc Thesis Presentation
MSc Thesis PresentationMSc Thesis Presentation
MSc Thesis Presentation
 
ThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersollThesisDefensePresentation_KyleIngersoll
ThesisDefensePresentation_KyleIngersoll
 
Local modeling in regression and time series prediction
Local modeling in regression and time series predictionLocal modeling in regression and time series prediction
Local modeling in regression and time series prediction
 
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Dee...
 
Sustainability: Actors, Behavior, and Transparency
Sustainability: Actors, Behavior, and TransparencySustainability: Actors, Behavior, and Transparency
Sustainability: Actors, Behavior, and Transparency
 

Recently uploaded

一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
gerogepatton
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
Rahul
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
sachin chaurasia
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
MiscAnnoy1
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
Engine Lubrication performance System.pdf
Engine Lubrication performance System.pdfEngine Lubrication performance System.pdf
Engine Lubrication performance System.pdf
mamamaam477
 
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have oneISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
Las Vegas Warehouse
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
Yasser Mahgoub
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
RamonNovais6
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
Addu25809
 

Recently uploaded (20)

一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
Engine Lubrication performance System.pdf
Engine Lubrication performance System.pdfEngine Lubrication performance System.pdf
Engine Lubrication performance System.pdf
 
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have oneISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
 

Improving Genetic Algorithm (GA) based NoC mapping algorithm using a formal model

  • 1. Vinitha A Palaniveloo Jude Angelo Ambrose Arcot Sowmya School of Computer Science and Engineering The University of New South Wales Sydney, Australia Improving GA-based NoC mapping algorithms using a formal model 7/10/2014 The University of New South Wales 1
  • 2. Outline • Introduction • Background and Related work • Case study: Application of formal model to evaluate NMAP algorithm • NoC mapping using formal NoC model and Genetic Algorithm (GA) • Results • Conclusion 7/10/2014 The University of New South Wales 2
  • 3. 7/10/2014 The University of New South Wales 3 Network on Chip Application Tasks Application Mapping SWITCH ROUTING I/P O/P NETWORK INTERFACE PROCESSING ELEMENT MEMORY Router PE – Processing Element MEM - Memory Network on Chip (NoC)
  • 4. NoC Mapping 7/10/2014 The University of New South Wales 4 Mapping algorithms aim at mapping applications onto the NoC platform, while optimizing certain metrics of interest such as energy, performance,… 𝑛𝑃𝑟 = 𝑛! 𝑛 − 𝑟 ! Size Number of Permutations 6 720 9 362880 16 2.09227x1013 25 1.55112x1025 Application NoC Infrastructure Communication scheme
  • 5. Open Problems in Application mapping 7/10/2014 The University of New South Wales 5 • Simulation too expensive to use in optimization loop of mapping algorithms, so analytical models used for performance evaluation • Analytical models usually abstract architecture specific details, not robust • Instead, propose use of formal NoC model mapping- contains details of Architecture, Communication scheme and Application
  • 6. Types of Mapping Algorithms 7/10/2014 The University of New South Wales 6 Mapping algorithms Static Exact mapping Greedy placement Iterative placement Search based mapping Mapping heuristics Genetic algorithms Dynamic
  • 7. Genetic Algorithms 7/10/2014 The University of New South Wales 7 • Genetic algorithms (GA) uses global search heuristics • Particular class of evolutionary algorithms that use techniques inspired by evolutionary biology- inheritance, mutation, selection and crossover (also called recombination)
  • 8. Genetic Algorithms 7/10/2014 The University of New South Wales 8 Initialize Population Check constraints ? Evaluate Fitness Select Survivors Output Results Vary Individuals Yes No Reproduction Crossover Mutation
  • 9. Crossover Operator 7/10/2014 The University of New South Wales 9 • Crossover – Generating offspring from two selected parents • Single point crossover • Two point crossover (Multi point crossover) • Uniform crossover • Example: – Parent 1: X X | X X X X X – Parent 2: Y Y | Y Y Y Y Y – Offspring 1: X X Y Y Y Y Y – Offspring 2: Y Y X X X X X Crossover is explorative, it makes a big jump to an area somewhere “in between” two (parents)
  • 10. Mutation Operator 7/10/2014 The University of New South Wales 10 Mutation • Generating new offspring from single parent by setting probability of mutation. • Maintaining the diversity of the individuals • Can “generate” new genes • Example: – Parent 1: X Y Y Y Y X Y – Parent 2: X X X Y Y Y X – Offspring 1: X Y Y X Y Y X – Offspring 2: Y X X Y X X Y Mutation is exploitative, it creates random small diversions, thereby staying near the parent
  • 11. Fitness function 7/10/2014 The University of New South Wales 11 Fitness function f(x) Individuals Ranked T1 T2 T3 T4 T5 T6 T6 T2 T3 T1 T5 T4 T1 T3 T6 T4 T5 T2 T5 T2 T3 T4 T1 T6 T4 T2 T5 T3 T1 T6 T6 T2 T3 T4 T5 T1 T1 T2 T3 T4 T5 T6 T6 T2 T3 T1 T5 T4 T1 T3 T6 T4 T5 T2 T5 T2 T3 T4 T1 T6 T4 T2 T5 T3 T1 T6 T6 T2 T3 T4 T5 T1 M1 M2 M3 M4 M5 M6 M5 M1 M2 M6 M3 M4 Fitness Function • Analytical model • Formal model
  • 12. Analytical model for Latency Estimation 7/10/2014 The University of New South Wales 12 Latency : Time for packet to traverse network • Latency = Hop Count+ Contention latency – Contention latency: impacted by traffic loads and buffers in the router, computed using probabilistic models or queuing theory – Hop Count: number of links traversed between source and destination, depends on routing algorithm, which can be computed analytically for deterministic routing algorithms
  • 13. Formal NoC model 7/10/2014 The University of New South Wales 13 P8 receive_R8? inject_pkt_R8 () switch_R8 () arbitrate_R8 () sink_pkt_R8 () transmit_R8! P0 receive_R0? inject_pkt_R0 () switch_R0 () arbitrate_R0 () sink_pkt_R0 () transmit_R0! receive_R8? inject_pkt_R8 () switch_R8 () arbitrate_R8 () sink_pkt_R8 () transmit_R8! P0 receive_R0? inject_pkt_R0 () switch_R0 () arbitrate_R0 () sink_pkt_R0 () transmit_R0! P1 P8 Pt0 = P0 || P1 || P2 || P3 || P4 || P5 || P6 || P7 || P8 Pt0 = P0 . P1 . P2 . P3 . P4 . P5 . P6 . P7 . P8 Estimating worst-case latency of NoC formally (by model checking) - posing latency as reachability problem
  • 14. GA Algorithms Termination Criteria 7/10/2014 The University of New South Wales 14 The generational process repeated until termination condition reached Common terminating conditions are: • A solution is found that satisfies minimum criteria • Fixed number of generations reached • Manual inspection • Any Combinations of the above
  • 15. Case study: NMAP algorithm & DSP Filter Application ARM Filter FFT IFFT Display Memory 200 600 200200 200 Source Destination Rate Memory ARM 200 Filter ARM 600 Filter FFT 200 Filter IFFT 200 ARM Filter 600 ARM Display 200 FFT Filter 200 IFFT Filter 200 [1] Mapping and Physical Planning of Networks-on-Chip Architectures with Quality-of-Service Guarantees 7/10/2014 The University of New South Wales 15 Peak link bandwidth as 1000MB/S 200MB/s – 20% packet injection rate - translates to one packet every 5 clock cycles 600MB/s – 60% packet injection rate - translates to one packet every 2 clock cycles
  • 16. Solution by NMAP Algorithm [1] 0 1 2 3 4 5 Memory FFT Filter ARM IFFT Display 7/10/2014 The University of New South Wales 16 Estimated worst-case latency using the formal model : 3.375 cycles
  • 17. Exploration of mapping space by brute force Modules MAP 93 MAP 95 MAP 271 MAP 276 MAP 391 MAP 396 MAP 699 MAP 701 Memory R0 R0 R3 R5 R3 R5 R2 R2 Filter R5 R4 R1 R1 R1 R1 R4 R4 IFFT R3 R5 R0 R0 R2 R2 R3 R5 FFT R4 R3 R2 R2 R0 R0 R5 R3 ARM R1 R1 R4 R4 R4 R4 R1 R1 Display R2 R2 R5 R3 R5 R3 R0 R0 Mapping proposed in [1] is MAPPING 391 in out model, highlighted above, it has minimum average worst case latency 7/10/2014 The University of New South Wales 17 Mapping with worst-case latency of 3.375 cycles Total number of solutions = 8
  • 18. • It is not possible to perform exhaustive search for large NoCs – 3x2 took 17min 42 sec – 3x3 took 6.5 days (148.62 hours) • Results of different mapping algorithms developed for the same constraint may be different • Mapping algorithms may find only a near optimal solution • Mapping algorithms may find more than one mapping solution Discussion 7/10/2014 The University of New South Wales 18
  • 19. Evaluate possibility of using formal model to enable mapping algorithms to find better optimal solution Proposal 7/10/2014 The University of New South Wales 19 Popular static mapping approaches Mapping heuristics Genetic Algorithms based mapping Using analytical model Using formal model
  • 20. Sample Applications 7/10/2014 The University of New South Wales 20 Application NoC Size Total traces APP1 3x2 8 APP2 3x3 13 APP3 4x4 25 APP4 5x5 38 APP5 10x10 150 APP6 20x20 600 APP7 30x30 1349
  • 21. Population size and Generation size 7/10/2014 The University of New South Wales 21 Algorithm termination condition: • Population size = 100 • Generation size = 10 WHY? • APP1 & APP2 => always finds at least one optimal solution • APP3 & APP4 => output solution is not significantly improved by increasing population and generation size Offspring generated using two types of genetic operators: • random mutation operator R • single point crossover operator S
  • 22. Simple mapping heuristics 7/10/2014 The University of New South Wales 22
  • 23. Result Application WCL (Heuristic) WCL (R_FM) WCL (S_FM) WCL (R_AM) WCL (S_AM) APP1 3.375 3.375 3.375 3.375 3.375 APP2 4.0 3.84 3.84 4.384 4.3 APP3 7.91 5.32 5.44 5.56 5.56 APP4 27.1 6.6 7.4 7.89 8.31 APP5 35.54 20.2 20.7 25.14 29.55 APP6 210.72 86.45 80.4 121.515 112.47 APP7 295.65 203.8 222.1 395.47 271.38 7/10/2014 The University of New South Wales 23
  • 24. Discussion of Mapping Results • GA based mapping algorithm generally finds better optimal solutions than the mapping heuristic, especially as the size of the application increases • all the GA-based algorithms perform equally well for smaller applications [APP1...APP4] • GA + formal is better than GA + analytical for larger applications [APP5...APP7] • With random mutation R, GA + formal has WCL 20% lower than GA + analytical • With single point crossover S, GA + formal has WCL 13% lower than GA + analytical 7/10/2014 The University of New South Wales 24
  • 25. Result 7/10/2014 The University of New South Wales 25
  • 26. Execution Time • GA + formal takes longer to compute, compared with GA + analytical • Random mutation R introduces more diversity in the population, and the solution converges to an optimal solution more quickly. But single point crossover S is faster to compute 7/10/2014 The University of New South Wales 26
  • 27. Conclusion 7/10/2014 The University of New South Wales 27 • Formal models can be applied to solve NoC design problem in addition to functional verification • Formal NoC models may be enhanced to estimate other QoS parameters such as energy, power and throughput, resulting in a good generic performance model
  • 28. Thank You! 7/10/2014 The University of New South Wales 28