SlideShare a Scribd company logo
1 of 21
Design and validation of a multi-core
embedded platform under high
performance requirements
University of L’Aquila
Center of Excellence DEWS
Department of Information Engineering, Computer
Science and Mathematics DISIM
4th Workshop on
High-performance and Real-time
Embedded Systems (HiRES 2016)
V. Muttillo, G. Valente, F. Federici, L. Pomante, M. Faccio
Overview
 Introduction
Proposed Platform
Evaluation and Validation
 Conclusions and future developments
HiRES 2016 2
Introduction
3HiRES 2016
Multi-core Embedded
SoC
On-Chip embedded systems are characterized by
several F/NF requirements
• Response time, power consumption, time-to-market etc.
Multi-core embedded systems design
• Suffers from the lack of uniform pathways to system
realization and application deployment
Parallel programming model
• Allows to obtain a speed-up for a multi-threaded
application by splitting the workload
Run-time monitoring solutions
• Allows to monitor system behaviour during life-time
4HiRES 2016
Proposed Solution
This work presents the development of an
embedded multi-core platform on FPGA with:
• Multi-LEON3 SMP HW architecture
• Non-intrusive distributed HW profiling subsystem
• Integrated customized Linux OS distribution
• OpenMP parallel programming models
• RVS profiling tool support
Final goal of the work
• Development of high-performance multi-core
embedded platform with run-time resource monitoring
components and off-line verification tools support
5HiRES 2016
Platform in
development flow
The work is related to the Artemis-JU ASP CRAFTERS
European project
• It has led to uniform embedded system development
flow in the research and industry domains
• The platform has been proposed to execute and validate
industrial case studies
• Support to embedded system designers
6HiRES 2016
Proposed Platform
7HiRES 2016
LEON3 32-bit synthesizable soft-processors, multi-core mode, dedicated
FPU, MMU for Linux OS etc.
HW Architecture
8HiRES 2016
OS and Parallel
Programming Model
Operating System
• A Linux distribution has been customized, starting from
LEON LINUX kernel
• Cross-compiler toolchain, buildroot tool to build user space
application and RAM loader have been provided by Aeroflex
Gaisler
Parallel Programming Model
• Libraries required to implement parallel applications
using OpenMP C/C++, have been added to the
customized Linux distribution
9HiRES 2016
HW Profiling System
AIPHS (AdaptIve Profiling Hardware Subsystem)
• Event and Time monitoring functionalities
10HiRES 2016
Final Platform
4-core Leon 3 with Linux operating system,
OpenMP libraries and hardware profiling system
11HiRES 2016
ML605 (Virtex 6) Development Board
THE PLATFORM HARDWARE ARCHITECTURE
Final Platform
4-core Leon 3 with Linux operating system,
OpenMP libraries and hardware profiling system
12HiRES 2016
ML605 (Virtex 6) Development Board
THE PLATFORM HARDWARE ARCHITECTURE
Platform Functionalities
• High performance multi-processing software execution
• Run-time event and time monitoring
• Reconfigurable HW architecture
• Resource monitoring application using MW layer
13HiRES 2016
Evaluation and
Validation
14HiRES 2016
0
50000000
100000000
150000000
200000000
250000000
300000000
Reduction Parallel SPMD No false Sharing
1 2 3 4
Simulated results
VIPPE-based speed-up evaluation on selected
benchmark
• Verify if OpenMP program parallelization made sense in
a scenario with a given memory organization (i.e. single
cache, DDR3 interface for external memory etc.)
• To check if specific OpenMP library implementation works well
with the proposed memory organization
15HiRES 2016
Experimental results
AIPHS-based speed-up evaluation on selected
benchmark
• Execution time increases with number of threads
• Multi-core architecture, based on LEON3 and one level
cache, using OpenMP leads to optimal performances
• False sharing problem is quite influent in this system
16HiRES 2016
0
100000000
200000000
300000000
400000000
500000000
600000000
700000000
Reduction Parallel SPMD No false Sharing
1 2 3 4
RVS Support
Rapita Verification Suite provides a framework for
on-target verification of embedded software
The use of AIPHS enables the designer to analyze
time information offline by using Rapita tools
AIPHS allows reducing the need for code
instrumentation so providing information more
related to the real behavior of the considered
application
17HiRES 2016
Conclusions and future
developments
18HiRES 2016
Conclusions
This work has described the design and the
validation of an embedded SoC multi-core platform
• early verification and validation
• enhanced performances in execution time (OpenMP)
• on-chip run-time monitoring (AIPHS)
Support for Rapita Verification Suite (RVS) allows
designers to evaluate meaningful statistics
• WCRT
• Average time execution
• etc…
19HiRES 2016
Future developments
Improvement of the profiling system to collect
more data and events while better filtering
overhead due to OS and ISR
Improvement of multi-core monitoring support for
RVS
Preliminary simulation step with VIPPE tool
integrated in the multi-core embedded systems
specific design flow
20HiRES 2016
Thanks for the attention
Questions?
21HiRES 2016

More Related Content

More from Vittoriano Muttillo

AIPHS: AdaptIve Profiling Hardware Sub-system
AIPHS: AdaptIve Profiling Hardware Sub-systemAIPHS: AdaptIve Profiling Hardware Sub-system
AIPHS: AdaptIve Profiling Hardware Sub-systemVittoriano Muttillo
 
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...Vittoriano Muttillo
 
Progettazione e realizzazione di un sistema software per il time logging
Progettazione e realizzazione di un sistema software per il time loggingProgettazione e realizzazione di un sistema software per il time logging
Progettazione e realizzazione di un sistema software per il time loggingVittoriano Muttillo
 
Algoritmi per l'ottimizzazione convessa
Algoritmi per l'ottimizzazione convessaAlgoritmi per l'ottimizzazione convessa
Algoritmi per l'ottimizzazione convessaVittoriano Muttillo
 
Advances in gml for geospatial applications slide
Advances in gml for geospatial applications slideAdvances in gml for geospatial applications slide
Advances in gml for geospatial applications slideVittoriano Muttillo
 

More from Vittoriano Muttillo (7)

AIPHS: AdaptIve Profiling Hardware Sub-system
AIPHS: AdaptIve Profiling Hardware Sub-systemAIPHS: AdaptIve Profiling Hardware Sub-system
AIPHS: AdaptIve Profiling Hardware Sub-system
 
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
A-LOOP: AMP system: 2-cores ARM Cortex A9/Linux OS and 4-cores Leon3/Linux OS...
 
Progettazione e realizzazione di un sistema software per il time logging
Progettazione e realizzazione di un sistema software per il time loggingProgettazione e realizzazione di un sistema software per il time logging
Progettazione e realizzazione di un sistema software per il time logging
 
Invarianza di un politopo
Invarianza di un politopoInvarianza di un politopo
Invarianza di un politopo
 
Ieee 802.15.4e nel wirelessHART
Ieee 802.15.4e nel wirelessHARTIeee 802.15.4e nel wirelessHART
Ieee 802.15.4e nel wirelessHART
 
Algoritmi per l'ottimizzazione convessa
Algoritmi per l'ottimizzazione convessaAlgoritmi per l'ottimizzazione convessa
Algoritmi per l'ottimizzazione convessa
 
Advances in gml for geospatial applications slide
Advances in gml for geospatial applications slideAdvances in gml for geospatial applications slide
Advances in gml for geospatial applications slide
 

Recently uploaded

US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionMebane Rash
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Earthing details of Electrical Substation
Earthing details of Electrical SubstationEarthing details of Electrical Substation
Earthing details of Electrical Substationstephanwindworld
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsSachinPawar510423
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Piping Basic stress analysis by engineering
Piping Basic stress analysis by engineeringPiping Basic stress analysis by engineering
Piping Basic stress analysis by engineeringJuanCarlosMorales19600
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 

Recently uploaded (20)

US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Earthing details of Electrical Substation
Earthing details of Electrical SubstationEarthing details of Electrical Substation
Earthing details of Electrical Substation
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documents
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Piping Basic stress analysis by engineering
Piping Basic stress analysis by engineeringPiping Basic stress analysis by engineering
Piping Basic stress analysis by engineering
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 

Design and validation of a multi-core embedded platform under high performance requirements

  • 1. Design and validation of a multi-core embedded platform under high performance requirements University of L’Aquila Center of Excellence DEWS Department of Information Engineering, Computer Science and Mathematics DISIM 4th Workshop on High-performance and Real-time Embedded Systems (HiRES 2016) V. Muttillo, G. Valente, F. Federici, L. Pomante, M. Faccio
  • 2. Overview  Introduction Proposed Platform Evaluation and Validation  Conclusions and future developments HiRES 2016 2
  • 4. Multi-core Embedded SoC On-Chip embedded systems are characterized by several F/NF requirements • Response time, power consumption, time-to-market etc. Multi-core embedded systems design • Suffers from the lack of uniform pathways to system realization and application deployment Parallel programming model • Allows to obtain a speed-up for a multi-threaded application by splitting the workload Run-time monitoring solutions • Allows to monitor system behaviour during life-time 4HiRES 2016
  • 5. Proposed Solution This work presents the development of an embedded multi-core platform on FPGA with: • Multi-LEON3 SMP HW architecture • Non-intrusive distributed HW profiling subsystem • Integrated customized Linux OS distribution • OpenMP parallel programming models • RVS profiling tool support Final goal of the work • Development of high-performance multi-core embedded platform with run-time resource monitoring components and off-line verification tools support 5HiRES 2016
  • 6. Platform in development flow The work is related to the Artemis-JU ASP CRAFTERS European project • It has led to uniform embedded system development flow in the research and industry domains • The platform has been proposed to execute and validate industrial case studies • Support to embedded system designers 6HiRES 2016
  • 8. LEON3 32-bit synthesizable soft-processors, multi-core mode, dedicated FPU, MMU for Linux OS etc. HW Architecture 8HiRES 2016
  • 9. OS and Parallel Programming Model Operating System • A Linux distribution has been customized, starting from LEON LINUX kernel • Cross-compiler toolchain, buildroot tool to build user space application and RAM loader have been provided by Aeroflex Gaisler Parallel Programming Model • Libraries required to implement parallel applications using OpenMP C/C++, have been added to the customized Linux distribution 9HiRES 2016
  • 10. HW Profiling System AIPHS (AdaptIve Profiling Hardware Subsystem) • Event and Time monitoring functionalities 10HiRES 2016
  • 11. Final Platform 4-core Leon 3 with Linux operating system, OpenMP libraries and hardware profiling system 11HiRES 2016 ML605 (Virtex 6) Development Board THE PLATFORM HARDWARE ARCHITECTURE
  • 12. Final Platform 4-core Leon 3 with Linux operating system, OpenMP libraries and hardware profiling system 12HiRES 2016 ML605 (Virtex 6) Development Board THE PLATFORM HARDWARE ARCHITECTURE
  • 13. Platform Functionalities • High performance multi-processing software execution • Run-time event and time monitoring • Reconfigurable HW architecture • Resource monitoring application using MW layer 13HiRES 2016
  • 15. 0 50000000 100000000 150000000 200000000 250000000 300000000 Reduction Parallel SPMD No false Sharing 1 2 3 4 Simulated results VIPPE-based speed-up evaluation on selected benchmark • Verify if OpenMP program parallelization made sense in a scenario with a given memory organization (i.e. single cache, DDR3 interface for external memory etc.) • To check if specific OpenMP library implementation works well with the proposed memory organization 15HiRES 2016
  • 16. Experimental results AIPHS-based speed-up evaluation on selected benchmark • Execution time increases with number of threads • Multi-core architecture, based on LEON3 and one level cache, using OpenMP leads to optimal performances • False sharing problem is quite influent in this system 16HiRES 2016 0 100000000 200000000 300000000 400000000 500000000 600000000 700000000 Reduction Parallel SPMD No false Sharing 1 2 3 4
  • 17. RVS Support Rapita Verification Suite provides a framework for on-target verification of embedded software The use of AIPHS enables the designer to analyze time information offline by using Rapita tools AIPHS allows reducing the need for code instrumentation so providing information more related to the real behavior of the considered application 17HiRES 2016
  • 19. Conclusions This work has described the design and the validation of an embedded SoC multi-core platform • early verification and validation • enhanced performances in execution time (OpenMP) • on-chip run-time monitoring (AIPHS) Support for Rapita Verification Suite (RVS) allows designers to evaluate meaningful statistics • WCRT • Average time execution • etc… 19HiRES 2016
  • 20. Future developments Improvement of the profiling system to collect more data and events while better filtering overhead due to OS and ISR Improvement of multi-core monitoring support for RVS Preliminary simulation step with VIPPE tool integrated in the multi-core embedded systems specific design flow 20HiRES 2016
  • 21. Thanks for the attention Questions? 21HiRES 2016

Editor's Notes

  1. Good afternoon, I’m Vittoriano Muttillo, PhD student from University of L’Aquila and today I will present “Design and validation of a multi-core embedded platform under high performance requirements”. I will start with a brief introduction, then I will expose the proposed platform and the related evaluation and validation tests and, finally, conclusion and future work related.
  2. Well, the main problem on On-chip embedded systems design is considering both Functional and non-functional requirements that impact in the implementation of specific systems. Run-time analysis, power consumption, time to market and so on, this constrains drive the designer to realize their systems. Multi-core embedded design also suffers from the lack of uniform pathway to system realization and application deployment. To improve performance also parallel programming model allows to obtain a speed-up, in terms of run-time response or execution time, for a multi-threaded application by splitting the workload around the processor. Finally, run-time monitor solution can allows to monitor system behavior during life-time, in term of memory access, bus bottleneck checking and processor stalls.
  3. So this work presents the development and design step of an embedded multi-core platform on FPGA with a specific characteristics related to an industrial avionic scenario firstly and in general to general high requirement environments industry scenario. The specific characteristics are a Leon3 SMP multi-core HW architecture and a list of others specific component for different functionalities (improvements of performance, monitoring functionalities etc). So the final goal of this work is the development of high-performance multi-core embedded platform with run-time resource monitoring component (an embedded distributed monitoring system for memory, buses and so on) and off-line verification tool support, such as RVS or others profiling tools.
  4. Additional information are that the work is related to the Artemis CRAFTERS European project which had the aim of has led to uniform embedded system development flow in the research and industry domains. The platform has been proposed to execute and validate an industrial case study offers by Thales Italia.
  5. The proposed platform realized in this work is based on Leon3 soft processor with a dedicated FPU per processor, a shared memory around the AHB bus and others particular configuration (cache, MMU and so on).
  6. On this HW architecture, a linux distribution has been customized, starting from Linux ernel with using leon patch for the architectural merge. Toolchains, buildroot tool for user space application and ram loader have been provided by Aeroflex Gaisler. For using a parallel programming model we have used the GNU OpenMP library GOMP in terms of dynamic linked library implementation at runtime.
  7. The HW pofiling system implemented, the distributed AIPHS Adaptive HW profiling SubSystems, is composed of an arbitrary number of sniffer with the architecture shown in figure. In particular, the adaptability is given by two part of sniffer component, the adapter and the bus output interface. This particular implementation ensure that is possible to adapt the our profiling system to different HW architecture and to distribute it on the specific HW implementation/prototype.
  8. The final platform is shown in this two figure. This is 4-core LEON3 with linux operating system, OpenMP libraries and distributed Hardware profiling systems. The development board used in this work is a ML605 Virtex 6.
  9. The final platform is shown in this two figure. This is 4-core LEON3 with linux operating system, OpenMP libraries and distributed Hardware profiling systems. The development board used in this work is a ML605 Virtex 6.
  10. So we proposed a specific functionalities framework to allows user to access Profiling HW data from an High level point of view. This framework is a multi-tier architecture with the hw abstraction layer for access AIPHS system, the kernel space with the specific instance of Operating system and parallel programming model and the user space layer with a middleware component that allow user application and third part monitoring software to use AIPHS and profiling data for online and offline analysis at runtime. The final goals of this multi-tier framework is to get high performance multi-processing software execution, ti provide a runtime event and time monitoring with our profiling systems, to use reconfigurable HW architecture and to to monitor application resource using middleware functionalities layer.
  11. Finally we have used a series of benchmark test for evaluation and validation the proposed platform. In this case we preliminary have made simulation using a virtual platform simulation tool provided by University of Cantabria, VIPPE tools. In this manner we have evaluated the speed-up trend on selected benchmark for verify if openmp program parallelization with specific configuration (single cache, DDR3 interface for external memory and so on) and to check if specific OpenMP library implementation works well with the proposed memory organization. In the left figure there are the simulation results that shown the speed-up increase and time decrease in clock cycle number. In the right figure we can be see the vippe simulation methods in which we have used platform model and software c implementation to simulating the run-time code execution of our systems.
  12. After this simulation step, we have used the same benchmark on-target, in a real execution environment. The result has been taken using AIPHS profiling internal system. The data has shown that execution time increases with number of threads and the multi-core architecture, based on LEON3 and one level cache, using OpenMP leads to optimal performance with result similar in speed-up trend respect to simulation execution. In this case we can see that false sharing problem is quite influent in this scenario, different to simulation mode in which this problem was not be relevant.
  13. Finally, we use profiling HW data to support Rapita verification Suite that provides a framework for on-target verification of embedded systems and AIPHS enables designer to analyze time information offline by using rapita tools. The choice of RVS has been driven by the possibility to allows reducing the need for code instrumentation so providing information more related to real behavior of considered application, providing a series of specific code analysis WCET, bus bottleneck, code coverage end so on, in order to help designer in the development step.
  14. This work has described the design and validation of an embedded multi core platform on SoC, with early verification and validation test, enhanced performance in execution time using OpenMP and implementing an on-chip run-time hw monitoring profiling systems called AIPHS in order to collect low level hw data. The support implementation of AIPHS related to RVS allaws designers to evaluate meaningfull statistics such as WCRT, Average time execution, bus utilization and bottlenecks, processor stalls in multi core mode and so on.
  15. Future development involves improvement of the profiling system to collect more data and events while better filtering overhead due to operating systems and interrupt service routine, interrupt time. The proposed platform offers also an improvement of multi-core monitoring support for RVS. Another step is to define a design methodology for developing multi-core embedded system using a preliminary simulation step with VIPPE virtual platform simulation tool integrated in the specific design flow.