SlideShare a Scribd company logo
1 of 22
Download to read offline
A RTRM Proposal for Multi/Many-Core Platforms
        and Reconfigurable Applications
                            P. Bellasi, G. Massari and W. Fornaciari
                              {bellasi, massari, fornacia}@elet.polimi.it




                                              ReCoSoC 2012

                                 Dipartimento di Elettronica e Informazione
                                           Politecnico di Milano

Last revision Jul, 2 2012
Introduction
       Why Run-Time Resource Management?

    Run-Time Resources Management (RTRM) is about
            finding the optimal tradeoff between
        QoS requirements and resources availability

    Target scenario
        Shared HW resources
             upcoming many-core devices are complex systems
                 process variations and run-time issues
        Mixed SW workloads
             resources sharing and competition
                 among applications with different and time-varying requirements
    Simple solutions are required
        support for frequently changing use-cases
        suitable for both critical and best-effort applications
The BarbequeRTRM Framework                    2
Introduction
       Paper Contribution

    Methodology to support system-wide run-time
     resource management
        exploiting design-time information
        hierarchical and distributed control
                                                 http://www.2parma.eu

    BarbequeRTRM Framework
        multi-objective optimization strategy
        easily portable and modular design
        run-time tunable and scalable policies
        open source project                      http://bosp.dei.polimi.it




The BarbequeRTRM Framework         3
Introduction
       How we compare?

                                     Desirable Properties




                                          De
                                            sig
                                             Co
                                              Clu



                                                ntr


                                                n-T
                                                 Ho


                                                 ste
                                                   Mu




                                                    ol-
                                                    He




                                                    im
                                                     Mu




                                                     mo


                                                      red
                                                      Re




                                                        Th
                                                        ter




                                                        eE
                                                        lt.
                                                         lti-




                                                          g.
           Resources



                                                           co




                                                            eo
                                                            .P
                                                            Re




                                                            Re




                                                             xp
                           Po




                                                             Ob




                                                              Pla
                                                              nf.




                                                               ry
                                                               lat
                                                                so
            Managers




                                                                so




                                                                loi
                              r




                                                                 jec
                             tab



                                                                  / Ad




                                                                  tfo




                                                                   Mo
                                                                    for
                                                                    urc




                                                                    urc




                                                                    tat
           Proposals




                                                                      ti v




                                                                      rm
                                 ilit



                                                                       ap




                                                                       ion
                                                                        ms




                                                                        de
                                                                         es




                                                                         es
                                                                          e




                                                                           s
                                      y



                                                                           t




                                                                           l
                StarPU
          Binotto et al.

               Fu et al.
              ACTORS

                 SEEC

                DistRM
         BarbequeRTRM



The BarbequeRTRM Framework                        4
The BarbequeRTRM
             System-Wide RTRM: Overall Framework


user-space                                                                     Application-Specific RTM
                               Critical Apps            Best-Effort Apps             Fine grained control on application
                          a                                                b         allocated resources:
                                    A                              B                 - task ordering
                                                                                     - virtual processor assignment
                                  RTLib                          RTLib               - DVFS
  Dynamic Code                                                                       - application parameters monitoring
   Generation                       C          C
          c                                                                        System-Wide RTRM
                              Res Accounting           Res Partitioning              Coarse grained control on platform
                      G                                                              available resources:
                                          Res Abstraction                            - resource accounting, partitioning
                                                                                     and abstraction
                                                                                     - high-level HW events handling
                                 MRAPI                 Platform Proxy                   e.g., critical conditions, faults...
                                                                                     - manage applications priorities
                                    D                        E                       - power/thermal “coarse tuning”
kernel
                                        Platform DRV
                                         Platform DRV                  d
                                           Platform Driver
                                                                                    BarbequeRTRM
supported platforms                                F
                      f
  Task Mapping            I               Platform Firmware
                                                                                                      Legend
         DDM              H                                                    X     SW Interface (API)
                      e
                                                                               Y     SW/HW Meta-data

The BarbequeRTRM Framework                             5
The BarbequeRTRM
             System-Wide RTRM: Presentation Outline

                          1
user-space                                                                             Application-Specific RTM
                                   Critical Apps             Best-Effort Apps                Fine grained control on application
                              a                                                    b         allocated resources:
                                        A                              B                     - task ordering
                                                                                             - virtual processor assignment
                                      RTLib                          RTLib                   - DVFS
  Dynamic Code                                                                               - application parameters monitoring
   Generation                           C          C
          c                                             2                                  System-Wide RTRM
                                  Res Accounting            Res Partitioning                 Coarse grained control on platform
                      G                                                                      available resources:
                                              Res Abstraction                                - resource accounting, partitioning
                                                                                             and abstraction
                                                                                             - high-level HW events handling
                                     MRAPI                  Platform Proxy                      e.g., critical conditions, faults...
                                                                               3             - manage applications priorities
                                        D                        E                           - power/thermal “coarse tuning”
kernel
                                            Platform DRV
                                             Platform DRV                  d
                                               Platform Driver
                                                                                            BarbequeRTRM
supported platforms                                    F
                      f
  Task Mapping                I               Platform Firmware
                                                                                                              Legend
         DDM                  H                                                        X     SW Interface (API)
                      e
                                                                                       Y     SW/HW Meta-data

The BarbequeRTRM Framework                                  6
The BarbequeRTRM
             System-Wide RTRM: Distributed Hierarchical Control

                          1
user-space                                                                         Application-Specific RTM
                                   Critical Apps            Best-Effort Apps             Fine grained control on application
                              a                                                b         allocated resources:
                                        A                              B                 - task ordering
                                                                                         - virtual processor assignment
                                      RTLib                          RTLib               - DVFS
  Dynamic Code                                                                           - application parameters monitoring
   Generation                           C          C
          c                                                                            System-Wide RTRM
                                  Res Accounting           Res Partitioning              Coarse grained control on platform
                      G                                                                  available resources:
                                              Res Abstraction                            - resource accounting, partitioning
                                                                                         and abstraction
                                                                                         - high-level HW events handling
                                     MRAPI                 Platform Proxy                   e.g., critical conditions, faults...
                                                                                         - manage applications priorities
                                        D                        E                       - power/thermal “coarse tuning”
kernel
                                            Platform DRV
                                             Platform DRV                  d
                                               Platform Driver
                                                                                        BarbequeRTRM
supported platforms                                    F
                      f
  Task Mapping                I               Platform Firmware
                                                                                                          Legend
         DDM                  H                                                    X     SW Interface (API)
                      e
                                                                                   Y     SW/HW Meta-data

The BarbequeRTRM Framework                                 7
The Proposed Control Solution
       Distributed Hierarchical Control

    Different subsystems have their own control loop (CL)
        System-wide level (resources partitioning, system-wide optimization, ...)
        Application specific (application tuning, dynamic memory management, ...)
        Firmware/OS level (F/V control, thermal alarms, resource availability, ...)

    FF closed CL
        using OP and AWM

    Optimal
        user defined goal functions
        including overheads



    Robust
    Adaptive
The BarbequeRTRM Framework                   8
The BarbequeRTRM
             System-Wide RTRM: Resource Partitioning Strategy


user-space                                                                      Application-Specific RTRM
                               Critical Apps             Best-Effort Apps             Fine grained control on application
                          a                                                 b         allocated resources:
                                    A                              B                  - task ordering
                                                                                      - virtual processor assignment
                                  RTLib                          RTLib                - DVFS
  Dynamic Code                                                                        - application parameters monitoring
   Generation                       C          C
          c                                         2                               System-Wide RTRM
                              Res Accounting            Res Partitioning              Coarse grained control on platform
                      G                                                               available resources:
                                          Res Abstraction                             - resource accounting, partitioning
                                                                                      and abstraction
                                                                                      - high-level HW events handling
                                 MRAPI                  Platform Proxy                   e.g., critical conditions, faults...
                                                                                      - manage applications priorities
                                    D                        E                        - power/thermal “coarse tuning”
kernel
                                        Platform DRV
                                         Platform DRV                  d
                                           Platform Driver
                                                                                     BarbequeRTRM
supported platforms                                F
                      f
  Task Mapping            I               Platform Firmware
                                                                                                       Legend
         DDM              H                                                     X     SW Interface (API)
                      e
                                                                                Y     SW/HW Meta-data

The BarbequeRTRM Framework                              9
Scheduling Policy
       YaMS - A modular multi-objective scheduler

    Introduction of a new modular policy (YaMS)
        partition available resources (R) on applications (A)
                 considering A priorities and R “residual” availabilities
        multi-objective optimization
             support a set of tunable goals
                 DONE: performances, overheads,
                     congestion, fairness
                 WIP: stability, robustness,
                     thermal and power
             increase overall system value
                 considering discrete and tunable
                 improvements
    LP theory, MMKP heuristic
        promote scheduling of some AWMs
             which improve optimization goals
        demote scheduling of others AWMs
             which degrade solution metrics
                 e.g. stability and robustness
The BarbequeRTRM Framework                       10
Scheduling Policy
       YaMS - Scalability



                                  Speedup

                                   +36%

                                   +54%




The BarbequeRTRM Framework   11
Scheduling Policy
       System-Wide Controller – Overall View




                                          BBQ Validation Policy
                                          - enforce certain control properties
                                               energy budget, stability and robustness
                                          - authorize resources synchronization

The BarbequeRTRM Framework         12
Scheduling Policy
       System-Wide Controller – Inner-Loop “Scheduling”




                                          BBQ Validation Policy
                                          - enforce certain control properties
                                              energy budget, stability and robustness
                                          - authorize resources synchronization

The BarbequeRTRM Framework        13
Scheduling Policy
       System-Wide Controller – Inner-Loop Overheads

             Apps with 3 AWM, 3 Clusters => 9 configuration per application
                         BBQ running on NSJ, 4 CPUs @ 2.5GHz (max)




                                               +
                                  +




                                                         +


The BarbequeRTRM Framework                14
The BarbequeRTRM
             System-Wide RTRM: Platform Integration


user-space                                                                         Application-Specific RTRM
                               Critical Apps            Best-Effort Apps                 Fine grained control on application
                          a                                                    b         allocated resources:
                                    A                              B                     - task ordering
                                                                                         - virtual processor assignment
                                  RTLib                          RTLib                   - DVFS
  Dynamic Code                                                                           - application parameters monitoring
   Generation                       C          C
          c                                                                            System-Wide RTRM
                              Res Accounting            Res Partitioning                 Coarse grained control on platform
                      G                                                                  available resources:
                                          Res Abstraction                                - resource accounting, partitioning
                                                                                         and abstraction
                                                                                         - high-level HW events handling
                                 MRAPI                 Platform Proxy                       e.g., critical conditions, faults...
                                                                           3             - manage applications priorities
                                    D                        E                           - power/thermal “coarse tuning”
kernel
                                        Platform DRV
                                         Platform DRV                  d
                                           Platform Driver
                                                                                        BarbequeRTRM
supported platforms                                F
                      f
  Task Mapping            I               Platform Firmware
                                                                                                          Legend
         DDM              H                                                        X     SW Interface (API)
                      e
                                                                                   Y     SW/HW Meta-data

The BarbequeRTRM Framework                             15
The BarbequeRTRM
       Platform Integration Layer

    Support for generic Linux SMP/NUMA machines
        portable solution, based on Linux Control Group
             CPUs and Memory nodes assignments
             Memory amount assignment
             CPU bandwidth quota assignment (requires kernel 3.2)
    Support both resources monitoring and control
        pre-configured CGroup to define BBQ controlled resources
             at Barbeque start, by parsing a pre-configured cgroups
             … than Barbeque takes control over these resources
        tun-time generation of new CGroup to control applications
             requires RTLib integration (of course)


    Working on P2012 integration
        new genration many-core platform from STMicroelectronics
             first version: 4 cluster with 16 Processing Element each one
The BarbequeRTRM Framework             16
Synchronization Policy
       System-Wide Controller – Outer-Loop “Synchronization”




                                         BBQ Validation Policy
                                         - enforce certain control properties
                                             energy budget, stability and robustness
                                         - authorize resources synchronization

The BarbequeRTRM Framework        17
Synchronization Policy
        System-Wide Controller – Outer-Loop Overheads

min AWM 25% CPU Time, 3 Clusters x 4CPUs => max 48 syncs
BBQ running on NSJ, 4 CPUs @ 2.5GHz (max)




                                                               +

                                        Linux kernel 3.2
                                  Creation overheads: ~500ms
                                  Update overheads: ~100ms
                                                               +
                                       (1/3 on quadcore i7)
                             +




                                                          +




                                                               +
     Application dependent



                                                                   CGroups
                                                                     PIL


The BarbequeRTRM Framework                      18
The BarbequeRTRM Framework
       Power Optimizations

    Initial experiments on congested workloads
        increasing running instances of Bodytrack (PARSEC)
             3AWM: [1,2,4] Threads
        system-wide power measurements
             via the standard IPMI interface




                                                                 Power Gains
                                                                   2,3-3,7%

                              Time Gains
                               338-625%




                                            X86_64 NUMA machine: 3 Clusters x 4CPUs
                                                    BBQ running on NSJ, 4 CPUs @ 800MHz

The BarbequeRTRM Framework                 19
The BarbequeRTRM Framework
       Conclusions

    Framework for System-Wide RTRM
        flexibility and scalability of the RTRM strategy
             thanks to its hierarchical and distributed control structure
        acceptable overheads for real usage scenarios
             including those with variable workload
        tunable multi-objective optimization policies
             to cope with several design constraints and goals
                 e.g., performance, power, thermal and reliability, ...
        promising results in terms of performance improving
        and power consumption reduction
             for a highly parallel workload, on a NUMA multi-core architecture

        availability of a simple API interface
             making straightforward for the programmers to take full advantages
             from framework services

The BarbequeRTRM Framework                     20
The BarbequeRTRM Framework
       Future Works

    Optimization policies
        robustness and stability assessment
        improving power/energy optimizations
        thermal and reliability management

    Platform integration
        extended support for Android targets
        complete the integration with the P2012 many-core platform




The BarbequeRTRM Framework      21
Thanks for your attention!




        If you are interested, please check
the project website for further information
  and keep update with the developments
                                              http://bosp.dei.polimi.it

More Related Content

Viewers also liked

Bootstrap process of u boot (NDS32 RISC CPU)
Bootstrap process of u boot (NDS32 RISC CPU)Bootstrap process of u boot (NDS32 RISC CPU)
Bootstrap process of u boot (NDS32 RISC CPU)Macpaul Lin
 
USB Specification 2.0 - Chapter 9 - Device Framework
USB Specification 2.0 - Chapter 9 - Device FrameworkUSB Specification 2.0 - Chapter 9 - Device Framework
USB Specification 2.0 - Chapter 9 - Device FrameworkMacpaul Lin
 
U boot porting guide for SoC
U boot porting guide for SoCU boot porting guide for SoC
U boot porting guide for SoCMacpaul Lin
 
Linux Porting to a Custom Board
Linux Porting to a Custom BoardLinux Porting to a Custom Board
Linux Porting to a Custom BoardPatrick Bellasi
 
Exploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource ManagementExploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource ManagementPatrick Bellasi
 
Embedded Linux from Scratch to Yocto
Embedded Linux from Scratch to YoctoEmbedded Linux from Scratch to Yocto
Embedded Linux from Scratch to YoctoSherif Mousa
 
Building Mini Embedded Linux System for X86 Arch
Building Mini Embedded Linux System for X86 ArchBuilding Mini Embedded Linux System for X86 Arch
Building Mini Embedded Linux System for X86 ArchSherif Mousa
 

Viewers also liked (7)

Bootstrap process of u boot (NDS32 RISC CPU)
Bootstrap process of u boot (NDS32 RISC CPU)Bootstrap process of u boot (NDS32 RISC CPU)
Bootstrap process of u boot (NDS32 RISC CPU)
 
USB Specification 2.0 - Chapter 9 - Device Framework
USB Specification 2.0 - Chapter 9 - Device FrameworkUSB Specification 2.0 - Chapter 9 - Device Framework
USB Specification 2.0 - Chapter 9 - Device Framework
 
U boot porting guide for SoC
U boot porting guide for SoCU boot porting guide for SoC
U boot porting guide for SoC
 
Linux Porting to a Custom Board
Linux Porting to a Custom BoardLinux Porting to a Custom Board
Linux Porting to a Custom Board
 
Exploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource ManagementExploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource Management
 
Embedded Linux from Scratch to Yocto
Embedded Linux from Scratch to YoctoEmbedded Linux from Scratch to Yocto
Embedded Linux from Scratch to Yocto
 
Building Mini Embedded Linux System for X86 Arch
Building Mini Embedded Linux System for X86 ArchBuilding Mini Embedded Linux System for X86 Arch
Building Mini Embedded Linux System for X86 Arch
 

Recently uploaded

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 

A RTRM Proposal for Multi/Many-Core Platforms and Reconfigurable Applications

  • 1. A RTRM Proposal for Multi/Many-Core Platforms and Reconfigurable Applications P. Bellasi, G. Massari and W. Fornaciari {bellasi, massari, fornacia}@elet.polimi.it ReCoSoC 2012 Dipartimento di Elettronica e Informazione Politecnico di Milano Last revision Jul, 2 2012
  • 2. Introduction Why Run-Time Resource Management?  Run-Time Resources Management (RTRM) is about finding the optimal tradeoff between QoS requirements and resources availability  Target scenario Shared HW resources upcoming many-core devices are complex systems process variations and run-time issues Mixed SW workloads resources sharing and competition among applications with different and time-varying requirements  Simple solutions are required support for frequently changing use-cases suitable for both critical and best-effort applications The BarbequeRTRM Framework 2
  • 3. Introduction Paper Contribution  Methodology to support system-wide run-time resource management exploiting design-time information hierarchical and distributed control http://www.2parma.eu  BarbequeRTRM Framework multi-objective optimization strategy easily portable and modular design run-time tunable and scalable policies open source project http://bosp.dei.polimi.it The BarbequeRTRM Framework 3
  • 4. Introduction How we compare? Desirable Properties De sig Co Clu ntr n-T Ho ste Mu ol- He im Mu mo red Re Th ter eE lt. lti- g. Resources co eo .P Re Re xp Po Ob Pla nf. ry lat so Managers so loi r jec tab / Ad tfo Mo for urc urc tat Proposals ti v rm ilit ap ion ms de es es e s y t l StarPU Binotto et al. Fu et al. ACTORS SEEC DistRM BarbequeRTRM The BarbequeRTRM Framework 4
  • 5. The BarbequeRTRM System-Wide RTRM: Overall Framework user-space Application-Specific RTM Critical Apps Best-Effort Apps Fine grained control on application a b allocated resources: A B - task ordering - virtual processor assignment RTLib RTLib - DVFS Dynamic Code - application parameters monitoring Generation C C c System-Wide RTRM Res Accounting Res Partitioning Coarse grained control on platform G available resources: Res Abstraction - resource accounting, partitioning and abstraction - high-level HW events handling MRAPI Platform Proxy e.g., critical conditions, faults... - manage applications priorities D E - power/thermal “coarse tuning” kernel Platform DRV Platform DRV d Platform Driver BarbequeRTRM supported platforms F f Task Mapping I Platform Firmware Legend DDM H X SW Interface (API) e Y SW/HW Meta-data The BarbequeRTRM Framework 5
  • 6. The BarbequeRTRM System-Wide RTRM: Presentation Outline 1 user-space Application-Specific RTM Critical Apps Best-Effort Apps Fine grained control on application a b allocated resources: A B - task ordering - virtual processor assignment RTLib RTLib - DVFS Dynamic Code - application parameters monitoring Generation C C c 2 System-Wide RTRM Res Accounting Res Partitioning Coarse grained control on platform G available resources: Res Abstraction - resource accounting, partitioning and abstraction - high-level HW events handling MRAPI Platform Proxy e.g., critical conditions, faults... 3 - manage applications priorities D E - power/thermal “coarse tuning” kernel Platform DRV Platform DRV d Platform Driver BarbequeRTRM supported platforms F f Task Mapping I Platform Firmware Legend DDM H X SW Interface (API) e Y SW/HW Meta-data The BarbequeRTRM Framework 6
  • 7. The BarbequeRTRM System-Wide RTRM: Distributed Hierarchical Control 1 user-space Application-Specific RTM Critical Apps Best-Effort Apps Fine grained control on application a b allocated resources: A B - task ordering - virtual processor assignment RTLib RTLib - DVFS Dynamic Code - application parameters monitoring Generation C C c System-Wide RTRM Res Accounting Res Partitioning Coarse grained control on platform G available resources: Res Abstraction - resource accounting, partitioning and abstraction - high-level HW events handling MRAPI Platform Proxy e.g., critical conditions, faults... - manage applications priorities D E - power/thermal “coarse tuning” kernel Platform DRV Platform DRV d Platform Driver BarbequeRTRM supported platforms F f Task Mapping I Platform Firmware Legend DDM H X SW Interface (API) e Y SW/HW Meta-data The BarbequeRTRM Framework 7
  • 8. The Proposed Control Solution Distributed Hierarchical Control  Different subsystems have their own control loop (CL) System-wide level (resources partitioning, system-wide optimization, ...) Application specific (application tuning, dynamic memory management, ...) Firmware/OS level (F/V control, thermal alarms, resource availability, ...)  FF closed CL using OP and AWM  Optimal user defined goal functions including overheads  Robust  Adaptive The BarbequeRTRM Framework 8
  • 9. The BarbequeRTRM System-Wide RTRM: Resource Partitioning Strategy user-space Application-Specific RTRM Critical Apps Best-Effort Apps Fine grained control on application a b allocated resources: A B - task ordering - virtual processor assignment RTLib RTLib - DVFS Dynamic Code - application parameters monitoring Generation C C c 2 System-Wide RTRM Res Accounting Res Partitioning Coarse grained control on platform G available resources: Res Abstraction - resource accounting, partitioning and abstraction - high-level HW events handling MRAPI Platform Proxy e.g., critical conditions, faults... - manage applications priorities D E - power/thermal “coarse tuning” kernel Platform DRV Platform DRV d Platform Driver BarbequeRTRM supported platforms F f Task Mapping I Platform Firmware Legend DDM H X SW Interface (API) e Y SW/HW Meta-data The BarbequeRTRM Framework 9
  • 10. Scheduling Policy YaMS - A modular multi-objective scheduler  Introduction of a new modular policy (YaMS) partition available resources (R) on applications (A) considering A priorities and R “residual” availabilities multi-objective optimization support a set of tunable goals DONE: performances, overheads, congestion, fairness WIP: stability, robustness, thermal and power increase overall system value considering discrete and tunable improvements  LP theory, MMKP heuristic promote scheduling of some AWMs which improve optimization goals demote scheduling of others AWMs which degrade solution metrics e.g. stability and robustness The BarbequeRTRM Framework 10
  • 11. Scheduling Policy YaMS - Scalability Speedup +36% +54% The BarbequeRTRM Framework 11
  • 12. Scheduling Policy System-Wide Controller – Overall View BBQ Validation Policy - enforce certain control properties energy budget, stability and robustness - authorize resources synchronization The BarbequeRTRM Framework 12
  • 13. Scheduling Policy System-Wide Controller – Inner-Loop “Scheduling” BBQ Validation Policy - enforce certain control properties energy budget, stability and robustness - authorize resources synchronization The BarbequeRTRM Framework 13
  • 14. Scheduling Policy System-Wide Controller – Inner-Loop Overheads Apps with 3 AWM, 3 Clusters => 9 configuration per application BBQ running on NSJ, 4 CPUs @ 2.5GHz (max) + + + The BarbequeRTRM Framework 14
  • 15. The BarbequeRTRM System-Wide RTRM: Platform Integration user-space Application-Specific RTRM Critical Apps Best-Effort Apps Fine grained control on application a b allocated resources: A B - task ordering - virtual processor assignment RTLib RTLib - DVFS Dynamic Code - application parameters monitoring Generation C C c System-Wide RTRM Res Accounting Res Partitioning Coarse grained control on platform G available resources: Res Abstraction - resource accounting, partitioning and abstraction - high-level HW events handling MRAPI Platform Proxy e.g., critical conditions, faults... 3 - manage applications priorities D E - power/thermal “coarse tuning” kernel Platform DRV Platform DRV d Platform Driver BarbequeRTRM supported platforms F f Task Mapping I Platform Firmware Legend DDM H X SW Interface (API) e Y SW/HW Meta-data The BarbequeRTRM Framework 15
  • 16. The BarbequeRTRM Platform Integration Layer  Support for generic Linux SMP/NUMA machines portable solution, based on Linux Control Group CPUs and Memory nodes assignments Memory amount assignment CPU bandwidth quota assignment (requires kernel 3.2)  Support both resources monitoring and control pre-configured CGroup to define BBQ controlled resources at Barbeque start, by parsing a pre-configured cgroups … than Barbeque takes control over these resources tun-time generation of new CGroup to control applications requires RTLib integration (of course)  Working on P2012 integration new genration many-core platform from STMicroelectronics first version: 4 cluster with 16 Processing Element each one The BarbequeRTRM Framework 16
  • 17. Synchronization Policy System-Wide Controller – Outer-Loop “Synchronization” BBQ Validation Policy - enforce certain control properties energy budget, stability and robustness - authorize resources synchronization The BarbequeRTRM Framework 17
  • 18. Synchronization Policy System-Wide Controller – Outer-Loop Overheads min AWM 25% CPU Time, 3 Clusters x 4CPUs => max 48 syncs BBQ running on NSJ, 4 CPUs @ 2.5GHz (max) + Linux kernel 3.2 Creation overheads: ~500ms Update overheads: ~100ms + (1/3 on quadcore i7) + + + Application dependent CGroups PIL The BarbequeRTRM Framework 18
  • 19. The BarbequeRTRM Framework Power Optimizations  Initial experiments on congested workloads increasing running instances of Bodytrack (PARSEC) 3AWM: [1,2,4] Threads system-wide power measurements via the standard IPMI interface Power Gains 2,3-3,7% Time Gains 338-625% X86_64 NUMA machine: 3 Clusters x 4CPUs BBQ running on NSJ, 4 CPUs @ 800MHz The BarbequeRTRM Framework 19
  • 20. The BarbequeRTRM Framework Conclusions  Framework for System-Wide RTRM flexibility and scalability of the RTRM strategy thanks to its hierarchical and distributed control structure acceptable overheads for real usage scenarios including those with variable workload tunable multi-objective optimization policies to cope with several design constraints and goals e.g., performance, power, thermal and reliability, ... promising results in terms of performance improving and power consumption reduction for a highly parallel workload, on a NUMA multi-core architecture availability of a simple API interface making straightforward for the programmers to take full advantages from framework services The BarbequeRTRM Framework 20
  • 21. The BarbequeRTRM Framework Future Works  Optimization policies robustness and stability assessment improving power/energy optimizations thermal and reliability management  Platform integration extended support for Android targets complete the integration with the P2012 many-core platform The BarbequeRTRM Framework 21
  • 22. Thanks for your attention! If you are interested, please check the project website for further information and keep update with the developments http://bosp.dei.polimi.it