ARM TechCon Session "Virtualization as the Nexus of Multicore Power Management"
Thursday, November 11, 2010
Adoption of multicore technology for the desktop,data center and embedded designs responds to comparable needs – to scale compute capacity without stepping up system clocks and to attain more MIPS-per-watt for devices and applications. Multicore for the desktop and data center enjoys mature support from deployed OSes. Even as embedded OSes become more adept at running on multicore CPUs, applications and middleware still face challenges of thread-safety, concurrency and load balancing. Mobile virtualization is a means to get maximum value from multicore ARM designs, at both architectural and app levels. It examines multicore use cases for virtualization, and how it brings superior CPU utilization,greater security, smoother legacy migration,& smarter energy management to multicore designs.
Optimizing elastic search on google compute engineBhuvaneshwaran R
If you are running the elastic search clusters on the GCE, then we need to take a look at the Capacity planning, OS level and Elasticsearch level optimization. I have presented this at GDG Delhi on Feb 22,2020.
Amazon RDS for Performance-Intensive Production Applications (DAT301) | AWS r...Amazon Web Services
Learn how to take advantage of Amazon RDS to run highly-available and performance-intensive production applications on AWS. We show you what you can do to achieve the highest levels of availability and performance for your relational databases. You learn how easy it is to architect for these requirements using several Amazon RDS features, such as Multi-AZ deployments, read replicas, and Provisioned IOPS storage. In addition, you learn how to quickly architect for the level of disaster recovery required by your business. Finally, some of our customers share how they built very high performing web and enterprise applications on Amazon RDS.
Open Kernel Labs (OK Labs), the leading global provider of embedded virtualization software for mobile phones and broadband Internet devices, and Citrix Systems, Inc. (NASDAQ:CTXS), today announced the “nirvana phone” reference architecture, using virtualization to enable both mobile communications and office desktop-type productivity. The nirvana phone concept goes beyond traditional smart phones by allowing users to access their corporate virtual desktop and applications from a single device, in any location. With support for docking to full-sized displays, keyboards, mice and other PC-type peripherals, nirvana phones will offer mobile workers a complete “virtual desktop in your pocket”, allowing them to take their desktop anywhere without the need to carry around a full laptop.
OK Labs Emerging Trends In Mobile Virtualization Rtecc 2010Open Kernel Labs
Sarabu discusses how the entire mobile ecosystem benefits when virtualization concepts are applied specifically to smartphones. Server virtualization and desktop virtualization have been widely adopted and well understood in the IT industry arena for a long time. However, embedded virtualization is a fairly new technology that addresses problems unique to embedded devices.
Optimizing elastic search on google compute engineBhuvaneshwaran R
If you are running the elastic search clusters on the GCE, then we need to take a look at the Capacity planning, OS level and Elasticsearch level optimization. I have presented this at GDG Delhi on Feb 22,2020.
Amazon RDS for Performance-Intensive Production Applications (DAT301) | AWS r...Amazon Web Services
Learn how to take advantage of Amazon RDS to run highly-available and performance-intensive production applications on AWS. We show you what you can do to achieve the highest levels of availability and performance for your relational databases. You learn how easy it is to architect for these requirements using several Amazon RDS features, such as Multi-AZ deployments, read replicas, and Provisioned IOPS storage. In addition, you learn how to quickly architect for the level of disaster recovery required by your business. Finally, some of our customers share how they built very high performing web and enterprise applications on Amazon RDS.
Open Kernel Labs (OK Labs), the leading global provider of embedded virtualization software for mobile phones and broadband Internet devices, and Citrix Systems, Inc. (NASDAQ:CTXS), today announced the “nirvana phone” reference architecture, using virtualization to enable both mobile communications and office desktop-type productivity. The nirvana phone concept goes beyond traditional smart phones by allowing users to access their corporate virtual desktop and applications from a single device, in any location. With support for docking to full-sized displays, keyboards, mice and other PC-type peripherals, nirvana phones will offer mobile workers a complete “virtual desktop in your pocket”, allowing them to take their desktop anywhere without the need to carry around a full laptop.
OK Labs Emerging Trends In Mobile Virtualization Rtecc 2010Open Kernel Labs
Sarabu discusses how the entire mobile ecosystem benefits when virtualization concepts are applied specifically to smartphones. Server virtualization and desktop virtualization have been widely adopted and well understood in the IT industry arena for a long time. However, embedded virtualization is a fairly new technology that addresses problems unique to embedded devices.
Linux power management: are you doing it right?Chris Simmonds
Devices operating on battery power need effective power management: anything you can do to reduce power usage will increase battery life. Even for devices running on mains power, better power managements has benefits in reducing the need for cooling and lower energy costs. This presentation describes the four principles of power management: don't rush if you don't have to; don't be ashamed of being idle; turn off things you are not using; and sleep when there is nothing else to do. Each of these has a counterpart in the Linux kernel.
[EWiLi2016] Enabling power-awareness for the Xen HypervisorMatteo Ferroni
Virtualization allows simultaneous execution of multi-tenant workloads on the same platform, either a server or an embedded system. Unfortunately, it is non-trivial to attribute hardware events to multiple virtual tenants, as some system’s metrics relate to the whole system (e.g., RAPL energy counters). Virtualized environments have then a rather incomplete picture of how tenants use the hardware, limiting their optimization capabilities. Thus, we propose XeMPower, a lightweight monitoring solution for Xen that precisely accounts hardware events to guest workloads. It also enables attribution of CPU power consumption to individual tenants. We show that XeMPower introduces negligible overhead in power consumption, aiming to be a reference design for power-aware virtualized environments.
Full paper: http://ceur-ws.org/Vol-1697/EWiLi16_10.pdf
Amazon EC2 provides a broad selection of instance types to accommodate a diverse mix of workloads. In this session, we provide an overview of the Amazon EC2 instance platform, key platform features, and the concept of instance generations. We dive into the current generation design choices of the different instance families, including General Purpose, Compute Optimized, Storage Optimized, Memory Optimized, and GPU instance. We also detail best practices and share performance tips for getting the most out of your Amazon EC2 instances.
Trading Power and Performance to Achieve Optimal Thermal Design for Battery-P...Mark Benson
Presentation at Communications, Microsystems, Optoelectronics, and Sensors (CMOS) in Vancouver, BC, by Mark Benson on July 18th, 2012.
ABSTRACT: In embedded systems there is a science to choosing the right processor for a given design. Usually processors are chosen based on objective factors such as peripheral support, physical packaging, memory, architecture family, and software tools available.
This type of quantitative analysis lends itself well to spreadsheets and formulas that can help choose which processor is best.
However, for small high-performance battery-powered devices that have heavy processing requirements, thermal performance quickly becomes an urgent design issue to overcome and is often viewed as an elusive and opinion-fraught black-art.
Modeling, managing, and characterizing the efficient dissipation of heat in these types of devices is complex. Fortunately, there are new processors coming to the embedded marketplace that have advanced features for managing thermal performance.
This, combined with new advances in software modeling techniques and algorithms gives us a fresh playground to innovate new ways to manage thermals in a way that was not previously possible.
With the rise of containerization, as well as the established adoption of virtualization technologies, run-time power and energy management is becoming one of the key challenges in modern cloud computing. This is also fundamental as power consumption contributes to the 20% of the Total Cost of Ownership of a datacenter and energy costs will exceed hardware costs in the near future. In this context, several goals towards power optimization can be achieved. On the one hand, power capping can be enforced and on top of that the system should be able to maximize performance. On the other hand, when performance are critical, the system should be able to provide a minimum SLA and optimize power consumption without violating it. Within this context, we propose a common autonomic methodology based on the ODA control loop for containers and virtual machines. The proposed methodology is able to achieve 25% power savings for containers and can improve performance under a power cap for virtual machines.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
More Related Content
Similar to OK Labs - Virtualization as the Nexus of Multicore Power Management
Linux power management: are you doing it right?Chris Simmonds
Devices operating on battery power need effective power management: anything you can do to reduce power usage will increase battery life. Even for devices running on mains power, better power managements has benefits in reducing the need for cooling and lower energy costs. This presentation describes the four principles of power management: don't rush if you don't have to; don't be ashamed of being idle; turn off things you are not using; and sleep when there is nothing else to do. Each of these has a counterpart in the Linux kernel.
[EWiLi2016] Enabling power-awareness for the Xen HypervisorMatteo Ferroni
Virtualization allows simultaneous execution of multi-tenant workloads on the same platform, either a server or an embedded system. Unfortunately, it is non-trivial to attribute hardware events to multiple virtual tenants, as some system’s metrics relate to the whole system (e.g., RAPL energy counters). Virtualized environments have then a rather incomplete picture of how tenants use the hardware, limiting their optimization capabilities. Thus, we propose XeMPower, a lightweight monitoring solution for Xen that precisely accounts hardware events to guest workloads. It also enables attribution of CPU power consumption to individual tenants. We show that XeMPower introduces negligible overhead in power consumption, aiming to be a reference design for power-aware virtualized environments.
Full paper: http://ceur-ws.org/Vol-1697/EWiLi16_10.pdf
Amazon EC2 provides a broad selection of instance types to accommodate a diverse mix of workloads. In this session, we provide an overview of the Amazon EC2 instance platform, key platform features, and the concept of instance generations. We dive into the current generation design choices of the different instance families, including General Purpose, Compute Optimized, Storage Optimized, Memory Optimized, and GPU instance. We also detail best practices and share performance tips for getting the most out of your Amazon EC2 instances.
Trading Power and Performance to Achieve Optimal Thermal Design for Battery-P...Mark Benson
Presentation at Communications, Microsystems, Optoelectronics, and Sensors (CMOS) in Vancouver, BC, by Mark Benson on July 18th, 2012.
ABSTRACT: In embedded systems there is a science to choosing the right processor for a given design. Usually processors are chosen based on objective factors such as peripheral support, physical packaging, memory, architecture family, and software tools available.
This type of quantitative analysis lends itself well to spreadsheets and formulas that can help choose which processor is best.
However, for small high-performance battery-powered devices that have heavy processing requirements, thermal performance quickly becomes an urgent design issue to overcome and is often viewed as an elusive and opinion-fraught black-art.
Modeling, managing, and characterizing the efficient dissipation of heat in these types of devices is complex. Fortunately, there are new processors coming to the embedded marketplace that have advanced features for managing thermal performance.
This, combined with new advances in software modeling techniques and algorithms gives us a fresh playground to innovate new ways to manage thermals in a way that was not previously possible.
With the rise of containerization, as well as the established adoption of virtualization technologies, run-time power and energy management is becoming one of the key challenges in modern cloud computing. This is also fundamental as power consumption contributes to the 20% of the Total Cost of Ownership of a datacenter and energy costs will exceed hardware costs in the near future. In this context, several goals towards power optimization can be achieved. On the one hand, power capping can be enforced and on top of that the system should be able to maximize performance. On the other hand, when performance are critical, the system should be able to provide a minimum SLA and optimize power consumption without violating it. Within this context, we propose a common autonomic methodology based on the ODA control loop for containers and virtual machines. The proposed methodology is able to achieve 25% power savings for containers and can improve performance under a power cap for virtual machines.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
4. > Device uses energy
• Drains battery
> Goal of energy management:
• Maximize battery life
Energy in Mobile Devices
5. Dynamic voltage and frequency scaling
> CMOS power consumption:
• P = Pdyn + Pstat
• Pdyn ∝ f V2
• Vmin ∝ f (very approximately)
> Assuming execution time T 1 /∝ f
• Edyn = Pdyn T ∝ f V2
/ f = V2
= f2
• lower frequency lower dynamic energy⇒
Energy-Management Mechanisms: DVFS
6. > When CPU is idle, turn clock off
• Pdyn = 0 ⇒ P = Pstat
> Sleep states reduce power further:
• Psleep < Pstat
> Typically have multiple sleep states
• shallow sleep states save some energy
but fast to enter/exit
• deep sleep states save more energy
but lose state and are expensive to enter/exit
> Complex tradeoff
Mechanisms: Sleep States
7. > Edyn ∝ f 2
lowest frequency is best⇒
> Ignores static energy!
• E = Edyn + Estat
• Edyn ∝ f 2
• Estat = Pstat T ∝ 1/f
> Low f increases execution time
⇒ Estat increases at low f !
Popular Approach: Lowest Frequency
8. > Run at maximum f, then go to sleep
• Tries to minimize static power — but:
• dynamic power isn’t irrelevant (yet)
– T 1/∝ f isn’t correct either — ignores memory!
• Effect of memory stalls
• T = TCPU + Tmem
• TCPU ∝ 1/f
• Tmem = const
• Estat ∝ T = 1/f + const
> Ignores sleep energy!
Other Approach: “Race to Halt”
9. > Run at maximum f, then go to sleep
> Earlier completion longer sleep⇒
• E = Edyn + Estat + Esleep
• Esleep = Psleep Tsleep
• Tsleep = T0 – T
• Esleep = Psleep (T0 - T)
> Still ignores dynamic energy!
Other Approach: “Race to Halt” (2)
11. Real Data: Total Energy (Measured)
CPU-
bound
CPU-
bound
Memory-
bound
Memory-
bound Naïve
model
Naïve
model
12. Real Data: Including Sleep Energy
High-power
sleep state
High-power
sleep state
Low-power
sleep state
Low-power
sleep state
13. > Energy management is complex!
> Optimal setting depends on:
• Workload
memory-bound vs CPU-bound vs in-between
• Hardware platform
static vs dynamic energy
CPU vs memory power
depth of sleep states and cost of entering
> Simple models don’t work!
Summary: Energy-Management Basics
14. > How to establish memory-boundedness?
> Easy way out: pre-characterization
• measure behavior off-line
• determine optimal power setting
by model or trial-and-error
> Ok-ish for pre-defined workloads
> Unsuitable for open systems
• ... such as phones
> Tricky with apps which change behavior
Characterizing Workloads
15. > Need to observe app and adjust setting
• works for any app
• adjusts to changing behavior
> Solution by [Snowdon et al., EuroSys’09]
> Performance counters are your friends!
• e.g. cache misses indicate memory access
> Can systematically select best counters
• build model of platform
• Linear combination of performance-counter readings
• pre-characterize hardware
• pick counters which provide most accurate model
• using sound statistical methods
Better Way: On-Line Characterization
16. > Model predicts energy consumption and relative execution speed
• at present setpoint
• at different setpoins
> Accurately predicts energy- and performance response to DVFS
• within a few %
> Can use this for informed energy-management decisions
On-Line Characterization & Modeling
23. > Implementation of power model and policies
• once for platform vs once for each guest
• no guest has global view, hypervisor does
• integration with other cores
DSPs, baseband processor
• policy-mechanism separation
Why do it outside the OS?
24. > Controls all resources
• CPU, memory, devices
> De-privileged guest OSes
• execute in user mode
• prevents interference
with hypervisor
with other guests
• ensures hypervisor retains control over resources
The Hypervisor
25. > Subsystems compete for it
> Cannot let subsystems manage it
• just as with memory, CPU
> Needs trusted, central authority
> Needs to be done in virtualization layer
Energy is a Global Resource
26. > Mechanisms in hypervisor
> Policies in isolated management module
> Keep hypervisor policy-free
• HW-like
Policy-Mechanism Separation
27. > Additional degree of freedom
• DVFS + sleep states + core shutdown
• Hypervisor supports transparent, temporary
consolidation of cores
• Unneeded cores turned off to reduce power
> Different tradeoffs
• Performance vs power close to linear
> Important to manage cores globally
• In average more cores off than with
per-guest management
• Can use deeper sleep state
• Less overall energy use
Enter Multicore
OKL4 Microvisor
Subsystem #1
CPU
VCPU VCPU VCPUVCPU
Subsystem #2
CPU CPUCPU
OKL4 Microvisor
Subsystem #1
CPU
VCPU VCPU VCPUVCPU
Subsystem #2
CPU CPUCPU
28. > Cache coherency couples clock
frequencies of multiple cores
> OSes running on different cores cannot
adjust clock independently
> Requires entity with global view
Enter Multicore: Architectural Constraints
29. > Cores have same ISA but different clock rates
> Hypervisor can determine optimal mapping of subsystems to cores
• Using same infrastructure as for DVFS
• Integrate with temporary core consolidation
Asymmetric Multicore
Fast
CPU
Slow
CPU
OKL4 Microvisor
CPU-bound
Subsystem
Fast
CPU
VCPU VCPU VCPUVCPU
Memory-bound
Subsystem
Slow
CPU
30. > Move subsystems between cores
• including temporary consolidation
of different subsystems on common core
> Architectural inter-core dependencies
• cannot manage core clocks independently
> Requires global control
• ... outside individual OSes
• indirection layer between OS and hardware
> No practical alternative to virtualization!
The Future is Multicore
OKL4 Microvisor
Subsystem #1
CPU
VCPU VCPU VCPUVCPU
Subsystem #2
CPU CPUCPU
31. > Virtualization is unavoidable long-term
> ... but provides other benefits short-term
> Early uptake maximises benefits
> Future-proof your designs!
Summary