SlideShare a Scribd company logo
1 of 42
Download to read offline
Krzysztof Kozlowski
k.kozlowski@samsung.com
Samsung R&D Institute Poland
 Overview and goals
 CPUfreq (with clock down)
 Run-time PM and power domains
 SoC low power states
 CPU idle drivers
 Devfreq
 Summary
 Limit the consumption of energy by mobile device
 Do not hurt performance (at least to some extend)
 The speech will focus on ARM architecture and
Samsung’s Exynos System-on-Chip family, although
ideas are not limited to Exynos
 Target devices: smartphones, tablets, wearables
 Mainline Linux kernel is changing very fast
◦ Some details about specific kernel drivers may become
obsolete soon
◦ Details as for current mainline kernel: 3.16
 All measurements were done on development devices:
◦ Custom kernels
◦ Custom operating systems
 They are not representative for market/end products
 Sometimes measurements on these devices may not
be even close to market products
 Many measurements were done in specific custom
configuration, very different from market product, e.g.:
◦ No CPU idle driver, no CPUfreq driver
◦ Booted to init=/bin/sh
 Measured on:
◦ Trats2, smartphone (Exynos 4412, 4 cores,
freq 200-1400 MHz)
◦ Gear1-like wearable (Exynos 4212, 2 cores, 200-1400 MHz)
◦ Exynos 3250 development board (2 cores, 100-1000 MHz)
 Kernel used:
◦ Exynos 4212 and 4412 : linux-next (next-20140804)
◦ Exynos 3250: internal Linux kernel tree 3.10
 Lack of full support in mainline
 Ondemand governor adjusts the frequency and voltage
to current load
 Specific conditions to embedded world – one
frequency and voltage for whole cluster
◦ On most SoCs: one clock frequency/voltage for all CPUs
◦ Except big.LITTLE (e.g. 2 clusters on quad-core Exynos Octa)
 Dual cluster SoCs have big.LITTLE
CPUfreq driver
Photo by Pauli Rautakorpi
 Separate CPUfreq drivers for each SoC
 Moving toward one generic cpufreq-cpu0 driver
 cpufreq-cpu0 requires:
◦ Clock to operate on (provided by clock driver)
◦ Optionally voltage regulator (provided by regulator driver)
◦ Table of Operating Performance Points from Device Tree
 OPP is a tuple of frequency and voltage
 Clock/regulator/OPP frameworks add necessary
abstraction layer and make cpufreq-cpu0 a generic
solution
 Reduces the frequency upon entering WFI or WFE
instruction (Wait for Interrupt/Event)
◦ All cores must be idle
 Behaves like a hardware ondemand CPUfreq governor
◦ But only for clock frequency (voltages remain untouched)
 Supported by most of Exynos SoCs
◦ As part of clock driver for Exynos 3250, Exynos 4 and Exynos
5250 [1]
 Measurements in idle mode (basic CPU idle – WFI), no load
 No benefits if CPUfreq governor is ondemand which is quite obvious
 ARMCLK cannot get below minimal frequency used in CPUfreq driver
 Ondemand CPUfreq reduces also ARM voltage
Board SoC Frequency
[MHz]
Idle [mA] Idle + clock down
[mA]
Trats2 Exynos 4412
1400 198 170
200 115 114
Gear1-like Exynos 4212
1400 102 82
200 60 59
Dev-board Exynos 3250
1000 36.2 26.7
100 19.2 18.5
0
50
100
150
200
250
current[mA]
ARMCLK clk down: energy consumption
Idle
Idle + clock down
 Putting devices into low power states when not used
◦ Optionally: automatically delayed suspends
 Needs support in device drivers
◦ Driver specifies when it is working and idle
 Usage counter (pm_runtime_get()/put())
 Time of last activities (pm_runtime_mark_last_busy())
◦ Driver implements runtime suspend and resume callbacks
 Local power control, powered independently
 Example power domains:
◦ CPU-s
◦ Multi-Format Codec (MFC)
◦ G3D (e.g. Mali)
◦ LCD
◦ Image Signal Processor
◦ Camera
 Linux offers a generic power domain framework [2]
◦ Used by SH-Mobile and Exynos
 Other vendors implement this on their own
 Multiple Linux devices can be attached to a domain
SoC
CPU0
CPU1
MFC
Camera
Camera
JPEG
TV
Mixer
Video
Processor
HDMI
Power Domain Devices
 Integrates with runtime PM
◦ If all attached devices have been suspended, power down
whole domain
 Differences in energy consumption:
Board SoC All domains
enabled
permanently [mA]
Power domain
runtime PM
[mA]
Diff [%]
Trats2 Exynos 4412 124 114 -8%
Dev-board Exynos 3250 24.7 18.5 -25%
 Measurements in idle mode (basic CPU idle - WFI), no load
 When in idle, enter low power state to save energy
◦ Responsible: CPU idle framework and drivers
 Wait for Interrupt (WFI), basic idle state
 Various SoCs support deeper low power states
◦ Msm: retention, standalone power collapse, power collapse
◦ Exynos: ARM Off TOP Running (AFTR), W-AFTR, Low Power
Audio playback (LPA)
 W-AFTR and LPA extends the AFTR low power mode
 Usually they have higher latency for enter and leave
◦ Enter deeper state only if we won’t be awaken right away
 System-level states
◦ Whole system must be prepared
1. CPU[1-n] must be powered off
2. Then CPU0 triggers entering deep low power state
 ARM Off TOP Running (AFTR)
◦ Cortex core is power gated (power supplied but internally
gated)
◦ Most of other modules are powered on (e.g. Audio, MMC,
CoreSight, USB, I2C, UART etc.)
 W-AFTR (on Exynos 3250)
◦ AFTR + power gating everything that can be power gated
◦ Except Dynamic Memory Controller, DDR, RTC
◦ Fast wake-up
 Low Power Audio playback (LPA, on other Exynos SoCs)
◦ AFTR + power gating everything that can be power gated
◦ Contents of L2 cache and TOP modules are preserved
(retention)
◦ Audio related blocks are on
 Sleep
◦ Power not supplied (regulators turned off)
◦ Only ALIVE and RTC blocks are on
◦ Entered with Suspend-to-RAM
Low power state Subsystem Requires additional drivers
WFI Scheduler (idle loop) No, built-in
AFTR, W-AFTR,
LPA
CPU idle
CPU idle driver and support
for board (arch/arm/mach-*)
Sleep Suspend
support for board
(arch/arm/mach-*)
 From CPU idle perspective: states are „coupled”
◦ Entering AFTR/W-AFTR/LPA depends on powering down all
other CPUs[1-n]
◦ If only CPU0 is left on, then CPU idle triggers low power state
 User space may hot unplug idle CPUs[1-n]
◦ echo 0 > /sys/bus/cpu/devices/cpu1/online
◦ E.g. mpdecision daemon for msm
 Kernel may synchronize entering idle states
◦ Coupled CPU idle drivers
 echo mem > /sys/power/state
Sync
Wait for CPU1 to
power down
Power
down
Enter AFTR
or WAFTR
CPU0
awake
Sync
Wait for CPU1 to
power up
Power
down
AFTR or
WAFTR
Woken
up
Woken up
by CPU0
IRQ
Wake
CPU1
 big.LITTLE
◦ Suspend whole cluster
◦ Supported Versatile TC2 (Linux 3.16) and Exynos 5420 (next)
 Exynos AFTR
◦ Currently only for Exynos 4210 and Exynos 5250
◦ Requires manual hot unplug of CPU1 to work
 Coupled CPU idle driver for Exynos (CPU1 off, CPU0
AFTR)
◦ On going work, patch by Daniel Lezcano [3]
 Due to lack of support in mainline all measurements
were done on internal Linux kernel tree 3.10
 Development board with Exynos 3250 (2 cores,
100-1000 MHz)
 The goal of measurements is to show overall
differences between CPU low power states
 Workload: simple computation tasks (work-sleep-
work), mainly used to test scheduler
 Tested configurations:
◦ CPUs enter only WFI (no CPU idle driver)
◦ Power down CPU1 from userspace permanently, CPU0 enters
WFI (no CPU idle driver)
◦ Coupled CPU idle: CPU1 power off, CPU0 AFTR
◦ Coupled CPU idle: CPU1 power off, CPU0 AFTR or W-AFTR
 W-AFTR is entered if certain conditions are met. If no, enter AFTR.
 Conditions:
 Camera, G3D, MFC power domains are off
 Certain bus clocks are gated
 MMC is idle
CPU frequency 2 tasks 3 tasks 4 tasks
WFI (CPU0 + CPU1) 200 MHz 200-600 MHz 200-600 MHz
CPU1 powered off 500 MHz 300-900 MHz 400-1000 MHz
25%
23%
35%35%
44%
46%
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
50%
2 3 4
TotalCPUload[%]
No. tasks
Total CPU usage, during load
WFI (CPU0 + CPU1)
CPU0 WFI + CPU1 always off
22.6
26
28
23.8
26.4
20
21
22
23
24
25
26
27
28
29
2 3 4
current[mA]
No. tasks
Energy consumption, during load
WFI (CPU0 + CPU1)
CPU0 WFI + CPU1 always off
 What about coupled CPU idle drivers (CPU1 off, CPU0
AFTR or W-AFTR)?
◦ No differences because the load prevented entering deeper
idle states
 But a new driver which powers off CPU1 also when
CPU0 is busy makes sense
◦ Workload consolidation in scheduler could also help
(increased idle time of CPU1)
◦ Already discussed and some works are in progress [4]
 Power-aware scheduling [5]
 Sched packing [6]
 Workload consolidation and CPU ConCurrency [7]
CPU idle driver Idle [mA] Diff against WFI
WFI (CPU0 + CPU1) 18.5
CPU0 WFI + CPU1 always off 18.3 -1.0%
CPU idle, coupled, OFF+AFTR 17.8 -3.8%
CPU idle, coupled, OFF+AFTR/WAFTR 9.8 -47.0%
Suspend-to-RAM 0.46
 No load
 Exynos 3250 CPU @100 MHz
0
2
4
6
8
10
12
14
16
18
20
WFI (CPU0 + CPU1) CPU0 WFI + CPU1
always off
Coupled,
OFF+AFTR
Coupled,
OFF+AFTR/WAFTR
Suspend-to-RAM
current[mA]
CPU idle: energy consumption, idle
 Dynamic Voltage and Frequency Scaling for devices
◦ e.g. Memory bus, MFC, G2D, ISP, SD/MMC
 Governors similar to CPUfreq (ondemand etc.)
◦ Current load may be taken from device performance counters
 Exynos: number of read and write operations in Dynamic Memory
Controller
 Mainline: only drivers for Exynos 4 and Exynos 5250
◦ ... but only for Exynos 5250 is working
 Due to lack of support in mainline all measurements
were done on internal Linux kernel tree 3.10
 Development board with Exynos 3250 (2 cores,
100-1000 MHz)
 The goal of measurements is to show overall difference
that devfreq makes
Possible values for Exynos 3250
development board
Min Max
INT voltage 0.80 V 1.00 V
MIF voltage 0.85 V 1.00 V
DMC clock 50 MHz 400 MHz
LEFT/RIGHT buses clock 50 MHz 133 MHz
 No devfreq means that highest voltages and clock rates
are chosen for:
◦ bus clocks
◦ INT and MIF regulators supplying power to many SoC modules
CPU idle driver No devfreq [mA] With devfreq [mA] Diff [%]
WFI (CPU0 + CPU1) 28.9 18.5 -36%
CPU0 WFI + CPU1
always off
28.6 18.3 -36%
CPU idle, coupled,
OFF+AFTR
27.9 17.8 -36%
CPU idle, coupled,
OFF+AFTR/WAFTR
16.3 9.8 -40%
 No load
 Exynos 3250 CPU @100 MHz
0
5
10
15
20
25
30
35
WFI (CPU0 + CPU1) CPU0 WFI + CPU1
always off
Coupled,
OFF+AFTR
Coupled,
OFF+AFTR/WAFTR
current[mA]
Devfreq: energy consumption, idle
devfreq
no devfreq
 Linux kernel has a number of features for reducing
energy usage
◦ Unfortunately many drivers are not yet present in mainline
 Most benefits come with:
◦ Frequency and voltage scaling (CPUfreq, devfreq)
◦ Entering low power CPU states (CPU idle)
◦ Developing good drivers
 For external resources use abstraction provided by the kernel (e.g.
Common Clock Framework, regulators)
 Runtime PM (if it is applicable)
 Still a lot of work to do
◦ Drivers: CPU idle, devfreq
◦ Integration between scheduler, CPUfreq and CPU idle
◦ Mobile devices are somehow different
 Energy consumption is an important factor (How often do you
have to charge your phone?)
 Mobile device is mostly idle (more opportunities to sleep)
 Consolidating workload makes sense (to some extend)
Thank you!
Any questions?
 [1] Common Clock Framework drivers for Exynos 4, 3250 and
5250
drivers/clk/samsung/clk-exynos*.c
 [2] Generic Device Power Domains
include/linux/om_domain.h
 [3] Coupled cpuidle driver for Exynos
Exynos4: cpuidle: support dual CPUs with AFTR state
http://thread.gmane.org/gmane.linux.power-
management.general/44248
 [4] Toward a more power-efficient scheduler, Jonathan Corbet
http://lwn.net/Articles/546664/
 [5] Morten Rasmussen
 Power-aware scheduling v2
https://lkml.org/lkml/2013/10/11/547
 sched: Energy cost model for energy-aware scheduling
https://lkml.org/lkml/2014/5/23/621
 [6] Sched: packing tasks (and newer works), Vincent Guittot
https://lkml.org/lkml/2013/10/18/121
 [7] A new CPU load metric for power-efficient scheduler: CPU
ConCurrency, Yuyang Du
https://lkml.org/lkml/2014/6/25/162

More Related Content

What's hot

Uboot startup sequence
Uboot startup sequenceUboot startup sequence
Uboot startup sequence
Houcheng Lin
 
Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...
Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...
Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...
Opersys inc.
 
U boot porting guide for SoC
U boot porting guide for SoCU boot porting guide for SoC
U boot porting guide for SoC
Macpaul Lin
 
XPDDS17: Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...
XPDDS17:  Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...XPDDS17:  Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...
XPDDS17: Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...
The Linux Foundation
 
Linux Porting to a Custom Board
Linux Porting to a Custom BoardLinux Porting to a Custom Board
Linux Porting to a Custom Board
Patrick Bellasi
 

What's hot (20)

LAS16-307: Benchmarking Schedutil in Android
LAS16-307: Benchmarking Schedutil in AndroidLAS16-307: Benchmarking Schedutil in Android
LAS16-307: Benchmarking Schedutil in Android
 
U-Boot - An universal bootloader
U-Boot - An universal bootloader U-Boot - An universal bootloader
U-Boot - An universal bootloader
 
Q4.11: Porting Android to new Platforms
Q4.11: Porting Android to new PlatformsQ4.11: Porting Android to new Platforms
Q4.11: Porting Android to new Platforms
 
Uboot startup sequence
Uboot startup sequenceUboot startup sequence
Uboot startup sequence
 
The Linux Kernel Scheduler (For Beginners) - SFO17-421
The Linux Kernel Scheduler (For Beginners) - SFO17-421The Linux Kernel Scheduler (For Beginners) - SFO17-421
The Linux Kernel Scheduler (For Beginners) - SFO17-421
 
SFO15-TR9: PSCI, ACPI (and UEFI to boot)
SFO15-TR9: PSCI, ACPI (and UEFI to boot)SFO15-TR9: PSCI, ACPI (and UEFI to boot)
SFO15-TR9: PSCI, ACPI (and UEFI to boot)
 
Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...
Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...
Native Android Userspace part of the Embedded Android Workshop at Linaro Conn...
 
Linux Kernel MMC Storage driver Overview
Linux Kernel MMC Storage driver OverviewLinux Kernel MMC Storage driver Overview
Linux Kernel MMC Storage driver Overview
 
SFO15-302: Energy Aware Scheduling: Progress Update
SFO15-302: Energy Aware Scheduling: Progress UpdateSFO15-302: Energy Aware Scheduling: Progress Update
SFO15-302: Energy Aware Scheduling: Progress Update
 
U boot porting guide for SoC
U boot porting guide for SoCU boot porting guide for SoC
U boot porting guide for SoC
 
Arm device tree and linux device drivers
Arm device tree and linux device driversArm device tree and linux device drivers
Arm device tree and linux device drivers
 
linux device driver
linux device driverlinux device driver
linux device driver
 
Embedded Linux BSP Training (Intro)
Embedded Linux BSP Training (Intro)Embedded Linux BSP Training (Intro)
Embedded Linux BSP Training (Intro)
 
LAS16-200: SCMI - System Management and Control Interface
LAS16-200:  SCMI - System Management and Control InterfaceLAS16-200:  SCMI - System Management and Control Interface
LAS16-200: SCMI - System Management and Control Interface
 
Power Management from Linux Kernel to Android
Power Management from Linux Kernel to AndroidPower Management from Linux Kernel to Android
Power Management from Linux Kernel to Android
 
LCU14-410: How to build an Energy Model for your SoC
LCU14-410: How to build an Energy Model for your SoCLCU14-410: How to build an Energy Model for your SoC
LCU14-410: How to build an Energy Model for your SoC
 
XPDDS17: Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...
XPDDS17:  Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...XPDDS17:  Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...
XPDDS17: Reworking the ARM GIC Emulation & Xen Challenges in the ARM ITS Emu...
 
LAS16-402: ARM Trusted Firmware – from Enterprise to Embedded
LAS16-402: ARM Trusted Firmware – from Enterprise to EmbeddedLAS16-402: ARM Trusted Firmware – from Enterprise to Embedded
LAS16-402: ARM Trusted Firmware – from Enterprise to Embedded
 
Linux Porting to a Custom Board
Linux Porting to a Custom BoardLinux Porting to a Custom Board
Linux Porting to a Custom Board
 
Qemu Introduction
Qemu IntroductionQemu Introduction
Qemu Introduction
 

Viewers also liked

Android power management
Android power managementAndroid power management
Android power management
Jerrin George
 

Viewers also liked (6)

SoC Idling for unconf COSCUP 2016
SoC Idling for unconf COSCUP 2016SoC Idling for unconf COSCUP 2016
SoC Idling for unconf COSCUP 2016
 
Q2.12: Idling ARMs in a busy world: Linux Power Management for ARM Multiclust...
Q2.12: Idling ARMs in a busy world: Linux Power Management for ARM Multiclust...Q2.12: Idling ARMs in a busy world: Linux Power Management for ARM Multiclust...
Q2.12: Idling ARMs in a busy world: Linux Power Management for ARM Multiclust...
 
Kernel Recipes 2015: Introduction to Kernel Power Management
Kernel Recipes 2015: Introduction to Kernel Power ManagementKernel Recipes 2015: Introduction to Kernel Power Management
Kernel Recipes 2015: Introduction to Kernel Power Management
 
LCA13: CPUIDLE: One driver to rule them all?
LCA13: CPUIDLE: One driver to rule them all?LCA13: CPUIDLE: One driver to rule them all?
LCA13: CPUIDLE: One driver to rule them all?
 
Android power management
Android power managementAndroid power management
Android power management
 
Q2.12: Research Update on big.LITTLE MP Scheduling
Q2.12: Research Update on big.LITTLE MP SchedulingQ2.12: Research Update on big.LITTLE MP Scheduling
Q2.12: Research Update on big.LITTLE MP Scheduling
 

Similar to Kernel Features for Reducing Power Consumption on Embedded Devices

ICTCoreCh11
ICTCoreCh11ICTCoreCh11
ICTCoreCh11
garcons0
 
Tutorial Arach N!D
Tutorial Arach N!DTutorial Arach N!D
Tutorial Arach N!D
kameshsept
 
Arm cortex-m3 by-joe_bungo_arm
Arm cortex-m3 by-joe_bungo_armArm cortex-m3 by-joe_bungo_arm
Arm cortex-m3 by-joe_bungo_arm
Prashant Ahire
 
Hardware Management Module
Hardware Management ModuleHardware Management Module
Hardware Management Module
Aero Plane
 

Similar to Kernel Features for Reducing Power Consumption on Embedded Devices (20)

Nvidia tegra K1 Presentation
Nvidia tegra K1 PresentationNvidia tegra K1 Presentation
Nvidia tegra K1 Presentation
 
embedded system introduction to microcontrollers
embedded system introduction to microcontrollersembedded system introduction to microcontrollers
embedded system introduction to microcontrollers
 
Linux_swspnd_v0.3_pub1
Linux_swspnd_v0.3_pub1Linux_swspnd_v0.3_pub1
Linux_swspnd_v0.3_pub1
 
Power management
Power managementPower management
Power management
 
Computer Architecture and Organization
Computer Architecture and OrganizationComputer Architecture and Organization
Computer Architecture and Organization
 
Iot Workshop NITT 2015
Iot Workshop NITT 2015Iot Workshop NITT 2015
Iot Workshop NITT 2015
 
ARM - Advance RISC Machine
ARM - Advance RISC MachineARM - Advance RISC Machine
ARM - Advance RISC Machine
 
arm
armarm
arm
 
IDF'16 San Francisco - Overclocking Session
IDF'16 San Francisco - Overclocking SessionIDF'16 San Francisco - Overclocking Session
IDF'16 San Francisco - Overclocking Session
 
Microcontroller from basic_to_advanced
Microcontroller from basic_to_advancedMicrocontroller from basic_to_advanced
Microcontroller from basic_to_advanced
 
ICTCoreCh11
ICTCoreCh11ICTCoreCh11
ICTCoreCh11
 
PIC Introduction and explained in detailed
PIC Introduction and explained in detailedPIC Introduction and explained in detailed
PIC Introduction and explained in detailed
 
Tutorial Arach N!D
Tutorial Arach N!DTutorial Arach N!D
Tutorial Arach N!D
 
NFS-320
NFS-320NFS-320
NFS-320
 
Atmega8u2 mur
Atmega8u2 murAtmega8u2 mur
Atmega8u2 mur
 
Embedded systems 101 final
Embedded systems 101 finalEmbedded systems 101 final
Embedded systems 101 final
 
Arm cortex-m3 by-joe_bungo_arm
Arm cortex-m3 by-joe_bungo_armArm cortex-m3 by-joe_bungo_arm
Arm cortex-m3 by-joe_bungo_arm
 
INDUSTRIAL TRAINING REPORT EMBEDDED SYSTEM.pptx
INDUSTRIAL TRAINING REPORT EMBEDDED SYSTEM.pptxINDUSTRIAL TRAINING REPORT EMBEDDED SYSTEM.pptx
INDUSTRIAL TRAINING REPORT EMBEDDED SYSTEM.pptx
 
Hardware Management Module
Hardware Management ModuleHardware Management Module
Hardware Management Module
 
PowerDRC/LVS 2.0.1 released by POLYTEDA
PowerDRC/LVS 2.0.1 released by POLYTEDAPowerDRC/LVS 2.0.1 released by POLYTEDA
PowerDRC/LVS 2.0.1 released by POLYTEDA
 

More from Ryo Jin

More from Ryo Jin (20)

Why is EFL used on Tizen?
Why is EFL used on Tizen?Why is EFL used on Tizen?
Why is EFL used on Tizen?
 
Samsung Z4 User Manual
Samsung Z4 User ManualSamsung Z4 User Manual
Samsung Z4 User Manual
 
Samsung ARTIK 050 (ARTIK ZERO) Modules Data Sheet
Samsung ARTIK 050 (ARTIK ZERO) Modules Data SheetSamsung ARTIK 050 (ARTIK ZERO) Modules Data Sheet
Samsung ARTIK 050 (ARTIK ZERO) Modules Data Sheet
 
Introduction to Watch Face Development with Tizen Studio
Introduction to Watch Face Development with Tizen StudioIntroduction to Watch Face Development with Tizen Studio
Introduction to Watch Face Development with Tizen Studio
 
Tizen 3.0's Window System Integration Layer of OpenGLES/EGL & Vulkan Driver
Tizen 3.0's Window System Integration Layer of OpenGLES/EGL & Vulkan DriverTizen 3.0's Window System Integration Layer of OpenGLES/EGL & Vulkan Driver
Tizen 3.0's Window System Integration Layer of OpenGLES/EGL & Vulkan Driver
 
Panduan Penggunaan Perangkat Wearable Tizen
Panduan Penggunaan Perangkat Wearable TizenPanduan Penggunaan Perangkat Wearable Tizen
Panduan Penggunaan Perangkat Wearable Tizen
 
Cara Menggunakan Smartphone Tizen
Cara Menggunakan Smartphone TizenCara Menggunakan Smartphone Tizen
Cara Menggunakan Smartphone Tizen
 
Gear Fit2 Watchface Design Guide
Gear Fit2 Watchface Design GuideGear Fit2 Watchface Design Guide
Gear Fit2 Watchface Design Guide
 
Samsung Indonesia: Tizen Store
Samsung Indonesia: Tizen StoreSamsung Indonesia: Tizen Store
Samsung Indonesia: Tizen Store
 
Samsung Indonesia: Tizen Wearables
Samsung Indonesia: Tizen WearablesSamsung Indonesia: Tizen Wearables
Samsung Indonesia: Tizen Wearables
 
Samsung Indonesia: Tizen Web Apps
Samsung Indonesia: Tizen Web AppsSamsung Indonesia: Tizen Web Apps
Samsung Indonesia: Tizen Web Apps
 
Samsung Indonesia: Tizen Native App
Samsung Indonesia: Tizen Native AppSamsung Indonesia: Tizen Native App
Samsung Indonesia: Tizen Native App
 
Samsung Indonesia: Tizen Platform Overview and IoT
Samsung Indonesia: Tizen Platform Overview and IoTSamsung Indonesia: Tizen Platform Overview and IoT
Samsung Indonesia: Tizen Platform Overview and IoT
 
Russian Tizen Project
Russian Tizen ProjectRussian Tizen Project
Russian Tizen Project
 
Samsung SM-R360 Tizen User Manual
Samsung SM-R360 Tizen User ManualSamsung SM-R360 Tizen User Manual
Samsung SM-R360 Tizen User Manual
 
Tizen Micro Profile for IoT device
Tizen Micro Profile for IoT deviceTizen Micro Profile for IoT device
Tizen Micro Profile for IoT device
 
Panduan Dasar Pemrograman Tizen
Panduan Dasar Pemrograman TizenPanduan Dasar Pemrograman Tizen
Panduan Dasar Pemrograman Tizen
 
The Story of Enlightenment, EFL, Tizen and Wayland
The Story of Enlightenment, EFL, Tizen and WaylandThe Story of Enlightenment, EFL, Tizen and Wayland
The Story of Enlightenment, EFL, Tizen and Wayland
 
Tizen PASS
Tizen PASSTizen PASS
Tizen PASS
 
Tizen PASS
Tizen PASSTizen PASS
Tizen PASS
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Kernel Features for Reducing Power Consumption on Embedded Devices

  • 2.  Overview and goals  CPUfreq (with clock down)  Run-time PM and power domains  SoC low power states  CPU idle drivers  Devfreq  Summary
  • 3.  Limit the consumption of energy by mobile device  Do not hurt performance (at least to some extend)  The speech will focus on ARM architecture and Samsung’s Exynos System-on-Chip family, although ideas are not limited to Exynos  Target devices: smartphones, tablets, wearables  Mainline Linux kernel is changing very fast ◦ Some details about specific kernel drivers may become obsolete soon ◦ Details as for current mainline kernel: 3.16
  • 4.  All measurements were done on development devices: ◦ Custom kernels ◦ Custom operating systems  They are not representative for market/end products  Sometimes measurements on these devices may not be even close to market products  Many measurements were done in specific custom configuration, very different from market product, e.g.: ◦ No CPU idle driver, no CPUfreq driver ◦ Booted to init=/bin/sh
  • 5.  Measured on: ◦ Trats2, smartphone (Exynos 4412, 4 cores, freq 200-1400 MHz) ◦ Gear1-like wearable (Exynos 4212, 2 cores, 200-1400 MHz) ◦ Exynos 3250 development board (2 cores, 100-1000 MHz)  Kernel used: ◦ Exynos 4212 and 4412 : linux-next (next-20140804) ◦ Exynos 3250: internal Linux kernel tree 3.10  Lack of full support in mainline
  • 6.  Ondemand governor adjusts the frequency and voltage to current load  Specific conditions to embedded world – one frequency and voltage for whole cluster ◦ On most SoCs: one clock frequency/voltage for all CPUs ◦ Except big.LITTLE (e.g. 2 clusters on quad-core Exynos Octa)  Dual cluster SoCs have big.LITTLE CPUfreq driver Photo by Pauli Rautakorpi
  • 7.  Separate CPUfreq drivers for each SoC  Moving toward one generic cpufreq-cpu0 driver  cpufreq-cpu0 requires: ◦ Clock to operate on (provided by clock driver) ◦ Optionally voltage regulator (provided by regulator driver) ◦ Table of Operating Performance Points from Device Tree  OPP is a tuple of frequency and voltage  Clock/regulator/OPP frameworks add necessary abstraction layer and make cpufreq-cpu0 a generic solution
  • 8.  Reduces the frequency upon entering WFI or WFE instruction (Wait for Interrupt/Event) ◦ All cores must be idle  Behaves like a hardware ondemand CPUfreq governor ◦ But only for clock frequency (voltages remain untouched)  Supported by most of Exynos SoCs ◦ As part of clock driver for Exynos 3250, Exynos 4 and Exynos 5250 [1]
  • 9.  Measurements in idle mode (basic CPU idle – WFI), no load  No benefits if CPUfreq governor is ondemand which is quite obvious  ARMCLK cannot get below minimal frequency used in CPUfreq driver  Ondemand CPUfreq reduces also ARM voltage Board SoC Frequency [MHz] Idle [mA] Idle + clock down [mA] Trats2 Exynos 4412 1400 198 170 200 115 114 Gear1-like Exynos 4212 1400 102 82 200 60 59 Dev-board Exynos 3250 1000 36.2 26.7 100 19.2 18.5
  • 10. 0 50 100 150 200 250 current[mA] ARMCLK clk down: energy consumption Idle Idle + clock down
  • 11.
  • 12.  Putting devices into low power states when not used ◦ Optionally: automatically delayed suspends  Needs support in device drivers ◦ Driver specifies when it is working and idle  Usage counter (pm_runtime_get()/put())  Time of last activities (pm_runtime_mark_last_busy()) ◦ Driver implements runtime suspend and resume callbacks
  • 13.  Local power control, powered independently  Example power domains: ◦ CPU-s ◦ Multi-Format Codec (MFC) ◦ G3D (e.g. Mali) ◦ LCD ◦ Image Signal Processor ◦ Camera  Linux offers a generic power domain framework [2] ◦ Used by SH-Mobile and Exynos  Other vendors implement this on their own
  • 14.  Multiple Linux devices can be attached to a domain SoC CPU0 CPU1 MFC Camera Camera JPEG TV Mixer Video Processor HDMI Power Domain Devices
  • 15.  Integrates with runtime PM ◦ If all attached devices have been suspended, power down whole domain  Differences in energy consumption: Board SoC All domains enabled permanently [mA] Power domain runtime PM [mA] Diff [%] Trats2 Exynos 4412 124 114 -8% Dev-board Exynos 3250 24.7 18.5 -25%  Measurements in idle mode (basic CPU idle - WFI), no load
  • 16.
  • 17.  When in idle, enter low power state to save energy ◦ Responsible: CPU idle framework and drivers  Wait for Interrupt (WFI), basic idle state  Various SoCs support deeper low power states ◦ Msm: retention, standalone power collapse, power collapse ◦ Exynos: ARM Off TOP Running (AFTR), W-AFTR, Low Power Audio playback (LPA)  W-AFTR and LPA extends the AFTR low power mode  Usually they have higher latency for enter and leave ◦ Enter deeper state only if we won’t be awaken right away
  • 18.  System-level states ◦ Whole system must be prepared 1. CPU[1-n] must be powered off 2. Then CPU0 triggers entering deep low power state  ARM Off TOP Running (AFTR) ◦ Cortex core is power gated (power supplied but internally gated) ◦ Most of other modules are powered on (e.g. Audio, MMC, CoreSight, USB, I2C, UART etc.)
  • 19.  W-AFTR (on Exynos 3250) ◦ AFTR + power gating everything that can be power gated ◦ Except Dynamic Memory Controller, DDR, RTC ◦ Fast wake-up  Low Power Audio playback (LPA, on other Exynos SoCs) ◦ AFTR + power gating everything that can be power gated ◦ Contents of L2 cache and TOP modules are preserved (retention) ◦ Audio related blocks are on
  • 20.  Sleep ◦ Power not supplied (regulators turned off) ◦ Only ALIVE and RTC blocks are on ◦ Entered with Suspend-to-RAM Low power state Subsystem Requires additional drivers WFI Scheduler (idle loop) No, built-in AFTR, W-AFTR, LPA CPU idle CPU idle driver and support for board (arch/arm/mach-*) Sleep Suspend support for board (arch/arm/mach-*)
  • 21.  From CPU idle perspective: states are „coupled” ◦ Entering AFTR/W-AFTR/LPA depends on powering down all other CPUs[1-n] ◦ If only CPU0 is left on, then CPU idle triggers low power state  User space may hot unplug idle CPUs[1-n] ◦ echo 0 > /sys/bus/cpu/devices/cpu1/online ◦ E.g. mpdecision daemon for msm  Kernel may synchronize entering idle states ◦ Coupled CPU idle drivers  echo mem > /sys/power/state
  • 22. Sync Wait for CPU1 to power down Power down Enter AFTR or WAFTR CPU0 awake
  • 23. Sync Wait for CPU1 to power up Power down AFTR or WAFTR Woken up Woken up by CPU0 IRQ Wake CPU1
  • 24.  big.LITTLE ◦ Suspend whole cluster ◦ Supported Versatile TC2 (Linux 3.16) and Exynos 5420 (next)  Exynos AFTR ◦ Currently only for Exynos 4210 and Exynos 5250 ◦ Requires manual hot unplug of CPU1 to work  Coupled CPU idle driver for Exynos (CPU1 off, CPU0 AFTR) ◦ On going work, patch by Daniel Lezcano [3]
  • 25.  Due to lack of support in mainline all measurements were done on internal Linux kernel tree 3.10  Development board with Exynos 3250 (2 cores, 100-1000 MHz)  The goal of measurements is to show overall differences between CPU low power states  Workload: simple computation tasks (work-sleep- work), mainly used to test scheduler
  • 26.  Tested configurations: ◦ CPUs enter only WFI (no CPU idle driver) ◦ Power down CPU1 from userspace permanently, CPU0 enters WFI (no CPU idle driver) ◦ Coupled CPU idle: CPU1 power off, CPU0 AFTR ◦ Coupled CPU idle: CPU1 power off, CPU0 AFTR or W-AFTR  W-AFTR is entered if certain conditions are met. If no, enter AFTR.  Conditions:  Camera, G3D, MFC power domains are off  Certain bus clocks are gated  MMC is idle
  • 27. CPU frequency 2 tasks 3 tasks 4 tasks WFI (CPU0 + CPU1) 200 MHz 200-600 MHz 200-600 MHz CPU1 powered off 500 MHz 300-900 MHz 400-1000 MHz 25% 23% 35%35% 44% 46% 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% 2 3 4 TotalCPUload[%] No. tasks Total CPU usage, during load WFI (CPU0 + CPU1) CPU0 WFI + CPU1 always off
  • 28. 22.6 26 28 23.8 26.4 20 21 22 23 24 25 26 27 28 29 2 3 4 current[mA] No. tasks Energy consumption, during load WFI (CPU0 + CPU1) CPU0 WFI + CPU1 always off
  • 29.  What about coupled CPU idle drivers (CPU1 off, CPU0 AFTR or W-AFTR)? ◦ No differences because the load prevented entering deeper idle states  But a new driver which powers off CPU1 also when CPU0 is busy makes sense ◦ Workload consolidation in scheduler could also help (increased idle time of CPU1) ◦ Already discussed and some works are in progress [4]  Power-aware scheduling [5]  Sched packing [6]  Workload consolidation and CPU ConCurrency [7]
  • 30. CPU idle driver Idle [mA] Diff against WFI WFI (CPU0 + CPU1) 18.5 CPU0 WFI + CPU1 always off 18.3 -1.0% CPU idle, coupled, OFF+AFTR 17.8 -3.8% CPU idle, coupled, OFF+AFTR/WAFTR 9.8 -47.0% Suspend-to-RAM 0.46  No load  Exynos 3250 CPU @100 MHz
  • 31. 0 2 4 6 8 10 12 14 16 18 20 WFI (CPU0 + CPU1) CPU0 WFI + CPU1 always off Coupled, OFF+AFTR Coupled, OFF+AFTR/WAFTR Suspend-to-RAM current[mA] CPU idle: energy consumption, idle
  • 32.
  • 33.  Dynamic Voltage and Frequency Scaling for devices ◦ e.g. Memory bus, MFC, G2D, ISP, SD/MMC  Governors similar to CPUfreq (ondemand etc.) ◦ Current load may be taken from device performance counters  Exynos: number of read and write operations in Dynamic Memory Controller  Mainline: only drivers for Exynos 4 and Exynos 5250 ◦ ... but only for Exynos 5250 is working
  • 34.  Due to lack of support in mainline all measurements were done on internal Linux kernel tree 3.10  Development board with Exynos 3250 (2 cores, 100-1000 MHz)  The goal of measurements is to show overall difference that devfreq makes
  • 35. Possible values for Exynos 3250 development board Min Max INT voltage 0.80 V 1.00 V MIF voltage 0.85 V 1.00 V DMC clock 50 MHz 400 MHz LEFT/RIGHT buses clock 50 MHz 133 MHz  No devfreq means that highest voltages and clock rates are chosen for: ◦ bus clocks ◦ INT and MIF regulators supplying power to many SoC modules
  • 36. CPU idle driver No devfreq [mA] With devfreq [mA] Diff [%] WFI (CPU0 + CPU1) 28.9 18.5 -36% CPU0 WFI + CPU1 always off 28.6 18.3 -36% CPU idle, coupled, OFF+AFTR 27.9 17.8 -36% CPU idle, coupled, OFF+AFTR/WAFTR 16.3 9.8 -40%  No load  Exynos 3250 CPU @100 MHz
  • 37. 0 5 10 15 20 25 30 35 WFI (CPU0 + CPU1) CPU0 WFI + CPU1 always off Coupled, OFF+AFTR Coupled, OFF+AFTR/WAFTR current[mA] Devfreq: energy consumption, idle devfreq no devfreq
  • 38.  Linux kernel has a number of features for reducing energy usage ◦ Unfortunately many drivers are not yet present in mainline  Most benefits come with: ◦ Frequency and voltage scaling (CPUfreq, devfreq) ◦ Entering low power CPU states (CPU idle) ◦ Developing good drivers  For external resources use abstraction provided by the kernel (e.g. Common Clock Framework, regulators)  Runtime PM (if it is applicable)
  • 39.  Still a lot of work to do ◦ Drivers: CPU idle, devfreq ◦ Integration between scheduler, CPUfreq and CPU idle ◦ Mobile devices are somehow different  Energy consumption is an important factor (How often do you have to charge your phone?)  Mobile device is mostly idle (more opportunities to sleep)  Consolidating workload makes sense (to some extend)
  • 41.  [1] Common Clock Framework drivers for Exynos 4, 3250 and 5250 drivers/clk/samsung/clk-exynos*.c  [2] Generic Device Power Domains include/linux/om_domain.h  [3] Coupled cpuidle driver for Exynos Exynos4: cpuidle: support dual CPUs with AFTR state http://thread.gmane.org/gmane.linux.power- management.general/44248  [4] Toward a more power-efficient scheduler, Jonathan Corbet http://lwn.net/Articles/546664/
  • 42.  [5] Morten Rasmussen  Power-aware scheduling v2 https://lkml.org/lkml/2013/10/11/547  sched: Energy cost model for energy-aware scheduling https://lkml.org/lkml/2014/5/23/621  [6] Sched: packing tasks (and newer works), Vincent Guittot https://lkml.org/lkml/2013/10/18/121  [7] A new CPU load metric for power-efficient scheduler: CPU ConCurrency, Yuyang Du https://lkml.org/lkml/2014/6/25/162