SlideShare a Scribd company logo
1
ACPI-next C-States
Charles Garcia-Tobin
Oct 2013
2
ACPI-Next C-states
 Numerical non-equivalency – Types of states
 Topology awareness
 Additional Information: Version, BreakEven, S/R, Cache
 Device, Power Resource, and Interrupt dependencies
 PSCI
3
Standardise types of State for ARM
 ARM Linux community uses ACPI derived terminology:
 C-State, P-state
 But it’s not done in a very standard way. Some “general
rules”
 C1 tend to be WFI
 Larger numbers mean deeper states
 Hard to compare systems
 ACPI has strong definitions for C0-C3 states
 Allows more states, but they are of type C1,C2 or C3
 But are not defined in ARM terms
4
Types of State
Run
Hierarchy is running fully operational
Idle
Hierarchy is not executing, but has preserved all
context, will wake up through an interrupt
Sleep
Hierarchy is not executing, context is lost and must be
saved/restored, will wake up through an interrupt.
Caches in the hierarchy are off
Off
Hierarchy is not executing, context is lost and wake up
will only occur by an explicit software command or By
physical hardware reset
zzz
RIP
5
Types of State
Run
Hierarchy is running fully operational
Idle
Hierarchy is not executing, but has preserved all
context, will wake up through an interrupt
Sleep
Hierarchy is not executing, context is lost and must be
saved/restored, will wake up through an interrupt
Caches in the hierarchy are off
Off
Hierarchy is not executing, context is lost and wake up
will only occur by an explicit software command or By
physical hardware reset
zzz
RIP
C0
C3
C1
6
Types of State
 Propose updating language for C1,C2,C3 so that for ARM
based systems:
 C1 is state (WFI) and also a type of state Idle
 C2 not used on ARM systems (as type of state)
 C3 as a type of state has the semantics of Sleep
7
Topology
 ACPI uses _CSD to express coupled C-states
 _CSD relies on the notion of symmetry : Same states for all
processors
 This doesn’t bode well for heterogeneous systems that might
have different number of states per processor
 Feedback from OSVs, is that _CSD is not a very nice object
to work with in kernel
8
Topology
 ACPI has a tree like name space
 We can use that to represent topology directly
 Modules : Structure that can contain other modules or
devices
 We can declare individual CPU states
 We can declare cluster/system states
 Can represent any arbitrary topology
9
Topology
Module System {
_CST(Sleep System)
Module Cluster 0 {
_CST(Sleep Cluster)
Processor CPU0 {
_CST(Idle,
Retention1,Sleep)
}
Processor CPU1 {
_CST(Idle,Sleep)
}
}
Module Cluster 1 {
_CST(Sleep Cluster)
Processor CPU2 {
_CST(Idle,
Retention1,
Retention2,
Sleep)
}
}}
10
Additional Information
 _CST doesn’t have enough information for an OSPM to
manage an ARM system’s power states
 No representation of what context may be lost
 No representation of what caches may be lost/if they need
management or not
 Last core?
 Break Even Latency missing
 No easy way to link C-states with device states or power resource
states
 No easy way to describe wake capability of an interrupt
 Not every interrupt can wake you from every state
 No versioning
11
Additional Information
 Proposal is to replace _CST with new object method _CSX
Package {
Version, // Integer (WORD)
Count, // Integer (WORD)
CStatesX[0], // Package
…
CStatesX[N-1] // Package
}
 Version is self explanatory. It would start at 0
12
Additional Information
 CStateX
Package(9) {
Register // Buffer (Resource Descriptor)
Type // Integer (BYTE)
Latency // Integer (WORD)
Power // Integer (DWORD)
BreakEven // Integer (DWORD)
GenStateFlags // Integer (WORD)
ArchStateFlags // Integer (DWORD)
PowerDepth, // Integer (BYTE)
LastMan, // Reference
}
13
Additional Information
 BreakEven:
 How long do I have to be in this state compared to WFI to save power
 GenStateFlags:
 OSPM_MANAGE_CACHE:
 Set if OSPM has to manage caches,
 CACHES_CONTENTS_LOST
 Set if caches up to current topology level are lost with this power state
 (More on this later)
 PLATFORM_COORDINATED:
 If set, platform coordinates and does last man standing.
 If state is requested it will be entered when last CPU requests that state
or deeper
 If clear if state is requested it will be entered. OSPM has to coordinate
 ArchStateFlags:
 Architecture specific flags. For ARM: what context needs to be saved/restored
 Generic Registers, VFP, Debug context, Timer, CPU Interface, GICD
14
Additional Information
 PowerDepth:
 Identifier describing how deep a power state is
 Bigger number -> Bigger power savings -> Bigger latencies
 Defaults to 0
 Used to describe dependencies
 PowerResources: if ON States of depth X or higher are not available
 Requires augmenting PowerResource Definition
 Device state D0, D1, D2, D3hot then:
 States of depth X=FunctionOfDState or higher are not available
 Requires new methods
 IRQ X can wake up power states of PowerDepth Y or lower
 Required augmenting IRQ descriptions
15
Additional Information
 LastMan:
 Allows for cases where only one specific CPU can enter a coupled
State
 If Plaform Coordinated (PLATFORM_COORDINATED flag
set) this should be undefined
 If CPU only power state this should be undefined
 If it is a coupled C state:
 If state can be initiated on any CPU (doesn’t matter who last man is) it
must be undefined
 Otherwise reference to processor node that must initiate the power
state transition
16
Cache
 Propose that each level of topology has a Cache object,
_CLV
 Describes Caches that are private to that part of the topology
 Simplest instance could be simply a list of caches:
_CLV return package of integers
eg: Core has L1 cache = {L1}
Cluster has L2 cache = {L2}
 Combines with flags, CACHES_CONTENTS_LOST and
OSPM_MANAGE_CACHE
17
Cache
 When not all CPUs have
entered a coupled state:
 Caches in sub-hierarchies
will be lost
 CPUx: L1 Module Cluster 0 {
_CSX(…
CStateX : CACHES_CONTENTS_LOST
…)
_CLV( L2 )
Processor CPU0 {
CSX(…)
_CLV( L1)
}
Processor CPU0 {
CSX(…)
_CLV( L1)
}}
L1
CPU1
L1
CPU0
Cluster 0
L2
18
Cache
 When not all CPUs have
entered a coupled state:
 Caches in sub-hierarchies
will be lost
 CPUx: L1
 When last CPU enters
cluster state:
 All caches up to topology
level are lost
 L2, all L1s
Module Cluster 0 {
_CSX(…
CStateX : CACHES_CONTENTS_LOST
…)
_CLV( L2 )
Processor CPU0 {
CSX(…)
_CLV( L1)
}
Processor CPU0 {
CSX(…)
_CLV( L1)
}}
L1
CPU1
L1
CPU0
Cluster 0
L2
19
PSCI
 Two flags in FADT Flags field:
PSCI_PRESENT: 1 if PSCI is implemented
USE_HVC: 1 if HVC is should be used in preference to SMC as
PSCI conduit
 Note no override or function IDs
20
PSCI
 CStateX uses Register to denote how to enter a C-state
 Registers represented GAS: Generic Address Structure
 GAS is very flexible and allows encapsulating “Software as
Hardware” in very specific cirmcunstance. This is called
Function Fixed Hardware
 Power State Parameter of CPU_SUSPEND is the Registers
Address
 Equivalent to method used for Intel today
21
PSCI
 GAS Register format for a C-State
Field Description
Address Space ID 0x7f (FFH identifier)
Register Bit Width 32
Register Bit Offset 0
Access Size 3 (DWORD)
Address Upper DWORD must be zero
Lower DWORD must denote the PSCI power_state
parameter for the CPU_SUSPEND call
Address: 0xF000000_00000000 is reserved for WFI
this would not result in a PSCI call
22
Questions
???

More Related Content

Viewers also liked

Note - (EDK2) Acpi Tables Compile and Install
Note - (EDK2) Acpi Tables Compile and InstallNote - (EDK2) Acpi Tables Compile and Install
Note - (EDK2) Acpi Tables Compile and Install
boyw165
 
BIOS, Linux and Firmware Test Suite in-between
BIOS, Linux and  Firmware Test Suite in-betweenBIOS, Linux and  Firmware Test Suite in-between
BIOS, Linux and Firmware Test Suite in-between
Alex Hung
 
Las16 200 - firmware summit - ras what is it- why do we need it
Las16 200 - firmware summit - ras what is it- why do we need itLas16 200 - firmware summit - ras what is it- why do we need it
Las16 200 - firmware summit - ras what is it- why do we need it
Linaro
 
Linux Initialization Process (1)
Linux Initialization Process (1)Linux Initialization Process (1)
Linux Initialization Process (1)
shimosawa
 
Hardware Probing in the Linux Kernel
Hardware Probing in the Linux KernelHardware Probing in the Linux Kernel
Hardware Probing in the Linux Kernel
Kernel TLV
 
High Performance Storage Devices in the Linux Kernel
High Performance Storage Devices in the Linux KernelHigh Performance Storage Devices in the Linux Kernel
High Performance Storage Devices in the Linux Kernel
Kernel TLV
 
Linux Initialization Process (2)
Linux Initialization Process (2)Linux Initialization Process (2)
Linux Initialization Process (2)
shimosawa
 
UEFI presentation
UEFI presentationUEFI presentation
UEFI presentation
Bruno Cornec
 
Power aware operating system
Power aware operating systemPower aware operating system
Power aware operating system
Rajan Kumar Yadav
 
BUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open SourceBUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open Source
Linaro
 

Viewers also liked (10)

Note - (EDK2) Acpi Tables Compile and Install
Note - (EDK2) Acpi Tables Compile and InstallNote - (EDK2) Acpi Tables Compile and Install
Note - (EDK2) Acpi Tables Compile and Install
 
BIOS, Linux and Firmware Test Suite in-between
BIOS, Linux and  Firmware Test Suite in-betweenBIOS, Linux and  Firmware Test Suite in-between
BIOS, Linux and Firmware Test Suite in-between
 
Las16 200 - firmware summit - ras what is it- why do we need it
Las16 200 - firmware summit - ras what is it- why do we need itLas16 200 - firmware summit - ras what is it- why do we need it
Las16 200 - firmware summit - ras what is it- why do we need it
 
Linux Initialization Process (1)
Linux Initialization Process (1)Linux Initialization Process (1)
Linux Initialization Process (1)
 
Hardware Probing in the Linux Kernel
Hardware Probing in the Linux KernelHardware Probing in the Linux Kernel
Hardware Probing in the Linux Kernel
 
High Performance Storage Devices in the Linux Kernel
High Performance Storage Devices in the Linux KernelHigh Performance Storage Devices in the Linux Kernel
High Performance Storage Devices in the Linux Kernel
 
Linux Initialization Process (2)
Linux Initialization Process (2)Linux Initialization Process (2)
Linux Initialization Process (2)
 
UEFI presentation
UEFI presentationUEFI presentation
UEFI presentation
 
Power aware operating system
Power aware operating systemPower aware operating system
Power aware operating system
 
BUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open SourceBUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open Source
 

Similar to LCU13: ACPI power state mapping

Q2.12: Idle Power States Nomenclature
Q2.12: Idle Power States NomenclatureQ2.12: Idle Power States Nomenclature
Q2.12: Idle Power States Nomenclature
Linaro
 
LCA13: Power State Coordination Interface
LCA13: Power State Coordination InterfaceLCA13: Power State Coordination Interface
LCA13: Power State Coordination Interface
Linaro
 
Hibernation in Linux 2.6.29
Hibernation in Linux 2.6.29Hibernation in Linux 2.6.29
Hibernation in Linux 2.6.29
Varun Mahajan
 
Linux_swspnd_v0.3_pub1
Linux_swspnd_v0.3_pub1Linux_swspnd_v0.3_pub1
Linux_swspnd_v0.3_pub1
padmanabha srinivasaiah
 
“Linux Kernel CPU Hotplug in the Multicore System”
“Linux Kernel CPU Hotplug in the Multicore System”“Linux Kernel CPU Hotplug in the Multicore System”
“Linux Kernel CPU Hotplug in the Multicore System”
GlobalLogic Ukraine
 
Smashing the stack for fun and profit
Smashing the stack for fun and profitSmashing the stack for fun and profit
Smashing the stack for fun and profit
Alexey Miasoedov
 
ARM Architecture and Meltdown/Spectre
ARM Architecture and Meltdown/SpectreARM Architecture and Meltdown/Spectre
ARM Architecture and Meltdown/Spectre
GlobalLogic Ukraine
 
SCSI Protocol
SCSI ProtocolSCSI Protocol
SCSI Protocol
Rakesh T
 
Different type of shells In Netapp Cluster mode 8.X and how to access them t...
Different type of shells In Netapp Cluster mode 8.X  and how to access them t...Different type of shells In Netapp Cluster mode 8.X  and how to access them t...
Different type of shells In Netapp Cluster mode 8.X and how to access them t...
Saroj Sahu
 
Cassandra no sql ecosystem
Cassandra no sql ecosystemCassandra no sql ecosystem
x86_1.ppt
x86_1.pptx86_1.ppt
x86_1.ppt
jeronimored
 
All you didn't know about the CAP theorem
All you didn't know about the CAP theoremAll you didn't know about the CAP theorem
All you didn't know about the CAP theorem
Kanstantsin Hontarau
 
Juniper Trouble Shooting
Juniper Trouble ShootingJuniper Trouble Shooting
Juniper Trouble Shooting
Mike(Haobin) Zheng
 
Cassandra Fundamentals - C* 2.0
Cassandra Fundamentals - C* 2.0Cassandra Fundamentals - C* 2.0
Cassandra Fundamentals - C* 2.0
Russell Spitzer
 
HA, SRX Cluster & Redundancy Groups
HA, SRX Cluster & Redundancy GroupsHA, SRX Cluster & Redundancy Groups
HA, SRX Cluster & Redundancy Groups
Kashif Latif
 
Cache performance-x86-2009
Cache performance-x86-2009Cache performance-x86-2009
Cache performance-x86-2009
Léia de Sousa
 
Chassis Cluster Configuration
Chassis Cluster ConfigurationChassis Cluster Configuration
Chassis Cluster Configuration
Kashif Latif
 
Intel x86 Architecture
Intel x86 ArchitectureIntel x86 Architecture
Intel x86 Architecture
ChangWoo Min
 
Arm11
Arm11Arm11
Arm11
Zogenk II
 
LCA13: ARMv8 Status and Updates
LCA13: ARMv8 Status and UpdatesLCA13: ARMv8 Status and Updates
LCA13: ARMv8 Status and Updates
Linaro
 

Similar to LCU13: ACPI power state mapping (20)

Q2.12: Idle Power States Nomenclature
Q2.12: Idle Power States NomenclatureQ2.12: Idle Power States Nomenclature
Q2.12: Idle Power States Nomenclature
 
LCA13: Power State Coordination Interface
LCA13: Power State Coordination InterfaceLCA13: Power State Coordination Interface
LCA13: Power State Coordination Interface
 
Hibernation in Linux 2.6.29
Hibernation in Linux 2.6.29Hibernation in Linux 2.6.29
Hibernation in Linux 2.6.29
 
Linux_swspnd_v0.3_pub1
Linux_swspnd_v0.3_pub1Linux_swspnd_v0.3_pub1
Linux_swspnd_v0.3_pub1
 
“Linux Kernel CPU Hotplug in the Multicore System”
“Linux Kernel CPU Hotplug in the Multicore System”“Linux Kernel CPU Hotplug in the Multicore System”
“Linux Kernel CPU Hotplug in the Multicore System”
 
Smashing the stack for fun and profit
Smashing the stack for fun and profitSmashing the stack for fun and profit
Smashing the stack for fun and profit
 
ARM Architecture and Meltdown/Spectre
ARM Architecture and Meltdown/SpectreARM Architecture and Meltdown/Spectre
ARM Architecture and Meltdown/Spectre
 
SCSI Protocol
SCSI ProtocolSCSI Protocol
SCSI Protocol
 
Different type of shells In Netapp Cluster mode 8.X and how to access them t...
Different type of shells In Netapp Cluster mode 8.X  and how to access them t...Different type of shells In Netapp Cluster mode 8.X  and how to access them t...
Different type of shells In Netapp Cluster mode 8.X and how to access them t...
 
Cassandra no sql ecosystem
Cassandra no sql ecosystemCassandra no sql ecosystem
Cassandra no sql ecosystem
 
x86_1.ppt
x86_1.pptx86_1.ppt
x86_1.ppt
 
All you didn't know about the CAP theorem
All you didn't know about the CAP theoremAll you didn't know about the CAP theorem
All you didn't know about the CAP theorem
 
Juniper Trouble Shooting
Juniper Trouble ShootingJuniper Trouble Shooting
Juniper Trouble Shooting
 
Cassandra Fundamentals - C* 2.0
Cassandra Fundamentals - C* 2.0Cassandra Fundamentals - C* 2.0
Cassandra Fundamentals - C* 2.0
 
HA, SRX Cluster & Redundancy Groups
HA, SRX Cluster & Redundancy GroupsHA, SRX Cluster & Redundancy Groups
HA, SRX Cluster & Redundancy Groups
 
Cache performance-x86-2009
Cache performance-x86-2009Cache performance-x86-2009
Cache performance-x86-2009
 
Chassis Cluster Configuration
Chassis Cluster ConfigurationChassis Cluster Configuration
Chassis Cluster Configuration
 
Intel x86 Architecture
Intel x86 ArchitectureIntel x86 Architecture
Intel x86 Architecture
 
Arm11
Arm11Arm11
Arm11
 
LCA13: ARMv8 Status and Updates
LCA13: ARMv8 Status and UpdatesLCA13: ARMv8 Status and Updates
LCA13: ARMv8 Status and Updates
 

More from Linaro

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Linaro
 
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Linaro
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Linaro
 
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
Linaro
 
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
Linaro
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
Linaro
 
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
Linaro
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Linaro
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Linaro
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
Linaro
 
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
Linaro
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro
 
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
Linaro
 
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
Linaro
 
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
Linaro
 
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
Linaro
 
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
Linaro
 
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
Linaro
 

More from Linaro (20)

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
 
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
 
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
 
Bud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
 
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
 
HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
 
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
 
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
 
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
 
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
 
HKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
 
HKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
 
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
 
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
 
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
 
HKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
 
HKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
 
HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
 
HKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
 

Recently uploaded

WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 

Recently uploaded (20)

WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 

LCU13: ACPI power state mapping

  • 2. 2 ACPI-Next C-states  Numerical non-equivalency – Types of states  Topology awareness  Additional Information: Version, BreakEven, S/R, Cache  Device, Power Resource, and Interrupt dependencies  PSCI
  • 3. 3 Standardise types of State for ARM  ARM Linux community uses ACPI derived terminology:  C-State, P-state  But it’s not done in a very standard way. Some “general rules”  C1 tend to be WFI  Larger numbers mean deeper states  Hard to compare systems  ACPI has strong definitions for C0-C3 states  Allows more states, but they are of type C1,C2 or C3  But are not defined in ARM terms
  • 4. 4 Types of State Run Hierarchy is running fully operational Idle Hierarchy is not executing, but has preserved all context, will wake up through an interrupt Sleep Hierarchy is not executing, context is lost and must be saved/restored, will wake up through an interrupt. Caches in the hierarchy are off Off Hierarchy is not executing, context is lost and wake up will only occur by an explicit software command or By physical hardware reset zzz RIP
  • 5. 5 Types of State Run Hierarchy is running fully operational Idle Hierarchy is not executing, but has preserved all context, will wake up through an interrupt Sleep Hierarchy is not executing, context is lost and must be saved/restored, will wake up through an interrupt Caches in the hierarchy are off Off Hierarchy is not executing, context is lost and wake up will only occur by an explicit software command or By physical hardware reset zzz RIP C0 C3 C1
  • 6. 6 Types of State  Propose updating language for C1,C2,C3 so that for ARM based systems:  C1 is state (WFI) and also a type of state Idle  C2 not used on ARM systems (as type of state)  C3 as a type of state has the semantics of Sleep
  • 7. 7 Topology  ACPI uses _CSD to express coupled C-states  _CSD relies on the notion of symmetry : Same states for all processors  This doesn’t bode well for heterogeneous systems that might have different number of states per processor  Feedback from OSVs, is that _CSD is not a very nice object to work with in kernel
  • 8. 8 Topology  ACPI has a tree like name space  We can use that to represent topology directly  Modules : Structure that can contain other modules or devices  We can declare individual CPU states  We can declare cluster/system states  Can represent any arbitrary topology
  • 9. 9 Topology Module System { _CST(Sleep System) Module Cluster 0 { _CST(Sleep Cluster) Processor CPU0 { _CST(Idle, Retention1,Sleep) } Processor CPU1 { _CST(Idle,Sleep) } } Module Cluster 1 { _CST(Sleep Cluster) Processor CPU2 { _CST(Idle, Retention1, Retention2, Sleep) } }}
  • 10. 10 Additional Information  _CST doesn’t have enough information for an OSPM to manage an ARM system’s power states  No representation of what context may be lost  No representation of what caches may be lost/if they need management or not  Last core?  Break Even Latency missing  No easy way to link C-states with device states or power resource states  No easy way to describe wake capability of an interrupt  Not every interrupt can wake you from every state  No versioning
  • 11. 11 Additional Information  Proposal is to replace _CST with new object method _CSX Package { Version, // Integer (WORD) Count, // Integer (WORD) CStatesX[0], // Package … CStatesX[N-1] // Package }  Version is self explanatory. It would start at 0
  • 12. 12 Additional Information  CStateX Package(9) { Register // Buffer (Resource Descriptor) Type // Integer (BYTE) Latency // Integer (WORD) Power // Integer (DWORD) BreakEven // Integer (DWORD) GenStateFlags // Integer (WORD) ArchStateFlags // Integer (DWORD) PowerDepth, // Integer (BYTE) LastMan, // Reference }
  • 13. 13 Additional Information  BreakEven:  How long do I have to be in this state compared to WFI to save power  GenStateFlags:  OSPM_MANAGE_CACHE:  Set if OSPM has to manage caches,  CACHES_CONTENTS_LOST  Set if caches up to current topology level are lost with this power state  (More on this later)  PLATFORM_COORDINATED:  If set, platform coordinates and does last man standing.  If state is requested it will be entered when last CPU requests that state or deeper  If clear if state is requested it will be entered. OSPM has to coordinate  ArchStateFlags:  Architecture specific flags. For ARM: what context needs to be saved/restored  Generic Registers, VFP, Debug context, Timer, CPU Interface, GICD
  • 14. 14 Additional Information  PowerDepth:  Identifier describing how deep a power state is  Bigger number -> Bigger power savings -> Bigger latencies  Defaults to 0  Used to describe dependencies  PowerResources: if ON States of depth X or higher are not available  Requires augmenting PowerResource Definition  Device state D0, D1, D2, D3hot then:  States of depth X=FunctionOfDState or higher are not available  Requires new methods  IRQ X can wake up power states of PowerDepth Y or lower  Required augmenting IRQ descriptions
  • 15. 15 Additional Information  LastMan:  Allows for cases where only one specific CPU can enter a coupled State  If Plaform Coordinated (PLATFORM_COORDINATED flag set) this should be undefined  If CPU only power state this should be undefined  If it is a coupled C state:  If state can be initiated on any CPU (doesn’t matter who last man is) it must be undefined  Otherwise reference to processor node that must initiate the power state transition
  • 16. 16 Cache  Propose that each level of topology has a Cache object, _CLV  Describes Caches that are private to that part of the topology  Simplest instance could be simply a list of caches: _CLV return package of integers eg: Core has L1 cache = {L1} Cluster has L2 cache = {L2}  Combines with flags, CACHES_CONTENTS_LOST and OSPM_MANAGE_CACHE
  • 17. 17 Cache  When not all CPUs have entered a coupled state:  Caches in sub-hierarchies will be lost  CPUx: L1 Module Cluster 0 { _CSX(… CStateX : CACHES_CONTENTS_LOST …) _CLV( L2 ) Processor CPU0 { CSX(…) _CLV( L1) } Processor CPU0 { CSX(…) _CLV( L1) }} L1 CPU1 L1 CPU0 Cluster 0 L2
  • 18. 18 Cache  When not all CPUs have entered a coupled state:  Caches in sub-hierarchies will be lost  CPUx: L1  When last CPU enters cluster state:  All caches up to topology level are lost  L2, all L1s Module Cluster 0 { _CSX(… CStateX : CACHES_CONTENTS_LOST …) _CLV( L2 ) Processor CPU0 { CSX(…) _CLV( L1) } Processor CPU0 { CSX(…) _CLV( L1) }} L1 CPU1 L1 CPU0 Cluster 0 L2
  • 19. 19 PSCI  Two flags in FADT Flags field: PSCI_PRESENT: 1 if PSCI is implemented USE_HVC: 1 if HVC is should be used in preference to SMC as PSCI conduit  Note no override or function IDs
  • 20. 20 PSCI  CStateX uses Register to denote how to enter a C-state  Registers represented GAS: Generic Address Structure  GAS is very flexible and allows encapsulating “Software as Hardware” in very specific cirmcunstance. This is called Function Fixed Hardware  Power State Parameter of CPU_SUSPEND is the Registers Address  Equivalent to method used for Intel today
  • 21. 21 PSCI  GAS Register format for a C-State Field Description Address Space ID 0x7f (FFH identifier) Register Bit Width 32 Register Bit Offset 0 Access Size 3 (DWORD) Address Upper DWORD must be zero Lower DWORD must denote the PSCI power_state parameter for the CPU_SUSPEND call Address: 0xF000000_00000000 is reserved for WFI this would not result in a PSCI call