Isorc18 keynote

TIMING ANALYSIS TO
TESTING
PROF. ABHIK ROYCHOUDHURY
NATIONAL UNIVERSITY OF SINGAPORE
Joint works with several co-authors over many years 2001-18.

TIME-CRITICAL SOFTWARE
IEEE ISORC 2018 Keynote 2

THE TALK
Advances in
Functionality
checking
driven by
Constraint
solving
Timing
Analysis++
Symbolic Execution
Analysis of multi-
cores
Tests apart from
bounds

WORST-CASE
EXECUTION TIME

DETAILED ARCHITECTURAL
MODELING

COMPARE EST. AND OBS. WCET

CHRONOS
• OVERVIEW OF THE TOOL
• CHRONOS: A TIMING ANALYZER FOR EMBEDDED SOFTWARE
XIANFENG LI, YUN LIANG, TULIKA MITRA AND ABHIK ROYCHOUDHURY
SCIENCE OF COMPUTER PROGRAMMING, VOLUME 69, DECEMBER 2007.
• SCALABLE LIGHT-WEIGHT INFEASIBLE PATH DETECTION
• WITHIN AN ITERATION
• ACROSS LOOP ITERATIONS
• NOVEL MICRO-ARCHITECTURAL MODELING
• OUT-OF-ORDER PIPELINES
• BRANCH PREDICTION
• I-CACHE AND ITS INTERACTION WITH OTHER FEATURES
• D-CACHE WITH NOVEL MODELING
• UNIFIED MULTI-LEVEL CACHE AND CODE/DATA LAYOUT

A VIEW OF TIMING ANALYSIS
System-level
Efficient,
large designs
Program level
Bit more
expensive,
accurate
System-level and Program-level
techniques are somewhat disjoint.
Motivation:
Artifacts other than
WCET bounds

Cache
Resource
sharing in
multi-cores
Cache
Instrumenting
assertions
Cache
Test
generation

DRAMCPU
CACHE
S
Caches have a significant impact on performance
Issues such as Cache Thrashing may hamper the performance gain due to
Caches
Caches are used to bridge the performance gap between
CPU and DRAM
CACHES: WHY ARE THEY NEEDED?

CACHE THRASHING: WHY IT IS BAD?
Cache Thrashing occurs when a frequently used cache line
is replaced by another frequently used cache line
… as a result lots of cache misses
m3
m2m1
While(true){
if(x > 5){
// m1 accessed
}else{
// m2 accessed
}
// m3 accessed
}
Set
1
Set
2
Cache
m1 and m2 conflict in cache
may lead to thrashing ...
access to m3
results in
cache hit
after first
iteration

Program
Cache
analysis
Pipeline
analysis
Branch predictor
modeling
WCET
of basic
blocks
constraints
Infeasible
path
constraints
Loop
bound
Micro architectural
modeling
Path analysis
WCET ANALYSIS
IPET = Implicit Path Enumeration Technique
IPET

ARCHITECTURE
IEEE ISORC 2018 Keynote
13
Core 1 Core n
L1 cache L1 cache
Shared L2 cache
Memory
Shared bus
Resource
sharing

Static
Analysis
Program
Cache
Configuration
Classification of
Memory Block
always hit (AH)
persistent (PS)
always miss (AM)
not classified (NC)
{m1,m2} maps to Cache Set 1
{m3} maps to Cache Set 2
STATIC ANALYSIS

IMPRECISION IN ABSTRACT
INTERPRETATION
15
p1 p2
Cache state = C1 Cache state = C2
Joined Cache state = C3
a
b
b
x
Abstract
cache set
Abstract
cache set
youngyoung
b Joined cache statePath p1 or path p2?
Joined cache state loses information about path p1 and p2

MODEL CHECKING ALONE ?
• A PATH SENSITIVE SEARCH
• PATH SENSITIVE SEARCH IS EXPENSIVE – PATH
EXPLOSION
• WORSE, COMBINED WITH POSSIBLE CACHE STATES
16
p1 p2
Cache state =
C1
Cache state =
C2

MODEL CHECKING ALONE ?
• A PATH-SENSITIVE SEARCH
• PATH SENSITIVE SEARCH IS EXPENSIVE – PATH
EXPLOSION
• WORSE, COMBINED WITH POSSIBLE CACHE STATES
17
p1
p2
a
b
young b
x
Abstract LRU
cache set
young
a
b
Abstract LRU
cache set
young b
x
Abstract LRU
cache set
young
State Explosion

CACHE ANALYSIS
18
Program
Pipeline
analysis
Branch predictor
modeling
WCET
of basic
blocks
constraints
Infeasible
path
constraints
Loop
bound
IPET
Micro architectural
modeling
Path analysis
Cache
analysis by
abstract
interpretation
Analysis
outcome
Refine by
Symbolic Exec
All checked
Timeout
Refinement by model checker can be terminated at any point
Model checker refinement steps are inherently parallel
Each model checker refinement step checks light assertion property

REFINEMENT (INTER-CORE)
19
m
m
Task
Cache hit
start
exit
Conflictin
g task
Cache miss
m1
m2
m
cache
x < y
x == y
Infeasible
m1
m2
Spurious
≠m ≠m
young

REFINEMENT (INTER-CORE)
20
m
m
Task
start
exit
Conflictin
g task
m1
m2
m
cache
x < y
x == y
Infeasible
m1
m2
C_m++
Increment
conflict
C_m++
Increment
conflict
assert (C_m <= 1)
Verified
m
A Cache Hit
young

REFINEMENT (WHY IT WORKS?)
21
Path 2
Cache miss
m
m
Conflict to mm’
C_m++
Increment
conflict
assert (C_m <= 0)
Property
Does not
affect the
value of C_m
x < y
x == y
m’
m

EXTENSION USING SYMBOLIC
EXECUTION
22
Conflictin
g task
m1
m2
x < y
x == y
m1
m2
C_m++
Increment
conflict
C_m++
Increment
conflict
assert (C_m <= 1)
x < y
constraint
solver
x = y x = y
x < y x ≥ y
x < y ˄ x = y
unknown
NO
assert (C_m <= 1)
satisfied
abort

IMPROVEMENT
24
L1 cache L1 cache
Shared L2
cache
4-way associative, 8 KB
Direct-mapped, 256 bytes
Tasks
cnt
jfdctint
edn
fir
fdct
ndes

IMPROVEMENT
25

A GENERIC FRAMEWORK
• THREE DIFFERENT ARCHITECTURAL/APPLICATION
SETTINGS
26
Intra task
(WCET in single core)
High
priority
Low
priority
Inter task
(Cache Related
Preemption Delay
analysis)
cache cache L1 cache L1 cache
Shared L2
cache
Task in
Core 1
Task in
Core 2
Inter core
(WCET in multi-core)
Cache
conflict Cache
conflict
Cache
conflict

Cache
Resource
sharing in
multi-cores
Cache
Instrumenting
assertions
Cache
TEST
GENERATI
ON

THE TALK
Advances in
Functionality
checking
driven by
Constraint
solving
Timing
Analysis++
Symbolic Execution
Analysis of multi-
cores
Tests apart from
bounds

TEST GENERATION
To develop a test generation framework which aims
to report all possible cache performance issues that
may exist in some program execution.
Test generator
Program
Cache
Configuration
Unique cache
performance issues
(each issue is reported
with a symbolic formula to
reach that issue)

DIFFERENT FROM PROFILING!
Program
Profiling
Program
Cache Config.
Test Inputs
Performanc
e
Issues
Test generator
Program
Cache Config.
Test Inputs
Performanc
e
Issues
Symbolic
Formula
No guarantees
for
completeness
Vs

We reduce the problem of testing cache
performance to an equivalent functionality testing
problem
Static Analysis
Instrumentation
Dynamic
Explore
Test Generate
P P’
Non-functional properties
encoded as assertions
Reduces the search
space for exploration
Explores the reduced
search space & generate
test cases
Test
Case
s
Stage I
Stage
II
KEY IDEA

IDENTIFYING THRASHING SCENARIOS
Classification of
Memory Block
Extract memory
blocks potentially
involved
in Cache Thrashing
Set of Cache
Thrashing Scenarios
{{m1,m2}}
assume direct
mapped cache
Extract
always miss (AM)
not classified (NC)
For each cache set

Encode each thrashing scenario as an assertion at appropriate
program location
Instrumentation
Thrashing
Sets
{{m1,m2}}
INSTRUMENTATION

GENERATING ASSERTIONS
An assertion captures the property that
all memory blocks in a Thrashing Scenario are
evicted at least once between two consecutive
accesses
Unique cache conflicts between two
access (Cm )
Let , {{m1,m2}}
assert(Cm1 ≤ 0 V Cm2 ≤ 0)
Condition for staying in the cache
Cm ≤ associativity of cache - 1

Exploration is performed to verify the validity of
Instrumented assertions
Instrumente
d Program
Instrumente
d
Assertions
<Ө,ф>
Where,Ө : thrashing scenario
Ф : symbolic formula on input that leads to
Ө
Validate
Deviate
Report
Exploration
DYNAMIC EXPLORATION

EXPLORATION USING GREEDY STRATEGY
Use CDG to find a
path with
maximum
# of unchecked
assertions
Control
Dependence
Graph (CDG)
Unchecked
Assertions
New path
to explore

TEST GENERATION
Results are generated in the format < Ө , Ф >
Where, Ө : thrashing scenario
Ф : symbolic formula on input that leads to
Ө
Any input which satisfy Ф will lead to cache
thrashing scenario Ө

Cache analysis by
abstract
interpretation
Instrumentation
automatically adds
assertions to the
program
Report
violated
assertions
Explore a path
leading to
assertions
(symbolic
exec)
Test
Suite
Program
CHMC
(cache hit-
miss
classification)
Instrumente
d
Program
Assertion
violated in
Time Budget
/
All
instrumente
d assertions
violated
always hit (AH)
persistent (PS)
always miss (AM)
not classified (NC)
TEST GENERATION RECAP

TOOLS NEEDED
Chrono
s
KLEE
STP
• Timing analysis engine
• Symbolic execution engine and SMT solver

EVALUATION
Assertion Coverage
Thrashing Potential
Unique assertions checked * 100
= --------------------------------------------
Unique assertions instrumented
Unique assertions violated *
100
= ----------------------------------------
Unique assertions
instrumented
100 % coverage implies all unique assertions have been checked at least once
Gives an idea about the thrashing potential for a program, for a given cache
configuration

o PROGRAMS WITH LESSER NUMBER OF INPUT DEPENDENT
PATHS WERE EXPLORED FASTER
o FOR MOST EXPERIMENTS, ONLY A SMALL FRACTION OF
INSTRUMENTED ASSERTIONS WERE VIOLATED
o APPLICATIONS INCLUDE
o PROVIDE INPUTS TO SYSTEM LEVEL ANALYSIS?
o REWRITING THE PROGRAM
o CHOOSING CACHE CONFIGURATION FOR AN APPLICATION
o CACHE LOCKING STRATEGIES
OBSERVATION

NOT PROFILING OR TESTING
Testing
Functionality
(Symbolic
Execution)
Testing
Performance
Profiling
Not Sound or Complete
Sound & Complete
Partitioning I/P
Space
Requires manual effort
May have false positives
Automated
No False Positives

Cache
Resource
sharing in
multi-cores
Cache
Instrumenting
assertions
Cache
Test
generation

System-level
Efficient,
large designs
Program level
Bit more
expensive,
accurate
Motivation:
WCET bounds

CACHE SIDE CHANNELS
load a[key]
load a[1]
load a[2]
Cache
Key = 0
load a[2]
a[0]
a[1]
a[2]
classified input (key) — key can be 0 or 1
MISS
Side-channel Leaks
45

CACHE SIDE CHANNELS
load a[key]
load a[1]
load a[2]
Cache
Key = 1
load a[2]
a[1]
a[2]
HIT
Side-channel Leaks
46

CACHE SIDE CHANNELS
Key = 1
HIT
load a[key]
load a[1]
load a[2]
Key = 0
MISS
🐞leak leak
load a[2]
load a[key]
load a[1]
load a[2]
load a[2]

ANALYZING CACHE SIDE CHANNELS
• Symbolically track memory address
• Expose non-functional behavior (cache misses) as functionality
• Get inputs which show specific cache miss scenarios
load a[key]
load a[1]
load a[2]
a[key] ⋀ (key = 0 ⌵ key = 1)
a[1]
a[2]
load a[2] a[2]
48
👿IEEE ISORC 2018 Keynote 48

CACHE SIDE CHANNEL IN AES

System-level
Efficient,
large designs
Program level
Bit more
expensive,
accurate
Motivation:
WCET bounds
Tests, Attack scenarios

Advances in
Functionality
checking
driven by
Constraint
solving
Timing
Analysis++
Symbolic Execution
Analysis of multi-cores, Tests apart from
bounds
Attack scenarios
When WCET analysis tools were developed in real-time systems community, constraint solvers were not
mature.
Additional applications and analyses can be developed by leveraging constraint solving and symbolic
execution.

Isorc18 keynote

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Isorc18 keynote

Similar to Isorc18 keynote (20)

More from Abhik Roychoudhury

More from Abhik Roychoudhury (9)

Recently uploaded

Recently uploaded (20)

Isorc18 keynote

Editor's Notes