CarFast: Achieving Higher Statement Coverage Faster

CARFAST:
ACHIEVING HIGHER STATEMENT
COVERAGE FASTER

Sangmin Park,
Ishtiaque Hussain,
Christoph Csallner,
Kunal Taneja,
B. M. Mainul Hossain,
Mark Grechanik,
Chen Fu, Qing Xie

CarFast Implementation Evaluation Conclusion

Motivation - Achieving High Coverage

 Coverage
 Degreeto which program has been tested
 Measure of confidence

 Widely used in industry
 Avionics industry standard, DO-254 and DO-178B
 Automotive industry standard, IEC 61508

 Other organizations

2


Motivation - Achieving Coverage Fast

Current approaches

Timeout

Goal: Achieve high high coverage
 Achieving coverage faster fast is difficult
 Complex programs
 Too many test inputs
(e.g., Renters Insurance Program with 78M customer profiles)

3


High level approach

 Observation (study we performed)
 80% of statements are covered by 20% of branches
(we call those branches "profitable")

 Intuition
 Cover profitable branches fast leading to achieving
high statement coverage quickly

 High level approach
 Use static analysis to find profitable branches
 Select inputs that direct program execution towards
profitable branches
4


CarFast – Illustrative Example

i1 = 20 and i2 = 20
void foo (int i1, int i2) {
i1==10
1: if (i1 == 10) { T F
2: … // branch 1: 300 statements 300 i2==50
3: } else if (i2 == 50) { stmts T F
4: … // branch 2: 600 statements 600 100
5: } else { stmts stmts
6: … // branch 3: 100 statements
7: if (i1==20) {
8: if (i2==30) { … }
9: }
10: }

}
5


CarFast – Illustrative Example

i1 = 20 and i2 = 20
i1==10
1: if (i1 == 10) { T F
7: if (i1==20) {
8: if (i2==30) { … }
9: }
10: }
DFS search: up to 10%

}
Branch 2: up to 70%
6


CarFast – Algorithm

i1 = 20 and i2 = 20
i1==10
1: if (i1 == 10) { T F
7: if (i1==20) {
8: if (i2==30) { … }
9: }
10: }
Step 1: Step 2: Step 3:
} Select Select
Rank
Branches Initial Input Next Input 7


CarFast – AlgorithmStep 1: Rank branches
• Counts (transitively) branches
by the number of statements
they contain
• Resolves method calls
• Ranks branches by statements
i1==10
1: if (i1 == 10) { T F
7: if (i1==20) {
8: if (i2==30) { … }
9: }
Rank Branch # Stmt
10: }
1 2 600
} 2 1 300
3 3 100
8
4 … …


CarFast – Algorithm 2: Select a random input
Step
• Selects a random input from input
database

i1==10
1: if (i1 == 10) { T F
7: if (i1==20) {
8: if (i2==30) { … } Input 1: i1 = 20 and i2 = 20
9: }
Rank Branch # Stmt i1 i2
10: }
1 2 600 5 50
} 2 1 300 20 20
3 3 100 30 30
9
4 … … 40 40

Step 3: Select next input from trace
• Executes the program with the input
CarFast – Algorithm to collect path condition
• Modifies path condition to cover
higher ranked branches
• Queries the condition to database
void foo (int i1, int i2) { • Selects random input if there are no
satisfying input
i1==10
1: if (i1 == 10) { T F
7: if (i1==20) {
8: if (i2==30) { … } Input 1: i1 = 20 and i2 = 20
9: }
Rank Branch # Stmt i1 i2
10: }
1 2 600 5 50
} 2 1 300 20 20
3 3 100 30 30
10
4 … … 40 40

satisfying input
i1==10
1: if (i1 == 10) { T F
i2==50
3: } else if (i2 == 50) { T F
7: if (i1==20) {
8: if (i2==30) { … } Input 1: i1 = 20 and i2 = 20
9: }
Rank Branch # Stmt i1 i2 C: (i1!=10)&&(i2!=50)&&(i1==20)&&(i2!=30)
10: }
1 2 600 5 50
} 2 1 300 20 20
3 3 100 30 30
11
4 … … 40 40

satisfying input
i1==10
1: if (i1 == 10) { T F
i2==50
3: } else if (i2 == 50) { T F
7: if (i1==20) {
8: if (i2==30) { … } Input 1: i1 = 20 and i2 = 20
9: }
Rank Branch # Stmt i1 i2 C: (i1!=10)&&(i2!=50)&&(i1==20)&&(i2!=30)
10: }
1 2 600 5 50
C’: (i1!=10)&&(i2==50)
} 2 1 300 20 20
3 3 100 30 30 Input 2: i1 = 5 and i2 = 50
12
4 … … 40 40


Implementation

 Scalability challenges in large applications: up to 1MLOC
 Large constraints of size up to 5MB
 Existing tools run out of memory

 Execution Engine
 Initial tool: Concolic execution engine (Dsc)
 Solution: DSC-Dumper mode
 Uses disk instead of memory
 Removes memory overhead

 Test Input Database
 Initial tool: MSSQL server 2008
 Solution: Constraint-based selector
 Uses B+ tree based index
 Provides API to process queries
13


Experiment – Approaches

Adaptive Random Testing
Random Testing (ART)
• Random selection of inputs • Random selection of evenly
• Black-box approach distributed inputs
• Black-box approach

DART CarFast
• Concolic execution • Our approach
approach • Static ranking based path
• Depth-first path exploration exploration
• White-box approach • White-box approach

14


Experiment – Subject Programs

 Challenges in selecting programs
 Programs with various sizes
 Programs with complex properties
 Programs without external dependencies

 RugRat program generator [WODA 2012]
 Stochastic-parse-tree based program generation approach
 Highly configurable option parameters
 Used in generating 12 programs from 1KLOC to 1MLOC

 Test inputs
 Each program has up to 20 integer inputs
 Complete combination of inputs for 20 integers = 10020
 Pairwise combination of inputs for 20 integers = 1M 15


Experiment – Setup

 Study Protocol
 For statistical significance, ran 30 times
 Total time = 4 approaches*12 programs*
30 times*24 hours
= 34,560 hours

 Baseline coverage = min(covi)
where i = {Random, ART, DART, CarFast}

 Measurement (to achieve baseline coverage)
 Number of iterations (1 iteration = 1 selection)
 Elapsed time

16


Experiment – Results
3 1 2
Programs Baseline Appoaches Iterations Elapsed Time
Coverage (mean) (mean)
Random 17.1 522.2

ART 17.8 59.8
3 (1.2K) 45%
DART 693.5 1447.0

CarFast 5.9 571.0

Random 1023.2 3162.5

5 (2.1K) 78% ART 1615.6 5157.7

CarFast 463.9 20040.9

Random • DART doesn't
543.1 1736.8

7 (7.8K) 79% ART scale
684.1 2217.6

CarFast 380.0 18829
17
* Complete results are in the paper.


Future Work

 Bottleneck
 Current: Identified modules causing bottlenecks
 Future: Improve the runtime of CarFast

 Fault-detection ability
 Current: Does not measure fault-detection ability
 Future: Investigate fault-detection ability

 Other test coverage metrics
 Current: Used static measure on statements
 Future: Use static measure on branches
18


Contributions

 CarFast
The first approach to select inputs for achieving
statement coverage fast

 Implementation
The tool scales up to 1MLOC

 Experiment
The study shows limitations in popular testing
techniques with statistical significance

 Tool, subjects, experimental data are available
www.carfast.org
20


Related Work

 Test-case prioritization
 Test case prioritization: empirical studies
[Elbaum, 2002]
 Dynamic symbolic execution
 DART [Godefroid, 2005]

 Hybrid concolic testing [Majundar, 2007]
 Heuristics for dynamic test generation [Burnim, 2008]

 Search-based testing
 Fitness-guided path exploration [Xie, 2009]

22


CarFast – Preliminary Study

 Study
 Performed on Apache programs
 Investigated branches and statements
 Observed power law in results –
20% of branches contain
80% of statements

 Hypothesis
 Assuming the observation holds,
we can steer execution to cover
those 20% of branches

23

CarFast: Achieving Higher Statement Coverage Faster

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (13)

Similar to CarFast: Achieving Higher Statement Coverage Faster

Similar to CarFast: Achieving Higher Statement Coverage Faster (6)

Recently uploaded

Recently uploaded (20)

CarFast: Achieving Higher Statement Coverage Faster

Editor's Notes