The Curious Case of Fuzzing for Automated Software Testing

The Curious Case of Fuzzing  
for Automated Software Testing
Marcel Böhme
Software Security

MPI-SP & Monash

Keywords: Vulnerability Discovery,
Automated Software Testing,
E
ff
ectiveness, E
ffi
ciency,  
Scalability, Guarantees

• Fuzzing for Automatic Vulnerability Discovery

• Making machines attack other machines.

• Focus on scalability, e
ffi
ciency, and e
ff
ectiveness.

• Foundations of Software Security

• Assurances in Software Security

• Fundamental limitations of existing approaches

• Drawing from multiple disciplines (information theory, biostatistics)

whoami
2
* Looking for PhD students :)

Marcel Böhme, Max Planck Institute for Security and Privacy · RUB Tag der Informatik · The Curious Case of Fuzzing for Automated Software Testing
Test Input
Expected 
Output?
Program
software testing

Test Input
Expected 
Output?
Program

software testing

Test Input
Expected 
Output?
Program

The Oracle
 
checks whether
software testing

Test Input
Expected 
Output?
Program

The Fuzzer
 
auto-generates
software testing

software testing
Test Input
Expected 
Output?
Program

The Fuzzer
 
auto-generates

software testing
Test Input
Expected 
Output?
Program

The Fuzzer
 
auto-generates
fuzzing == automated

fuzzing :: properties
• e
ff
ectiveness
• e
ffi
ciency

• scalability

• e
ff
ectiveness

• e
ffi
ciency
• scalability

• e
ff
ectiveness

• e
ffi
ciency

• scalability

• You’ve been generating test cases for your program.

• No bugs found! 👍

• Is your program free of bugs?
fuzzing :: effectiveness
[ESEC/FSE’21] “Estimating Residual Risk in Greybox Fuzzing
”

Böhme, Liyanage, and Wüstholz




• Probably not. 😆
”






• Is your test case generation technique e
ff
ective?
”






ff
ective?

• Maybe? 😅
”






ff
ective?

• Maybe? 😅

•
🤔
• How do you even measure e
ff
ectiveness if there are no bugs?
”


ff

ff
Code Coverage!

ff
Code Coverage!
So, you are saying: Achieve 100% coverage.

That should be easy, right?

ff
Code Coverage!
So, you are saying: Achieve 100% coverage.

That should be easy, right?
Wrong.

ff
Code Coverage!
We cannot compute S!
 
As hard as software verification.

ff
Code Coverage!
So, you are saying: As a fuzzer achieves
 
more coverage, it also finds more bugs, right?
[ICSE’22] “On the Reliability of Coverage-based Fuzzer Benchmarking
”

Böhme, Szekeres, and Metzmann

ff
Code Coverage!
So, you are saying: As a fuzzer achieves
 
more coverage, it also finds more bugs, right?
Right!
”


ff
Very strong correlation with bug finding!
Code Coverage!

ff
Code Coverage!
So, you are saying: We can compare fuzzers in terms
 
of coverage achieved and declare the winner as the
 
most effective at bug finding, right?
”


ff
Code Coverage!
So, you are saying: We can compare fuzzers in terms
 
of coverage achieved and declare the winner as the
 
most effective at bug finding, right?
Wrong.
”


Worst in Coverage

Best in Bug Finding
”


So, it should be better to systematically
 
generate test inputs that cover most
 
of the coverage elements rather than
 
to randomly generate inputs, right?

So, it should be better to systematically
 
generate test inputs that cover most
 
of the coverage elements rather than
 
to randomly generate inputs, right?
Wrong.

fuzzing :: efficiency
[ESEC/FSE’14] “On the Ef
fi
ciency of Automated Software Testing
”

Böhme and Paul

time budget
fi
”

Böhme and Paul

time budget
As we increase the test input
 
generation time for the most effective
 
technique, it will achieve less and less
 
coverage within the time budget.
fi
”

Böhme and Paul

Even the most effective technique
 
is outperformed by random testing,
 
if it takes too much time
 
to generate the test inputs.
fi
”

Böhme and Paul

fuzzing :: scalability
[ESEC/FSE’20] “Fuzzing: On the Exponential Cost of Vulnerability Discovery
”

Böhme and Falk

”

Böhme and Falk
If we increase the #machines exponentially,
 
we find the same bugs exponentially faster,
 
right?

”

Böhme and Falk
 
we find the same bugs exponentially faster,
 
right?
Right!

”

Böhme and Falk
 
the number of new bugs found in the same time
 
increases exponentially, right?

”

Böhme and Falk
 
the number of new bugs found in the same time
 
increases exponentially, right?
Wrong.

1 2 4 8 16 32 64 128 256 512
machines
new bugs 1 2 3 4 5 6 7 8 9
24 hrs

fuzzing :: summary

fuzzing :: summary
• Maximize coverage to increase #bugs found.
• How do you even measure eﬀectiveness if there are no bugs?
Very strong correlation with bug finding!
Code Coverage!

fuzzing :: summary

• Compare fuzzers in terms of *both*, coverage and bug
fi
nding.
Worst in Coverage
Best in Bug Finding
[ICSE’22] “On the Reliability of Coverage-based Fuzzer Benchmarking”

fuzzing :: summary

fi
nding.

• Dumb and fast is better than smart and slow.
time budget
[ESEC/FSE’14] “On the Efficiency of Automated Software Testing”
Böhme and Paul
time budget
[ESEC/FSE’14] “On the Efficiency of Automated Software Testing”
Böhme and Paul

fuzzing :: summary

fi
nding.

• Dumb and fast is better than smart and slow.

• Bug
fi
nding comes at an exponential cost.
1 2 4 8 16 32 64 128 256 512
machines
new bugs 1 2 3 4 5 6 7 8 9
24 hrs
* Looking for PhD students :)

The Curious Case of Fuzzing for Automated Software Testing

More Related Content

What's hot

Similar to The Curious Case of Fuzzing for Automated Software Testing

More from mboehme

Recently uploaded

The Curious Case of Fuzzing for Automated Software Testing