Automatic for the People

Andy Zaidman
Delft University of Technology
The Netherlands
International Conference on
Automation of Software Test
(AST 2023)
May 16th, 2023
Melbourne, Australia

A story on testing… and a bit of music

Human-in-the-Loop
important in testing
⎼Sigrid Eldh, during AST 2023 on Monday May 15th

Automatic for the people?
Automatic by the people?
Automatic by the machine?
Automatic for the machine?

Automatic for the people
Automatic by the people?
Automatic by the machine?
Automatic for the machine?

What do we know about
developers’ testing activities in
the IDE?

2443 SW Engineers – 118 countries
Java/C# – Eclipse, IntelliJ, Android Studio, Visual Studio
Minimal 5 month observation period (max. 2.5 years)
161 person years of development work

Not all Java projects do unit testing (in the IDE)
3 508
1 498
(43%)
Write/look at test code
or execute tests

Estimated when installing tool
Time spent on test code engineering
test code
engineering
production code
engineering
47% - 53%

Estimated when installing tool After measuring min. 5 months
Time spent on test code engineering
25% - 75%
test code
engineering
production code
engineering
47% - 53% test code
engineering
production code
engineering

Overestimating the testing effort stems from:
(1) the tedious nature of testing
(2) developers disliking it

10. Man on the Moon
Lyrics: Now Andy did you hear about this one?
…
If you believed they put a man on the moon
…

High Coverage
&
Enormous Potential

Difficult to Read Test Cases
&
Are They Asserting Correctness?

Tools like TestDescriber
Key issue:
Understanding
the scenario
under test

Which test makes
more sense? Which
scenario is easier to
grasp?

Do test need to be
UNDERSTANDABLE?

A good test can catch a bug
and returns feedback that can
help you identify the issue.

A good developer test forms
executable documentation that
tells you how to use the methods.

A good developer test forms
executable documentation that
tells you how to use the methods.
Understandability of a test
(scenario) is important

What is the purpose of a generated test?
Throw away; find faults in current version of the software

Inspiration to write a manually written test case

To become part of a maintained test suite

To become part of a maintained test suite
A more specific generated test, e.g., a crash replicating test

Some solution spaces…
• Test amplification
• Test generation with information carving
• Documenting generated tests
• Better support for manual writing of test cases
• …

Test amplification
• Starts from existing test cases
Applies systematic “mutations” to test code to see whether more
coverage is obtained and/or different scenarios are tested

Test amplification
• Starts from existing test cases
Applies systematic “mutations” to test code to see whether more
coverage is obtained and/or different scenarios are tested
+ Easier to understand test
scenarios compared to
freshly generated tests
- Difficult to cover entire
search space, depends on
existing tests
- Inspection cost

Test generation with information
carving

Carving to explore the search space

Carving to explore the search space
vs.
Carving to improve the
understandability of tests

Better support for manual writing
of test cases

When are developers
discouraged to test?

When do developers
aspire to test or become
better testers?

1. We observe 13 developers thinking-aloud while testing methods in
open-source software
2. We challenge and augment our findings by surveying 72 software
developers

Main test case engineering strategies of
software engineers
1. intensively guided by documentation,
2. intensively guided by source code,
3. or ad-hoc.

Recommendations
1. Tool support
• Creation of test skeletons
• Quick code coverage indications
• Copy/paste support for test code
2. Developers
• Have a clear adequacy criterion to avoid uncertainty
3. Education
• Teach how to use code coverage tools to steer testing, not just as a
metric

Recommendations
1. Tool support
• Creation of test skeletons
• Quick code coverage indications
• Copy/paste support for test code
2. Developers
• Have a clear adequacy criterion to avoid uncertainty
3. Education
• Teach how to use code coverage tools to steer testing, not just as a
metric
Come see our work on SW testing education
at SEENG in the next session!

We are going to have to make our hands dirty…

Test cost Inspection cost
“You can’t test software with your hands in your pants”

We should be obsessive in improving the
user experience

user experience UX

developer experience DX

tester experience TX

by the machine

by the machine and
the people

All great
work by…

and
many
more
Team

Thank you!
International Conference on
Automation of Software Test
(AST 2023)
May 16th, 2023
Melbourne, Australia
azaidman

Some pointers
• Maurício Aniche, Christoph Treude, Andy Zaidman. How Developers Engineer Test Cases: An
Observational Study; IEEE Trans. on Software Engineering, 2022.
• Moritz Beller, Georgios Gousios, Annibale Panichella, Sebastian Proksch, Sven Amann, Andy
Zaidman. Developer Testing in The IDE: Patterns, Beliefs, And Behavior; IEEE Trans. on Software
Engineering, 2019.
• Sebastiano Panichella, Annibale Panichella, Moritz Beller, Andy Zaidman, Harald Gall. The Impact
of Test Case Summaries on Bug Fixing Performance: An Empirical Investigation; ICSE 2016.
• Carolin Brandt, Andy Zaidman. Developer-Centric Test Amplification: The Interplay Between
Automatic Generation and Human Exploration; Empirical Software Engineering, 2022.
• Mark Swillus, Andy Zaidman. Sentiment Overflow in the Testing Stack: Analysing Software Testing
Posts on Stack Overflow, Arxiv 2022.

Automatic for the People

More Related Content

Similar to Automatic for the People

More from Andy Zaidman

Recently uploaded

Automatic for the People