Evaluating the Usefulness of IR-Based Fault LocalizationTechniques

Evaluating the Usefulness
of IR-Based Fault
Localization Techniques
Qianqian Wang* Chris Parnin** Alessandro Orso*
* Georgia Institute of Technology, USA
** North Carolina State University, USA

Debugging is Difﬁcult
Let’s&see…&
Over&50&years&of&research&
on&automated&debugging.&

Let’s&see…&
1962.&Symbolic&Debugging&(UNIVAC&FLIT)&
1981.%Weiser.%Program%Slicing%
1999.$Delta$Debugging$
2001.%Sta)s)cal%Debugging%
��

Let’s&see…&
1962.&Symbolic&Debugging&(UNIVAC&FLIT)&
1981.%Weiser.%Program%Slicing%
1999.$Delta$Debugging$
2001.%Sta)s)cal%Debugging%
��
STILL

IR-Based FL Techniques
• How do they work?
• Rank source ﬁles based on their lexical similarity to bug
reports 
• How well do they work?
Top 1 Top 5 Top 10
Percentage 35% 58% 69%

Source code file: CTabFolder.java
public class CTabFolder extends Composite {
// tooltip
int [] toolTipEvents = new int[] {SWT.MouseExit,
SWT.MouseHover, SWT.MouseMove,
SWT.MouseDown, SWT.DragDetect};
Listener toolTipListener;
…
/ * Returns <code>true</code> if the CTabFolder
only displys the selected tab
* and <code>false</code> if the CTabFolder
displays multiple tabs.
*/
…void onMouseHover(Event event) {
showToolTip(event.x, event.y);
}
void onDispose() {
inDispose = true;
hideToolTip();
…
}
}
Understanding IR-based FL Techniques
Bug ID: 90018
Summary: Native tooltips left around on
CTabFolder.
Description: Hover over the PartStack
CTabFolder inside eclipse until some native
tooltip is displayed. For example, the
maximize button. When the tooltip appears,
change perspectives using the keybinding.
the CTabFolder gets hidden, but its tooltip
is permanently displayed and never goes
away. Even if that CTabFolder is disposed
(I'm assuming) when the perspective is
closed.
--------------------------------------------------------------------------

• Does the presence of technical information affect the fault
localization results?
• How often do bug reports contain such information?
• Is such information enough for developers to ﬁnd the faulty
ﬁles easily?
Assessing IR-based FL Techniques

• Q1: Does technical information affect fault
• Q2: How often do bug reports contain
technical information?
Analytical Study

Subjects
Project # Bugs #Source ﬁle
AspectJ 286 6k
SWT 98 0.5k
ZXing 20 0.4k
Jodatime 9 0.2k

• Categorize bug reports
• Stack traces/test cases/program entity names/natural
language descriptions
• Generate ranked lists
• BugLocator - IR-based fault localization tool
• Perform statistical analysis
Q1: Method

Q1: Results
Program
entity
Stack
trace
Test case
Results √ X X
• Does bug report information affect fault
√ Statistically signiﬁcant difference: p < 0.05
X No statistically signiﬁcant difference: p >= 0.05

Q1: Results
Program
entity
Stack
trace
Test case
Results √ X X
• Does bug report information affect fault
Bug report characteristics affect IR-based
fault localization results

• How often bug reports contain technical
information?
• Select 10,000 bug reports from SWT Bugzilla
• Check presence of technical information:
• Stack traces
• Test cases
• Program entity names
Q2: Method

information?
Q2: Results
Stack traces Test cases
Program
entity

information?
Q2: Results
Stack traces Test cases
Program
entity
The majority bug reports do not contain
enough information

Additional finding
“Optimistic” Evaluation Approach
• Assumption: Changed files = faulty files
• Reality:
• 40% bugs contain multiple changed files
• Not all changed files contain bugs
• Best-ranked files may not be faulty

Additional finding
“Optimistic” Evaluation Approach
• Assumption: Changed files = faulty files
• Reality:
• 40% bugs contain multiple changed files
• Not all changed files contain bugs
• Best-ranked files may not be faulty
Results of existing studies might be
worse than what reported

• Q3: Does bug report information affect
developers’ performance?
• Q4: Do IR-based techniques help developers’
performance?
User Study

Experiment Protocol: Setup
Participants:
70 developers
Graduate Students
Software subject:
• Eclipse SWT
• 2 bugs for each developer
Task: ﬁnd and ﬁx the bug
Tools:
• Eclipse plug-in
• Integrating ranked lists
• Logging
…"
1)"
2)"
3)"
4)"
✔
✔
✔

Experimental Protocol: Variables
Bug related
Tool related

Experimental Protocol: Variables
1) ———
2) ———
3) ———
4) ———
…
1) ———
2) ———
3) ———
4) ———
…
Good/bad ranked list
Good/bad bug report
With/without a ranked list
Bug related
Tool related
(i.e., with/without the tool)

Experimental Protocol: Evaluation Metrics
Time
• To ﬁnd the faulty ﬁle
• To locate the bug
Debugging score

Q3: Results
Time used to ﬁnd
the faulty ﬁle
Time used to locate
the bug
Debugging score
√ X √
Compared the performance of 2 groups: 
1. without tool, good bug reports 
2. without tool, bad bug reports

Q3: Results
Time used to find
the faulty file
Time used to locate
the bug
Debugging score
√ X √
Compared the performance of 2 groups: 
1. without tool, good bug reports 
2. without tool, bad bug reports
Good bug reports (i.e., with entity names) allow
developers to shorten the time to find the
faulty file and help them find better fixes

Q4: Results
Condition
Debugging
score
Time to ﬁnd
the ﬁle
Time to locate
the bug
X X X
X √ X
X X X
X X X
Compared the performance of 2 groups under 4 conditions: 
1. without tool, {good|bad} bug reports, {good|bad} ranked list 
2. with tool, {good|bad} bug reports, {good|bad} ranked list

Q4: Results
Condition
Debugging
score
Time to find
the file
Time to locate
the bug
X X X
X √ X
X X X
X X X
Good ranked 
list
Bad ranked 
list
Good bug 
report
Bad bug 
report
X Not statist, sign.
√ Statist. significant

Q4: Results
Condition
Debugging
score
Time to ﬁnd
the ﬁle
Time to locate
the bug
X X X
X √ X
X X X
X X X
Good ranked 
list
Bad ranked 
list
Good bug 
report
Bad bug 
report
Only perfect ranked lists help when users
can not get enough hints from bug reports

Q4: Results
Condition
Debugging
score
Time to find
the file
Time to locate
the bug
X X X
X √ X
X X X
X X X
Good ranked 
list
Bad ranked 
list
Good bug 
report
Bad bug 
report
Only perfect ranked lists help when users
can not get enough hints from bug reports
The tool only helps find the faulty file, but
developers spend much more time locating
the bug in the faulty file than finding such file

Additional Observations
• Developers used program entity names in the
bug report as search keywords.
• Ranked lists generated by IR-based techniques
affected users’ debugging behavior
• Gave a starting point
• Gave them conﬁdence

Summary
• Studied the practical usefulness of IR-based FL techniques
• Performed both an analytical study and a user study
• Main findings
• Bug report characteristics affect IR-based fault localization results
• Results of existing studies might be worse than what reported
• The majority of bug reports do not contain enough information
• “Good” bug reports allow developers to shorten the time to find the
faulty file and help them find better fixes
• Only perfect ranked lists help when users can not get enough hints from
bug reports
• The tool only helps find the faulty file, but developers spend much more
time locating the bug in the faulty file than finding such file

Implications
• Better bug reports are needed
• Automated debugging techniques should focus
on improving results for bug reports with little
information
• Automated debugging techniques should
provide ﬁner-grained information and context
• More user studies and realistic evaluations are
needed

Evaluating the Usefulness of IR-Based Fault LocalizationTechniques

Recommended

Recommended

More Related Content

What's hot

What's hot (11)

Viewers also liked

Viewers also liked (14)

Similar to Evaluating the Usefulness of IR-Based Fault LocalizationTechniques

Similar to Evaluating the Usefulness of IR-Based Fault LocalizationTechniques (20)

Recently uploaded

Recently uploaded (20)

Evaluating the Usefulness of IR-Based Fault LocalizationTechniques