Thesis+of+zéphyrin+soh.ppt

Ph.D. Defense
Improving Program Comprehension and
Recommendation Systems using Developers’
Context
Zéphyrin Soh
Supervised by:
Dr. Yann-Gaël Guéhéneuc, Dr. Giuliano Antoniol, Dr. Foutse Khomh
Polytechnique Montréal, Canada
December 8, 2015
Zéphyrin Soh (Polytechnique Montréal) Context to improve comprehension and recommendation 1/40

Outline
1 Context & Motivations
2 Problems & Thesis Statement
3 Contribution 1: Quality of ITs
4 Contribution 2: Recommendation Systems
5 Contribution 3: Developers’ Eﬀort
6 Contribution 4: Program Exploration
7 Conclusion

Context & Motivations

Maintenance cost increases [1]
*Source:http://cvmountain.com *Source:https://www.leaseweb.com
1981 1999
[1] Stephen R. Schach, Object-Oriented and Classical Software Engineering. Eighth
Edition McGraw-Hill, New York, 2011, pp. 11

Typical Scenario for maintenance using ITS and SCR
Stakeholder
(user/developer)

Stakeholder
(user/developer)
ITS
Maintenance task = change request (bug
or enhancement)

Stakeholder
(user/developer)
ITS
Triaging
Paul Mary Sylvia
Developers

Workspace
Stakeholder
(user/developer)
ITS
Triaging
Workspace
(+Mylyn)
Paul Mary Sylvia
Developers
Optionally use a monitoring tool to collect
logs
Mandatory in the policy of some projects

Workspace
Stakeholder
(user/developer)
ITS
Triaging
Workspace
(+Mylyn)
Paul Mary Sylvia
Developers
Interaction traces or context: Acyivity logs
collected when developers interacted with
program entities through the IDE
Patch: All the changes performed on the
program

Workspace
Stakeholder
(user/developer)
ITS
Triaging
Workspace
(+Mylyn)
Paul Mary Sylvia
SCR
Commiter/
reviewer
Developers
Task is resolved: “ﬁxed/closed”

Workspace
Stakeholder
(user/developer)
ITS
Triaging
Workspace
(+Mylyn)
Paul Mary Sylvia
SCR
Commiter
reviewer
Developers
The stakeholders involved in the
development need help, in particular the
one who is in charge of performing
changes to resolve the task

Workspace
(+Mylyn)
Paul

Workspace
(+Mylyn)
Paul
Barry Boehm, Software engineering. IEEE Trans. Computers, 1976

Difficult
Workspace
(+Mylyn)
Paul
Expensive
Ian Sommerville, Software engineering. Pearson, ninth edition, 2011

Difficult
Workspace
(+Mylyn)
Paul
Expensive
Search
Relate
Collect
Relevant
information
Ko , An exploratory study of how developers seek, relate, and collect relevant
information during software maintenance tasks. IEEE TSE, 2006

Difficult
Workspace
(+Mylyn)
Paul
~50% time
Expensive
Search
Relate
Collect
Relevant
information
SWEBOK V3.0, Guide of Software Engineering Body of Knowledge. IEEE,
2014, pp. 5-4

Difficult
Workspace
(+Mylyn)
Paul
~50% time
Expensive
Search
Relate
Collect
Relevant
information
Program
Exploration

Difficult
Workspace
(+Mylyn)
Paul
~50% time
Expensive
Search
Relate
Collect
Relevant
information
Program
Exploration
How developers explore program to ﬁnd relevant information to
change impacts their eﬀort (and thus their productivity).

Problems & Thesis Statement
We can collect and use accurate interaction traces to assess
developers’ effort, understand how developers’ spent their effort,
and assess their program exploration strategies during
maintenance and evolution activities.

No investigation of
the quality of ITs

No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies

No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
No empirical
evidence if
complex
change
required
more effort

No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
No empirical
evidence if
complex
change
required
more effort
No systematic
identification
of exploration
strategies

Contribution 1: Quality of ITs
No investigation of
the quality of ITs

Motivations
ITS
Workspace
(+Mylyn)
ITs are collected in real-work
environment

Motivations
ITS
Workspace
(+Mylyn)
Interlease activities Interruptions/Idle times
environment
Activities

Motivations
ITS
Workspace
(+Mylyn)
Interlease activities Interruptions/Idle times
environment
Activities
The time mined from
ITs is the time spent by the
developers performing the
maintenance task.

Motivations
Kind = “edit” ⇒ change activity
Edit with not null duration
Intent = time spent in the editor is more productive
Intent vs. actual use

Experiment
Are there noises in ITs?
Can we clean noises in ITs?

Experiment
Experiment setting
15 participants
Four systems
1 participant = 1 task

Experiment
Experiment setting
15 participants
Four systems
1 participant = 1 task
Collected data
Mylyn
Interaction
Traces (RITs)
Videos
Video
Transcription
Video-based
Interaction
Traces (VITs)
Workspace
(+Mylyn)

Time-related Noise
e1
e2
e3
ot
d2
it
d3
Time
d1
Global Time (GT) = endDate(e3) - startDate(e1)
Accumulated Time (AT) = d1 + d2 + d3

Time-related Noise
Global vs. Accumulated
VITs: Same results (by deﬁnition)
RITs: Diﬀerent results

Time-related Noise
RITs vs. VITs
RITs miss on average 6% of the time spent to perform the task.
Diﬀerence due to overlaps between events

Time-related Noise
RITs
e1
e2
e3
ot
d2
it
d3
Time
d1
Average individual idle times ≈ 30 sec.
Time (RITs) = AT - ot + d
d = it if it < 30 sec.

Edit-related Noise
#edit events (duration = 0)
VITs vs. RITs

Edit-related Noise
All edits are not real modiﬁcation of the code
ITs contain about 28% of false edit-events
Feedback from Mylyn community
“... the argument that there is noise in the edit events makes
sense to me.”
“The edit events dont have to be textual edits ...”

Cleaning ITs
Noises are prevalents in ITs
Can we clean noises in ITs?

Cleaning ITs
Threshold-based approach
Kind = « edit »
No change Code change
Time <= 24s Time > 24s
Double click
open
Static navigation
(F3)
From search view others

Cleaning ITs
Threshold-based approach
Kind = « edit »
No change Code change
Time <= 24s Time > 24s
Double click
open
Static navigation
(F3)
From search view others
CITs (Cleaned ITs) = RITs + cleaning rules
False edit: P = 93%; R = 64%
True edit: P = 18%; R = 81%

Cleaning ITs
Prediction-based approach
NaiveBayes, J48, Random Forest
Best:J48 with over sampling
False edit: P = 98.5%; R = 90%
True edit: P = 31.8%; R = 77.8%

Conclusion
No investigation of
the quality of ITs
- Method to (in)validate
assumptions
- Cleaning approaches

Conclusion
No investigation of
the quality of ITs
assumptions
- Do not make these
assumptions anymore
- Use our approach to
clean ITs

Contribution 2: Recommendation Systems
No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies

Impact of noises on previous studies
Two studies [2,3]
They made the investigated assumptions
Data and code/scripts available
[2] Ying and Robillard, The inﬂuence of the task on programmer bahaviour. ICPC
2011
[3] Lee at al., The impact of view histories on edit recommendations. TSE 2015

Editing Styles
Overview
0 10.5
e1 e2
e = first half if +50% time(e) in first half
Fraction(e = first)
0 to 19% = edit-last
87 to 100% = edit-first
Otherwise = edit-throughout

Editing Styles
Approach
VITs (oracle), RITs, CITs
Evaluation
Precision & recall
RITs
Editing style
VITs Editing style
CITs Editing style
Evaluation
Evaluation
Impact
of noise

Editing Styles
Results
Experiment dataset
Edit-ﬁrst: →
Edit-last: Recall CITs 22%
Edit-throughout: Precision CITs 4%
On average prediction-based Precision CITs 14% and Recall
CITS 7%

Editing Styles
Results
Experiment dataset
Edit-ﬁrst: →
Edit-last: Recall CITs 22%
Edit-throughout: Precision CITs 4%
On average prediction-based Precision CITs 14% and Recall
CITS 7%
Bugzilla data set
1 970 ITs for 4 systems
66%-41% same categorisation
34%-59% diﬀerent categorisation

Recommendation Systems
Same dataset
Same source code
Precision 41% (Threshold-based)
Recall 57% (Prediction-based)

Conclusion
No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
assumptions
- Do not make these
assumptions anymore
clean ITs
- Improve
recommendation
systems

Conclusion
No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
assumptions
- Do not make these
assumptions anymore
clean ITs
- Improve
recommendation
systems
- Noises bias the
results of previous
studies

Contribution 3: Developers’ Eﬀort
No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
No empirical
evidence if
complex
change
required
more effort

Eclipse bug #188083
Patch #74156
File: 2
LOC : 26
+ 18 LOC
- 8 LOC

Eclipse bug #134884
Patch #94002
File: 2
LOC : 20
+ 19 LOC
- 1 LOC

Complexity of the Changes
Which change is more complex?
vs.
Eclipse bug #134884
Patch #94002
File: 2
LOC : 20
+ 19 LOC
- 1 LOC
Eclipse bug #188083
Patch #74156
File: 2
LOC : 26
+ 18 LOC
- 8 LOC

1 How to estimate the eﬀort spend to provide a patch?
2 Does a complex patch need more eﬀort?

Metrics
Developers’ eﬀort
Time Spend: Total duration spent on all ﬁles and their
contents
Cyclomatic complexity: Cyclomatic complexity of the
exploration graph

Metrics
Developers’ effort
Time Spend: Total duration spent on all files and their
contents
Cyclomatic complexity: Cyclomatic complexity of the
exploration graph
Complexity of the changes
Entropy: How much the changes are scattered between
files [1]
Change distance: How much difference between the source
code before the changes and source code after.
[1] A. E. Hassan, Predicting faults using the complexity of code changes, ICSE 2009

Matching
How do we match interactions and patches?
2,408 Interactions histories 3,395 Patches
?

Matching
How do we match interactions and patches?
2,408 Interactions histories 3,395 Patches
?
Assumption: An interaction is matched to a patch (i.e., the patch
is the result of the corresponding interaction) if and only if both
are attached to the same bug report, by the same developer at the
same date (date/hour/minutes).

Matching
Unbalanced matchings
Developers modify ﬁles without interacting with them:
Changes not requiring much eﬀort, e.g., propagation of
refactoring
Interactions are not collected when performing the task

Matching
refactoring
F6F4
F5F3
F2
F1
F7
F3
F2
F1
F8

Matching
refactoring
F6F4
F5F3
F2
F1
F7
F3
F2
F1
F8
F7
F6
F5
F9
F2
F3
F1
F4
F8

Results
Eﬀort vs. complexity of the changes
1028 matchings and 217 unbalanced matchings

Results
Eﬀort vs. complexity of the changes
1028 matchings and 217 unbalanced matchings
Developers do not necessary spend more eﬀort on tasks
requiring more complex changes
Time (sec.)
Cyclomatic Complexity
Cyclomatic Complexity
Time (sec.)
Entropy
Entropy
Change distance
Change distance
0.16
0.27
0.31
0.33

Additional Files
Additional Files
Exploring ﬁles that should not be modiﬁed

Additional Files
Additional Files
Significantly relevant files vs. additional (useful and
accidental) files
F4
F1
F2
F5
F6
F3
F7
F9F8
62% 38%

Additional Files
Additional Files
Significantly relevant files vs. additional (useful and
accidental) files
Effort vs. number of additional files: 0.63 (time) and 0.82
(cyclomatic complexity)
F4
F1
F2
F5
F6
F3
F7
F9F8
62% 38%

Conclusion
No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
No empirical
evidence if
complex
change
required
more effort
assumptions
- Do not make these
assumptions anymore
clean ITs
- Improve
recommendation
systems
- Noises bias the
results of previous
studies
- Improve
knowledge
- Do not use surrogate
effort with the result
- New feature location
methods and tools

Perfect
equality
0 1
Maximum
inequality
RE
UE RE
Use F-Measure to maximize both precision and recall
0.4

Conclusion
No investigation of
the quality of ITs
No validation
of the
impact of
inacurate ITs
on previous
studies
No empirical
evidence if
complex
change
required
more effort
No systematic
identification
of exploration
strategies
assumptions
- Do not make these
assumptions anymore
clean ITs
- Improve
recommendation
systems
- Noises bias the
results of previous
studies
- Improve
knowledge
- Do not use surrogate
effort with the result
- New feature location
methods and tools
- Improve
knowledge
- Good strategy
- Use strategy to
guide developers

Thesis+of+zéphyrin+soh.ppt

Recommended

Recommended

More Related Content

Similar to Thesis+of+zéphyrin+soh.ppt

Similar to Thesis+of+zéphyrin+soh.ppt (20)

More from Ptidej Team

More from Ptidej Team (20)

Recently uploaded

Recently uploaded (20)

Thesis+of+zéphyrin+soh.ppt