Presentation of Ticket Tagger, a tool demo paper accepted at the International Conference of Software Maintenance and Evolution. Ticket Tagger predicts and assigns labels on newly opened issues on GitHub.
Design For Accessibility: Getting it right from the start
Ticket Tagger at IEEE ICSME 2019
1. Ticket Tagger
machine learning driven ticket classification
github.com/rafaelkallis/ticket-tagger
Rafael Kallis 1
Andrea Di Sorbo 2
Gerardo Canfora 2
Sebastiano Panichella 3
1
University of Zurich, Switzerland
2
University of Sannio, Italy
3
Zurich University of Applied Sciences, Switzerland
2. Ticket Tagger
Introduction
• Issue trackers are essential tools for creating, managing and
addressing issues that occur in software systems.
• A critical aspect for handling and prioritizing issues involves the
assignment of labels to them in order to determine their
type [4].
• The labeling process has a positive impact on the effectiveness
of issue processing [6].
1
4. Ticket Tagger
Motivation
• Manually assigning labels to issues is a labor-intensive and
time-consuming task for project managers [3]
• Current labeling mechanism is scarcely used on GitHub [1, 2].
3
6. Ticket Tagger
Goal
Create a tool that:
• Automatically predicts labels to assign to issues.
• Stimulates the use of labeling mechanisms in software projects.
• Facilitates the issue management and priorization processes.
5
8. Ticket Tagger
Tool
We introduce Ticket Tagger, a tool that leverages machine learning
strategies on issue titles and descriptions for automatically labeling
GitHub issues.
Freely accessible to any developer and can be integrated painlessly
into existing repositories.
github.com/apps/ticket-tagger
7
10. Ticket Tagger
Model Selection
• Ticket Tagger uses fastText [5] for labeling tickets.
• FastText is less resource intensive but still competitive against
deep learning models.
• Model trained with 10k GitHub issues drawn at random for each
of the labels: “bug”, “enhancement”, and “question” (30k total).
9
16. Ticket Tagger
machine learning driven ticket classification
github.com/rafaelkallis/ticket-tagger
Rafael Kallis 1
Andrea Di Sorbo 2
Gerardo Canfora 2
Sebastiano Panichella 3
1
University of Zurich, Switzerland
2
University of Sannio, Italy
3
Zurich University of Applied Sciences, Switzerland
17. Ticket Tagger
Training-Set Preview
__label__bug "scala presentation compiler Scala...
__label__bug "Failed to create an external role...
__label__bug support inline image link bot...
__label__enhancement "Autoplay video on channel...
__label__enhancement Resume from backup with...
__label__enhancement "replace redux store...
__label__question How to disable the log? Your...
__label__question "How to read gradle command...
__label__question Results in Transition...
15
18. References i
T. F. Bissyandé, D. Lo, L. Jiang, L. Réveillere, J. Klein, and
Y. Le Traon.
Got issues? who cares about it? a large scale investigation of
issue trackers from github.
In 2013 IEEE 24th international symposium on software reliability
engineering (ISSRE), pages 188–197. IEEE, 2013.
J. Cabot, J. L. C. Izquierdo, V. Cosentino, and B. Rolandi.
Exploring the use of labels to categorize issues in open-source
software projects.
In 2015 IEEE 22nd International Conference on Software Analysis,
Evolution, and Reengineering (SANER), pages 550–554, 2015.
19. References ii
Q. Fan, Y. Yu, G. Yin, T. Wang, and H. Wang.
Where is the road for issue reports classification based on text
mining?
In International Symposium on Empirical Software Engineering
and Measurement, ESEM 2017, pages 121–130, 2017.
J. L. C. Izquierdo, V. Cosentino, B. Rolandi, A. Bergel, and J. Cabot.
Gila: Github label analyzer.
In 22nd IEEE International Conference on Software Analysis,
Evolution, and Reengineering, SANER 2015, Montreal, QC, Canada,
March 2-6, 2015, pages 479–483, 2015.
A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov.
Bag of tricks for efficient text classification.
arXiv preprint arXiv:1607.01759, 2016.
20. References iii
Z. Liao, D. He, Z. Chen, X. Fan, Y. Zhang, and S. Liu.
Exploring the characteristics of issue-related behaviors in
github using visualization techniques.
IEEE Access, 6:24003–24015, 2018.