3. Motivations
1.Tag right?
2. When will be answered?
~Help users to tag their questions on
stackoverflow.com more properly
DEMO:
www.stackoverflowtags.tech
6. Query #1
• Prob. of a question labeled with specific tag(such as tag A) will be
answered in 10 mins
= number of questions answered in 10 mins and tagged with A/ total number of questions tagged with A
7. question_id tags answer_time(sec) posted_at
231 Java 3010 2016_01_02_21_20_01
290 spark 7381 2016_01_02_22_09_01
341 Java 5611 2016_01_10_01_02_05
Query #1
• Prob. of a question labeled with specific tag(such as tag A) will be
answered in 10 mins
= number of questions answered in 10 mins and tagged with A/ total number of questions tagged with A
8. question_id tags answer_time(sec) posted_at
231 Java 3010 2016_01_02_21_20_01
290 spark 7381 2016_01_02_22_09_01
341 Java 5611 2016_01_10_01_02_05
Query #1
• Prob. of a question labeled with specific tag(such as tag A) will be
answered in 10 mins
= number of questions answered in 10 mins and tagged with A/ total number of questions tagged with A
14. • User entered tagA
Query #2
tag A
tag C tag B
tag D
3
2
5
5 10
3
7
15. • User entered tagA
• Search all neighbors
of tagA and compute
their similarity to A.
Query #2
tag A
tag C tag B
tag D
3
2
5
5 10
3
7
SC->A=3/5=0.6
SD->A=2/3=0.67
SB>A=5/10=0.5
16. • User entered tagA
• Search all neighbors
of tagA and compute
their similarity to A.
• Sort B,C,D by their
similarity to A
Query #2
tag A
tag C tag B
tag D
3
2
5
5 10
3
7
SC->A=3/5=0.6
SD->A=2/3=0.67
SB>A=5/10=0.5
17. • User entered tagA
• Search all neighbors of
tagA and compute their
similarity to A.
• Sort B,C,D by their
similarity to A
• Give result to user
Query #2
tag A
tag C tag B
tag D
3
2
5
5 10
3
7
SC->A=3/5=0.6
SD->A=2/3=0.67
SB>A=5/10=0.5
18. Challenges and Future Considerations
• Streaming processing to update information
• Process big data
• Scale up the performance of sorting in graph
19. About Me
• Chentao(Sam) Zhang
• MS in Electrical & Computer
Engineering from University of
Delaware
• Passionated to learn and try
new things
20. Query #1
tags
Prob. of being answered in
10 mins Avg time(sec)
Java 0.32 1200
spark 0.013 31000
21. tags
Prob. of being answered in
10 mins Avg time(sec)
Java 0.32 1200
spark 0.013 31000
Query #1