Scam2011 syer

•Download as PPT, PDF•

0 likes•102 views

SAIL_QU

Identifying Performance
Deviations in Thread Pools
Mark D. Syer, Bram Adams and Ahmed E. Hassan
mdsyer@cs.queensu.ca
Software Analysis and Intelligence Lab
Queen’s University, Canada
1

3
4
How to detect
performance
deviations?
Needle in a haystack!
Complex
architectures
Ultra-Large-Scale
Software Systems

Potential solutions must account for…
• Huge amounts of data
• Limited system knowledge
4
Potential solutions should…
• Limit manual review/analysis

5
4
How to detect
performance
deviations?
Hundreds of machines
Thousands of threads
Needle in a haystack!
Complex
architectures
Simulate user actions
Monitor thread behaviour by
collecting resource usage metrics

Simulate user actions
Monitor thread behaviour by
collecting resource usage metrics
6
43
21
Group threads depending on their machine ID

7
3
21
4
1
2
4
3
Group similar behaviour
Similar
Dissimilar
Find dissimilar behaviour

8
Abstraction
Clustering
Ranking
Group threads
Find dissimilar
behaviour
Group similar
behaviour
How to group
threads?
1
2
3

Abstraction
Ranking
Clustering
9
Abstraction
Clustering
Ranking
Machines
Threads

11
6:00 8:004:00
Abstracting by
Space
Vs.
Time
RQ1: What time period
has the most deviations?
RQ2: What threads have
deviations?

Deviations Injected into Thread Data
12
Memory Leak Injected

Deviations Injected into Thread Data
13
CPU Spike Injected

Most Deviations are Identified
Precision Recall
Top Level 100% 100%
Thread Level 100% 76.61%
14
1-recall 2-recall 3-recall
86.67% 96.67% 100%

Viewers also liked

Icst2012 zamanSAIL_QU

Scam2010 thomas presentationSAIL_QU

Icse2013 shangSAIL_QU

Ian wcre2011SAIL_QU

Bettenburg icpc2011SAIL_QU

Compsac2010 malikSAIL_QU

Icpc2010 bettenburgSAIL_QU

Icse2011 build maintenanceSAIL_QU

Bdsys icsm v3.5SAIL_QU

Msr2012 adamsSAIL_QU

Icsm2009 alamSAIL_QU

Ase2010 shangSAIL_QU

Icse2011 srcSAIL_QU

Msr2012 bettenburg presentationSAIL_QU

Icpc2011 syerSAIL_QU

Kcsd2009 emadSAIL_QU

Icse2011 thomas posterSAIL_QU

Qsic2010 shihabSAIL_QU

Esem2010 shihabSAIL_QU

Aosd2009 adamsSAIL_QU

Viewers also liked (20)

Icst2012 zaman

Scam2010 thomas presentation

Icse2013 shang

Ian wcre2011

Bettenburg icpc2011

Compsac2010 malik

Icpc2010 bettenburg

Icse2011 build maintenance

Bdsys icsm v3.5

Msr2012 adams

Icsm2009 alam

Ase2010 shang

Icse2011 src

Msr2012 bettenburg presentation

Icpc2011 syer

Kcsd2009 emad

Icse2011 thomas poster

Qsic2010 shihab

Esem2010 shihab

Aosd2009 adams

Similar to Scam2011 syer

Rapid pruning of search space through hierarchical matchinglucenerevolution

10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...DuraSpace

Duraspace Hot Topics Series 6: Metadata and Repository ServicesMatthew Critchlow

RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...S. Diana Hu

RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...Joaquin Delgado PhD.

AIAA Conference - Big Data Session_ Final - Jan 2016Manjula Ambur

Where Search Meets Machine Learning: Presented by Diana Hu & Joaquin Delgado,...Lucidworks

Lucene/Solr Revolution 2015: Where Search Meets Machine LearningJoaquin Delgado PhD.

Lucene/Solr Revolution 2015: Where Search Meets Machine LearningS. Diana Hu

Machine Learning Summary for Caltech2Lukas Mandrake

Query-time Nonparametric Regression with Temporally Bounded Models - Patrick ...Lucidworks

How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...Databricks

Summit EU Machine LearningMapR Technologies

Turning Information chaos into reliable dataCareer Communications Group

2023 Supervised Learning for Orange3 from scratchFEG

Presentation of Apache Cassandra Nikiforos Botis

AI Possibilities for DDIAndreas Taudte

Introduction to apache spark and machine learningAwoyemi Ezekiel

Business intelligence and data warehousingVaishnavi

Elastic search from the trenchesVinícius Carvalho

Similar to Scam2011 syer (20)

Rapid pruning of search space through hierarchical matching

10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...

Duraspace Hot Topics Series 6: Metadata and Repository Services

RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...

RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...

AIAA Conference - Big Data Session_ Final - Jan 2016

Where Search Meets Machine Learning: Presented by Diana Hu & Joaquin Delgado,...

Lucene/Solr Revolution 2015: Where Search Meets Machine Learning

Machine Learning Summary for Caltech2

Query-time Nonparametric Regression with Temporally Bounded Models - Patrick ...

How Azure Databricks helped make IoT Analytics a Reality with Janath Manohara...

Summit EU Machine Learning

Turning Information chaos into reliable data

2023 Supervised Learning for Orange3 from scratch

Presentation of Apache Cassandra

AI Possibilities for DDI

Introduction to apache spark and machine learning

Business intelligence and data warehousing

Elastic search from the trenches

More from SAIL_QU

Studying the Integration Practices and the Evolution of Ad Libraries in the G...SAIL_QU

Studying the Dialogue Between Users and Developers of Free Apps in the Google...SAIL_QU

Improving the testing efficiency of selenium-based load testsSAIL_QU

Studying User-Developer Interactions Through the Distribution and Reviewing M...SAIL_QU

Studying online distribution platforms for games through the mining of data f...SAIL_QU

Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...SAIL_QU

Investigating the Challenges in Selenium Usage and Improving the Testing Effi...SAIL_QU

Mining Development Knowledge to Understand and Support Software Logging Pract...SAIL_QU

Which Log Level Should Developers Choose For a New Logging Statement?SAIL_QU

Towards Just-in-Time Suggestions for Log ChangesSAIL_QU

The Impact of Task Granularity on Co-evolution AnalysesSAIL_QU

A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...SAIL_QU

How are Discussions Associated with Bug Reworking? An Empirical Study on Open...SAIL_QU

A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...SAIL_QU

A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...SAIL_QU

Studying the Dialogue Between Users and Developers of Free Apps in the Google...SAIL_QU

What Do Programmers Know about Software Energy Consumption?SAIL_QU

Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...SAIL_QU

Revisiting the Experimental Design Choices for Approaches for the Automated R...SAIL_QU

Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsSAIL_QU

More from SAIL_QU (20)

Studying the Integration Practices and the Evolution of Ad Libraries in the G...

Studying the Dialogue Between Users and Developers of Free Apps in the Google...

Improving the testing efficiency of selenium-based load tests

Studying User-Developer Interactions Through the Distribution and Reviewing M...

Studying online distribution platforms for games through the mining of data f...

Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...

Investigating the Challenges in Selenium Usage and Improving the Testing Effi...

Mining Development Knowledge to Understand and Support Software Logging Pract...

Which Log Level Should Developers Choose For a New Logging Statement?

Towards Just-in-Time Suggestions for Log Changes

The Impact of Task Granularity on Co-evolution Analyses

A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...

How are Discussions Associated with Bug Reworking? An Empirical Study on Open...

A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...

A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...

Studying the Dialogue Between Users and Developers of Free Apps in the Google...

What Do Programmers Know about Software Energy Consumption?

Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...

Revisiting the Experimental Design Choices for Approaches for the Automated R...

Measuring Program Comprehension: A Large-Scale Field Study with Professionals

Scam2011 syer

1. Identifying Performance Deviations in Thread Pools Mark D. Syer, Bram Adams and Ahmed E. Hassan mdsyer@cs.queensu.ca Software Analysis and Intelligence Lab Queen’s University, Canada 1

2. 2

3. 3 4 How to detect performance deviations? Needle in a haystack! Complex architectures Ultra-Large-Scale Software Systems

4. Potential solutions must account for… • Huge amounts of data • Limited system knowledge 4 Potential solutions should… • Limit manual review/analysis

5. 5 4 How to detect performance deviations? Hundreds of machines Thousands of threads Needle in a haystack! Complex architectures Simulate user actions Monitor thread behaviour by collecting resource usage metrics

6. Simulate user actions Monitor thread behaviour by collecting resource usage metrics 6 43 21 Group threads depending on their machine ID

7. 7 3 21 4 1 2 4 3 Group similar behaviour Similar Dissimilar Find dissimilar behaviour

8. 8 Abstraction Clustering Ranking Group threads Find dissimilar behaviour Group similar behaviour How to group threads? 1 2 3

9. Abstraction Ranking Clustering 9 Abstraction Clustering Ranking Machines Threads

10. 10 does it work?

11. 11 6:00 8:004:00 Abstracting by Space Vs. Time RQ1: What time period has the most deviations? RQ2: What threads have deviations?

12. Deviations Injected into Thread Data 12 Memory Leak Injected

13. Deviations Injected into Thread Data 13 CPU Spike Injected

14. Most Deviations are Identified Precision Recall Top Level 100% 100% Thread Level 100% 76.61% 14 1-recall 2-recall 3-recall 86.67% 96.67% 100%

15. 15

Editor's Notes

Title Slide
0. We increasing rely on ultra large scale software systems for e-commerce, personal and professional networking and telecommunications. 1. 2.
0. We increasing rely on ultra large scale software systems for e-commerce, personal and professional networking and telecommunications. 1. Complex system 2. How to detect performance deviations? Needle in a haystack
Collect metrics for processes 1. Load test the system by simulating user actions 2. Collect resource usage metrics (e.g., cpu usage and memory allocation) for each thread
Collect metrics for processes 0. Abstract metrics, group all the processes by which machine they are running on 1. Plot resource usage metrics
Identify and group similar behaviour 1. Group 1 and 2 2. Group 1,2 and 3 3. Group 1,2,3 and 4 4. Rank But, this still leaves us comparing machines, we want to find the processes that are deviating. We can use our ranking to look as just the processes running on machine 4.
Methodology 1. In general, you must decide how to group processes. In our previous example we grouped by machines, but how grouping is done more generally is open to analysts
Top Down 1. In the previous example we analyzed at the machine level, but still don’t know which threads are deviating 2. Use the machine level ranking to inform our selection of threads, and analyze the threads of the deviating machine
We have about 100 hours of load test data
As opposed to grouping by space (i.e., machines) we group by time 1. Group by machines 2. Group by time 2a. Top level become time (in our previous example we compared the behaviour of machines, now we compare the behaviour of threads from different times) 3. 4. RQ1 5. RQ2
Quantitative Evaluation – Defect Injection The types of deviations that were injected was based on previous research and consultations with system experts
Quantitative Evaluation – Defect Injection (CPU Spikes)
Quantitative Evaluation – Results 1. k - recall – did we correctly rank the time period with the most deviations?
Conclusion 1-3. Slides 2-4

Scam2011 syer

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

Similar to Scam2011 syer

Similar to Scam2011 syer (20)

More from SAIL_QU

More from SAIL_QU (20)

Scam2011 syer

Editor's Notes