SlideShare a Scribd company logo
1 of 48
Download to read offline
Mining System Logs to Learn Error
Predictors 
A Case Study of a Telemetry System
Barbara Russo 
L.E.S.E.R.
Faculty of Computer Science, Free University of Bozen-Bolzano, Italy
Barbara.Russo@unibz.it
Universität Stuttgart - June 9th, 2015
A collaboration between 
Free University of Bozen-Bolzano, Italy 
and 
University of Alberta, Canada 
Barbara Russo, Giancarlo Succi, Witold Pedrycz (2015) Mining system logs to learn
error predictors: a case study of a telemetry system, Empirical Software Engineering:
Volume 20, Issue 4 (2015), pp. 879-927
Universität Stuttgart - June 9th, 2015 2
System events
•  Events describe the behaviour within and across
subsystems or components
–  the system changes over time
•  Logs track events
Universität Stuttgart - June 9th, 2015 3
The value of logs
•  Log events carry information on
–  the software application that generated the event and its
state,
–  the task and the user whose interaction with the system
triggered the event, and
–  the time-stamp at which the event is generated.
Universität Stuttgart - June 9th, 2015 4
Logs can be cryptic
Universität Stuttgart - June 9th, 2015 5
Errors
•  Some behaviours are desirable and some are not
•  Undesirable behaviours are referred to as system errors
–  crashes that immediately stop the system and are easily
identifiable
–  deviations from the expected output that let the system run
and reveal only at completion of system tasks
Universität Stuttgart - June 9th, 2015 6
Meaning of errors
•  Events in error state (errors) act as alerts
–  ? Manifestations of system failures
–  ? Originated from a series of preceding events
–  ? Immediate action must be taken
–  ? Indication of an underlying problem
Universität Stuttgart - June 9th, 2015 7
Goal
•  Analysing the behaviour of a (composite) system by
mining logs of events and predicting future system
misbehaviour
•  Composite: many applications or subsystems
Universität Stuttgart - June 9th, 2015 8
Method
•  Solve a classification problem with SVM
•  Build a sequence abstraction by mining logs
•  Integrate several statistical techniques to control for
data brittleness and accuracy of model selection and
validation
•  Discuss the classification problem at different degree
of defectiveness
Universität Stuttgart - June 9th, 2015 9
Sequences
•  A single event may not suffice to predict system
failures
•  An event sequence is a set of events ordered by their
timestamp occurring within a given time window
•  A sequence abstraction is a representation of identified
sequences in formal format that machines can read
Universität Stuttgart - June 9th, 2015 10
Research question
•  Is the amount and type of information carried by a
sequence enough to predict errors?
Universität Stuttgart - June 9th, 2015 11
Isolating sequences
Universität Stuttgart - June 9th, 2015 12
Different length, different types
Abstracting sequences
Universität Stuttgart - June 9th, 2015 13
µ1 … µn
s7
s30
s2
s14
s10
Same length, same types
Example – sequence type
•  sv1=[0,1,0,1]
•  sv2=[2,1,1,0]
Universität Stuttgart - June 9th, 2015 14
Sequence type
•  µi – number of the events of type i in a sequence
•  sv=[µ1, …,µn] – vector of event multiplicities
•  ρ(sv) = sum of # errors in sequences mapping into sv
Universität Stuttgart - June 9th, 2015 15
Features to feed SVM
•  v= [sv, µ(sv), ν(sv)] – feature
–  µ(sv) = # sequences mapping into sv
–  ν(sv) average # of users in sequences mapping into sv
•  v is an faulty feature if at least one event in one
sequence is in error state
Universität Stuttgart - June 9th, 2015 16
Sequence vector semantic
•  Patterns of system behaviour
–  If µ>1 and ρ>0 such sequences denote a reliability problem
that recurs
•  Distributed teams
–  If ν>1 the comparative analysis of features with ρ>0 or ρ=0
tells whether errors are originated by multi users working
for the same tasks
Universität Stuttgart - June 9th, 2015 17
Example - features
•  v1= [0,1,0,1;1,1], sv1=[0,1,0,1]
–  µ(sv1) =1, ν(sv1)=1, ρ(sv1)=0
•  v2 = [2,1,1,0;1,2], sv2=[2,1,1,0]
–  µ(sv2) =1, ν(sv2)=2, ρ(sv2)=2
Universität Stuttgart - June 9th, 2015 18
The classification problem
19
Data Sets Classifier
Different ex-ante
distributions:
(faulty, non-faulty)
G2 =Non-Faulty
G1= Faulty
Ex-post classification
differs on different
classifier’s thresholds
Features
Classification
•  False Positive = features v that are predicted faulty
but do not contain error(s), ρ(sv)=0
•  True positive = features v that are predicted faulty and
contain error(s), ρ(sv)>0
•  False negative = features v that are predicted non-
faulty but that contain error(s), ρ(sv)>0
•  True negative = features v that are predicted non-faulty
and do not contain error(s), ρ(sv)=0
Universität Stuttgart - June 9th, 2015 20
Measures of accuracy
Universität Stuttgart - June 9th, 2015 21
Build classifiers on historical data
22
Classifier
Training Set
Test
Set
1.  To tune classifier’s parameters
2.  To compute classifier’s fitting
performance
Compare prediction performance
23
Classifier1
Validation
Set
Classifier2
Classifiern
…	
  
Validating sequence abstraction
•  Did we put too much information in our features?
–  Information Gain selects features that most contribute to the
information of a given classification category:
Classification category: sequences with a given number of error events
Universität Stuttgart - June 9th, 2015 24
Control the effect of the dataset
nature
•  Does set balancing increase the quality of prediction?
–  If classification categories are not equally represented in
datasets, classifiers might have low precision even though
true positive rate is high and false positive rate is low.
–  Such imbalanced data sets are very frequent in software
engineering data
Universität Stuttgart - June 9th, 2015 25
Parametric classification
•  The problem varies depending on how many errors we
allow in the system
•  c – cut-off value, i.e., number of errors in a sequence
vector
•  Categories:
–  G1(c)={v = [sv, µ(sv),ν(sv)] | ρ(sv)≥c}
–  G2(c)={v = [sv, µ(sv),ν(sv)] | ρ(sv)<c}
Universität Stuttgart - June 9th, 2015 26
The case study
Universität Stuttgart - June 9th, 2015 27
Business Questions
•  In our case study:
–  Can we use Support Vector Machines to build suitable
predictors?
–  Is there any Support Vector Machine that performs best for
all system applications?
–  Is there any machine that does it for different levels of
reliability requested to the system?
Universität Stuttgart - June 9th, 2015 28
Descriptive analysis across apps
Universität Stuttgart - June 9th, 2015 29
54 datasets out of them
25 with some faulty
features
Across system applications
Universität Stuttgart - June 9th, 2015 30
Applications ordered by size of features set
Percentageoffaultyfeatures
Effects of Information Gain
Universität Stuttgart - June 9th, 2015 31
Splitting data
•  Three approaches to control for artificial assumptions
–  Varying the size of splitting “t-splitting”
–  Reducing features with IG and varying size “t-splitting
reduced”
–  Balancing sets “k-splitting” , i.e., manipulating sets so that
the number of instances in the two categories are balanced
Universität Stuttgart - June 9th, 2015 32
Types of SVM
•  Different kernels
–  Multilayer perceptron
–  Linear
–  Radial Basis Function
Universität Stuttgart - June 9th, 2015 33
Fitting performance ac. applications
Universität Stuttgart - June 9th, 2015 34
Number of applications for which a classifier
outperforms (with MR) the others in quality of fit
Prediction
Universität Stuttgart - June 9th, 2015 35
No filter
Filtered with IG
•  Models with high fitting performance
(bal>0.73)
•  Prediction performance averaged across t-
splitting and models
Findings
•  Better with IG filtering, MP is best across applications,
but it is not the unique (Clustering applications?)
•  Artificial balance does not help to identify a single
classifier, but it helps to increase convergence in those
classifiers that are not reduced with IG
Universität Stuttgart - June 9th, 2015 36
Findings (superior than literature)
•  Best performance at individual application (MP, c=3):
–  1% false positive rate, 94% true positive rate, and 95%
precision
•  Best performance across applications averaged over
models for c=2,
–  9% false positive rate, 78% true positive rate, and 95%
precision,
Universität Stuttgart - June 9th, 2015 37
What predictions can tell managers
•  Application the manages software tools of cars
–  Pervasive in the telemetry system
•  106 distinct features of 10 different event types, 18%
multiple sequences, and 89% with more than one user
•  c=1
•  IG reduction from 12 to 7 still including µ and ν
Universität Stuttgart - June 9th, 2015 38
Confusion matrix: prediction - MP
Universität Stuttgart - June 9th, 2015 39
Prediction - assumptions
•  Behaviour is the same in next three months
•  1000 features
•  Category balance is the one for the test set (fitting)
(39%)
–  390 faulty features and 610 non- faulty features
Universität Stuttgart - June 9th, 2015 40
In numbers
•  We have 390 faulty features and 610 non-faulty
features and 450 predicted faulty features
•  Predicted faulty features that have no error:
–  67 = 11%*610
•  Fail to predict faulty features = 70 =18%*390
Universität Stuttgart - June 9th, 2015 41
Pred pos Pred neg Total
Pos 82% 18% 100%
Neg 11% 89% 100%
Total 45% 54% 100%
Cost of prediction
•  Inspection cost. Wasting time ≥ 67 * average cost to
fix one error
–  There might be more than one error in one sequence on
average
•  Cost for undiscovered errors. Defect slippage ≥ 70
–  Measure of system unreliability
–  Cost to repair errors at late stages (inaccuracy, higher cost
due to pressure, not being able to fix)
Universität Stuttgart - June 9th, 2015 42
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
False Positive Rate
TruePositiveRate
Best prediction
models
Equal chance
Higher cost to fix
undiscovered errors
Higher inspection costs
Prediction
MP
RBF
L
Universität Stuttgart - June 9th, 2015 43
FPr=11%, TPr=82%
Recommendations
•  Select models that first accurately fit historical data
before using them for predictions
–  Best models for quality of fit are not always the best
predictors for all splitting sizes of a feature set
•  Reduce information redundancy
Universität Stuttgart - June 9th, 2015 44
Recommendations
•  Report fitting accuracy
•  Use parametric classification
–  The parameter being the number of errors a sequence must
contain in order to be classified as defective/faulty.
•  Study prediction at different cut-off values or with different
splitting size or balance to solve the prediction problem
independently from the level of reliability requested for the
system and the nature of the data.
Universität Stuttgart - June 9th, 2015 45
Thank you
Universität Stuttgart - June 9th, 2015 46
With artificial balance
•  It does not help to identify a single classifier
•  It helps to increase convergence in those classifiers
that are not reduced with IG
47
With IG filter
48
Best classifiers across
different t-splitting;
classifiers with b<0.73 are
not reported

More Related Content

What's hot

Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?
Lionel Briand
 
Hybrid Solution of the Cold-Start Problem in Context-Aware Recommender Systems
Hybrid Solution of the Cold-Start Problem in Context-Aware Recommender SystemsHybrid Solution of the Cold-Start Problem in Context-Aware Recommender Systems
Hybrid Solution of the Cold-Start Problem in Context-Aware Recommender Systems
Matthias Braunhofer
 
Keynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingKeynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based Testing
Lionel Briand
 
A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...
Lionel Briand
 

What's hot (11)

Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?Can we predict the quality of spectrum-based fault localization?
Can we predict the quality of spectrum-based fault localization?
 
Hybrid Solution of the Cold-Start Problem in Context-Aware Recommender Systems
Hybrid Solution of the Cold-Start Problem in Context-Aware Recommender SystemsHybrid Solution of the Cold-Start Problem in Context-Aware Recommender Systems
Hybrid Solution of the Cold-Start Problem in Context-Aware Recommender Systems
 
Real-time ranking with concept drift using expert advice
Real-time ranking with concept drift using expert adviceReal-time ranking with concept drift using expert advice
Real-time ranking with concept drift using expert advice
 
Testing of Cyber-Physical Systems: Diversity-driven Strategies
Testing of Cyber-Physical Systems: Diversity-driven StrategiesTesting of Cyber-Physical Systems: Diversity-driven Strategies
Testing of Cyber-Physical Systems: Diversity-driven Strategies
 
Keynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingKeynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based Testing
 
Automating System Test Case Classification and Prioritization for Use Case-Dr...
Automating System Test Case Classification and Prioritization for Use Case-Dr...Automating System Test Case Classification and Prioritization for Use Case-Dr...
Automating System Test Case Classification and Prioritization for Use Case-Dr...
 
A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...
 
South Tyrol Suggests - STS
South Tyrol Suggests - STSSouth Tyrol Suggests - STS
South Tyrol Suggests - STS
 
Synthetic Data Generation for Statistical Testing
Synthetic Data Generation for Statistical TestingSynthetic Data Generation for Statistical Testing
Synthetic Data Generation for Statistical Testing
 
Techniques for Context-Aware and Cold-Start Recommendations
Techniques for Context-Aware and Cold-Start RecommendationsTechniques for Context-Aware and Cold-Start Recommendations
Techniques for Context-Aware and Cold-Start Recommendations
 
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data ManagementContext-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
 

Viewers also liked

Wireless telemetry systems
Wireless telemetry systemsWireless telemetry systems
Wireless telemetry systems
Sneha Suluru
 
Lil' Sebastians Views on Globalization and Enviromentalism
Lil' Sebastians Views on Globalization and EnviromentalismLil' Sebastians Views on Globalization and Enviromentalism
Lil' Sebastians Views on Globalization and Enviromentalism
Kyra Eddy
 

Viewers also liked (17)

Drifty: mobile application for drifters
Drifty: mobile application for driftersDrifty: mobile application for drifters
Drifty: mobile application for drifters
 
IOT Success depends on Integration
IOT Success depends on Integration IOT Success depends on Integration
IOT Success depends on Integration
 
Network Telemetry: Pushing Boundaries
Network Telemetry: Pushing BoundariesNetwork Telemetry: Pushing Boundaries
Network Telemetry: Pushing Boundaries
 
Wireless telemetry systems
Wireless telemetry systemsWireless telemetry systems
Wireless telemetry systems
 
Data transmission and telemetry
Data transmission and telemetryData transmission and telemetry
Data transmission and telemetry
 
Easy to swallow wireless telemetry ppt
Easy to swallow wireless telemetry pptEasy to swallow wireless telemetry ppt
Easy to swallow wireless telemetry ppt
 
Found Poems
Found PoemsFound Poems
Found Poems
 
Bán, cho thuê bộ cắt băng khánh thành giá rẻ nhất tại tp.hcm
Bán, cho thuê bộ cắt băng khánh thành giá rẻ nhất tại tp.hcmBán, cho thuê bộ cắt băng khánh thành giá rẻ nhất tại tp.hcm
Bán, cho thuê bộ cắt băng khánh thành giá rẻ nhất tại tp.hcm
 
Lh firma genel tanıtım sf
Lh firma genel tanıtım sfLh firma genel tanıtım sf
Lh firma genel tanıtım sf
 
ฟอร์มนำเสนอ Computer
ฟอร์มนำเสนอ Computerฟอร์มนำเสนอ Computer
ฟอร์มนำเสนอ Computer
 
Prezentacja fffgfg
Prezentacja fffgfgPrezentacja fffgfg
Prezentacja fffgfg
 
a
aa
a
 
Dairy creamers eng
Dairy creamers engDairy creamers eng
Dairy creamers eng
 
Netiquette
NetiquetteNetiquette
Netiquette
 
Lil' Sebastians Views on Globalization and Enviromentalism
Lil' Sebastians Views on Globalization and EnviromentalismLil' Sebastians Views on Globalization and Enviromentalism
Lil' Sebastians Views on Globalization and Enviromentalism
 
Cho thuê mc dẫn chương trình tiếng hoa chuyên nghiệp tại tp.hcm
Cho thuê mc dẫn chương trình tiếng hoa chuyên nghiệp tại tp.hcmCho thuê mc dẫn chương trình tiếng hoa chuyên nghiệp tại tp.hcm
Cho thuê mc dẫn chương trình tiếng hoa chuyên nghiệp tại tp.hcm
 
Project management ณรงค์ สัพโส
Project management ณรงค์ สัพโสProject management ณรงค์ สัพโส
Project management ณรงค์ สัพโส
 

Similar to Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgart June 2015

Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Luigi Vanfretti
 

Similar to Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgart June 2015 (20)

Survey on Software Defect Prediction
Survey on Software Defect PredictionSurvey on Software Defect Prediction
Survey on Software Defect Prediction
 
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
 
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting wit...
 
Root cause analysis in complex cyber physical systems - Faults Classification
Root cause analysis in complex cyber physical systems - Faults ClassificationRoot cause analysis in complex cyber physical systems - Faults Classification
Root cause analysis in complex cyber physical systems - Faults Classification
 
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree AlgorithmWater Quality Index Calculation of River Ganga using Decision Tree Algorithm
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
 
Six sigma tools an overview
Six sigma tools  an overviewSix sigma tools  an overview
Six sigma tools an overview
 
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
 
The benefits of fine-grained synchronization in deterministic and efficient ...
The benefits of fine-grained synchronization in  deterministic and efficient ...The benefits of fine-grained synchronization in  deterministic and efficient ...
The benefits of fine-grained synchronization in deterministic and efficient ...
 
Time Series Anomaly Detection with .net and Azure
Time Series Anomaly Detection with .net and AzureTime Series Anomaly Detection with .net and Azure
Time Series Anomaly Detection with .net and Azure
 
Survey on Software Defect Prediction
Survey on Software Defect PredictionSurvey on Software Defect Prediction
Survey on Software Defect Prediction
 
presentationIDC - 14MAY2015
presentationIDC - 14MAY2015presentationIDC - 14MAY2015
presentationIDC - 14MAY2015
 
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
 
The evolution of semantic technology evaluation in my own flesh (The 15 tip...
The evolution of semantic technology evaluation in my own flesh (The 15 tip...The evolution of semantic technology evaluation in my own flesh (The 15 tip...
The evolution of semantic technology evaluation in my own flesh (The 15 tip...
 
Who cares about Software Process Modelling? A First Investigation about the P...
Who cares about Software Process Modelling? A First Investigation about the P...Who cares about Software Process Modelling? A First Investigation about the P...
Who cares about Software Process Modelling? A First Investigation about the P...
 
IRJET - Finger Vein Extraction and Authentication System for ATM
IRJET -  	  Finger Vein Extraction and Authentication System for ATMIRJET -  	  Finger Vein Extraction and Authentication System for ATM
IRJET - Finger Vein Extraction and Authentication System for ATM
 
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
 
Ch15 software reliability
Ch15 software reliabilityCh15 software reliability
Ch15 software reliability
 
Six sigma
Six sigmaSix sigma
Six sigma
 
Identifying and classifying unknown Network Disruption
Identifying and classifying unknown Network DisruptionIdentifying and classifying unknown Network Disruption
Identifying and classifying unknown Network Disruption
 
KTH SmarTS Lab - An Introduction to our Research Group and Activities
KTH SmarTS Lab - An Introduction to our Research Group and ActivitiesKTH SmarTS Lab - An Introduction to our Research Group and Activities
KTH SmarTS Lab - An Introduction to our Research Group and Activities
 

Recently uploaded

Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
amilabibi1
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
Kayode Fayemi
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
Kayode Fayemi
 

Recently uploaded (20)

lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptx
 
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 

Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgart June 2015

  • 1. Mining System Logs to Learn Error Predictors A Case Study of a Telemetry System Barbara Russo L.E.S.E.R. Faculty of Computer Science, Free University of Bozen-Bolzano, Italy Barbara.Russo@unibz.it Universität Stuttgart - June 9th, 2015
  • 2. A collaboration between Free University of Bozen-Bolzano, Italy and University of Alberta, Canada Barbara Russo, Giancarlo Succi, Witold Pedrycz (2015) Mining system logs to learn error predictors: a case study of a telemetry system, Empirical Software Engineering: Volume 20, Issue 4 (2015), pp. 879-927 Universität Stuttgart - June 9th, 2015 2
  • 3. System events •  Events describe the behaviour within and across subsystems or components –  the system changes over time •  Logs track events Universität Stuttgart - June 9th, 2015 3
  • 4. The value of logs •  Log events carry information on –  the software application that generated the event and its state, –  the task and the user whose interaction with the system triggered the event, and –  the time-stamp at which the event is generated. Universität Stuttgart - June 9th, 2015 4
  • 5. Logs can be cryptic Universität Stuttgart - June 9th, 2015 5
  • 6. Errors •  Some behaviours are desirable and some are not •  Undesirable behaviours are referred to as system errors –  crashes that immediately stop the system and are easily identifiable –  deviations from the expected output that let the system run and reveal only at completion of system tasks Universität Stuttgart - June 9th, 2015 6
  • 7. Meaning of errors •  Events in error state (errors) act as alerts –  ? Manifestations of system failures –  ? Originated from a series of preceding events –  ? Immediate action must be taken –  ? Indication of an underlying problem Universität Stuttgart - June 9th, 2015 7
  • 8. Goal •  Analysing the behaviour of a (composite) system by mining logs of events and predicting future system misbehaviour •  Composite: many applications or subsystems Universität Stuttgart - June 9th, 2015 8
  • 9. Method •  Solve a classification problem with SVM •  Build a sequence abstraction by mining logs •  Integrate several statistical techniques to control for data brittleness and accuracy of model selection and validation •  Discuss the classification problem at different degree of defectiveness Universität Stuttgart - June 9th, 2015 9
  • 10. Sequences •  A single event may not suffice to predict system failures •  An event sequence is a set of events ordered by their timestamp occurring within a given time window •  A sequence abstraction is a representation of identified sequences in formal format that machines can read Universität Stuttgart - June 9th, 2015 10
  • 11. Research question •  Is the amount and type of information carried by a sequence enough to predict errors? Universität Stuttgart - June 9th, 2015 11
  • 12. Isolating sequences Universität Stuttgart - June 9th, 2015 12 Different length, different types
  • 13. Abstracting sequences Universität Stuttgart - June 9th, 2015 13 µ1 … µn s7 s30 s2 s14 s10 Same length, same types
  • 14. Example – sequence type •  sv1=[0,1,0,1] •  sv2=[2,1,1,0] Universität Stuttgart - June 9th, 2015 14
  • 15. Sequence type •  µi – number of the events of type i in a sequence •  sv=[µ1, …,µn] – vector of event multiplicities •  ρ(sv) = sum of # errors in sequences mapping into sv Universität Stuttgart - June 9th, 2015 15
  • 16. Features to feed SVM •  v= [sv, µ(sv), ν(sv)] – feature –  µ(sv) = # sequences mapping into sv –  ν(sv) average # of users in sequences mapping into sv •  v is an faulty feature if at least one event in one sequence is in error state Universität Stuttgart - June 9th, 2015 16
  • 17. Sequence vector semantic •  Patterns of system behaviour –  If µ>1 and ρ>0 such sequences denote a reliability problem that recurs •  Distributed teams –  If ν>1 the comparative analysis of features with ρ>0 or ρ=0 tells whether errors are originated by multi users working for the same tasks Universität Stuttgart - June 9th, 2015 17
  • 18. Example - features •  v1= [0,1,0,1;1,1], sv1=[0,1,0,1] –  µ(sv1) =1, ν(sv1)=1, ρ(sv1)=0 •  v2 = [2,1,1,0;1,2], sv2=[2,1,1,0] –  µ(sv2) =1, ν(sv2)=2, ρ(sv2)=2 Universität Stuttgart - June 9th, 2015 18
  • 19. The classification problem 19 Data Sets Classifier Different ex-ante distributions: (faulty, non-faulty) G2 =Non-Faulty G1= Faulty Ex-post classification differs on different classifier’s thresholds Features
  • 20. Classification •  False Positive = features v that are predicted faulty but do not contain error(s), ρ(sv)=0 •  True positive = features v that are predicted faulty and contain error(s), ρ(sv)>0 •  False negative = features v that are predicted non- faulty but that contain error(s), ρ(sv)>0 •  True negative = features v that are predicted non-faulty and do not contain error(s), ρ(sv)=0 Universität Stuttgart - June 9th, 2015 20
  • 21. Measures of accuracy Universität Stuttgart - June 9th, 2015 21
  • 22. Build classifiers on historical data 22 Classifier Training Set Test Set 1.  To tune classifier’s parameters 2.  To compute classifier’s fitting performance
  • 24. Validating sequence abstraction •  Did we put too much information in our features? –  Information Gain selects features that most contribute to the information of a given classification category: Classification category: sequences with a given number of error events Universität Stuttgart - June 9th, 2015 24
  • 25. Control the effect of the dataset nature •  Does set balancing increase the quality of prediction? –  If classification categories are not equally represented in datasets, classifiers might have low precision even though true positive rate is high and false positive rate is low. –  Such imbalanced data sets are very frequent in software engineering data Universität Stuttgart - June 9th, 2015 25
  • 26. Parametric classification •  The problem varies depending on how many errors we allow in the system •  c – cut-off value, i.e., number of errors in a sequence vector •  Categories: –  G1(c)={v = [sv, µ(sv),ν(sv)] | ρ(sv)≥c} –  G2(c)={v = [sv, µ(sv),ν(sv)] | ρ(sv)<c} Universität Stuttgart - June 9th, 2015 26
  • 27. The case study Universität Stuttgart - June 9th, 2015 27
  • 28. Business Questions •  In our case study: –  Can we use Support Vector Machines to build suitable predictors? –  Is there any Support Vector Machine that performs best for all system applications? –  Is there any machine that does it for different levels of reliability requested to the system? Universität Stuttgart - June 9th, 2015 28
  • 29. Descriptive analysis across apps Universität Stuttgart - June 9th, 2015 29 54 datasets out of them 25 with some faulty features
  • 30. Across system applications Universität Stuttgart - June 9th, 2015 30 Applications ordered by size of features set Percentageoffaultyfeatures
  • 31. Effects of Information Gain Universität Stuttgart - June 9th, 2015 31
  • 32. Splitting data •  Three approaches to control for artificial assumptions –  Varying the size of splitting “t-splitting” –  Reducing features with IG and varying size “t-splitting reduced” –  Balancing sets “k-splitting” , i.e., manipulating sets so that the number of instances in the two categories are balanced Universität Stuttgart - June 9th, 2015 32
  • 33. Types of SVM •  Different kernels –  Multilayer perceptron –  Linear –  Radial Basis Function Universität Stuttgart - June 9th, 2015 33
  • 34. Fitting performance ac. applications Universität Stuttgart - June 9th, 2015 34 Number of applications for which a classifier outperforms (with MR) the others in quality of fit
  • 35. Prediction Universität Stuttgart - June 9th, 2015 35 No filter Filtered with IG •  Models with high fitting performance (bal>0.73) •  Prediction performance averaged across t- splitting and models
  • 36. Findings •  Better with IG filtering, MP is best across applications, but it is not the unique (Clustering applications?) •  Artificial balance does not help to identify a single classifier, but it helps to increase convergence in those classifiers that are not reduced with IG Universität Stuttgart - June 9th, 2015 36
  • 37. Findings (superior than literature) •  Best performance at individual application (MP, c=3): –  1% false positive rate, 94% true positive rate, and 95% precision •  Best performance across applications averaged over models for c=2, –  9% false positive rate, 78% true positive rate, and 95% precision, Universität Stuttgart - June 9th, 2015 37
  • 38. What predictions can tell managers •  Application the manages software tools of cars –  Pervasive in the telemetry system •  106 distinct features of 10 different event types, 18% multiple sequences, and 89% with more than one user •  c=1 •  IG reduction from 12 to 7 still including µ and ν Universität Stuttgart - June 9th, 2015 38
  • 39. Confusion matrix: prediction - MP Universität Stuttgart - June 9th, 2015 39
  • 40. Prediction - assumptions •  Behaviour is the same in next three months •  1000 features •  Category balance is the one for the test set (fitting) (39%) –  390 faulty features and 610 non- faulty features Universität Stuttgart - June 9th, 2015 40
  • 41. In numbers •  We have 390 faulty features and 610 non-faulty features and 450 predicted faulty features •  Predicted faulty features that have no error: –  67 = 11%*610 •  Fail to predict faulty features = 70 =18%*390 Universität Stuttgart - June 9th, 2015 41 Pred pos Pred neg Total Pos 82% 18% 100% Neg 11% 89% 100% Total 45% 54% 100%
  • 42. Cost of prediction •  Inspection cost. Wasting time ≥ 67 * average cost to fix one error –  There might be more than one error in one sequence on average •  Cost for undiscovered errors. Defect slippage ≥ 70 –  Measure of system unreliability –  Cost to repair errors at late stages (inaccuracy, higher cost due to pressure, not being able to fix) Universität Stuttgart - June 9th, 2015 42
  • 43. 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 False Positive Rate TruePositiveRate Best prediction models Equal chance Higher cost to fix undiscovered errors Higher inspection costs Prediction MP RBF L Universität Stuttgart - June 9th, 2015 43 FPr=11%, TPr=82%
  • 44. Recommendations •  Select models that first accurately fit historical data before using them for predictions –  Best models for quality of fit are not always the best predictors for all splitting sizes of a feature set •  Reduce information redundancy Universität Stuttgart - June 9th, 2015 44
  • 45. Recommendations •  Report fitting accuracy •  Use parametric classification –  The parameter being the number of errors a sequence must contain in order to be classified as defective/faulty. •  Study prediction at different cut-off values or with different splitting size or balance to solve the prediction problem independently from the level of reliability requested for the system and the nature of the data. Universität Stuttgart - June 9th, 2015 45
  • 46. Thank you Universität Stuttgart - June 9th, 2015 46
  • 47. With artificial balance •  It does not help to identify a single classifier •  It helps to increase convergence in those classifiers that are not reduced with IG 47
  • 48. With IG filter 48 Best classifiers across different t-splitting; classifiers with b<0.73 are not reported