SlideShare a Scribd company logo
1 of 17
Download to read offline
Analysis of Grid log data with Affinity Propagation
G. Modena, M. van Someren
Universiteit van Amsterdam
June 20, 2013
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 1 / 17
Grid Computing
Paradigm vs. implementation
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 2 / 17
Paradigm
On-demand computational power and storage
Distributed, heterogeneous workload
Dynamic network topology
Resources: users, computing nodes, brokers, jobs
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 3 / 17
Implementation
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 4 / 17
Problem domain
Grid
Elevated (job) failure rates
Possible causes
User related failures (configuration errors, buggy code)
Network failure (firewalls blocking traffics, node disappear, errors at
site level)
Resource failure (error on local nodes)
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 5 / 17
Failure patterns
job abortion rate
resubmitted jobs rate
cross-domain traffic patterns
resource starvation
unreachable or black hole nodes
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 6 / 17
Troubleshooting
Manual analysis and correlation of log data Data is generated by
multiple sources
Time consuming sysop involvement
Job traffic is domain dependent
Hard to manage and assess problems
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 7 / 17
Goal
Detect meaningful job traffic patterns in Grid log data
Approach
1 Clustering
2 Get experts to label and accept clusters
3 Train a classifier for interesting clusters
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 8 / 17
Data
Grid Observatory provided a dataset of log files (EGEE)
In one week: 2356 jobs, 16 users, 18 brokers, 25 computing elements
No categories: unlabelled data
Average job path length of 4 hops, duration 7.5 hours, job abortion
rate close to 50%
job until time t (some jobs run very long, or disappear); tried several
values for t (1 hour -1 week)
Extracted 40 features for describing jobs: features of user, input node,
broker, computing element, path
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 9 / 17
Example of features
Duration
Number of transitions between grid nodes
Latest status code
Last grid node visited
Number of resubmissions
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 10 / 17
Step 1: clustering
Affinity Propagation
Represent data points as a cluster graph
Belief propagation
Chosen because
State of the art - proven in multiple domains
No parameter for number of clusters
Found to be fast
Found to find good clusters
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 11 / 17
Method
Affinity Propagation
Given: pairwise similarity between data points
Find: clusters, consisting of exemplar (most characteristic datapoint in
cluster) and members
At each iteration estimate
How well suited a point is to be an exemplar of a cluster
How well suited a point is to be a member of a cluster around
another exemplar
On convergence
Clusters emerge
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 12 / 17
Result of clustering step
12 clusters (longer time periods = more clusters)
Extracted characteristic features for each cluster
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 13 / 17
Step 2: Clusters validation
Presented a list of clusters and their characteristic features to human
experts (NIKHEF)
Experts selected and labeled 5 clusters that were meaningful and large
enough
”Best” clusters have been discovered for values of t between 8 and 24
hours
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 14 / 17
5 useful clusters - with features
Successfully terminated jobs (557): job length, user, resource broker num
jobs, computing units num jobs
Failure involving computing elements and brokers (189): job length, user,
resource broker num jobs, computing units num jobs
High job resubmission count and low resubmission time intervals; possible
black hole node (133): resubmission count, resubmission time interval,
resource broker average job permanence
User related problem; maybe a user submitting jobs to a broker unable to
match her requirements (11): user, resource broker aborted, resource
broker terminated, resource broker average job permanence, resource
broker num jobs
Anomaly possibly caused by resource exhaustion on computing elements
(101): computing element average job permanence, average terminated,
Computing element num jobs
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 15 / 17
Step 3: Classification
Cross-validation with same data
SVM as classifier (boolean and multi-class)
84%accuracy on cross validation for the boolean class case
60% accuracy on cross validation for the multi-class case
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 16 / 17
Conclusion
Our method
Discovered useful classes of jobs on a Grid network
Can be used as filter in monitoring (trend discovery)
Future work
Scale up the dataset
Incorporate a temporal dimension in clustering
Online anytime clustering and classification
G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 17 / 17

More Related Content

What's hot

Quantum computing and machine learning overview
Quantum computing and machine learning overviewQuantum computing and machine learning overview
Quantum computing and machine learning overviewColleen Farrelly
 
Dahlquist bosc 20160709
Dahlquist bosc 20160709Dahlquist bosc 20160709
Dahlquist bosc 20160709GRNsight
 
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...SAIL_QU
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...IOSR Journals
 
Machine Learning in Materials Science and Chemistry, USPTO, Nathan C. Frey
Machine Learning in Materials Science and Chemistry, USPTO, Nathan C. FreyMachine Learning in Materials Science and Chemistry, USPTO, Nathan C. Frey
Machine Learning in Materials Science and Chemistry, USPTO, Nathan C. FreyNathan Frey, PhD
 

What's hot (10)

Quantum computing and machine learning overview
Quantum computing and machine learning overviewQuantum computing and machine learning overview
Quantum computing and machine learning overview
 
Dahlquist bosc 20160709
Dahlquist bosc 20160709Dahlquist bosc 20160709
Dahlquist bosc 20160709
 
29
2929
29
 
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
 
Mj upjs
Mj upjsMj upjs
Mj upjs
 
Analysis of the Datasets
Analysis of the DatasetsAnalysis of the Datasets
Analysis of the Datasets
 
FDSE2015
FDSE2015FDSE2015
FDSE2015
 
15
1515
15
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
 
Machine Learning in Materials Science and Chemistry, USPTO, Nathan C. Frey
Machine Learning in Materials Science and Chemistry, USPTO, Nathan C. FreyMachine Learning in Materials Science and Chemistry, USPTO, Nathan C. Frey
Machine Learning in Materials Science and Chemistry, USPTO, Nathan C. Frey
 

Similar to Analysis of grid log data with Affinity Propagation

Semantics in Sensor Networks
Semantics in Sensor NetworksSemantics in Sensor Networks
Semantics in Sensor NetworksOscar Corcho
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsIRJET Journal
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentMaribel Acosta Deibe
 
Bäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdf
Bäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdfBäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdf
Bäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdfTadiyosHailemichael
 
Web Graph Clustering Using Hyperlink Structure
Web Graph Clustering Using Hyperlink StructureWeb Graph Clustering Using Hyperlink Structure
Web Graph Clustering Using Hyperlink Structureaciijournal
 
600.412.Lecture06
600.412.Lecture06600.412.Lecture06
600.412.Lecture06ragibhasan
 
Block Sampling: Efficient Accurate Online Aggregation in MapReduce
Block Sampling: Efficient Accurate Online Aggregation in MapReduceBlock Sampling: Efficient Accurate Online Aggregation in MapReduce
Block Sampling: Efficient Accurate Online Aggregation in MapReduceVasia Kalavri
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Alexander Decker
 
Multi Object Tracking Methods Based on Particle Filter and HMM
Multi Object Tracking Methods Based on Particle Filter and HMMMulti Object Tracking Methods Based on Particle Filter and HMM
Multi Object Tracking Methods Based on Particle Filter and HMMIJTET Journal
 
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Daniele Dell'Aglio
 
A Multipath Connection Model for Traffic Matrices
A Multipath Connection Model for Traffic MatricesA Multipath Connection Model for Traffic Matrices
A Multipath Connection Model for Traffic MatricesIJERA Editor
 
Term Paper Presentation
Term Paper PresentationTerm Paper Presentation
Term Paper PresentationShubham Singh
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?Anubhav Jain
 
Drsp dimension reduction for similarity matching and pruning of time series ...
Drsp  dimension reduction for similarity matching and pruning of time series ...Drsp  dimension reduction for similarity matching and pruning of time series ...
Drsp dimension reduction for similarity matching and pruning of time series ...IJDKP
 

Similar to Analysis of grid log data with Affinity Propagation (20)

Semantics in Sensor Networks
Semantics in Sensor NetworksSemantics in Sensor Networks
Semantics in Sensor Networks
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather Conditions
 
Crowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality AssessmentCrowdsourcing Linked Data Quality Assessment
Crowdsourcing Linked Data Quality Assessment
 
081.ppt
081.ppt081.ppt
081.ppt
 
Bäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdf
Bäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdfBäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdf
Bäßler2022_Article_UnsupervisedAnomalyDetectionIn.pdf
 
Web Graph Clustering Using Hyperlink Structure
Web Graph Clustering Using Hyperlink StructureWeb Graph Clustering Using Hyperlink Structure
Web Graph Clustering Using Hyperlink Structure
 
600.412.Lecture06
600.412.Lecture06600.412.Lecture06
600.412.Lecture06
 
dc09ttp-2011-thesis
dc09ttp-2011-thesisdc09ttp-2011-thesis
dc09ttp-2011-thesis
 
Block Sampling: Efficient Accurate Online Aggregation in MapReduce
Block Sampling: Efficient Accurate Online Aggregation in MapReduceBlock Sampling: Efficient Accurate Online Aggregation in MapReduce
Block Sampling: Efficient Accurate Online Aggregation in MapReduce
 
Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)Survey on classification algorithms for data mining (comparison and evaluation)
Survey on classification algorithms for data mining (comparison and evaluation)
 
Multi Object Tracking Methods Based on Particle Filter and HMM
Multi Object Tracking Methods Based on Particle Filter and HMMMulti Object Tracking Methods Based on Particle Filter and HMM
Multi Object Tracking Methods Based on Particle Filter and HMM
 
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...Ontology based top-k query answering over massive, heterogeneous, and dynamic...
Ontology based top-k query answering over massive, heterogeneous, and dynamic...
 
A Multipath Connection Model for Traffic Matrices
A Multipath Connection Model for Traffic MatricesA Multipath Connection Model for Traffic Matrices
A Multipath Connection Model for Traffic Matrices
 
Big Data and IOT
Big Data and IOTBig Data and IOT
Big Data and IOT
 
Term Paper Presentation
Term Paper PresentationTerm Paper Presentation
Term Paper Presentation
 
Elementary Concepts of data minig
Elementary Concepts of data minigElementary Concepts of data minig
Elementary Concepts of data minig
 
Project Report (Summer 2016)
Project Report (Summer 2016)Project Report (Summer 2016)
Project Report (Summer 2016)
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?
 
Drsp dimension reduction for similarity matching and pruning of time series ...
Drsp  dimension reduction for similarity matching and pruning of time series ...Drsp  dimension reduction for similarity matching and pruning of time series ...
Drsp dimension reduction for similarity matching and pruning of time series ...
 

More from Gabriele Modena

Resilient Distributed Datasets
Resilient Distributed DatasetsResilient Distributed Datasets
Resilient Distributed DatasetsGabriele Modena
 
Estimating mental states of a depressed person with Bayesian Networks
Estimating mental states of a depressed person with Bayesian NetworksEstimating mental states of a depressed person with Bayesian Networks
Estimating mental states of a depressed person with Bayesian NetworksGabriele Modena
 
Moving Towards a Streaming Architecture
Moving Towards a Streaming ArchitectureMoving Towards a Streaming Architecture
Moving Towards a Streaming ArchitectureGabriele Modena
 
Approximation algorithms for stream and batch processing
Approximation algorithms for stream and batch processingApproximation algorithms for stream and batch processing
Approximation algorithms for stream and batch processingGabriele Modena
 
Full stack analytics with Hadoop 2
Full stack analytics with Hadoop 2Full stack analytics with Hadoop 2
Full stack analytics with Hadoop 2Gabriele Modena
 
Analysis of interrupt latencies in a real-time kernel
Analysis of interrupt latencies in a real-time kernelAnalysis of interrupt latencies in a real-time kernel
Analysis of interrupt latencies in a real-time kernelGabriele Modena
 

More from Gabriele Modena (6)

Resilient Distributed Datasets
Resilient Distributed DatasetsResilient Distributed Datasets
Resilient Distributed Datasets
 
Estimating mental states of a depressed person with Bayesian Networks
Estimating mental states of a depressed person with Bayesian NetworksEstimating mental states of a depressed person with Bayesian Networks
Estimating mental states of a depressed person with Bayesian Networks
 
Moving Towards a Streaming Architecture
Moving Towards a Streaming ArchitectureMoving Towards a Streaming Architecture
Moving Towards a Streaming Architecture
 
Approximation algorithms for stream and batch processing
Approximation algorithms for stream and batch processingApproximation algorithms for stream and batch processing
Approximation algorithms for stream and batch processing
 
Full stack analytics with Hadoop 2
Full stack analytics with Hadoop 2Full stack analytics with Hadoop 2
Full stack analytics with Hadoop 2
 
Analysis of interrupt latencies in a real-time kernel
Analysis of interrupt latencies in a real-time kernelAnalysis of interrupt latencies in a real-time kernel
Analysis of interrupt latencies in a real-time kernel
 

Recently uploaded

Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 

Recently uploaded (20)

Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 

Analysis of grid log data with Affinity Propagation

  • 1. Analysis of Grid log data with Affinity Propagation G. Modena, M. van Someren Universiteit van Amsterdam June 20, 2013 G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 1 / 17
  • 2. Grid Computing Paradigm vs. implementation G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 2 / 17
  • 3. Paradigm On-demand computational power and storage Distributed, heterogeneous workload Dynamic network topology Resources: users, computing nodes, brokers, jobs G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 3 / 17
  • 4. Implementation G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 4 / 17
  • 5. Problem domain Grid Elevated (job) failure rates Possible causes User related failures (configuration errors, buggy code) Network failure (firewalls blocking traffics, node disappear, errors at site level) Resource failure (error on local nodes) G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 5 / 17
  • 6. Failure patterns job abortion rate resubmitted jobs rate cross-domain traffic patterns resource starvation unreachable or black hole nodes G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 6 / 17
  • 7. Troubleshooting Manual analysis and correlation of log data Data is generated by multiple sources Time consuming sysop involvement Job traffic is domain dependent Hard to manage and assess problems G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 7 / 17
  • 8. Goal Detect meaningful job traffic patterns in Grid log data Approach 1 Clustering 2 Get experts to label and accept clusters 3 Train a classifier for interesting clusters G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 8 / 17
  • 9. Data Grid Observatory provided a dataset of log files (EGEE) In one week: 2356 jobs, 16 users, 18 brokers, 25 computing elements No categories: unlabelled data Average job path length of 4 hops, duration 7.5 hours, job abortion rate close to 50% job until time t (some jobs run very long, or disappear); tried several values for t (1 hour -1 week) Extracted 40 features for describing jobs: features of user, input node, broker, computing element, path G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 9 / 17
  • 10. Example of features Duration Number of transitions between grid nodes Latest status code Last grid node visited Number of resubmissions G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 10 / 17
  • 11. Step 1: clustering Affinity Propagation Represent data points as a cluster graph Belief propagation Chosen because State of the art - proven in multiple domains No parameter for number of clusters Found to be fast Found to find good clusters G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 11 / 17
  • 12. Method Affinity Propagation Given: pairwise similarity between data points Find: clusters, consisting of exemplar (most characteristic datapoint in cluster) and members At each iteration estimate How well suited a point is to be an exemplar of a cluster How well suited a point is to be a member of a cluster around another exemplar On convergence Clusters emerge G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 12 / 17
  • 13. Result of clustering step 12 clusters (longer time periods = more clusters) Extracted characteristic features for each cluster G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 13 / 17
  • 14. Step 2: Clusters validation Presented a list of clusters and their characteristic features to human experts (NIKHEF) Experts selected and labeled 5 clusters that were meaningful and large enough ”Best” clusters have been discovered for values of t between 8 and 24 hours G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 14 / 17
  • 15. 5 useful clusters - with features Successfully terminated jobs (557): job length, user, resource broker num jobs, computing units num jobs Failure involving computing elements and brokers (189): job length, user, resource broker num jobs, computing units num jobs High job resubmission count and low resubmission time intervals; possible black hole node (133): resubmission count, resubmission time interval, resource broker average job permanence User related problem; maybe a user submitting jobs to a broker unable to match her requirements (11): user, resource broker aborted, resource broker terminated, resource broker average job permanence, resource broker num jobs Anomaly possibly caused by resource exhaustion on computing elements (101): computing element average job permanence, average terminated, Computing element num jobs G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 15 / 17
  • 16. Step 3: Classification Cross-validation with same data SVM as classifier (boolean and multi-class) 84%accuracy on cross validation for the boolean class case 60% accuracy on cross validation for the multi-class case G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 16 / 17
  • 17. Conclusion Our method Discovered useful classes of jobs on a Grid network Can be used as filter in monitoring (trend discovery) Future work Scale up the dataset Incorporate a temporal dimension in clustering Online anytime clustering and classification G. Modena, M. van Someren (UvA) Analysis of Grid log data with AP June 20, 2013 17 / 17