SlideShare a Scribd company logo
FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION




                 Merging Event Logs in ProM
                                           Jan Claes
                                       Ghent University
                                http://processmining.ugent.be




Faculty of Economics and Business Administration                                             Jan Claes for TUe 2012
Department of Management Information and Operations Management                                     6 February, 2012
Merging Event Logs




                                                       ?
      Multiple event logs                       ProM plugin      Merged event log

Faculty of Economics and Business Administration                        Jan Claes for TUe 2012
Department of Management Information and Operations Management                           2 / 21
Merging Event Logs




1. Find links      2. Merge chronologically            3. Add unlinked traces   4. Put in new log file
Faculty of Economics and Business Administration                                     Jan Claes for TUe 2012
Department of Management Information and Operations Management                                        3 / 21
Approaches

Genetic Algorithm
        J. Claes, G. Poels, Integrating Computer Log Files for Process Mining: a Genetic
         Algorithm Inspired Technique, in CAiSE 2011 Workshops, LNBIP 83, 2011

Artificial Immune System
        J. Claes, G. Poels, Merging Computer Log Files for Process Mining: an Artificial
         Immune System Technique, in BPM 2011 Workshops, LNBIP 99, 2011

Rule Based
        J. Claes, G. Poels, Merging Event Logs for Process Mining: A Rule Based Merging
         Method and Rule Suggestion Algorithm, to be submitted in 2012




Faculty of Economics and Business Administration                          Jan Claes for TUe 2012
Department of Management Information and Operations Management                             4 / 21
FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION




                                 1. Genetic Algorithm




Faculty of Economics and Business Administration                                             Jan Claes for TUe 2012
Department of Management Information and Operations Management                                     6 February, 2012
1. Genetic Algorithm




                                                     SEL cross-over
                            RAND       fitness                             MUT
                                                     POP
                             POP                          mutation         POP




                                     Selection              Reproduction




Faculty of Economics and Business Administration                                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                                    6 / 21
1. Genetic Algorithm

Fitness function
        Sum of weighted factor scores per link
             •   Same trace id (STIi)
             •   Trace order (TOi) if all start events are in the first log
             •   Equal attribute values (EAVi)
             •   Number of linked traces (NLTi)
             •   Time distance (TDi)




Faculty of Economics and Business Administration                  Jan Claes for TUe 2012
Department of Management Information and Operations Management                     7 / 21
1. Genetic Algorithm

Simplification
      Population size one
      Only mutations
Improvements
      More intelligent start population (not random)
      More intelligent mutations (improve at least one
       factor of the fitness function)
Attention
      Intensification vs. diversification
Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                    8 / 21
FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION




                         2. Artificial Immune system




Faculty of Economics and Business Administration                                             Jan Claes for TUe 2012
Department of Management Information and Operations Management                                     6 February, 2012
2. Artificial Immune System


                                                                 Immune cells
                                                                  (type B-cell)




                                               Antigen
                         Antibodies
                         (receptor)




Faculty of Economics and Business Administration                                  Jan Claes for TUe 2012
Department of Management Information and Operations Management                                    10 / 21
2. Artificial Immune System

       HIGH                                  HIGH                                   HIGH


                                                                  mutations
                 INIT               sorted                CLONE               MUT             EDIT
                 POP                 POP                   POP                POP             POP

RAND
 POP                                         LOW                                    LOW
                                                                         Affinity maturation
         Initial population             Clonal selection            Hypermutation Receptor editing

                                                         SEED
       LOW



 Faculty of Economics and Business Administration                                    Jan Claes for TUe 2012
 Department of Management Information and Operations Management                                      11 / 21
2. Artificial Immune System

Clonal selection
        Clone the fittest x% solutions (I)
Hypermutation
        Randomly change each clone
        The higher the fitness score, the less changes (I)
Receptor editing
        Take the best y% solutions (I)
        Add totally random solutions to the set (D)
                               (I: Intensification, D: Diversification)
Faculty of Economics and Business Administration                          Jan Claes for TUe 2012
Department of Management Information and Operations Management                            12 / 21
2. Artificial Immune System

Hypermutation
     Choose ‘random’ indicator factor to improve
           • Higher chance to pick factors with positive previous effect
     Choose random action
           • Add link, remove link or alter link
     Choose random candidate
           • From all solutions that would improve with selected action
     Choose random improvement
           • From all possible improvements for selected candidate

Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   13 / 21
FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION




                                         3. Rule Based




Faculty of Economics and Business Administration                                             Jan Claes for TUe 2012
Department of Management Information and Operations Management                                     6 February, 2012
3. Rule Based

Automatic merging is not transparant
 (how good is the merging result?)
Previous algorithms are (too) slow
My experience
        in most cases it is about finding an attribute value
         (literally) in a trace of the other log
        you need data experts/analyst to get the right
         data, they mostly have a good idea about the link
         between two log files
Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   15 / 21
3. Rule Based

Semi-automatic solution
        Let user configure merging rule based on attribute
         values
             • More transparent
             • Faster
             • Includes expert knowledge if available
        Help user by suggesting merging rules based on
         the data in the log


Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   16 / 21
3. Rule Based

Merging rules
   Merge all traces where…
       attribute <select name> from <select container> in the 1st log
       <select operator>
       attribute <select name> from <select container> in the 2nd log

   E.g. Merge all traces where attribute Trace ID from a trace in
    the 1st log equals attribute Supplier Reference from event Send
    goods in the 2nd log



Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   17 / 21
3. Rule Based

   <select name>
         • Contains all possible attribute names available in the log
   <select container>
         •   From a trace
         •   From any event in a trace
         •   From a trace or any event in a trace
         •   From event X, From event Y, From event Z, …
   <select operator>
         • equals, is not equal, greater than, greater or equal, …
         • comes before, comes after
Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   18 / 21
3. Rule Based

Suggesting rules
          Look at all attribute values in the log
          Make a rule for every equal match in both logs
          Count the number of linked traces for every rule
          Filter rules with only one link
          Sort such that rule that is closer to 1-to-1 match is
           higher in the list
             • rules that make more or fewer links are lower in the list
             • if no 1-to-1 rule exist, the ‘best’ rule is still on top

Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   19 / 21
3. Rule Based

Some remarks
        User can configure rules or select from the
         suggestion list
        Suggestion list is currently limited to equals-rules
         but is calculated very fast (order n1 + n2 !)
        Rules can be combined with And or Or
        By explicitly selecting rules, the approach is more
         transparent
        Possible use as shortcut for merging logs from
         within one system
Faculty of Economics and Business Administration                 Jan Claes for TUe 2012
Department of Management Information and Operations Management                   20 / 21
Contact information




                                          Jan Claes
                                          jan.claes@ugent.be

                                          http://processmining.ugent.be
                                          Twitter: @janclaesbelgium
                                          Pav D8.a (until February 10)




Faculty of Economics and Business Administration                          Jan Claes for TUe 2012
Department of Management Information and Operations Management                            21 / 21

More Related Content

Similar to ProM 2012

EIS 2011
EIS 2011EIS 2011
EIS 2011
Jan Claes
 
Stad Gent 2012
Stad Gent 2012Stad Gent 2012
Stad Gent 2012
Jan Claes
 
Confenis 2012
Confenis 2012Confenis 2012
Confenis 2012
Jan Claes
 
Process Mining by Jan Claes
Process Mining by Jan ClaesProcess Mining by Jan Claes
Process Mining by Jan Claes
CONFENIS 2012
 
Confenis2012DC
Confenis2012DCConfenis2012DC
Confenis2012DC
Jan Claes
 
Confenis Conference Presentation
Confenis Conference PresentationConfenis Conference Presentation
Confenis Conference Presentation
Maxime Bernaert
 
INISET@CAiSE 2011
INISET@CAiSE 2011INISET@CAiSE 2011
INISET@CAiSE 2011
Jan Claes
 
Managing Business Processes Communication and Performance
Managing Business Processes Communication and Performance Managing Business Processes Communication and Performance
Managing Business Processes Communication and Performance
Yves Caseau
 

Similar to ProM 2012 (8)

EIS 2011
EIS 2011EIS 2011
EIS 2011
 
Stad Gent 2012
Stad Gent 2012Stad Gent 2012
Stad Gent 2012
 
Confenis 2012
Confenis 2012Confenis 2012
Confenis 2012
 
Process Mining by Jan Claes
Process Mining by Jan ClaesProcess Mining by Jan Claes
Process Mining by Jan Claes
 
Confenis2012DC
Confenis2012DCConfenis2012DC
Confenis2012DC
 
Confenis Conference Presentation
Confenis Conference PresentationConfenis Conference Presentation
Confenis Conference Presentation
 
INISET@CAiSE 2011
INISET@CAiSE 2011INISET@CAiSE 2011
INISET@CAiSE 2011
 
Managing Business Processes Communication and Performance
Managing Business Processes Communication and Performance Managing Business Processes Communication and Performance
Managing Business Processes Communication and Performance
 

More from Jan Claes

COGNISE@CAiSE 2019
COGNISE@CAiSE 2019COGNISE@CAiSE 2019
COGNISE@CAiSE 2019
Jan Claes
 
BPMS2@BPM2018
BPMS2@BPM2018BPMS2@BPM2018
BPMS2@BPM2018
Jan Claes
 
ICLTC 2018
ICLTC 2018ICLTC 2018
ICLTC 2018
Jan Claes
 
EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018
Jan Claes
 
BPM Cluster Meeting 2018
BPM Cluster Meeting 2018BPM Cluster Meeting 2018
BPM Cluster Meeting 2018
Jan Claes
 
Research: Why? What? How?
Research: Why? What? How?Research: Why? What? How?
Research: Why? What? How?
Jan Claes
 
BPMS2@BPM2016
BPMS2@BPM2016BPMS2@BPM2016
BPMS2@BPM2016
Jan Claes
 
TEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD ContestTEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD Contest
Jan Claes
 
PhD defense November 2015
PhD defense November 2015PhD defense November 2015
PhD defense November 2015
Jan Claes
 
PhD pre-defense September 2015
PhD pre-defense September 2015PhD pre-defense September 2015
PhD pre-defense September 2015
Jan Claes
 
UGent MIS research seminar June 2015
UGent MIS research seminar June 2015UGent MIS research seminar June 2015
UGent MIS research seminar June 2015
Jan Claes
 
UGent MIS research seminar December 2014
UGent MIS research seminar December 2014UGent MIS research seminar December 2014
UGent MIS research seminar December 2014
Jan Claes
 
BPM Cluster Meeting 2014
BPM Cluster Meeting 2014BPM Cluster Meeting 2014
BPM Cluster Meeting 2014
Jan Claes
 
PhD Day 2014
PhD Day 2014PhD Day 2014
PhD Day 2014
Jan Claes
 
Colloquium@TUe
Colloquium@TUeColloquium@TUe
Colloquium@TUe
Jan Claes
 
COGNISE@CAiSE2013
COGNISE@CAiSE2013COGNISE@CAiSE2013
COGNISE@CAiSE2013
Jan Claes
 
ECIS2013DC
ECIS2013DCECIS2013DC
ECIS2013DC
Jan Claes
 
PhD Day 2013
PhD Day 2013PhD Day 2013
PhD Day 2013
Jan Claes
 
EIS 2012
EIS 2012EIS 2012
EIS 2012
Jan Claes
 
BPM2012
BPM2012BPM2012
BPM2012
Jan Claes
 

More from Jan Claes (20)

COGNISE@CAiSE 2019
COGNISE@CAiSE 2019COGNISE@CAiSE 2019
COGNISE@CAiSE 2019
 
BPMS2@BPM2018
BPMS2@BPM2018BPMS2@BPM2018
BPMS2@BPM2018
 
ICLTC 2018
ICLTC 2018ICLTC 2018
ICLTC 2018
 
EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018EMMSAD++@CAiSE 2018
EMMSAD++@CAiSE 2018
 
BPM Cluster Meeting 2018
BPM Cluster Meeting 2018BPM Cluster Meeting 2018
BPM Cluster Meeting 2018
 
Research: Why? What? How?
Research: Why? What? How?Research: Why? What? How?
Research: Why? What? How?
 
BPMS2@BPM2016
BPMS2@BPM2016BPMS2@BPM2016
BPMS2@BPM2016
 
TEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD ContestTEDxGhent 2016 PhD Contest
TEDxGhent 2016 PhD Contest
 
PhD defense November 2015
PhD defense November 2015PhD defense November 2015
PhD defense November 2015
 
PhD pre-defense September 2015
PhD pre-defense September 2015PhD pre-defense September 2015
PhD pre-defense September 2015
 
UGent MIS research seminar June 2015
UGent MIS research seminar June 2015UGent MIS research seminar June 2015
UGent MIS research seminar June 2015
 
UGent MIS research seminar December 2014
UGent MIS research seminar December 2014UGent MIS research seminar December 2014
UGent MIS research seminar December 2014
 
BPM Cluster Meeting 2014
BPM Cluster Meeting 2014BPM Cluster Meeting 2014
BPM Cluster Meeting 2014
 
PhD Day 2014
PhD Day 2014PhD Day 2014
PhD Day 2014
 
Colloquium@TUe
Colloquium@TUeColloquium@TUe
Colloquium@TUe
 
COGNISE@CAiSE2013
COGNISE@CAiSE2013COGNISE@CAiSE2013
COGNISE@CAiSE2013
 
ECIS2013DC
ECIS2013DCECIS2013DC
ECIS2013DC
 
PhD Day 2013
PhD Day 2013PhD Day 2013
PhD Day 2013
 
EIS 2012
EIS 2012EIS 2012
EIS 2012
 
BPM2012
BPM2012BPM2012
BPM2012
 

Recently uploaded

Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...
Lviv Startup Club
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Navpack & Print
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
Adam Smith
 
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Lviv Startup Club
 
Enterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdfEnterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdf
KaiNexus
 
Unveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdfUnveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdf
Sam H
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
NathanBaughman3
 
The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...
balatucanapplelovely
 
Brand Analysis for an artist named Struan
Brand Analysis for an artist named StruanBrand Analysis for an artist named Struan
Brand Analysis for an artist named Struan
sarahvanessa51503
 
3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx
tanyjahb
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
Cynthia Clay
 
Exploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social DreamingExploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social Dreaming
Nicola Wreford-Howard
 
VAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and RequirementsVAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and Requirements
uae taxgpt
 
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdfikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
agatadrynko
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
DerekIwanaka1
 
Sustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & EconomySustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & Economy
Operational Excellence Consulting
 
Business Valuation Principles for Entrepreneurs
Business Valuation Principles for EntrepreneursBusiness Valuation Principles for Entrepreneurs
Business Valuation Principles for Entrepreneurs
Ben Wann
 
The-McKinsey-7S-Framework. strategic management
The-McKinsey-7S-Framework. strategic managementThe-McKinsey-7S-Framework. strategic management
The-McKinsey-7S-Framework. strategic management
Bojamma2
 
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdfMeas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
dylandmeas
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
SynapseIndia
 

Recently uploaded (20)

Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
 
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)Maksym Vyshnivetskyi: PMO Quality Management (UA)
Maksym Vyshnivetskyi: PMO Quality Management (UA)
 
Enterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdfEnterprise Excellence is Inclusive Excellence.pdf
Enterprise Excellence is Inclusive Excellence.pdf
 
Unveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdfUnveiling the Secrets How Does Generative AI Work.pdf
Unveiling the Secrets How Does Generative AI Work.pdf
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
 
The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...The effects of customers service quality and online reviews on customer loyal...
The effects of customers service quality and online reviews on customer loyal...
 
Brand Analysis for an artist named Struan
Brand Analysis for an artist named StruanBrand Analysis for an artist named Struan
Brand Analysis for an artist named Struan
 
3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 
Exploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social DreamingExploring Patterns of Connection with Social Dreaming
Exploring Patterns of Connection with Social Dreaming
 
VAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and RequirementsVAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and Requirements
 
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdfikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
 
Sustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & EconomySustainability: Balancing the Environment, Equity & Economy
Sustainability: Balancing the Environment, Equity & Economy
 
Business Valuation Principles for Entrepreneurs
Business Valuation Principles for EntrepreneursBusiness Valuation Principles for Entrepreneurs
Business Valuation Principles for Entrepreneurs
 
The-McKinsey-7S-Framework. strategic management
The-McKinsey-7S-Framework. strategic managementThe-McKinsey-7S-Framework. strategic management
The-McKinsey-7S-Framework. strategic management
 
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdfMeas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
Meas_Dylan_DMBS_PB1_2024-05XX_Revised.pdf
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
 

ProM 2012

  • 1. FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION Merging Event Logs in ProM Jan Claes Ghent University http://processmining.ugent.be Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 6 February, 2012
  • 2. Merging Event Logs ? Multiple event logs ProM plugin Merged event log Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 2 / 21
  • 3. Merging Event Logs 1. Find links 2. Merge chronologically 3. Add unlinked traces 4. Put in new log file Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 3 / 21
  • 4. Approaches Genetic Algorithm  J. Claes, G. Poels, Integrating Computer Log Files for Process Mining: a Genetic Algorithm Inspired Technique, in CAiSE 2011 Workshops, LNBIP 83, 2011 Artificial Immune System  J. Claes, G. Poels, Merging Computer Log Files for Process Mining: an Artificial Immune System Technique, in BPM 2011 Workshops, LNBIP 99, 2011 Rule Based  J. Claes, G. Poels, Merging Event Logs for Process Mining: A Rule Based Merging Method and Rule Suggestion Algorithm, to be submitted in 2012 Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 4 / 21
  • 5. FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION 1. Genetic Algorithm Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 6 February, 2012
  • 6. 1. Genetic Algorithm SEL cross-over RAND fitness MUT POP POP mutation POP Selection Reproduction Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 6 / 21
  • 7. 1. Genetic Algorithm Fitness function  Sum of weighted factor scores per link • Same trace id (STIi) • Trace order (TOi) if all start events are in the first log • Equal attribute values (EAVi) • Number of linked traces (NLTi) • Time distance (TDi) Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 7 / 21
  • 8. 1. Genetic Algorithm Simplification  Population size one  Only mutations Improvements  More intelligent start population (not random)  More intelligent mutations (improve at least one factor of the fitness function) Attention  Intensification vs. diversification Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 8 / 21
  • 9. FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION 2. Artificial Immune system Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 6 February, 2012
  • 10. 2. Artificial Immune System Immune cells (type B-cell) Antigen Antibodies (receptor) Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 10 / 21
  • 11. 2. Artificial Immune System HIGH HIGH HIGH mutations INIT sorted CLONE MUT EDIT POP POP POP POP POP RAND POP LOW LOW Affinity maturation Initial population Clonal selection Hypermutation Receptor editing SEED LOW Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 11 / 21
  • 12. 2. Artificial Immune System Clonal selection  Clone the fittest x% solutions (I) Hypermutation  Randomly change each clone  The higher the fitness score, the less changes (I) Receptor editing  Take the best y% solutions (I)  Add totally random solutions to the set (D) (I: Intensification, D: Diversification) Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 12 / 21
  • 13. 2. Artificial Immune System Hypermutation  Choose ‘random’ indicator factor to improve • Higher chance to pick factors with positive previous effect  Choose random action • Add link, remove link or alter link  Choose random candidate • From all solutions that would improve with selected action  Choose random improvement • From all possible improvements for selected candidate Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 13 / 21
  • 14. FACULTY OF ECONOMICS AND BUSINESS ADMINISTRATION 3. Rule Based Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 6 February, 2012
  • 15. 3. Rule Based Automatic merging is not transparant (how good is the merging result?) Previous algorithms are (too) slow My experience  in most cases it is about finding an attribute value (literally) in a trace of the other log  you need data experts/analyst to get the right data, they mostly have a good idea about the link between two log files Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 15 / 21
  • 16. 3. Rule Based Semi-automatic solution  Let user configure merging rule based on attribute values • More transparent • Faster • Includes expert knowledge if available  Help user by suggesting merging rules based on the data in the log Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 16 / 21
  • 17. 3. Rule Based Merging rules  Merge all traces where… attribute <select name> from <select container> in the 1st log <select operator> attribute <select name> from <select container> in the 2nd log  E.g. Merge all traces where attribute Trace ID from a trace in the 1st log equals attribute Supplier Reference from event Send goods in the 2nd log Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 17 / 21
  • 18. 3. Rule Based  <select name> • Contains all possible attribute names available in the log  <select container> • From a trace • From any event in a trace • From a trace or any event in a trace • From event X, From event Y, From event Z, …  <select operator> • equals, is not equal, greater than, greater or equal, … • comes before, comes after Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 18 / 21
  • 19. 3. Rule Based Suggesting rules  Look at all attribute values in the log  Make a rule for every equal match in both logs  Count the number of linked traces for every rule  Filter rules with only one link  Sort such that rule that is closer to 1-to-1 match is higher in the list • rules that make more or fewer links are lower in the list • if no 1-to-1 rule exist, the ‘best’ rule is still on top Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 19 / 21
  • 20. 3. Rule Based Some remarks  User can configure rules or select from the suggestion list  Suggestion list is currently limited to equals-rules but is calculated very fast (order n1 + n2 !)  Rules can be combined with And or Or  By explicitly selecting rules, the approach is more transparent  Possible use as shortcut for merging logs from within one system Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 20 / 21
  • 21. Contact information Jan Claes jan.claes@ugent.be http://processmining.ugent.be Twitter: @janclaesbelgium Pav D8.a (until February 10) Faculty of Economics and Business Administration Jan Claes for TUe 2012 Department of Management Information and Operations Management 21 / 21