SlideShare a Scribd company logo
Metaheuristic Optimization
for Automated Business Process Discovery
Adriano Augusto, Marlon Dumas, Marcello La Rosa
Context
Automated discovery of (business) process models from event logs
Automated
Process Discovery
Approach (APDA)
2
a » b » c » g » e » h 10
a » b » c » f » g » h 10
a » b » d » g » e » h 10
a » b » d » e » g » h 10
a » b » e » c » g » h 10
a » b » e » d » g » h 10
a » c » b » e » g » h 10
a » c » b » f » g » h 10
a » d » b » e » g » h 10
a » d » b » f » g » h 10
Process Model Quality
How good is an automatically discovered process model?
Process
Model
APDA
3
Event
Log
Compare
Fitness, Precision (F-score)
Generalization
Simplicity
Soundness
State-of-the-art
4
Automated Process
Discovery Approaches
APDAs based on
Directly-Follows Graphs
(DFG-based APDAs)
Split Miner
Inductive Miner
Fodina Miner
Heuristics Miner
…
DFG-based APDAs
5
Process
Model
DFG-based
APDA
(e.g. Split Miner)
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
a » b » c » g » e » h 10
a » b » c » f » g » h 10
a » b » d » g » e » h 10
a » b » d » e » g » h 10
a » b » e » c » g » h 10
a » b » e » d » g » h 10
a » c » b » e » g » h 10
a » c » b » f » g » h 10
a » d » b » e » g » h 10
a » d » b » f » g » h 10
DFG-based APDAs
6
Process
Model
DFG-based
APDA
(e.g. Split Miner)
Event
Log
a » b » c » g » e » h 10
a » b » c » f » g » h 10
a » b » d » g » e » h 10
a » b » d » e » g » h 10
a » b » e » c » g » h 10
a » b » e » d » g » h 10
a » c » b » e » g » h 10
a » c » b » f » g » h 10
a » d » b » e » g » h 10
a » d » b » f » g » h 10
input params
6
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
What is the best input configuration?
Model
(1)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration 1
Assess
Quality
Model
Quality (1)
Model
(2)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration 2
Assess
Quality
Model
Quality (2)
Model
(N)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration N
Assess
Quality
Model
Quality (N)
Compare
What is the best input configuration?
Model
(1)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration 1
Assess
Quality
Model
Quality (1)
Model
(2)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration 2
Assess
Quality
Model
Quality (2)
Model
(N)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration N
Assess
Quality
Model
Quality (N)
Compare
Model (x)
is the
BEST!
How to be more efficient?
9
Optimization Metaheuristics
Population Based
Evolutionary computation
Ant colony
Bee colony
Swarm particles
…
Single-solution Based
Repetitive local search
Iterative local search
Tabu search
Simulated annealing
…
Adapting the Metaheuristics to our Context
10
Repetitive Local Search (RLS)
Iterative Local Search (ILS)
Tabu Search (TS)
Simulated Annealing (SA)
1. Solution Space
2. Solution Neighbourhood
3. Objective Function
Adapting the Metaheuristics to our Context
Model
(1)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration 1
Assess
Quality
Model
Quality (1)
Model
(2)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration 2
Assess
Quality
Model
Quality (2)
Model
(N)
DFG-based
APDA
(e.g. Split Miner)
Log
Configuration N
Assess
Quality
Model
Quality (N)
Compare
Model (x)
is the
BEST!
Solution Space Objective Function
Neighbours?
Optimizing a DFG-based APDAs
12
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Optimizing a DFG-based APDAs
13
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Assess
Quality
Assess Quality
 Fitness, precision, generalization, or simplicity?
 What measure to use?
Assess
Quality
Fitness and precision > F-score
Alignment, anti-alignment, PCC, entropy, Markovian accuracy
Optimizing a DFG-based APDAs
15
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Assess
Quality
Optimizing a DFG-based APDAs
16
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Assess
Quality
Explore Neighbour DFGs
Explore
Neighbour
DFGs
 Given a DFG, its closer neighbours are the ones having one more or one less edge.
 Adding edges will result into adding behaviour (increasing the fitness of the model)
 Removing edges will result into removing behaviour (increasing the precision of the model)
Explore
Neighbour
DFGs
DFG DFG
DFG
DFG
DFG
DFGModel
Quality
Optimizing a DFG-based APDAs
18
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Assess
Quality
Convert
DFGs to
Models
Optimizing a DFG-based APDAs
19
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Assess
Quality
Convert
DFGs to
Models
Assess
Quality
Optimizing a DFG-based APDAs
20
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Assess
Quality
Select Best
DFG
Candidate
Convert
DFGs to
Models
Assess
Quality
Optimizing a DFG-based APDAs
21
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Check
Termination
Condition
Assess
Quality
Select Best
DFG
Candidate
Convert
DFGs to
Models
Assess
Quality
Timeout
Number of iterations
Objective function threshold
Optimizing a DFG-based APDAs
22
Process
Model
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Check
Termination
Condition fulfilled
not
fulfilled
Assess
Quality
Select Best
DFG
Candidate
Convert
DFGs to
Models
Assess
Quality
Optimization Metaheuristic
Optimization Framework
23
APDA – Metaheuristic Interface
Event
Log
Input
Settings
Objective
FunctionsOptimization metaheuristics ID
APDA ID
Objective Function ID
Process
Model
Optimization
Metaheuristics
DFG-based
APDAs
Optimization Framework Instantiation
24
APDA – Metaheuristic Interface
Event
Log
Input
Settings
Markovian
F-scoreOptimization metaheuristics ID
APDA ID
Objective Function ID
Process
Model
RLS, ILS,
TS, SA
Split Miner
Evaluation Setup
25
— 20 real-life event logs (10 BPIC logs, RTFMP, SEPSIS case, and 8 private logs)
— 3 baselines without hyper-parameters optimization:
Inductive Miner (IM), Evolutionary Tree Miner (ETM), Split Miner (SM)
— 1 baseline with hyper-parameters optimization, Split Miner (HPO)
— Markovian accuracy, Alignment accuracy, simplicity, and time performance
MarkvovianF-score
Event Logs
Limitations
27
— Slower than baselines with default input params (Inductive Miner, Split Miner)
— More complex models when optimizing fitness
— Not applicable to any APDA, only for DFG-based APDA
Thanks for attending!
Questions?
28
Future Work
29
— Add more DFG-based APDAs to our framework
(Fodina Miner and Inductive Miner)
— Explore alternative quality measures to drive the optimization metaheuristics
— Combine accuracy and simplicity measures
Results – Markvovian F-score 30
Results – Alignment F-score 31
Optimizing a DFG-based APDAs
32
Process
Model
Event
Log
Discover
DFG
Manipulate
DFG
(e.g. Filtering)
Convert DFG
to Model
(e.g. BPMN)
input params
Explore
Neighbour
DFGs
Check
Termination
Condition fulfilled
not
fulfilled
Assess
Quality
Select Best
DFG
Candidate
Convert
DFGs to
Models
Assess
Quality
Optimization Metaheuristic

More Related Content

Similar to Metaheuristic Optimization for Automated Business Process Discovery

Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Data Con LA
 
List intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and OptimizationsList intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and Optimizations
Sunghwan Kim
 
Debugging Applications with GNU Debugger
Debugging Applications with GNU DebuggerDebugging Applications with GNU Debugger
Debugging Applications with GNU Debugger
Priyank Kapadia
 
Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...
Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...
Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...
Codemotion
 
Development of Reliability Analysis and Multidisciplinary Design Optimization...
Development of Reliability Analysis and Multidisciplinary Design Optimization...Development of Reliability Analysis and Multidisciplinary Design Optimization...
Development of Reliability Analysis and Multidisciplinary Design Optimization...
Altair
 
Webinar: Schema Patterns and Your Storage Engine
Webinar: Schema Patterns and Your Storage EngineWebinar: Schema Patterns and Your Storage Engine
Webinar: Schema Patterns and Your Storage Engine
MongoDB
 
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14thSnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData
 
GCC
GCCGCC
Event Sourcing - what could possibly go wrong?
Event Sourcing - what could possibly go wrong?Event Sourcing - what could possibly go wrong?
Event Sourcing - what could possibly go wrong?
Andrzej Ludwikowski
 
Building a Scalable Distributed Stats Infrastructure with Storm and KairosDB
Building a Scalable Distributed Stats Infrastructure with Storm and KairosDBBuilding a Scalable Distributed Stats Infrastructure with Storm and KairosDB
Building a Scalable Distributed Stats Infrastructure with Storm and KairosDB
Cody Ray
 
Dataflow shuffle service
Dataflow shuffle service Dataflow shuffle service
Dataflow shuffle service
Yuta Hono
 
Debugger Principle Overview & GDB Tricks
Debugger Principle Overview & GDB TricksDebugger Principle Overview & GDB Tricks
Debugger Principle Overview & GDB Tricks
dutor
 
Framework Engineering Revisited
Framework Engineering RevisitedFramework Engineering Revisited
Framework Engineering Revisited
YoungSu Son
 
Java Performance and Profiling
Java Performance and ProfilingJava Performance and Profiling
Java Performance and Profiling
WSO2
 
Spark Streaming Tips for Devs and Ops
Spark Streaming Tips for Devs and OpsSpark Streaming Tips for Devs and Ops
Spark Streaming Tips for Devs and Ops
Francisco Pérez Paradas
 
Spark Streaming Tips for Devs and Ops by Fran perez y federico fernández
Spark Streaming Tips for Devs and Ops by Fran perez y federico fernándezSpark Streaming Tips for Devs and Ops by Fran perez y federico fernández
Spark Streaming Tips for Devs and Ops by Fran perez y federico fernández
J On The Beach
 
RAPIDS: ускоряем Pandas и scikit-learn на GPU Павел Клеменков, NVidia
RAPIDS: ускоряем Pandas и scikit-learn на GPU  Павел Клеменков, NVidiaRAPIDS: ускоряем Pandas и scikit-learn на GPU  Павел Клеменков, NVidia
RAPIDS: ускоряем Pandas и scikit-learn на GPU Павел Клеменков, NVidia
Mail.ru Group
 
Effectively Scale and Operate AEM with MongoDB by Norberto Leite
Effectively Scale and Operate AEM with MongoDB by Norberto LeiteEffectively Scale and Operate AEM with MongoDB by Norberto Leite
Effectively Scale and Operate AEM with MongoDB by Norberto Leite
AEM HUB
 
[214]유연하고 확장성 있는 빅데이터 처리
[214]유연하고 확장성 있는 빅데이터 처리[214]유연하고 확장성 있는 빅데이터 처리
[214]유연하고 확장성 있는 빅데이터 처리
NAVER D2
 
Go 1.10 Release Party - PDX Go
Go 1.10 Release Party - PDX GoGo 1.10 Release Party - PDX Go
Go 1.10 Release Party - PDX Go
Rodolfo Carvalho
 

Similar to Metaheuristic Optimization for Automated Business Process Discovery (20)

Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
 
List intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and OptimizationsList intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and Optimizations
 
Debugging Applications with GNU Debugger
Debugging Applications with GNU DebuggerDebugging Applications with GNU Debugger
Debugging Applications with GNU Debugger
 
Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...
Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...
Andrzej Ludwikowski - Event Sourcing - what could possibly go wrong? - Codemo...
 
Development of Reliability Analysis and Multidisciplinary Design Optimization...
Development of Reliability Analysis and Multidisciplinary Design Optimization...Development of Reliability Analysis and Multidisciplinary Design Optimization...
Development of Reliability Analysis and Multidisciplinary Design Optimization...
 
Webinar: Schema Patterns and Your Storage Engine
Webinar: Schema Patterns and Your Storage EngineWebinar: Schema Patterns and Your Storage Engine
Webinar: Schema Patterns and Your Storage Engine
 
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14thSnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
SnappyData Ad Analytics Use Case -- BDAM Meetup Sept 14th
 
GCC
GCCGCC
GCC
 
Event Sourcing - what could possibly go wrong?
Event Sourcing - what could possibly go wrong?Event Sourcing - what could possibly go wrong?
Event Sourcing - what could possibly go wrong?
 
Building a Scalable Distributed Stats Infrastructure with Storm and KairosDB
Building a Scalable Distributed Stats Infrastructure with Storm and KairosDBBuilding a Scalable Distributed Stats Infrastructure with Storm and KairosDB
Building a Scalable Distributed Stats Infrastructure with Storm and KairosDB
 
Dataflow shuffle service
Dataflow shuffle service Dataflow shuffle service
Dataflow shuffle service
 
Debugger Principle Overview & GDB Tricks
Debugger Principle Overview & GDB TricksDebugger Principle Overview & GDB Tricks
Debugger Principle Overview & GDB Tricks
 
Framework Engineering Revisited
Framework Engineering RevisitedFramework Engineering Revisited
Framework Engineering Revisited
 
Java Performance and Profiling
Java Performance and ProfilingJava Performance and Profiling
Java Performance and Profiling
 
Spark Streaming Tips for Devs and Ops
Spark Streaming Tips for Devs and OpsSpark Streaming Tips for Devs and Ops
Spark Streaming Tips for Devs and Ops
 
Spark Streaming Tips for Devs and Ops by Fran perez y federico fernández
Spark Streaming Tips for Devs and Ops by Fran perez y federico fernándezSpark Streaming Tips for Devs and Ops by Fran perez y federico fernández
Spark Streaming Tips for Devs and Ops by Fran perez y federico fernández
 
RAPIDS: ускоряем Pandas и scikit-learn на GPU Павел Клеменков, NVidia
RAPIDS: ускоряем Pandas и scikit-learn на GPU  Павел Клеменков, NVidiaRAPIDS: ускоряем Pandas и scikit-learn на GPU  Павел Клеменков, NVidia
RAPIDS: ускоряем Pandas и scikit-learn на GPU Павел Клеменков, NVidia
 
Effectively Scale and Operate AEM with MongoDB by Norberto Leite
Effectively Scale and Operate AEM with MongoDB by Norberto LeiteEffectively Scale and Operate AEM with MongoDB by Norberto Leite
Effectively Scale and Operate AEM with MongoDB by Norberto Leite
 
[214]유연하고 확장성 있는 빅데이터 처리
[214]유연하고 확장성 있는 빅데이터 처리[214]유연하고 확장성 있는 빅데이터 처리
[214]유연하고 확장성 있는 빅데이터 처리
 
Go 1.10 Release Party - PDX Go
Go 1.10 Release Party - PDX GoGo 1.10 Release Party - PDX Go
Go 1.10 Release Party - PDX Go
 

More from Marlon Dumas

How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
Marlon Dumas
 
Walking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process OptimizationWalking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process Optimization
Marlon Dumas
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Marlon Dumas
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Marlon Dumas
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
Marlon Dumas
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Marlon Dumas
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Marlon Dumas
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
Marlon Dumas
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
Marlon Dumas
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
Marlon Dumas
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Marlon Dumas
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Marlon Dumas
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
Marlon Dumas
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Marlon Dumas
 
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Marlon Dumas
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
Marlon Dumas
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
Marlon Dumas
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Marlon Dumas
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
Marlon Dumas
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
Marlon Dumas
 

More from Marlon Dumas (20)

How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
 
Walking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process OptimizationWalking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process Optimization
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
 
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
 

Recently uploaded

mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 

Recently uploaded (20)

mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 

Metaheuristic Optimization for Automated Business Process Discovery

  • 1. Metaheuristic Optimization for Automated Business Process Discovery Adriano Augusto, Marlon Dumas, Marcello La Rosa
  • 2. Context Automated discovery of (business) process models from event logs Automated Process Discovery Approach (APDA) 2 a » b » c » g » e » h 10 a » b » c » f » g » h 10 a » b » d » g » e » h 10 a » b » d » e » g » h 10 a » b » e » c » g » h 10 a » b » e » d » g » h 10 a » c » b » e » g » h 10 a » c » b » f » g » h 10 a » d » b » e » g » h 10 a » d » b » f » g » h 10
  • 3. Process Model Quality How good is an automatically discovered process model? Process Model APDA 3 Event Log Compare Fitness, Precision (F-score) Generalization Simplicity Soundness
  • 4. State-of-the-art 4 Automated Process Discovery Approaches APDAs based on Directly-Follows Graphs (DFG-based APDAs) Split Miner Inductive Miner Fodina Miner Heuristics Miner …
  • 5. DFG-based APDAs 5 Process Model DFG-based APDA (e.g. Split Miner) Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) a » b » c » g » e » h 10 a » b » c » f » g » h 10 a » b » d » g » e » h 10 a » b » d » e » g » h 10 a » b » e » c » g » h 10 a » b » e » d » g » h 10 a » c » b » e » g » h 10 a » c » b » f » g » h 10 a » d » b » e » g » h 10 a » d » b » f » g » h 10
  • 6. DFG-based APDAs 6 Process Model DFG-based APDA (e.g. Split Miner) Event Log a » b » c » g » e » h 10 a » b » c » f » g » h 10 a » b » d » g » e » h 10 a » b » d » e » g » h 10 a » b » e » c » g » h 10 a » b » e » d » g » h 10 a » c » b » e » g » h 10 a » c » b » f » g » h 10 a » d » b » e » g » h 10 a » d » b » f » g » h 10 input params 6 Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN)
  • 7. What is the best input configuration? Model (1) DFG-based APDA (e.g. Split Miner) Log Configuration 1 Assess Quality Model Quality (1) Model (2) DFG-based APDA (e.g. Split Miner) Log Configuration 2 Assess Quality Model Quality (2) Model (N) DFG-based APDA (e.g. Split Miner) Log Configuration N Assess Quality Model Quality (N) Compare
  • 8. What is the best input configuration? Model (1) DFG-based APDA (e.g. Split Miner) Log Configuration 1 Assess Quality Model Quality (1) Model (2) DFG-based APDA (e.g. Split Miner) Log Configuration 2 Assess Quality Model Quality (2) Model (N) DFG-based APDA (e.g. Split Miner) Log Configuration N Assess Quality Model Quality (N) Compare Model (x) is the BEST!
  • 9. How to be more efficient? 9 Optimization Metaheuristics Population Based Evolutionary computation Ant colony Bee colony Swarm particles … Single-solution Based Repetitive local search Iterative local search Tabu search Simulated annealing …
  • 10. Adapting the Metaheuristics to our Context 10 Repetitive Local Search (RLS) Iterative Local Search (ILS) Tabu Search (TS) Simulated Annealing (SA) 1. Solution Space 2. Solution Neighbourhood 3. Objective Function
  • 11. Adapting the Metaheuristics to our Context Model (1) DFG-based APDA (e.g. Split Miner) Log Configuration 1 Assess Quality Model Quality (1) Model (2) DFG-based APDA (e.g. Split Miner) Log Configuration 2 Assess Quality Model Quality (2) Model (N) DFG-based APDA (e.g. Split Miner) Log Configuration N Assess Quality Model Quality (N) Compare Model (x) is the BEST! Solution Space Objective Function Neighbours?
  • 12. Optimizing a DFG-based APDAs 12 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params
  • 13. Optimizing a DFG-based APDAs 13 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Assess Quality
  • 14. Assess Quality  Fitness, precision, generalization, or simplicity?  What measure to use? Assess Quality Fitness and precision > F-score Alignment, anti-alignment, PCC, entropy, Markovian accuracy
  • 15. Optimizing a DFG-based APDAs 15 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Assess Quality
  • 16. Optimizing a DFG-based APDAs 16 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Assess Quality
  • 17. Explore Neighbour DFGs Explore Neighbour DFGs  Given a DFG, its closer neighbours are the ones having one more or one less edge.  Adding edges will result into adding behaviour (increasing the fitness of the model)  Removing edges will result into removing behaviour (increasing the precision of the model) Explore Neighbour DFGs DFG DFG DFG DFG DFG DFGModel Quality
  • 18. Optimizing a DFG-based APDAs 18 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Assess Quality Convert DFGs to Models
  • 19. Optimizing a DFG-based APDAs 19 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Assess Quality Convert DFGs to Models Assess Quality
  • 20. Optimizing a DFG-based APDAs 20 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Assess Quality Select Best DFG Candidate Convert DFGs to Models Assess Quality
  • 21. Optimizing a DFG-based APDAs 21 Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Check Termination Condition Assess Quality Select Best DFG Candidate Convert DFGs to Models Assess Quality Timeout Number of iterations Objective function threshold
  • 22. Optimizing a DFG-based APDAs 22 Process Model Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Check Termination Condition fulfilled not fulfilled Assess Quality Select Best DFG Candidate Convert DFGs to Models Assess Quality Optimization Metaheuristic
  • 23. Optimization Framework 23 APDA – Metaheuristic Interface Event Log Input Settings Objective FunctionsOptimization metaheuristics ID APDA ID Objective Function ID Process Model Optimization Metaheuristics DFG-based APDAs
  • 24. Optimization Framework Instantiation 24 APDA – Metaheuristic Interface Event Log Input Settings Markovian F-scoreOptimization metaheuristics ID APDA ID Objective Function ID Process Model RLS, ILS, TS, SA Split Miner
  • 25. Evaluation Setup 25 — 20 real-life event logs (10 BPIC logs, RTFMP, SEPSIS case, and 8 private logs) — 3 baselines without hyper-parameters optimization: Inductive Miner (IM), Evolutionary Tree Miner (ETM), Split Miner (SM) — 1 baseline with hyper-parameters optimization, Split Miner (HPO) — Markovian accuracy, Alignment accuracy, simplicity, and time performance
  • 27. Limitations 27 — Slower than baselines with default input params (Inductive Miner, Split Miner) — More complex models when optimizing fitness — Not applicable to any APDA, only for DFG-based APDA
  • 29. Future Work 29 — Add more DFG-based APDAs to our framework (Fodina Miner and Inductive Miner) — Explore alternative quality measures to drive the optimization metaheuristics — Combine accuracy and simplicity measures
  • 31. Results – Alignment F-score 31
  • 32. Optimizing a DFG-based APDAs 32 Process Model Event Log Discover DFG Manipulate DFG (e.g. Filtering) Convert DFG to Model (e.g. BPMN) input params Explore Neighbour DFGs Check Termination Condition fulfilled not fulfilled Assess Quality Select Best DFG Candidate Convert DFGs to Models Assess Quality Optimization Metaheuristic