SlideShare a Scribd company logo
Marlon Dumas
Professor @ University of Tartu
Co-founder @ Apromore
With contributions by Manuel Camargo and Oscar González-Rojas
Baltic DB&IS’2022, 5 July 2022
Obtain
PO
confirm.
Prepare
shipment
Schedule
delivery
Issue
invoice
Check
Invoice
Load
truck
Check &
confirm
PO
Notify
shipment
Unload
truck
Package
products
Issue
delivery
receipt
Request
PO change
Match
incoming
payment
Schedule
payment
2
Financial
Human
Resources
Technology
Organisation
Function A
(Sales)
Function B
(Manufacturing)
Function C
(Finance)
Assets &
Partners
Customers
Data
Business Process
Business Process
Business Process
Business processes
3
Materials
Obtain
PO
confirm.
Schedule
delivery
Unload
truck
Issue
delivery
receipt
Check
invoice
Schedule
payment
Check &
confirm
PO
Package
products
Load
truck
Notify
shipment
Issue
invoice
Match
payment
Payment
made
PO
received
PO
issued
Goods
arrived
4
5
Dispatch
order
Prepare
route -
perishable
goods
X
Prepare
regular
route
X
Shipment
method
Perishables
Hybrid
Verify
Order
Business Process
 Reallocate resources
 Automate tasks
 Parallelize activities
 Modify the sequence flow
 Increase de process demand
Process Managers(s)
Business Analyst(s)
“What-If” Business Process Analysis
6
Process Credit Card
Accept Cash or Check
Identify payment method
18 / 8 min
20 / 5 min
5 / 2 min
Prepare package for customer
10 / 5 min
Cycle time
Processing / Waiting times
Costs x activity x resource …
Resource utilization
1h
5 min
38 min
10 min
The Traditional Answer: Business Process Simulation
8
9
Exp(20m)
Normal(20m, 4m)
Normal(10m, 2m)
Normal(10m, 2m)
Normal(10m, 2m)
0m
0
Arrival rate = 2 applications per hour
Inter-arrival time = 0.5 hour
Negative exponential distribution
From Monday-Friday, 9am-5pm
0.3
0.7
0.3
Clerk
Officer
System
Clerk
Officer
Officer
Business Process Simulation: Assumptions
The process model is authoritative (always followed to the letter)
• No deviations
• No workarounds
The simulation parameters accurately reflect reality
• …whereas in reality, they are often guesstimates
A resource only works on one task instance at a time / a task is performed by one resource
• No multi-tasking / no multi-resource tasks (teamwork)
Resources have robotic behavior (eager resources consume work items in FIFO mode)
• No batching
• No tiredness effects, no interruptions, no distractions beyond “stochastic” ones
Undifferentiated resources
• Every resource in a pool has the same performance as others
No time-sharing outside the simulated process
• Resources fully dedicated to one process 12
End Result
Business process simulations based
on incomplete models,
guesstimates, and simplifying
assumptions are not faithful
 adoption of business process
simulation is disappointing
3
Event Log
Given
• one or more business processes, for which we
have:
• one or more process specifications and/or
• event logs generated by the execution of the
processes on top of one or more information
systems.
• one or more process performance measures of
interest (e.g. cycle time, resource cost)
• One or more changes to the process (interventions)
Predict
• Predict the values of the process performance
measures after the given interventions.
Non-Functional Requirements
16
Predictions accurate.
Accuracy may be measured e.g. via an error
between the predicted and the actual
performance measures after intervention.
Predictions should be accompanied by a
reliability estimate. In most cases, the
reliability is high.
Reliability could be captured, e.g. by
confidence intervals
Event Log
18
{T1 -> T2 -> T3}
{T1 -> T3 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T2}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T3 -> T2}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
Event log
Process Model
Discovery
Model-to-trace
alignment & repair
Control Flow Discovery
Model
enhancement
Simulation parameters
extraction
BPS model assembly
Vs.
Generated
Ground truth
Accuracy assessment
Tuning
Hyperparameter
optimizer
Simulator
Simulated
Log
https://github.com/AutomatedProcessImprovement/Simod
19
Branching
probabilities
definition
Interarrival dist.
Resource pool
discovery
Task assignment
Task 1 Task 2
Task 3
Activity process
times
X
Task 4
X
Availabiity timetables
Multi-tasking behavior
Batching and prioritization
Phase Category Variable
Control flow discovery
Parallelism threshold (ε)
Filtering threshold (η)
Parameters for log repair
Simulation parameters
discovery
Thresholds for resource pool
discovery
Parameters for fitting temporal
distributions
Optimal alignment of complete bipartite graph Test Log x Simulation Log
weighted by Damerau-Levenshtein (DL) distance, with penalty for temporal mismatch
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
Test Fold of
Event-Log
Simulated Log
2
1
A1 A3 A3 A5 A6 A7
A8
A2 A3 A4 A5 A6 A7 A8
Delete(A1)
Substitute(A2)
*
Insert(A4)
Transpose
(A8↔A7)
Ꝺ1:
Ꝺ2:
Damerau-levenshtein distance (DL)
Control-Flow Similarity (CLFS)
A1 A3 A3 A5 A6 A7
A8
Delete(A1)
Substitute(A2)
*
Insert(A4)
Transpose
(A8↔A7)
Ꝺ1:
A2 A3 A4 A5 A6 A7 A8
Ꝺ2:
Business Process Trace Distance (BPTD)
Error in waiting and
processing time
No penalization if
parallel activities
Event Log Similarity (ELS)
Dataset Control-Flow
Similarity
(string-edit distance)
Temporal
Similarity
(timed-string edit distance)
Call centre 0.37 0.41
Pharmacy customer service 0.29 0.30
Purchase-to-Pay 0.55 0.57
Make-to-order manufacturing 0.65 0.69
Academic credentials recognition 0.32 0.29
Insurance claims handling 0.39 0.43
Loan Origination 0.41 0.42
Discover simulation
model
Simulate model
10 times
Evaluate Similarity
(mean string-edit
distance & timed-
string edit distance)
This Photo by Unknown Author is
licensed under CC BY-NC
We can try to fill
in the glass
• Discover and add batching behavior to simulation models
• .. prioritization
• … timers and external factors (not explicit in the data)
• etc.
Or perhaps we should look for another paradigm….
23
24
How is going to continue this case until it is finished?
How long is this case still going to take until it is finished?
What is the next activity for this case?
When is this next activity going to take place?
…
Generate a set of traces (event log)
E1 E2 E3
Running case
2
5
{T1 -> T2 -> T3}
{T1 -> T3 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T2 -> T2}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
{T1 -> T3 -> T2}
{T1 -> T2 -> T3}
{T1 -> T2 -> T3}
Event log
Pre-processing
• Timestamp
relativization
• Scaling
Continuous features
• Embedded dimensions
• N-grams extraction
• Discovery of roles
Discrete features
Generation &
Assessment
Selection method
• Arg. Max
• Random choice
Accuracy assessment
Model selection &
Training
Architecture selection
Training with
examples of size 0
• Specialized
• Shared
• Concatenated
Partition 2
(20%)
Testing
Partition
1
(80%)
Training
Testing
Deep Learning
Trainer
BEST DL MODEL
Trace generator
Time
splitting
Evaluator
DDS
Parameter extraction
BEST SIM MODEL
Simulator
Training (80%)
Validation
(20%)
Event-log
1
2
3
• DDS Models (SIMOD) and DL
models have comparable
performance w.r.t. control-flow
similarity (CLFS)
• DL models sometimes clearly
outperform DDS models on
temporal metrics (MAE, ELS)
Could we combine them?
- Assumes undifferentiated resources with robotic
behavior
- Branches are selected using branching probabilities
Generative Deep Learning Methods
- Does not explicitly take into account resource
constraints
- Models resource availability via neural networks that
may capture non-linear availability functions
- Learns the case arrival process from data (univariate
or multivariate models)
- May take as input a process specification (helps with
interpretability)
- Models the case creation process via a probability
distribution
- Takes into account resource constraints
- No interpretable process specification
- Branching behavior modeled via neural networks (e.g.
LSTM) that may capture complex relations
- Models resource availability as calendars (possibly
discovered from historical data)
- May capture differentiated resources and robotic
behavior
Data-Driven (Discrete Event) Simulation
- Provides a natural mechanism for capturing the effect
of changes to the process
- Does not have a mechanism for capturing the effect
of changes to the process
Phase 1
A1 A2 A3 A5 A6
A4
Ꝺ1:
A2 A3 A4 A5
Ꝺ2:
Phase 2
A1 A2 A3 A5 A6
A4
A2 A3 A4 A5
Phase 3
e1- start e1- complete
e2- start
𝜎1 Ac1
Ac2
e2- complete
Waiting time predictive model
Features: Wait+Ac2+Cx+WIP+RO
Processing time predictive model
Features: Proc+Ac1+Cx+WIP+RO
A1 A2 A3 A5 A6
A4
A2 A3 A4 A5
Discovering a process model to
generate traces
Learning a time series generator to
determine when each trace starts
Deep-learning the processing time and
waiting time of each activity in a given trace
32
Deep Simulation generally outperforms classical DDS in temporal measures
34
Batch 1 Batch 2 Batch 3 Batch 4 Batch 5 Batch 6
• DeepSimulator can better estimate the impact of changes in the demand in settings where such
changes have been previously observed in the data.
36
• The accuracy of DeepSimulator degrades when evaluated in a previously unobserved scenario (new
task is added to the process)
SIMOD DSIM SIMOD DSIM SIMOD DSIM
Version 1
BPI17W 971151 417572 0.02222 0.03593 3185 3647
BPI12W 660211 534341 0.11295 0.04853 515 458
CVS 1489252 467572 0.03213 0.00001 3380 849
Version 2
BPI17W 895524 290980 0.06438 0.03218 4528 3431
BPT12W 550266 524995 0.25888 0.22003 726 507
CVS 540112 246159 0.15674 0.05708 2453 1967
AS-IS WHAT-IF AS-IS WHAT-IF AS-IS WHAT-IF
CFM 7155 17546 22006 33137 0.15629 0.28762
CVS 283061 1040344 357717 1052255 0.31972 1.84601
Log
MAE RMSE SMAPE
Scenario
1
Scenario
2
Log
MAE EMD DTW
This Photo by Unknown Author
is licensed under CC BY-NC-ND
Wrap-Up
• There’s a long road ahead to constructing accurate and reliable
simulation models from event logs
• Combination of deep learning techniques & simulation promising, but
need to be further researched to become practically usable for what-
if analysis
• Extensions needed to support a wide range of interventions / changes
• Extensions needed to provide reliability estimates (for what-if analysis)
• More validation in large-scale scenarios
37
References
Limitations and pitfalls of traditional BP simulation
• van der Aalst: Business Process Simulation Survival Guide. In Handbook on Business
Process Management Vol. 1, 2015, 337-370
Data-Driven Simulation (discovering simulation models from logs)
• Rozinat et al. Discovering simulation models. Inf. Syst. 34(3): 305-327 (2009)
• Martin et al. The Use of Process Mining in Business Process Simulation Model Construction
- Structuring the Field. Bus. Inf. Syst. Eng. 58(1): 73-87
• Camargo et al. Automated discovery of business process simulation models from event
logs. Decis. Support Syst. 134:113284, 2020 https://arxiv.org/abs/2009.03567
• Pourbafrani et al. Extracting Process Features from Event Logs to Learn Coarse-Grained
Simulation Models. CAiSE 2021: 125-140
Data-Driven Simulation and Deep Learning
• Camargo et al. Discovering Generative Models from Event Logs: Data-driven Simulation vs
Deep Learning, PeerJ Computer Science, 7: e577, 2021 https://peerj.com/articles/cs-577/
• Camargo et al. Learning Accurate Business Process Simulation Models from Event Logs via
Automated Process Discovery and Deep Learning. CAiSE’2022
https://arxiv.org/abs/2103.11944

More Related Content

Similar to Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?

Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Marlon Dumas
 
SAP consulting results
SAP consulting resultsSAP consulting results
SAP consulting results
Konstantin Berger
 
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
cookie1969
 
Business Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to PredictionsBusiness Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to Predictions
Marlon Dumas
 
EO notes Lecture 27 Project Management 2.ppt
EO notes Lecture 27 Project Management 2.pptEO notes Lecture 27 Project Management 2.ppt
EO notes Lecture 27 Project Management 2.ppt
yashchotaliyael21
 
Full accesspolicyconsolidation for event processing systems
Full accesspolicyconsolidation for event processing systemsFull accesspolicyconsolidation for event processing systems
Full accesspolicyconsolidation for event processing systems
viswanadhamsatish
 
Introduction to Business Process Analysis and Redesign
Introduction to Business Process Analysis and RedesignIntroduction to Business Process Analysis and Redesign
Introduction to Business Process Analysis and Redesign
Marlon Dumas
 
Metrics-Based Process Mapping
Metrics-Based Process MappingMetrics-Based Process Mapping
Metrics-Based Process Mapping
TKMG, Inc.
 
lecture1.ppt
lecture1.pptlecture1.ppt
lecture1.ppt
SagarDR5
 
C++ Notes PPT.ppt
C++ Notes PPT.pptC++ Notes PPT.ppt
C++ Notes PPT.ppt
Alpha474815
 
Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...
Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...
Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...
Enzo M. Tieghi
 
Stream Processing Overview
Stream Processing OverviewStream Processing Overview
Stream Processing Overview
Maycon Viana Bordin
 
Unbiased, Fine-Grained Description of Processes Performance from Event Data
Unbiased, Fine-Grained Description of Processes Performance from Event DataUnbiased, Fine-Grained Description of Processes Performance from Event Data
Unbiased, Fine-Grained Description of Processes Performance from Event Data
Vadim Denisov
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
NECST Lab @ Politecnico di Milano
 
Control phase lean six sigma tollgate template
Control phase   lean six sigma tollgate templateControl phase   lean six sigma tollgate template
Control phase lean six sigma tollgate templateSteven Bonacorsi
 
Control phase lean six sigma tollgate template
Control phase   lean six sigma tollgate templateControl phase   lean six sigma tollgate template
Control phase lean six sigma tollgate templateSteven Bonacorsi
 
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Marlon Dumas
 
Copyright © 2014 McGraw-Hill Higher Education. All rights .docx
Copyright © 2014 McGraw-Hill Higher Education. All rights .docxCopyright © 2014 McGraw-Hill Higher Education. All rights .docx
Copyright © 2014 McGraw-Hill Higher Education. All rights .docx
vanesaburnand
 

Similar to Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable? (20)

Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...Learning Accurate Business Process Simulation Models from Event Logs via Auto...
Learning Accurate Business Process Simulation Models from Event Logs via Auto...
 
SAP consulting results
SAP consulting resultsSAP consulting results
SAP consulting results
 
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
 
Business Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to PredictionsBusiness Process Analytics: From Insights to Predictions
Business Process Analytics: From Insights to Predictions
 
EO notes Lecture 27 Project Management 2.ppt
EO notes Lecture 27 Project Management 2.pptEO notes Lecture 27 Project Management 2.ppt
EO notes Lecture 27 Project Management 2.ppt
 
Full accesspolicyconsolidation for event processing systems
Full accesspolicyconsolidation for event processing systemsFull accesspolicyconsolidation for event processing systems
Full accesspolicyconsolidation for event processing systems
 
BIRTE-13-Kawashima
BIRTE-13-KawashimaBIRTE-13-Kawashima
BIRTE-13-Kawashima
 
Introduction to Business Process Analysis and Redesign
Introduction to Business Process Analysis and RedesignIntroduction to Business Process Analysis and Redesign
Introduction to Business Process Analysis and Redesign
 
Metrics-Based Process Mapping
Metrics-Based Process MappingMetrics-Based Process Mapping
Metrics-Based Process Mapping
 
lecture1.ppt
lecture1.pptlecture1.ppt
lecture1.ppt
 
C++ Notes PPT.ppt
C++ Notes PPT.pptC++ Notes PPT.ppt
C++ Notes PPT.ppt
 
Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...
Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...
Tieghi Anipla 20 04 2010 Come Possiamo Essere Sicuri Che Tutti Seguano Le Pro...
 
Final Report Defense 021509
Final Report Defense 021509Final Report Defense 021509
Final Report Defense 021509
 
Stream Processing Overview
Stream Processing OverviewStream Processing Overview
Stream Processing Overview
 
Unbiased, Fine-Grained Description of Processes Performance from Event Data
Unbiased, Fine-Grained Description of Processes Performance from Event DataUnbiased, Fine-Grained Description of Processes Performance from Event Data
Unbiased, Fine-Grained Description of Processes Performance from Event Data
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
 
Control phase lean six sigma tollgate template
Control phase   lean six sigma tollgate templateControl phase   lean six sigma tollgate template
Control phase lean six sigma tollgate template
 
Control phase lean six sigma tollgate template
Control phase   lean six sigma tollgate templateControl phase   lean six sigma tollgate template
Control phase lean six sigma tollgate template
 
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
Mine Your Simulation Model: Automated Discovery of Business Process Simulatio...
 
Copyright © 2014 McGraw-Hill Higher Education. All rights .docx
Copyright © 2014 McGraw-Hill Higher Education. All rights .docxCopyright © 2014 McGraw-Hill Higher Education. All rights .docx
Copyright © 2014 McGraw-Hill Higher Education. All rights .docx
 

More from Marlon Dumas

How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
Marlon Dumas
 
Walking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process OptimizationWalking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process Optimization
Marlon Dumas
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Marlon Dumas
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Marlon Dumas
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
Marlon Dumas
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Marlon Dumas
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Marlon Dumas
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
Marlon Dumas
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
Marlon Dumas
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
Marlon Dumas
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Marlon Dumas
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Marlon Dumas
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
Marlon Dumas
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
Marlon Dumas
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
Marlon Dumas
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Marlon Dumas
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
Marlon Dumas
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
Marlon Dumas
 
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionPrescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Marlon Dumas
 
On the Road to AI-Infused Process Execution
On the Road to AI-Infused Process ExecutionOn the Road to AI-Infused Process Execution
On the Road to AI-Infused Process Execution
Marlon Dumas
 

More from Marlon Dumas (20)

How GenAI will (not) change your business?
How GenAI will (not)  change your business?How GenAI will (not)  change your business?
How GenAI will (not) change your business?
 
Walking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process OptimizationWalking the Way from Process Mining to AI-Driven Process Optimization
Walking the Way from Process Mining to AI-Driven Process Optimization
 
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...Discovery and Simulation of Business Processes with Probabilistic Resource Av...
Discovery and Simulation of Business Processes with Probabilistic Resource Av...
 
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
Can I Trust My Simulation Model? Measuring the Quality of Business Process Si...
 
Business Process Optimization: Status and Perspectives
Business Process Optimization: Status and PerspectivesBusiness Process Optimization: Status and Perspectives
Business Process Optimization: Status and Perspectives
 
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
Learning When to Treat Business Processes: Prescriptive Process Monitoring wi...
 
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business ProcessesWhy am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
Why am I Waiting Data-Driven Analysis of Waiting Times in Business Processes
 
Augmented Business Process Management
Augmented Business Process ManagementAugmented Business Process Management
Augmented Business Process Management
 
Process Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process SimulationProcess Mining and Data-Driven Process Simulation
Process Mining and Data-Driven Process Simulation
 
Modeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process SimulationModeling Extraneous Activity Delays in Business Process Simulation
Modeling Extraneous Activity Delays in Business Process Simulation
 
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...Business Process Simulation with Differentiated Resources: Does it Make a Dif...
Business Process Simulation with Differentiated Resources: Does it Make a Dif...
 
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource ConstraintsPrescriptive Process Monitoring Under Uncertainty and Resource Constraints
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints
 
Robotic Process Mining
Robotic Process MiningRobotic Process Mining
Robotic Process Mining
 
Process Mining: A Guide for Practitioners
Process Mining: A Guide for PractitionersProcess Mining: A Guide for Practitioners
Process Mining: A Guide for Practitioners
 
Process Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptxProcess Mining for Process Improvement.pptx
Process Mining for Process Improvement.pptx
 
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
Data-Driven Analysis of  Batch Processing Inefficiencies  in Business ProcessesData-Driven Analysis of  Batch Processing Inefficiencies  in Business Processes
Data-Driven Analysis of Batch Processing Inefficiencies in Business Processes
 
Optimización de procesos basada en datos
Optimización de procesos basada en datosOptimización de procesos basada en datos
Optimización de procesos basada en datos
 
Process Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process ImprovementProcess Mining and AI for Continuous Process Improvement
Process Mining and AI for Continuous Process Improvement
 
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time ReductionPrescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction
 
On the Road to AI-Infused Process Execution
On the Road to AI-Infused Process ExecutionOn the Road to AI-Infused Process Execution
On the Road to AI-Infused Process Execution
 

Recently uploaded

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 

Recently uploaded (20)

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 

Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?

  • 1. Marlon Dumas Professor @ University of Tartu Co-founder @ Apromore With contributions by Manuel Camargo and Oscar González-Rojas Baltic DB&IS’2022, 5 July 2022
  • 3. Financial Human Resources Technology Organisation Function A (Sales) Function B (Manufacturing) Function C (Finance) Assets & Partners Customers Data Business Process Business Process Business Process Business processes 3 Materials
  • 5. 5 Dispatch order Prepare route - perishable goods X Prepare regular route X Shipment method Perishables Hybrid Verify Order Business Process  Reallocate resources  Automate tasks  Parallelize activities  Modify the sequence flow  Increase de process demand Process Managers(s) Business Analyst(s) “What-If” Business Process Analysis
  • 6. 6 Process Credit Card Accept Cash or Check Identify payment method 18 / 8 min 20 / 5 min 5 / 2 min Prepare package for customer 10 / 5 min Cycle time Processing / Waiting times Costs x activity x resource … Resource utilization 1h 5 min 38 min 10 min The Traditional Answer: Business Process Simulation
  • 7. 8
  • 9. 0 Arrival rate = 2 applications per hour Inter-arrival time = 0.5 hour Negative exponential distribution From Monday-Friday, 9am-5pm 0.3 0.7 0.3
  • 11. Business Process Simulation: Assumptions The process model is authoritative (always followed to the letter) • No deviations • No workarounds The simulation parameters accurately reflect reality • …whereas in reality, they are often guesstimates A resource only works on one task instance at a time / a task is performed by one resource • No multi-tasking / no multi-resource tasks (teamwork) Resources have robotic behavior (eager resources consume work items in FIFO mode) • No batching • No tiredness effects, no interruptions, no distractions beyond “stochastic” ones Undifferentiated resources • Every resource in a pool has the same performance as others No time-sharing outside the simulated process • Resources fully dedicated to one process 12
  • 12. End Result Business process simulations based on incomplete models, guesstimates, and simplifying assumptions are not faithful  adoption of business process simulation is disappointing 3
  • 14. Given • one or more business processes, for which we have: • one or more process specifications and/or • event logs generated by the execution of the processes on top of one or more information systems. • one or more process performance measures of interest (e.g. cycle time, resource cost) • One or more changes to the process (interventions) Predict • Predict the values of the process performance measures after the given interventions.
  • 15. Non-Functional Requirements 16 Predictions accurate. Accuracy may be measured e.g. via an error between the predicted and the actual performance measures after intervention. Predictions should be accompanied by a reliability estimate. In most cases, the reliability is high. Reliability could be captured, e.g. by confidence intervals
  • 17. 18 {T1 -> T2 -> T3} {T1 -> T3 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T2} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T3 -> T2} {T1 -> T2 -> T3} {T1 -> T2 -> T3} Event log Process Model Discovery Model-to-trace alignment & repair Control Flow Discovery Model enhancement Simulation parameters extraction BPS model assembly Vs. Generated Ground truth Accuracy assessment Tuning Hyperparameter optimizer Simulator Simulated Log https://github.com/AutomatedProcessImprovement/Simod
  • 18. 19 Branching probabilities definition Interarrival dist. Resource pool discovery Task assignment Task 1 Task 2 Task 3 Activity process times X Task 4 X Availabiity timetables Multi-tasking behavior Batching and prioritization
  • 19. Phase Category Variable Control flow discovery Parallelism threshold (ε) Filtering threshold (η) Parameters for log repair Simulation parameters discovery Thresholds for resource pool discovery Parameters for fitting temporal distributions Optimal alignment of complete bipartite graph Test Log x Simulation Log weighted by Damerau-Levenshtein (DL) distance, with penalty for temporal mismatch {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} Test Fold of Event-Log Simulated Log
  • 20. 2 1 A1 A3 A3 A5 A6 A7 A8 A2 A3 A4 A5 A6 A7 A8 Delete(A1) Substitute(A2) * Insert(A4) Transpose (A8↔A7) Ꝺ1: Ꝺ2: Damerau-levenshtein distance (DL) Control-Flow Similarity (CLFS) A1 A3 A3 A5 A6 A7 A8 Delete(A1) Substitute(A2) * Insert(A4) Transpose (A8↔A7) Ꝺ1: A2 A3 A4 A5 A6 A7 A8 Ꝺ2: Business Process Trace Distance (BPTD) Error in waiting and processing time No penalization if parallel activities Event Log Similarity (ELS)
  • 21. Dataset Control-Flow Similarity (string-edit distance) Temporal Similarity (timed-string edit distance) Call centre 0.37 0.41 Pharmacy customer service 0.29 0.30 Purchase-to-Pay 0.55 0.57 Make-to-order manufacturing 0.65 0.69 Academic credentials recognition 0.32 0.29 Insurance claims handling 0.39 0.43 Loan Origination 0.41 0.42 Discover simulation model Simulate model 10 times Evaluate Similarity (mean string-edit distance & timed- string edit distance) This Photo by Unknown Author is licensed under CC BY-NC
  • 22. We can try to fill in the glass • Discover and add batching behavior to simulation models • .. prioritization • … timers and external factors (not explicit in the data) • etc. Or perhaps we should look for another paradigm…. 23
  • 23. 24 How is going to continue this case until it is finished? How long is this case still going to take until it is finished? What is the next activity for this case? When is this next activity going to take place? … Generate a set of traces (event log) E1 E2 E3 Running case
  • 24. 2 5 {T1 -> T2 -> T3} {T1 -> T3 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T2 -> T2} {T1 -> T2 -> T3} {T1 -> T2 -> T3} {T1 -> T3 -> T2} {T1 -> T2 -> T3} {T1 -> T2 -> T3} Event log Pre-processing • Timestamp relativization • Scaling Continuous features • Embedded dimensions • N-grams extraction • Discovery of roles Discrete features Generation & Assessment Selection method • Arg. Max • Random choice Accuracy assessment Model selection & Training Architecture selection Training with examples of size 0 • Specialized • Shared • Concatenated
  • 25. Partition 2 (20%) Testing Partition 1 (80%) Training Testing Deep Learning Trainer BEST DL MODEL Trace generator Time splitting Evaluator DDS Parameter extraction BEST SIM MODEL Simulator Training (80%) Validation (20%) Event-log 1 2 3
  • 26. • DDS Models (SIMOD) and DL models have comparable performance w.r.t. control-flow similarity (CLFS) • DL models sometimes clearly outperform DDS models on temporal metrics (MAE, ELS) Could we combine them?
  • 27. - Assumes undifferentiated resources with robotic behavior - Branches are selected using branching probabilities Generative Deep Learning Methods - Does not explicitly take into account resource constraints - Models resource availability via neural networks that may capture non-linear availability functions - Learns the case arrival process from data (univariate or multivariate models) - May take as input a process specification (helps with interpretability) - Models the case creation process via a probability distribution - Takes into account resource constraints - No interpretable process specification - Branching behavior modeled via neural networks (e.g. LSTM) that may capture complex relations - Models resource availability as calendars (possibly discovered from historical data) - May capture differentiated resources and robotic behavior Data-Driven (Discrete Event) Simulation - Provides a natural mechanism for capturing the effect of changes to the process - Does not have a mechanism for capturing the effect of changes to the process
  • 28. Phase 1 A1 A2 A3 A5 A6 A4 Ꝺ1: A2 A3 A4 A5 Ꝺ2: Phase 2 A1 A2 A3 A5 A6 A4 A2 A3 A4 A5 Phase 3 e1- start e1- complete e2- start 𝜎1 Ac1 Ac2 e2- complete Waiting time predictive model Features: Wait+Ac2+Cx+WIP+RO Processing time predictive model Features: Proc+Ac1+Cx+WIP+RO A1 A2 A3 A5 A6 A4 A2 A3 A4 A5 Discovering a process model to generate traces Learning a time series generator to determine when each trace starts Deep-learning the processing time and waiting time of each activity in a given trace
  • 29. 32 Deep Simulation generally outperforms classical DDS in temporal measures
  • 30. 34 Batch 1 Batch 2 Batch 3 Batch 4 Batch 5 Batch 6 • DeepSimulator can better estimate the impact of changes in the demand in settings where such changes have been previously observed in the data.
  • 31. 36 • The accuracy of DeepSimulator degrades when evaluated in a previously unobserved scenario (new task is added to the process) SIMOD DSIM SIMOD DSIM SIMOD DSIM Version 1 BPI17W 971151 417572 0.02222 0.03593 3185 3647 BPI12W 660211 534341 0.11295 0.04853 515 458 CVS 1489252 467572 0.03213 0.00001 3380 849 Version 2 BPI17W 895524 290980 0.06438 0.03218 4528 3431 BPT12W 550266 524995 0.25888 0.22003 726 507 CVS 540112 246159 0.15674 0.05708 2453 1967 AS-IS WHAT-IF AS-IS WHAT-IF AS-IS WHAT-IF CFM 7155 17546 22006 33137 0.15629 0.28762 CVS 283061 1040344 357717 1052255 0.31972 1.84601 Log MAE RMSE SMAPE Scenario 1 Scenario 2 Log MAE EMD DTW This Photo by Unknown Author is licensed under CC BY-NC-ND
  • 32. Wrap-Up • There’s a long road ahead to constructing accurate and reliable simulation models from event logs • Combination of deep learning techniques & simulation promising, but need to be further researched to become practically usable for what- if analysis • Extensions needed to support a wide range of interventions / changes • Extensions needed to provide reliability estimates (for what-if analysis) • More validation in large-scale scenarios 37
  • 33. References Limitations and pitfalls of traditional BP simulation • van der Aalst: Business Process Simulation Survival Guide. In Handbook on Business Process Management Vol. 1, 2015, 337-370 Data-Driven Simulation (discovering simulation models from logs) • Rozinat et al. Discovering simulation models. Inf. Syst. 34(3): 305-327 (2009) • Martin et al. The Use of Process Mining in Business Process Simulation Model Construction - Structuring the Field. Bus. Inf. Syst. Eng. 58(1): 73-87 • Camargo et al. Automated discovery of business process simulation models from event logs. Decis. Support Syst. 134:113284, 2020 https://arxiv.org/abs/2009.03567 • Pourbafrani et al. Extracting Process Features from Event Logs to Learn Coarse-Grained Simulation Models. CAiSE 2021: 125-140 Data-Driven Simulation and Deep Learning • Camargo et al. Discovering Generative Models from Event Logs: Data-driven Simulation vs Deep Learning, PeerJ Computer Science, 7: e577, 2021 https://peerj.com/articles/cs-577/ • Camargo et al. Learning Accurate Business Process Simulation Models from Event Logs via Automated Process Discovery and Deep Learning. CAiSE’2022 https://arxiv.org/abs/2103.11944