Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models

Can I Trust My Simulation
Model? Measuring the
Quality of Business Process
Simulation Models
David Chapela-Campa1, Ismail Benchekroun2, Opher Baron2,
Marlon Dumas1, Dmitry Krass2, and Arik Senderovich3
21st International Conference on Business Process
Management (BPM 2023)
1 University of Tartu, Estonia
2 University of Toronto, Canada
3 York University, Canada

Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models 2
Introduction

Business Process Simulation (BPS)
3
BPS allows users to address “what-if” analysis questions.
What would be the cycle time of the process if the rate of arrival of new cases
doubles?
Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models

4
BPS models can be manually created by modeling experts.

5
BPS models can be manually created by modeling experts.
Use of process mining techniques to automatically discover BPS
models from business process event logs.

6
How to assess the quality of a BPS model?
Automatic assessment.
Useful to detect the sources of deviations.

Proposed Framework

Quality of a BPS model
8
Comparing an event log with a BPS model.
Variety of different BPS models formats.
Process event log

9
Generate K simulated event logs.
Compare individually and report the average and confidence interval.
9
Process event log
K simulated event logs

10
A BPS model can be very accurate in one aspect (e.g., control-flow), yet
very different in another (e.g., processing times).
Three main dimensions: control-flow, temporal, congestion.
Process event log Simulated event log

Proposed Framework
Control-flow measures

Control-flow: Control-Flow Log
Distance
12
Control-Flow Log Distance (CFLD): given two event logs L1 and L2,
(minimum) average distance to transform each case in L1 into another
case in L2, such that each case in L1 is paired to a different case in L2.
Camargo, M., Dumas, M., Rojas, O.G.: Discovering generative models from event logs:
data-driven simulation vs deep learning. PeerJ Comput. Sci. 7, e577 (2021)

Distance
13
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H

Distance
14
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H
0

Distance
15
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H
0
0
0.2

Distance
16
CFLD =
0+0+0.75+0+0.2
5
= 0.19

Control-flow: N-Gram Distance
17
N-Gram Distance (NGD): given two event logs L1 and L2, and a positive
integer 𝑛, difference in the frequencies of the 𝑛-grams observed in
both L1 and L2.
Leemans, S.J.J., Syring, A.F., van der Aalst, W.M.P.: Earth movers’ stochastic conformance
checking. In: BPM Forum 2019. LNBIP, vol. 360, pp. 127–143. Springer (2019)

18
both L1 and L2.
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H
N = 3

19
both L1 and L2.
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H
N = 3

20
both L1 and L2.
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H
N = 3

21
both L1 and L2.
A B C D
A B C D
A C B D
A E F G H
A E F G I
A B C D
A C B D
A E F G
A E F G H
A E F G H
N = 3

22
both L1 and L2.
0
1
2
3
4
5
6
_ _ A _ A B _ A C _ A E A B C A C B A E F B C D C B D E F G F G H F G I C D _

Proposed Framework
Temporal measures

Temporal: Absolute Event
Distribution
24
Absolute Event Distribution (AED): given two event logs L1 and L2,
distance between the time series of the events in L1 and L2.
How different they are distributed through the event log.

Distribution
25

Distribution
26
06-10-2022 10am – 11am
07-10-2022 11am – 12pm

Distribution
27
Earth
mover's
distance

Temporal: Circadian Event
Distribution
28
Circadian Event Distribution (CED): given two event logs L1 and L2,
distance between the time series of the events in L1 and L2, for each
day of the week.
How different they are distributed through each day of the week.
Monday
Tuesday
Wednesday
Thursday 10am – 11am
Friday 11am – 12pm

Temporal: Circadian Event
Distribution
29
Circadian Event Distribution (CED): given two event logs L1 and L2,
distance between the time series of the events in L1 and L2, for each
day of the week.
How different they are distributed through each day of the week.
EMD
Monday Monday

Temporal: Relative Event
Distribution
30
Relative Event Distribution (RED): given two event logs L1 and L2,
distance between the time series of the events in L1 and L2, with
respect to the start of their case.
How different they are distributed within each process case.
00:00:00 01:01:47

Temporal: Relative Event
Distribution
Relative Event Distribution (RED): given two event logs L1 and L2,
distance between the time series of the events in L1 and L2, with
respect to the start of their case.
How different they are distributed within each process case.
EMD

Proposed Framework
Congestion measures

Congestion: Case Arrival Rate
33
Case Arrival Rate (CAR): given two event logs L1 and L2, distance
between how the case arrivals are distributed in L1 and L2.
06-10-2022 10am – 11am

Congestion: Case Arrival Rate
34
Case Arrival Rate (CAR): given two event logs L1 and L2, distance
between how the case arrivals are distributed in L1 and L2.
EMD

Congestion: Cycle Time Distribution
35
Cycle Time Distribution (CTD): given two event logs L1 and L2, distance
between the distribution of cycle times in L1 and L2.
01:07:02

Congestion: Cycle Time Distribution
36
Cycle Time Distribution (CTD): given two event logs L1 and L2, distance
between the distribution of cycle times in L1 and L2.
EMD

Evaluation

Evaluation
38
EQ1: Are the proposed measures able to discern the impact of different
known modifications to a BPS model?
EQ2: Is the N-Gram Distance’s performance significantly different from
the CFLD’s performance?
No modifications
Control-flow
Gateway probabilities
Case arrival rate
Activity durations
Resource contention
Working calendars
Extraneous delays

Evaluation
39

Evaluation
40
EQ1: Are the proposed measures able to discern the impact of different
known modifications to a BPS model?

Evaluation
41
EQ2: Is the N-Gram Distance’s performance significantly different from
the CFLD’s performance?
Kendall
rank
correlation
coefficient
1.0

Evaluation
42
EQ3: Given two BPS models discovered by existing automated BPS
model discovery techniques in real-life scenarios, are the proposed
measures able to identify the strengths and weaknesses of each
technique?

Evaluation
43
technique?
4 real-life processes: each split into disjoint training and test.

Evaluation
44
technique?
Automatically discover BPS model with SIMOD and Service Miner.

Evaluation
45
technique?

Evaluation
46
technique?

Evaluation
47
technique?

Evaluation
48
technique?

Evaluation
49
EQ4: Does the 1-WD report the same insights in real-life scenarios as
the EMD?

Conclusion

Conclusion
51
Proposed a framework to measure the quality of a BPS model:
decomposing into three perspectives (control-flow, temporal, and
congestion), and defined measures for each of these perspectives.
The measures proved their ability to detect the alterations in their
corresponding perspectives.
Beyond capturing the quality of BPS model and identifying the sources of
discrepancies, the measures can also assist in eliciting areas for
improvement in these techniques.
The presented computationally efficient alternatives led to similar
conclusions.

Future Work
52
Explore the applicability of the proposed measures to other process
mining problems, e.g., concept drift detection and variant analysis.
Studying how to assess the quality of BPS models in the context of
object-centric event logs.
Study other quality measures for BPS models adapted from the field of
generative machine learning, for example, by using a discriminative
model that attempts to distinguish between data generated by the
BPS model and real data.

Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models

Similar to Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models (20)

More from Marlon Dumas

More from Marlon Dumas (20)

Recently uploaded

Recently uploaded (20)

Can I Trust My Simulation Model? Measuring the Quality of Business Process Simulation Models

Editor's Notes