Log-Based Understanding of Business Processes through Temporal Logic Query Checking

Log-Based Understanding
of Business Processes
through Temporal Logic Query Checking
Margus Räim, Claudio Di Ciccio, Fabrizio Maria Maggi, Massimo Mecella, and Jan Mendling
22nd International Conference on Cooperative Information Systems
Amantea, Italy
claudio.di.ciccio@wu.ac.at

Outline
 Process Mining,
Log-based Understanding
SEITE 2

Outline
 Process Mining,
 LTLf query checking
 Our objective
SEITE 3

Outline
 Process Mining,
 Our objective
 Folding the future
SEITE 4

Outline
 Process Mining,
 Our objective
 Folding the future
 Evaluation
 Conclusion
SEITE 5

The event log
Process model
SEITE 6

The event log
Process instance Event log
Trace
SEITE 7

The event log
EventTask
Trace
SEITE 8

The event log
Trace
SEITE 12

Control-flow discovery
?
Objective: understanding the
temporal structure that best
describes the process behind
the event log
SEITE 17

Control-flow discovery
v. flexible processes
SEITE 18

Temporal constraints mining
?
Objective: understanding the
constraints that best define
the allowed behaviour of the
process behind the event log
SEITE 19

Log-based understanding
of a process model
1. Which activities require another one to follow?
2. Which activities require another one to
precede?
3. Which activities are mutually exclusive?
…
We want to leave the user free to specify the
rules to be discovered
SEITE 20

LTLf
 Linear Temporal Logic (LTL) was originarily a
specification language for the execution of
(endless) concurrent programs (Pnueli, 1977)
 Syntax (let A be a propositional symbol):
 Interpretation over infinite traces,
i.e., an infinite sequence of consecutive instants of time
 LTLf formulae are meant to be interpreted over
finite traces
“Until”
“Eventually”“Always”
“Next”

Log-based understanding
SEITE 22
precede?
1.
2.
3.

Log-based understanding:
An example
SEITE 23
A A B C A B C A C B C
C C C C C A A B C A A B A A B
A B B B D
B A B D
A B B D
C A B A A C C B B
B D A D B D
A B C A A B B C
D D D D D
C A A C C C A A B C B C C B D

An example
SEITE 24
A B B B D
B A B D
A B B D
C A B A A C C B B
B D A D B D
A B C A A B B C
D D D D D
C A A C C C A A B C B C C B
A always requires B to follow (10/10)
0 Counterexamples
10 Witnesses

An example
SEITE 25
A B B B D
B A B D
A B B D
C A B A A C C B B
B D A D B D
A B C A A B B C
D D D D D
B does not always require A to precede (8/10)
2 Counterexamples
8 Witnesses

An example
SEITE 26
A B B B D
B A B D
A B B D
C A B A A C C B B
B D A D B D
A B C A A B B C
D D D D D
C and D are mutually exclusive (10/10)
10 Witnesses
0 Counterexamples

LTLf query checking
SEITE 27
precede?
1.
2.
3.
Placeholder
Placeholder
Placeholders are meant to be assigned with one of the activities in the log alphabet
(in the example, either to A, B, C or D)

Understading real-life logs:
why we cannot do it by inspection
SEITE 28

Recap
 An event log is given
 The user wants to have an understanding of what
went on there, to gain knowledge about the process
behind such log
 To this extent, (s)he formulates queries, asking for
activities that satisfy given conditions about
temporal constraints
 Our technique aims at answering such queries
 We take advantage of the fact that:
1. we know the finite set of activities of which the process
consists,
2. the queries are formulated in a well-known formal
language, and
3. …
SEITE 29

The technique:
assumptions
SEITE 30
… We know the future

Traces are finite
SEITE 31
A A B C A B C A C B C ¶
C C C C C A A B C A A B A A B ¶
A B B B D ¶
B A B D ¶
A B B D ¶
C A B A A C C B B ¶
B D A D B D ¶
A B C A A B B C ¶
D D D D D ¶
C A A C C C A A B C B C C B D ¶

Folding from the future
SEITE 32
4 Witnesses, 1 Counterexample

SEITE 33

SEITE 34
4 Witnesses,
1 Counterexample

Divide et impera:
the query evaluation tree
SEITE 35
The algorithm is designed to recursively call sub-procedures

Evaluation:
performance w.r.t. query
SEITE 36
Default log:
100 traces of 10 events each, log alphabet of 10 activities
Windows 7 OS, Intel Core i7 CPU, 8GB of main memory
Prototype encoded in C (https://github.com/r2im/pickaxe)

Evaluation:
performance w.r.t. query
SEITE 37
Default log:
100 traces of 10 events each, log alphabet of 10 activities
Windows 7 OS, Intel Core i7 CPU, 8GB of main memory
Prototype encoded in C (https://github.com/r2im/pickaxe)

Evaluation:
case study
SEITE 38
BPI Challenge 2011 (Dutch hospital’s log)
1,143 cases and 150,291 events, 623 activities

Conclusions
 What we saw:
 A novel technique for the log-based understanding of a
process model
 More in the paper:
 Formal definition of the folded temporal structure
 The algorithm for answering LTLf queries
 Proof of the theorem stating the soundness of the
proposed algorithm
 Experiments in detail
 Future work:
 Improve performance
 Create a user-interaction for refining the query
formulation, iteratively
SEITE 39

Log-Based Understanding
of Business Processes
through Temporal Logic Query Checking
Margus Räim, Claudio Di Ciccio, Fabrizio Maria Maggi, Massimo Mecella, and Jan Mendling
Extra

Verifying constraints on log
(state of the art)
SEITE 42
B|C|D
A|C|D
A
B
 A A B C A B C A C B C
 C C C C C A A B C A A B A A B
 A B B B D
 B A B D
 A B B D
 C A B A A C C B B
 B D A D B D
 A B C A A B B C
 D D D D D
 C A A C C C A A B C B C C B D

(state of the art)
SEITE 43
B|C|D
A|C|D
A
B
C|D
A|B|C|D
A
B
A|B|C|D
 A B B B D
 B A B D
 A B B D
 B D A D B D
 A B C A A B B C
 D D D D D

(state of the art)
SEITE 44
B|C|D
A|C|D
A
B
C|D
A|B|C|D
A
B
A|B|C|D
C|D
A|C|D
B|C|D
A
B
A
B
A|B|C|D
A|B|C|D
Here we already know which activities
are meant to be constrained
 A B B B D
 B A B D
 A B B D
 B D A D B D
 A B C A A B B C
 D D D D D

Intuition
 Replay turns out to be the best technique to
 Maintain the history in the current state, and
 Wait for the future moves, which are unknown
 Working with logs, we have an advantage…
SEITE 45
B|C|D
A|C|D
A
B
C C C C C A A B C A A B A A B C C A C A B A C B B A B C

The technique:
assumption
SEITE 46
… We know the future

SEITE 47
[Â]
[^B]
A
B
A B B B D
B A B D
A B B D
C A B A A C C B B
B D A D B D
A B C A A B B C
D D D D D
C A A C C C A A B C B C C B D
[Â]
[^C]
A
C
[Â]
[^D]
A
D
[^B]
[Â]
B
A
[^B]
[^C]
B
C
[^B]
[^D]
B
D
[^C]
[Â]
C
A
[^C]
[^B]
C
B
[^C]
[^D]
C
D
[^D]
[Â]
D
A

SEITE 48
[Â]
[^B]
A
B
[Â]
[^C]
A
C
[Â]
[^D]
A
D
[^B]
[Â]
B
A
[^B]
[^C]
B
C
[^B]
[^D]
B
D
[^C]
[Â]
C
A
[^C]
[^B]
C
B
[^C]
[^D]
C
D
[^D]
[Â]
D
A

Recap
SEITE 49
Understand the
rules behind a log
in a more
practicable yet
customisable
way

Log-Based Understanding of Business Processes through Temporal Logic Query Checking

Recommended

Recommended

More Related Content

Similar to Log-Based Understanding of Business Processes through Temporal Logic Query Checking

Similar to Log-Based Understanding of Business Processes through Temporal Logic Query Checking (6)

More from Claudio Di Ciccio

More from Claudio Di Ciccio (12)

Recently uploaded

Recently uploaded (20)

Log-Based Understanding of Business Processes through Temporal Logic Query Checking