Predictive Business Process Monitoring with LSTM Neural Networks

•Download as PPTX, PDF•

4 likes•1,874 views

Research paper presentation on deep learning for predictive monitoring of business processes. Talk delivered by Niek Tax at the CAiSE'2017 conference, 15 June 2017. Research paper available at: http://tinyurl.com/yambgtng and source code available at: https://github.com/verenich/ProcessSequencePrediction/

Education

Predictive Business Process
Monitoring with LSTM Neural
Networks
Niek Tax (TU/e)
Ilya Verenich (QUT)
Marcello La Rosa (QUT)
Marlon Dumas (Tartu)
June 15th, 2017

Predictive Business Process Monitoring
in a Nutshell
PAGE 1

Predictive Monitoring: General Approach
PAGE 2

Predictive Monitoring Example
PAGE 3
Current
situation
• What is the next activity for
this case?
• When is this next activity
going to take place?
• How long is this case still
going to take until it is
finished?
• What is the outcome of this
case? Is the compensation
going to be paid? Or
rejected?

Recurrent Neural Networks
PAGE 4
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436-444 (2015)
Has problems with long-term dependencies in the data!

Long Short-Term Memory
PAGE 5Figure from http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Computation 9(8),
1735-1780 (1997)
How do we represent events as
neural network inputs?

From Event to Feature Vector
PAGE 6
One-hot encoding
1) Time since midnight
2) Time since week start
3) Time since last event <1,0,0,0,0,0,0,0>
• 8 different activities
• Each position
represents one
activity

Data Sets for Evaluation
• Helpdesk log
- Ticketing management
process of the helpdesk of an
Italian software company
• BPI Challenge 2012 W subprocess log
- A financial loan application process
at a large financial institution in
the Netherlands
PAGE 8
Events 13 710
Cases 3 804
Activities 9
• Environmental Permit log
- Process of handling environmental
permit applications at a Dutch
municipality
Events 72 413
Cases 9 658
Activities 6
Events 38 944
Cases 937
Activities 381

Baseline Technique for Time Prediction
PAGE 9
van der Aalst, W.M.P., Schonenberg, M.H., & Song, M. (2011). Time
prediction based on process mining. Information Systems, 36(2),
450-475.

Predicting the Next Activity and its Time
PAGE 10
2-layers of which
one layer shared
is the best
architecture on
both data sets
LSTMs outperform
traditional RNNs
LSTMs outperform the
transition system based
approach for time-of-
next-event prediction
LSTMs are more
accurate in predicting
the next activity on
BPI’12 W than 2
baseline methods

Predicting the Suffix of a Case
PAGE 11
Polato, M., Sperduti, A., Burattin, A., de Leoni, M.: Time and activity sequence prediction of
business process instances. arXiv preprint arXiv:1602.07566 (2016)

Predicting the Remaining Cycle Time
PAGE 12

Conclusions
• LSTMs can be used as a general framework for prediction tasks in
the context of business processes that outperforms tailor-made
approaches on a range of tasks and data sets
• Predicting time and activity in one shared model outperforms
predicting both in separate models
• We identified a limitation of the technique when event logs contain
many repeated events
• Code and documentation is available at:
http://verenich.github.io/ProcessSequencePrediction
PAGE 13

Neural Network Details
• Learning algorithm: Adam
• Loss Functions
• Batch Normalization layer after every two layers
PAGE 14
Ioffe, S., & Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by
Reducing Internal Covariate Shift. In Proceedings of the 32nd International Conference on Machine
Learning (pp. 448-456).
Kingma, D., & Ba, J. (2015). Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd
International Conference for Learning Representations
Objective Loss Function
Activity Cross Entropy Loss
Time Mean Absolute Error Loss

What's hot

論文出版スケジュールの立て方英文校正エディテージ

新人研修純音聴力検査TeruKamogashira

併購報告聯想仁寶合資案（無評價版）Friedrich Chen

内容的妥当性，構造的妥当性と仮説検定の評価Yoshitake Takebayashi

シングルケースデザインの基礎.pdfYusuke Shudo

朝レク優先順位Katsushige Takagishi

linear programming DagnaygebawGoshme

統計モデリングで癌の5年生存率データから良い病院を探す. .

SIMPLEX METHOD.pptxTista3

Stany nagle w poloznictwiePolanest

GLMM in interventional study at Require 23, 20151219Shuhei Ichikawa

MCMCと正規分布の推測Gen Fujita

What's hot (12)

論文出版スケジュールの立て方

新人研修純音聴力検査

併購報告聯想仁寶合資案（無評價版）

内容的妥当性，構造的妥当性と仮説検定の評価

シングルケースデザインの基礎.pdf

朝レク優先順位

linear programming

統計モデリングで癌の5年生存率データから良い病院を探す

SIMPLEX METHOD.pptx

Stany nagle w poloznictwie

GLMM in interventional study at Require 23, 20151219

MCMCと正規分布の推測

Similar to Predictive Business Process Monitoring with LSTM Neural Networks

Human Activity Recognition SystemIRJET Journal

Module 04 Content· As a continuation to examining your policies, rIlonaThornburg83

Mining Fuzzy Time Interval Periodic Patterns in Smart Home Data IJECEIAES

Drsp dimension reduction for similarity matching and pruning of time series ...IJDKP

Event detection and summarization based on social networks and semantic query...ijnlc

Relative parameter quantification in dataIJDKP

IRJET - Creating a Security Alert for the Care Takers Implementing a Vast Dee...IRJET Journal

Articulo temporaldataminingadolfouex

Hyperparameters analysis of long short-term memory architecture for crop cla...IJECEIAES

A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICSijistjournal

IoT module 3- 22ETC15H-2022-23 by Dr.Suresha V1.pdfSURESHA V

A Dynamic Systems Approach to Production Management in the Automotive IndustryFrancisco Restivo

Performance analysis of data mining algorithms with neural networkIAEME Publication

Multiprocessor scheduling of dependent tasks to minimize makespan and reliabi...ijfcstjournal

A data driven approach for monitoring network eventsJisc

10 9242 it geo-spatial information for managing ambiguity(edit ty)IAESIJEECS

Dssquynhphan2009

Review on Solar Power System with Artificial Intelligenceijtsrd

ON THE PERFORMANCE OF INTRUSION DETECTION SYSTEMS WITH HIDDEN MULTILAYER NEUR...IJCNCJournal

Similar to Predictive Business Process Monitoring with LSTM Neural Networks (20)

Human Activity Recognition System

Module 04 Content· As a continuation to examining your policies, r

Mining Fuzzy Time Interval Periodic Patterns in Smart Home Data

Drsp dimension reduction for similarity matching and pruning of time series ...

Event detection and summarization based on social networks and semantic query...

Relative parameter quantification in data

IRJET - Creating a Security Alert for the Care Takers Implementing a Vast Dee...

Articulo temporaldatamining

Hyperparameters analysis of long short-term memory architecture for crop cla...

A STUDY OF TRADITIONAL DATA ANALYSIS AND SENSOR DATA ANALYTICS

IoT module 3- 22ETC15H-2022-23 by Dr.Suresha V1.pdf

A Dynamic Systems Approach to Production Management in the Automotive Industry

Performance analysis of data mining algorithms with neural network

Multiprocessor scheduling of dependent tasks to minimize makespan and reliabi...

A data driven approach for monitoring network events

10 9242 it geo-spatial information for managing ambiguity(edit ty)

Dss

Review on Solar Power System with Artificial Intelligence

ON THE PERFORMANCE OF INTRUSION DETECTION SYSTEMS WITH HIDDEN MULTILAYER NEUR...

Recently uploaded

Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2

Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105

How to Add Barcode on PDF Report in Odoo 17Celine George

USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.

LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxConquiztadors- the Quiz Society of Sri Venkateswara College

ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb

Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George

THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña

Raw materials used in Herbal Cosmetics.pptxAshokrao Mane college of Pharmacy Peth-Vadgaon

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxConquiztadors- the Quiz Society of Sri Venkateswara College

Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99

Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR

Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup

Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1

Roles & Responsibilities in PharmacovigilanceSamikshaHamane

Computed Fields and api Depends in the Odoo 17Celine George

Recently uploaded (20)

Grade 9 Q4-MELC1-Active and Passive Voice.pptx

Barangay Council for the Protection of Children (BCPC) Orientation.pptx

How to Add Barcode on PDF Report in Odoo 17

USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...

LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx

ANG SEKTOR NG agrikultura.pptx QUARTER 4

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝

AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf

Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17

THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION

Raw materials used in Herbal Cosmetics.pptx

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx

Choosing the Right CBSE School A Comprehensive Guide for Parents

Gas measurement O2,Co2,& ph) 04/2024.pptx

call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️

Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf

Science 7 Quarter 4 Module 2: Natural Resources.pptx

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...

Roles & Responsibilities in Pharmacovigilance

Computed Fields and api Depends in the Odoo 17

Predictive Business Process Monitoring with LSTM Neural Networks

1. Predictive Business Process Monitoring with LSTM Neural Networks Niek Tax (TU/e) Ilya Verenich (QUT) Marcello La Rosa (QUT) Marlon Dumas (Tartu) June 15th, 2017

2. Predictive Business Process Monitoring in a Nutshell PAGE 1

3. Predictive Monitoring: General Approach PAGE 2

4. Predictive Monitoring Example PAGE 3 Current situation • What is the next activity for this case? • When is this next activity going to take place? • How long is this case still going to take until it is finished? • What is the outcome of this case? Is the compensation going to be paid? Or rejected?

5. Recurrent Neural Networks PAGE 4 LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436-444 (2015) Has problems with long-term dependencies in the data!

6. Long Short-Term Memory PAGE 5Figure from http://colah.github.io/posts/2015-08-Understanding-LSTMs/ Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Computation 9(8), 1735-1780 (1997) How do we represent events as neural network inputs?

7. From Event to Feature Vector PAGE 6 One-hot encoding 1) Time since midnight 2) Time since week start 3) Time since last event <1,0,0,0,0,0,0,0> • 8 different activities • Each position represents one activity

8. Neural Network Architectures PAGE 7

9. Data Sets for Evaluation • Helpdesk log - Ticketing management process of the helpdesk of an Italian software company • BPI Challenge 2012 W subprocess log - A financial loan application process at a large financial institution in the Netherlands PAGE 8 Events 13 710 Cases 3 804 Activities 9 • Environmental Permit log - Process of handling environmental permit applications at a Dutch municipality Events 72 413 Cases 9 658 Activities 6 Events 38 944 Cases 937 Activities 381

10. Baseline Technique for Time Prediction PAGE 9 van der Aalst, W.M.P., Schonenberg, M.H., & Song, M. (2011). Time prediction based on process mining. Information Systems, 36(2), 450-475.

11. Predicting the Next Activity and its Time PAGE 10 2-layers of which one layer shared is the best architecture on both data sets LSTMs outperform traditional RNNs LSTMs outperform the transition system based approach for time-of- next-event prediction LSTMs are more accurate in predicting the next activity on BPI’12 W than 2 baseline methods

12. Predicting the Suffix of a Case PAGE 11 Polato, M., Sperduti, A., Burattin, A., de Leoni, M.: Time and activity sequence prediction of business process instances. arXiv preprint arXiv:1602.07566 (2016)

13. Predicting the Remaining Cycle Time PAGE 12

14. Conclusions • LSTMs can be used as a general framework for prediction tasks in the context of business processes that outperforms tailor-made approaches on a range of tasks and data sets • Predicting time and activity in one shared model outperforms predicting both in separate models • We identified a limitation of the technique when event logs contain many repeated events • Code and documentation is available at: http://verenich.github.io/ProcessSequencePrediction PAGE 13

15. Neural Network Details • Learning algorithm: Adam • Loss Functions • Batch Normalization layer after every two layers PAGE 14 Ioffe, S., & Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. In Proceedings of the 32nd International Conference on Machine Learning (pp. 448-456). Kingma, D., & Ba, J. (2015). Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference for Learning Representations Objective Loss Function Activity Cross Entropy Loss Time Mean Absolute Error Loss

Editor's Notes

This Figure sketches out a high-level architecture of a business process monitoring system supporting predictive monitoring. Predictive monitoring produces recommendations for process workers during the execution of a case. These recommendations refer to a specific (uncompleted) case of the process and tell the user the impact of a given action on the probability that the case at hand will fail to fulfill the performance objectives or compliance rules. In particular, predictive monitoring can be used to raise alerts when certain actions are likely to lead to violation of business constraints. In this way, rather than imposing what to do, the business pro- cess support system acts as a compliance monitoring and recommender system, raising flags whenever certain actions heighten the probability of undesirable deviations. In this context, SLA is any condition that can be evaluated to true or false over every completed case of the process (“every simple insurance claim should be resolved at most 2 weeks after all required documents have been submitted” ) or a performance objective (“every claim should be resolved within 2 months”)
This Figure sketches out a high-level architecture of a business process monitoring system supporting predictive monitoring. Predictive monitoring produces recommendations for process workers during the execution of a case. These recommendations refer to a specific (uncompleted) case of the process and tell the user the impact of a given action on the probability that the case at hand will fail to fulfill the performance objectives or compliance rules. In particular, predictive monitoring can be used to raise alerts when certain actions are likely to lead to violation of business constraints. In this way, rather than imposing what to do, the business pro- cess support system acts as a compliance monitoring and recommender system, raising flags whenever certain actions heighten the probability of undesirable deviations. In this context, SLA is any condition that can be evaluated to true or false over every completed case of the process (“every simple insurance claim should be resolved at most 2 weeks after all required documents have been submitted” ) or a performance objective (“every claim should be resolved within 2 months”)
This slide only at Benelearn, skip it at CAiSE
Process discovery is the task of discovering a process model from a log of events, extracted from for instance an ERP system. Events in an event log contain a case, which groups together events that somehow belong together, like here where each case represents a paper and each event represents a step in the submission process of this paper. Each case can be seen as an instance of the process. The process model generated by a process discovery algorithm can be in any process modeling notation depending on the algorithm, often Petri nets are used, like here on the screen.

Predictive Business Process Monitoring with LSTM Neural Networks

Recommended

Recommended

More Related Content

What's hot

What's hot (12)

Similar to Predictive Business Process Monitoring with LSTM Neural Networks

Similar to Predictive Business Process Monitoring with LSTM Neural Networks (20)

More from Marlon Dumas

More from Marlon Dumas (20)

Recently uploaded

Recently uploaded (20)

Predictive Business Process Monitoring with LSTM Neural Networks

Editor's Notes