SlideShare a Scribd company logo
1 of 22
Decision Mining Revisited
Discovering Overlapping Rules
Felix Mannhardt, Massimiliano de Leoni,
Hajo A. Reijers, Wil M.P. van der Aalst
Scope: Mining decision rules from event logs
PAGE 1
Apply
Amount
Grant
Extensive
Check
Reject
Eligibility
Simple
Check
Request
Information
Income
Receive
Information
Category
Activity
Data
Control-flow – Petri net defines order & possible choices
PAGE 2
Apply Grant
Extensive
Check
Reject
Simple
Check
Request
Information
Receive
Information
Exclusive
Choice
Sequence
Exclusive
Choice
Data-perspective – Data Petri Net modelling decisions
PAGE 3
Decision point
Data recording
Decision rule
PAGE 4
DMN 1.1 released on 2016
Widely adopted by tool vendors, for example:
U Eligibility Outcome
1 Yes Grant
2 No Reject
Decision Table
Grant
Reject
[Eligibility = No]
[Eligibility = Yes]
Comparing the Petri net notation to DMN
Decision Rule / Guard
Why are overlapping rules needed?
PAGE 5
Incomplete
Information
• Not recorded
• Process context
• Confidential
• ...
• Expert approval
• Deferred choice
• Randomized check
• Inconsistent human behavior
• ...
Goal: Discover rules which may overlap
PAGE 6
Process Model
Event Log
Process Model with
Overlapping Decision Rules
Overlapping Rule
Discovery
Decision point - Mutually-exclusive rule
PAGE 7
Grant
Reject
[Eligibility = No]
[Eligibility = Yes]
Count Eligibility Outcome
5x “No” Reject
20x “Yes” Grant
Observation instances from an event log
Grant
Reject
Decision point – Overlapping rule
PAGE 8
C Rating Amount Activity
1 Good - Simple Check
2 Bad - Extensive Check
3 Bad Low Simple Check
4 Bad High Request Information
5 Unknown - Request Information
Alternative Decision Table Notation
Proposed Discovery Method
PAGE 9
Process Model
Event Log
Process Model
With Overlapping Rules
Overlapping Rule
Discovery
foreach
Decision Point
Collect
Instances
1st
Classification
2nd
Classification
Collect
Misclassified
Build
Rules
1) Collect Instances
PAGE 10
Event Log collect
Rating Amount Outcome
6x Good Low Simple
6x Good High Simple
6x Bad High Extensive
4x Bad High Request
6x Bad Low Extensive
4x Bad Low Simple
6x Unknown High Request
Observation instances
• Cyclic Behavior
• Noise (Missing / Additional Events)
• Unassigned values
• Inconsistent recording
Alignment-based method
2) 1st Classification & 3) Misclassified Instances
PAGE 11
Rating Amount Outcome
6x Good Low Simple
6x Good High Simple
6x Bad High Extensive
4x Bad High Request
6x Bad Low Extensive
4x Bad Low Simple
6x Unknown High Request
Rating
Simple RequestExtensive
Good Unknown
Bad
12 OK 12 OK
8 NOK
6 OK
Instances Decision Tree
4) 2nd Classification
PAGE 12
Instances
Amount
Request Simple
High Low
2nd Decision Tree
Rating Amount Outcome
4x Bad High Request
4x Bad Low Simple
5) Build Overlapping Decision Rules
PAGE 13
Rating
Simple RequestExtensive
Good Unknown
Bad
Amount
Request Simple
High Low
Compiled to overlapping rules
If Rating = Good then Simple
If Rating = Unknown then Request
If Rating = Bad then Extensive
If Rating = Bad AND Amount = High
then Request
If Rating = Bad AND Amount = Low
then Simple
Resulting Data-aware Process Model
PAGE 14
Trade-off: Precise and fitting model
PAGE 15
Rating Amount Outcome
6x Good Low Simple
6x Good High Simple
6x Bad High Extensive
4x Bad High Request
6x Bad Low Extensive
4x Bad Low Simple
6x Unknown High Request
Unfitting
Imprecise
[Underfitting]
Good Trade-off
Evaluation – Measures
PAGE 16
Precision Fitness
How much unobserved
behavior is modelled?
How much observed
behavior is modelled?
Image source (CC BY-SA): https://en.wikipedia.org/wiki/Precision_and_recall#/media/File:Precisionrecall.svg
Evaluation – Setup
PAGE 17
Method Description Expected
Precision
Expected
Fitness
WO Without rules Poor Good
DTF Mutually-exclusive approach Good Poor
DTT Naïve overlapping approach Poor Good
DTO Presented overlapping approach Balanced Balanced
Dataset # Traces # Events # Attributes # Decisions
Road Fines 150,000 500,000 9 5
Hospital 1,000 15,000 39 11
Datasets
Compared Methods
Evaluation – Example rules in the hospital data
PAGE 18
Method Intensive Care Normal Care Skip
DTO L > 0 ∧ H = true L > 0 L ≤ 0 ∨
(L > 0 ∧ H = false)
DTT true L > 0 L ≤ 0
DTF false L > 0 L ≤ 0
Imprecise
Unfitting
Good
trade-off
Evaluation – Precision & Fitness
PAGE 19
Fitness Precision
• Fitness  how often rules are violated
• DTO improves fitness over DTF (mutually-exclusive)
• Precision  how strict are the rules
• DTO improves precision against WO
• DTO does sacrifice precision vs. DTF
Conclusion & Future Work
• Method: Discovery of overlapping rules using event logs
• Based on decision tree induction
• ProM framework: MultiPerspectiveExplorer
http://www.promtools.org
• Results: Trade-off fitness & precision
• Improves the model fitness over
standard trees
• Improves the model precision over
naïve approach
• Future work
• Better experimental validation
• Manage the complexity of discovered rules
• Imbalanced distributions
PAGE 20
Questions?
PAGE 21
@fmannhardt - f.mannhardt@tue.nl - http://promtools.org
Multi-Perspective Explorer

More Related Content

Similar to Decision Mining Revisited - Discovering Overlapping Rules

Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...emermell
 
Automated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data FrankfurtAutomated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data FrankfurtLars Trieloff
 
Ingenia consultants-9 basic steps towards TRM compliance
Ingenia consultants-9 basic steps towards TRM complianceIngenia consultants-9 basic steps towards TRM compliance
Ingenia consultants-9 basic steps towards TRM complianceSami Benafia
 
Feature selection with imbalanced data in agriculture
Feature selection with  imbalanced data in agricultureFeature selection with  imbalanced data in agriculture
Feature selection with imbalanced data in agricultureAboul Ella Hassanien
 
Building better products through Experimentation - SDForum Business Intellige...
Building better products through Experimentation - SDForum Business Intellige...Building better products through Experimentation - SDForum Business Intellige...
Building better products through Experimentation - SDForum Business Intellige...Deepak Nadig
 
Ipac 2014
Ipac 2014Ipac 2014
Ipac 2014cwhms
 
Improving Findability through Site Search Analytics
Improving Findability through Site Search AnalyticsImproving Findability through Site Search Analytics
Improving Findability through Site Search AnalyticsLouis Rosenfeld
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data Sharjeel Imtiaz
 
How FSQA Data is Driving Continuous Improvement
How FSQA Data is Driving Continuous ImprovementHow FSQA Data is Driving Continuous Improvement
How FSQA Data is Driving Continuous ImprovementSafetyChain Software
 
Anomaly detection Workshop slides
Anomaly detection Workshop slidesAnomaly detection Workshop slides
Anomaly detection Workshop slidesQuantUniversity
 
Automated decision making using Predictive Applications – Big Data Paris
Automated decision making using Predictive Applications – Big Data ParisAutomated decision making using Predictive Applications – Big Data Paris
Automated decision making using Predictive Applications – Big Data ParisLars Trieloff
 
Diseconomies of Scale webinar
Diseconomies of Scale webinarDiseconomies of Scale webinar
Diseconomies of Scale webinarLocality
 
Tailoring machine learning practices to support prescriptive analytics
Tailoring machine learning practices to support prescriptive analyticsTailoring machine learning practices to support prescriptive analytics
Tailoring machine learning practices to support prescriptive analyticsAdam Doyle
 
Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...
Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...
Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...will buckley
 
IT Operation Analytic for security- MiSSconf(sp1)
IT Operation Analytic for security- MiSSconf(sp1)IT Operation Analytic for security- MiSSconf(sp1)
IT Operation Analytic for security- MiSSconf(sp1)stelligence
 
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017Muhammad Faisal Reza
 
140422DH07 - NRF Presentation
140422DH07 - NRF Presentation140422DH07 - NRF Presentation
140422DH07 - NRF PresentationDarrell Huntsman
 

Similar to Decision Mining Revisited - Discovering Overlapping Rules (20)

Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
 
Automated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data FrankfurtAutomated decision making with predictive applications – Big Data Frankfurt
Automated decision making with predictive applications – Big Data Frankfurt
 
Ingenia consultants-9 basic steps towards TRM compliance
Ingenia consultants-9 basic steps towards TRM complianceIngenia consultants-9 basic steps towards TRM compliance
Ingenia consultants-9 basic steps towards TRM compliance
 
Data analytics, a (short) tour
Data analytics, a (short) tourData analytics, a (short) tour
Data analytics, a (short) tour
 
Feature selection with imbalanced data in agriculture
Feature selection with  imbalanced data in agricultureFeature selection with  imbalanced data in agriculture
Feature selection with imbalanced data in agriculture
 
Building better products through Experimentation - SDForum Business Intellige...
Building better products through Experimentation - SDForum Business Intellige...Building better products through Experimentation - SDForum Business Intellige...
Building better products through Experimentation - SDForum Business Intellige...
 
Ipac 2014
Ipac 2014Ipac 2014
Ipac 2014
 
Data mining
Data miningData mining
Data mining
 
Improving Findability through Site Search Analytics
Improving Findability through Site Search AnalyticsImproving Findability through Site Search Analytics
Improving Findability through Site Search Analytics
 
Big Data Analytics and Open Data
Big Data Analytics and Open Data Big Data Analytics and Open Data
Big Data Analytics and Open Data
 
1120 track2 bennett
1120 track2 bennett1120 track2 bennett
1120 track2 bennett
 
How FSQA Data is Driving Continuous Improvement
How FSQA Data is Driving Continuous ImprovementHow FSQA Data is Driving Continuous Improvement
How FSQA Data is Driving Continuous Improvement
 
Anomaly detection Workshop slides
Anomaly detection Workshop slidesAnomaly detection Workshop slides
Anomaly detection Workshop slides
 
Automated decision making using Predictive Applications – Big Data Paris
Automated decision making using Predictive Applications – Big Data ParisAutomated decision making using Predictive Applications – Big Data Paris
Automated decision making using Predictive Applications – Big Data Paris
 
Diseconomies of Scale webinar
Diseconomies of Scale webinarDiseconomies of Scale webinar
Diseconomies of Scale webinar
 
Tailoring machine learning practices to support prescriptive analytics
Tailoring machine learning practices to support prescriptive analyticsTailoring machine learning practices to support prescriptive analytics
Tailoring machine learning practices to support prescriptive analytics
 
Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...
Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...
Optimizing Protocol Planning, Feasibility, and Site Selection through an Inte...
 
IT Operation Analytic for security- MiSSconf(sp1)
IT Operation Analytic for security- MiSSconf(sp1)IT Operation Analytic for security- MiSSconf(sp1)
IT Operation Analytic for security- MiSSconf(sp1)
 
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017
 
140422DH07 - NRF Presentation
140422DH07 - NRF Presentation140422DH07 - NRF Presentation
140422DH07 - NRF Presentation
 

More from Felix Mannhardt

A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...
A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...
A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...Felix Mannhardt
 
Estimating the Impact of Incidents on Process Delay - ICPM 2019
Estimating the Impact of Incidents on Process Delay - ICPM 2019Estimating the Impact of Incidents on Process Delay - ICPM 2019
Estimating the Impact of Incidents on Process Delay - ICPM 2019Felix Mannhardt
 
Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...
Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...
Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...Felix Mannhardt
 
Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...
Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...
Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...Felix Mannhardt
 
From Low-Level Events to Activities - A Pattern-based Approach
From Low-Level Events to Activities - A Pattern-based ApproachFrom Low-Level Events to Activities - A Pattern-based Approach
From Low-Level Events to Activities - A Pattern-based ApproachFelix Mannhardt
 
Analyzing the Trajectories of Patients with Sepsis using Process Mining
Analyzing the Trajectories of Patients with Sepsis using Process MiningAnalyzing the Trajectories of Patients with Sepsis using Process Mining
Analyzing the Trajectories of Patients with Sepsis using Process MiningFelix Mannhardt
 
XESLite - Handling Event Logs in ProM
XESLite - Handling Event Logs in ProMXESLite - Handling Event Logs in ProM
XESLite - Handling Event Logs in ProMFelix Mannhardt
 

More from Felix Mannhardt (7)

A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...
A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...
A Taxonomy for Combining Activity Recognition and Process Discovery in Indust...
 
Estimating the Impact of Incidents on Process Delay - ICPM 2019
Estimating the Impact of Incidents on Process Delay - ICPM 2019Estimating the Impact of Incidents on Process Delay - ICPM 2019
Estimating the Impact of Incidents on Process Delay - ICPM 2019
 
Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...
Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...
Data-driven Process Discovery - Revealing Conditional Infrequent Behavior fro...
 
Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...
Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...
Unsupervised Event Abstraction using Pattern Abstraction and Local Process Mo...
 
From Low-Level Events to Activities - A Pattern-based Approach
From Low-Level Events to Activities - A Pattern-based ApproachFrom Low-Level Events to Activities - A Pattern-based Approach
From Low-Level Events to Activities - A Pattern-based Approach
 
Analyzing the Trajectories of Patients with Sepsis using Process Mining
Analyzing the Trajectories of Patients with Sepsis using Process MiningAnalyzing the Trajectories of Patients with Sepsis using Process Mining
Analyzing the Trajectories of Patients with Sepsis using Process Mining
 
XESLite - Handling Event Logs in ProM
XESLite - Handling Event Logs in ProMXESLite - Handling Event Logs in ProM
XESLite - Handling Event Logs in ProM
 

Recently uploaded

Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfrohankumarsinghrore1
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curveAreesha Ahmad
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxSilpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptRakeshMohan42
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfSumit Kumar yadav
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 

Recently uploaded (20)

Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 

Decision Mining Revisited - Discovering Overlapping Rules

  • 1. Decision Mining Revisited Discovering Overlapping Rules Felix Mannhardt, Massimiliano de Leoni, Hajo A. Reijers, Wil M.P. van der Aalst
  • 2. Scope: Mining decision rules from event logs PAGE 1 Apply Amount Grant Extensive Check Reject Eligibility Simple Check Request Information Income Receive Information Category Activity Data
  • 3. Control-flow – Petri net defines order & possible choices PAGE 2 Apply Grant Extensive Check Reject Simple Check Request Information Receive Information Exclusive Choice Sequence Exclusive Choice
  • 4. Data-perspective – Data Petri Net modelling decisions PAGE 3 Decision point Data recording Decision rule
  • 5. PAGE 4 DMN 1.1 released on 2016 Widely adopted by tool vendors, for example: U Eligibility Outcome 1 Yes Grant 2 No Reject Decision Table Grant Reject [Eligibility = No] [Eligibility = Yes] Comparing the Petri net notation to DMN Decision Rule / Guard
  • 6. Why are overlapping rules needed? PAGE 5 Incomplete Information • Not recorded • Process context • Confidential • ... • Expert approval • Deferred choice • Randomized check • Inconsistent human behavior • ...
  • 7. Goal: Discover rules which may overlap PAGE 6 Process Model Event Log Process Model with Overlapping Decision Rules Overlapping Rule Discovery
  • 8. Decision point - Mutually-exclusive rule PAGE 7 Grant Reject [Eligibility = No] [Eligibility = Yes] Count Eligibility Outcome 5x “No” Reject 20x “Yes” Grant Observation instances from an event log Grant Reject
  • 9. Decision point – Overlapping rule PAGE 8 C Rating Amount Activity 1 Good - Simple Check 2 Bad - Extensive Check 3 Bad Low Simple Check 4 Bad High Request Information 5 Unknown - Request Information Alternative Decision Table Notation
  • 10. Proposed Discovery Method PAGE 9 Process Model Event Log Process Model With Overlapping Rules Overlapping Rule Discovery foreach Decision Point Collect Instances 1st Classification 2nd Classification Collect Misclassified Build Rules
  • 11. 1) Collect Instances PAGE 10 Event Log collect Rating Amount Outcome 6x Good Low Simple 6x Good High Simple 6x Bad High Extensive 4x Bad High Request 6x Bad Low Extensive 4x Bad Low Simple 6x Unknown High Request Observation instances • Cyclic Behavior • Noise (Missing / Additional Events) • Unassigned values • Inconsistent recording Alignment-based method
  • 12. 2) 1st Classification & 3) Misclassified Instances PAGE 11 Rating Amount Outcome 6x Good Low Simple 6x Good High Simple 6x Bad High Extensive 4x Bad High Request 6x Bad Low Extensive 4x Bad Low Simple 6x Unknown High Request Rating Simple RequestExtensive Good Unknown Bad 12 OK 12 OK 8 NOK 6 OK Instances Decision Tree
  • 13. 4) 2nd Classification PAGE 12 Instances Amount Request Simple High Low 2nd Decision Tree Rating Amount Outcome 4x Bad High Request 4x Bad Low Simple
  • 14. 5) Build Overlapping Decision Rules PAGE 13 Rating Simple RequestExtensive Good Unknown Bad Amount Request Simple High Low Compiled to overlapping rules If Rating = Good then Simple If Rating = Unknown then Request If Rating = Bad then Extensive If Rating = Bad AND Amount = High then Request If Rating = Bad AND Amount = Low then Simple
  • 16. Trade-off: Precise and fitting model PAGE 15 Rating Amount Outcome 6x Good Low Simple 6x Good High Simple 6x Bad High Extensive 4x Bad High Request 6x Bad Low Extensive 4x Bad Low Simple 6x Unknown High Request Unfitting Imprecise [Underfitting] Good Trade-off
  • 17. Evaluation – Measures PAGE 16 Precision Fitness How much unobserved behavior is modelled? How much observed behavior is modelled? Image source (CC BY-SA): https://en.wikipedia.org/wiki/Precision_and_recall#/media/File:Precisionrecall.svg
  • 18. Evaluation – Setup PAGE 17 Method Description Expected Precision Expected Fitness WO Without rules Poor Good DTF Mutually-exclusive approach Good Poor DTT Naïve overlapping approach Poor Good DTO Presented overlapping approach Balanced Balanced Dataset # Traces # Events # Attributes # Decisions Road Fines 150,000 500,000 9 5 Hospital 1,000 15,000 39 11 Datasets Compared Methods
  • 19. Evaluation – Example rules in the hospital data PAGE 18 Method Intensive Care Normal Care Skip DTO L > 0 ∧ H = true L > 0 L ≤ 0 ∨ (L > 0 ∧ H = false) DTT true L > 0 L ≤ 0 DTF false L > 0 L ≤ 0 Imprecise Unfitting Good trade-off
  • 20. Evaluation – Precision & Fitness PAGE 19 Fitness Precision • Fitness  how often rules are violated • DTO improves fitness over DTF (mutually-exclusive) • Precision  how strict are the rules • DTO improves precision against WO • DTO does sacrifice precision vs. DTF
  • 21. Conclusion & Future Work • Method: Discovery of overlapping rules using event logs • Based on decision tree induction • ProM framework: MultiPerspectiveExplorer http://www.promtools.org • Results: Trade-off fitness & precision • Improves the model fitness over standard trees • Improves the model precision over naïve approach • Future work • Better experimental validation • Manage the complexity of discovered rules • Imbalanced distributions PAGE 20
  • 22. Questions? PAGE 21 @fmannhardt - f.mannhardt@tue.nl - http://promtools.org Multi-Perspective Explorer

Editor's Notes

  1. I would like to present our work about “Decision Mining – Discovering Overlapping Rules”. My name is Felix Mannhardt, I’m a PhD student from the Eindhoven University of Technology. This is joint work with Massimiliano, Hajo and Wil.
  2. First to scope our work, I would like to introduce some of the assumptions/notations underlying our work. We want to analyze decisions that took place in processes. We assume that processes can be represented by process models. Notation: Activities boxes Data rounded boxes
  3. The control-flow of a process can be described with process A process model, such as a Petri net, defines the ordering and dependencies between activities We choose Petri net as notation to be independent from the actual process modelling language (such as BPMN or similar) For example: …
  4. Next to the order and dependencies between activities: decisions are at the heart of processes For example, data is recorded during the execution of activities; exclusive-choice in the process are decision points; decision rules govern which activities can be executed
  5. Decision point, exclusive choice between two activities Mutually-exclusive rule defined
  6. - DMN decision table using the Collect hit policy
  7. Public ‘Road Fines” dataset, IEEE taskforce Private hospital dataset
  8. Simplified Model of the care-path at the hospital DTO get better scores for fitness and precision compared to the DTT Lactate level are related to admission,