Business Rule Learning with Interactive Selection of Association Rules - RuleML 2014 challenge

426 views

Published on

Presentation of a RuleML2014 Challenge paper

Published in: Software
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
426
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
8
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Business Rule Learning with Interactive Selection of Association Rules - RuleML 2014 challenge

  1. 1. Business Rule Learning with Interactive Selection of Association Rules Stanislav Vojíř, Přemysl Václav Duben and Tomáš Kliegr Department of Information and Knowledge Engineering University of Economics, Prague
  2. 2. Relevant paper Learning Business Rules with Association Rule Classifiers Tomáš Kliegr, Jaroslav Kuchař, Davide Sottara, Stanislav Vojíř
  3. 3. Motivation  2 possible scenarios  Automatic model creation  Data mining of rules (without support)  Rules prunning  User-managed model creation  Selection of rules gained from data mining  Manually rules input
  4. 4. Rule base preparation 1. Data preparation 2. Association rule mining, rule selection 3. Classification model testing 4. Ruleset editing 5. Classification model testing
  5. 5. Data preparation  Data set for data mining (CSV file, MySQL source)  Import configuration (encoding, separators, primary key)  (Training and testing dataset)  Preprocessing  Columns in data set => attributes for data mining  Numerical columns => intervals, bins of values  Categorical columns => bins of values
  6. 6. Data mining of association rules  GUHA procedure ASSOC  Interactive data mining task configuration using rule pattern  Attributes with fixed values, dynamic binning wildcard…  Interest measures (not only confidence, support)  Support for disjunctions, negations, brackets  Rules selection into rule clipboard  classification model testing  export of rules into knowledge base
  7. 7. Data mining of association rules
  8. 8. Data mining of association rules
  9. 9. Classification model testing  Using training dataset or testing dataset with columns with the same names  Rules in DRL form => testing using Drools Expert  Conflict resolution  Confidence  Support  First fired rule
  10. 10. Classification model testing
  11. 11. Ruleset editing  Not only selection of rules gained from data mining results  Rule editing using interactive editor  Antecedent => Rule condition  Consequent => Rule body
  12. 12. Ruleset editing
  13. 13. Software components summary  EasyMiner  Interactive data mining system  PHP, JavaScript + Joomla! based CMS (reports support)  Based on LISp-Miner system  C++, C#.NET  GUHA procedure ASSOC  EasyMinerCenter  New component for background knowledge management  PHP  Data saved in RDF form (using ARC2 Store)
  14. 14. Software components summary  Business rules editor  JavaScript  Model tester  Java EE application based on Drools Expert component
  15. 15. Future work  New data mining backend  Support for rule prunning  Work with background knowledge base
  16. 16. Demo  Example dataset  7 columns (age, salary, district, amount, payments, duration, rating)  6181 rows  Demo screencast  http://easyminer.eu/screencasts
  17. 17. Try it yourself! EasyMiner.eu  For more information, please visit the web: http://easyminer.eu  Screencasts  Demo  Technical information and papers

×