SlideShare a Scribd company logo
1 of 22
Download to read offline
Towards Human-centric AutoML via Logic and
Argumentation
DATAPLAT 2022
Joseph Giovanelli, Giuseppe Pisano
University of Bologna - Alma Mater Studiorum
The role of Machine Learning in Data Platforms
1
Finding a solution in Machine Learning tasks
Instantiate a ML pipeline encompasses that:
• at each step, a technique must be selected;
• for each technique, a set of hyper-parameters must be set;
• each hyper-parameter has its own search space.
2
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
CRISP-DM: Cross Industry Standard Process for Data Mining
CRISP-DM enables the exploration
• domain-related;
• transformation-related;
• cross-cutting (e.g., ethical, legal).
of ML Constraints:
• data-related;
• algorithm-related;
3
AutoML
AutoML aims at automating the ML pipeline instantiation:
• it is difficult to consider all the constraints together;
• it is not transparent;
• it doesn’t allow a proper knowledge augmentation. 4
HAMLET: Human-centric AutoML via Logic and Argumentation
HAMLET leverages :
• Logic to give a structure to the knowledge;
• Argumentation to deal with inconsistencies,
and revise the results.
5
HAMLET - The LogicalKB
The LogicalKB provides to:
• the Data Scientist to structure the ML constraints ;
• the AutoML tool to encode the explored results;
6
HAMLET - The Problem Graph
The Problem Graph allows to:
• consider all the ML constraints together;
• sets up the AutoML search space;
• discuss and argument about the results.
7
HAMLET - Usage example
Tranformations:
• Discretisation (D);
• Normalisation (N).
ML algorithm:
• Decision Tree (DT ).
We have 5 possible ML pipelines:
1. DT
2. D → DT
3. N → DT
4. D →N → DT
5. N →D → DT
8
HAMLET - Usage example
Tranformations:
• Discretisation (D);
• Normalisation (N).
ML algorithm:
• Decision Tree (DT ).
ML constraints:
• algorithm-related: “require D when applying DT ”;
The algorithm-related constraints discard the 1st and 3rd ML pipelines:
1. DT 7
2. D → DT
3. N → DT 7
4. D →N → DT
5. N →D → DT
8
HAMLET - Usage example
Tranformations:
• Discretisation (D);
• Normalisation (N).
ML algorithm:
• Decision Tree (DT ).
ML constraints:
• algorithm-related: “require D when applying DT ”;
• transformation-related: “no N in pipelines with D”.
The transformation-related constraints discard the 4th and 5th ML pipelines:
1. DT 7
2. D → DT
3. N → DT 7
4. D →N → DT 7
5. N →D → DT 7
8
HAMLET - Usage example
Tranformations:
• Discretisation (D);
• Normalisation (N).
ML algorithm:
• Decision Tree (DT ).
ML constraints:
• algorithm-related: “require D when applying DT ”;
• transformation-related: “no N in pipelines with D”.
The only valid ML pipeline is the 2nd:
1. DT 7
2. D → DT 3
3. N → DT 7
4. D →N → DT 7
5. N →D → DT 7
8
HAMLET - Usage example1
Tranformations:
• Discretisation (D);
• Normalisation (N).
ML algorithm:
• Decision Tree (DT ).
ML constraints:
• algorithm-related: “require D when applying DT ”;
• transformation-related: “no N in pipelines with D”.
LogicalKB: Problem Graph:
1https://queueinc.github.io/HAMLET-DATAPLAT2022/ 9
Conclusions and future works
Contributions:
• structure the ML constraints and the AutoML solutions in a LogicalKB;
• parse the structured LogicalKB into a human- and machine-readable
medium called Problem Graph;
• leverage the Problem Graph to set up an AutoML search space;
• leverage the Problem Graph to allow both the DS and an AutoML tool to
revise the current knowledge.
Future works
• sound formalisation;
• implementation;
• test efficacy and benefits on real-case problems.
10
Thanks for the attention.
10

More Related Content

Similar to Towards Human-centric AutoML via Logic and Argumentation

(Faiz) MachineLearning(ppt).pptx
(Faiz) MachineLearning(ppt).pptx(Faiz) MachineLearning(ppt).pptx
(Faiz) MachineLearning(ppt).pptxFaiz430036
 
Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniquesPoonam Kshirsagar
 
deep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptdeep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptPerumalPitchandi
 
DutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical PerspectiveDutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical PerspectiveBigML, Inc
 
Overcoming the limits of Machine Learning in business
Overcoming the limits of Machine Learning in businessOvercoming the limits of Machine Learning in business
Overcoming the limits of Machine Learning in businessAhmed Fattah
 
DMDW Lesson 05 + 06 + 07 - Data Mining Applied
DMDW Lesson 05 + 06 + 07 - Data Mining AppliedDMDW Lesson 05 + 06 + 07 - Data Mining Applied
DMDW Lesson 05 + 06 + 07 - Data Mining AppliedJohannes Hoppe
 
Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...
Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...
Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...PyData
 
Some "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data frontSome "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data frontGreg Landrum
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Ali Alkan
 
Kaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML InterpretabilityKaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML InterpretabilityAlberto Danese
 
Fms invited talk_2018 v5
Fms invited talk_2018 v5Fms invited talk_2018 v5
Fms invited talk_2018 v5Nisha Talagala
 
The Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it WorkThe Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it WorkIvo Andreev
 
Drifting Away: Testing ML Models in Production
Drifting Away: Testing ML Models in ProductionDrifting Away: Testing ML Models in Production
Drifting Away: Testing ML Models in ProductionDatabricks
 
Machine Learning: Past, Present and Future - by Tom Dietterich
Machine Learning: Past, Present and Future - by Tom DietterichMachine Learning: Past, Present and Future - by Tom Dietterich
Machine Learning: Past, Present and Future - by Tom DietterichBigML, Inc
 
Pathways Overview For Open House 19 Sep2010
Pathways Overview For Open House   19 Sep2010Pathways Overview For Open House   19 Sep2010
Pathways Overview For Open House 19 Sep2010jmorriso
 
BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6Rod Soto
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONIJDKP
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONIJDKP
 

Similar to Towards Human-centric AutoML via Logic and Argumentation (20)

(Faiz) MachineLearning(ppt).pptx
(Faiz) MachineLearning(ppt).pptx(Faiz) MachineLearning(ppt).pptx
(Faiz) MachineLearning(ppt).pptx
 
Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniques
 
deep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptdeep_Visualization in Data mining.ppt
deep_Visualization in Data mining.ppt
 
DutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical PerspectiveDutchMLSchool. ML: A Technical Perspective
DutchMLSchool. ML: A Technical Perspective
 
Overcoming the limits of Machine Learning in business
Overcoming the limits of Machine Learning in businessOvercoming the limits of Machine Learning in business
Overcoming the limits of Machine Learning in business
 
DMDW Lesson 05 + 06 + 07 - Data Mining Applied
DMDW Lesson 05 + 06 + 07 - Data Mining AppliedDMDW Lesson 05 + 06 + 07 - Data Mining Applied
DMDW Lesson 05 + 06 + 07 - Data Mining Applied
 
Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...
Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...
Driving Moore's Law with Python-Powered Machine Learning: An Insider's Perspe...
 
C2_W1---.pdf
C2_W1---.pdfC2_W1---.pdf
C2_W1---.pdf
 
Some "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data frontSome "challenges" on the open-source/open-data front
Some "challenges" on the open-source/open-data front
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
 
Kaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML InterpretabilityKaggle Days Paris - Alberto Danese - ML Interpretability
Kaggle Days Paris - Alberto Danese - ML Interpretability
 
Fms invited talk_2018 v5
Fms invited talk_2018 v5Fms invited talk_2018 v5
Fms invited talk_2018 v5
 
The Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it WorkThe Power of Auto ML and How Does it Work
The Power of Auto ML and How Does it Work
 
Drifting Away: Testing ML Models in Production
Drifting Away: Testing ML Models in ProductionDrifting Away: Testing ML Models in Production
Drifting Away: Testing ML Models in Production
 
Emerging Technologies in IT
Emerging Technologies in ITEmerging Technologies in IT
Emerging Technologies in IT
 
Machine Learning: Past, Present and Future - by Tom Dietterich
Machine Learning: Past, Present and Future - by Tom DietterichMachine Learning: Past, Present and Future - by Tom Dietterich
Machine Learning: Past, Present and Future - by Tom Dietterich
 
Pathways Overview For Open House 19 Sep2010
Pathways Overview For Open House   19 Sep2010Pathways Overview For Open House   19 Sep2010
Pathways Overview For Open House 19 Sep2010
 
BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6BsidesLVPresso2016_JZeditsv6
BsidesLVPresso2016_JZeditsv6
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
 

Recently uploaded

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...Nguyen Thanh Tu Collection
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSean M. Fox
 
How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17Celine George
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital ManagementMBA Assignment Experts
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....Ritu480198
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFVivekanand Anglo Vedic Academy
 
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxAnalyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxLimon Prince
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽中 央社
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Mohamed Rizk Khodair
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code ExamplesPeter Brusilovsky
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/siemaillard
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryCeline George
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024Borja Sotomayor
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismDabee Kamal
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxheathfieldcps1
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45MysoreMuleSoftMeetup
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...Krashi Coaching
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppCeline George
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...Nguyen Thanh Tu Collection
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxneillewis46
 

Recently uploaded (20)

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDF
 
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptxAnalyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
Analyzing and resolving a communication crisis in Dhaka textiles LTD.pptx
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).Dementia (Alzheimer & vasular dementia).
Dementia (Alzheimer & vasular dementia).
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/
 
How to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 InventoryHow to Manage Closest Location in Odoo 17 Inventory
How to Manage Closest Location in Odoo 17 Inventory
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
MSc Ag Genetics & Plant Breeding: Insights from Previous Year JNKVV Entrance ...
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio App
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 

Towards Human-centric AutoML via Logic and Argumentation

  • 1. Towards Human-centric AutoML via Logic and Argumentation DATAPLAT 2022 Joseph Giovanelli, Giuseppe Pisano University of Bologna - Alma Mater Studiorum
  • 2. The role of Machine Learning in Data Platforms 1
  • 3. Finding a solution in Machine Learning tasks Instantiate a ML pipeline encompasses that: • at each step, a technique must be selected; • for each technique, a set of hyper-parameters must be set; • each hyper-parameter has its own search space. 2
  • 4. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 5. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 6. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 7. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 8. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 9. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 10. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 11. CRISP-DM: Cross Industry Standard Process for Data Mining CRISP-DM enables the exploration • domain-related; • transformation-related; • cross-cutting (e.g., ethical, legal). of ML Constraints: • data-related; • algorithm-related; 3
  • 12. AutoML AutoML aims at automating the ML pipeline instantiation: • it is difficult to consider all the constraints together; • it is not transparent; • it doesn’t allow a proper knowledge augmentation. 4
  • 13. HAMLET: Human-centric AutoML via Logic and Argumentation HAMLET leverages : • Logic to give a structure to the knowledge; • Argumentation to deal with inconsistencies, and revise the results. 5
  • 14. HAMLET - The LogicalKB The LogicalKB provides to: • the Data Scientist to structure the ML constraints ; • the AutoML tool to encode the explored results; 6
  • 15. HAMLET - The Problem Graph The Problem Graph allows to: • consider all the ML constraints together; • sets up the AutoML search space; • discuss and argument about the results. 7
  • 16. HAMLET - Usage example Tranformations: • Discretisation (D); • Normalisation (N). ML algorithm: • Decision Tree (DT ). We have 5 possible ML pipelines: 1. DT 2. D → DT 3. N → DT 4. D →N → DT 5. N →D → DT 8
  • 17. HAMLET - Usage example Tranformations: • Discretisation (D); • Normalisation (N). ML algorithm: • Decision Tree (DT ). ML constraints: • algorithm-related: “require D when applying DT ”; The algorithm-related constraints discard the 1st and 3rd ML pipelines: 1. DT 7 2. D → DT 3. N → DT 7 4. D →N → DT 5. N →D → DT 8
  • 18. HAMLET - Usage example Tranformations: • Discretisation (D); • Normalisation (N). ML algorithm: • Decision Tree (DT ). ML constraints: • algorithm-related: “require D when applying DT ”; • transformation-related: “no N in pipelines with D”. The transformation-related constraints discard the 4th and 5th ML pipelines: 1. DT 7 2. D → DT 3. N → DT 7 4. D →N → DT 7 5. N →D → DT 7 8
  • 19. HAMLET - Usage example Tranformations: • Discretisation (D); • Normalisation (N). ML algorithm: • Decision Tree (DT ). ML constraints: • algorithm-related: “require D when applying DT ”; • transformation-related: “no N in pipelines with D”. The only valid ML pipeline is the 2nd: 1. DT 7 2. D → DT 3 3. N → DT 7 4. D →N → DT 7 5. N →D → DT 7 8
  • 20. HAMLET - Usage example1 Tranformations: • Discretisation (D); • Normalisation (N). ML algorithm: • Decision Tree (DT ). ML constraints: • algorithm-related: “require D when applying DT ”; • transformation-related: “no N in pipelines with D”. LogicalKB: Problem Graph: 1https://queueinc.github.io/HAMLET-DATAPLAT2022/ 9
  • 21. Conclusions and future works Contributions: • structure the ML constraints and the AutoML solutions in a LogicalKB; • parse the structured LogicalKB into a human- and machine-readable medium called Problem Graph; • leverage the Problem Graph to set up an AutoML search space; • leverage the Problem Graph to allow both the DS and an AutoML tool to revise the current knowledge. Future works • sound formalisation; • implementation; • test efficacy and benefits on real-case problems. 10
  • 22. Thanks for the attention. 10