SlideShare a Scribd company logo
1 of 18
Download to read offline
Weiwei Cheng & Eyke Hüllermeier
  Knowledge Engineering & Bioinformatics Lab
Department of Mathematics and Computer Science
        University of Marburg, Germany
Multilabel Classification

           sky
 cloud



   tree




                            1/16
What is Multilabel Classification?

    Conventional classification
        Instances are associated with a single label from a
         set of finite labels
               if          binary classification;
               if          multi-class classification.

    Multilabel classification
        Instances are associated with a set of labels    .




                                                               2/16
Existing Methods
   Quite a number of methods for multilabel classification have been
    proposed, most of them being model-based approaches (training a
    global model for prediction).

   Our work is especially motivated by MLKNN:
    Zhang & Zhou. ML-kNN: A lazy learning approach to multi-label learning.
    Pattern Recognition, 2007, 40(7): 2038-2048.
    In a number of practical problems, MLKNN shows very strong
    performance and even outperforms RankSVM and AdaBoost.MH.

   Still, many methods ignore the correlation between labels.
    A paper with label CS is more likely having label Math, than Law.

                                                                              3/16
Our Contributions

 A new multilabel learning method,


 which is based on a formalization of instance-based
  classification as logistic regression (combination of
  model-based and instance-based learning),

 takes the correlation between labels into account and
  represents it in an easily interpretable way.


                                                          4/16
IBL as Logistic Regression

     Key idea:
     Consider the labels of neighbors as “extra features” of an instance

                age   weight   height    sex     w.child
                26      62      1.83     male      no        1
nearest          16     45      1.65    female     no        0
neighbors
                28      85      1.90     male      yes       1
                                          ...       ...
test instance   27      50      1.63     male      yes       ?



                                                 Does he like basketball?
                                                                            5/16
IBL as Logistic Regression
 Extended representation:


        age     weight      height    sex     w.child   #
         26       62         1.83    male       no        1/3       1
         16       45         1.65    female     no          0      0
         28       85         1.90    male       yes      2/3        1
                                       ...       ...
         27       50         1.63    male       yes      2/3        ?




                                              Extra feature: 2 among 3
                                              neighbors like basketball
                                                                          6/16
IBL as Logistic Regression (binary case)
Consider query instance      , distance                          ,
posterior probability                              :




For example, we can define                                       .


Now consider the whole neighborhood of                 :



                   bias term (prior probability)           evidence for positive class   7/16
IBL as Logistic Regression (binary case)


From distance to similarity




The standard KNN classifier is recovered as a special case:
        • Set       , and

        •


                                                              8/16
IBL as Logistic Regression

             Same idea for multilabel case:
             Consider the labels of neighbors as “extra features” of an instance

             age     weight   height     sex    w.child
              26       62       1.83    male      no        1        0        1
  NN          16       45       1.65   female     no        0        1        0
              28       85       1.90    male      yes       1        0        1
                                          ...      ...
test inst.    27       50       1.63    male      yes       ?        ?        ?



                                                Does he like basketball?
                                                                                   9/16
IBL as Logistic Regression
     Extended representation:


age     weight height       sex    w.child    #       #       #
26        62       1.83    male      no        1/3        0       1      1   0     1
16        45      1.65    female     no           0       1    1/3       0   1     0
28        85      1.90     male      yes       2/3        0       1      1   0     0
                            ...      ...
27        50       1.63    male      yes       2/3     1/3     1/3       ?   ?     ?




                                           Extra feature: 1 among 3
                                           neighbors like table tennis
                                                                                 10/16
IBL as Logistic Regression (multilabel case)
 We solve one logistic regression problem for each label!


 Example:




                     To what extent does the presence of
                     label basektball in the neighborhood
                     increase the probability that football is
                     relevant for the query?

                                                                 11/16
IBL as Logistic Regression (multilabel case)

 Multilabel prediction rule




 Ranking rule




                                               12/16
Experiments
   dataset          domain           #inst.      #attr.   #labels     card.
    emotions          music              593          72         6      1,87
    image             vision           2000          135         5      1,24
    genbase          biology             662    1186(n)         27       1,25
    mediamill       multimedia         5000         120        101      4,27
    reuters            text             7119        243          7      1,24
    scene             vision           2407         294          6      1,07
    yeast            biology            2417         103        14      4,24

   Tested methods:
     MLKNN

     Binary relevance learning (BR) with logistic regression, C4.5 and KNN

     Label powerset (LP) with C4.5

     Our method: IBLR-ML


                                                                                13/16
Evaluation metrics
 Hamming loss


 one error


 coverage


 rank loss


 average precision


                      14/16
critical distance




                    Nemenyi test with p=0.05
                                          15/16
Contributions of Our Work
   Novel approach to IBL, applicable to classification in general and
    multilabel classification in particular.

   Key idea: Consider label information in the neighborhood of a
    query as “extra features” of that query.

   Balance between global and local inference automatically optimized
    via fitting a logistic regression function.

   Interdependencies between labels estimated by regression
    coefficients.

   Extension: Logistic regression combining “normal features” with
    “extra features”.

                                                                         16/16
IBLR-ML is available in the MULAN Java library,
maintained by the
Machine Learning & Knowledge Discovery Group,
University of Thessaloniki.

http://mlkd.csd.auth.gr/multilabel.html

More Related Content

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Recently uploaded (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Featured

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 

Featured (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Combining Instance-Based Learning and Logistic Regression for Multilabel Classi cation

  • 1. Weiwei Cheng & Eyke Hüllermeier Knowledge Engineering & Bioinformatics Lab Department of Mathematics and Computer Science University of Marburg, Germany
  • 2. Multilabel Classification sky cloud tree 1/16
  • 3. What is Multilabel Classification?  Conventional classification  Instances are associated with a single label from a set of finite labels  if binary classification;  if multi-class classification.  Multilabel classification  Instances are associated with a set of labels . 2/16
  • 4. Existing Methods  Quite a number of methods for multilabel classification have been proposed, most of them being model-based approaches (training a global model for prediction).  Our work is especially motivated by MLKNN: Zhang & Zhou. ML-kNN: A lazy learning approach to multi-label learning. Pattern Recognition, 2007, 40(7): 2038-2048. In a number of practical problems, MLKNN shows very strong performance and even outperforms RankSVM and AdaBoost.MH.  Still, many methods ignore the correlation between labels. A paper with label CS is more likely having label Math, than Law. 3/16
  • 5. Our Contributions  A new multilabel learning method,  which is based on a formalization of instance-based classification as logistic regression (combination of model-based and instance-based learning),  takes the correlation between labels into account and represents it in an easily interpretable way. 4/16
  • 6. IBL as Logistic Regression Key idea: Consider the labels of neighbors as “extra features” of an instance age weight height sex w.child 26 62 1.83 male no 1 nearest 16 45 1.65 female no 0 neighbors 28 85 1.90 male yes 1 ... ... test instance 27 50 1.63 male yes ? Does he like basketball? 5/16
  • 7. IBL as Logistic Regression Extended representation: age weight height sex w.child # 26 62 1.83 male no 1/3 1 16 45 1.65 female no 0 0 28 85 1.90 male yes 2/3 1 ... ... 27 50 1.63 male yes 2/3 ? Extra feature: 2 among 3 neighbors like basketball 6/16
  • 8. IBL as Logistic Regression (binary case) Consider query instance , distance , posterior probability : For example, we can define . Now consider the whole neighborhood of : bias term (prior probability) evidence for positive class 7/16
  • 9. IBL as Logistic Regression (binary case) From distance to similarity The standard KNN classifier is recovered as a special case: • Set , and • 8/16
  • 10. IBL as Logistic Regression Same idea for multilabel case: Consider the labels of neighbors as “extra features” of an instance age weight height sex w.child 26 62 1.83 male no 1 0 1 NN 16 45 1.65 female no 0 1 0 28 85 1.90 male yes 1 0 1 ... ... test inst. 27 50 1.63 male yes ? ? ? Does he like basketball? 9/16
  • 11. IBL as Logistic Regression Extended representation: age weight height sex w.child # # # 26 62 1.83 male no 1/3 0 1 1 0 1 16 45 1.65 female no 0 1 1/3 0 1 0 28 85 1.90 male yes 2/3 0 1 1 0 0 ... ... 27 50 1.63 male yes 2/3 1/3 1/3 ? ? ? Extra feature: 1 among 3 neighbors like table tennis 10/16
  • 12. IBL as Logistic Regression (multilabel case) We solve one logistic regression problem for each label! Example: To what extent does the presence of label basektball in the neighborhood increase the probability that football is relevant for the query? 11/16
  • 13. IBL as Logistic Regression (multilabel case) Multilabel prediction rule Ranking rule 12/16
  • 14. Experiments  dataset domain #inst. #attr. #labels card. emotions music 593 72 6 1,87 image vision 2000 135 5 1,24 genbase biology 662 1186(n) 27 1,25 mediamill multimedia 5000 120 101 4,27 reuters text 7119 243 7 1,24 scene vision 2407 294 6 1,07 yeast biology 2417 103 14 4,24  Tested methods:  MLKNN  Binary relevance learning (BR) with logistic regression, C4.5 and KNN  Label powerset (LP) with C4.5  Our method: IBLR-ML 13/16
  • 15. Evaluation metrics  Hamming loss  one error  coverage  rank loss  average precision 14/16
  • 16. critical distance Nemenyi test with p=0.05 15/16
  • 17. Contributions of Our Work  Novel approach to IBL, applicable to classification in general and multilabel classification in particular.  Key idea: Consider label information in the neighborhood of a query as “extra features” of that query.  Balance between global and local inference automatically optimized via fitting a logistic regression function.  Interdependencies between labels estimated by regression coefficients.  Extension: Logistic regression combining “normal features” with “extra features”. 16/16
  • 18. IBLR-ML is available in the MULAN Java library, maintained by the Machine Learning & Knowledge Discovery Group, University of Thessaloniki. http://mlkd.csd.auth.gr/multilabel.html