SlideShare a Scribd company logo
CS 678 - Ensembles and Bayes 1
Semi-Supervised Learning
 Can we improve the quality of our learning by combining
labeled and unlabeled data
 Usually a lot more unlabeled data available than labeled
 Assume a set L of labeled data and U of unlabeled data
(from the same distribution)
 Focus on Semi-Supervised Classification though there are
many other variations
– Aiding clustering with some labeled data
– Regression
– Model selection with unlabeled data (COD)
 Transduction vs Induction
How Semi-Supervised Works
 Most approaches make strong model assumptions
(guesses). If wrong can make things worse.
 Some commonly used assumptions:
– Clusters of data are from the same class
– Data can be represented as a mixture of parameterized distributions
– Decision boundaries should go through non-dense areas of the data
– Model should be as simple as possible (Occam)
CS 678 - Ensembles and Bayes 2
Unsupervised Learning of Domain
Features
 PCA, SVD
 NLDR – Non-Linear Dimensionality Reduction
 Many Deep Learning Models
– Deep Belief Nets
– Sparse Auto-encoders
– Self-Taught Learning
CS 678 - Ensembles and Bayes 3
Deep Net with Greedy Layer Wise Training
Adobe – Deep Learning and Active Learning 4
ML Model
New Feature Space
Original Inputs
Supervised
Learning
Unsupervised
Learning
Self-Training (Bootstrap)
 Self-Training
– Train supervised model on labeled data L
– Test on unlabeled data U
– Add the most confidently classified members of U to L
– Repeat
 Multi-Model
– Uses multiple models to label/move instances of U to L
– Co-Training
 Train two models with different independent features sets
 Add most confident instances from U of one model into L of the other (i.e.
they “teach” each other)
 Repeat
– Multi-View Learning
 Train multiple diverse models on L. Those instances in U which most
models agree on are placed in L.
CS 678 - Ensembles and Bayes 5
Generative Models
 Generative – Assume data can be represented by some
mixture of parameterized models (e.g. Gaussian) and use
EM to learn parameters (ala Baum-Welch)
CS 678 - Ensembles and Bayes 6
Graph Models
 Graph Models
– Neighbor nodes assumed to be similar with larger edge weights
– Force same class member in L to be close, while maintaining
smoothness with respect to the graph for U.
– Add in members of U as neighbors based on some similarity
– Iteratively label U (breadth first)
CS 678 - Ensembles and Bayes 7
TSVM
 Transductive SVM (TSVM) or Semi-Supervised SVM
(S3VM)
 Maximize margin of both L and U. Decision surface
placed in non-dense spaces
– Assumes classes are "well-separated"
– Can also try to simultaneously maintain class proportion on both
sides similar to labeled proportion
CS 678 - Ensembles and Bayes 8
Summary
 Oracle Learning
 Becoming a more critical area as more unlabeled data
becomes cheaply available
CS 678 - Ensembles and Bayes 9
Active Learning
 Obtaining labeled data can be the most expensive part of a
machine learning task
 Supervised, Unsupervised, and Semi-Supervised Learning
 In Active Learning can query an oracle (e.g. a human
expert, test, etc.) to obtain the label for a specific input
 In active learning we try to learn the most accurate model
while having to query the least amount of data for labels
Adobe – Deep Learning and Active Learning 10
Active Learning
Adobe – Deep Learning and Active Learning 11
Often query:
1) A low confidence instance (i.e. near a decision boundary)
2) An instance which is in a relatively dense neighborhood
Active Learning
Adobe – Deep Learning and Active Learning 12
Often query:
1) A low confidence instance (i.e. near a decision boundary)
2) An instance which is in a relatively dense neighborhood
Active Learning
Adobe – Deep Learning and Active Learning 13
Often query:
1) A low confidence instance (i.e. near a decision boundary)
2) An instance which is in a relatively dense neighborhood
Active Learning
Adobe – Deep Learning and Active Learning 14
Often query:
1) A low confidence instance (i.e. near a decision boundary)
2) An instance which is in a relatively dense neighborhood
Active Clustering
Images (Objects, Words, etc.)
 First do unsupervised clustering
 Which points to show an expert in order to get feedback on
the clustering to allow adjustment?
Adobe – Deep Learning and Active Learning 15
Active Clustering
Images (Objects, Words, etc.)
 First do unsupervised clustering
 Which points to show an expert in order to get feedback on
the clustering to allow adjustment?
Adobe – Deep Learning and Active Learning 16
Active Clustering
Images (Objects, Words, etc.)
 First do unsupervised clustering
 Which points to show an expert in order to get feedback on
the clustering to allow adjustment?
Adobe – Deep Learning and Active Learning 17

More Related Content

Similar to Semi-Supervised.pptx

Machine Learning
Machine LearningMachine Learning
Machine Learningbutest
 
ML crash course
ML crash courseML crash course
ML crash course
mikaelhuss
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Miningbutest
 
Prior On Model Space
Prior On Model SpacePrior On Model Space
Prior On Model Space
Meir Maor
 
learning.ppt
learning.pptlearning.ppt
learning.ppt
BadWork
 
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...butest
 
Automated Software Requirements Labeling
Automated Software Requirements LabelingAutomated Software Requirements Labeling
Automated Software Requirements Labeling
Data Works MD
 
Kdd by Mr.Sameer Kumar Das
Kdd by Mr.Sameer Kumar DasKdd by Mr.Sameer Kumar Das
Kdd by Mr.Sameer Kumar Das
Mr.Sameer Kumar Das
 
NoSQL Basics - A Quick Tour
NoSQL Basics - A Quick TourNoSQL Basics - A Quick Tour
NoSQL Basics - A Quick Tour
Bikram Sinha. MBA, PMP
 
17- Kernels and Clustering.pptx
17- Kernels and Clustering.pptx17- Kernels and Clustering.pptx
17- Kernels and Clustering.pptx
ssuser2023c6
 
Introduction to Machine Learning.
Introduction to Machine Learning.Introduction to Machine Learning.
Introduction to Machine Learning.butest
 
LECTURE8.PPT
LECTURE8.PPTLECTURE8.PPT
LECTURE8.PPTbutest
 
Kaggle presentation
Kaggle presentationKaggle presentation
Kaggle presentation
HJ van Veen
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.ppt
yang947066
 
Warehousing
WarehousingWarehousing
Introduction to Machine Learning Aristotelis Tsirigos
Introduction to Machine Learning Aristotelis Tsirigos Introduction to Machine Learning Aristotelis Tsirigos
Introduction to Machine Learning Aristotelis Tsirigos butest
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learningbutest
 
November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 butest
 

Similar to Semi-Supervised.pptx (20)

Machine Learning
Machine LearningMachine Learning
Machine Learning
 
ML crash course
ML crash courseML crash course
ML crash course
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
Prior On Model Space
Prior On Model SpacePrior On Model Space
Prior On Model Space
 
learning.ppt
learning.pptlearning.ppt
learning.ppt
 
Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...Ensemble Learning Featuring the Netflix Prize Competition and ...
Ensemble Learning Featuring the Netflix Prize Competition and ...
 
Automated Software Requirements Labeling
Automated Software Requirements LabelingAutomated Software Requirements Labeling
Automated Software Requirements Labeling
 
Kdd by Mr.Sameer Kumar Das
Kdd by Mr.Sameer Kumar DasKdd by Mr.Sameer Kumar Das
Kdd by Mr.Sameer Kumar Das
 
Presentation
PresentationPresentation
Presentation
 
NoSQL Basics - A Quick Tour
NoSQL Basics - A Quick TourNoSQL Basics - A Quick Tour
NoSQL Basics - A Quick Tour
 
17- Kernels and Clustering.pptx
17- Kernels and Clustering.pptx17- Kernels and Clustering.pptx
17- Kernels and Clustering.pptx
 
Introduction to Machine Learning.
Introduction to Machine Learning.Introduction to Machine Learning.
Introduction to Machine Learning.
 
LECTURE8.PPT
LECTURE8.PPTLECTURE8.PPT
LECTURE8.PPT
 
Kaggle presentation
Kaggle presentationKaggle presentation
Kaggle presentation
 
Overfitting and-tbl
Overfitting and-tblOverfitting and-tbl
Overfitting and-tbl
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.ppt
 
Warehousing
WarehousingWarehousing
Warehousing
 
Introduction to Machine Learning Aristotelis Tsirigos
Introduction to Machine Learning Aristotelis Tsirigos Introduction to Machine Learning Aristotelis Tsirigos
Introduction to Machine Learning Aristotelis Tsirigos
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learning
 
November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 November, 2006 CCKM'06 1
November, 2006 CCKM'06 1
 

More from Tamer Nadeem

PresentationDNN.pptx
PresentationDNN.pptxPresentationDNN.pptx
PresentationDNN.pptx
Tamer Nadeem
 
Chapter 1.ppt
Chapter 1.pptChapter 1.ppt
Chapter 1.ppt
Tamer Nadeem
 
4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx
4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx
4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx
Tamer Nadeem
 
qos-f05.pdf
qos-f05.pdfqos-f05.pdf
qos-f05.pdf
Tamer Nadeem
 
lecture1.pdf
lecture1.pdflecture1.pdf
lecture1.pdf
Tamer Nadeem
 
cs229-probability_review_slides.pdf
cs229-probability_review_slides.pdfcs229-probability_review_slides.pdf
cs229-probability_review_slides.pdf
Tamer Nadeem
 
N00014 21-s-f003
N00014 21-s-f003N00014 21-s-f003
N00014 21-s-f003
Tamer Nadeem
 
Ci carplay-cic
Ci carplay-cicCi carplay-cic
Ci carplay-cic
Tamer Nadeem
 

More from Tamer Nadeem (8)

PresentationDNN.pptx
PresentationDNN.pptxPresentationDNN.pptx
PresentationDNN.pptx
 
Chapter 1.ppt
Chapter 1.pptChapter 1.ppt
Chapter 1.ppt
 
4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx
4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx
4_1_indoorLocalization_1_fingerprint_deadreckoning.pptx
 
qos-f05.pdf
qos-f05.pdfqos-f05.pdf
qos-f05.pdf
 
lecture1.pdf
lecture1.pdflecture1.pdf
lecture1.pdf
 
cs229-probability_review_slides.pdf
cs229-probability_review_slides.pdfcs229-probability_review_slides.pdf
cs229-probability_review_slides.pdf
 
N00014 21-s-f003
N00014 21-s-f003N00014 21-s-f003
N00014 21-s-f003
 
Ci carplay-cic
Ci carplay-cicCi carplay-cic
Ci carplay-cic
 

Recently uploaded

Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 

Recently uploaded (20)

Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 

Semi-Supervised.pptx

  • 1. CS 678 - Ensembles and Bayes 1 Semi-Supervised Learning  Can we improve the quality of our learning by combining labeled and unlabeled data  Usually a lot more unlabeled data available than labeled  Assume a set L of labeled data and U of unlabeled data (from the same distribution)  Focus on Semi-Supervised Classification though there are many other variations – Aiding clustering with some labeled data – Regression – Model selection with unlabeled data (COD)  Transduction vs Induction
  • 2. How Semi-Supervised Works  Most approaches make strong model assumptions (guesses). If wrong can make things worse.  Some commonly used assumptions: – Clusters of data are from the same class – Data can be represented as a mixture of parameterized distributions – Decision boundaries should go through non-dense areas of the data – Model should be as simple as possible (Occam) CS 678 - Ensembles and Bayes 2
  • 3. Unsupervised Learning of Domain Features  PCA, SVD  NLDR – Non-Linear Dimensionality Reduction  Many Deep Learning Models – Deep Belief Nets – Sparse Auto-encoders – Self-Taught Learning CS 678 - Ensembles and Bayes 3
  • 4. Deep Net with Greedy Layer Wise Training Adobe – Deep Learning and Active Learning 4 ML Model New Feature Space Original Inputs Supervised Learning Unsupervised Learning
  • 5. Self-Training (Bootstrap)  Self-Training – Train supervised model on labeled data L – Test on unlabeled data U – Add the most confidently classified members of U to L – Repeat  Multi-Model – Uses multiple models to label/move instances of U to L – Co-Training  Train two models with different independent features sets  Add most confident instances from U of one model into L of the other (i.e. they “teach” each other)  Repeat – Multi-View Learning  Train multiple diverse models on L. Those instances in U which most models agree on are placed in L. CS 678 - Ensembles and Bayes 5
  • 6. Generative Models  Generative – Assume data can be represented by some mixture of parameterized models (e.g. Gaussian) and use EM to learn parameters (ala Baum-Welch) CS 678 - Ensembles and Bayes 6
  • 7. Graph Models  Graph Models – Neighbor nodes assumed to be similar with larger edge weights – Force same class member in L to be close, while maintaining smoothness with respect to the graph for U. – Add in members of U as neighbors based on some similarity – Iteratively label U (breadth first) CS 678 - Ensembles and Bayes 7
  • 8. TSVM  Transductive SVM (TSVM) or Semi-Supervised SVM (S3VM)  Maximize margin of both L and U. Decision surface placed in non-dense spaces – Assumes classes are "well-separated" – Can also try to simultaneously maintain class proportion on both sides similar to labeled proportion CS 678 - Ensembles and Bayes 8
  • 9. Summary  Oracle Learning  Becoming a more critical area as more unlabeled data becomes cheaply available CS 678 - Ensembles and Bayes 9
  • 10. Active Learning  Obtaining labeled data can be the most expensive part of a machine learning task  Supervised, Unsupervised, and Semi-Supervised Learning  In Active Learning can query an oracle (e.g. a human expert, test, etc.) to obtain the label for a specific input  In active learning we try to learn the most accurate model while having to query the least amount of data for labels Adobe – Deep Learning and Active Learning 10
  • 11. Active Learning Adobe – Deep Learning and Active Learning 11 Often query: 1) A low confidence instance (i.e. near a decision boundary) 2) An instance which is in a relatively dense neighborhood
  • 12. Active Learning Adobe – Deep Learning and Active Learning 12 Often query: 1) A low confidence instance (i.e. near a decision boundary) 2) An instance which is in a relatively dense neighborhood
  • 13. Active Learning Adobe – Deep Learning and Active Learning 13 Often query: 1) A low confidence instance (i.e. near a decision boundary) 2) An instance which is in a relatively dense neighborhood
  • 14. Active Learning Adobe – Deep Learning and Active Learning 14 Often query: 1) A low confidence instance (i.e. near a decision boundary) 2) An instance which is in a relatively dense neighborhood
  • 15. Active Clustering Images (Objects, Words, etc.)  First do unsupervised clustering  Which points to show an expert in order to get feedback on the clustering to allow adjustment? Adobe – Deep Learning and Active Learning 15
  • 16. Active Clustering Images (Objects, Words, etc.)  First do unsupervised clustering  Which points to show an expert in order to get feedback on the clustering to allow adjustment? Adobe – Deep Learning and Active Learning 16
  • 17. Active Clustering Images (Objects, Words, etc.)  First do unsupervised clustering  Which points to show an expert in order to get feedback on the clustering to allow adjustment? Adobe – Deep Learning and Active Learning 17