SlideShare a Scribd company logo
1
UNSUPERVISED LEARNINGUNSUPERVISED LEARNING
Supervised and Unsupervised Learning
ID3 and Version space are supervised learning algorithms
Unsupervised learning eliminates the teacher and requires
that the learners form concepts (categories) on their own
Conceptual clustering is the problem of discovering useful
categories in unclassified data (data whose categories are not
pre-determined)
2
CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING
Unsupervised Learning and Numeric taxonomy
The clustering problem begins with a collection of
unclassified objects and a means for measuring the similarity
of objects
The goal is to organize the objects into classes so that similar
objects are in one class
Numeric taxonomy is one of the oldest approaches to
clustering problem
3
CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING
In Numeric Taxonomy the objects are represented as a
collection of features and each of the feature has some
numeric value
An object is thus a vector of n feature values and can be
considered as a point in n-dimensional space
The similarity of any two objects can be measured by the
Euclidean distance between them in this space
Using this similarity metric, clustering algorithms build
clusters in a bottom up fashion (agglomerative clustering
strategy)
4
CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING
The categories are formed by the following approach
1. Examine all pairs of objects, and select the pair with the
highest degree of similarity and make that pair a cluster
2. Define the features of the cluster as some function, such
as average, of the features of the component members
and then replace the component objects with this cluster
definition
3. Repeat this process on the collection of objects until all
objects have been reduced to a single cluster
The result of this algorithm is a binary tree whose leaf nodes
are instances and whose internal nodes are clusters of
increasing size
5
CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING
6
CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING
We may extend this algorithm to objects represented as sets
of symbolic features. The only problem is in the measuring
the similarity of objects
A similarity metric can be the proportion of features that
any two objects have in common
object 1 = {small, red, rubber, ball}
object 2 = {small, blue, rubber, ball}
object 3 = {large, black, wooden, ball}
similarity (object 1, object 2) = ¾
similarity (object 1, object 3) = ¼
similarity (object 2, object 3) = ¼
7
CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING
In defining categories we cannot give all features equal
weight
In any given context, certain of an object’s features are more
important than others; simple similarity metrics treat all
features equally
The feature weights are to be set according to the goals of the
categorization
8
CLUSTER/2CLUSTER/2
CLUSTER/2 forms k categories by constructing individuals
around k seed objects
The parameter k is user adjustable
CLUSTER/2 evaluates the resulting clusters, selecting new
seeds and repeating the process until its quality criteria are
met
9
CLUSTER/2CLUSTER/2
The algorithm
• Select k seeds from the set of observed objects. This may
be done randomly or according to some selection function
• For each seed, using that seed as a positive instance and
all other seeds as negative instances, produce a
maximally general definition that tries to cover all of the
non-seed instances, until stopped by the negative
instances (other seeds)
10
CLUSTER/2CLUSTER/2
The algorithm
• Classify all objects in the sample according to these
descriptions. Note that this may lead to multiple
classifications of other, non seed, objects
11
CLUSTER/2CLUSTER/2
The algorithm
• Replace each maximally general description with a
maximally specific description that covers all objects in
the category. This decreases likelihood that classes
overlap on unseen objects
12
CLUSTER/2CLUSTER/2
The algorithm
• Classes may still overlap on given objects.
• Using a distance metric, select an element closest to the
center of each class. Using these central elements as new
seeds, repeat the above steps.
13
CLUSTER/2CLUSTER/2
The algorithm
• Stop when clusters are satisfactory. A typical quality
matrix is the complexity of the general descriptions of
classes. For instance, we might prefer clusters that yield
syntactically simple definitions, such as those with a small
number of conjuncts
14
CLUSTER/2CLUSTER/2
The algorithm
• If clusters are unsatisfactory and no improvement occurs
over several iterations, select the new seeds closest to the
edge of the cluster, rather than those at the center. This
favors creation of totally new clusters
15
AssignmentAssignment
Read Section 9.6.1 & 9.6.2 of Luger

More Related Content

What's hot

KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
Simplilearn
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
Dinesh V
 
Bayesian networks in AI
Bayesian networks in AIBayesian networks in AI
Bayesian networks in AI
Byoung-Hee Kim
 
K - Nearest neighbor ( KNN )
K - Nearest neighbor  ( KNN )K - Nearest neighbor  ( KNN )
K - Nearest neighbor ( KNN )
Mohammad Junaid Khan
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
Sangath babu
 
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Edureka!
 
Learning in AI
Learning in AILearning in AI
Learning in AI
Minakshi Atre
 
Unsupervised learning: Clustering
Unsupervised learning: ClusteringUnsupervised learning: Clustering
Unsupervised learning: Clustering
Deepak George
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement Learning
Salem-Kabbani
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex models
Manojit Nandi
 
Machine learning
Machine learningMachine learning
Machine learning
omaraldabash
 
Big Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- CeliBig Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- Celi
intensivecaresociety
 
Basis path testing
Basis path testingBasis path testing
Basis path testing
Hoa Le
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
Md. Main Uddin Rony
 
Interpretability of machine learning
Interpretability of machine learningInterpretability of machine learning
Interpretability of machine learning
Daiki Tanaka
 
Product Management for AI/ML
Product Management for AI/MLProduct Management for AI/ML
Product Management for AI/ML
Jeremy Horn
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine Learning
Joel Graff
 
Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and Evaluation
Sara Hooker
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
CosmoAIMS Bassett
 

What's hot (20)

KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
KNN Algorithm - How KNN Algorithm Works With Example | Data Science For Begin...
 
Explainable AI
Explainable AIExplainable AI
Explainable AI
 
Bayesian networks in AI
Bayesian networks in AIBayesian networks in AI
Bayesian networks in AI
 
K - Nearest neighbor ( KNN )
K - Nearest neighbor  ( KNN )K - Nearest neighbor  ( KNN )
K - Nearest neighbor ( KNN )
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
 
Learning in AI
Learning in AILearning in AI
Learning in AI
 
Unsupervised learning: Clustering
Unsupervised learning: ClusteringUnsupervised learning: Clustering
Unsupervised learning: Clustering
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement Learning
 
Interpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex modelsInterpretable machine learning : Methods for understanding complex models
Interpretable machine learning : Methods for understanding complex models
 
Machine learning
Machine learningMachine learning
Machine learning
 
Big Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- CeliBig Data: Learning from MIMIC- Celi
Big Data: Learning from MIMIC- Celi
 
Basis path testing
Basis path testingBasis path testing
Basis path testing
 
Classification Based Machine Learning Algorithms
Classification Based Machine Learning AlgorithmsClassification Based Machine Learning Algorithms
Classification Based Machine Learning Algorithms
 
Interpretability of machine learning
Interpretability of machine learningInterpretability of machine learning
Interpretability of machine learning
 
Product Management for AI/ML
Product Management for AI/MLProduct Management for AI/ML
Product Management for AI/ML
 
Applications in Machine Learning
Applications in Machine LearningApplications in Machine Learning
Applications in Machine Learning
 
K Nearest Neighbors
K Nearest NeighborsK Nearest Neighbors
K Nearest Neighbors
 
Module 4: Model Selection and Evaluation
Module 4: Model Selection and EvaluationModule 4: Model Selection and Evaluation
Module 4: Model Selection and Evaluation
 
Machine learning clustering
Machine learning clusteringMachine learning clustering
Machine learning clustering
 

Viewers also liked

References, Klaudiusz Zawadzki, Chief steward
References, Klaudiusz Zawadzki, Chief stewardReferences, Klaudiusz Zawadzki, Chief steward
References, Klaudiusz Zawadzki, Chief steward
Anna Hołub
 
AI: Planning and AI
AI: Planning and AIAI: Planning and AI
AI: Planning and AI
DataminingTools Inc
 
Link building presentation from SEOZone 2014 Istanbul
Link building presentation from SEOZone 2014 IstanbulLink building presentation from SEOZone 2014 Istanbul
Link building presentation from SEOZone 2014 Istanbul
Christoph C. Cemper
 
Apple Smart Home Artificial Intelligence Insights from Patents
Apple Smart Home Artificial Intelligence Insights from PatentsApple Smart Home Artificial Intelligence Insights from Patents
Apple Smart Home Artificial Intelligence Insights from Patents
Alex G. Lee, Ph.D. Esq. CLP
 
Data clustering a review
Data clustering a reviewData clustering a review
Data clustering a reviewunyil96
 
ARTIFICIAL INTELLIGENCE Presentation
ARTIFICIAL INTELLIGENCE PresentationARTIFICIAL INTELLIGENCE Presentation
ARTIFICIAL INTELLIGENCE Presentation
Muhammad Ahmed
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
Edureka!
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
DataminingTools Inc
 
Artificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural NetworksArtificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural Networks
The Integral Worm
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
Girish Naik
 
Slideshare.Com Powerpoint
Slideshare.Com PowerpointSlideshare.Com Powerpoint
Slideshare.Com Powerpoint
guested929b
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligencevallibhargavi
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
Lars Marius Garshol
 

Viewers also liked (18)

References, Klaudiusz Zawadzki, Chief steward
References, Klaudiusz Zawadzki, Chief stewardReferences, Klaudiusz Zawadzki, Chief steward
References, Klaudiusz Zawadzki, Chief steward
 
AI: Planning and AI
AI: Planning and AIAI: Planning and AI
AI: Planning and AI
 
Link building presentation from SEOZone 2014 Istanbul
Link building presentation from SEOZone 2014 IstanbulLink building presentation from SEOZone 2014 Istanbul
Link building presentation from SEOZone 2014 Istanbul
 
Technical_phd
Technical_phdTechnical_phd
Technical_phd
 
Apple Smart Home Artificial Intelligence Insights from Patents
Apple Smart Home Artificial Intelligence Insights from PatentsApple Smart Home Artificial Intelligence Insights from Patents
Apple Smart Home Artificial Intelligence Insights from Patents
 
Data clustering a review
Data clustering a reviewData clustering a review
Data clustering a review
 
Hopfield Networks
Hopfield NetworksHopfield Networks
Hopfield Networks
 
ARTIFICIAL INTELLIGENCE Presentation
ARTIFICIAL INTELLIGENCE PresentationARTIFICIAL INTELLIGENCE Presentation
ARTIFICIAL INTELLIGENCE Presentation
 
HOPFIELD NETWORK
HOPFIELD NETWORKHOPFIELD NETWORK
HOPFIELD NETWORK
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
 
Artificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural NetworksArtificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural Networks
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Slideshare.Com Powerpoint
Slideshare.Com PowerpointSlideshare.Com Powerpoint
Slideshare.Com Powerpoint
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Slideshare ppt
Slideshare pptSlideshare ppt
Slideshare ppt
 

Similar to Clustering in artificial intelligence

47 292-298
47 292-29847 292-298
47 292-298
idescitation
 
Survey on Unsupervised Learning in Datamining
Survey on Unsupervised Learning in DataminingSurvey on Unsupervised Learning in Datamining
Survey on Unsupervised Learning in Datamining
IOSR Journals
 
Data Science - Part VII - Cluster Analysis
Data Science - Part VII -  Cluster AnalysisData Science - Part VII -  Cluster Analysis
Data Science - Part VII - Cluster Analysis
Derek Kane
 
Literature Survey On Clustering Techniques
Literature Survey On Clustering TechniquesLiterature Survey On Clustering Techniques
Literature Survey On Clustering Techniques
IOSR Journals
 
A0310112
A0310112A0310112
A0310112
iosrjournals
 
Clustering.pdf
Clustering.pdfClustering.pdf
Clustering.pdf
nadimhossain24
 
Clustering
ClusteringClustering
Clustering
Md. Hasnat Shoheb
 
26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt
vikassingh569137
 
Clustering & classification
Clustering & classificationClustering & classification
Clustering & classification
Jamshed Khan
 
CSA 3702 machine learning module 3
CSA 3702 machine learning module 3CSA 3702 machine learning module 3
CSA 3702 machine learning module 3
Nandhini S
 
Unsupervised learning Algorithms and Assumptions
Unsupervised learning Algorithms and AssumptionsUnsupervised learning Algorithms and Assumptions
Unsupervised learning Algorithms and Assumptions
refedey275
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
inventionjournals
 
CLUSTERING IN DATA MINING.pdf
CLUSTERING IN DATA MINING.pdfCLUSTERING IN DATA MINING.pdf
CLUSTERING IN DATA MINING.pdf
SowmyaJyothi3
 
Clustering on DSS
Clustering on DSSClustering on DSS
Clustering on DSS
Enaam Alotaibi
 
K means Clustering - algorithm to cluster n objects
K means Clustering - algorithm to cluster n objectsK means Clustering - algorithm to cluster n objects
K means Clustering - algorithm to cluster n objects
VoidVampire
 
A survey on Efficient Enhanced K-Means Clustering Algorithm
 A survey on Efficient Enhanced K-Means Clustering Algorithm A survey on Efficient Enhanced K-Means Clustering Algorithm
A survey on Efficient Enhanced K-Means Clustering Algorithm
ijsrd.com
 
Clustering techniques final
Clustering techniques finalClustering techniques final
Clustering techniques final
Benard Maina
 
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Chapter 10. Cluster Analysis Basic Concepts and Methods.pptChapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Subrata Kumer Paul
 
METHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptxMETHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptx
agniva pradhan
 
1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx
agniva pradhan
 

Similar to Clustering in artificial intelligence (20)

47 292-298
47 292-29847 292-298
47 292-298
 
Survey on Unsupervised Learning in Datamining
Survey on Unsupervised Learning in DataminingSurvey on Unsupervised Learning in Datamining
Survey on Unsupervised Learning in Datamining
 
Data Science - Part VII - Cluster Analysis
Data Science - Part VII -  Cluster AnalysisData Science - Part VII -  Cluster Analysis
Data Science - Part VII - Cluster Analysis
 
Literature Survey On Clustering Techniques
Literature Survey On Clustering TechniquesLiterature Survey On Clustering Techniques
Literature Survey On Clustering Techniques
 
A0310112
A0310112A0310112
A0310112
 
Clustering.pdf
Clustering.pdfClustering.pdf
Clustering.pdf
 
Clustering
ClusteringClustering
Clustering
 
26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt
 
Clustering & classification
Clustering & classificationClustering & classification
Clustering & classification
 
CSA 3702 machine learning module 3
CSA 3702 machine learning module 3CSA 3702 machine learning module 3
CSA 3702 machine learning module 3
 
Unsupervised learning Algorithms and Assumptions
Unsupervised learning Algorithms and AssumptionsUnsupervised learning Algorithms and Assumptions
Unsupervised learning Algorithms and Assumptions
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
CLUSTERING IN DATA MINING.pdf
CLUSTERING IN DATA MINING.pdfCLUSTERING IN DATA MINING.pdf
CLUSTERING IN DATA MINING.pdf
 
Clustering on DSS
Clustering on DSSClustering on DSS
Clustering on DSS
 
K means Clustering - algorithm to cluster n objects
K means Clustering - algorithm to cluster n objectsK means Clustering - algorithm to cluster n objects
K means Clustering - algorithm to cluster n objects
 
A survey on Efficient Enhanced K-Means Clustering Algorithm
 A survey on Efficient Enhanced K-Means Clustering Algorithm A survey on Efficient Enhanced K-Means Clustering Algorithm
A survey on Efficient Enhanced K-Means Clustering Algorithm
 
Clustering techniques final
Clustering techniques finalClustering techniques final
Clustering techniques final
 
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Chapter 10. Cluster Analysis Basic Concepts and Methods.pptChapter 10. Cluster Analysis Basic Concepts and Methods.ppt
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
 
METHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptxMETHODS OF CLUSTER ANALYSIS.pptx
METHODS OF CLUSTER ANALYSIS.pptx
 
1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx1. METHODS OF CLUSTER ANALYSIS.pptx
1. METHODS OF CLUSTER ANALYSIS.pptx
 

Recently uploaded

Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 

Recently uploaded (20)

Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 

Clustering in artificial intelligence

  • 1. 1 UNSUPERVISED LEARNINGUNSUPERVISED LEARNING Supervised and Unsupervised Learning ID3 and Version space are supervised learning algorithms Unsupervised learning eliminates the teacher and requires that the learners form concepts (categories) on their own Conceptual clustering is the problem of discovering useful categories in unclassified data (data whose categories are not pre-determined)
  • 2. 2 CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING Unsupervised Learning and Numeric taxonomy The clustering problem begins with a collection of unclassified objects and a means for measuring the similarity of objects The goal is to organize the objects into classes so that similar objects are in one class Numeric taxonomy is one of the oldest approaches to clustering problem
  • 3. 3 CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING In Numeric Taxonomy the objects are represented as a collection of features and each of the feature has some numeric value An object is thus a vector of n feature values and can be considered as a point in n-dimensional space The similarity of any two objects can be measured by the Euclidean distance between them in this space Using this similarity metric, clustering algorithms build clusters in a bottom up fashion (agglomerative clustering strategy)
  • 4. 4 CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING The categories are formed by the following approach 1. Examine all pairs of objects, and select the pair with the highest degree of similarity and make that pair a cluster 2. Define the features of the cluster as some function, such as average, of the features of the component members and then replace the component objects with this cluster definition 3. Repeat this process on the collection of objects until all objects have been reduced to a single cluster The result of this algorithm is a binary tree whose leaf nodes are instances and whose internal nodes are clusters of increasing size
  • 6. 6 CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING We may extend this algorithm to objects represented as sets of symbolic features. The only problem is in the measuring the similarity of objects A similarity metric can be the proportion of features that any two objects have in common object 1 = {small, red, rubber, ball} object 2 = {small, blue, rubber, ball} object 3 = {large, black, wooden, ball} similarity (object 1, object 2) = ¾ similarity (object 1, object 3) = ¼ similarity (object 2, object 3) = ¼
  • 7. 7 CONCEPTUAL CLUSTERINGCONCEPTUAL CLUSTERING In defining categories we cannot give all features equal weight In any given context, certain of an object’s features are more important than others; simple similarity metrics treat all features equally The feature weights are to be set according to the goals of the categorization
  • 8. 8 CLUSTER/2CLUSTER/2 CLUSTER/2 forms k categories by constructing individuals around k seed objects The parameter k is user adjustable CLUSTER/2 evaluates the resulting clusters, selecting new seeds and repeating the process until its quality criteria are met
  • 9. 9 CLUSTER/2CLUSTER/2 The algorithm • Select k seeds from the set of observed objects. This may be done randomly or according to some selection function • For each seed, using that seed as a positive instance and all other seeds as negative instances, produce a maximally general definition that tries to cover all of the non-seed instances, until stopped by the negative instances (other seeds)
  • 10. 10 CLUSTER/2CLUSTER/2 The algorithm • Classify all objects in the sample according to these descriptions. Note that this may lead to multiple classifications of other, non seed, objects
  • 11. 11 CLUSTER/2CLUSTER/2 The algorithm • Replace each maximally general description with a maximally specific description that covers all objects in the category. This decreases likelihood that classes overlap on unseen objects
  • 12. 12 CLUSTER/2CLUSTER/2 The algorithm • Classes may still overlap on given objects. • Using a distance metric, select an element closest to the center of each class. Using these central elements as new seeds, repeat the above steps.
  • 13. 13 CLUSTER/2CLUSTER/2 The algorithm • Stop when clusters are satisfactory. A typical quality matrix is the complexity of the general descriptions of classes. For instance, we might prefer clusters that yield syntactically simple definitions, such as those with a small number of conjuncts
  • 14. 14 CLUSTER/2CLUSTER/2 The algorithm • If clusters are unsatisfactory and no improvement occurs over several iterations, select the new seeds closest to the edge of the cluster, rather than those at the center. This favors creation of totally new clusters