SlideShare a Scribd company logo
1 of 4
Download to read offline
United International University
​​Assignment-01
​​Title: Classification by Clustering.
Course : Pattern Recognition Lab
Course Code : CSI 416
Submitted To :
Dr.Dewan Md.Farid
Associate Professor,CSE
United International University.
Submitted By :
Nazmul Hyder
Id : 011 131 085
Section : SB
Date : 06.08.2017
Dataset 1: Mushroom
Data set
characteristics:
Multivariat
e
Number of
instances :
8124 Area : Life
Attribute
characteristics:
Categorical Number of
attribute :
22 Data
denoted:
1987-09-01
Associate Task: Classificat
ion
Missing values: Yes Number of
web hits :
261721
Using Decision Tree (j4.8) :
Accuracy : 66.9621%
Error : 33.0379%
After simple k means clustering :
Number of cluster:2
Using DT(j4.8):
Accuracy : 99.4953%
Error : 0.5047%
Number of cluster:4
Using DT(j4.8):
Accuracy : 99.8523%
Error : 0.1477%
Number of cluster:6
Using DT(j4.8):
Accuracy : 99.483%
Error : 0.517%
Observation : Accuracy increased after the clustering compared to the initial accuracy .
Dataset 2: Wine-Quality-Red
Data set
characteristics:
Multivariat
e
Number of
instances :
1599 Area : Business
Attribute
characteristics:
Real Number of
attribute :
12 Data
denoted:
2009-10-07
Associate Task: Classificat
ion,
Regression
Missing values: No Number of
web hits :
475478
Using Decision Tree (j4.8) :
Accuracy : 90.9944%
Error : 9.0056%
After simple k means clustering :
Number of cluster:2
Using DT(j4.8):
Accuracy : 99.1245%
Error : 0.8755%
Number of cluster:4
Using DT(j4.8):
Accuracy : 97.8737%
Error : 2.1263%
Number of cluster:6
Using DT(j4.8):
Accuracy : 97.8737%
Error : 2.1263%
Observation : Accuracy increased after the clustering compared to the initial accuracy .
Dataset 3 : ZOO
Data set
characteristics:
Multivariat
e
Number of
instances :
101 Area : Life
Attribute
characteristics:
Categorical
, Integer
Number of
attribute :
17 Data
denoted:
1990-05-15
Associate Task: Classificat
ion
Missing values: No Number of
web hits :
172527
Using Decision Tree (j4.8) :
Accuracy : 99.0196%
Error : 0.9804%
After simple k means clustering :
Number of cluster : 2
Using DT(j4.8):
Accuracy : 100%
Error : 0%
Number of cluster : 4
Using DT(j4.8):
Accuracy : 97.0588%
Error : 2.9412%
Number of cluster : 6
Using DT(j4.8):
Accuracy : 96.0784%
Error :3.9216%
Observation : Accuracy increased after the clustering compared to the initial accuracy .
Dataset 3 : flags
Data set
characteristics:
Multivariate Number of
instances :
194 Area : N/A
Attribute
characteristics:
Categorical,
Integer
Number of
attribute :
30 Data
denoted:
1990-05-15
Associate Task: Classificati
on
Missing values: No Number of
web hits :
159411
Using Decision Tree (j4.8) :
Accuracy : 81.9588%
Error : 18.0412%
After simple k means clustering :
Number of cluster : 2
Using DT(j4.8):
Accuracy : 92.7835%
Error : 7.2165%
Number of cluster : 4
Using DT(j4.8):
Accuracy :88.6598%
Error :11.3402%
Number of cluster : 6
Using DT(j4.8):
Accuracy : 86.5979%
Error :13.4021%
Observation : Accuracy increased after the clustering compared to the initial accuracy .

More Related Content

Similar to Classification by clustering

Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...
Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...
Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...
Zabir Al Nazi Nabil
 
Week 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdfWeek 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdf
meghana092
 
Week_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdfWeek_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdf
PrabhaK22
 
Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...
Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...
Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...
Priyanka Aash
 

Similar to Classification by clustering (20)

Data analysis in artificial intelligence
Data analysis in artificial intelligenceData analysis in artificial intelligence
Data analysis in artificial intelligence
 
Deep Learning based Frameworks for Handling Imbalance in DGA, Email, and URL ...
Deep Learning based Frameworks for Handling Imbalance in DGA, Email, and URL ...Deep Learning based Frameworks for Handling Imbalance in DGA, Email, and URL ...
Deep Learning based Frameworks for Handling Imbalance in DGA, Email, and URL ...
 
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
NSL KDD Cup 99 dataset Anomaly Detection using Machine Learning Technique
 
Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...
Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...
Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning ...
 
Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2Bioinfo ngs data format visualization v2
Bioinfo ngs data format visualization v2
 
BlueHat v18 || Protecting the protector, hardening machine learning defenses ...
BlueHat v18 || Protecting the protector, hardening machine learning defenses ...BlueHat v18 || Protecting the protector, hardening machine learning defenses ...
BlueHat v18 || Protecting the protector, hardening machine learning defenses ...
 
Finding suitable classifier for pattern recognition
Finding suitable classifier for pattern recognitionFinding suitable classifier for pattern recognition
Finding suitable classifier for pattern recognition
 
How we've made a global search engine for genetic data
How we've made a global search engine for genetic dataHow we've made a global search engine for genetic data
How we've made a global search engine for genetic data
 
Towards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD DatasetsTowards Automatic Classification of LOD Datasets
Towards Automatic Classification of LOD Datasets
 
Injection Attack detection using ML for
Injection Attack detection using ML  forInjection Attack detection using ML  for
Injection Attack detection using ML for
 
Nighthawk: A Two-Level Genetic-Random Unit Test Data Generator
Nighthawk: A Two-Level Genetic-Random Unit Test Data GeneratorNighthawk: A Two-Level Genetic-Random Unit Test Data Generator
Nighthawk: A Two-Level Genetic-Random Unit Test Data Generator
 
Understand students’ self-reflections through learning analytics
Understand students’ self-reflections through learning analyticsUnderstand students’ self-reflections through learning analytics
Understand students’ self-reflections through learning analytics
 
Protocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkProtocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural Network
 
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
Deep Multi-task Learning with Label Correlation Constraint for Video Concept ...
 
Toxic Comment Classification using Neural Network and Machine Learning
Toxic Comment Classification using Neural Network and Machine LearningToxic Comment Classification using Neural Network and Machine Learning
Toxic Comment Classification using Neural Network and Machine Learning
 
Week 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdfWeek 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdf
 
Week_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdfWeek_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdf
 
Using field-based DNA sequencing to accelerate phylogenomics
Using field-based DNA sequencing to accelerate phylogenomicsUsing field-based DNA sequencing to accelerate phylogenomics
Using field-based DNA sequencing to accelerate phylogenomics
 
Intrusion Detection System for Classification of Attacks with Cross Validation
Intrusion Detection System for Classification of Attacks with Cross ValidationIntrusion Detection System for Classification of Attacks with Cross Validation
Intrusion Detection System for Classification of Attacks with Cross Validation
 
Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...
Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...
Protecting the Protector, Hardening Machine Learning Defenses Against Adversa...
 

More from Nazmul Hyder

More from Nazmul Hyder (9)

Analysis of Tree in Computer Based Application
Analysis of Tree in Computer Based ApplicationAnalysis of Tree in Computer Based Application
Analysis of Tree in Computer Based Application
 
Language Translator ( Compiler)
Language Translator ( Compiler)Language Translator ( Compiler)
Language Translator ( Compiler)
 
Linux Shell Scripts and Shell Commands✌️
Linux Shell Scripts and Shell Commands✌️Linux Shell Scripts and Shell Commands✌️
Linux Shell Scripts and Shell Commands✌️
 
Huffman coding
Huffman coding Huffman coding
Huffman coding
 
Dataset Analysis using weka tools (pattern recognition)
Dataset Analysis using weka tools (pattern recognition)Dataset Analysis using weka tools (pattern recognition)
Dataset Analysis using weka tools (pattern recognition)
 
ODOO documentation(e-commerce +accounting+purchase+inventory+invoice+HR+ POS)
ODOO documentation(e-commerce +accounting+purchase+inventory+invoice+HR+ POS)ODOO documentation(e-commerce +accounting+purchase+inventory+invoice+HR+ POS)
ODOO documentation(e-commerce +accounting+purchase+inventory+invoice+HR+ POS)
 
E-commerce (System Analysis and Design)
E-commerce (System Analysis and Design)E-commerce (System Analysis and Design)
E-commerce (System Analysis and Design)
 
Benchmark analysis (Online Shopping System)
Benchmark analysis (Online Shopping System)Benchmark analysis (Online Shopping System)
Benchmark analysis (Online Shopping System)
 
Online medicine store (using ODOO)
Online medicine store (using ODOO)Online medicine store (using ODOO)
Online medicine store (using ODOO)
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Recently uploaded (20)

Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 

Classification by clustering

  • 1. United International University ​​Assignment-01 ​​Title: Classification by Clustering. Course : Pattern Recognition Lab Course Code : CSI 416 Submitted To : Dr.Dewan Md.Farid Associate Professor,CSE United International University. Submitted By : Nazmul Hyder Id : 011 131 085 Section : SB Date : 06.08.2017
  • 2. Dataset 1: Mushroom Data set characteristics: Multivariat e Number of instances : 8124 Area : Life Attribute characteristics: Categorical Number of attribute : 22 Data denoted: 1987-09-01 Associate Task: Classificat ion Missing values: Yes Number of web hits : 261721 Using Decision Tree (j4.8) : Accuracy : 66.9621% Error : 33.0379% After simple k means clustering : Number of cluster:2 Using DT(j4.8): Accuracy : 99.4953% Error : 0.5047% Number of cluster:4 Using DT(j4.8): Accuracy : 99.8523% Error : 0.1477% Number of cluster:6 Using DT(j4.8): Accuracy : 99.483% Error : 0.517% Observation : Accuracy increased after the clustering compared to the initial accuracy . Dataset 2: Wine-Quality-Red Data set characteristics: Multivariat e Number of instances : 1599 Area : Business Attribute characteristics: Real Number of attribute : 12 Data denoted: 2009-10-07 Associate Task: Classificat ion, Regression Missing values: No Number of web hits : 475478 Using Decision Tree (j4.8) : Accuracy : 90.9944% Error : 9.0056%
  • 3. After simple k means clustering : Number of cluster:2 Using DT(j4.8): Accuracy : 99.1245% Error : 0.8755% Number of cluster:4 Using DT(j4.8): Accuracy : 97.8737% Error : 2.1263% Number of cluster:6 Using DT(j4.8): Accuracy : 97.8737% Error : 2.1263% Observation : Accuracy increased after the clustering compared to the initial accuracy . Dataset 3 : ZOO Data set characteristics: Multivariat e Number of instances : 101 Area : Life Attribute characteristics: Categorical , Integer Number of attribute : 17 Data denoted: 1990-05-15 Associate Task: Classificat ion Missing values: No Number of web hits : 172527 Using Decision Tree (j4.8) : Accuracy : 99.0196% Error : 0.9804% After simple k means clustering : Number of cluster : 2 Using DT(j4.8): Accuracy : 100% Error : 0% Number of cluster : 4 Using DT(j4.8): Accuracy : 97.0588% Error : 2.9412% Number of cluster : 6 Using DT(j4.8): Accuracy : 96.0784% Error :3.9216% Observation : Accuracy increased after the clustering compared to the initial accuracy .
  • 4. Dataset 3 : flags Data set characteristics: Multivariate Number of instances : 194 Area : N/A Attribute characteristics: Categorical, Integer Number of attribute : 30 Data denoted: 1990-05-15 Associate Task: Classificati on Missing values: No Number of web hits : 159411 Using Decision Tree (j4.8) : Accuracy : 81.9588% Error : 18.0412% After simple k means clustering : Number of cluster : 2 Using DT(j4.8): Accuracy : 92.7835% Error : 7.2165% Number of cluster : 4 Using DT(j4.8): Accuracy :88.6598% Error :11.3402% Number of cluster : 6 Using DT(j4.8): Accuracy : 86.5979% Error :13.4021% Observation : Accuracy increased after the clustering compared to the initial accuracy .