SlideShare a Scribd company logo
1 of 12
Dataset Analysis
Presented By
Nazmul Hyder
ID : 011 131 085
Section : SB
Contents
❑ Dataset Name
❑ Classifiers
❑ Dataset Description
❑ Dataset Analysis
❑ Graphical representation.
❑ References
Datasets Name
❏ Mushroom.
❏ Wine-Quality.
❏ Flags.
❏ ZOO.
Classifiers
❏kNN
❏NBC
❏Decision Tree (J48)
❏oneR
❏Random Forest
Dataset Description
Dataset name No of
instances
No of
attributes
Attribute
type
Class
value
Data
denoted
Donor
Mushroom 8124 22 nominal 2 1987 Jeff Schlimmer
Wine-Quality 1599 12 numeric 6
(nominal)
2009 Paulo Cortez,
Antonio Cerdeira,
Fernando Almeida
Flags 194 30 nominal 194
(nominal)
1990 Richard S. Forsyth
ZOO 101 17 nominal 8
(nominal)
1990 Richard S. Forsyth
Dataset Analysis:
Mushroom-Cross validation(10 folds)
Classifier Accuracy Error Rate Recall Precision F-score
kNN (k=3%) 59.6135% 40.3865% 0.596 0.576 0.583
NBC 64.5126% 35.4874% 0.645 0.769 0.665
j4.8 61.9645% 38.0355% 0.620 0.629 0.623
oneR 57.9025% 42.0975% 0.579 0.411 0.469
Random Forest 47.3043% 52.6957% 0.473 0.476 0.474
Dataset Analysis (con.)
Wine-Quality-Cross validation(10 folds)
Classifier Accuracy Error Rate Recall Precision F-score
kNN (k=3%) 57.7236% 42.2764% 0.577 0.542 0.553
NBC 55.0344% 44.9656% 0.550 0.554 0.550
j4.8 61.4759% 38.5241% 0.615 0.612 0.613
oneR 54.6592% 45.3408% 0.547 0.496 0.511
Random Forest 70.1063% 29.8337% 0.701 0.679 0.684
Flags - Cross validation(10 folds)
Classifier Accuracy Error Rate Recall Precision F-score
kNN (k=3%) 59.2789% 40.7216% 0.593 0.553 0.550
NBC 55.1546% 44.8454% 0.552 0.571 0.542
j4.8 59.2784% 40.7216% 0.593 0.570 0.576
oneR 4.6392% 95.3608% 0.046 0.002 0.004
Random Forest 61.3402% 38.6598% 0.613 0.545 0.572
Dataset Analysis (con.)
ZOO - Cross validation(10 folds)
Classifier Accuracy Error Rate Recall Precision F-score
kNN (k=3%) 94.1176% 5.8824% 0.941 0.935 0.931
NBC 95.098% 4.902% 0.951 0.953 0.950
j4.8 92.1569% 7.8431% 0.922 0.916 0.915
oneR 2.9412% 97.0588% 0.029 0.039 0.026
Random Forest 92.1569% 7.8431% 0.922 0.874 0.896
Dataset Analysis (con.)
Classifier result comparison :
References :
Quick Links :
Mushroom:https://archive.ics.uci.edu/ml/datasets/mushroom
Wine Quality:https://archive.ics.uci.edu/ml/datasets/wine+quality
Flags : https://archive.ics.uci.edu/ml/datasets/Flags
ZOO: http://archive.ics.uci.edu/ml/datasets/Zoo
URL : http://archive.ics.uci.edu/ml/datasets.html
Thank You

More Related Content

Recently uploaded

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Recently uploaded (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Dataset Analysis using weka tools (pattern recognition)

  • 1. Dataset Analysis Presented By Nazmul Hyder ID : 011 131 085 Section : SB
  • 2. Contents ❑ Dataset Name ❑ Classifiers ❑ Dataset Description ❑ Dataset Analysis ❑ Graphical representation. ❑ References
  • 3. Datasets Name ❏ Mushroom. ❏ Wine-Quality. ❏ Flags. ❏ ZOO.
  • 5. Dataset Description Dataset name No of instances No of attributes Attribute type Class value Data denoted Donor Mushroom 8124 22 nominal 2 1987 Jeff Schlimmer Wine-Quality 1599 12 numeric 6 (nominal) 2009 Paulo Cortez, Antonio Cerdeira, Fernando Almeida Flags 194 30 nominal 194 (nominal) 1990 Richard S. Forsyth ZOO 101 17 nominal 8 (nominal) 1990 Richard S. Forsyth
  • 6. Dataset Analysis: Mushroom-Cross validation(10 folds) Classifier Accuracy Error Rate Recall Precision F-score kNN (k=3%) 59.6135% 40.3865% 0.596 0.576 0.583 NBC 64.5126% 35.4874% 0.645 0.769 0.665 j4.8 61.9645% 38.0355% 0.620 0.629 0.623 oneR 57.9025% 42.0975% 0.579 0.411 0.469 Random Forest 47.3043% 52.6957% 0.473 0.476 0.474
  • 7. Dataset Analysis (con.) Wine-Quality-Cross validation(10 folds) Classifier Accuracy Error Rate Recall Precision F-score kNN (k=3%) 57.7236% 42.2764% 0.577 0.542 0.553 NBC 55.0344% 44.9656% 0.550 0.554 0.550 j4.8 61.4759% 38.5241% 0.615 0.612 0.613 oneR 54.6592% 45.3408% 0.547 0.496 0.511 Random Forest 70.1063% 29.8337% 0.701 0.679 0.684
  • 8. Flags - Cross validation(10 folds) Classifier Accuracy Error Rate Recall Precision F-score kNN (k=3%) 59.2789% 40.7216% 0.593 0.553 0.550 NBC 55.1546% 44.8454% 0.552 0.571 0.542 j4.8 59.2784% 40.7216% 0.593 0.570 0.576 oneR 4.6392% 95.3608% 0.046 0.002 0.004 Random Forest 61.3402% 38.6598% 0.613 0.545 0.572 Dataset Analysis (con.)
  • 9. ZOO - Cross validation(10 folds) Classifier Accuracy Error Rate Recall Precision F-score kNN (k=3%) 94.1176% 5.8824% 0.941 0.935 0.931 NBC 95.098% 4.902% 0.951 0.953 0.950 j4.8 92.1569% 7.8431% 0.922 0.916 0.915 oneR 2.9412% 97.0588% 0.029 0.039 0.026 Random Forest 92.1569% 7.8431% 0.922 0.874 0.896 Dataset Analysis (con.)
  • 11. References : Quick Links : Mushroom:https://archive.ics.uci.edu/ml/datasets/mushroom Wine Quality:https://archive.ics.uci.edu/ml/datasets/wine+quality Flags : https://archive.ics.uci.edu/ml/datasets/Flags ZOO: http://archive.ics.uci.edu/ml/datasets/Zoo URL : http://archive.ics.uci.edu/ml/datasets.html