SlideShare a Scribd company logo
Presented by:-
Gabani Juli
Gamit Manish
Gayatri Rathod
Gohel Harsh
Gohil Krishna
 Waikato Environment for knowledge Analysis
 Weka is a collection of machine learning algorithms for
data mining tasks.
 The algorithms can either be applied directly to a
dataset or called from your own java code.
 Weka contains tools for data Pre-Processing,
Classification, Regression, Clustering, Association
rules and Visualization.
 It is also well-suited for developing new machine
learning schemes.
 49 data preprocessing tools
 76 classification/ regression algorithms
 8 clustering algorithms
 3 algorithms for finding association rules
 15 attribute/ subset evaluators + 10 search algorithms
for feature selection
 When you start Weka,the GUI chooser pops up and
lets you choose four ways to work with Weka and your
data.
 Simple CLI provides a simple command-line interface
and allows direct execution of Weka commands.
 Explorer is an environment for exploring data.
 Experimenter is an environment for performing
experiments and conducting statistical tests between
learning schemes.
 KnowledgeFlow is a Java-Beans-based interface for
setting up and running machine learning experiments.
 “Weka Explorer” window appears on a screen.
 Preprocessing Data
 At the very top of the window, just below the title bar
there is a row of tabs. Only the first tab ‘preprocessor’
is active at the moment because there is no dataset
open.
 The first three buttons at the top of the preprocess
section enable you to load data into Weka.
 Data can be imported from a file in various
formats:ARFF,CSV,C4.5,binry,it can also be read from
a URL or from an SQL database(using JDBC).
 The most common way of getting the data into Weka is
to store it as Attribute-Relation File Format(ARFF)file.
 To open file from URL,click on ‘Open URL…’ button,
it brings up a dialog box requesting to enter source
URL.
 Data can also be read from an SQL database using
JDBC. Click on ‘Open DB..’ button.
 Classifiers
 Classifiers in Weka are the models for predicting
nominal or numeric quantities.
 The learning schemes available in Weka include
decision trees and lists, instance-based classifiers,
support vector machines, multi-layer perceptrons,
logistic regression and bayes nets.
 “Meta” classifiers include bagging, stacking, error-
correcting output codes and locally weighted learning.
 Once you have your data set loaded , all the tabs are
available to you. Click on the ‘classify’ tab.
 Clustering Data
 Weka contains ‘clusterers’ for finding groups of
similar instances in a dataset.
 The clustering schemes available in Weka are k-means,
EM , Cobweb ,X-means , Farthest First .
 Cluster can be visualized and compared to ‘true’
clusters(if given).
 Evaluation is based on log likelihood if clustering
scheme produces a portability distribution.
 Finding Associations:
 Weka contains an implementation of the Apriori algorithm
for learning association rules.
 This is the only currently available scheme for learning
associations in Weka.
 It works only with discrete data and will identify
statistical dependencies between groups of attributes,
milk, peanut butter and bread, jelly, beer and diapers,
with confidence 40% and support 30%.
 Apriori can compute all rules that have a given
minimum support and exceed a given confidence.
 Attribute Selection:
 Attribute selection searches through all possible
combinations of attributes in the data and finds which
subset of attributes works best for prediction .
 Attribute selection methods contain two parts: a search
method such as best-first, forward selection, random,
exhaustive, genetic algorithm, ranking, and an
evaluation method such as correlation-based, wrapper ,
information gain, chi-squared. Attribute selection
mechanism is very flexible.
 Weka allows(almost) arbitrary combinations of the two
methods.
 Data Visualization:
 Weka’s visualization allows you to visualize a 2-D plot
of the current working relation.
 Visualization is very useful in practice, it helps to
determine difficulty of the learning problem.
 Weka can visualize single attributes (1-d) and pairs of
attributes (2-d), rotate 3-d visualizations.
 Weka has ‘Jitter’ option to deal with nominal attributes
and to detect “hidden” data points

More Related Content

What's hot

Weka toolkit introduction
Weka toolkit introductionWeka toolkit introduction
Weka toolkit introductionbutest
 
Weka library, JAVA
Weka library, JAVAWeka library, JAVA
Weka library, JAVA
Kamthorn Puntumapon
 
weka data mining
weka data mining weka data mining
weka data mining
kalthoom almaqbali
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorialbutest
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using wekarathorenitin87
 
Weka tutorial
Weka tutorialWeka tutorial
Weka tutorial
Mohammed Aejazuddin
 
An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Weka
DataminingTools Inc
 
A simple introduction to weka
A simple introduction to wekaA simple introduction to weka
A simple introduction to weka
Pamoda Vajiramali
 
Weka presentation
Weka presentationWeka presentation
Weka presentationSaeed Iqbal
 
WEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And AttributesWEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And Attributes
DataminingTools Inc
 
WEKA: The Explorer
WEKA: The ExplorerWEKA: The Explorer
WEKA: The Explorer
DataminingTools Inc
 
WEKA: Introduction To Weka
WEKA: Introduction To WekaWEKA: Introduction To Weka
WEKA: Introduction To Weka
DataminingTools Inc
 
Classification and Clustering Analysis using Weka
Classification and Clustering Analysis using Weka Classification and Clustering Analysis using Weka
Classification and Clustering Analysis using Weka
Ishan Awadhesh
 
A random decision tree frameworkfor privacy preserving data mining
A random decision tree frameworkfor privacy preserving data miningA random decision tree frameworkfor privacy preserving data mining
A random decision tree frameworkfor privacy preserving data mining
Venkat Projects
 
1.5 weka an intoduction
1.5 weka an intoduction1.5 weka an intoduction
1.5 weka an intoduction
Krish_ver2
 
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Ankit Pandey
 

What's hot (16)

Weka toolkit introduction
Weka toolkit introductionWeka toolkit introduction
Weka toolkit introduction
 
Weka library, JAVA
Weka library, JAVAWeka library, JAVA
Weka library, JAVA
 
weka data mining
weka data mining weka data mining
weka data mining
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorial
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using weka
 
Weka tutorial
Weka tutorialWeka tutorial
Weka tutorial
 
An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Weka
 
A simple introduction to weka
A simple introduction to wekaA simple introduction to weka
A simple introduction to weka
 
Weka presentation
Weka presentationWeka presentation
Weka presentation
 
WEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And AttributesWEKA: Data Mining Input Concepts Instances And Attributes
WEKA: Data Mining Input Concepts Instances And Attributes
 
WEKA: The Explorer
WEKA: The ExplorerWEKA: The Explorer
WEKA: The Explorer
 
WEKA: Introduction To Weka
WEKA: Introduction To WekaWEKA: Introduction To Weka
WEKA: Introduction To Weka
 
Classification and Clustering Analysis using Weka
Classification and Clustering Analysis using Weka Classification and Clustering Analysis using Weka
Classification and Clustering Analysis using Weka
 
A random decision tree frameworkfor privacy preserving data mining
A random decision tree frameworkfor privacy preserving data miningA random decision tree frameworkfor privacy preserving data mining
A random decision tree frameworkfor privacy preserving data mining
 
1.5 weka an intoduction
1.5 weka an intoduction1.5 weka an intoduction
1.5 weka an intoduction
 
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
 

Similar to Weka

Weka toolkit introduction
Weka toolkit introductionWeka toolkit introduction
Weka toolkit introductionbutest
 
Introduction to weka
Introduction to wekaIntroduction to weka
Introduction to weka
JK Knowledge
 
Weka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data miningWeka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data mining
Keshab Kumar Gaurav
 
wekapresentation-130107115704-phpapp02.pdf
wekapresentation-130107115704-phpapp02.pdfwekapresentation-130107115704-phpapp02.pdf
wekapresentation-130107115704-phpapp02.pdf
Dr. Rajesh P Barnwal
 
Introduction to Weka and Preprocessing.ppt
Introduction to Weka and Preprocessing.pptIntroduction to Weka and Preprocessing.ppt
Introduction to Weka and Preprocessing.ppt
radhikadsu
 
Data Mining using Weka
Data Mining using WekaData Mining using Weka
Data Mining using Weka
Shashidhar Shenoy
 
Data mining weka
Data mining wekaData mining weka
Data mining weka
prashant 100702007
 
Knowledge Discovery Using Data Mining
Knowledge Discovery Using Data MiningKnowledge Discovery Using Data Mining
Knowledge Discovery Using Data Mining
parthvora18
 
Weka_Manual_Sagar
Weka_Manual_SagarWeka_Manual_Sagar
Weka_Manual_SagarSagar Kumar
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning Algorithms
Walaa Hamdy Assy
 
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
cscpconf
 
An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Weka
weka Content
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
Editor IJMTER
 
IJET-V3I2P2
IJET-V3I2P2IJET-V3I2P2
Weka term paper(siddharth 10 bm60086)
Weka term paper(siddharth 10 bm60086)Weka term paper(siddharth 10 bm60086)
Weka term paper(siddharth 10 bm60086)
Siddharth Verma
 
lab #6
lab #6lab #6
lab #6butest
 
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
International Educational Applied Scientific Research Journal (IEASRJ)
 
Caqdas
CaqdasCaqdas

Similar to Weka (20)

Weka toolkit introduction
Weka toolkit introductionWeka toolkit introduction
Weka toolkit introduction
 
Introduction to weka
Introduction to wekaIntroduction to weka
Introduction to weka
 
Weka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data miningWeka : A machine learning algorithms for data mining
Weka : A machine learning algorithms for data mining
 
wekapresentation-130107115704-phpapp02.pdf
wekapresentation-130107115704-phpapp02.pdfwekapresentation-130107115704-phpapp02.pdf
wekapresentation-130107115704-phpapp02.pdf
 
Introduction to Weka and Preprocessing.ppt
Introduction to Weka and Preprocessing.pptIntroduction to Weka and Preprocessing.ppt
Introduction to Weka and Preprocessing.ppt
 
Data Mining using Weka
Data Mining using WekaData Mining using Weka
Data Mining using Weka
 
Itb weka nikhil
Itb weka nikhilItb weka nikhil
Itb weka nikhil
 
Data mining weka
Data mining wekaData mining weka
Data mining weka
 
Knowledge Discovery Using Data Mining
Knowledge Discovery Using Data MiningKnowledge Discovery Using Data Mining
Knowledge Discovery Using Data Mining
 
Weka_Manual_Sagar
Weka_Manual_SagarWeka_Manual_Sagar
Weka_Manual_Sagar
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning Algorithms
 
Spe165 t
Spe165 tSpe165 t
Spe165 t
 
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
 
An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Weka
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
 
IJET-V3I2P2
IJET-V3I2P2IJET-V3I2P2
IJET-V3I2P2
 
Weka term paper(siddharth 10 bm60086)
Weka term paper(siddharth 10 bm60086)Weka term paper(siddharth 10 bm60086)
Weka term paper(siddharth 10 bm60086)
 
lab #6
lab #6lab #6
lab #6
 
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
 
Caqdas
CaqdasCaqdas
Caqdas
 

Recently uploaded

一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
dxobcob
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
ChristineTorrepenida1
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
zwunae
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
ssuser7dcef0
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
Mukeshwaran Balu
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
anoopmanoharan2
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
Rahul
 
bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
Divyam548318
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
awadeshbabu
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
NidhalKahouli2
 
AIR POLLUTION lecture EnE203 updated.pdf
AIR POLLUTION lecture EnE203 updated.pdfAIR POLLUTION lecture EnE203 updated.pdf
AIR POLLUTION lecture EnE203 updated.pdf
RicletoEspinosa1
 
TOP 10 B TECH COLLEGES IN JAIPUR 2024.pptx
TOP 10 B TECH COLLEGES IN JAIPUR 2024.pptxTOP 10 B TECH COLLEGES IN JAIPUR 2024.pptx
TOP 10 B TECH COLLEGES IN JAIPUR 2024.pptx
nikitacareer3
 
Online aptitude test management system project report.pdf
Online aptitude test management system project report.pdfOnline aptitude test management system project report.pdf
Online aptitude test management system project report.pdf
Kamal Acharya
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
SyedAbiiAzazi1
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Soumen Santra
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 

Recently uploaded (20)

一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
一比一原版(Otago毕业证)奥塔哥大学毕业证成绩单如何办理
 
Unbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptxUnbalanced Three Phase Systems and circuits.pptx
Unbalanced Three Phase Systems and circuits.pptx
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单专业办理
 
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
NUMERICAL SIMULATIONS OF HEAT AND MASS TRANSFER IN CONDENSING HEAT EXCHANGERS...
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
ACRP 4-09 Risk Assessment Method to Support Modification of Airfield Separat...
 
PPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testingPPT on GRP pipes manufacturing and testing
PPT on GRP pipes manufacturing and testing
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
 
bank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdfbank management system in java and mysql report1.pdf
bank management system in java and mysql report1.pdf
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
[JPP-1] - (JEE 3.0) - Kinematics 1D - 14th May..pdf
 
basic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdfbasic-wireline-operations-course-mahmoud-f-radwan.pdf
basic-wireline-operations-course-mahmoud-f-radwan.pdf
 
AIR POLLUTION lecture EnE203 updated.pdf
AIR POLLUTION lecture EnE203 updated.pdfAIR POLLUTION lecture EnE203 updated.pdf
AIR POLLUTION lecture EnE203 updated.pdf
 
TOP 10 B TECH COLLEGES IN JAIPUR 2024.pptx
TOP 10 B TECH COLLEGES IN JAIPUR 2024.pptxTOP 10 B TECH COLLEGES IN JAIPUR 2024.pptx
TOP 10 B TECH COLLEGES IN JAIPUR 2024.pptx
 
Online aptitude test management system project report.pdf
Online aptitude test management system project report.pdfOnline aptitude test management system project report.pdf
Online aptitude test management system project report.pdf
 
14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application14 Template Contractual Notice - EOT Application
14 Template Contractual Notice - EOT Application
 
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTSHeap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
Heap Sort (SS).ppt FOR ENGINEERING GRADUATES, BCA, MCA, MTECH, BSC STUDENTS
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 

Weka

  • 1. Presented by:- Gabani Juli Gamit Manish Gayatri Rathod Gohel Harsh Gohil Krishna
  • 2.  Waikato Environment for knowledge Analysis  Weka is a collection of machine learning algorithms for data mining tasks.  The algorithms can either be applied directly to a dataset or called from your own java code.  Weka contains tools for data Pre-Processing, Classification, Regression, Clustering, Association rules and Visualization.  It is also well-suited for developing new machine learning schemes.
  • 3.  49 data preprocessing tools  76 classification/ regression algorithms  8 clustering algorithms  3 algorithms for finding association rules  15 attribute/ subset evaluators + 10 search algorithms for feature selection
  • 4.  When you start Weka,the GUI chooser pops up and lets you choose four ways to work with Weka and your data.
  • 5.  Simple CLI provides a simple command-line interface and allows direct execution of Weka commands.  Explorer is an environment for exploring data.  Experimenter is an environment for performing experiments and conducting statistical tests between learning schemes.  KnowledgeFlow is a Java-Beans-based interface for setting up and running machine learning experiments.
  • 6.  “Weka Explorer” window appears on a screen.
  • 7.  Preprocessing Data  At the very top of the window, just below the title bar there is a row of tabs. Only the first tab ‘preprocessor’ is active at the moment because there is no dataset open.  The first three buttons at the top of the preprocess section enable you to load data into Weka.  Data can be imported from a file in various formats:ARFF,CSV,C4.5,binry,it can also be read from a URL or from an SQL database(using JDBC).
  • 8.  The most common way of getting the data into Weka is to store it as Attribute-Relation File Format(ARFF)file.
  • 9.  To open file from URL,click on ‘Open URL…’ button, it brings up a dialog box requesting to enter source URL.
  • 10.  Data can also be read from an SQL database using JDBC. Click on ‘Open DB..’ button.
  • 11.  Classifiers  Classifiers in Weka are the models for predicting nominal or numeric quantities.  The learning schemes available in Weka include decision trees and lists, instance-based classifiers, support vector machines, multi-layer perceptrons, logistic regression and bayes nets.  “Meta” classifiers include bagging, stacking, error- correcting output codes and locally weighted learning.
  • 12.  Once you have your data set loaded , all the tabs are available to you. Click on the ‘classify’ tab.
  • 13.  Clustering Data  Weka contains ‘clusterers’ for finding groups of similar instances in a dataset.  The clustering schemes available in Weka are k-means, EM , Cobweb ,X-means , Farthest First .  Cluster can be visualized and compared to ‘true’ clusters(if given).  Evaluation is based on log likelihood if clustering scheme produces a portability distribution.
  • 14.  Finding Associations:  Weka contains an implementation of the Apriori algorithm for learning association rules.  This is the only currently available scheme for learning associations in Weka.  It works only with discrete data and will identify statistical dependencies between groups of attributes, milk, peanut butter and bread, jelly, beer and diapers, with confidence 40% and support 30%.  Apriori can compute all rules that have a given minimum support and exceed a given confidence.
  • 15.  Attribute Selection:  Attribute selection searches through all possible combinations of attributes in the data and finds which subset of attributes works best for prediction .  Attribute selection methods contain two parts: a search method such as best-first, forward selection, random, exhaustive, genetic algorithm, ranking, and an evaluation method such as correlation-based, wrapper , information gain, chi-squared. Attribute selection mechanism is very flexible.  Weka allows(almost) arbitrary combinations of the two methods.
  • 16.  Data Visualization:  Weka’s visualization allows you to visualize a 2-D plot of the current working relation.  Visualization is very useful in practice, it helps to determine difficulty of the learning problem.  Weka can visualize single attributes (1-d) and pairs of attributes (2-d), rotate 3-d visualizations.  Weka has ‘Jitter’ option to deal with nominal attributes and to detect “hidden” data points