SlideShare a Scribd company logo
1 of 11
Download to read offline
DATAWAREHOUSINGDATAWAREHOUSING
CS-543CS-543
Muhammad Adil RajaMuhammad Adil Raja
2001-03-00172001-03-0017
Data MiningData Mining
Steps in Data MiningSteps in Data Mining
a.a. Exploration.Exploration.
b.b. Model Building and Validation.Model Building and Validation.
c.c. Deployment.Deployment.
Techniques Used In DataTechniques Used In Data
MiningMining
 Association analysis.Association analysis.
 Decision trees.Decision trees.
 Neural networks.Neural networks.
 Statistical methods in general.Statistical methods in general.
Decision TreesDecision Trees
A decision tree takes as input an objectA decision tree takes as input an object
or situation described by a set ofor situation described by a set of
properties, and outputs a yes/noproperties, and outputs a yes/no
decision. Decision trees thereforedecision. Decision trees therefore
represent Boolean functions. Functionsrepresent Boolean functions. Functions
with a larger range of outputs can also bewith a larger range of outputs can also be
represented...."represented...."
Decision TreesDecision Trees
 Are Also Known As Classification TreesAre Also Known As Classification Trees
 Regression Trees: A Variant of DecisionRegression Trees: A Variant of Decision
Trees.Trees.
Classification versusClassification versus
Regression TreesRegression Trees
As with all regression techniques weAs with all regression techniques we
assume the existence of a singleassume the existence of a single
response (target) variable and one orresponse (target) variable and one or
more predictor variables. If the responsemore predictor variables. If the response
variable is categorical then classificationvariable is categorical then classification
or decision trees are created and if theor decision trees are created and if the
response variable is continuous thenresponse variable is continuous then
regression trees are produced.regression trees are produced.
Target variable is Group (G) with a binary Response.
A, C & D are Continuous Predictors and B is Categorical
Flexibility ofFlexibility of
Classification TreesClassification Trees
 The ability ofThe ability of classification treesclassification trees is tois to
examine the effects of the predictorexamine the effects of the predictor
variables one at a time.variables one at a time.
How To Split ?How To Split ?
 With a Categorical Predictor Having N LevelsWith a Categorical Predictor Having N Levels
There Can Be 2There Can Be 2k-1k-1 -1 Candidate Splits.-1 Candidate Splits.
 With a Continuous Predictor Having N DistinctWith a Continuous Predictor Having N Distinct
Values There Can Be 2Values There Can Be 2N-1N-1 -1 Candidate Splits.-1 Candidate Splits.
 All Levels of All Predictors Can be Equally LikelyAll Levels of All Predictors Can be Equally Likely
Candidates For Splitting.Candidates For Splitting.
 We Have To Choose a Value Which DecreasesWe Have To Choose a Value Which Decreases
The Misclassification.The Misclassification.
Splitting ContinuedSplitting Continued
 Split till the time misclassification in theSplit till the time misclassification in the
terminal nodes keeps on decreasing.terminal nodes keeps on decreasing.
 Splitting beyond a certain depth does notSplitting beyond a certain depth does not
decrease the misclassification.decrease the misclassification.
 In certain cases splitting beyond a certainIn certain cases splitting beyond a certain
depth may increase the misclassificationdepth may increase the misclassification
as well.as well.

More Related Content

Viewers also liked

Signed, Sealed... Delivered? Behind Certifications and Beyond Labels
Signed, Sealed... Delivered? Behind Certifications and Beyond LabelsSigned, Sealed... Delivered? Behind Certifications and Beyond Labels
Signed, Sealed... Delivered? Behind Certifications and Beyond LabelsSustainable Brands
 
Vozdaverdade escudo-110625110719-phpapp01
Vozdaverdade escudo-110625110719-phpapp01Vozdaverdade escudo-110625110719-phpapp01
Vozdaverdade escudo-110625110719-phpapp01Danielly Carvalho
 
Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...
Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...
Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...Cristhiane Assenhaimer Takahashi
 
Augusta National Golf Club - A Master Course
Augusta National Golf Club - A Master CourseAugusta National Golf Club - A Master Course
Augusta National Golf Club - A Master Coursereflectiveworke16
 
Nagoya.R #14 入門者講習
Nagoya.R #14 入門者講習Nagoya.R #14 入門者講習
Nagoya.R #14 入門者講習Yusaku Kawaguchi
 
The Knock Knock Protocol
The Knock Knock ProtocolThe Knock Knock Protocol
The Knock Knock Protocoladil raja
 
How Transparency Drives Performance
How Transparency Drives PerformanceHow Transparency Drives Performance
How Transparency Drives PerformanceSustainable Brands
 
統計環境R_データ分析編2016
統計環境R_データ分析編2016統計環境R_データ分析編2016
統計環境R_データ分析編2016wada, kazumi
 
Writing effective buisness proposals
Writing effective buisness proposalsWriting effective buisness proposals
Writing effective buisness proposalsKinverg
 
Attitude and Soft Skills
Attitude and Soft SkillsAttitude and Soft Skills
Attitude and Soft SkillsAbdalis Toro
 
On Research (And Development)
On Research (And Development)On Research (And Development)
On Research (And Development)adil raja
 

Viewers also liked (13)

Signed, Sealed... Delivered? Behind Certifications and Beyond Labels
Signed, Sealed... Delivered? Behind Certifications and Beyond LabelsSigned, Sealed... Delivered? Behind Certifications and Beyond Labels
Signed, Sealed... Delivered? Behind Certifications and Beyond Labels
 
Vozdaverdade escudo-110625110719-phpapp01
Vozdaverdade escudo-110625110719-phpapp01Vozdaverdade escudo-110625110719-phpapp01
Vozdaverdade escudo-110625110719-phpapp01
 
Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...
Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...
Turbidimetry for the Stability Evaluation of Emulsions Used in machining indu...
 
Augusta National Golf Club - A Master Course
Augusta National Golf Club - A Master CourseAugusta National Golf Club - A Master Course
Augusta National Golf Club - A Master Course
 
Resume Leyla Clothier
Resume Leyla ClothierResume Leyla Clothier
Resume Leyla Clothier
 
Thompson C.V.
Thompson C.V.Thompson C.V.
Thompson C.V.
 
Nagoya.R #14 入門者講習
Nagoya.R #14 入門者講習Nagoya.R #14 入門者講習
Nagoya.R #14 入門者講習
 
The Knock Knock Protocol
The Knock Knock ProtocolThe Knock Knock Protocol
The Knock Knock Protocol
 
How Transparency Drives Performance
How Transparency Drives PerformanceHow Transparency Drives Performance
How Transparency Drives Performance
 
統計環境R_データ分析編2016
統計環境R_データ分析編2016統計環境R_データ分析編2016
統計環境R_データ分析編2016
 
Writing effective buisness proposals
Writing effective buisness proposalsWriting effective buisness proposals
Writing effective buisness proposals
 
Attitude and Soft Skills
Attitude and Soft SkillsAttitude and Soft Skills
Attitude and Soft Skills
 
On Research (And Development)
On Research (And Development)On Research (And Development)
On Research (And Development)
 

Similar to Data Warehousing

Cluster analysis
Cluster analysisCluster analysis
Cluster analysisAcad
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4Roger Barga
 
A Decision Tree Based Classifier for Classification & Prediction of Diseases
A Decision Tree Based Classifier for Classification & Prediction of DiseasesA Decision Tree Based Classifier for Classification & Prediction of Diseases
A Decision Tree Based Classifier for Classification & Prediction of Diseasesijsrd.com
 
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...cscpconf
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data MiningKai Koenig
 
Barga Data Science lecture 5
Barga Data Science lecture 5Barga Data Science lecture 5
Barga Data Science lecture 5Roger Barga
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationMario Sangiorgio
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Edureka!
 
Clinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's diseaseClinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's diseaseGeorge Kalangi
 
Using Diversity for Automated Boundary Value Testing
Using Diversity for Automated Boundary Value TestingUsing Diversity for Automated Boundary Value Testing
Using Diversity for Automated Boundary Value TestingFelix Dobslaw
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - ReportAkanksha Gohil
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniquesijsrd.com
 
dataminingclassificationprediction123 .pptx
dataminingclassificationprediction123 .pptxdataminingclassificationprediction123 .pptx
dataminingclassificationprediction123 .pptxAsrithaKorupolu
 
Outliers and Inconsistency
Outliers and InconsistencyOutliers and Inconsistency
Outliers and InconsistencyNeil Rubens
 

Similar to Data Warehousing (20)

Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Barga Data Science lecture 4
Barga Data Science lecture 4Barga Data Science lecture 4
Barga Data Science lecture 4
 
Dbm630 lecture06
Dbm630 lecture06Dbm630 lecture06
Dbm630 lecture06
 
A Decision Tree Based Classifier for Classification & Prediction of Diseases
A Decision Tree Based Classifier for Classification & Prediction of DiseasesA Decision Tree Based Classifier for Classification & Prediction of Diseases
A Decision Tree Based Classifier for Classification & Prediction of Diseases
 
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
Analysis of Bayes, Neural Network and Tree Classifier of Classification Techn...
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Barga Data Science lecture 5
Barga Data Science lecture 5Barga Data Science lecture 5
Barga Data Science lecture 5
 
Decision tree
Decision treeDecision tree
Decision tree
 
Hx3115011506
Hx3115011506Hx3115011506
Hx3115011506
 
Current Approaches in Search Result Diversification
Current Approaches in Search Result DiversificationCurrent Approaches in Search Result Diversification
Current Approaches in Search Result Diversification
 
Machine Learning (Decisoion Trees)
Machine Learning (Decisoion Trees)Machine Learning (Decisoion Trees)
Machine Learning (Decisoion Trees)
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
 
Clinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's diseaseClinical Data Classification of alzheimer's disease
Clinical Data Classification of alzheimer's disease
 
Using Diversity for Automated Boundary Value Testing
Using Diversity for Automated Boundary Value TestingUsing Diversity for Automated Boundary Value Testing
Using Diversity for Automated Boundary Value Testing
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
 
Decision tree
Decision treeDecision tree
Decision tree
 
data mining.pptx
data mining.pptxdata mining.pptx
data mining.pptx
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniques
 
dataminingclassificationprediction123 .pptx
dataminingclassificationprediction123 .pptxdataminingclassificationprediction123 .pptx
dataminingclassificationprediction123 .pptx
 
Outliers and Inconsistency
Outliers and InconsistencyOutliers and Inconsistency
Outliers and Inconsistency
 

More from adil raja

A Software Requirements Specification
A Software Requirements SpecificationA Software Requirements Specification
A Software Requirements Specificationadil raja
 
NUAV - A Testbed for Development of Autonomous Unmanned Aerial Vehicles
NUAV - A Testbed for Development of Autonomous Unmanned Aerial VehiclesNUAV - A Testbed for Development of Autonomous Unmanned Aerial Vehicles
NUAV - A Testbed for Development of Autonomous Unmanned Aerial Vehiclesadil raja
 
DevOps Demystified
DevOps DemystifiedDevOps Demystified
DevOps Demystifiedadil raja
 
Simulators as Drivers of Cutting Edge Research
Simulators as Drivers of Cutting Edge ResearchSimulators as Drivers of Cutting Edge Research
Simulators as Drivers of Cutting Edge Researchadil raja
 
File Transfer Through Sockets
File Transfer Through SocketsFile Transfer Through Sockets
File Transfer Through Socketsadil raja
 
Remote Command Execution
Remote Command ExecutionRemote Command Execution
Remote Command Executionadil raja
 
CMM Level 3 Assessment of Xavor Pakistan
CMM Level 3 Assessment of Xavor PakistanCMM Level 3 Assessment of Xavor Pakistan
CMM Level 3 Assessment of Xavor Pakistanadil raja
 
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...adil raja
 
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...adil raja
 
Real-Time Non-Intrusive Speech Quality Estimation for VoIP
Real-Time Non-Intrusive Speech Quality Estimation for VoIPReal-Time Non-Intrusive Speech Quality Estimation for VoIP
Real-Time Non-Intrusive Speech Quality Estimation for VoIPadil raja
 
ULMAN GUI Specifications
ULMAN GUI SpecificationsULMAN GUI Specifications
ULMAN GUI Specificationsadil raja
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...adil raja
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...adil raja
 
Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...
Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...
Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...adil raja
 
Modelling the Effect of Packet Loss on Speech Quality
Modelling the Effect of Packet Loss on Speech QualityModelling the Effect of Packet Loss on Speech Quality
Modelling the Effect of Packet Loss on Speech Qualityadil raja
 
A Random Presentation
A Random PresentationA Random Presentation
A Random Presentationadil raja
 

More from adil raja (20)

ANNs.pdf
ANNs.pdfANNs.pdf
ANNs.pdf
 
A Software Requirements Specification
A Software Requirements SpecificationA Software Requirements Specification
A Software Requirements Specification
 
NUAV - A Testbed for Development of Autonomous Unmanned Aerial Vehicles
NUAV - A Testbed for Development of Autonomous Unmanned Aerial VehiclesNUAV - A Testbed for Development of Autonomous Unmanned Aerial Vehicles
NUAV - A Testbed for Development of Autonomous Unmanned Aerial Vehicles
 
DevOps Demystified
DevOps DemystifiedDevOps Demystified
DevOps Demystified
 
Simulators as Drivers of Cutting Edge Research
Simulators as Drivers of Cutting Edge ResearchSimulators as Drivers of Cutting Edge Research
Simulators as Drivers of Cutting Edge Research
 
File Transfer Through Sockets
File Transfer Through SocketsFile Transfer Through Sockets
File Transfer Through Sockets
 
Remote Command Execution
Remote Command ExecutionRemote Command Execution
Remote Command Execution
 
Thesis
ThesisThesis
Thesis
 
CMM Level 3 Assessment of Xavor Pakistan
CMM Level 3 Assessment of Xavor PakistanCMM Level 3 Assessment of Xavor Pakistan
CMM Level 3 Assessment of Xavor Pakistan
 
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
 
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
Implementation of a Non-Intrusive Speech Quality Assessment Tool on a Mid-Net...
 
Real-Time Non-Intrusive Speech Quality Estimation for VoIP
Real-Time Non-Intrusive Speech Quality Estimation for VoIPReal-Time Non-Intrusive Speech Quality Estimation for VoIP
Real-Time Non-Intrusive Speech Quality Estimation for VoIP
 
VoIP
VoIPVoIP
VoIP
 
ULMAN GUI Specifications
ULMAN GUI SpecificationsULMAN GUI Specifications
ULMAN GUI Specifications
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
 
ULMAN-GUI
ULMAN-GUIULMAN-GUI
ULMAN-GUI
 
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
Modeling the Effect of Packet Loss on Speech Quality: Genetic Programming Bas...
 
Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...
Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...
Modeling the Effect of packet Loss on Speech Quality: GP Based Symbolic Regre...
 
Modelling the Effect of Packet Loss on Speech Quality
Modelling the Effect of Packet Loss on Speech QualityModelling the Effect of Packet Loss on Speech Quality
Modelling the Effect of Packet Loss on Speech Quality
 
A Random Presentation
A Random PresentationA Random Presentation
A Random Presentation
 

Recently uploaded

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 

Data Warehousing

  • 3. Steps in Data MiningSteps in Data Mining a.a. Exploration.Exploration. b.b. Model Building and Validation.Model Building and Validation. c.c. Deployment.Deployment.
  • 4. Techniques Used In DataTechniques Used In Data MiningMining  Association analysis.Association analysis.  Decision trees.Decision trees.  Neural networks.Neural networks.  Statistical methods in general.Statistical methods in general.
  • 5. Decision TreesDecision Trees A decision tree takes as input an objectA decision tree takes as input an object or situation described by a set ofor situation described by a set of properties, and outputs a yes/noproperties, and outputs a yes/no decision. Decision trees thereforedecision. Decision trees therefore represent Boolean functions. Functionsrepresent Boolean functions. Functions with a larger range of outputs can also bewith a larger range of outputs can also be represented...."represented...."
  • 6. Decision TreesDecision Trees  Are Also Known As Classification TreesAre Also Known As Classification Trees  Regression Trees: A Variant of DecisionRegression Trees: A Variant of Decision Trees.Trees.
  • 7. Classification versusClassification versus Regression TreesRegression Trees As with all regression techniques weAs with all regression techniques we assume the existence of a singleassume the existence of a single response (target) variable and one orresponse (target) variable and one or more predictor variables. If the responsemore predictor variables. If the response variable is categorical then classificationvariable is categorical then classification or decision trees are created and if theor decision trees are created and if the response variable is continuous thenresponse variable is continuous then regression trees are produced.regression trees are produced.
  • 8. Target variable is Group (G) with a binary Response. A, C & D are Continuous Predictors and B is Categorical
  • 9. Flexibility ofFlexibility of Classification TreesClassification Trees  The ability ofThe ability of classification treesclassification trees is tois to examine the effects of the predictorexamine the effects of the predictor variables one at a time.variables one at a time.
  • 10. How To Split ?How To Split ?  With a Categorical Predictor Having N LevelsWith a Categorical Predictor Having N Levels There Can Be 2There Can Be 2k-1k-1 -1 Candidate Splits.-1 Candidate Splits.  With a Continuous Predictor Having N DistinctWith a Continuous Predictor Having N Distinct Values There Can Be 2Values There Can Be 2N-1N-1 -1 Candidate Splits.-1 Candidate Splits.  All Levels of All Predictors Can be Equally LikelyAll Levels of All Predictors Can be Equally Likely Candidates For Splitting.Candidates For Splitting.  We Have To Choose a Value Which DecreasesWe Have To Choose a Value Which Decreases The Misclassification.The Misclassification.
  • 11. Splitting ContinuedSplitting Continued  Split till the time misclassification in theSplit till the time misclassification in the terminal nodes keeps on decreasing.terminal nodes keeps on decreasing.  Splitting beyond a certain depth does notSplitting beyond a certain depth does not decrease the misclassification.decrease the misclassification.  In certain cases splitting beyond a certainIn certain cases splitting beyond a certain depth may increase the misclassificationdepth may increase the misclassification as well.as well.