SlideShare a Scribd company logo
1 of 26
Data mining 
Data mining (the analysis step of the "Knowledge Discovery in 
Databases" process, or KDD), an interdisciplinary subfield of computer 
science,is the computational process of discovering patterns in large 
data sets involving methods at the intersection of artificial intelligence, 
machine learning, statistics, and database systems
Theoretical Foundations of Data Mining
Data Reduction - The basic idea of this theory is to reduce the 
data representation which trades accuracy for speed in response 
to the need to obtain quick approximate answers to queries on 
very large data bases.Some of the data reduction techniques are 
as follows: 
•Singular value Decomposition 
•Wavelets 
•Regression 
•Log-linear models 
•Histograms 
•Clustering 
•Sampling 
•Construction of Index Trees
Data Compression - The basic idea of this theory is to 
compress the given data by encoding in terms of the following: 
•Bits 
•Association Rules 
•Decision Trees 
•Clusters
Pattern Discovery - The basic idea of this theory is to discover 
patterns occurring in the database. Following are the areas that 
contributes to this theory: 
• Machine Learning 
• Neural Network 
• Association Mining 
• Sequential Pattern Matching 
• Clustering
Probability Theory - This theory is based on statistical theory. 
The basic idea behind this theory is to discover joint probability 
distributions of random variables. 
Microeconomic View - As per the perception of this theory, the 
database schema consist of data and patterns that are stored in 
the database. Therefore according to this theory data mining is 
the task of performing induction on databases.
Inductive databases - Apart from the database oriented 
techniques, there are statistical techniques also available for data 
analysis. These techniques can be applied to scientific data and 
data from economic & social sciences as well.
Statistical Data Mining 
Statistics is the traditional field that deals with the quantification,collection, analysis, interpretation, and 
drawing conclusions from data. 
Some of the Statistical Data Mining Techniques are as follows: 
•Regression - The regression methods are used to predict the value of 
response variable from one or more predictor variables where the variables 
are numeric.Following are the several forms of Regression: 
• Linear 
• Multiple 
• Weighted 
• Polynomial 
• Nonparametric 
• Robust
•Generalized Linear Models - Generalized Linear Model includes: 
• Logistic Regression 
• Poisson Regression 
•The model's generalization allow a categorical response variable to be 
related to set of predictor variables in manner similar to the modelling of 
numeric response variable using linear regression. 
Analysis of Variance - This technique analyzes: 
•Experimental data for two or more populations described by a numeric 
response variable. 
•One or more categorical variables (factors).
•Mixed-effect Models - These models are used for analyzing the grouped 
data. These models describe the relationship between a response variable 
and some covariates in data grouped according to one or more factors. 
•Factor Analysis - Factor Analysis Method is used to predict a categorical 
response variable. This method assumes that independent variable follow a 
multivariate normal distribution. 
•Time Series Analysis - Following are the methods for analyzing time-series 
data: 
•Autoregression Methods 
•Univariate ARIMA (AutoRegressive Integrated Moving Average) Modeling 
•Long-memory time-series modeling
Visual Data Mining 
Visual Data Mining uses data and/or knowledge visualization 
techniques to discover implicit knowledge from the large data 
sets. The Visual Data Mining can be viewed as an integration of 
following disciplines: 
•Data Visualization 
•Data Mining
Visual Data Mining is closely related to the following: 
• Computer Graphics 
• Multimedia Systems 
• Human Computer Interaction 
• Pattern Recognition 
• High performance computing
Generally data visualization and data mining can be integrated in the following ways: 
•Data Visualization - The data in the databases or the data 
warehouses can be viewed in several visual forms that are listed 
below: 
• Boxplots 
• 3-D Cubes 
• Data distribution charts 
• Curves 
• Surfaces 
• Link graphs etc.
Data Mining result Visualization - Data Mining Result 
Visualization is the presentation of the results of data mining in 
visual forms. These visual forms could be scatter plots and 
boxplots etc. 
Data Mining Process Visualization - Data Mining Process 
Visualization presents the several processes of data mining. This 
allows the users to see how the data are extracted. This also 
allow the users to see from which database or data warehouse 
data are cleaned, integrated, preprocessed, and mined.
Rate of cloud in an 
area
Audio Data Mining 
To indicate the patterns of data or the features of data mining 
results, Audio Data Mining makes use of audio signals. By 
transforming patterns into sound and musing instead of watching 
pictures, we can listen to pitches,tunes in order to identify 
anything interesting.
Data Mining and Collaborative Filtering 
Today the consumer faced with large variety of goods and 
services while shopping. During live customer transactions, the 
Recommender System helps the consumer by making product 
recommendation. The Collaborative Filtering Approach is 
generally used for recommending products to customers. These 
recommendations are based on the opinions of other customers.
Openness 
Individuals have the right to know what information is collected about 
them, who have access to the data and how the data are being used. One 
social concern of data mining is the issue of privacy and information 
security. Opt-out policies, which allow consumers to specify limitations 
on the use of their personal data, are one approach toward data privacy 
protection, while data security-enhancing techniques can 
anonymize information for security and privacy.
thanvi

More Related Content

What's hot

Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641Aiswaryadevi Jaganmohan
 
Data preprocessing in Machine learning
Data preprocessing in Machine learning Data preprocessing in Machine learning
Data preprocessing in Machine learning pyingkodi maran
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data MiningDHIVYADEVAKI
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data miningSlideshare
 
Supervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine LearningSupervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine LearningSpotle.ai
 
VTU 6th Sem Elective CSE - Module 3 cloud computing
VTU 6th Sem Elective CSE - Module 3 cloud computingVTU 6th Sem Elective CSE - Module 3 cloud computing
VTU 6th Sem Elective CSE - Module 3 cloud computingSachin Gowda
 
Evaluating web conference_tools
Evaluating web conference_toolsEvaluating web conference_tools
Evaluating web conference_toolsAniket Maithani
 
Mining single dimensional boolean association rules from transactional
Mining single dimensional boolean association rules from transactionalMining single dimensional boolean association rules from transactional
Mining single dimensional boolean association rules from transactionalramya marichamy
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation Pralhad Rijal
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data miningKamal Acharya
 
13. Query Processing in DBMS
13. Query Processing in DBMS13. Query Processing in DBMS
13. Query Processing in DBMSkoolkampus
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data miningKrish_ver2
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process Shuvra Ghosh
 
Data Reduction
Data ReductionData Reduction
Data ReductionRajan Shah
 
Text data mining1
Text data mining1Text data mining1
Text data mining1KU Leuven
 
Decision tree induction
Decision tree inductionDecision tree induction
Decision tree inductionthamizh arasi
 

What's hot (20)

Artificial Neural Networks for Data Mining
Artificial Neural Networks for Data MiningArtificial Neural Networks for Data Mining
Artificial Neural Networks for Data Mining
 
Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641
 
Data preprocessing in Machine learning
Data preprocessing in Machine learning Data preprocessing in Machine learning
Data preprocessing in Machine learning
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
Supervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine LearningSupervised and Unsupervised Machine Learning
Supervised and Unsupervised Machine Learning
 
VTU 6th Sem Elective CSE - Module 3 cloud computing
VTU 6th Sem Elective CSE - Module 3 cloud computingVTU 6th Sem Elective CSE - Module 3 cloud computing
VTU 6th Sem Elective CSE - Module 3 cloud computing
 
Evaluating web conference_tools
Evaluating web conference_toolsEvaluating web conference_tools
Evaluating web conference_tools
 
Mining single dimensional boolean association rules from transactional
Mining single dimensional boolean association rules from transactionalMining single dimensional boolean association rules from transactional
Mining single dimensional boolean association rules from transactional
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
 
Classification techniques in data mining
Classification techniques in data miningClassification techniques in data mining
Classification techniques in data mining
 
13. Query Processing in DBMS
13. Query Processing in DBMS13. Query Processing in DBMS
13. Query Processing in DBMS
 
4.2 spatial data mining
4.2 spatial data mining4.2 spatial data mining
4.2 spatial data mining
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process
 
Data mining tasks
Data mining tasksData mining tasks
Data mining tasks
 
Data Reduction
Data ReductionData Reduction
Data Reduction
 
Data reduction
Data reductionData reduction
Data reduction
 
Data Mining
Data MiningData Mining
Data Mining
 
Text data mining1
Text data mining1Text data mining1
Text data mining1
 
Decision tree induction
Decision tree inductionDecision tree induction
Decision tree induction
 

Viewers also liked

Olap operations
Olap operationsOlap operations
Olap operationsOm Prakash
 
Online analytical processing (olap) tools
Online analytical processing (olap) toolsOnline analytical processing (olap) tools
Online analytical processing (olap) toolskulkarnivaibhav
 
OLAP Cubes: Basic operations
OLAP Cubes: Basic operationsOLAP Cubes: Basic operations
OLAP Cubes: Basic operationsSthefan Berwanger
 
Knowledge Discovery in Databases
Knowledge Discovery in DatabasesKnowledge Discovery in Databases
Knowledge Discovery in DatabasesDiwas Kandel
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database Avnish Patel
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data MiningAmritanshu Mehra
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingJason S
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesSaif Ullah
 

Viewers also liked (13)

Data mining
Data miningData mining
Data mining
 
Olap operations
Olap operationsOlap operations
Olap operations
 
Online analytical processing (olap) tools
Online analytical processing (olap) toolsOnline analytical processing (olap) tools
Online analytical processing (olap) tools
 
OLAP Cubes: Basic operations
OLAP Cubes: Basic operationsOLAP Cubes: Basic operations
OLAP Cubes: Basic operations
 
Knowledge Discovery in Databases
Knowledge Discovery in DatabasesKnowledge Discovery in Databases
Knowledge Discovery in Databases
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
 
Web content mining
Web content miningWeb content mining
Web content mining
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
 

Similar to Additional themes of data mining for Msc CS

Similar to Additional themes of data mining for Msc CS (20)

Unit-V-Introduction to Data Mining.pptx
Unit-V-Introduction to  Data Mining.pptxUnit-V-Introduction to  Data Mining.pptx
Unit-V-Introduction to Data Mining.pptx
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
 
data mining
data miningdata mining
data mining
 
finalestkddfinalpresentation-111207021040-phpapp01.pptx
finalestkddfinalpresentation-111207021040-phpapp01.pptxfinalestkddfinalpresentation-111207021040-phpapp01.pptx
finalestkddfinalpresentation-111207021040-phpapp01.pptx
 
Data mining
Data miningData mining
Data mining
 
Data mining an introduction
Data mining an introductionData mining an introduction
Data mining an introduction
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
Unit i
Unit iUnit i
Unit i
 
Dma unit 1
Dma unit   1Dma unit   1
Dma unit 1
 
Lecture2 (1).ppt
Lecture2 (1).pptLecture2 (1).ppt
Lecture2 (1).ppt
 
Data mining introduction
Data mining introductionData mining introduction
Data mining introduction
 
Data mining basic concept and Data warehousing
Data mining basic concept and Data warehousingData mining basic concept and Data warehousing
Data mining basic concept and Data warehousing
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
Data mining
Data miningData mining
Data mining
 
01 Introduction to Data Mining
01 Introduction to Data Mining01 Introduction to Data Mining
01 Introduction to Data Mining
 
7.-Data-Analytics.pptx
7.-Data-Analytics.pptx7.-Data-Analytics.pptx
7.-Data-Analytics.pptx
 
Seminar Presentation
Seminar PresentationSeminar Presentation
Seminar Presentation
 
Ch~2.pdf
Ch~2.pdfCh~2.pdf
Ch~2.pdf
 
Unit 3 part ii Data mining
Unit 3 part ii Data miningUnit 3 part ii Data mining
Unit 3 part ii Data mining
 

Recently uploaded

Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayMakMakNepo
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 

Recently uploaded (20)

Quarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up FridayQuarter 4 Peace-education.pptx Catch Up Friday
Quarter 4 Peace-education.pptx Catch Up Friday
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 

Additional themes of data mining for Msc CS

  • 1.
  • 2. Data mining Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science,is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems
  • 3.
  • 5. Data Reduction - The basic idea of this theory is to reduce the data representation which trades accuracy for speed in response to the need to obtain quick approximate answers to queries on very large data bases.Some of the data reduction techniques are as follows: •Singular value Decomposition •Wavelets •Regression •Log-linear models •Histograms •Clustering •Sampling •Construction of Index Trees
  • 6. Data Compression - The basic idea of this theory is to compress the given data by encoding in terms of the following: •Bits •Association Rules •Decision Trees •Clusters
  • 7. Pattern Discovery - The basic idea of this theory is to discover patterns occurring in the database. Following are the areas that contributes to this theory: • Machine Learning • Neural Network • Association Mining • Sequential Pattern Matching • Clustering
  • 8. Probability Theory - This theory is based on statistical theory. The basic idea behind this theory is to discover joint probability distributions of random variables. Microeconomic View - As per the perception of this theory, the database schema consist of data and patterns that are stored in the database. Therefore according to this theory data mining is the task of performing induction on databases.
  • 9. Inductive databases - Apart from the database oriented techniques, there are statistical techniques also available for data analysis. These techniques can be applied to scientific data and data from economic & social sciences as well.
  • 10. Statistical Data Mining Statistics is the traditional field that deals with the quantification,collection, analysis, interpretation, and drawing conclusions from data. Some of the Statistical Data Mining Techniques are as follows: •Regression - The regression methods are used to predict the value of response variable from one or more predictor variables where the variables are numeric.Following are the several forms of Regression: • Linear • Multiple • Weighted • Polynomial • Nonparametric • Robust
  • 11. •Generalized Linear Models - Generalized Linear Model includes: • Logistic Regression • Poisson Regression •The model's generalization allow a categorical response variable to be related to set of predictor variables in manner similar to the modelling of numeric response variable using linear regression. Analysis of Variance - This technique analyzes: •Experimental data for two or more populations described by a numeric response variable. •One or more categorical variables (factors).
  • 12. •Mixed-effect Models - These models are used for analyzing the grouped data. These models describe the relationship between a response variable and some covariates in data grouped according to one or more factors. •Factor Analysis - Factor Analysis Method is used to predict a categorical response variable. This method assumes that independent variable follow a multivariate normal distribution. •Time Series Analysis - Following are the methods for analyzing time-series data: •Autoregression Methods •Univariate ARIMA (AutoRegressive Integrated Moving Average) Modeling •Long-memory time-series modeling
  • 13.
  • 14.
  • 15. Visual Data Mining Visual Data Mining uses data and/or knowledge visualization techniques to discover implicit knowledge from the large data sets. The Visual Data Mining can be viewed as an integration of following disciplines: •Data Visualization •Data Mining
  • 16. Visual Data Mining is closely related to the following: • Computer Graphics • Multimedia Systems • Human Computer Interaction • Pattern Recognition • High performance computing
  • 17. Generally data visualization and data mining can be integrated in the following ways: •Data Visualization - The data in the databases or the data warehouses can be viewed in several visual forms that are listed below: • Boxplots • 3-D Cubes • Data distribution charts • Curves • Surfaces • Link graphs etc.
  • 18. Data Mining result Visualization - Data Mining Result Visualization is the presentation of the results of data mining in visual forms. These visual forms could be scatter plots and boxplots etc. Data Mining Process Visualization - Data Mining Process Visualization presents the several processes of data mining. This allows the users to see how the data are extracted. This also allow the users to see from which database or data warehouse data are cleaned, integrated, preprocessed, and mined.
  • 19.
  • 20. Rate of cloud in an area
  • 21. Audio Data Mining To indicate the patterns of data or the features of data mining results, Audio Data Mining makes use of audio signals. By transforming patterns into sound and musing instead of watching pictures, we can listen to pitches,tunes in order to identify anything interesting.
  • 22.
  • 23. Data Mining and Collaborative Filtering Today the consumer faced with large variety of goods and services while shopping. During live customer transactions, the Recommender System helps the consumer by making product recommendation. The Collaborative Filtering Approach is generally used for recommending products to customers. These recommendations are based on the opinions of other customers.
  • 24.
  • 25. Openness Individuals have the right to know what information is collected about them, who have access to the data and how the data are being used. One social concern of data mining is the issue of privacy and information security. Opt-out policies, which allow consumers to specify limitations on the use of their personal data, are one approach toward data privacy protection, while data security-enhancing techniques can anonymize information for security and privacy.