A little about data mining

•

1 like•400 views

Data mining and machine learning used to be two cousins. They have different parents. Now they grow increasingly like each other, almost like twins. Many times people even call data mining by the name machine learning. The field of machine learning grew out of the effort of building artificial intelligence. Its major concern is making a machine learn and adapt to new information. The field of data mining grows out of knowledge discovery from databases. Data mining is focused on better understanding of characteristics and patterns among variables in large databases using a variety of statistical and analytical tools. It is used to identify relationships among variables in large data sets and understand hidden patterns that they may contain. http://nguyenngocbinhphuong.com/supervised-learning-vs-unsupervised-learning/

Business

 Data mining is focused on better understanding
of characteristics and patterns among variables
in large databases using a variety of statistical
and analytical tools.
◦ It is used to identify relationships among variables in
large data sets and understand hidden patterns that
they may contain.
◦ XLMiner software implement many basic data
mining procedures in a spreadsheet environment.
2

 In supervised data mining techniques, there is a
dependent variable the method is trying to predict.
◦ The classification and prediction/forecasting methods
are supervised data mining techniques.
 In unsupervised data mining techniques, there is
no dependent variable. Instead, these techniques
search for patterns and structure among all of the
variables.
◦ One popular unsupervised method is association
analysis (known in marketing as market basket analysis)
◦ The most common unsupervised method is clustering
(known in marketing as segmentation).
4
DescriptiveanalyticsPredictiveanalytics

Suppose you had a basket and
filled it with different kinds of fruits.
5

 We have four types of fruits:
6
APPLE
BANANA
GRAPE
CHERRY

 You already learn from your previous work about the
physical characters of fruits. So arranging the same type
of fruits at one place is easy now.
 In data mining terminology the earlier work is called as
training data. You already learn the things from your
training data. This is because of response variable.
7

 Suppose you have taken a new fruit from the
basket then you will see the size, color, and shape
of that particular fruit.
◦ If size is Big, color is Red, the shape is rounded shape
with a depression at the top, you will confirm the fruit
name as Apple and you will put in Apple group.
 If you learn the thing before from training data and
then applying that knowledge to the test data (for
new fruit), this type of learning is called as
supervised learning.
8

 This time, you don’t know anything about the
fruits, honestly saying this is the first time you
have seen them. You have no clue about those.
 So, how will you arrange them? What will you do
first?
9
You will take a fruit and you
will arrange them by
considering the physical
character of that particular
fruit.

 Suppose you have considered color. Then you will
arrange them on considering base condition as
color. Then the groups will be something like this:
◦ RED COLOR GROUP: apples & cherries.
◦ GREEN COLOR GROUP: bananas & grapes.
 So now you will take another physical character
such as size.
◦ RED COLOR AND BIG SIZE: apples.
◦ RED COLOR AND SMALL SIZE: cherries.
◦ GREEN COLOR AND BIG SIZE: bananas.
◦ GREEN COLOR AND SMALL SIZE: grapes.
10

 Here you did not learn anything before, means no
training data and no response variable.
 In data mining, this kind of learning is known as
unsupervised learning.
11

Similar to A little about data mining

analysing_data_using_spss.pdfChanduMattaparthi1

Analysis Of Data Using SPSSBrittany Brown

Big6 - Ribs: The Groundbreaking PowerPointskoelker

Unit-1.pdfSwarnaKumariChinni

Machine Learning.pptxromanpankaj1

Saaty1Lilis Rohaeti

Data monetizationGramener

Dsrt 734 assignment 5 – week 7 – due february 21st at 11joney4

Midterm Portfolio- By: Mirnell D. GonzalezMirnell

Midterm PortfolioMirnell

LANGUAGE OF RESEARCH and ADVOCACY CAMPAIGN.pptxCresildaBiloy1

INTRODUCTION TO RESEARCH METHODLOGY IN SENIORSJubilinAlbania

INTRO_METHODS.pptxJubilinAlbania

INTRODUCTION TO RESEARCH METHODS IN THE PHILS..pptxJubilinAlbania

02.10.08 POLI 399christineshearer

The role of statistics and the data analysis process.pptJakeCuenca10

Unit 2.pptxNirmalavenkatachalam

Research process modelsjenmeltzer

Keys To Poverty PresentationKneuenswander

Data collectionMGM SCHOOL/COLLEGE OF NURSING

Similar to A little about data mining (20)

analysing_data_using_spss.pdf

Analysis Of Data Using SPSS

Big6 - Ribs: The Groundbreaking PowerPoint

Unit-1.pdf

Machine Learning.pptx

Saaty1

Data monetization

Dsrt 734 assignment 5 – week 7 – due february 21st at 11

Midterm Portfolio- By: Mirnell D. Gonzalez

Midterm Portfolio

LANGUAGE OF RESEARCH and ADVOCACY CAMPAIGN.pptx

INTRODUCTION TO RESEARCH METHODLOGY IN SENIORS

INTRO_METHODS.pptx

INTRODUCTION TO RESEARCH METHODS IN THE PHILS..pptx

02.10.08 POLI 399

The role of statistics and the data analysis process.ppt

Unit 2.pptx

Research process models

Keys To Poverty Presentation

Data collection

Recently uploaded

(8264348440) 🔝 Call Girls In Hauz Khas 🔝 Delhi NCRsoniya singh

Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...lizamodels9

Best Practices for Implementing an External Recruiting PartnershipRecruitment Process Outsourcing Association

Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9

Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...lizamodels9

BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls

Banana Powder Manufacturing Plant Project Report 2024 Edition.pptxgeorgebrinton95

Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncrdollysharma2066

(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCRsoniya singh

Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation

Marketing Management Business Plan_My Sweet Creationsnakalysalcedo61

Catalogue ONG NUOC PPR DE NHAT .pdfOrient Homes

Call Girls In ⇛⇛Chhatarpur⇚⇚. Brings Offer Delhi Contact Us 8377877756dollysharma2066

BEST Call Girls In BELLMONT HOTEL ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls

2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley

Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...lizamodels9

Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCRStunning ➥8448380779▻ Call Girls In Hauz Khas Delhi NCR

Intro to BCG's Carbon Emissions Benchmark_vF.pdfpollardmorgan

Investment analysis and portfolio managementJunaidKhan750825

Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...lizamodels9

Recently uploaded (20)

(8264348440) 🔝 Call Girls In Hauz Khas 🔝 Delhi NCR

Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...

Best Practices for Implementing an External Recruiting Partnership

Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...

Lowrate Call Girls In Sector 18 Noida ❤️8860477959 Escorts 100% Genuine Servi...

BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,

Banana Powder Manufacturing Plant Project Report 2024 Edition.pptx

Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr

(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR

Keppel Ltd. 1Q 2024 Business Update Presentation Slides

Marketing Management Business Plan_My Sweet Creations

Catalogue ONG NUOC PPR DE NHAT .pdf

Call Girls In ⇛⇛Chhatarpur⇚⇚. Brings Offer Delhi Contact Us 8377877756

BEST Call Girls In BELLMONT HOTEL ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,

2024 Numerator Consumer Study of Cannabis Usage

Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...

Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR

Intro to BCG's Carbon Emissions Benchmark_vF.pdf

Investment analysis and portfolio management

Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...

A little about data mining

1. 1 A LITTLE ABOUT DATA MINING

2.  Data mining is focused on better understanding of characteristics and patterns among variables in large databases using a variety of statistical and analytical tools. ◦ It is used to identify relationships among variables in large data sets and understand hidden patterns that they may contain. ◦ XLMiner software implement many basic data mining procedures in a spreadsheet environment. 2

3. 3

4.  In supervised data mining techniques, there is a dependent variable the method is trying to predict. ◦ The classification and prediction/forecasting methods are supervised data mining techniques.  In unsupervised data mining techniques, there is no dependent variable. Instead, these techniques search for patterns and structure among all of the variables. ◦ One popular unsupervised method is association analysis (known in marketing as market basket analysis) ◦ The most common unsupervised method is clustering (known in marketing as segmentation). 4 DescriptiveanalyticsPredictiveanalytics

5. Suppose you had a basket and filled it with different kinds of fruits. 5

6.  We have four types of fruits: 6 APPLE BANANA GRAPE CHERRY

7.  You already learn from your previous work about the physical characters of fruits. So arranging the same type of fruits at one place is easy now.  In data mining terminology the earlier work is called as training data. You already learn the things from your training data. This is because of response variable. 7

8.  Suppose you have taken a new fruit from the basket then you will see the size, color, and shape of that particular fruit. ◦ If size is Big, color is Red, the shape is rounded shape with a depression at the top, you will confirm the fruit name as Apple and you will put in Apple group.  If you learn the thing before from training data and then applying that knowledge to the test data (for new fruit), this type of learning is called as supervised learning. 8

9.  This time, you don’t know anything about the fruits, honestly saying this is the first time you have seen them. You have no clue about those.  So, how will you arrange them? What will you do first? 9 You will take a fruit and you will arrange them by considering the physical character of that particular fruit.

10.  Suppose you have considered color. Then you will arrange them on considering base condition as color. Then the groups will be something like this: ◦ RED COLOR GROUP: apples & cherries. ◦ GREEN COLOR GROUP: bananas & grapes.  So now you will take another physical character such as size. ◦ RED COLOR AND BIG SIZE: apples. ◦ RED COLOR AND SMALL SIZE: cherries. ◦ GREEN COLOR AND BIG SIZE: bananas. ◦ GREEN COLOR AND SMALL SIZE: grapes. 10

11.  Here you did not learn anything before, means no training data and no response variable.  In data mining, this kind of learning is known as unsupervised learning. 11

A little about data mining

Recommended

Recommended

More Related Content

Similar to A little about data mining

Similar to A little about data mining (20)

More from Nguyen Ngoc Binh Phuong

More from Nguyen Ngoc Binh Phuong (8)

Recently uploaded

Recently uploaded (20)

A little about data mining