SlideShare a Scribd company logo
1 of 13
DATA MINING
PRESENTED BY:
PROFESSOR VIJAY KHATA
COMPUTER DEPARTMENT
DATA MINING TASK PRIMITIVES
• Data mining task is represented in form of data mining query.
• This data mining query is defined in terms of data mining task primitives.
• Will allow the user to interactively communicate with the data mining system.
• Basic building block or components that are used to construct a data mining process.
• The use of data mining task primitives can provide a modular and reusable approach,
which can improve performance, efficiency, and understandability of the data mining
process.
DATA MINING TASK PRIMITIVES
• The data mining task primitives are as follows:
1. The set of task relevant data to be mined
2. Kind of knowledge to be mined
3. Background knowledge to be used in the discovery process
4. Interestingness measures and thresholds for pattern evaluation
5. Representation for visualizing the discovered pattern
1. TASK RELEVANT DATA
 this specifies the portions of the database or the set of data in which user is interested.
 Database name
Database tables
Relevant attributes
Data grouping criteria
2. THE KIND OF KNOWELDGE TO BE MINED.
This specifies the data mining functions can be performed, such as:
1. Characterization
2. Discrimination
3. Association or correlation analysis
4. Classification
5. Prediction
6. clustering
3. BACKGROUNG KNOWLEDGE TO BE USED IN THE
DISCOVERY PROCESS.
It refers to any prior information or understanding that is used to guide the data mining
process.
This can include domain-specific knowledge, such as industry-specific terminology,
trends or best practices, as well as knowledge about data itself.
The use of background knowledge can help to improve the accuracy and relevance of
the insights obtained from the data mining process.
Concept hierarchies are a popular form of background knowledge, which allows data to
be mined at multiple level of abstraction.
4. Interestingness measures and thresholds for pattern evaluation
• This is used to evaluate the patterns that are discovered by the process of knowledge
discovery.
• These measures are used to identify patterns that are meaningful or relevant to the task.
5. Representation for visualizing the discovered pattern
It refers to the methods used to represent the patterns or insights discovered through
data mining in a way that is easy to understand and interpret.
Visualization techniques such as charts, graphs, and maps are commonly used to
represent the data and can help to highlight important trends, patterns, or relationships
within the data.
INTEGRATION OF DATA MINING SYSTEM WITH
DATABASE
The data mining system is integrated with a database or data warehouse system so that it
can do its task in an effective presence.
A data mining system operates in an environment that needed it to communicate with
other data systems like database system.
There are possible integration schemes that can integrate these systems.
1. NO COUPLING
No coupling defines that a data mining system will not use any function of a database
or data warehouse system.
It can retrieve data from specific source (including file system), process data using some
data mining algorithms, and therefore save the mining results in a different file.
2. LOOSE COUPLING
Loose coupling means that a data mining system will use some facilities of a database or
data warehouse system, fetching data from repository managed by these system,
performing data mining, and then storing the mining results either in a file or in a
designated place in a database or data warehouse.
3. SEMI-TIGHT COUPLING-ENHANCED DATA
MINING PERFORMANCE
The semi-tight coupling means that besides linking a data mining system to a
Database/Data warehouse system, efficient implementations of a few essential data
mining primitives (sorting, indexing, aggregation) can be provided in the Database/Data
Warehouse system.
4. TIGHT COUPLING-A UNIFORM INFORMATION
PROCESSING ENVIRONMENT
Tight coupling means that a Data Mining system is smoothly integrated into
the database/Data warehouse system.

More Related Content

Similar to Subhaschamdrabhosesubhqschndrachose.pptx

Data Mining Module 1 Business Analytics.
Data Mining Module 1 Business Analytics.Data Mining Module 1 Business Analytics.
Data Mining Module 1 Business Analytics.Jayanti Pande
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesFellowBuddy.com
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousingSunny Gandhi
 
UNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data MiningUNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data MiningNandakumar P
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.pptPalaniKumarR2
 
Dwdm unit 1-2016-Data ingarehousing
Dwdm unit 1-2016-Data ingarehousingDwdm unit 1-2016-Data ingarehousing
Dwdm unit 1-2016-Data ingarehousingDhilsath Fathima
 
Architecture of data mining system
Architecture of data mining systemArchitecture of data mining system
Architecture of data mining systemramya marichamy
 
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxDATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxrandyburney60861
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.pptSamPrem3
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data FusionIRJET Journal
 
Implementing sorting in database systems
Implementing sorting in database systemsImplementing sorting in database systems
Implementing sorting in database systemsunyil96
 

Similar to Subhaschamdrabhosesubhqschndrachose.pptx (20)

Unit i
Unit iUnit i
Unit i
 
Data Mining Module 1 Business Analytics.
Data Mining Module 1 Business Analytics.Data Mining Module 1 Business Analytics.
Data Mining Module 1 Business Analytics.
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
UNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data MiningUNIT 2: Part 2: Data Warehousing and Data Mining
UNIT 2: Part 2: Data Warehousing and Data Mining
 
G045033841
G045033841G045033841
G045033841
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
 
Dwdm unit 1-2016-Data ingarehousing
Dwdm unit 1-2016-Data ingarehousingDwdm unit 1-2016-Data ingarehousing
Dwdm unit 1-2016-Data ingarehousing
 
Architecture of data mining system
Architecture of data mining systemArchitecture of data mining system
Architecture of data mining system
 
Unit II.pdf
Unit II.pdfUnit II.pdf
Unit II.pdf
 
Datawarehousing Terminology
Datawarehousing TerminologyDatawarehousing Terminology
Datawarehousing Terminology
 
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxDATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
 
20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt20IT501_DWDM_PPT_Unit_II.ppt
20IT501_DWDM_PPT_Unit_II.ppt
 
Cross Domain Data Fusion
Cross Domain Data FusionCross Domain Data Fusion
Cross Domain Data Fusion
 
Implementing sorting in database systems
Implementing sorting in database systemsImplementing sorting in database systems
Implementing sorting in database systems
 
Data Mining
Data MiningData Mining
Data Mining
 
Advanced Database System
Advanced Database SystemAdvanced Database System
Advanced Database System
 
P341
P341P341
P341
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 

Recently uploaded

9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 

Recently uploaded (20)

9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 

Subhaschamdrabhosesubhqschndrachose.pptx

  • 1. DATA MINING PRESENTED BY: PROFESSOR VIJAY KHATA COMPUTER DEPARTMENT
  • 2. DATA MINING TASK PRIMITIVES • Data mining task is represented in form of data mining query. • This data mining query is defined in terms of data mining task primitives. • Will allow the user to interactively communicate with the data mining system. • Basic building block or components that are used to construct a data mining process. • The use of data mining task primitives can provide a modular and reusable approach, which can improve performance, efficiency, and understandability of the data mining process.
  • 3. DATA MINING TASK PRIMITIVES • The data mining task primitives are as follows: 1. The set of task relevant data to be mined 2. Kind of knowledge to be mined 3. Background knowledge to be used in the discovery process 4. Interestingness measures and thresholds for pattern evaluation 5. Representation for visualizing the discovered pattern
  • 4. 1. TASK RELEVANT DATA  this specifies the portions of the database or the set of data in which user is interested.  Database name Database tables Relevant attributes Data grouping criteria
  • 5. 2. THE KIND OF KNOWELDGE TO BE MINED. This specifies the data mining functions can be performed, such as: 1. Characterization 2. Discrimination 3. Association or correlation analysis 4. Classification 5. Prediction 6. clustering
  • 6. 3. BACKGROUNG KNOWLEDGE TO BE USED IN THE DISCOVERY PROCESS. It refers to any prior information or understanding that is used to guide the data mining process. This can include domain-specific knowledge, such as industry-specific terminology, trends or best practices, as well as knowledge about data itself. The use of background knowledge can help to improve the accuracy and relevance of the insights obtained from the data mining process. Concept hierarchies are a popular form of background knowledge, which allows data to be mined at multiple level of abstraction.
  • 7. 4. Interestingness measures and thresholds for pattern evaluation • This is used to evaluate the patterns that are discovered by the process of knowledge discovery. • These measures are used to identify patterns that are meaningful or relevant to the task.
  • 8. 5. Representation for visualizing the discovered pattern It refers to the methods used to represent the patterns or insights discovered through data mining in a way that is easy to understand and interpret. Visualization techniques such as charts, graphs, and maps are commonly used to represent the data and can help to highlight important trends, patterns, or relationships within the data.
  • 9. INTEGRATION OF DATA MINING SYSTEM WITH DATABASE The data mining system is integrated with a database or data warehouse system so that it can do its task in an effective presence. A data mining system operates in an environment that needed it to communicate with other data systems like database system. There are possible integration schemes that can integrate these systems.
  • 10. 1. NO COUPLING No coupling defines that a data mining system will not use any function of a database or data warehouse system. It can retrieve data from specific source (including file system), process data using some data mining algorithms, and therefore save the mining results in a different file.
  • 11. 2. LOOSE COUPLING Loose coupling means that a data mining system will use some facilities of a database or data warehouse system, fetching data from repository managed by these system, performing data mining, and then storing the mining results either in a file or in a designated place in a database or data warehouse.
  • 12. 3. SEMI-TIGHT COUPLING-ENHANCED DATA MINING PERFORMANCE The semi-tight coupling means that besides linking a data mining system to a Database/Data warehouse system, efficient implementations of a few essential data mining primitives (sorting, indexing, aggregation) can be provided in the Database/Data Warehouse system.
  • 13. 4. TIGHT COUPLING-A UNIFORM INFORMATION PROCESSING ENVIRONMENT Tight coupling means that a Data Mining system is smoothly integrated into the database/Data warehouse system.