SlideShare a Scribd company logo
Group 7
What is Data
                  Mining ?



                                                Mining and discovery of new
                                                information in terms of
                                                patterns or
                                                rules from vast amounts of
                                                data.



The process of discovering meaningful new correlations, patterns and trends by sifting
through large amounts of data stored in repositoties, using pattern recognition
technologies as well as statical and methematics techniques.
Why we mine
  Data ?




  Commercial View Point :-
  Lots of data is being collected and warehoused .
  Computers have become cheaper and more powerful.
  Competitive Pressure is Strong .


  Scientific View Point :-
  Data collected and stored at enormous speeds (GB/hour).
  Traditional techniques infeasible for raw data.
  Data mining may help scientists.
On what kind of
   Data...?



          •   Relational databases
          •   Data warehouses
          •   Transactional databases
          •   Advanced database systems:
                   Object-relational
                   Spacial and Temporal
                   Time-series
                   Multimedia, text
                   WWW
What are the goals
 of Data mining?



    • Prediction  e.g. sales volume, earthquakes
    • Identification e.g. existence of genes, system
    intrusions
    • Classification of different categories e.g. discount
    seeking shoppers or loyal regular shoppers in a
    supermarket
    • Optimization of limited resources such as time,
    space, money or materials and maximization of
    outputs such as sales or profits
What are the
      applications of Data-
            Mining ?


● Marketing
                                     ● Finance
 Analysis of consumer behavior
                                      Creditworthiness of clients
 Advertising campaigns
                                      Performance analysis of finance
 Targeted mailings
                                        investments
 Segmentation of
                                      Fraud detection
  customers, stores, or products

● Manufacturing
                                     ● Health Care
 Optimization of resources
                                      Discovering patterns in X-ray
 Optimization of manufacturing
                                        images
  processes
                                      Analyzing side effects of drugs
 Product design based on customer
                                      Effectiveness of treatments
  requirements
What are the present
commercial tools for
   Data Mining ?




                     Data to knowledge
 SAS                                            Oracle data-miner




 Intelligent miner                 Clementine
How to build a data
  mining model?       An important concept is
                      that building a mining
                      model is part of a larger
                      process.
1. Defining
    the
 problem.     Clearly define the business
                       problem.
2. Preparing
    Data       consolidate and clean the data that
               was identified in the Defining the
               Problem step.
3.Exploring
   Data
              Explore the prepared data



       .
4.Building
 Models      Before you build a model, you must
             randomly separate the prepared data into
             separate training and testing datasets.
             You use the training dataset to build the
             model, and the testing dataset to test the
             accuracy of the model by creating
             prediction queries.
5. Exploring
and validating
models           Explore the models that you
                 have built and test their
                 effectiveness.
6. Deploying
and updating
               Deploy to a production
models         environment the models
               that performed the best.
What are the major
issues in Data-Mining
      concept ?

    Mining different kinds of knowledge in databases
    Interactive mining of knowledge at multiple levels of
     abstraction
    Incorporation of background knowledge
    Data mining query languages and ad-hoc data mining
    Expression and visualization of data mining results
    Handling noise and incomplete data
    Pattern evaluation: the interestingness problem
    Integration of the discovered knowledge with existing
     knowledge: A knowledge fusion problem
    Protection of data security, integrity, and privacy
How will be the future of
 Data-Mining concept?




      ● Active research is ongoing
       Neural Networks
       Regression Analysis
       Genetic Algorithms
      ● Data mining is used in many areas today. We
      cannot even begin to imagine what the future
      holds in its womb!
Thank You !

More Related Content

What's hot

Data mining
Data mining Data mining
Data mining
sayalipatil528
 
Application of KDD & its future scope
Application of KDD & its future scopeApplication of KDD & its future scope
Application of KDD & its future scope
Tanmay Sethi
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
Hoang Nguyen
 
Key Principles Of Data Mining
Key Principles Of Data MiningKey Principles Of Data Mining
Key Principles Of Data Mining
tobiemuir
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
Sanzid Kawsar
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
Seerat Malik
 
Data Mining and Data Warehouse
Data Mining and Data WarehouseData Mining and Data Warehouse
Data Mining and Data Warehouse
Anupam Sharma
 
Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and Applications
IJMER
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
arpit bhadoriya
 
Data mining
Data miningData mining
Data mining
pradeepa n
 
Data mining
Data miningData mining
Data mining
jadhav_priti
 
Application areas of data mining
Application areas of data miningApplication areas of data mining
Application areas of data mining
priya jain
 
Group7_Datamining_Project_Report_Final
Group7_Datamining_Project_Report_FinalGroup7_Datamining_Project_Report_Final
Group7_Datamining_Project_Report_Final
Manikandan Sundarapandian
 
Data Mining
Data MiningData Mining
Data Mining
Data MiningData Mining
Data Mining
Mîrză MuNib
 
Data Mining in telecommunication industry
Data Mining in telecommunication industryData Mining in telecommunication industry
Data Mining in telecommunication industry
pragya ratan
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
Si Krishan
 
Data mining
Data miningData mining
Data mining
heba_ahmad
 
Case study for DWDM
Case study for DWDMCase study for DWDM
Case study for DWDM
Aniruddha Achar B P
 
Data Mining
Data MiningData Mining
Data Mining
Megha Sharma
 

What's hot (20)

Data mining
Data mining Data mining
Data mining
 
Application of KDD & its future scope
Application of KDD & its future scopeApplication of KDD & its future scope
Application of KDD & its future scope
 
Data mining and knowledge discovery
Data mining and knowledge discoveryData mining and knowledge discovery
Data mining and knowledge discovery
 
Key Principles Of Data Mining
Key Principles Of Data MiningKey Principles Of Data Mining
Key Principles Of Data Mining
 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Data Mining and Data Warehouse
Data Mining and Data WarehouseData Mining and Data Warehouse
Data Mining and Data Warehouse
 
Data Mining: Future Trends and Applications
Data Mining: Future Trends and ApplicationsData Mining: Future Trends and Applications
Data Mining: Future Trends and Applications
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
 
Data mining
Data miningData mining
Data mining
 
Data mining
Data miningData mining
Data mining
 
Application areas of data mining
Application areas of data miningApplication areas of data mining
Application areas of data mining
 
Group7_Datamining_Project_Report_Final
Group7_Datamining_Project_Report_FinalGroup7_Datamining_Project_Report_Final
Group7_Datamining_Project_Report_Final
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Mining in telecommunication industry
Data Mining in telecommunication industryData Mining in telecommunication industry
Data Mining in telecommunication industry
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Data mining
Data miningData mining
Data mining
 
Case study for DWDM
Case study for DWDMCase study for DWDM
Case study for DWDM
 
Data Mining
Data MiningData Mining
Data Mining
 

Viewers also liked

Plán školení technik Haier
Plán školení   technik HaierPlán školení   technik Haier
Plán školení technik Haier
Michal Kadlec
 
The european union tr
The european union trThe european union tr
The european union tr
comeniusipb
 
Pp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comeniusPp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comenius
comeniusipb
 
AGEL - cesta k rovnováze
AGEL - cesta k rovnovázeAGEL - cesta k rovnováze
AGEL - cesta k rovnováze
Michal Kadlec
 
Gastronomy 2
Gastronomy 2Gastronomy 2
Gastronomy 2
comeniusipb
 
Re engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport serviceRe engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport service
Udara Seneviratne
 
Gun industry
Gun industryGun industry
Gun industry
Udara Seneviratne
 

Viewers also liked (9)

Plán školení technik Haier
Plán školení   technik HaierPlán školení   technik Haier
Plán školení technik Haier
 
The european union tr
The european union trThe european union tr
The european union tr
 
Pp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comeniusPp origens catalunya (anglès) comenius
Pp origens catalunya (anglès) comenius
 
AGEL - cesta k rovnováze
AGEL - cesta k rovnovázeAGEL - cesta k rovnováze
AGEL - cesta k rovnováze
 
Prezentacja EU
Prezentacja EUPrezentacja EU
Prezentacja EU
 
Gastronomy 2
Gastronomy 2Gastronomy 2
Gastronomy 2
 
Agel intro
Agel   introAgel   intro
Agel intro
 
Re engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport serviceRe engineering process of sri lankan national transport service
Re engineering process of sri lankan national transport service
 
Gun industry
Gun industryGun industry
Gun industry
 

Similar to Data mining concepts

Exploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxExploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptx
YashwanthKumar306128
 
What is data mining ?
What is data mining ?What is data mining ?
What is data mining ?
Johan Blomme
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
data minig for eng with all topics and history
data minig for eng with all topics and historydata minig for eng with all topics and history
data minig for eng with all topics and history
nbaisane16
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Mining
dataminers.ir
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
Phi Jack
 
3 marketing research
3 marketing research3 marketing research
3 marketing research
saadii410
 
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
Mark Heid
 
Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
Data mining (prefinals)
Data mining (prefinals)Data mining (prefinals)
Data mining (prefinals)
sadam33146
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
rameshkbudhani
 
Data mining
Data miningData mining
Data mining
Ujjwal Kumar
 
17 cs002
17 cs00217 cs002
17 cs002
TPLatchoumi
 
2011 Shopper Insights Brochure
2011 Shopper Insights Brochure2011 Shopper Insights Brochure
2011 Shopper Insights Brochure
fglick
 
Data Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large DatabasesData Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large Databases
SSA KPI
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
NIET Journal of Engineering & Technology (NIETJET)
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
IRJET Journal
 
Ch35
Ch35Ch35
Today's BI and Data Mining ecosystem
Today's BI and Data Mining ecosystemToday's BI and Data Mining ecosystem
Today's BI and Data Mining ecosystem
Josep Arroyo
 

Similar to Data mining concepts (20)

Exploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxExploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptx
 
What is data mining ?
What is data mining ?What is data mining ?
What is data mining ?
 
Knowledge Discovery and Data Mining
Knowledge Discovery and Data MiningKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining
 
data minig for eng with all topics and history
data minig for eng with all topics and historydata minig for eng with all topics and history
data minig for eng with all topics and history
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
Introduction To Data Mining
Introduction To Data MiningIntroduction To Data Mining
Introduction To Data Mining
 
Introduction To Data Mining
Introduction To Data Mining   Introduction To Data Mining
Introduction To Data Mining
 
3 marketing research
3 marketing research3 marketing research
3 marketing research
 
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
BSC 3362 - Big Data and Social Analytics - IOD Conference (IBM)
 
Data mining
Data miningData mining
Data mining
 
Data mining (prefinals)
Data mining (prefinals)Data mining (prefinals)
Data mining (prefinals)
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
 
Data mining
Data miningData mining
Data mining
 
17 cs002
17 cs00217 cs002
17 cs002
 
2011 Shopper Insights Brochure
2011 Shopper Insights Brochure2011 Shopper Insights Brochure
2011 Shopper Insights Brochure
 
Data Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large DatabasesData Mining and Knowledge Discovery in Large Databases
Data Mining and Knowledge Discovery in Large Databases
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 
Ch35
Ch35Ch35
Ch35
 
Today's BI and Data Mining ecosystem
Today's BI and Data Mining ecosystemToday's BI and Data Mining ecosystem
Today's BI and Data Mining ecosystem
 

More from Udara Seneviratne

Industrial presentation
Industrial presentationIndustrial presentation
Industrial presentation
Udara Seneviratne
 
Expert Food Analysis System
Expert Food Analysis SystemExpert Food Analysis System
Expert Food Analysis System
Udara Seneviratne
 
Eye disease expert system
Eye disease expert systemEye disease expert system
Eye disease expert system
Udara Seneviratne
 
Ayurvedic diet management system
Ayurvedic diet management systemAyurvedic diet management system
Ayurvedic diet management system
Udara Seneviratne
 
Media streaming
Media streamingMedia streaming
Media streaming
Udara Seneviratne
 
Automated Traval Ticketing System
Automated Traval Ticketing SystemAutomated Traval Ticketing System
Automated Traval Ticketing System
Udara Seneviratne
 
Business Strategic Analysis of RyanAir
Business Strategic Analysis of RyanAirBusiness Strategic Analysis of RyanAir
Business Strategic Analysis of RyanAir
Udara Seneviratne
 
Pros and cons of facebook
Pros and cons of facebookPros and cons of facebook
Pros and cons of facebook
Udara Seneviratne
 
Did you know....
Did you know....Did you know....
Did you know....
Udara Seneviratne
 
Add ons
Add onsAdd ons
Brain damaging habits
Brain damaging habitsBrain damaging habits
Brain damaging habits
Udara Seneviratne
 
Mobile computing
Mobile computingMobile computing
Mobile computing
Udara Seneviratne
 
Scedule feasibility
Scedule feasibilityScedule feasibility
Scedule feasibility
Udara Seneviratne
 
Environmental issues
Environmental issuesEnvironmental issues
Environmental issues
Udara Seneviratne
 
Survey report of life style of young people in badulla area
Survey report of life style of young people in badulla areaSurvey report of life style of young people in badulla area
Survey report of life style of young people in badulla area
Udara Seneviratne
 
How to succeed
How to succeedHow to succeed
How to succeed
Udara Seneviratne
 
Parents wish1
Parents wish1Parents wish1
Parents wish1
Udara Seneviratne
 
The poor man
The poor manThe poor man
The poor man
Udara Seneviratne
 
Identity styles of communication
Identity styles of communicationIdentity styles of communication
Identity styles of communication
Udara Seneviratne
 

More from Udara Seneviratne (19)

Industrial presentation
Industrial presentationIndustrial presentation
Industrial presentation
 
Expert Food Analysis System
Expert Food Analysis SystemExpert Food Analysis System
Expert Food Analysis System
 
Eye disease expert system
Eye disease expert systemEye disease expert system
Eye disease expert system
 
Ayurvedic diet management system
Ayurvedic diet management systemAyurvedic diet management system
Ayurvedic diet management system
 
Media streaming
Media streamingMedia streaming
Media streaming
 
Automated Traval Ticketing System
Automated Traval Ticketing SystemAutomated Traval Ticketing System
Automated Traval Ticketing System
 
Business Strategic Analysis of RyanAir
Business Strategic Analysis of RyanAirBusiness Strategic Analysis of RyanAir
Business Strategic Analysis of RyanAir
 
Pros and cons of facebook
Pros and cons of facebookPros and cons of facebook
Pros and cons of facebook
 
Did you know....
Did you know....Did you know....
Did you know....
 
Add ons
Add onsAdd ons
Add ons
 
Brain damaging habits
Brain damaging habitsBrain damaging habits
Brain damaging habits
 
Mobile computing
Mobile computingMobile computing
Mobile computing
 
Scedule feasibility
Scedule feasibilityScedule feasibility
Scedule feasibility
 
Environmental issues
Environmental issuesEnvironmental issues
Environmental issues
 
Survey report of life style of young people in badulla area
Survey report of life style of young people in badulla areaSurvey report of life style of young people in badulla area
Survey report of life style of young people in badulla area
 
How to succeed
How to succeedHow to succeed
How to succeed
 
Parents wish1
Parents wish1Parents wish1
Parents wish1
 
The poor man
The poor manThe poor man
The poor man
 
Identity styles of communication
Identity styles of communicationIdentity styles of communication
Identity styles of communication
 

Recently uploaded

Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Pitangent Analytics & Technology Solutions Pvt. Ltd
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
Mydbops
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 

Recently uploaded (20)

Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 

Data mining concepts

  • 2. What is Data Mining ? Mining and discovery of new information in terms of patterns or rules from vast amounts of data. The process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositoties, using pattern recognition technologies as well as statical and methematics techniques.
  • 3. Why we mine Data ? Commercial View Point :- Lots of data is being collected and warehoused . Computers have become cheaper and more powerful. Competitive Pressure is Strong . Scientific View Point :- Data collected and stored at enormous speeds (GB/hour). Traditional techniques infeasible for raw data. Data mining may help scientists.
  • 4. On what kind of Data...? • Relational databases • Data warehouses • Transactional databases • Advanced database systems: Object-relational Spacial and Temporal Time-series Multimedia, text WWW
  • 5. What are the goals of Data mining? • Prediction e.g. sales volume, earthquakes • Identification e.g. existence of genes, system intrusions • Classification of different categories e.g. discount seeking shoppers or loyal regular shoppers in a supermarket • Optimization of limited resources such as time, space, money or materials and maximization of outputs such as sales or profits
  • 6. What are the applications of Data- Mining ? ● Marketing ● Finance  Analysis of consumer behavior  Creditworthiness of clients  Advertising campaigns  Performance analysis of finance  Targeted mailings investments  Segmentation of  Fraud detection customers, stores, or products ● Manufacturing ● Health Care  Optimization of resources  Discovering patterns in X-ray  Optimization of manufacturing images processes  Analyzing side effects of drugs  Product design based on customer  Effectiveness of treatments requirements
  • 7. What are the present commercial tools for Data Mining ? Data to knowledge SAS Oracle data-miner Intelligent miner Clementine
  • 8. How to build a data mining model? An important concept is that building a mining model is part of a larger process.
  • 9. 1. Defining the problem. Clearly define the business problem.
  • 10. 2. Preparing Data consolidate and clean the data that was identified in the Defining the Problem step.
  • 11. 3.Exploring Data Explore the prepared data .
  • 12. 4.Building Models Before you build a model, you must randomly separate the prepared data into separate training and testing datasets. You use the training dataset to build the model, and the testing dataset to test the accuracy of the model by creating prediction queries.
  • 13. 5. Exploring and validating models Explore the models that you have built and test their effectiveness.
  • 14. 6. Deploying and updating Deploy to a production models environment the models that performed the best.
  • 15. What are the major issues in Data-Mining concept ?  Mining different kinds of knowledge in databases  Interactive mining of knowledge at multiple levels of abstraction  Incorporation of background knowledge  Data mining query languages and ad-hoc data mining  Expression and visualization of data mining results  Handling noise and incomplete data  Pattern evaluation: the interestingness problem  Integration of the discovered knowledge with existing knowledge: A knowledge fusion problem  Protection of data security, integrity, and privacy
  • 16. How will be the future of Data-Mining concept? ● Active research is ongoing  Neural Networks  Regression Analysis  Genetic Algorithms ● Data mining is used in many areas today. We cannot even begin to imagine what the future holds in its womb!