SlideShare a Scribd company logo
1 of 19
GROUP MEMBERS
NAMES ROLL-NO
WASIM RAZA BSCS-13-F-293
SHAHRUKH IDREES BSCS-13-F-281
KALSOOM AKHLAQ BSCS-13-F-252
SANILA SAJID BSCS-13-F-274
RABIYA FATIMA BSCS-13-F-255
HAFSA QURESHI BSCS-13-F-248
What is data mining?
 Data mining is also called knowledge
discovery and data mining (KDD)
 extraction of useful patterns from data
sources, e.g., databases, texts, web,
image.
 Patterns must be:
 valid, novel, potentially useful,
understandable
Why data mining now?
 The data is abundant.
 The data is being warehoused.
 The computing power is affordable.
 The competitive pressure is strong.
 Data mining tools have become
available
DATAWARE HOUSING
The electronic storage of a large
amount of information by a business.
Warehoused data must be
stored in a manner that is
secure, reliable, easy to
retrieve and easy to manage.
DATABASE
A database is a collection of information
that is organized so that it can
easily be accessed, managed,
and updated.
DIFF B/W DWH & DATA MINING
 Process to compiling
& organizing data into
same database.
 Engineering phase
but no business users
are involve.
 Process of designing,
how to data stored to
improve reporting.
 Process to extracting
meaningful data from
database.
 Business users but
with the assistance of
engineer.
 Statical analysis,
analyst use technical
to query and sort
through Terabytes of
data looking for
patterns.
DIFF B/W DB & DATA MINING
 It involves day-to-day
processing.
 It is used to run the
business.
 It provides primitive
and highly detailed
data.
 It is based on Entity
Relationship Model.
 It contains current
data.
 extraction of
previously unknown
and interesting
information from raw
data.
 Due to the
exponential growth of
data, especially in
areas such as
business.
 convert this large
wealth of data in to
business intelligence.
Data mining applications
 Marketing, customer profiling and
retention, identifying potential
customers, market segmentation.
 Fraud detection
identifying credit card fraud, intrusion
detection
 Scientific data analysis
 Text and web mining
 Any application that involves a large
amount of data …
ADVANTAGES OF DATA
MINING
 Marking/Retailing: Data mining can aid
direct marketers by providing them with useful and
accurate trends about their customers’ purchasing
behavior.
 Banking/Crediting: Data mining can assist
financial institutions in areas such as credit
reporting and loan information.
ADVANTAGEOUS OF DATA
MINING (cont…)
 Law enforcement: Data mining can aid
law enforcers in identifying criminal suspects
as well as apprehending these criminals by
examining trends in location, crime type, habit,
and other patterns of behaviors.
 Researchers: Data mining can assist
researchers by speeding up their data analyzing
process; thus, allowing them more time to work on
other projects.
DIS-ADVANTAGEOUS OF DATA
MINING
 Privacy Issues: For example, according to
Washing Post, in 1998, CVS had sold their
patient’s prescription purchases to a different
company ….
 Security issues: Although companies have
a lot of personal information about us available
online, they do not have sufficient security systems
in place to protect that information.
DIS-ADVANTAGEOUS OF DATA
MINING (cont….)
 Misuse of information: Some of the
company will answer your phone based on your
purchase history. If you have spent a lot of money
or buying
a lot of product from one company, your call will be
answered really soon. So you should not think that
your call is really being answer in the order in
which it was receive.
DATA MINING TECHNIQUES
 Classification:
Classification is learning rules that can be applied to
new data and will typically include following steps:
preprocessing of data, designing modeling,
learning/feature selection and Evaluation/validation.
 Association:
Association is looking for relationships between
variables.
 Clustering:
Clustering is identifying similar groups from
unstructured data.
DATA MINING
TECHNIQUES(CONT…)
 Sequential pattern mining:
A sequential rule: A B, says that event A will be
immediately followed by event B with a certain
confidence.
 Deviation detection:
discovering the most significant changes in data.
 Regressions:
Regression is finding functions with minimal error
to model data .
REFERENCES
 http://www.differencebetween.com/differ
ence-between-dbms-and-vs-data-
mining/
 www.sims.berkeley.edu/how-much-info
 http://www.programmerinterview.com/ind
ex.php/database-sql/data-mining-vs-
warehousing/
ANY QUESTION…..?

More Related Content

Similar to Data mining

Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
Ashok Kumar
 

Similar to Data mining (20)

IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
notes_dmdw_chap1.docx
notes_dmdw_chap1.docxnotes_dmdw_chap1.docx
notes_dmdw_chap1.docx
 
Abstract
AbstractAbstract
Abstract
 
A Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining PresentationA Practical Approach To Data Mining Presentation
A Practical Approach To Data Mining Presentation
 
Big Data
Big DataBig Data
Big Data
 
DWM
DWMDWM
DWM
 
Data mining
Data miningData mining
Data mining
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
Modern trends in information systems
Modern trends in information systemsModern trends in information systems
Modern trends in information systems
 
Big Data: How does it fit in your data strategy?
Big Data: How does it fit in your data strategy?Big Data: How does it fit in your data strategy?
Big Data: How does it fit in your data strategy?
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
 
Big data
Big dataBig data
Big data
 
Big data vs datawarehousing
Big data vs datawarehousingBig data vs datawarehousing
Big data vs datawarehousing
 
Big data vs datawarehousing
Big data vs datawarehousingBig data vs datawarehousing
Big data vs datawarehousing
 
Lec 1 introduction
Lec 1 introductionLec 1 introduction
Lec 1 introduction
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 
krithi-talk-impact.ppt
krithi-talk-impact.pptkrithi-talk-impact.ppt
krithi-talk-impact.ppt
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Data mining

  • 1.
  • 2. GROUP MEMBERS NAMES ROLL-NO WASIM RAZA BSCS-13-F-293 SHAHRUKH IDREES BSCS-13-F-281 KALSOOM AKHLAQ BSCS-13-F-252 SANILA SAJID BSCS-13-F-274 RABIYA FATIMA BSCS-13-F-255 HAFSA QURESHI BSCS-13-F-248
  • 3.
  • 4. What is data mining?  Data mining is also called knowledge discovery and data mining (KDD)  extraction of useful patterns from data sources, e.g., databases, texts, web, image.  Patterns must be:  valid, novel, potentially useful, understandable
  • 5. Why data mining now?  The data is abundant.  The data is being warehoused.  The computing power is affordable.  The competitive pressure is strong.  Data mining tools have become available
  • 6. DATAWARE HOUSING The electronic storage of a large amount of information by a business. Warehoused data must be stored in a manner that is secure, reliable, easy to retrieve and easy to manage.
  • 7. DATABASE A database is a collection of information that is organized so that it can easily be accessed, managed, and updated.
  • 8. DIFF B/W DWH & DATA MINING  Process to compiling & organizing data into same database.  Engineering phase but no business users are involve.  Process of designing, how to data stored to improve reporting.  Process to extracting meaningful data from database.  Business users but with the assistance of engineer.  Statical analysis, analyst use technical to query and sort through Terabytes of data looking for patterns.
  • 9. DIFF B/W DB & DATA MINING  It involves day-to-day processing.  It is used to run the business.  It provides primitive and highly detailed data.  It is based on Entity Relationship Model.  It contains current data.  extraction of previously unknown and interesting information from raw data.  Due to the exponential growth of data, especially in areas such as business.  convert this large wealth of data in to business intelligence.
  • 10.
  • 11. Data mining applications  Marketing, customer profiling and retention, identifying potential customers, market segmentation.  Fraud detection identifying credit card fraud, intrusion detection  Scientific data analysis  Text and web mining  Any application that involves a large amount of data …
  • 12. ADVANTAGES OF DATA MINING  Marking/Retailing: Data mining can aid direct marketers by providing them with useful and accurate trends about their customers’ purchasing behavior.  Banking/Crediting: Data mining can assist financial institutions in areas such as credit reporting and loan information.
  • 13. ADVANTAGEOUS OF DATA MINING (cont…)  Law enforcement: Data mining can aid law enforcers in identifying criminal suspects as well as apprehending these criminals by examining trends in location, crime type, habit, and other patterns of behaviors.  Researchers: Data mining can assist researchers by speeding up their data analyzing process; thus, allowing them more time to work on other projects.
  • 14. DIS-ADVANTAGEOUS OF DATA MINING  Privacy Issues: For example, according to Washing Post, in 1998, CVS had sold their patient’s prescription purchases to a different company ….  Security issues: Although companies have a lot of personal information about us available online, they do not have sufficient security systems in place to protect that information.
  • 15. DIS-ADVANTAGEOUS OF DATA MINING (cont….)  Misuse of information: Some of the company will answer your phone based on your purchase history. If you have spent a lot of money or buying a lot of product from one company, your call will be answered really soon. So you should not think that your call is really being answer in the order in which it was receive.
  • 16. DATA MINING TECHNIQUES  Classification: Classification is learning rules that can be applied to new data and will typically include following steps: preprocessing of data, designing modeling, learning/feature selection and Evaluation/validation.  Association: Association is looking for relationships between variables.  Clustering: Clustering is identifying similar groups from unstructured data.
  • 17. DATA MINING TECHNIQUES(CONT…)  Sequential pattern mining: A sequential rule: A B, says that event A will be immediately followed by event B with a certain confidence.  Deviation detection: discovering the most significant changes in data.  Regressions: Regression is finding functions with minimal error to model data .
  • 18. REFERENCES  http://www.differencebetween.com/differ ence-between-dbms-and-vs-data- mining/  www.sims.berkeley.edu/how-much-info  http://www.programmerinterview.com/ind ex.php/database-sql/data-mining-vs- warehousing/