SlideShare a Scribd company logo
Data Mining is the mining, or discovery, of new information in terms of patterns or rules
from vast amounts of data. T1o be useful, data mining must be carried out efficiently on large
files and databases. Eg: using neural network , some mathematical algorithm to mine on data
and analyzing data. That result extracting of data increasing productivity and efficiency..
eg: socail network: facebook, linked in, twitter. People as a data . Extracting data for valuable
busineess resource
Goals of Data Mining
 Prediction: Determine how certain attributes will behave in the future. For example,
how much sales volume a store will generate in a given period.
 Identification: Identify patterns in data. For example, newly wed couples tend to
spend more money buying furniture.
 Classification: Partition data into classes. For example, customers can be classified
into different categories with different behavior in shopping.
Eg:customer in supermarket can be categorized into discount seeking, shoppers,
shopper in rush, loyal regular shopper, infrequent shopper.
 Optimization: Optimize the use of limited resources such as time, space, money or
materials. For example, how to best use advertising to maximize profits (sales).
Types of Knowledge Discovered during Data Mining
 Association rules: For example, when a male shopper buys a new car, he is likely to
buy a car CD.
 Classification hierarchies: For example, mutual funds may be classified into three
categories: growth, income and stable. In banking application, customer applying for
credit card can be classified as risk,fail risk and good risk.
 Sequence patterns: Sequence patterns are temporal associations. For example, if
mortgage interest rate drops, within six months period the sales of houses will
increase by certain percentage.
 Patterns within time series: such as stock price data behavior in time.
 Detection of Similarity, or segmentation (Clustering): A population of events or
item can be partitioned into similar set of elements .For example, health data may
indicate similarity among subgroups of people.
1 http://sumanastani.com.np
s
Applications of Data Mining
 Marketing
 Finance
 Manufacturing
 Health Care
Commercial Data Mining Tools
Intelligent Miner from IBM applies classification and association rules to detect rules and
patterns and make predictions.
Enterprise Miner from SAS applies decision trees, neural nets, clustering techniques, statistics,
association rules.
Many new tools are coming out on the market in recent years, making data mining a very
active research and development area.
What is 'Data Warehousing'
Data warehousing is the electronic storage of a large amount of information by a business that
help in future decision making. Warehoused data must be stored in a manner that is secure,
reliable, easy to retrieve and easy to manage
A data warehouse is a:
 subject-oriented
 integrated
 timevarying
 non-volatilecollection of data in support of the management's decision-making
process.
A data warehouse is a centralized repository that stores data from multiple
information sources and transforms them into a common, multidimensional data
model for efficient querying and analysis.
DATAWARE HOUSE VS DATABASE
Database
1.Database are collection of data organized in some way.
2.Used for Online Transactional Processing (OLTP) include insert, delete, update and
other queries. but can be used for other purposes such as Data Warehousing. This records
the data from the user for history.
3.The tables and joins are complex since they are normalized (for RDMS). This is done to
reduce redundant data and to save storage space.
4. Database Desigh :Entity – Relational modeling techniques are used for RDMS database
design.
5.Optimized for write operation.
6.Performance is low for analysis queries.
7.Data are volatile: changes frequently
Data Warehouse
1.DataWare house is an effective collection of data that facilitates reporting and analysis
for future decision.
2.Used for Online Analytical Processing (OLAP). This reads the historical data for the
Users for business decisions.
3.The Tables and joins are simple since they are de-normalized. This is done to reduce the
response time for analytical queries.
4.Database Design : Data – Modeling techniques are used for the Data Warehouse design.
5.Optimized for read operations.
6.High performance for analytical queries.
7.Data are non-volatile: changes less often.
Characterstics
subject-oriented : A data warehouse can be used to analyze a particular subject area. For
example, “sales” can be a particular subject.
integrated : A data warehouse integrates data from multiple data sources. For example,
source A and source B may have different ways of identifying a product, but in a data
warehouse, there will be only a single way of identifying a product.
It is consistent in the way that data from several sources is extracted and transformed. For
example, coding conventions are standardized: M _ male, F _ female.
Timevarying : Historical data is kept in a data warehouse. For example, one can retrieve data
from 3 months, 6 months, 12 months, or even older data from a data warehouse. This
contrasts with a transactions system, where often only the most recent data is kept. For
example, a transaction system may hold the most recent address of a customer, where a data
warehouse can hold all addresses associated with a customer.
Data are organized by various time-periods (e.g. months).
Non-volatile : Once data is in the data warehouse, it will not change. So, historical data in a
data warehouse should never be altered.
collection of data in support of the management's decision-making process.
A data warehouse is a centralized repository that stores data from multiple information
sources and transforms them into a common, multidimensional data model for efficient
querying and analysis.
Other extra charcter:
1.Client Server Architecture
2.Transperency
3.Flexible reporting
4.Multi user support
Function of Data Ware house.(RDSSSD)
1. Roll Up: Data are summarized with generalization like weekly=>monthly=>annualy
2. Drill Down: Complement of roll up. Opposite
3. Pivot : cross tabulation(roatation) can be performed
4. Slice and Dice : projection operation is performed on the dimension
5. Sorting : data is sorted in some order(ascend/descend)
6. Selection: data is available by value or range
7. Derived computed attributes: Attributes are composed by operation on stored derived
value.

More Related Content

What's hot

Data mining & data warehousing
Data mining & data warehousingData mining & data warehousing
Data mining & data warehousing
Shubha Brota Raha
 
introduction to data warehousing and mining
 introduction to data warehousing and mining introduction to data warehousing and mining
introduction to data warehousing and mining
Rajesh Chandra
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
Snehali Chake
 
Data warehouse
Data warehouseData warehouse
Data warehouse
safaataamsah
 
Data mining
Data miningData mining
Data mining
Cloudbells.com
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Data mining 2 - Data warehouse (cheat sheet - printable)
Data mining 2 - Data warehouse (cheat sheet - printable)Data mining 2 - Data warehouse (cheat sheet - printable)
Data mining 2 - Data warehouse (cheat sheet - printable)
yesheeka
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
dataware
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Pradnya Saval
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
Sushil Kulkarni
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data WarehousingAswathy S Nair
 
Difference between data warehouse and data mining
Difference between data warehouse and data miningDifference between data warehouse and data mining
Difference between data warehouse and data mining
maxonlinetr
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
Pralhad Rijal
 
Data mining
Data mining Data mining
Data mining
AthiraR23
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
Dung Nguyen
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
Fazle Rabbi Ador
 
DMML1_overview.ppt
DMML1_overview.pptDMML1_overview.ppt
DMML1_overview.pptbutest
 
Databases
DatabasesDatabases
DatabasesUMaine
 
Data mining introduction
Data mining introductionData mining introduction
Data mining introduction
Basma Gamal
 

What's hot (19)

Data mining & data warehousing
Data mining & data warehousingData mining & data warehousing
Data mining & data warehousing
 
introduction to data warehousing and mining
 introduction to data warehousing and mining introduction to data warehousing and mining
introduction to data warehousing and mining
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data mining
Data miningData mining
Data mining
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Data mining 2 - Data warehouse (cheat sheet - printable)
Data mining 2 - Data warehouse (cheat sheet - printable)Data mining 2 - Data warehouse (cheat sheet - printable)
Data mining 2 - Data warehouse (cheat sheet - printable)
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
Difference between data warehouse and data mining
Difference between data warehouse and data miningDifference between data warehouse and data mining
Difference between data warehouse and data mining
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
 
Data mining
Data mining Data mining
Data mining
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
DMML1_overview.ppt
DMML1_overview.pptDMML1_overview.ppt
DMML1_overview.ppt
 
Databases
DatabasesDatabases
Databases
 
Data mining introduction
Data mining introductionData mining introduction
Data mining introduction
 

Similar to Data miningvs datawarehouse

Data warehouse
Data warehouseData warehouse
Data warehouse
RajThakuri
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
Modern trends in information systems
Modern trends in information systemsModern trends in information systems
Modern trends in information systems
Preeti Sontakke
 
TOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdfTOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdf
SCITprojects2022
 
data warehousing and data mining (1).pdf
data warehousing and data mining (1).pdfdata warehousing and data mining (1).pdf
data warehousing and data mining (1).pdf
SCITprojects2022
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
Satyam Jaiswal
 
notes_dmdw_chap1.docx
notes_dmdw_chap1.docxnotes_dmdw_chap1.docx
notes_dmdw_chap1.docx
Abshar Fatima
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
SOMASUNDARAM T
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
SHIKHA GAUTAM
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
Unit 1
Unit 1Unit 1
Unit 1
DrPrabu M
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Shruti Dalela
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
AyushMeraki1
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
MadhuriNigam1
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
Sukirti Garg
 
Mining internal sources of data
Mining internal sources of dataMining internal sources of data
Mining internal sources of data
nomanbhutta
 
Data Mining
Data MiningData Mining
Data Mining
SOMASUNDARAM T
 

Similar to Data miningvs datawarehouse (20)

Abstract
AbstractAbstract
Abstract
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data mining notes
Data mining notesData mining notes
Data mining notes
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Modern trends in information systems
Modern trends in information systemsModern trends in information systems
Modern trends in information systems
 
TOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdfTOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdf
 
data warehousing and data mining (1).pdf
data warehousing and data mining (1).pdfdata warehousing and data mining (1).pdf
data warehousing and data mining (1).pdf
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
 
notes_dmdw_chap1.docx
notes_dmdw_chap1.docxnotes_dmdw_chap1.docx
notes_dmdw_chap1.docx
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Unit 1
Unit 1Unit 1
Unit 1
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Mining internal sources of data
Mining internal sources of dataMining internal sources of data
Mining internal sources of data
 
Data Mining
Data MiningData Mining
Data Mining
 

Recently uploaded

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 

Recently uploaded (20)

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 

Data miningvs datawarehouse

  • 1. Data Mining is the mining, or discovery, of new information in terms of patterns or rules from vast amounts of data. T1o be useful, data mining must be carried out efficiently on large files and databases. Eg: using neural network , some mathematical algorithm to mine on data and analyzing data. That result extracting of data increasing productivity and efficiency.. eg: socail network: facebook, linked in, twitter. People as a data . Extracting data for valuable busineess resource Goals of Data Mining  Prediction: Determine how certain attributes will behave in the future. For example, how much sales volume a store will generate in a given period.  Identification: Identify patterns in data. For example, newly wed couples tend to spend more money buying furniture.  Classification: Partition data into classes. For example, customers can be classified into different categories with different behavior in shopping. Eg:customer in supermarket can be categorized into discount seeking, shoppers, shopper in rush, loyal regular shopper, infrequent shopper.  Optimization: Optimize the use of limited resources such as time, space, money or materials. For example, how to best use advertising to maximize profits (sales). Types of Knowledge Discovered during Data Mining  Association rules: For example, when a male shopper buys a new car, he is likely to buy a car CD.  Classification hierarchies: For example, mutual funds may be classified into three categories: growth, income and stable. In banking application, customer applying for credit card can be classified as risk,fail risk and good risk.  Sequence patterns: Sequence patterns are temporal associations. For example, if mortgage interest rate drops, within six months period the sales of houses will increase by certain percentage.  Patterns within time series: such as stock price data behavior in time.  Detection of Similarity, or segmentation (Clustering): A population of events or item can be partitioned into similar set of elements .For example, health data may indicate similarity among subgroups of people. 1 http://sumanastani.com.np s
  • 2. Applications of Data Mining  Marketing  Finance  Manufacturing  Health Care Commercial Data Mining Tools Intelligent Miner from IBM applies classification and association rules to detect rules and patterns and make predictions. Enterprise Miner from SAS applies decision trees, neural nets, clustering techniques, statistics, association rules. Many new tools are coming out on the market in recent years, making data mining a very active research and development area. What is 'Data Warehousing' Data warehousing is the electronic storage of a large amount of information by a business that help in future decision making. Warehoused data must be stored in a manner that is secure, reliable, easy to retrieve and easy to manage A data warehouse is a:  subject-oriented  integrated  timevarying  non-volatilecollection of data in support of the management's decision-making process. A data warehouse is a centralized repository that stores data from multiple information sources and transforms them into a common, multidimensional data model for efficient querying and analysis.
  • 3. DATAWARE HOUSE VS DATABASE Database 1.Database are collection of data organized in some way. 2.Used for Online Transactional Processing (OLTP) include insert, delete, update and other queries. but can be used for other purposes such as Data Warehousing. This records the data from the user for history. 3.The tables and joins are complex since they are normalized (for RDMS). This is done to reduce redundant data and to save storage space. 4. Database Desigh :Entity – Relational modeling techniques are used for RDMS database design. 5.Optimized for write operation. 6.Performance is low for analysis queries. 7.Data are volatile: changes frequently Data Warehouse 1.DataWare house is an effective collection of data that facilitates reporting and analysis for future decision. 2.Used for Online Analytical Processing (OLAP). This reads the historical data for the Users for business decisions. 3.The Tables and joins are simple since they are de-normalized. This is done to reduce the response time for analytical queries. 4.Database Design : Data – Modeling techniques are used for the Data Warehouse design. 5.Optimized for read operations. 6.High performance for analytical queries. 7.Data are non-volatile: changes less often. Characterstics subject-oriented : A data warehouse can be used to analyze a particular subject area. For example, “sales” can be a particular subject. integrated : A data warehouse integrates data from multiple data sources. For example, source A and source B may have different ways of identifying a product, but in a data warehouse, there will be only a single way of identifying a product. It is consistent in the way that data from several sources is extracted and transformed. For example, coding conventions are standardized: M _ male, F _ female. Timevarying : Historical data is kept in a data warehouse. For example, one can retrieve data from 3 months, 6 months, 12 months, or even older data from a data warehouse. This contrasts with a transactions system, where often only the most recent data is kept. For example, a transaction system may hold the most recent address of a customer, where a data warehouse can hold all addresses associated with a customer.
  • 4. Data are organized by various time-periods (e.g. months). Non-volatile : Once data is in the data warehouse, it will not change. So, historical data in a data warehouse should never be altered. collection of data in support of the management's decision-making process. A data warehouse is a centralized repository that stores data from multiple information sources and transforms them into a common, multidimensional data model for efficient querying and analysis. Other extra charcter: 1.Client Server Architecture 2.Transperency 3.Flexible reporting 4.Multi user support Function of Data Ware house.(RDSSSD) 1. Roll Up: Data are summarized with generalization like weekly=>monthly=>annualy 2. Drill Down: Complement of roll up. Opposite 3. Pivot : cross tabulation(roatation) can be performed 4. Slice and Dice : projection operation is performed on the dimension 5. Sorting : data is sorted in some order(ascend/descend) 6. Selection: data is available by value or range 7. Derived computed attributes: Attributes are composed by operation on stored derived value.