SlideShare a Scribd company logo
PRESENTATION
              ON
       DATA WAREHOUSING
             AND
          DATA MINING
SUBMITTED TO:                        SUBMITTED BY:-
MRS.MANISHA BHATNAGAR
(HOD OF COMP. SCI DEPT)              MCA-III
MRS.HARKAWNALJEET KAUR               ROLL NO:9
(ASST. PROF OF COMP. SCIENCE DEPT)
CONTENTS

DATA WAREHOUSE
CHARACTERSTICS OF DATA WAREHOUSE
ARCHITECTURE OF DATA WAREHOUSE
DATA STORING IN DATA WAREHOUSE
DATA WAREHOUSE DIMESIONAL MODELLING
INSTALLING THE SERVICE MANAGER DATA
WAREHOUSE SERVER
EXAMPLE OF DATA WAREHOUSE
ADVANTAGE OF DATA WAREHOUSE
DISADVANTAGE OF DATA WAREHOUSE
CONTENTS

DATA MINING
ELEMENTS OF DATA MINING
DATA MINING PROCESS
ARCHITECTURE OF DATA MINING
ADDING THE OPTION OF DATA MINING TO A
DATABASE
ADVANTAGES OF DATA MINING
DISADVANTAGES OF DATA MINING
DATA WAREHOUSE

A data warehouse is a relational database that is designed for query
and analysis rather than for transaction processing. It usually
contains historical data derived from transaction data, but can
include data from other sources.

DEFINITION OF DATA WAREHOUSE

     “ A data warehouse is simply a single, complete, and consistent
store of data obtained from a variety of different sources and made
available to end users in a way they can understand and use it in a
business context.”

                                   BARLIEN DEVLIN, IBM CONSULTANT
CHARACTERSTICS OF DATA
              WAREHOUSING

Subject Oriented: -Data are organized according to subject instead of
application. Data warehouses are designed to help you analyze data

Integrated: -Integration is closely related to subject orientation. Data
warehouses must put data from disparate sources into a consistent
format.

Nonvolatile: - Nonvolatile means that, once entered into the data
warehouse, data should not change.

Time Variant: - Data warehouse maintains historical data which are
used to analyze the business or market trends and facilitate future
predictions.
Data Warehouse Architectures

Data warehouses and their architectures vary depending upon the
specifics of an organization's situation. Three common architectures are:

■ Data Warehouse Architecture (Basic)

■ Data Warehouse Architecture (with a Staging Area)

■ Data Warehouse Architecture (with a Staging Area and Data Marts)
Data Warehouse Architecture (Basic)

Figure shows a simple architecture for a data warehouse.
End users directly access data derived from several source
systems through the data warehouse.
Data Warehouse Architecture
                (with a Staging Area)
you need to clean and process your operational data before putting it into
the warehouse.
You can do this programmatically, although most data warehouses use
a staging area instead.
A staging area simplifies building summaries and general warehouse
management.
Figure illustrates this typical architecture.
Data Warehouse Architecture
    (with a Staging Area)
Data Warehouse Architecture
      (with a Staging Area and Data Marts)
Although the architecture in Figure is quite common, you may want to
customize your warehouse's architecture for different groups within your
organization.
You can do this by adding data marts, which are systems designed for a
particular line of business.
Figure illustrates an example where purchasing, sales, and inventories
are separated. In this example, a financial analyst might want to analyze
historical data for purchases and sales.
Data Warehouse Architecture
(with a Staging Area and Data Marts)
DATA STORING IN DATA
                WAREHOUSE
FACT TABLE: -
 The central table that contains the fact data. Fact tables represent data
usually numeric that are analyzed and examined.

DIMENSION TABLE:-
Dimension tables store the information you normally use to contain
queries.
Data Warehouse Dimensional Modelling
            (Types of Schemas)
There are four types of schemas are available in data warehouse.

                              SCHEMA




                                                      FACT
     STAR
                                                  CONSTELLATION
    SCHEMA
                                                     SCHEMA


             SNOWFLAKE                GALAXY
               SCHEMA                 SCHEMA
Star Schema

A star schema is the one in which a central fact table is sourrounded by
denormalized dimensional tables.
Snowflake schema

A snow flake schema is an enhancement of star schema by adding
additional dimensions.
Snowflake Schema



                                        Sale fact table
 geography                              Number
                                         Number
              Store
                                        Prod_id
                                         Prod_id
Id                                      store_id
             id                          store_id
State                                   quantity
             Name                        quantity
country
             Geography_id



                              product                       Brand
                            Prod_id
                            Brand_id                      Id
                            cost                          Brand
Galaxy Schema

Galaxy schema contains many fact tables with some common
dimensions (conformed dimensions). It is also known as Fact
Constellation Schema
Galaxy Schema

Retailer                                                         supplier
                                                                Supplier_id
Retail_id                                                       Name
Name                                                            country
city

            Sale fact table               Purchase fact table
            Number
             Number                       Number
                                           Number
            Prod_id
             Prod_id                      Prod_id
                                           Prod_id
            Retail_id
             Retail_id                    supplier_id
                                           supplier_id
            quantity
             quantity                     quantity
                                           quantity


                                product
                              Prod_id
                              Type
                              cost
Installing the Service Manager Data
               Warehouse Server
By using Service Manager Setup .
EXAMPLE OF DATA WAREHOUSE

McMaster’s Data Warehouse design
EXAMPLE OF DATA WAREHOUSE

ARCHITECTURE (PRODUCTION ENVIRONMENT)
We are considering implementing the following three-tier platform
which will allow us to scale horizontally in the future:
Our development environment consists of a server with 2 x Intel Xeon
2.8GHz Processors, 2GB of RAM and is running Windows 2000 –
Service Pack 4.
We are considering the following for the scaled roll-out of our
production environment.
A. Hardware
1. Server 1 - SAS® Data Server
- 4 way 64 bit 1.5Ghz Itanium2 server
- 16 Gb RAM
- 2 73 Gb Drives (RAID 1) for the OS
- 1 10/100/1Gb Cu Ethernet card
EXAMPLE OF DATA WAREHOUSE

ARCHITECTURE (PRODUCTION ENVIRONMENT)
- 1 Windows 2003 Enterprise Edition for Itanium
2 Mid-Tier (Web) Server
- 2 way 32 bit 3Ghz Xeon Server
- 4 Gb RAM
- 1 10/100/1Gb Cu Ethernet card
- 1 Windows 2003 Enterprise Edition for x86
3. SAN Drive Array (modular and can grow with the warehouse)
- 6 – 72GB Drives (RAID 5) total 360GB for SAS® and Data
EXAMPLE OF DATA WAREHOUSE

ARCHITECTURE (PRODUCTION ENVIRONMENT) 
B. Software
1. Server 1 - SAS® Data Server
- SAS® 9.1.3 
- SAS® Metadata Server 
- SAS® WorkSpace Server 
- SAS® Stored Process Server 
- Platform JobScheduler 
2. Mid -Tier Server
- SAS® Web Report Studio 
- SAS® Information Delivery Portal 
- BEA Web Logic for future SAS® SPM Platform 
- Xythos Web File System (WFS) 
 
EXAMPLE OF DATA WAREHOUSE

ARCHITECTURE (PRODUCTION ENVIRONMENT) 


3. Client –Tier Server
- SAS® Enterprise Guide 
- SAS® Add-In for Microsoft Office 
BENFITS OF DATA WAREHOUSE

1. A Data Warehouse Delivers Enhanced Business Intelligence: -
Insights  will  be  gained  through  improved  information  access. 
     Managers  and  executives  will  be  freed  from  making  their 
    decisions  based  on  limited  data  and  their  own  “gut  feelings”. 
     Decisions that affect the strategy and operations of organizations 
    will  be  based  upon  credible  facts  and  will  be  backed  up  with 
    evidence and actual organizational data.

2. A Data Warehouse Saves Time
Since business users can quickly access critical data from a 
   number of sources—all in one place—they can rapidly make 
   informed decisions on key initiatives. They won’t waste 
   precious time retrieving data from multiple sources.
BENFITS OF DATA WAREHOUSE

3. A Data Warehouse Enhances Data Quality and
    Consistency
A  data  warehouse implementation  includes  the  conversion  of 
     data  from numerous  source  systems   into  a  common 
    format.   So  you  can  have  more  confidence  in  the  accuracy 
    of  your  data.  And  accurate  data  is  the  basis  for  strong 
    business decisions.

4. A Data Warehouse Provides Historical Intelligence
A  data  warehouse  stores  large  amounts  of  historical  data  so 
    you can analyze different time periods and trends in order to 
    make  future  predictions.  Such  data  typically  cannot  be 
    stored  in  a  transactional  database  or  used  to  generate 
    reports from a transactional system.
BENFITS OF DATA WAREHOUSE

5. A Data Warehouse Generates a High ROI
Finally,  the  piece  de  resistance—return  on  investment. 
   Companies  that  have  implemented  data  warehouses  and 
   complementary  BI  systems  have generated  more 
   revenue  and  saved  more  money  than  companies  that 
   haven’t invested in BI systems and data warehouses.
DISADVANTAGES OF DATA
                WAREHOUSE
•Long initial implementation time and associated high cost

•Adding new data sources takes time and associated high cost

•Limited flexibility of use and types of users - requires multiple 
separate data marts for multiple uses and types of users

•Typically, data is static and dated

•Difficult to accommodate changes in data types and ranges, 
data source schema.
 
DATA MINING

Data  mining  is  process of  discovering  hidden, previously  unknown 
and  usable  information  from  a  large  amount  of  data.  It  is  often 
defined as finding hidden information in a database.

DEFINITION OF DATA MINING: -

“The efficient discovery of valuable non-obvious information from a 
large collection of data.”
                                                                                         [BIGUS 96]
Elements of Data mining

•Extract,  transform,  and  load  transaction  data  onto  the  data  warehouse 
system.

•Store and manage the data in a multidimensional database system.

•Provide  data  access  to  business  analysts  and  information  technology 
professionals.

•Analyze the data by application software.

•Present the data in a useful format, such as a graph or table.
 
DATA MINING PROCESS




               Slide 1 and slide 2
DATA MINING PROCESS

SELECTION:-
                            Selecting the data according to some criteria.
PREPROCESSING:-
                                  This is  the data cleansing stage where certain 
information  is  removed  which  is  unnecessary  and  may  slow  down 
queries.
TRANSFORMATION:-
                                          The data is not merely transferred across but 
transformed in that overlays may be added such as demographic overlays 
commonly used in market research.
DATA MINING PROCESS

DATA MINING:-
                                the stage is concerned with the extraction of patterns 
from the data. A pattern can be defined as given a set of facts(data) F, a 
language L, and some measure of certainty C, a pattern is a statement S 
in L that describes relationships among a subset F(s) of F with a certainty 
C.

INTERPRETATION AND EVALUTION:-
                                                                           The Patterns identified by 
the  system  are  interpreted  into  knowledge  which  can  then  be  used  to 
support human decision making.
ARCHTITECTURE OF DATA MINING

There are three tiers in the tight-coupling data mining architecture:

Data layer: data layer can be database and/or data warehouse systems.
This layer is an interface for all data sources. Data mining results are
stored in data layer so it can be presented to end-user in form of reports
or other kind of visualization.
Application layer: -Data mining application layer is used to retrieve
data from database. Some transformation routine can be performed here
to transform data into desired format.
Front-end layer: -Front-end layer provides intuitive and friendly user
interface for end-user to interact with data mining system. Data mining
result presented in visualization form to the user in the front-end layer.

 
ARCHTITECTURE OF DATA MINING
Adding the Data Mining Option to a
                 Database
Once you have installed the Oracle Database software, you can build
databases as needed. You might build a database without the Data
Mining option but later decide to add it.
Advantages of Data
                        Mining
Marketing / Retail
Data mining helps marketing companies to build models based on
historical data to predict who will respond to new marketing campaign
such as direct mail, online marketing campaign and etc.
Data mining brings a lot of benefit s to retail company in the same way
as marketing. Through market basket analysis, the store can have an
appropriate production arrangement in the way that customers can buy
frequent buying products together with pleasant.
Advantages of Data
                        Mining
Finance / Banking
Data mining gives financial institutions information about loan
information and credit reporting. By building a model from previous
customer’s data with common characteristics, the bank and financial can
estimate what are the good and/or bad loans and its risk level. In
addition, data mining can help banks to detect fraudulent credit card
transaction to help credit card’s owner prevent their losses.
Advantages of Data
                        Mining
Manufacturing
By applying data mining in operational engineering data, manufacturers
can detect faulty equipments and determine optimal control parameters.

Governments
Data mining helps government agency by digging and analyzing records
of financial transaction to build patterns that can detect money
laundering or criminal activity.
Disadvantages of data mining

Privacy Issues
The concerns about the personal privacy have been increasing
enormously recently especially when internet is booming with social
networks.
Security issues
Security is a big issue. Businesses owns information about their
employee and customers including social security number, birthday,
payroll and etc. However how properly this information is taken is still
in questions. There have been a lot of cases that hackers were accesses
and stole big data of customers from big corporation such as Ford Motor
Credit Company, Sony… with so much personal and financial
information available, the credit card stolen and identity theft become a
big problem.
 
Disadvantages of data mining

Misuse of information/inaccurate information
Information collected through data mining intended for marketing or
ethical purposes can be misused. This information is exploited by
unethical people or business to take benefit of vulnerable people or
discriminate against a group of people.
QUERIES?
   ?
THANK
 YOU

More Related Content

What's hot

Data Vault and DW2.0
Data Vault and DW2.0Data Vault and DW2.0
Data Vault and DW2.0
Empowered Holdings, LLC
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecturepcherukumalla
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Ramkrishna bhagat
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Richard Bányi
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
janani thirupathi
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Sonali Chawla
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse ArchitecturesTheju Paul
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
adivasoft
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
Kent Graziano
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
Empowered Holdings, LLC
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
VijayasankariS
 
Big data storage
Big data storageBig data storage
Big data storage
Vikram Nandini
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
aksrauf
 
Data warehouse
Data warehouseData warehouse
Data warehouse
Rishabh Dogra
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 
Data warehouse
Data warehouseData warehouse
Data warehouse
shachibattar
 

What's hot (20)

Data Vault and DW2.0
Data Vault and DW2.0Data Vault and DW2.0
Data Vault and DW2.0
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data Warehouse Architectures
Data Warehouse ArchitecturesData Warehouse Architectures
Data Warehouse Architectures
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
 
Big data storage
Big data storageBig data storage
Big data storage
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Dimensional Modeling
Dimensional ModelingDimensional Modeling
Dimensional Modeling
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Ppt
PptPpt
Ppt
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 

Viewers also liked

Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Abu Saleh
 
Vbmca204821311240
Vbmca204821311240Vbmca204821311240
Vbmca204821311240Ayushi Jain
 
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Abu Saleh
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
Karthik Srini B R
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
shaikh2016
 
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystemCh12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
Norhisham Mohamad Nordin
 
Multimedia db system
Multimedia db systemMultimedia db system
Multimedia db system
Yojana Nanaware
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Shruti Dalela
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
Sibananda Khatai
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
Syamsul Bahrin Zaibon
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
Faizal Basheer
 
Lecture 3 multimedia databases
Lecture 3   multimedia databasesLecture 3   multimedia databases
Lecture 3 multimedia databasesRanjana N Jinde
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
Sunderland City Council
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
Syamsul Bahrin Zaibon
 
Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel systemManish Singh
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Salah Amean
 
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic ConceptsData Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Salah Amean
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database
Avnish Patel
 

Viewers also liked (20)

Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12Lecture 2, c++(complete reference,herbet sheidt)chapter-12
Lecture 2, c++(complete reference,herbet sheidt)chapter-12
 
Vbmca204821311240
Vbmca204821311240Vbmca204821311240
Vbmca204821311240
 
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15Lecture 5, c++(complete reference,herbet sheidt)chapter-15
Lecture 5, c++(complete reference,herbet sheidt)chapter-15
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
 
Distributed D B
Distributed  D BDistributed  D B
Distributed D B
 
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystemCh12.ed wk9businessintelligenceanddecisionsupportsystem
Ch12.ed wk9businessintelligenceanddecisionsupportsystem
 
Multimedia db system
Multimedia db systemMultimedia db system
Multimedia db system
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
 
Data mining
Data miningData mining
Data mining
 
Multimedia database
Multimedia databaseMultimedia database
Multimedia database
 
Lecture 3 multimedia databases
Lecture 3   multimedia databasesLecture 3   multimedia databases
Lecture 3 multimedia databases
 
Data Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data VisualisationData Warehousing, Data Mining & Data Visualisation
Data Warehousing, Data Mining & Data Visualisation
 
Multimedia Database
Multimedia DatabaseMultimedia Database
Multimedia Database
 
Distributed & parallel system
Distributed & parallel systemDistributed & parallel system
Distributed & parallel system
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
 
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic ConceptsData Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
Data Mining:Concepts and Techniques, Chapter 8. Classification: Basic Concepts
 
Multimedia Database
Multimedia Database Multimedia Database
Multimedia Database
 

Similar to Data warehousing

professional informatica trainer
professional informatica trainerprofessional informatica trainer
professional informatica trainer
vibrantuser
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
Prithwis Mukerjee
 
Data Warehosing -Security
Data Warehosing -SecurityData Warehosing -Security
Data Warehosing -Security
Akhi Balakrishnan
 
BW Multi-Dimensional Model
BW Multi-Dimensional ModelBW Multi-Dimensional Model
BW Multi-Dimensional Modelyujesh
 
Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008
Eduardo Castro
 
142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500
sumit621
 
Multidimentional data model
Multidimentional data modelMultidimentional data model
Multidimentional data modeljagdish_93
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARN
abclearnn
 
Data warehousing
Data warehousingData warehousing
Data warehousingVarun Jain
 
Oracle: Dw Design
Oracle: Dw DesignOracle: Dw Design
Oracle: Dw Design
oracle content
 
Oracle: DW Design
Oracle: DW DesignOracle: DW Design
Oracle: DW Design
DataminingTools Inc
 
Oracle data integrator project
Oracle data integrator projectOracle data integrator project
Oracle data integrator projectAmit Sharma
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouseganblues
 
Data Access Tech Ed India
Data Access   Tech Ed IndiaData Access   Tech Ed India
Data Access Tech Ed Indiarsnarayanan
 
SSRS Report with Parameters and Data Filtration
SSRS Report with Parameters and Data FiltrationSSRS Report with Parameters and Data Filtration
SSRS Report with Parameters and Data Filtration
Naji El Kotob
 
AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...
AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...
AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...
Amazon Web Services
 
Informix warehouse and accelerator overview
Informix warehouse and accelerator overviewInformix warehouse and accelerator overview
Informix warehouse and accelerator overview
Keshav Murthy
 
Dimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptDimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.ppt
nishant523869
 
introduction to datawarehouse
introduction to datawarehouseintroduction to datawarehouse
introduction to datawarehousekiran14360
 

Similar to Data warehousing (20)

professional informatica trainer
professional informatica trainerprofessional informatica trainer
professional informatica trainer
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Data Warehosing -Security
Data Warehosing -SecurityData Warehosing -Security
Data Warehosing -Security
 
BW Multi-Dimensional Model
BW Multi-Dimensional ModelBW Multi-Dimensional Model
BW Multi-Dimensional Model
 
Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008
 
142230 633685297550892500
142230 633685297550892500142230 633685297550892500
142230 633685297550892500
 
Business Intelligence: A Review
Business Intelligence: A ReviewBusiness Intelligence: A Review
Business Intelligence: A Review
 
Multidimentional data model
Multidimentional data modelMultidimentional data model
Multidimentional data model
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARN
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Oracle: Dw Design
Oracle: Dw DesignOracle: Dw Design
Oracle: Dw Design
 
Oracle: DW Design
Oracle: DW DesignOracle: DW Design
Oracle: DW Design
 
Oracle data integrator project
Oracle data integrator projectOracle data integrator project
Oracle data integrator project
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Data Access Tech Ed India
Data Access   Tech Ed IndiaData Access   Tech Ed India
Data Access Tech Ed India
 
SSRS Report with Parameters and Data Filtration
SSRS Report with Parameters and Data FiltrationSSRS Report with Parameters and Data Filtration
SSRS Report with Parameters and Data Filtration
 
AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...
AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...
AWS November Webinar Series - Advanced Analytics with Amazon Redshift and the...
 
Informix warehouse and accelerator overview
Informix warehouse and accelerator overviewInformix warehouse and accelerator overview
Informix warehouse and accelerator overview
 
Dimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.pptDimensional Modeling Concepts_Nishant.ppt
Dimensional Modeling Concepts_Nishant.ppt
 
introduction to datawarehouse
introduction to datawarehouseintroduction to datawarehouse
introduction to datawarehouse
 

Recently uploaded

Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 

Recently uploaded (20)

Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 

Data warehousing

  • 1. PRESENTATION ON DATA WAREHOUSING AND DATA MINING SUBMITTED TO: SUBMITTED BY:- MRS.MANISHA BHATNAGAR (HOD OF COMP. SCI DEPT) MCA-III MRS.HARKAWNALJEET KAUR ROLL NO:9 (ASST. PROF OF COMP. SCIENCE DEPT)
  • 2. CONTENTS DATA WAREHOUSE CHARACTERSTICS OF DATA WAREHOUSE ARCHITECTURE OF DATA WAREHOUSE DATA STORING IN DATA WAREHOUSE DATA WAREHOUSE DIMESIONAL MODELLING INSTALLING THE SERVICE MANAGER DATA WAREHOUSE SERVER EXAMPLE OF DATA WAREHOUSE ADVANTAGE OF DATA WAREHOUSE DISADVANTAGE OF DATA WAREHOUSE
  • 3. CONTENTS DATA MINING ELEMENTS OF DATA MINING DATA MINING PROCESS ARCHITECTURE OF DATA MINING ADDING THE OPTION OF DATA MINING TO A DATABASE ADVANTAGES OF DATA MINING DISADVANTAGES OF DATA MINING
  • 4. DATA WAREHOUSE A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. It usually contains historical data derived from transaction data, but can include data from other sources. DEFINITION OF DATA WAREHOUSE “ A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of different sources and made available to end users in a way they can understand and use it in a business context.” BARLIEN DEVLIN, IBM CONSULTANT
  • 5. CHARACTERSTICS OF DATA WAREHOUSING Subject Oriented: -Data are organized according to subject instead of application. Data warehouses are designed to help you analyze data Integrated: -Integration is closely related to subject orientation. Data warehouses must put data from disparate sources into a consistent format. Nonvolatile: - Nonvolatile means that, once entered into the data warehouse, data should not change. Time Variant: - Data warehouse maintains historical data which are used to analyze the business or market trends and facilitate future predictions.
  • 6. Data Warehouse Architectures Data warehouses and their architectures vary depending upon the specifics of an organization's situation. Three common architectures are: ■ Data Warehouse Architecture (Basic) ■ Data Warehouse Architecture (with a Staging Area) ■ Data Warehouse Architecture (with a Staging Area and Data Marts)
  • 7. Data Warehouse Architecture (Basic) Figure shows a simple architecture for a data warehouse. End users directly access data derived from several source systems through the data warehouse.
  • 8. Data Warehouse Architecture (with a Staging Area) you need to clean and process your operational data before putting it into the warehouse. You can do this programmatically, although most data warehouses use a staging area instead. A staging area simplifies building summaries and general warehouse management. Figure illustrates this typical architecture.
  • 9. Data Warehouse Architecture (with a Staging Area)
  • 10. Data Warehouse Architecture (with a Staging Area and Data Marts) Although the architecture in Figure is quite common, you may want to customize your warehouse's architecture for different groups within your organization. You can do this by adding data marts, which are systems designed for a particular line of business. Figure illustrates an example where purchasing, sales, and inventories are separated. In this example, a financial analyst might want to analyze historical data for purchases and sales.
  • 11. Data Warehouse Architecture (with a Staging Area and Data Marts)
  • 12. DATA STORING IN DATA WAREHOUSE FACT TABLE: - The central table that contains the fact data. Fact tables represent data usually numeric that are analyzed and examined. DIMENSION TABLE:- Dimension tables store the information you normally use to contain queries.
  • 13. Data Warehouse Dimensional Modelling (Types of Schemas) There are four types of schemas are available in data warehouse. SCHEMA FACT STAR CONSTELLATION SCHEMA SCHEMA SNOWFLAKE GALAXY SCHEMA SCHEMA
  • 14. Star Schema A star schema is the one in which a central fact table is sourrounded by denormalized dimensional tables.
  • 15. Snowflake schema A snow flake schema is an enhancement of star schema by adding additional dimensions.
  • 16. Snowflake Schema Sale fact table geography Number Number Store Prod_id Prod_id Id store_id id store_id State quantity Name quantity country Geography_id product Brand Prod_id Brand_id Id cost Brand
  • 17. Galaxy Schema Galaxy schema contains many fact tables with some common dimensions (conformed dimensions). It is also known as Fact Constellation Schema
  • 18. Galaxy Schema Retailer supplier Supplier_id Retail_id Name Name country city Sale fact table Purchase fact table Number Number Number Number Prod_id Prod_id Prod_id Prod_id Retail_id Retail_id supplier_id supplier_id quantity quantity quantity quantity product Prod_id Type cost
  • 19. Installing the Service Manager Data Warehouse Server By using Service Manager Setup .
  • 20. EXAMPLE OF DATA WAREHOUSE McMaster’s Data Warehouse design
  • 21. EXAMPLE OF DATA WAREHOUSE ARCHITECTURE (PRODUCTION ENVIRONMENT) We are considering implementing the following three-tier platform which will allow us to scale horizontally in the future: Our development environment consists of a server with 2 x Intel Xeon 2.8GHz Processors, 2GB of RAM and is running Windows 2000 – Service Pack 4. We are considering the following for the scaled roll-out of our production environment. A. Hardware 1. Server 1 - SAS® Data Server - 4 way 64 bit 1.5Ghz Itanium2 server - 16 Gb RAM - 2 73 Gb Drives (RAID 1) for the OS - 1 10/100/1Gb Cu Ethernet card
  • 22. EXAMPLE OF DATA WAREHOUSE ARCHITECTURE (PRODUCTION ENVIRONMENT) - 1 Windows 2003 Enterprise Edition for Itanium 2 Mid-Tier (Web) Server - 2 way 32 bit 3Ghz Xeon Server - 4 Gb RAM - 1 10/100/1Gb Cu Ethernet card - 1 Windows 2003 Enterprise Edition for x86 3. SAN Drive Array (modular and can grow with the warehouse) - 6 – 72GB Drives (RAID 5) total 360GB for SAS® and Data
  • 23. EXAMPLE OF DATA WAREHOUSE ARCHITECTURE (PRODUCTION ENVIRONMENT)  B. Software 1. Server 1 - SAS® Data Server - SAS® 9.1.3  - SAS® Metadata Server  - SAS® WorkSpace Server  - SAS® Stored Process Server  - Platform JobScheduler  2. Mid -Tier Server - SAS® Web Report Studio  - SAS® Information Delivery Portal  - BEA Web Logic for future SAS® SPM Platform  - Xythos Web File System (WFS)   
  • 24. EXAMPLE OF DATA WAREHOUSE ARCHITECTURE (PRODUCTION ENVIRONMENT)  3. Client –Tier Server - SAS® Enterprise Guide  - SAS® Add-In for Microsoft Office 
  • 25. BENFITS OF DATA WAREHOUSE 1. A Data Warehouse Delivers Enhanced Business Intelligence: - Insights  will  be  gained  through  improved  information  access.   Managers  and  executives  will  be  freed  from  making  their  decisions  based  on  limited  data  and  their  own  “gut  feelings”.   Decisions that affect the strategy and operations of organizations  will  be  based  upon  credible  facts  and  will  be  backed  up  with  evidence and actual organizational data. 2. A Data Warehouse Saves Time Since business users can quickly access critical data from a  number of sources—all in one place—they can rapidly make  informed decisions on key initiatives. They won’t waste  precious time retrieving data from multiple sources.
  • 26. BENFITS OF DATA WAREHOUSE 3. A Data Warehouse Enhances Data Quality and Consistency A  data  warehouse implementation  includes  the  conversion  of   data  from numerous  source  systems   into  a  common  format.   So  you  can  have  more  confidence  in  the  accuracy  of  your  data.  And  accurate  data  is  the  basis  for  strong  business decisions. 4. A Data Warehouse Provides Historical Intelligence A  data  warehouse  stores  large  amounts  of  historical  data  so  you can analyze different time periods and trends in order to  make  future  predictions.  Such  data  typically  cannot  be  stored  in  a  transactional  database  or  used  to  generate  reports from a transactional system.
  • 27. BENFITS OF DATA WAREHOUSE 5. A Data Warehouse Generates a High ROI Finally,  the  piece  de  resistance—return  on  investment.  Companies  that  have  implemented  data  warehouses  and  complementary  BI  systems  have generated  more  revenue  and  saved  more  money  than  companies  that  haven’t invested in BI systems and data warehouses.
  • 28. DISADVANTAGES OF DATA WAREHOUSE •Long initial implementation time and associated high cost •Adding new data sources takes time and associated high cost •Limited flexibility of use and types of users - requires multiple  separate data marts for multiple uses and types of users •Typically, data is static and dated •Difficult to accommodate changes in data types and ranges,  data source schema.  
  • 29. DATA MINING Data  mining  is  process of  discovering  hidden, previously  unknown  and  usable  information  from  a  large  amount  of  data.  It  is  often  defined as finding hidden information in a database. DEFINITION OF DATA MINING: - “The efficient discovery of valuable non-obvious information from a  large collection of data.”                                                                                          [BIGUS 96]
  • 30. Elements of Data mining •Extract,  transform,  and  load  transaction  data  onto  the  data  warehouse  system. •Store and manage the data in a multidimensional database system. •Provide  data  access  to  business  analysts  and  information  technology  professionals. •Analyze the data by application software. •Present the data in a useful format, such as a graph or table.  
  • 31. DATA MINING PROCESS Slide 1 and slide 2
  • 32. DATA MINING PROCESS SELECTION:-                             Selecting the data according to some criteria. PREPROCESSING:-                                   This is  the data cleansing stage where certain  information  is  removed  which  is  unnecessary  and  may  slow  down  queries. TRANSFORMATION:-                                          The data is not merely transferred across but  transformed in that overlays may be added such as demographic overlays  commonly used in market research.
  • 33. DATA MINING PROCESS DATA MINING:-                                 the stage is concerned with the extraction of patterns  from the data. A pattern can be defined as given a set of facts(data) F, a  language L, and some measure of certainty C, a pattern is a statement S  in L that describes relationships among a subset F(s) of F with a certainty  C. INTERPRETATION AND EVALUTION:-                                                                            The Patterns identified by  the  system  are  interpreted  into  knowledge  which  can  then  be  used  to  support human decision making.
  • 34. ARCHTITECTURE OF DATA MINING There are three tiers in the tight-coupling data mining architecture: Data layer: data layer can be database and/or data warehouse systems. This layer is an interface for all data sources. Data mining results are stored in data layer so it can be presented to end-user in form of reports or other kind of visualization. Application layer: -Data mining application layer is used to retrieve data from database. Some transformation routine can be performed here to transform data into desired format. Front-end layer: -Front-end layer provides intuitive and friendly user interface for end-user to interact with data mining system. Data mining result presented in visualization form to the user in the front-end layer.  
  • 36. Adding the Data Mining Option to a Database Once you have installed the Oracle Database software, you can build databases as needed. You might build a database without the Data Mining option but later decide to add it.
  • 37. Advantages of Data Mining Marketing / Retail Data mining helps marketing companies to build models based on historical data to predict who will respond to new marketing campaign such as direct mail, online marketing campaign and etc. Data mining brings a lot of benefit s to retail company in the same way as marketing. Through market basket analysis, the store can have an appropriate production arrangement in the way that customers can buy frequent buying products together with pleasant.
  • 38. Advantages of Data Mining Finance / Banking Data mining gives financial institutions information about loan information and credit reporting. By building a model from previous customer’s data with common characteristics, the bank and financial can estimate what are the good and/or bad loans and its risk level. In addition, data mining can help banks to detect fraudulent credit card transaction to help credit card’s owner prevent their losses.
  • 39. Advantages of Data Mining Manufacturing By applying data mining in operational engineering data, manufacturers can detect faulty equipments and determine optimal control parameters. Governments Data mining helps government agency by digging and analyzing records of financial transaction to build patterns that can detect money laundering or criminal activity.
  • 40. Disadvantages of data mining Privacy Issues The concerns about the personal privacy have been increasing enormously recently especially when internet is booming with social networks. Security issues Security is a big issue. Businesses owns information about their employee and customers including social security number, birthday, payroll and etc. However how properly this information is taken is still in questions. There have been a lot of cases that hackers were accesses and stole big data of customers from big corporation such as Ford Motor Credit Company, Sony… with so much personal and financial information available, the credit card stolen and identity theft become a big problem.  
  • 41. Disadvantages of data mining Misuse of information/inaccurate information Information collected through data mining intended for marketing or ethical purposes can be misused. This information is exploited by unethical people or business to take benefit of vulnerable people or discriminate against a group of people.
  • 42. QUERIES? ?