SlideShare a Scribd company logo
1 of 16
Data Mining and Warehousing
(UCA15E04)
Unit 5 – Tuning the Datawarehouse
Prepared by
Dr. K. Puspalatha, Mrs. K. Ponveni, Mrs. J. Shyamala Devi
Difficulties in Data Warehouse Tuning
Tuning a data warehouse is a difficult procedure due to following
reasons −
 Data warehouse is dynamic; it never remains constant.
 It is very difficult to predict what query the user is going to post in
the future.
 Business requirements change with time.
 Users and their profiles keep changing.
 The user can switch from one group to another.
 The data load on the warehouse also changes with time.
2
Performance Assessment
Objective measures of performance
 Average query response time
 Scan rates
 I/O throughput rates
 Time used per day query
 Memory usage per process
3
Performance Assessment contd..
 specify the measures in service level agreement (SLA).
 no use trying to tune response time, if they are already better than
those required.
 realistic expectations while making performance assessment.
 feasible expectations.
 aggregations and views should be used (user need not know the
complexity of the system).
 user can write a query you had not tuned for.
4
TUNING DATA LOAD
Why need tuning data load?
 Speeds up ad hoc and fixed queries
 Optimize hardware performance
 Increase efficiency of loading process
 Ensure data is consistent
 Avoid duplication of data
 Reduce operational cost
 Avoid bottlenecking
5
Data flow through the data warehouse
Metadata
Extraction
Detail
Records
Metadata
Extraction
Utilities
Data Sources
Data
Warehouse
Warehouse server
ORACLE
MS
ACCESS
DB2
6
Steps in Tuning
 Preallocate space for the table
 Allocate sufficient memory
 Creating DBWR process
 Remove any unnecessary
 Triggers
 Constraints
 Remove any indexes on the tables
7
Tuning Data Load involves
 Perform consistency and integrity checks
 Creating indexes and partition
 Creating business views
 Denormalization if appropriate
 Aggregation and Summary tables
8
Tuning Queries
Fixed queries - Clearly defined and well understood
Adhoc queries - Unpredictable in quantity and frequency
Fixed Queries
Ad hoc Queries
QUERY PERFORMANCE
Unexpected long lasting queries can be caused by
Slow network connection
Slow running queries
Lack of useful statistics
Out of date statistics
Lack of useful indexes
Lack of useful data striping
10
Fixed Queries
 Fixed queries are well defined. Examples of fixed queries
• regular reports
• Canned queries
• Common aggregations
 Tuning fixed queries in a DW is same as in a RDBMS.
 difference is - amount of data to be queried may be different
 good to store the most successful execution plan.
 spot changing data size and data skew, as it will cause the execution plan to
change.
We cannot do more on fact table but while dealing with dimension tables or the
aggregations, the usual collection of SQL tweaking, storage mechanism, and
access methods can be used to tune these queries.
11
Ad Hoc Queries
To understand ad hoc queries, it is important to know the ad hoc users of the
data warehouse. For each user or group of users, you need to know the following
−
 The number of users in the group
 Whether they use ad hoc queries at regular intervals of time
 Whether they use ad hoc queries frequently
 Whether they use ad hoc queries occasionally at unknown intervals.
 The maximum size of query they tend to run
 The average size of query they tend to run
 Whether they require drill-down access to the base data
 The elapsed login time per day
 The peak time of daily usage
 The number of queries they run per peak hour
12
How to Tune Ad hoc Queries ?
 Frequency,Quantity
 Understanding user profiles
 Different queries against aggregation table
 How often?
 Frequently used indexes
 This will help in
 Growth Predictions
 Capacity Planning
 Index/Aggregation should be used or deleted
13
Query for U !!!
Select Name, roll no from BCA_rank where cgpa >8
And cgpa<=10
TUNE
14
It’s Simple
Select Name, roll no from BCA_rank where
cgpa >8
15
Thank You
16

More Related Content

What's hot

Datavail Health Check
Datavail Health CheckDatavail Health Check
Datavail Health CheckDatavail
 
1.data base administrator
1.data base administrator1.data base administrator
1.data base administratorsuperguyz16
 
Epic Clarity Running on Exadata
Epic Clarity Running on ExadataEpic Clarity Running on Exadata
Epic Clarity Running on ExadataEnkitec
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analyticsd-Wise Technologies
 
Elements of a Successful Computer System ver 1.0
Elements of a Successful Computer System ver 1.0Elements of a Successful Computer System ver 1.0
Elements of a Successful Computer System ver 1.0Dr. C.V. Suresh Babu
 
Resume - Vikash Chilana - 3yrs Exp
Resume - Vikash Chilana - 3yrs ExpResume - Vikash Chilana - 3yrs Exp
Resume - Vikash Chilana - 3yrs ExpVikas Chilana
 
Decoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data StandardsDecoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data Standardsd-Wise Technologies
 
GlobalCODE - Can You Answer These Questions?
GlobalCODE - Can You Answer These Questions?GlobalCODE - Can You Answer These Questions?
GlobalCODE - Can You Answer These Questions?Covance
 
Data profiling-best-practices
Data profiling-best-practicesData profiling-best-practices
Data profiling-best-practicesBlaise Cheuteu
 
Testing a data warehouses
Testing a data warehousesTesting a data warehouses
Testing a data warehousesHimanshu
 
3 D's of test data management managing effectively the underlying challenges...
3 D's of test data management  managing effectively the underlying challenges...3 D's of test data management  managing effectively the underlying challenges...
3 D's of test data management managing effectively the underlying challenges...Ajeet Singh, PMP, CSM
 
Testing data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti BhushanTesting data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti BhushanKirti Bhushan
 

What's hot (17)

Datavail Health Check
Datavail Health CheckDatavail Health Check
Datavail Health Check
 
1.data base administrator
1.data base administrator1.data base administrator
1.data base administrator
 
Final Ucat Ppt
Final Ucat PptFinal Ucat Ppt
Final Ucat Ppt
 
SQL DBA SURESH RESUME
SQL DBA SURESH RESUMESQL DBA SURESH RESUME
SQL DBA SURESH RESUME
 
Epic Clarity Running on Exadata
Epic Clarity Running on ExadataEpic Clarity Running on Exadata
Epic Clarity Running on Exadata
 
JR's Lifetime Advanced Analytics
JR's Lifetime Advanced AnalyticsJR's Lifetime Advanced Analytics
JR's Lifetime Advanced Analytics
 
Elements of a Successful Computer System ver 1.0
Elements of a Successful Computer System ver 1.0Elements of a Successful Computer System ver 1.0
Elements of a Successful Computer System ver 1.0
 
Resume - Vikash Chilana - 3yrs Exp
Resume - Vikash Chilana - 3yrs ExpResume - Vikash Chilana - 3yrs Exp
Resume - Vikash Chilana - 3yrs Exp
 
Decoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data StandardsDecoding the Acronyms in Clinical Data Standards
Decoding the Acronyms in Clinical Data Standards
 
GlobalCODE - Can You Answer These Questions?
GlobalCODE - Can You Answer These Questions?GlobalCODE - Can You Answer These Questions?
GlobalCODE - Can You Answer These Questions?
 
Data profiling-best-practices
Data profiling-best-practicesData profiling-best-practices
Data profiling-best-practices
 
Testing a data warehouses
Testing a data warehousesTesting a data warehouses
Testing a data warehouses
 
Hrr cmio-benefits
Hrr cmio-benefitsHrr cmio-benefits
Hrr cmio-benefits
 
Erdi güngör bbs
Erdi güngör bbsErdi güngör bbs
Erdi güngör bbs
 
3 D's of test data management managing effectively the underlying challenges...
3 D's of test data management  managing effectively the underlying challenges...3 D's of test data management  managing effectively the underlying challenges...
3 D's of test data management managing effectively the underlying challenges...
 
Testing data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti BhushanTesting data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti Bhushan
 
d-Wise Overview
d-Wise Overviewd-Wise Overview
d-Wise Overview
 

Similar to Data mining and warehousing (uca15 e04)

Taming the Beast: Optimizing Oracle EBS for Radical Efficiency
Taming the Beast: Optimizing Oracle EBS for Radical EfficiencyTaming the Beast: Optimizing Oracle EBS for Radical Efficiency
Taming the Beast: Optimizing Oracle EBS for Radical EfficiencyDatavail
 
Tips tricks to speed nw bi 2009
Tips tricks to speed  nw bi  2009Tips tricks to speed  nw bi  2009
Tips tricks to speed nw bi 2009HawaDia
 
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
Iod session 3423   analytics patterns of expertise, the fast path to amazing ...Iod session 3423   analytics patterns of expertise, the fast path to amazing ...
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...Rachel Bland
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Precisely
 
Data Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubeyData Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubeyAnkita Dubey
 
10 tips-for-optimizing-sql-server-performance-white-paper-22127
10 tips-for-optimizing-sql-server-performance-white-paper-2212710 tips-for-optimizing-sql-server-performance-white-paper-22127
10 tips-for-optimizing-sql-server-performance-white-paper-22127Kaizenlogcom
 
Tableau Best Practices.pptx
Tableau Best Practices.pptxTableau Best Practices.pptx
Tableau Best Practices.pptxAnitaB33
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousingEr. Nawaraj Bhandari
 
Mind Map Test Data Management Overview
Mind Map Test Data Management OverviewMind Map Test Data Management Overview
Mind Map Test Data Management Overviewdublinx
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse OptimizationCloudera, Inc.
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Why dba needed in dwh projects
Why dba needed in dwh projectsWhy dba needed in dwh projects
Why dba needed in dwh projectsanurag.vidyarthi
 
Test data management
Test data managementTest data management
Test data managementRohit Gupta
 
Day 02 sap_bi_overview_and_terminology
Day 02 sap_bi_overview_and_terminologyDay 02 sap_bi_overview_and_terminology
Day 02 sap_bi_overview_and_terminologytovetrivel
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysNEWYORKSYS-IT SOLUTIONS
 
Data Collection Process And Integrity
Data Collection Process And IntegrityData Collection Process And Integrity
Data Collection Process And IntegrityGerrit Klaschke, CSM
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 

Similar to Data mining and warehousing (uca15 e04) (20)

Taming the Beast: Optimizing Oracle EBS for Radical Efficiency
Taming the Beast: Optimizing Oracle EBS for Radical EfficiencyTaming the Beast: Optimizing Oracle EBS for Radical Efficiency
Taming the Beast: Optimizing Oracle EBS for Radical Efficiency
 
Planning Data Warehouse
Planning Data WarehousePlanning Data Warehouse
Planning Data Warehouse
 
Tips tricks to speed nw bi 2009
Tips tricks to speed  nw bi  2009Tips tricks to speed  nw bi  2009
Tips tricks to speed nw bi 2009
 
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
Iod session 3423   analytics patterns of expertise, the fast path to amazing ...Iod session 3423   analytics patterns of expertise, the fast path to amazing ...
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
Data Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubeyData Warehouses & Deployment By Ankita dubey
Data Warehouses & Deployment By Ankita dubey
 
10 tips-for-optimizing-sql-server-performance-white-paper-22127
10 tips-for-optimizing-sql-server-performance-white-paper-2212710 tips-for-optimizing-sql-server-performance-white-paper-22127
10 tips-for-optimizing-sql-server-performance-white-paper-22127
 
Tableau Best Practices.pptx
Tableau Best Practices.pptxTableau Best Practices.pptx
Tableau Best Practices.pptx
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousing
 
Mind Map Test Data Management Overview
Mind Map Test Data Management OverviewMind Map Test Data Management Overview
Mind Map Test Data Management Overview
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Why dba needed in dwh projects
Why dba needed in dwh projectsWhy dba needed in dwh projects
Why dba needed in dwh projects
 
Test data management
Test data managementTest data management
Test data management
 
Day 02 sap_bi_overview_and_terminology
Day 02 sap_bi_overview_and_terminologyDay 02 sap_bi_overview_and_terminology
Day 02 sap_bi_overview_and_terminology
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Data warehouse testing
Data warehouse testingData warehouse testing
Data warehouse testing
 
Data Collection Process And Integrity
Data Collection Process And IntegrityData Collection Process And Integrity
Data Collection Process And Integrity
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 

Recently uploaded

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

Recently uploaded (20)

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 

Data mining and warehousing (uca15 e04)

  • 1. Data Mining and Warehousing (UCA15E04) Unit 5 – Tuning the Datawarehouse Prepared by Dr. K. Puspalatha, Mrs. K. Ponveni, Mrs. J. Shyamala Devi
  • 2. Difficulties in Data Warehouse Tuning Tuning a data warehouse is a difficult procedure due to following reasons −  Data warehouse is dynamic; it never remains constant.  It is very difficult to predict what query the user is going to post in the future.  Business requirements change with time.  Users and their profiles keep changing.  The user can switch from one group to another.  The data load on the warehouse also changes with time. 2
  • 3. Performance Assessment Objective measures of performance  Average query response time  Scan rates  I/O throughput rates  Time used per day query  Memory usage per process 3
  • 4. Performance Assessment contd..  specify the measures in service level agreement (SLA).  no use trying to tune response time, if they are already better than those required.  realistic expectations while making performance assessment.  feasible expectations.  aggregations and views should be used (user need not know the complexity of the system).  user can write a query you had not tuned for. 4
  • 5. TUNING DATA LOAD Why need tuning data load?  Speeds up ad hoc and fixed queries  Optimize hardware performance  Increase efficiency of loading process  Ensure data is consistent  Avoid duplication of data  Reduce operational cost  Avoid bottlenecking 5
  • 6. Data flow through the data warehouse Metadata Extraction Detail Records Metadata Extraction Utilities Data Sources Data Warehouse Warehouse server ORACLE MS ACCESS DB2 6
  • 7. Steps in Tuning  Preallocate space for the table  Allocate sufficient memory  Creating DBWR process  Remove any unnecessary  Triggers  Constraints  Remove any indexes on the tables 7
  • 8. Tuning Data Load involves  Perform consistency and integrity checks  Creating indexes and partition  Creating business views  Denormalization if appropriate  Aggregation and Summary tables 8
  • 9. Tuning Queries Fixed queries - Clearly defined and well understood Adhoc queries - Unpredictable in quantity and frequency Fixed Queries Ad hoc Queries
  • 10. QUERY PERFORMANCE Unexpected long lasting queries can be caused by Slow network connection Slow running queries Lack of useful statistics Out of date statistics Lack of useful indexes Lack of useful data striping 10
  • 11. Fixed Queries  Fixed queries are well defined. Examples of fixed queries • regular reports • Canned queries • Common aggregations  Tuning fixed queries in a DW is same as in a RDBMS.  difference is - amount of data to be queried may be different  good to store the most successful execution plan.  spot changing data size and data skew, as it will cause the execution plan to change. We cannot do more on fact table but while dealing with dimension tables or the aggregations, the usual collection of SQL tweaking, storage mechanism, and access methods can be used to tune these queries. 11
  • 12. Ad Hoc Queries To understand ad hoc queries, it is important to know the ad hoc users of the data warehouse. For each user or group of users, you need to know the following −  The number of users in the group  Whether they use ad hoc queries at regular intervals of time  Whether they use ad hoc queries frequently  Whether they use ad hoc queries occasionally at unknown intervals.  The maximum size of query they tend to run  The average size of query they tend to run  Whether they require drill-down access to the base data  The elapsed login time per day  The peak time of daily usage  The number of queries they run per peak hour 12
  • 13. How to Tune Ad hoc Queries ?  Frequency,Quantity  Understanding user profiles  Different queries against aggregation table  How often?  Frequently used indexes  This will help in  Growth Predictions  Capacity Planning  Index/Aggregation should be used or deleted 13
  • 14. Query for U !!! Select Name, roll no from BCA_rank where cgpa >8 And cgpa<=10 TUNE 14
  • 15. It’s Simple Select Name, roll no from BCA_rank where cgpa >8 15