SlideShare a Scribd company logo
1 of 12
Hierarchical
De-normalization
By Kamal Panhwar (A1KW-410-435)

In instructions of Sir Muhammad Hussain Mughal
Contents
      Introduction and writers intro


  Normalization and De-normalization


             Methodology


     Single Dimensional Queries

       Multi Dimensional Quires


               Conclusion
Introduction to Topic
 Two most common processes in database design
  are data Normalization and De-normalization, this
  research discuss opimization using Hierarchical
  Denormalization

         Mortez Zaker is a research student in Advance
         Databases and data warehouse, he is System
         Analyst and specialized in database.



         Somnuk Phon-Amnuaisuk done B.Eng and have done
         Ph.D in Artificial Intelligence.


         Dr.Su-Cheng Haw’s has research in XML Database.
         Her favorite fields Data Modeling, Design, Data
         Management, Data Sestamatic etc.
Introduction
 Datawarehouse is basically foundation for Decision
  Spport Systems, which involve huge collection of
  information, extractions of data from different sources
  which is available for On-line Analytical processing
  (OLAP)
 DW involve number of join operations, incurring
  computational overhead, multi-dimensional grouping and
  aggregation Operations
 Due to nature of DW Performance and efficiency are
  most required qualities.
Researcher compared the results of Hierarchical
 De-normalization and how much it effect
 performance.
Normalization
                            1. Normalization
                                Normalization is technique in which we
                     120    reformat and design database and break in in
                            such way that it can give us more speed and
                            integrity constraints and restriction of business
                            rules.

              70
                            2. De-normalization
       50
                                Database is transferred from Normalized to
                            De-normalized to create data ware house.
30



2003   2004   2005   2006                Data Warehouse
Pakistan
                                 Hierachy
                                                  Each member in a hierarchy is
                                                  Known as a “node”. The top node is
                                                   called root
                                                  And bottom nodes are leafs notes.
                                  Punjab           A parent
         Frontier
                                                  Node is a node that has children
                                                  And a child node is a node which
                                                             Belongs to a
                                             Lahore
                                                               A parent
                Peshwar          Multan
Mardan




                 Qasa
                                        Gulberg   Defence
                Khawani


                               Street
Methodology
Researcher used to compare efficiency of De-Normilzation and normalization
processes and analysis the performance of data models, used series of queries on
some column for evaluation.

Normalized data set
                                                  De-Normalized data set
Query Set

             A set of query Benchmark has been                   Query
Selection    used for frequent query application like
             Star-Schema in the data warehouse


             Queries designed on based business
Business     analysis missions.

             Evaluation was performed by using
Evaluation   calculating time using all queries in      Record           Results
             benchmark
One dimensional Queries
Multi dimensional Queries
Conclusion

Researcher used their methodology and research to know exactly what is effect of
using hierarchical de-normalization. The come to following conclusion

The findings confirm that most probably hierarchical denormalization have the capability
of improving query perfromance since they can reduce the query response times when
the data structure in Data Warehouse is engaged in several joins operations. The result
can help researcher in the future to develop general guidelines which can be applicable to
majority of database designs.
Hierarchical Denormalization

More Related Content

What's hot

Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Shani729
 
Dwh lecture 07-denormalization
Dwh   lecture 07-denormalizationDwh   lecture 07-denormalization
Dwh lecture 07-denormalizationSulman Ahmed
 
Distributed database management systems
Distributed database management systemsDistributed database management systems
Distributed database management systemsUsman Tariq
 
Dwh lecture 08-denormalization tech
Dwh   lecture 08-denormalization techDwh   lecture 08-denormalization tech
Dwh lecture 08-denormalization techSulman Ahmed
 
Advance database system (part 3)
Advance database system (part 3)Advance database system (part 3)
Advance database system (part 3)Abdullah Khosa
 
Intro to Data warehousing Lecture 04
Intro to Data warehousing   Lecture 04Intro to Data warehousing   Lecture 04
Intro to Data warehousing Lecture 04AnwarrChaudary
 
Advance database system (part 2)
Advance database system (part 2)Advance database system (part 2)
Advance database system (part 2)Abdullah Khosa
 
Mi0034 –database management systems
Mi0034 –database management systemsMi0034 –database management systems
Mi0034 –database management systemssmumbahelp
 
Introduction to Teradata And How Teradata Works
Introduction to Teradata And How Teradata WorksIntroduction to Teradata And How Teradata Works
Introduction to Teradata And How Teradata WorksBigClasses Com
 
Advance database system(part 4)
Advance database system(part 4)Advance database system(part 4)
Advance database system(part 4)Abdullah Khosa
 
The Database Environment Chapter 6
The Database Environment Chapter 6The Database Environment Chapter 6
The Database Environment Chapter 6Jeanie Arnoco
 
Implementing sorting in database systems
Implementing sorting in database systemsImplementing sorting in database systems
Implementing sorting in database systemsunyil96
 
Mi0034 database management systems
Mi0034  database management systemsMi0034  database management systems
Mi0034 database management systemssmumbahelp
 
Teradata 13.10
Teradata 13.10Teradata 13.10
Teradata 13.10Teradata
 

What's hot (20)

Database design
Database designDatabase design
Database design
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6
 
Teradata a z
Teradata a zTeradata a z
Teradata a z
 
Dwh lecture 07-denormalization
Dwh   lecture 07-denormalizationDwh   lecture 07-denormalization
Dwh lecture 07-denormalization
 
Distributed database management systems
Distributed database management systemsDistributed database management systems
Distributed database management systems
 
Dwh lecture 08-denormalization tech
Dwh   lecture 08-denormalization techDwh   lecture 08-denormalization tech
Dwh lecture 08-denormalization tech
 
2 normalization
2 normalization2 normalization
2 normalization
 
Advance database system (part 3)
Advance database system (part 3)Advance database system (part 3)
Advance database system (part 3)
 
Intro to Data warehousing Lecture 04
Intro to Data warehousing   Lecture 04Intro to Data warehousing   Lecture 04
Intro to Data warehousing Lecture 04
 
Advance database system (part 2)
Advance database system (part 2)Advance database system (part 2)
Advance database system (part 2)
 
Veena
VeenaVeena
Veena
 
Mi0034 –database management systems
Mi0034 –database management systemsMi0034 –database management systems
Mi0034 –database management systems
 
Introduction to Teradata And How Teradata Works
Introduction to Teradata And How Teradata WorksIntroduction to Teradata And How Teradata Works
Introduction to Teradata And How Teradata Works
 
Advance database system(part 4)
Advance database system(part 4)Advance database system(part 4)
Advance database system(part 4)
 
The Database Environment Chapter 6
The Database Environment Chapter 6The Database Environment Chapter 6
The Database Environment Chapter 6
 
Implementing sorting in database systems
Implementing sorting in database systemsImplementing sorting in database systems
Implementing sorting in database systems
 
Normalization,ddl,dml,dcl
Normalization,ddl,dml,dclNormalization,ddl,dml,dcl
Normalization,ddl,dml,dcl
 
Databse management system
Databse management systemDatabse management system
Databse management system
 
Mi0034 database management systems
Mi0034  database management systemsMi0034  database management systems
Mi0034 database management systems
 
Teradata 13.10
Teradata 13.10Teradata 13.10
Teradata 13.10
 

Similar to Hierarchical Denormalization

Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Calpont Corporation
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingPeter Haase
 
Data Governance for Data Lakes
Data Governance for Data LakesData Governance for Data Lakes
Data Governance for Data LakesKiran Kamreddy
 
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...Cloudera, Inc.
 
Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16GenomeInABottle
 
Why ODS? The Role Of The ODS In Today’s BI World And How Oracle Technology H...
Why ODS?  The Role Of The ODS In Today’s BI World And How Oracle Technology H...Why ODS?  The Role Of The ODS In Today’s BI World And How Oracle Technology H...
Why ODS? The Role Of The ODS In Today’s BI World And How Oracle Technology H...C. Scyphers
 
Parallel processing in data warehousing and big data
Parallel processing in data warehousing and big dataParallel processing in data warehousing and big data
Parallel processing in data warehousing and big dataAbhishek Sharma
 
No sql and data scalability
No sql and data scalabilityNo sql and data scalability
No sql and data scalabilityRoger Xia
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data WarehouseZalpa Rathod
 
Collaborate 2012-business data transformation and consolidation for a global ...
Collaborate 2012-business data transformation and consolidation for a global ...Collaborate 2012-business data transformation and consolidation for a global ...
Collaborate 2012-business data transformation and consolidation for a global ...Chain Sys Corporation
 
Distributed_Database_System
Distributed_Database_SystemDistributed_Database_System
Distributed_Database_SystemPhilip Zhong
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEZalpa Rathod
 
Deep neural networks and tabular data
Deep neural networks and tabular dataDeep neural networks and tabular data
Deep neural networks and tabular dataJimmyLiang20
 
25.ranking on data manifold with sink points
25.ranking on data manifold with sink points25.ranking on data manifold with sink points
25.ranking on data manifold with sink pointsVenkatesh Neerukonda
 
History of database processing module 1 (2)
History of database processing module 1 (2)History of database processing module 1 (2)
History of database processing module 1 (2)chottu89
 

Similar to Hierarchical Denormalization (20)

Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
 
Survey on NoSQL integration
Survey on NoSQL integrationSurvey on NoSQL integration
Survey on NoSQL integration
 
Dremel Paper Review
Dremel Paper ReviewDremel Paper Review
Dremel Paper Review
 
Lecture1
Lecture1Lecture1
Lecture1
 
Data Governance for Data Lakes
Data Governance for Data LakesData Governance for Data Lakes
Data Governance for Data Lakes
 
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
 
Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16Genome in a Bottle Consortium Workshop Welcome Aug. 16
Genome in a Bottle Consortium Workshop Welcome Aug. 16
 
Why ODS? The Role Of The ODS In Today’s BI World And How Oracle Technology H...
Why ODS?  The Role Of The ODS In Today’s BI World And How Oracle Technology H...Why ODS?  The Role Of The ODS In Today’s BI World And How Oracle Technology H...
Why ODS? The Role Of The ODS In Today’s BI World And How Oracle Technology H...
 
Parallel processing in data warehousing and big data
Parallel processing in data warehousing and big dataParallel processing in data warehousing and big data
Parallel processing in data warehousing and big data
 
No sql and data scalability
No sql and data scalabilityNo sql and data scalability
No sql and data scalability
 
OLAP & Data Warehouse
OLAP & Data WarehouseOLAP & Data Warehouse
OLAP & Data Warehouse
 
15 19
15 1915 19
15 19
 
Drill njhug -19 feb2013
Drill njhug -19 feb2013Drill njhug -19 feb2013
Drill njhug -19 feb2013
 
Collaborate 2012-business data transformation and consolidation for a global ...
Collaborate 2012-business data transformation and consolidation for a global ...Collaborate 2012-business data transformation and consolidation for a global ...
Collaborate 2012-business data transformation and consolidation for a global ...
 
Distributed_Database_System
Distributed_Database_SystemDistributed_Database_System
Distributed_Database_System
 
OLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSEOLAP & DATA WAREHOUSE
OLAP & DATA WAREHOUSE
 
Deep neural networks and tabular data
Deep neural networks and tabular dataDeep neural networks and tabular data
Deep neural networks and tabular data
 
25.ranking on data manifold with sink points
25.ranking on data manifold with sink points25.ranking on data manifold with sink points
25.ranking on data manifold with sink points
 
History of database processing module 1 (2)
History of database processing module 1 (2)History of database processing module 1 (2)
History of database processing module 1 (2)
 

More from Kamaluddin Panhwar

More from Kamaluddin Panhwar (7)

Software Development
Software DevelopmentSoftware Development
Software Development
 
Communication
CommunicationCommunication
Communication
 
Business Process Engineering
Business Process EngineeringBusiness Process Engineering
Business Process Engineering
 
Bar Shampoo
Bar ShampooBar Shampoo
Bar Shampoo
 
Introduction To It Voice Recognize
Introduction To It Voice RecognizeIntroduction To It Voice Recognize
Introduction To It Voice Recognize
 
Shipping Management Expert System
Shipping Management Expert SystemShipping Management Expert System
Shipping Management Expert System
 
Principle of Management
Principle of ManagementPrinciple of Management
Principle of Management
 

Hierarchical Denormalization

  • 1. Hierarchical De-normalization By Kamal Panhwar (A1KW-410-435) In instructions of Sir Muhammad Hussain Mughal
  • 2. Contents Introduction and writers intro Normalization and De-normalization Methodology Single Dimensional Queries Multi Dimensional Quires Conclusion
  • 3. Introduction to Topic  Two most common processes in database design are data Normalization and De-normalization, this research discuss opimization using Hierarchical Denormalization Mortez Zaker is a research student in Advance Databases and data warehouse, he is System Analyst and specialized in database. Somnuk Phon-Amnuaisuk done B.Eng and have done Ph.D in Artificial Intelligence. Dr.Su-Cheng Haw’s has research in XML Database. Her favorite fields Data Modeling, Design, Data Management, Data Sestamatic etc.
  • 4. Introduction  Datawarehouse is basically foundation for Decision Spport Systems, which involve huge collection of information, extractions of data from different sources which is available for On-line Analytical processing (OLAP)  DW involve number of join operations, incurring computational overhead, multi-dimensional grouping and aggregation Operations  Due to nature of DW Performance and efficiency are most required qualities. Researcher compared the results of Hierarchical De-normalization and how much it effect performance.
  • 5. Normalization 1. Normalization Normalization is technique in which we 120 reformat and design database and break in in such way that it can give us more speed and integrity constraints and restriction of business rules. 70 2. De-normalization 50 Database is transferred from Normalized to De-normalized to create data ware house. 30 2003 2004 2005 2006 Data Warehouse
  • 6. Pakistan Hierachy Each member in a hierarchy is Known as a “node”. The top node is called root And bottom nodes are leafs notes. Punjab A parent Frontier Node is a node that has children And a child node is a node which Belongs to a Lahore A parent Peshwar Multan Mardan Qasa Gulberg Defence Khawani Street
  • 7. Methodology Researcher used to compare efficiency of De-Normilzation and normalization processes and analysis the performance of data models, used series of queries on some column for evaluation. Normalized data set De-Normalized data set
  • 8. Query Set A set of query Benchmark has been Query Selection used for frequent query application like Star-Schema in the data warehouse Queries designed on based business Business analysis missions. Evaluation was performed by using Evaluation calculating time using all queries in Record Results benchmark
  • 11. Conclusion Researcher used their methodology and research to know exactly what is effect of using hierarchical de-normalization. The come to following conclusion The findings confirm that most probably hierarchical denormalization have the capability of improving query perfromance since they can reduce the query response times when the data structure in Data Warehouse is engaged in several joins operations. The result can help researcher in the future to develop general guidelines which can be applicable to majority of database designs.