The document discusses multidimensional databases. It defines multidimensional databases as systems designed to efficiently store and retrieve large volumes of related data that can be viewed from different perspectives or dimensions. It provides an example using automobile sales data that can be analyzed based on dimensions like model, color, dealership, and time. Multidimensional databases allow for interactive analysis of data across multiple dimensions, unlike relational databases which are less flexible and have slower response times for similar analyses.
1. Submitted by:
Pankaj Thakur
ME CSE (Modular)
Roll No. 171408
Submitted to:
Dr. Rakesh Kumar
Assistant Professor
Computer Science &
Engineering
2. Definition & Need of Multidimensional
Database.
Relational Vs Multidimensional Database
Features of Multidimensional Database
Ideal and Real World Applications
Where not to use
Research work & References.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
3. A multidimensional database is a computer
software system designed to allow for the
efficient and convenient storage and retrieval
of large volumes of data that is
1) intimately related and
2) stored, viewed and analyzed from
different perspectives.
These perspectives are called dimension.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
4. To answer some of the typical business questions from raw data often
require looking that data from various angles, perspectives or
dimensions.
For example, in order to improve and evaluate business activity, an
automobile marketer might expect viewing historical sales volume
figures from multiple perspectives such as:
Sales volumes by model
Sales volumes by color
Sales volumes by dealership
Sales volumes over time
Analyzing the Sales Volumes data from any one or more of the above
perspectives can result answers to important questions such as:
What is the trend in sales volumes over a period of time for a
specific model and color across a specific group of dealerships?
Having the ability to respond to these types of inquiries in a timely
fashion allows managers to formulate effective rules, identify trends and
improve their overall ability to make important business decisions.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
5. Certainly, relational databases could answer
these question, but query results must also
come to the manager in a meaningful and
timely way.
End users needing interactive access to large
volumes of data stored in a relational
environment are often frustrated by poor
response times and lack of flexibility offered
by relational database technology and their
SQL query building tools.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
6. The SALES VOLUMES field contains the data we wish to analyze.
The COLOR and MODEL fields contain the perspectives we will analyze the data from.
MODEL, ranges across only three possible values: MINI VAN, SPORTS COUPE ,SEDAN.
COLOR, also ranges across three possible values: BLUE, RED and WHITE.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
7. There is an alternative way of representing this data - “cross tab"
view, or data matrix.
In this representation the SALES VOLUMES figures are located at the
x-axis/y-axis intersections of a 3x3 matrix.
This matrix is an example of a two-dimensional “array.”
Pankaj Thakur, ME (CSE, Modular), RN 171408.
8. In this example, the first dimension
is MODEL; it has three positions:
MINI VAN, SEDAN and COUPE.
The second dimension is COLOR; it
also has three positions: BLUE,
WHITE and RED.
The Sales Volume figures are
located at the intersections of the
dimension positions.
This multidimensional array
structure represents a higher level
of organization than the relational
table.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
10. Pankaj Thakur, ME (CSE, Modular), RN 171408.
In relational table format, this array translates into
a table of 1000 records! In contrast the
multidimensional structure has more "knowledge"
about where a particular piece of data lies.
11. A relational system might have to search through
all 1000 records just to find the right record
whereas the multidimensional system only has to
search along three dimensions of 10 positions
each to find the matching "record" or cell.
This is a maximum of 30 position searches for
the array versus a maximum of 1000 record
searches for the table.
If we assume that the average search equals half
the maximum search, we are looking at 15
versus 500 searches– a 3300% improvement in
performance!
Pankaj Thakur, ME (CSE, Modular), RN 171408.
12. Pankaj Thakur, ME (CSE, Modular), RN 171408.
Our three dimensional example can be extended to four dimensions
by adding a time dimension to indicate the month of the year in
which a sale was made.
This paradigm can be extended to five or more dimensions
17. Pankaj Thakur, ME (CSE, Modular), RN 171408.
The end user can
select the desired
positions along
each dimension.
The reduced array
can now be rotated
and used in
computations.
Relatively few
resource
consuming
searches are
required
18. Pankaj Thakur, ME (CSE, Modular), RN 171408.
Moving up and moving down levels in a hierarchy is referred
to as "roll-up" and "drill-down."
19. Pankaj Thakur, ME (CSE, Modular), RN 171408.
The high degree of structure in a
multidimensional array translates into a very
simple and efficient query language.
Query output is more immediately useful to the
end user.
20. Pankaj Thakur, ME (CSE, Modular), RN 171408.
Multiple users simultaneously
access and view the same data,
without experiencing a degradation
in performance.
Users can define their own unique
and independent views of the same
data structure.
Each end user is able to rotate,
range, drill-down and roll-up
through the database without
impacting the view of any other
user.
21. Pankaj Thakur, ME (CSE, Modular), RN 171408.
Data stored in a multidimensional array offers a
high degree inherent organization that greatly
facilitates analysis.
For example, in our two-dimensional example, all
of the Sales Volume figures for COLOR=BLUE are in
a neat row.
Similarly, all of the Sales Volume figures for
CAR=SEDAN are conveniently lined up.
22. Financial Analysis and Reporting
Budgeting
Promotion Tracking
Quality Assurance and Quality Control
Product Profitability
Survey Analysis
Pankaj Thakur, ME (CSE, Modular), RN 171408.
23. Ease of Data Presentation and Navigation
Ease of Maintenance
Performance
Pankaj Thakur, ME (CSE, Modular), RN 171408.
24. Multidimensional data integration
OLAP workload pattern
OLTP workload pattern
OLAP and OLTP integration
Pankaj Thakur, ME (CSE, Modular), RN 171408.
25. There are some performance related reasons
for not placing certain types of datasets in
multidimensional arrays. For example, the
relational table below.
Pankaj Thakur, ME (CSE, Modular), RN 171408.
27. 1. Multidimensional Database for Crime Prevention. (Horák JiĜí, Ivan Igor, Drozdová
Michala, Horáková Bronislava, Bala Petr(2016 17th International Carpathian Control
Conference (ICCC))
2. Application of Multidimensional databases of Rainfall and Low Pressure Systems on
OLAP-based model, Kavita Pabreja, Research Scholar, Birla Institute of Technology and
Science, Pilani.
3. OLAP Umbrella: Visualization Model for Multidimensional Databases by Tanmoy
Bhowmik, 1 Anirban Sarkar, 2 Narayan C Debnath, Department of Computer
Applications, National Institute of Technology, Durgapur, WB – 713209, India,
Department of Computer Science, Winona State University, Winona, MN – 55987, USA.
4. Normal Forms for Multidimensional Databases, W. Lehner, J. Albrecht, H. Wedekind,
University of Erlangen-Nuremberg, Dept. of Database Systems Martensstr. 3, D-91058
Erlangen, Germany
Pankaj Thakur, ME (CSE, Modular), RN 171408.