SlideShare a Scribd company logo
1 of 3
Hema S. Botre, Prof.M.S.Chaudhari / International Journal of Engineering Research and
Applications (IJERA) ISSN: 2248-9622 www.ijera.com
Vol. 3, Issue 3, May-Jun 2013, pp.950-952
950 | P a g e
Design of an Algorithm For Materialized View Selection
Hema S. Botre Prof.M.S.Chaudhari
Smt.Bhagwati Chaturvedi Smt.Bhagwati Chaturvedi
College of Engg,Nagpur College of Engg,Nagpur
Abstract:
Quick responds and accurate data are
important factors in the data warehouse. In
distributed database, large amount of database
required quick response to the user which can
enhance the query processing. A data warehouse
is the collection of materialized views which are
used to process a given set of queries.
Materialized views are found useful for fast
query processing. It can be the result of a query,
subset of the rows or column or can be the join
result. Instead of accessing the original database ,
some immediate result can store in the form of
materialized view which can give the fast access
to the data .There are two issues should be
consider while developing materialized views i.e.
space constraint & maintenance cost
constraint .Because materialized view required
large amount of space on to the disk,
materialization of all queries or views are not
possible. Therefore Materialized views selection
is one of the important decisions in designing a
data warehouse. This work will implement an
algorithm which can select the queries based on
frequently accessing by the user & materialized
that query so that user can get fast access to
there useful data & automatically query
processing will be faster and maintenance of
data in order to get up-to-date information..
Keywords: Data warehouse, materialized views,
maintenance cost, query processing cost.
Introduction:
Data warehouse contains huge amount of
data from various distributed, heterogeneous and
operational data sources. Initially traditional data
sources never used to perform complex query
therefore to overcome the weakness of traditional
databases, data warehouse have been developed.
Data warehouse is the collection of many
materialized view which are used to answer some
aggregate queries. Materialized view can give fast
query response rather than accessing the data from
base table. Therefore problem generally occur which
views or queries should materialized in order to get
low processing cost & low maintenance cost.
Materialization of all views/queries
consumes larger space on to the disk therefore we
need to select some of the views/queries which are
beneficial to us. This work focus on two main things
while selecting the views/queries for materialization
i. e. frequently accessing queries and space occupied
by the data that is fetched after applying a query.
After considering this two main constraint an
Algorithm will select those views/queries which
contain maximum benefit per unit space. It means
it will select those views/queries which can give the
fast access to their useful data of the user and by
which query processing will be faster.
Another part of this work will maintain the
data consistently. It means as the base table will
change, respective materialized view will also be
changed so that user can get accurate data.
Related Work:
Harinarayan [1] presented a greedy
algorithm for the selection of materialized views in
which data cubes are used for query evaluation cost
but in this work, storage cost and maintenance cost
are not given. T.Nalini, Dr. A.Kumaravel,
Dr.K.Rangarajan [2] have been proposed I-mine
mining techniques by which frequently accessing
query can consider after that processing cost can be
calculated & space occupied by that data can also be
calculated .After considering all these three
parameter one threshold value will select the views
for materialization. Mahip Bartere and Dr. Prashant
Deshmukh [3] proposed sample views which are the
index materialized view used for fast accessing data
form the databases. But it is useful only those
application which contain random sample form the
database. Himanshu Gupta and Inderpal Singh
Mumick [4] developed an algorithms to select a set
of views to materialize in a data warehouse in order
to minimize the total query response time under the
constraint of a given total view maintenance time.
Chuan Zhang and Jian Yang [5] proposed a
completely different approach, Genetic Algorithm,
to choose materialized views and demonstrate that it
is practical and effective compared with heuristic
approaches. B.Ashadevi,Dr.R.Balasubramanian [6]
implemented an algorithm in which query which has
highest weight than the other are used for
materialization and it can delete an existing
materialized view which are no longer useful as well
as maintenance is also well defined in this work.
Sanket Patel and Deepak Dembla [7] have proposed
the algorithm which select the views for
materialization and it select the best node to
distributed materialized view. Imene Mami and
Zohra Bellahsene [8] have proposed work in which,
it select views based on workload and processing
Hema S. Botre, Prof.M.S.Chaudhari / International Journal of Engineering Research and
Applications (IJERA) ISSN: 2248-9622 www.ijera.com
Vol. 3, Issue 3, May-Jun 2013, pp.950-952
951 | P a g e
cost and it implemented an algorithm which
calculated updating of a materialized view, from
which those views which are updated frequently will
be deleted.
Proposed Work:
This work focus on two main things while
selecting views for materialization that are
frequently accessing queries and total disk space.
The problem of utilizing the limited resources disk
space or maintenance time to minimize the total
query processing cost comes under the materialized
view selection with resource constraints.
Following algorithm will implement the above
Priority Based Algorithm:
Materialized view consumes large amounts
of disk-space due to their large sizes that usually
reach hundreds of terabytes. Therefore it is not
possible to materialize all the views. This work will
implement an algorithm which materialized views
with highest frequency and continue it till the total
disk space will be fulfilled.
Algorithm:
1. Initialize disk size d.
2. Cerate array y for frequently accessing
views.
3. Create a target array z for materialized
views.
4. Repeat
5. Retrive x=[Current_view,Goal_view] from
y.
6. if i==queue_size.
7. then return(Goal_view)
8. Add Current_View into z.
9. If
union(Current_viewsize,Next_viewsize)<=
d
10. Add Next_View into z.
11.Until queue is empty.
In the above algorithm the work will
consider both constraint: the total disk space and
frequently accessing queries by which it can
improve query performance . Therefore it will be
useful in huge amount of database.
Results:
Initially this work counts the frequency of
each query and creates the materialized views of
those queries which frequency goes beyond the
threshold value.
This gives the following result of an algorithm:
After this it created the materialized views of those
query which frequencies are more than the threshold
value.
Following are the materialized views:
This materialization based on highest frequencies of
an queries.
After creation of materialized view whenever user
will fire the same query, data will be fetched
directly from the materialized view which will take
less processing time.
This can be shown as follows:
Conclusion:
The selection of views to materialize is one
of the most important issues in designing a data
warehouse. So as to achieve the best combination of
good query response where query processing cost
should be minimized in a given storage space
constraints. The space constraint is the most
important factor while selecting the views to be
materialized. The Proposed approach will
implement such an algorithm that will select the
Hema S. Botre, Prof.M.S.Chaudhari / International Journal of Engineering Research and
Applications (IJERA) ISSN: 2248-9622 www.ijera.com
Vol. 3, Issue 3, May-Jun 2013, pp.950-952
952 | P a g e
materialized views to minimize the space & storage
constraint. The materialized views are selected to
give minimum query response time.
References:
[1] V. Harinarayan, A. Rajaraman, and J.
Ullman. “Implementing data cubes
efficiently”. Proceedings of ACM
SIGMOD 1996 International Conference
on Management of Data, Montreal, Canada,
pages 205- 216, 1996.
[2] T.Nalini, Dr. A.Kumaravel,
Dr.K.Rangarajan , “An Efficient I-MINE
Algorithm for Materialized Views in a Data
Warehouse Environment” , International
Conference on Computer Science, Vol.8,
Issue 5, No 1, September 2011.
[3] Mahip Bartere and Dr. Prashant
Deshmukh, “Optimization of Query
Processing Time Base on Materialized
Sample View ”, (IJCSIT) International
Journal of Computer Science and
Information Technologies, Vol. 2 (3) ,
2011, 1222-1228
[4] H. Gupta, I.S. Mumick, Selection of views
to materialize under a maintenance cost
constraint. In Proc. 7th International
Conference on Database Theory (ICDT'99),
Jerusalem, Israel, pp. 453–470,1999.
[5] C. Zhang and J. Yang, “Genetic algorithm
for materialized view selection in data
warehouse environments,” Proceedings of
the International Conference on Data
Warehousing and Knowledge Discovery ,
LNCS, vol. 1676, pp. 116–125, 999.
[6] B.Ashadevi and Dr.R.Balasubramanian,
“Optimized Cost Effective Approach for
Selection of Materialized Views in Data
Warehousing”, JCS&T Vol. 9 No. 1,April
2009.
[7] Sanket Patel and Deepak Dembla, “An
Approach for Selection and Maintenance of
Materialized View in Data Warehousing”,
International Journal of Information
Sciences and Application.ISSN 0974-2255
Volume 4, Number 1 (2012).
[8] Imene Mami and Zohra Bellahsene, “A
Survey of View Selection Methods”,
SIGMOD Record, March 2012 (Vol. 41,
No. 1).
[9] MR. P. P. Karde, DR. V. M. Thakare,
“Materialized View Selection Approach
Using Tree Based
Methodology”,ISSN:0975-6698-NOV 09
to Oct 10-vol 1.

More Related Content

What's hot

Applying Genetic Algorithms to Information Retrieval Using Vector Space Model
Applying Genetic Algorithms to Information Retrieval Using Vector Space ModelApplying Genetic Algorithms to Information Retrieval Using Vector Space Model
Applying Genetic Algorithms to Information Retrieval Using Vector Space Model
IJCSEA Journal
 
Applying genetic algorithms to information retrieval using vector space model
Applying genetic algorithms to information retrieval using vector space modelApplying genetic algorithms to information retrieval using vector space model
Applying genetic algorithms to information retrieval using vector space model
IJCSEA Journal
 
An exploratory analysis on half hourly electricity load patterns leading to h...
An exploratory analysis on half hourly electricity load patterns leading to h...An exploratory analysis on half hourly electricity load patterns leading to h...
An exploratory analysis on half hourly electricity load patterns leading to h...
acijjournal
 
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
csandit
 
Optimal approach for text summarization
Optimal approach for text summarizationOptimal approach for text summarization
Optimal approach for text summarization
IAEME Publication
 

What's hot (18)

B018110610
B018110610B018110610
B018110610
 
Applying Genetic Algorithms to Information Retrieval Using Vector Space Model
Applying Genetic Algorithms to Information Retrieval Using Vector Space ModelApplying Genetic Algorithms to Information Retrieval Using Vector Space Model
Applying Genetic Algorithms to Information Retrieval Using Vector Space Model
 
Data mining technique for opinion
Data mining technique for opinionData mining technique for opinion
Data mining technique for opinion
 
Applying genetic algorithms to information retrieval using vector space model
Applying genetic algorithms to information retrieval using vector space modelApplying genetic algorithms to information retrieval using vector space model
Applying genetic algorithms to information retrieval using vector space model
 
An exploratory analysis on half hourly electricity load patterns leading to h...
An exploratory analysis on half hourly electricity load patterns leading to h...An exploratory analysis on half hourly electricity load patterns leading to h...
An exploratory analysis on half hourly electricity load patterns leading to h...
 
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
IMPROVED NEURAL NETWORK PREDICTION PERFORMANCES OF ELECTRICITY DEMAND: MODIFY...
 
Early Identification of Diseases Based on Responsible Attribute using Data Mi...
Early Identification of Diseases Based on Responsible Attribute using Data Mi...Early Identification of Diseases Based on Responsible Attribute using Data Mi...
Early Identification of Diseases Based on Responsible Attribute using Data Mi...
 
T OP K-O PINION D ECISIONS R ETRIEVAL IN H EALTHCARE S YSTEM
T OP  K-O PINION  D ECISIONS  R ETRIEVAL IN  H EALTHCARE  S YSTEM T OP  K-O PINION  D ECISIONS  R ETRIEVAL IN  H EALTHCARE  S YSTEM
T OP K-O PINION D ECISIONS R ETRIEVAL IN H EALTHCARE S YSTEM
 
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSEA CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
A CONCEPTUAL METADATA FRAMEWORK FOR SPATIAL DATA WAREHOUSE
 
Ijetcas14 338
Ijetcas14 338Ijetcas14 338
Ijetcas14 338
 
50120130406036
5012013040603650120130406036
50120130406036
 
IJET-V2I6P32
IJET-V2I6P32IJET-V2I6P32
IJET-V2I6P32
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
 
Assess data reliability from a set of criteria using the theory of belief fun...
Assess data reliability from a set of criteria using the theory of belief fun...Assess data reliability from a set of criteria using the theory of belief fun...
Assess data reliability from a set of criteria using the theory of belief fun...
 
Privacy Preservation and Restoration of Data Using Unrealized Data Sets
Privacy Preservation and Restoration of Data Using Unrealized Data SetsPrivacy Preservation and Restoration of Data Using Unrealized Data Sets
Privacy Preservation and Restoration of Data Using Unrealized Data Sets
 
Assessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalAssessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s Recital
 
50120140504015
5012014050401550120140504015
50120140504015
 
Optimal approach for text summarization
Optimal approach for text summarizationOptimal approach for text summarization
Optimal approach for text summarization
 

Viewers also liked

Viewers also liked (20)

Ey33907916
Ey33907916Ey33907916
Ey33907916
 
Ez33917920
Ez33917920Ez33917920
Ez33917920
 
Hl3312951297
Hl3312951297Hl3312951297
Hl3312951297
 
Eo33845851
Eo33845851Eo33845851
Eo33845851
 
Ct33572581
Ct33572581Ct33572581
Ct33572581
 
Cx33596601
Cx33596601Cx33596601
Cx33596601
 
V33119122
V33119122V33119122
V33119122
 
K33050053
K33050053K33050053
K33050053
 
P34104107
P34104107P34104107
P34104107
 
Cn34539542
Cn34539542Cn34539542
Cn34539542
 
X34143146
X34143146X34143146
X34143146
 
E342331
E342331E342331
E342331
 
Bh35333338
Bh35333338Bh35333338
Bh35333338
 
Ig3115521556
Ig3115521556Ig3115521556
Ig3115521556
 
Ch32524527
Ch32524527Ch32524527
Ch32524527
 
Cq32579584
Cq32579584Cq32579584
Cq32579584
 
Ci32528533
Ci32528533Ci32528533
Ci32528533
 
Cu32604607
Cu32604607Cu32604607
Cu32604607
 
Retos y Logros 2001-2010
Retos y Logros 2001-2010Retos y Logros 2001-2010
Retos y Logros 2001-2010
 
Erick tenemaza
Erick tenemazaErick tenemaza
Erick tenemaza
 

Similar to Fg33950952

Open domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingOpen domain question answering system using semantic role labeling
Open domain question answering system using semantic role labeling
eSAT Publishing House
 

Similar to Fg33950952 (20)

International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI) International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
Query optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query managementQuery optimization in oodbms identifying subquery for query management
Query optimization in oodbms identifying subquery for query management
 
Ax34298305
Ax34298305Ax34298305
Ax34298305
 
Novel Ensemble Tree for Fast Prediction on Data Streams
Novel Ensemble Tree for Fast Prediction on Data StreamsNovel Ensemble Tree for Fast Prediction on Data Streams
Novel Ensemble Tree for Fast Prediction on Data Streams
 
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENTQUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
QUERY OPTIMIZATION IN OODBMS: IDENTIFYING SUBQUERY FOR COMPLEX QUERY MANAGEMENT
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple Databases
 
Evaluation of a New Incremental Classification Tree Algorithm for Mining High...
Evaluation of a New Incremental Classification Tree Algorithm for Mining High...Evaluation of a New Incremental Classification Tree Algorithm for Mining High...
Evaluation of a New Incremental Classification Tree Algorithm for Mining High...
 
EVALUATION OF A NEW INCREMENTAL CLASSIFICATION TREE ALGORITHM FOR MINING HIGH...
EVALUATION OF A NEW INCREMENTAL CLASSIFICATION TREE ALGORITHM FOR MINING HIGH...EVALUATION OF A NEW INCREMENTAL CLASSIFICATION TREE ALGORITHM FOR MINING HIGH...
EVALUATION OF A NEW INCREMENTAL CLASSIFICATION TREE ALGORITHM FOR MINING HIGH...
 
An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded content
 
Open domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingOpen domain question answering system using semantic role labeling
Open domain question answering system using semantic role labeling
 
A novel scheduling algorithm for cloud computing environment
A novel scheduling algorithm for cloud computing environmentA novel scheduling algorithm for cloud computing environment
A novel scheduling algorithm for cloud computing environment
 
Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkMining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce Framework
 
An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...
 
A Brief Survey on Recommendation System for a Gradient Classifier based Inade...
A Brief Survey on Recommendation System for a Gradient Classifier based Inade...A Brief Survey on Recommendation System for a Gradient Classifier based Inade...
A Brief Survey on Recommendation System for a Gradient Classifier based Inade...
 
50120130405014 2-3
50120130405014 2-350120130405014 2-3
50120130405014 2-3
 
K355662
K355662K355662
K355662
 
K355662
K355662K355662
K355662
 
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
An Efficient Compressed Data Structure Based Method for Frequent Item Set MiningAn Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
An Efficient Compressed Data Structure Based Method for Frequent Item Set Mining
 

Recently uploaded

“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
Wonjun Hwang
 

Recently uploaded (20)

ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 

Fg33950952

  • 1. Hema S. Botre, Prof.M.S.Chaudhari / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 3, May-Jun 2013, pp.950-952 950 | P a g e Design of an Algorithm For Materialized View Selection Hema S. Botre Prof.M.S.Chaudhari Smt.Bhagwati Chaturvedi Smt.Bhagwati Chaturvedi College of Engg,Nagpur College of Engg,Nagpur Abstract: Quick responds and accurate data are important factors in the data warehouse. In distributed database, large amount of database required quick response to the user which can enhance the query processing. A data warehouse is the collection of materialized views which are used to process a given set of queries. Materialized views are found useful for fast query processing. It can be the result of a query, subset of the rows or column or can be the join result. Instead of accessing the original database , some immediate result can store in the form of materialized view which can give the fast access to the data .There are two issues should be consider while developing materialized views i.e. space constraint & maintenance cost constraint .Because materialized view required large amount of space on to the disk, materialization of all queries or views are not possible. Therefore Materialized views selection is one of the important decisions in designing a data warehouse. This work will implement an algorithm which can select the queries based on frequently accessing by the user & materialized that query so that user can get fast access to there useful data & automatically query processing will be faster and maintenance of data in order to get up-to-date information.. Keywords: Data warehouse, materialized views, maintenance cost, query processing cost. Introduction: Data warehouse contains huge amount of data from various distributed, heterogeneous and operational data sources. Initially traditional data sources never used to perform complex query therefore to overcome the weakness of traditional databases, data warehouse have been developed. Data warehouse is the collection of many materialized view which are used to answer some aggregate queries. Materialized view can give fast query response rather than accessing the data from base table. Therefore problem generally occur which views or queries should materialized in order to get low processing cost & low maintenance cost. Materialization of all views/queries consumes larger space on to the disk therefore we need to select some of the views/queries which are beneficial to us. This work focus on two main things while selecting the views/queries for materialization i. e. frequently accessing queries and space occupied by the data that is fetched after applying a query. After considering this two main constraint an Algorithm will select those views/queries which contain maximum benefit per unit space. It means it will select those views/queries which can give the fast access to their useful data of the user and by which query processing will be faster. Another part of this work will maintain the data consistently. It means as the base table will change, respective materialized view will also be changed so that user can get accurate data. Related Work: Harinarayan [1] presented a greedy algorithm for the selection of materialized views in which data cubes are used for query evaluation cost but in this work, storage cost and maintenance cost are not given. T.Nalini, Dr. A.Kumaravel, Dr.K.Rangarajan [2] have been proposed I-mine mining techniques by which frequently accessing query can consider after that processing cost can be calculated & space occupied by that data can also be calculated .After considering all these three parameter one threshold value will select the views for materialization. Mahip Bartere and Dr. Prashant Deshmukh [3] proposed sample views which are the index materialized view used for fast accessing data form the databases. But it is useful only those application which contain random sample form the database. Himanshu Gupta and Inderpal Singh Mumick [4] developed an algorithms to select a set of views to materialize in a data warehouse in order to minimize the total query response time under the constraint of a given total view maintenance time. Chuan Zhang and Jian Yang [5] proposed a completely different approach, Genetic Algorithm, to choose materialized views and demonstrate that it is practical and effective compared with heuristic approaches. B.Ashadevi,Dr.R.Balasubramanian [6] implemented an algorithm in which query which has highest weight than the other are used for materialization and it can delete an existing materialized view which are no longer useful as well as maintenance is also well defined in this work. Sanket Patel and Deepak Dembla [7] have proposed the algorithm which select the views for materialization and it select the best node to distributed materialized view. Imene Mami and Zohra Bellahsene [8] have proposed work in which, it select views based on workload and processing
  • 2. Hema S. Botre, Prof.M.S.Chaudhari / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 3, May-Jun 2013, pp.950-952 951 | P a g e cost and it implemented an algorithm which calculated updating of a materialized view, from which those views which are updated frequently will be deleted. Proposed Work: This work focus on two main things while selecting views for materialization that are frequently accessing queries and total disk space. The problem of utilizing the limited resources disk space or maintenance time to minimize the total query processing cost comes under the materialized view selection with resource constraints. Following algorithm will implement the above Priority Based Algorithm: Materialized view consumes large amounts of disk-space due to their large sizes that usually reach hundreds of terabytes. Therefore it is not possible to materialize all the views. This work will implement an algorithm which materialized views with highest frequency and continue it till the total disk space will be fulfilled. Algorithm: 1. Initialize disk size d. 2. Cerate array y for frequently accessing views. 3. Create a target array z for materialized views. 4. Repeat 5. Retrive x=[Current_view,Goal_view] from y. 6. if i==queue_size. 7. then return(Goal_view) 8. Add Current_View into z. 9. If union(Current_viewsize,Next_viewsize)<= d 10. Add Next_View into z. 11.Until queue is empty. In the above algorithm the work will consider both constraint: the total disk space and frequently accessing queries by which it can improve query performance . Therefore it will be useful in huge amount of database. Results: Initially this work counts the frequency of each query and creates the materialized views of those queries which frequency goes beyond the threshold value. This gives the following result of an algorithm: After this it created the materialized views of those query which frequencies are more than the threshold value. Following are the materialized views: This materialization based on highest frequencies of an queries. After creation of materialized view whenever user will fire the same query, data will be fetched directly from the materialized view which will take less processing time. This can be shown as follows: Conclusion: The selection of views to materialize is one of the most important issues in designing a data warehouse. So as to achieve the best combination of good query response where query processing cost should be minimized in a given storage space constraints. The space constraint is the most important factor while selecting the views to be materialized. The Proposed approach will implement such an algorithm that will select the
  • 3. Hema S. Botre, Prof.M.S.Chaudhari / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 3, May-Jun 2013, pp.950-952 952 | P a g e materialized views to minimize the space & storage constraint. The materialized views are selected to give minimum query response time. References: [1] V. Harinarayan, A. Rajaraman, and J. Ullman. “Implementing data cubes efficiently”. Proceedings of ACM SIGMOD 1996 International Conference on Management of Data, Montreal, Canada, pages 205- 216, 1996. [2] T.Nalini, Dr. A.Kumaravel, Dr.K.Rangarajan , “An Efficient I-MINE Algorithm for Materialized Views in a Data Warehouse Environment” , International Conference on Computer Science, Vol.8, Issue 5, No 1, September 2011. [3] Mahip Bartere and Dr. Prashant Deshmukh, “Optimization of Query Processing Time Base on Materialized Sample View ”, (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (3) , 2011, 1222-1228 [4] H. Gupta, I.S. Mumick, Selection of views to materialize under a maintenance cost constraint. In Proc. 7th International Conference on Database Theory (ICDT'99), Jerusalem, Israel, pp. 453–470,1999. [5] C. Zhang and J. Yang, “Genetic algorithm for materialized view selection in data warehouse environments,” Proceedings of the International Conference on Data Warehousing and Knowledge Discovery , LNCS, vol. 1676, pp. 116–125, 999. [6] B.Ashadevi and Dr.R.Balasubramanian, “Optimized Cost Effective Approach for Selection of Materialized Views in Data Warehousing”, JCS&T Vol. 9 No. 1,April 2009. [7] Sanket Patel and Deepak Dembla, “An Approach for Selection and Maintenance of Materialized View in Data Warehousing”, International Journal of Information Sciences and Application.ISSN 0974-2255 Volume 4, Number 1 (2012). [8] Imene Mami and Zohra Bellahsene, “A Survey of View Selection Methods”, SIGMOD Record, March 2012 (Vol. 41, No. 1). [9] MR. P. P. Karde, DR. V. M. Thakare, “Materialized View Selection Approach Using Tree Based Methodology”,ISSN:0975-6698-NOV 09 to Oct 10-vol 1.