SlideShare a Scribd company logo
1 of 6
Download to read offline
International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013
DOI : 10.5121/ijwsc.2013.4202 23
SCHEMA BASED STORAGE OF XML DOCUMENTS IN
RELATIONAL DATABASES
Dr. Pushpa Suri1
and Divyesh Sharma2
1
Department of Computer Science and Applications, Kurukshetra University,
Kurukshetra
pushpa.suri@yahoo.co.in
2
Department of Computer Science and Applications, Kurukshetra University,
Kurukshetra
divyeshsharma89@gmail.com
ABSTRACT
XML (Extensible Mark up language) is emerging as a tool for representing and exchanging data over the
internet. When we want to store and query XML data, we can use two approaches either by using native
databases or XML enabled databases. In this paper we deal with XML enabled databases. We use
relational databases to store XML documents. In this paper we focus on mapping of XML DTD into
relations. Mapping needs three steps: 1) Simplify Complex DTD’s 2) Make DTD graph by using simplified
DTD’s 3) Generate Relational schema. We present an inlining algorithm for generating relational schemas
from available DTD’s. This algorithm also handles recursion in an XML document.
KEYWORDS
XML, Relational databases, Schema Based Storage
1. INTRODUCTION
As XML (extensible markup language) documents needs efficient storage. The concept of
Relational data bases provides more mature way to store and query these documents. Research on
storage of XML documents using relational databases are classified by two kinds : Schema
oblivious storage, Schema Based storage .Schema oblivious storage does not use any DTD
(document type definition) or XML schema to map XML document in to
relations[4][5][6][7][8][9][10]. Schema based storage requires DTD attached with the XML
documents itself . In this paper we discuss mapping of XML DTD into relations with handling of
recursion.
2. RELATED WORK
Shanmugasundaram [1] , provides three basic Inlining algorithims for mapping XML DTD into
relations :Basic, shared and Hybrid. In basic inlining, for each element in XML document, a
relation is created. This will lead to excessive relations and when we process queries to these
relations, this will lead to excessive joins. In shared inlining, the elements having single
occurance are inlined to its parent elements. This will reduce the number of relations as compared
to Basic. In Hybrid, shared element i.e. the element having single occurance but shared by many
International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013
24
parents are also inlined to its parent elements. Mustafa Atay [2] extends these algorithims by
providing the concept of endID. endID is the maximum id of the descendants of an element . For
every element that is to be inlined, an endID is stored. This is helpful when we reconstruct the
document. .Lu [3] provides a seperate edge table for storing elements having multiple occurances.
3. PROPOSED WORK
We discuss the concept of Inlining in detail. Inlining is basically to store the information of the
child element with its parent element in the same relation. Elements having indegree = 1 i.e. the
element having one parent can be inlined into same relation of its parent element. Indegree is the
number of incoming edges to a node where a node represents an element itself in XML document.
The first step to generate the relational schema from available DTD is to simplify the
complexities of DTD i.e. solve +, *, |,? operators of DTD.
+ Operator means one or many. i.e. one element can occur one or many times in XML document.
* Operator means zero or many i.e. one element can occur zero or more times in XML document.
| Operator means choice.
? Optional operator means whether element occurs or not
Suppose e, e1, e2 denotes elements itself. Then Resolve DTD by these rules :-
1. e** → e*
2. (e+)+ → e*
3. e+ → e*
4. e1e2→ (e1, e2)
5. (e, e1)  (e, e2) → (e, e1, e2)
6. e? → e
7. e - - e - - → e*
8. e*-- -- e → e*
9. e - --- e* → e*
10. e* - - - e* → e*
Next step is to make the graph of the simplified DTD. A DTD graph is the graphical
representation of the XML document type definitions. In this graph, nodes are represented by
elements and attributes of DTD and edges represents their parent-child relationships. Cycles
represents recursion between elements of the documents. We represent two edges in this graph.
Single edge →
Multiple edge →*
International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013
25
Elements having single occurrences are denoted by single edge and having multiple occurrences
are denoted by multiple *edges. After making simplified DTD graph, last step is to generate
relational schema based on simplified DTD graph. This is the algorithm for generating relational
schema:-
Algorithm1: XML DTD Mapping
Input: DTD graph
Output: Relational Schema
1. For the nodes in the DTD graph having indegree = 0 ,Create Relation.
1.1. In relation, introduce an attribute id which serves as primary key in the relation.
1.2. Introduce an attribute Rootelem for that element to specify whether the element is a root
or not.
2. For the nodes having indegree = 1 Inline them with their parent elements.
3. For the nodes having indegree >1 with single occurrence, inline them with their parent
elements.
4. For the nodes having multiple occurrence i.e. *edge in DTD graph having indegree =1,
Create relation.
4.1. In Relation, introduce the attribute id.
4.2. Introduce attribute parent id to specify the id of parent element.
5. Nodes having multiple occurrences with indegree > 1 ,Create Relation.
5.1. In Relation, introduce an attribute id for identification for that node.
5.2. Introduce an attribute Parent type to specify the parent element.
5.3. Introduce an attribute Parent id to specify the id of the parent.
5.4. Introduce an attribute Node type to specify the name of node.
6. If the nodes having cycles in the DTD graph then
6.1. If cycle contains only single edges create relation for one of the element in the cycle.
6.2. If the cycle contains one *edge and one single edge. Make Relations for both of the
elements with an id reference of single edge element stored in *edge element.
6.3. If the cycle contains both *edges then create Relations for both of the elements.
International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013
26
4. A COMPLETE EXAMPLE
<! Doctype Document [
<! ELEMENT Document (company)>
<! ELEMENT company (dept*, employee+, projects+, cname)>
<! ELEMENT dept (dname?, employee+, company)>
<! ELEMENT employee (id, name, address, designation, project+)>
<! ELEMENT project (ptitle, pno  location)>
<! ELEMENT name (firstname?, lastname?)>
<! ELEMENT ptitle (# PCDATA) >.
<! ELEMENT dname (# PCDATA) >.
<! ELEMENT pno (# PCDATA>)
<! ELEMENT location (# PCDATA) >
<! ELEMENT address (# PCDATA) >
<! ELEMENT designation (# PCDATA)>
<! ELEMENT cname (# PCDATA)>
<! ELEMENT firstname (# PCDATA)>
<! ELEMENT lastname (# PCDATA)>
]
After simplification of DTD’s by applying the rules given above, the DTD becomes:
<! ELEMENT company (dept*, employee*, project*, cname)>
<! ELEMENT dept (dname, employee*, company)>
<! ELEMENT employee (id, name, designation, project*)>
<! ELEMENT project(ptitle, pno, location)>
<! ELEMENT name (first name, last name)>
International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013
27
Figure 1: DTD graph
In this case, we are having cycles at root element. Two kinds of cycles exist here. One cycle
exists between employee and project element. Both elements have multiple occurances and
recursive to each other. Second cycle exists between company and dept. Company is the root
element of the document. Dept has multiple occurances. Both cycles will be handled according to
the algorithm given above.
The corresponding relational schema. becomes:-
Company (id, root elem : true, cname)
Dept (id, parent id, dname ,company. id)
Employee (id, parent type, parent id, node type, name.firstname, name.lastname, address,
designation)
Project (id, parent id, parent type, node type, pno, ptitle, location)
5. CONCLUSIONS
In this paper we have given the inlining algorithm which will generate relational schema of the
given XML DTD. Although recursion in XML documents is not very common but we have
discussed it and also elaborate that how it will be handled while generating the relations. In future
research will extend this concept in query processing of XML documents and translation of XML
queries in to SQL queries.
REFERENCES
[1] J.Shanmugasundaram, K. Tufte, C. Zhang, G.He, D. Dewitt, J. Naughton, Relational Databases for
Querying XML Documents: Limitations and opportunities, VLDB 1999, pp : 302-314.
[2] M. Atay, A Chebotko, D. L iu, S. Lu, F. Fotoubi , Efficient schema based XML to relational data
mapping, Information systems, Elesevier 2005.
[3] S.Lu,Y. Sun, M.Atay, F. Fotouhi, A New inlining algorithm for mapping XML DTDS to relational
schema, In Proceedings of the First International Work-shop on XML Schema and Data Management
, in conjuction with the 22nd ACM International Conference on Conceptual Modeling , Chicago, IL,
October 2003.
[4] M. YoshiKawa., T.Amagasa, T. Shtimura, XREL: A path based approach to storage and retrieval of
XML documents using relational databases, ACM Transactions on Internet Technology, 1(1) ,
pp:110-141, August 2001.
[5] J. Qin, S.Zhao, S. Yang, W. Dau, XPEV: A Storage Approach for Well-Formed XML Documents,
FKSD, LNAI 3613, 2005, pp.360-369.
International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013
28
[6] D.Florescu, D. Kossman, A Performance Evaluation of Alternative Mapping Schemes for storing
XML Data in a Relational Database, Rapport de Recherche No. 3680 INRIA, Rocquencourt,
France,1999.
[7] A.Salminen, F.Wm, Requirements for XML Document Database Systems. First ACM Symposium on
Document Engineering, Atlanta. 2001 pp:85-94.
[8] H. Zafari, K. Hasami, M.Ebrahim Shiri, Xlight, An efficient relational schema to store and query
XML data, In the proceedings of the IEEE International conference in Data Store and Data
Engineering 2011, pp:254-257.
[9] M.Ibrahim Fakharaldien, J.Mohamed Zain, N. Sulaiman, XRecursive: An efficient method to store
and query XML documents, Australian Journal of basic and Applied Sciences, 5(12) 2011 pp: 2910-
2916.
[10] M.Sharkawi, N. Tazi, LNV: Relational database Storage structure for XML documents, The 3rd
ACS/IEEE International Conference On Computer Systems and Applications, 2005, pp:49-56.

More Related Content

What's hot

Data structures and algorithms alfred v. aho, john e. hopcroft and jeffrey ...
Data structures and algorithms   alfred v. aho, john e. hopcroft and jeffrey ...Data structures and algorithms   alfred v. aho, john e. hopcroft and jeffrey ...
Data structures and algorithms alfred v. aho, john e. hopcroft and jeffrey ...Chethan Nt
 
Efficiency of TreeMatch Algorithm in XML Tree Pattern Matching
Efficiency of TreeMatch Algorithm in XML Tree Pattern  MatchingEfficiency of TreeMatch Algorithm in XML Tree Pattern  Matching
Efficiency of TreeMatch Algorithm in XML Tree Pattern MatchingIOSR Journals
 
NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...
NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...
NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...ijdms
 
Overview of Object-Oriented Concepts Characteristics by vikas jagtap
Overview of Object-Oriented Concepts Characteristics by vikas jagtapOverview of Object-Oriented Concepts Characteristics by vikas jagtap
Overview of Object-Oriented Concepts Characteristics by vikas jagtapVikas Jagtap
 
Linked List Problems
Linked List ProblemsLinked List Problems
Linked List ProblemsSriram Raj
 
Data Structures and Algorithms for Big Databases
Data Structures and Algorithms for Big DatabasesData Structures and Algorithms for Big Databases
Data Structures and Algorithms for Big Databasesomnidba
 
Exploiting web search engines to search structured
Exploiting web search engines to search structuredExploiting web search engines to search structured
Exploiting web search engines to search structuredNita Pawar
 
Object oriented concepts
Object oriented conceptsObject oriented concepts
Object oriented conceptschristradus
 
QUERY INVERSION TO FIND DATA PROVENANCE
QUERY INVERSION TO FIND DATA PROVENANCE QUERY INVERSION TO FIND DATA PROVENANCE
QUERY INVERSION TO FIND DATA PROVENANCE cscpconf
 
O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE
O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE
O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE ijdms
 
Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)
Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)
Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)Beat Signer
 
Introduction to Data Science With R Notes
Introduction to Data Science With R NotesIntroduction to Data Science With R Notes
Introduction to Data Science With R NotesLakshmiSarvani6
 
Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...
Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...
Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...IJECEIAES
 
Image-Based Literal Node Matching for Linked Data Integration
Image-Based Literal Node Matching for Linked Data IntegrationImage-Based Literal Node Matching for Linked Data Integration
Image-Based Literal Node Matching for Linked Data IntegrationIJwest
 
Dbms important questions and answers
Dbms important questions and answersDbms important questions and answers
Dbms important questions and answersLakshmiSarvani6
 

What's hot (20)

Data structures and algorithms alfred v. aho, john e. hopcroft and jeffrey ...
Data structures and algorithms   alfred v. aho, john e. hopcroft and jeffrey ...Data structures and algorithms   alfred v. aho, john e. hopcroft and jeffrey ...
Data structures and algorithms alfred v. aho, john e. hopcroft and jeffrey ...
 
Efficiency of TreeMatch Algorithm in XML Tree Pattern Matching
Efficiency of TreeMatch Algorithm in XML Tree Pattern  MatchingEfficiency of TreeMatch Algorithm in XML Tree Pattern  Matching
Efficiency of TreeMatch Algorithm in XML Tree Pattern Matching
 
NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...
NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...
NOSQL IMPLEMENTATION OF A CONCEPTUAL DATA MODEL: UML CLASS DIAGRAM TO A DOCUM...
 
Sql ppt
Sql pptSql ppt
Sql ppt
 
Overview of Object-Oriented Concepts Characteristics by vikas jagtap
Overview of Object-Oriented Concepts Characteristics by vikas jagtapOverview of Object-Oriented Concepts Characteristics by vikas jagtap
Overview of Object-Oriented Concepts Characteristics by vikas jagtap
 
Linked List Problems
Linked List ProblemsLinked List Problems
Linked List Problems
 
Oodbms ch 20
Oodbms ch 20Oodbms ch 20
Oodbms ch 20
 
Data Structures and Algorithms for Big Databases
Data Structures and Algorithms for Big DatabasesData Structures and Algorithms for Big Databases
Data Structures and Algorithms for Big Databases
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Exploiting web search engines to search structured
Exploiting web search engines to search structuredExploiting web search engines to search structured
Exploiting web search engines to search structured
 
Object oriented concepts
Object oriented conceptsObject oriented concepts
Object oriented concepts
 
QUERY INVERSION TO FIND DATA PROVENANCE
QUERY INVERSION TO FIND DATA PROVENANCE QUERY INVERSION TO FIND DATA PROVENANCE
QUERY INVERSION TO FIND DATA PROVENANCE
 
O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE
O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE
O NTOLOGY B ASED D OCUMENT C LUSTERING U SING M AP R EDUCE
 
Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)
Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)
Relational Database Design - Lecture 4 - Introduction to Databases (1007156ANR)
 
Introduction to Data Science With R Notes
Introduction to Data Science With R NotesIntroduction to Data Science With R Notes
Introduction to Data Science With R Notes
 
Object oriented database
Object oriented databaseObject oriented database
Object oriented database
 
Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...
Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...
Optimisation towards Latent Dirichlet Allocation: Its Topic Number and Collap...
 
Image-Based Literal Node Matching for Linked Data Integration
Image-Based Literal Node Matching for Linked Data IntegrationImage-Based Literal Node Matching for Linked Data Integration
Image-Based Literal Node Matching for Linked Data Integration
 
Dbms important questions and answers
Dbms important questions and answersDbms important questions and answers
Dbms important questions and answers
 
Bt0066 dbms
Bt0066 dbmsBt0066 dbms
Bt0066 dbms
 

Viewers also liked

SERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHY
SERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHYSERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHY
SERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHYijwscjournal
 
Nch taps new medical chief
Nch taps new medical chiefNch taps new medical chief
Nch taps new medical chiefNCHFoundation
 
Mobile web service provisioning and
Mobile web service provisioning andMobile web service provisioning and
Mobile web service provisioning andijwscjournal
 
SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...
SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...
SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...ijwscjournal
 
SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...
SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...
SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...ijwscjournal
 
Bonfire night
Bonfire nightBonfire night
Bonfire nightaumatell
 
EDUC 7101 Assistive Technology and the iPad
EDUC 7101 Assistive Technology and the iPadEDUC 7101 Assistive Technology and the iPad
EDUC 7101 Assistive Technology and the iPadTroy Gordon
 
Marketing plan 11 18-06
Marketing plan 11 18-06Marketing plan 11 18-06
Marketing plan 11 18-06Amy Virgo CTA
 
A Literature Review on Trust Management in Web Services Access Control
A Literature Review on Trust Management in Web Services Access ControlA Literature Review on Trust Management in Web Services Access Control
A Literature Review on Trust Management in Web Services Access Controlijwscjournal
 
Semantic web approach towards interoperability and privacy issues in social n...
Semantic web approach towards interoperability and privacy issues in social n...Semantic web approach towards interoperability and privacy issues in social n...
Semantic web approach towards interoperability and privacy issues in social n...ijwscjournal
 
Menuju keseimbangan diri
Menuju keseimbangan diriMenuju keseimbangan diri
Menuju keseimbangan diriLembayung Senja
 
Modul Libre office calc
Modul Libre office calcModul Libre office calc
Modul Libre office calcRaul Amaral
 
Ordem basiku iha Ubuntu
Ordem basiku iha UbuntuOrdem basiku iha Ubuntu
Ordem basiku iha UbuntuRaul Amaral
 
Prostata ir jos masažas sekso prekes
Prostata  ir jos masažas sekso prekesProstata  ir jos masažas sekso prekes
Prostata ir jos masažas sekso prekesIntymu2010
 
5 analinio sekso patarimai
5 analinio sekso patarimai5 analinio sekso patarimai
5 analinio sekso patarimaiIntymu2010
 
ASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATION
ASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATIONASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATION
ASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATIONijwscjournal
 

Viewers also liked (17)

SERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHY
SERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHYSERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHY
SERVICE SELECTION BASED ON DOMINANT ROLE OF THE CHOREOGRAPHY
 
Nch taps new medical chief
Nch taps new medical chiefNch taps new medical chief
Nch taps new medical chief
 
Mobile web service provisioning and
Mobile web service provisioning andMobile web service provisioning and
Mobile web service provisioning and
 
SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...
SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...
SOME INTEROPERABILITY ISSUES IN THE DESIGNING OF WEB SERVICES : CASE STUDY ON...
 
SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...
SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...
SPATIAL ANALYSIS ABOUT USERS COLLABORATION ON GEO-SOCIAL NETWORKS IN A BRAZIL...
 
Bonfire night
Bonfire nightBonfire night
Bonfire night
 
EDUC 7101 Assistive Technology and the iPad
EDUC 7101 Assistive Technology and the iPadEDUC 7101 Assistive Technology and the iPad
EDUC 7101 Assistive Technology and the iPad
 
Marketing plan 11 18-06
Marketing plan 11 18-06Marketing plan 11 18-06
Marketing plan 11 18-06
 
A Literature Review on Trust Management in Web Services Access Control
A Literature Review on Trust Management in Web Services Access ControlA Literature Review on Trust Management in Web Services Access Control
A Literature Review on Trust Management in Web Services Access Control
 
Semantic web approach towards interoperability and privacy issues in social n...
Semantic web approach towards interoperability and privacy issues in social n...Semantic web approach towards interoperability and privacy issues in social n...
Semantic web approach towards interoperability and privacy issues in social n...
 
4. metode transportasi
4. metode transportasi4. metode transportasi
4. metode transportasi
 
Menuju keseimbangan diri
Menuju keseimbangan diriMenuju keseimbangan diri
Menuju keseimbangan diri
 
Modul Libre office calc
Modul Libre office calcModul Libre office calc
Modul Libre office calc
 
Ordem basiku iha Ubuntu
Ordem basiku iha UbuntuOrdem basiku iha Ubuntu
Ordem basiku iha Ubuntu
 
Prostata ir jos masažas sekso prekes
Prostata  ir jos masažas sekso prekesProstata  ir jos masažas sekso prekes
Prostata ir jos masažas sekso prekes
 
5 analinio sekso patarimai
5 analinio sekso patarimai5 analinio sekso patarimai
5 analinio sekso patarimai
 
ASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATION
ASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATIONASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATION
ASPECTUAL PATTERNS FOR WEB SERVICES ADAPTATION
 

Similar to SCHEMA BASED STORAGE OF XML DOCUMENTS IN RELATIONAL DATABASES

Innovative way for normalizing xml document
Innovative way for normalizing xml documentInnovative way for normalizing xml document
Innovative way for normalizing xml documentAlexander Decker
 
Xml Presentation-1
Xml Presentation-1Xml Presentation-1
Xml Presentation-1Sudharsan S
 
Exploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalExploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalFaegheh Hasibi
 
Semantic Knowledge Acquisition of Information for Syntactic web
Semantic Knowledge Acquisition of Information for Syntactic web Semantic Knowledge Acquisition of Information for Syntactic web
Semantic Knowledge Acquisition of Information for Syntactic web dannyijwest
 
FEATURES MATCHING USING NATURAL LANGUAGE PROCESSING
FEATURES MATCHING USING NATURAL LANGUAGE PROCESSINGFEATURES MATCHING USING NATURAL LANGUAGE PROCESSING
FEATURES MATCHING USING NATURAL LANGUAGE PROCESSINGIJCI JOURNAL
 
A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...
A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...
A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...BRNSSPublicationHubI
 
Catalog-based Conversion from Relational Database into XML Schema (XSD)
Catalog-based Conversion from Relational Database into XML Schema (XSD)Catalog-based Conversion from Relational Database into XML Schema (XSD)
Catalog-based Conversion from Relational Database into XML Schema (XSD)CSCJournals
 
chapter 1 Introduction to Ds and Algorithm Anyasis.pptx
chapter 1 Introduction to Ds and Algorithm Anyasis.pptxchapter 1 Introduction to Ds and Algorithm Anyasis.pptx
chapter 1 Introduction to Ds and Algorithm Anyasis.pptxAmrutaNavale2
 
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASESEFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASESIJCSEIT Journal
 
Duplicate Detection in Hierarchical Data Using XPath
Duplicate Detection in Hierarchical Data Using XPathDuplicate Detection in Hierarchical Data Using XPath
Duplicate Detection in Hierarchical Data Using XPathiosrjce
 
RELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULESRELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULESijwscjournal
 
RELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULESRELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULESijwscjournal
 
Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...
Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...
Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...IRJET Journal
 
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLsMethodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLsIRJET Journal
 

Similar to SCHEMA BASED STORAGE OF XML DOCUMENTS IN RELATIONAL DATABASES (20)

Innovative way for normalizing xml document
Innovative way for normalizing xml documentInnovative way for normalizing xml document
Innovative way for normalizing xml document
 
Xml11
Xml11Xml11
Xml11
 
Xml Presentation-1
Xml Presentation-1Xml Presentation-1
Xml Presentation-1
 
Xml
XmlXml
Xml
 
XML.ppt
XML.pptXML.ppt
XML.ppt
 
Xml
XmlXml
Xml
 
Exploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity RetrievalExploiting Entity Linking in Queries For Entity Retrieval
Exploiting Entity Linking in Queries For Entity Retrieval
 
Semantic Knowledge Acquisition of Information for Syntactic web
Semantic Knowledge Acquisition of Information for Syntactic web Semantic Knowledge Acquisition of Information for Syntactic web
Semantic Knowledge Acquisition of Information for Syntactic web
 
FEATURES MATCHING USING NATURAL LANGUAGE PROCESSING
FEATURES MATCHING USING NATURAL LANGUAGE PROCESSINGFEATURES MATCHING USING NATURAL LANGUAGE PROCESSING
FEATURES MATCHING USING NATURAL LANGUAGE PROCESSING
 
A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...
A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...
A Study on Mapping Semi-Structured Data to a Structured Data for Inverted Ind...
 
Catalog-based Conversion from Relational Database into XML Schema (XSD)
Catalog-based Conversion from Relational Database into XML Schema (XSD)Catalog-based Conversion from Relational Database into XML Schema (XSD)
Catalog-based Conversion from Relational Database into XML Schema (XSD)
 
chapter 1 Introduction to Ds and Algorithm Anyasis.pptx
chapter 1 Introduction to Ds and Algorithm Anyasis.pptxchapter 1 Introduction to Ds and Algorithm Anyasis.pptx
chapter 1 Introduction to Ds and Algorithm Anyasis.pptx
 
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASESEFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
EFFICIENT SCHEMA BASED KEYWORD SEARCH IN RELATIONAL DATABASES
 
J017616976
J017616976J017616976
J017616976
 
Duplicate Detection in Hierarchical Data Using XPath
Duplicate Detection in Hierarchical Data Using XPathDuplicate Detection in Hierarchical Data Using XPath
Duplicate Detection in Hierarchical Data Using XPath
 
RELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULESRELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULES
 
RELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULESRELATIONAL STORAGE FOR XML RULES
RELATIONAL STORAGE FOR XML RULES
 
Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...
Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...
Semantic Semi-Structured Documents of Least Edit Distance (LED) Calculation f...
 
XML Schema.pptx
XML Schema.pptxXML Schema.pptx
XML Schema.pptx
 
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLsMethodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
Methodology for Managing Dynamic Collections on Semantic Semi-Structured XMLs
 

Recently uploaded

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Recently uploaded (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

SCHEMA BASED STORAGE OF XML DOCUMENTS IN RELATIONAL DATABASES

  • 1. International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013 DOI : 10.5121/ijwsc.2013.4202 23 SCHEMA BASED STORAGE OF XML DOCUMENTS IN RELATIONAL DATABASES Dr. Pushpa Suri1 and Divyesh Sharma2 1 Department of Computer Science and Applications, Kurukshetra University, Kurukshetra pushpa.suri@yahoo.co.in 2 Department of Computer Science and Applications, Kurukshetra University, Kurukshetra divyeshsharma89@gmail.com ABSTRACT XML (Extensible Mark up language) is emerging as a tool for representing and exchanging data over the internet. When we want to store and query XML data, we can use two approaches either by using native databases or XML enabled databases. In this paper we deal with XML enabled databases. We use relational databases to store XML documents. In this paper we focus on mapping of XML DTD into relations. Mapping needs three steps: 1) Simplify Complex DTD’s 2) Make DTD graph by using simplified DTD’s 3) Generate Relational schema. We present an inlining algorithm for generating relational schemas from available DTD’s. This algorithm also handles recursion in an XML document. KEYWORDS XML, Relational databases, Schema Based Storage 1. INTRODUCTION As XML (extensible markup language) documents needs efficient storage. The concept of Relational data bases provides more mature way to store and query these documents. Research on storage of XML documents using relational databases are classified by two kinds : Schema oblivious storage, Schema Based storage .Schema oblivious storage does not use any DTD (document type definition) or XML schema to map XML document in to relations[4][5][6][7][8][9][10]. Schema based storage requires DTD attached with the XML documents itself . In this paper we discuss mapping of XML DTD into relations with handling of recursion. 2. RELATED WORK Shanmugasundaram [1] , provides three basic Inlining algorithims for mapping XML DTD into relations :Basic, shared and Hybrid. In basic inlining, for each element in XML document, a relation is created. This will lead to excessive relations and when we process queries to these relations, this will lead to excessive joins. In shared inlining, the elements having single occurance are inlined to its parent elements. This will reduce the number of relations as compared to Basic. In Hybrid, shared element i.e. the element having single occurance but shared by many
  • 2. International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013 24 parents are also inlined to its parent elements. Mustafa Atay [2] extends these algorithims by providing the concept of endID. endID is the maximum id of the descendants of an element . For every element that is to be inlined, an endID is stored. This is helpful when we reconstruct the document. .Lu [3] provides a seperate edge table for storing elements having multiple occurances. 3. PROPOSED WORK We discuss the concept of Inlining in detail. Inlining is basically to store the information of the child element with its parent element in the same relation. Elements having indegree = 1 i.e. the element having one parent can be inlined into same relation of its parent element. Indegree is the number of incoming edges to a node where a node represents an element itself in XML document. The first step to generate the relational schema from available DTD is to simplify the complexities of DTD i.e. solve +, *, |,? operators of DTD. + Operator means one or many. i.e. one element can occur one or many times in XML document. * Operator means zero or many i.e. one element can occur zero or more times in XML document. | Operator means choice. ? Optional operator means whether element occurs or not Suppose e, e1, e2 denotes elements itself. Then Resolve DTD by these rules :- 1. e** → e* 2. (e+)+ → e* 3. e+ → e* 4. e1e2→ (e1, e2) 5. (e, e1)  (e, e2) → (e, e1, e2) 6. e? → e 7. e - - e - - → e* 8. e*-- -- e → e* 9. e - --- e* → e* 10. e* - - - e* → e* Next step is to make the graph of the simplified DTD. A DTD graph is the graphical representation of the XML document type definitions. In this graph, nodes are represented by elements and attributes of DTD and edges represents their parent-child relationships. Cycles represents recursion between elements of the documents. We represent two edges in this graph. Single edge → Multiple edge →*
  • 3. International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013 25 Elements having single occurrences are denoted by single edge and having multiple occurrences are denoted by multiple *edges. After making simplified DTD graph, last step is to generate relational schema based on simplified DTD graph. This is the algorithm for generating relational schema:- Algorithm1: XML DTD Mapping Input: DTD graph Output: Relational Schema 1. For the nodes in the DTD graph having indegree = 0 ,Create Relation. 1.1. In relation, introduce an attribute id which serves as primary key in the relation. 1.2. Introduce an attribute Rootelem for that element to specify whether the element is a root or not. 2. For the nodes having indegree = 1 Inline them with their parent elements. 3. For the nodes having indegree >1 with single occurrence, inline them with their parent elements. 4. For the nodes having multiple occurrence i.e. *edge in DTD graph having indegree =1, Create relation. 4.1. In Relation, introduce the attribute id. 4.2. Introduce attribute parent id to specify the id of parent element. 5. Nodes having multiple occurrences with indegree > 1 ,Create Relation. 5.1. In Relation, introduce an attribute id for identification for that node. 5.2. Introduce an attribute Parent type to specify the parent element. 5.3. Introduce an attribute Parent id to specify the id of the parent. 5.4. Introduce an attribute Node type to specify the name of node. 6. If the nodes having cycles in the DTD graph then 6.1. If cycle contains only single edges create relation for one of the element in the cycle. 6.2. If the cycle contains one *edge and one single edge. Make Relations for both of the elements with an id reference of single edge element stored in *edge element. 6.3. If the cycle contains both *edges then create Relations for both of the elements.
  • 4. International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013 26 4. A COMPLETE EXAMPLE <! Doctype Document [ <! ELEMENT Document (company)> <! ELEMENT company (dept*, employee+, projects+, cname)> <! ELEMENT dept (dname?, employee+, company)> <! ELEMENT employee (id, name, address, designation, project+)> <! ELEMENT project (ptitle, pno  location)> <! ELEMENT name (firstname?, lastname?)> <! ELEMENT ptitle (# PCDATA) >. <! ELEMENT dname (# PCDATA) >. <! ELEMENT pno (# PCDATA>) <! ELEMENT location (# PCDATA) > <! ELEMENT address (# PCDATA) > <! ELEMENT designation (# PCDATA)> <! ELEMENT cname (# PCDATA)> <! ELEMENT firstname (# PCDATA)> <! ELEMENT lastname (# PCDATA)> ] After simplification of DTD’s by applying the rules given above, the DTD becomes: <! ELEMENT company (dept*, employee*, project*, cname)> <! ELEMENT dept (dname, employee*, company)> <! ELEMENT employee (id, name, designation, project*)> <! ELEMENT project(ptitle, pno, location)> <! ELEMENT name (first name, last name)>
  • 5. International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013 27 Figure 1: DTD graph In this case, we are having cycles at root element. Two kinds of cycles exist here. One cycle exists between employee and project element. Both elements have multiple occurances and recursive to each other. Second cycle exists between company and dept. Company is the root element of the document. Dept has multiple occurances. Both cycles will be handled according to the algorithm given above. The corresponding relational schema. becomes:- Company (id, root elem : true, cname) Dept (id, parent id, dname ,company. id) Employee (id, parent type, parent id, node type, name.firstname, name.lastname, address, designation) Project (id, parent id, parent type, node type, pno, ptitle, location) 5. CONCLUSIONS In this paper we have given the inlining algorithm which will generate relational schema of the given XML DTD. Although recursion in XML documents is not very common but we have discussed it and also elaborate that how it will be handled while generating the relations. In future research will extend this concept in query processing of XML documents and translation of XML queries in to SQL queries. REFERENCES [1] J.Shanmugasundaram, K. Tufte, C. Zhang, G.He, D. Dewitt, J. Naughton, Relational Databases for Querying XML Documents: Limitations and opportunities, VLDB 1999, pp : 302-314. [2] M. Atay, A Chebotko, D. L iu, S. Lu, F. Fotoubi , Efficient schema based XML to relational data mapping, Information systems, Elesevier 2005. [3] S.Lu,Y. Sun, M.Atay, F. Fotouhi, A New inlining algorithm for mapping XML DTDS to relational schema, In Proceedings of the First International Work-shop on XML Schema and Data Management , in conjuction with the 22nd ACM International Conference on Conceptual Modeling , Chicago, IL, October 2003. [4] M. YoshiKawa., T.Amagasa, T. Shtimura, XREL: A path based approach to storage and retrieval of XML documents using relational databases, ACM Transactions on Internet Technology, 1(1) , pp:110-141, August 2001. [5] J. Qin, S.Zhao, S. Yang, W. Dau, XPEV: A Storage Approach for Well-Formed XML Documents, FKSD, LNAI 3613, 2005, pp.360-369.
  • 6. International Journal on Web Service Computing (IJWSC), Vol.4, No.2, June 2013 28 [6] D.Florescu, D. Kossman, A Performance Evaluation of Alternative Mapping Schemes for storing XML Data in a Relational Database, Rapport de Recherche No. 3680 INRIA, Rocquencourt, France,1999. [7] A.Salminen, F.Wm, Requirements for XML Document Database Systems. First ACM Symposium on Document Engineering, Atlanta. 2001 pp:85-94. [8] H. Zafari, K. Hasami, M.Ebrahim Shiri, Xlight, An efficient relational schema to store and query XML data, In the proceedings of the IEEE International conference in Data Store and Data Engineering 2011, pp:254-257. [9] M.Ibrahim Fakharaldien, J.Mohamed Zain, N. Sulaiman, XRecursive: An efficient method to store and query XML documents, Australian Journal of basic and Applied Sciences, 5(12) 2011 pp: 2910- 2916. [10] M.Sharkawi, N. Tazi, LNV: Relational database Storage structure for XML documents, The 3rd ACS/IEEE International Conference On Computer Systems and Applications, 2005, pp:49-56.