SlideShare a Scribd company logo
1 of 28
By-
Neha Kulkarni
ME Computer
Pune Insitute of Computer Technology
๏ฝ What is spatial data?
๏ฝ Types of spatial data
๏ฝ Types of queries
๏ฝ Applications
๏ฝ Indexing Techniques
๏ฝ Comparison of Indexing techniques
๏ฝ GiST
๏ฝ Indexing High-dimensional data
๏ฝ Conclusion
โ€ข Spatial data represent the location ,size
and shape of an object on earth
โ€ข Ex. Building, lake
Point data: Line data: polygon data:
๏ฝ Point Data:
๏ฝ Simplest form of representing spatial data
๏ฝ No space and has no associated area or volume
๏ฝ Consists of collection of points
๏ฝ Ex. Raster data
๏ฝ Region data:
๏ฝ Has spatial extend with location and boundaries
๏ฝ Represented using of points, line, polygons
๏ฝ Ex. roads, rivers: line data
1) Spatial range queries:
Related with region data
Ex. โ€œFind all cities within 50 miles of Puneโ€
2)Nearest Neighbor queries:
Related with point data
Ex. โ€œFind 10 cities nearest to Puneโ€
In ordered cities
Use in multimedia database
3) Spatial Join Queries:
- use both point and region data.
- Ex. โ€œFind pairs of cities within 200 miles of each other
AND โ€œ Find all cities near a lakeโ€
- More complex
- Expensive to evaluate
1) Geographic Information System(GIS)
Ex. MAP
2) Computer Aided Design/Manufacturing(CAD/CAM)
Ex. Surface of design object
Range and Spatial join queries used
3) Multimedia Database
video, audio, image, text also required spatial data
Nearest neighbor queries and point data
Point Data: Grid files, แธฅE trees, Kdtrees, point quad
trees
Region data: Quad trees, R trees, SKD trees,
-Yet no best indexing technique
- R trees are commonly used :
due to simplicity, ability to handle both data
performance to complex queries
Three main indexing techniques :
๏ฝ Region Quad-Trees and Z-Ordering โ€“ handle
both point and region data
๏ฝ Grid Files โ€“ only point data
๏ฝ R-Trees โ€“ handle both point and region data
๏ฝ Z-ordering gives us a way to group points
according to spatial proximity.
๏ฝ Consider X-01 and Y-11
๏ฝ Z-value is 0111 by interleaving
X and Y values.
This gives us the value for the
point 7.
Space filling curves
The Region Quad tree structure corresponds directly to the
recursive decomposition of the data space.
Each node in the tree corresponds to a square-shaped region of
the data space.
๏ฝ Grid files rely upon a grid directory to identify the
data page containing a desired point.
๏ฝ The Grid file partitions space into rectangular
regions using lines that are parallel to the axes.
๏ฝ If the X axis is cut into i segments and the Y
axis is cut into j segments, we have a total of i
x j partitions. The grid directory is an i by j
array with one entry per partition.
๏ฝ This description is maintained in an array
called a linear scale; there is one linear scale
per axis.
Searching for a point in a grid file
Inserting points in a Grid File
๏ฝ Adaptation of B+ Tree
๏ฝ Height-balanced data structure
๏ฝ Search key values are referred to as Bounding
Boxes
๏ฝ A data entry consists of a pair (n-dimensional
box, Rid)
๏ฝ Rid โ€“ object Identifier
๏ฝ N-dimensional box is the smallest box that
contains the object
An example R-Tree
๏ฝ Search for Objects Overlapping Box Q
Start at root.
1. If current node is non-leaf, for each
entry <E, ptr>, if box E overlaps Q,
search subtree identified by ptr.
2. If current node is leaf, for each entry
<E, rid>, if E overlaps Q, rid identifies
an object that might overlap Q.
Insert Entry <B, ptr>
๏ฝ Start at root and go down to โ€œbest-fitโ€ leaf L.
๏ฝ Go to child whose box needs least enlargement to
cover B; resolve ties by going to smallest area child.
๏ฝ If best-fit leaf L has space, insert entry and stop.
Otherwise, split L into L1 and L2.
๏ฝ Adjust entry for L in its parent so that the box now
covers (only) L1.
๏ฝ Add an entry (in the parent node of L) for L2. (This
could cause the parent node to recursively split.)
Region Quad
Trees
Grid Files(point
data)
R-Trees
Range Queries Easily handled Easily handled
for point data.
Handled by
calculating
bounding box
Nearest
Neighbour
Queries
Can be
handled.
Sometimes
tricky due to
long diagonal
jumps
Easily handled
for point data.
Handled well
by traversing
for the point
or region
Spatial Joins Can be
handled with
some
extension to
range queries
Easily handled
for point data.
Handled very
well
๏ฝ The Generalized Search Tree (GiST) abstracts the
โ€œtreeโ€ nature of a class of indexes including B+ trees
and R-tree variants.
๏ฝ Striking similarities in insert/delete/search and even
concurrency control algorithms make it possible to
provide โ€œtemplatesโ€ for these algorithms that can be
customized to obtain the many different tree index
structures.
๏ฝ GiST provides an alternative for implementing other
tree indexes in an ORDBMS.
๏ฝ Typically, high-dimensional datasets are collections
of points, not regions.
๏ฝ E.g., Feature vectors in multimedia applications.
๏ฝ Very sparse
๏ฝ Nearest neighbor queries are common.
๏ฝ R-tree becomes worse than sequential scan for most
datasets with more than a dozen dimensions.
๏ฝ As dimensionality increases contrast (ratio of
distances between nearest and farthest points)
usually decreases; โ€œnearest neighborโ€ is not
meaningful.
๏ฝ Spatial data management has many
applications, including GIS, CAD/CAM,
multimedia indexing, Point and region data
๏ฝ R-tree approach is widely used in GIS
systems
๏ฝ Used in spatial data mining approaches.
๏ฝ Popular SDBMS : MySQL(geometry datatype),
Neo4j, AllegroGraph, SpaceBase, CouchDB,
PostGreSQL, SpatialDB
๏ฝ โ€œDatabase Management Systemsโ€ by Raghu
Ramakrishnan, 3rd Edition
๏ฝ www.techopedia.com/definition
๏ฝ dna.fernuni-hagen.de/IntroSpatialDBMS
๏ฝ www.geol-amu.org/notes
Spatial databases
Spatial databases

More Related Content

What's hot

Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
ankur bhalla
ย 
Spatial enhancement
Spatial enhancement Spatial enhancement
Spatial enhancement
abinarkt
ย 

What's hot (20)

Spatial Data Model
Spatial Data ModelSpatial Data Model
Spatial Data Model
ย 
TYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
TYBSC IT PGIS Unit II Chapter I Data Management and Processing SystemsTYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
TYBSC IT PGIS Unit II Chapter I Data Management and Processing Systems
ย 
TYBSC IT PGIS Unit I Chapter I- Introduction to Geographic Information Systems
TYBSC IT PGIS Unit I  Chapter I- Introduction to Geographic Information SystemsTYBSC IT PGIS Unit I  Chapter I- Introduction to Geographic Information Systems
TYBSC IT PGIS Unit I Chapter I- Introduction to Geographic Information Systems
ย 
215 spatial db
215 spatial db215 spatial db
215 spatial db
ย 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
ย 
Geographic Phenomena
Geographic PhenomenaGeographic Phenomena
Geographic Phenomena
ย 
TYBSC IT PGIS Unit IV Spacial Data Analysis
TYBSC IT PGIS Unit IV  Spacial Data AnalysisTYBSC IT PGIS Unit IV  Spacial Data Analysis
TYBSC IT PGIS Unit IV Spacial Data Analysis
ย 
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, dataData mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, data
ย 
TYBSC IT PGIS Unit V Data Visualization
TYBSC IT PGIS Unit V  Data VisualizationTYBSC IT PGIS Unit V  Data Visualization
TYBSC IT PGIS Unit V Data Visualization
ย 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
ย 
Spatial enhancement
Spatial enhancement Spatial enhancement
Spatial enhancement
ย 
GIS - Topology
GIS - Topology GIS - Topology
GIS - Topology
ย 
Frequent itemset mining methods
Frequent itemset mining methodsFrequent itemset mining methods
Frequent itemset mining methods
ย 
Components of Spatial Data Quality in GIS
Components of Spatial Data Quality in GISComponents of Spatial Data Quality in GIS
Components of Spatial Data Quality in GIS
ย 
My ppt on gis
My ppt on gisMy ppt on gis
My ppt on gis
ย 
Conversion of Existing Data
Conversion of Existing DataConversion of Existing Data
Conversion of Existing Data
ย 
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial DatabaseTYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
ย 
Gis unit 3
Gis   unit 3Gis   unit 3
Gis unit 3
ย 
Geographic Information System unit 2
Geographic Information System unit 2Geographic Information System unit 2
Geographic Information System unit 2
ย 
SPATIAL DATABASES.pptx
SPATIAL DATABASES.pptxSPATIAL DATABASES.pptx
SPATIAL DATABASES.pptx
ย 

Similar to Spatial databases

Optimizing spatial database
Optimizing spatial databaseOptimizing spatial database
Optimizing spatial database
Ishraq Al Fataftah
ย 
23. Advanced Datatypes and New Application in DBMS
23. Advanced Datatypes and New Application in DBMS23. Advanced Datatypes and New Application in DBMS
23. Advanced Datatypes and New Application in DBMS
koolkampus
ย 
UNIT - III GIS DATA STRUCTURES (2).ppt
UNIT - III GIS DATA STRUCTURES (2).pptUNIT - III GIS DATA STRUCTURES (2).ppt
UNIT - III GIS DATA STRUCTURES (2).ppt
RamMishra65
ย 
UNIT - III GIS DATA STRUCTURES (1).ppt
UNIT - III GIS DATA STRUCTURES (1).pptUNIT - III GIS DATA STRUCTURES (1).ppt
UNIT - III GIS DATA STRUCTURES (1).ppt
RamMishra65
ย 

Similar to Spatial databases (20)

spatial databases ADBMS ppt
spatial databases ADBMS pptspatial databases ADBMS ppt
spatial databases ADBMS ppt
ย 
Optimizing spatial database
Optimizing spatial databaseOptimizing spatial database
Optimizing spatial database
ย 
Geographic Phenomena and their Representations
Geographic Phenomena and their RepresentationsGeographic Phenomena and their Representations
Geographic Phenomena and their Representations
ย 
23. Advanced Datatypes and New Application in DBMS
23. Advanced Datatypes and New Application in DBMS23. Advanced Datatypes and New Application in DBMS
23. Advanced Datatypes and New Application in DBMS
ย 
Spot db consistency checking and optimization in spatial database
Spot db  consistency checking and optimization in spatial databaseSpot db  consistency checking and optimization in spatial database
Spot db consistency checking and optimization in spatial database
ย 
UNIT - III GIS DATA STRUCTURES (2).ppt
UNIT - III GIS DATA STRUCTURES (2).pptUNIT - III GIS DATA STRUCTURES (2).ppt
UNIT - III GIS DATA STRUCTURES (2).ppt
ย 
UNIT - III GIS DATA STRUCTURES (1).ppt
UNIT - III GIS DATA STRUCTURES (1).pptUNIT - III GIS DATA STRUCTURES (1).ppt
UNIT - III GIS DATA STRUCTURES (1).ppt
ย 
Algorithms for Query Processing and Optimization of Spatial Operations
Algorithms for Query Processing and Optimization of Spatial OperationsAlgorithms for Query Processing and Optimization of Spatial Operations
Algorithms for Query Processing and Optimization of Spatial Operations
ย 
Data models in geographical information system(GIS)
Data models in geographical information system(GIS)Data models in geographical information system(GIS)
Data models in geographical information system(GIS)
ย 
Chapter7.Revised
Chapter7.RevisedChapter7.Revised
Chapter7.Revised
ย 
DATA in GIS and DATA Query
DATA in GIS and DATA QueryDATA in GIS and DATA Query
DATA in GIS and DATA Query
ย 
Mujungi Davis
Mujungi DavisMujungi Davis
Mujungi Davis
ย 
Mujungi Davis
Mujungi DavisMujungi Davis
Mujungi Davis
ย 
Geoinformatics.pptx
Geoinformatics.pptxGeoinformatics.pptx
Geoinformatics.pptx
ย 
Lect 7 &amp; 8 types of vector data model-gis
Lect 7 &amp; 8 types of vector data model-gisLect 7 &amp; 8 types of vector data model-gis
Lect 7 &amp; 8 types of vector data model-gis
ย 
3D Analyst
3D Analyst3D Analyst
3D Analyst
ย 
Lect 5 data models-gis
Lect 5 data models-gisLect 5 data models-gis
Lect 5 data models-gis
ย 
Scattered gis handbook
Scattered gis handbookScattered gis handbook
Scattered gis handbook
ย 
Trees Information
Trees InformationTrees Information
Trees Information
ย 
Spatial database managment.pptx
Spatial database managment.pptxSpatial database managment.pptx
Spatial database managment.pptx
ย 

Recently uploaded

Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar โ‰ผ๐Ÿ” Delhi door step de...
Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar  โ‰ผ๐Ÿ” Delhi door step de...Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar  โ‰ผ๐Ÿ” Delhi door step de...
Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar โ‰ผ๐Ÿ” Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
ย 
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort ServiceCall Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
ย 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
SUHANI PANDEY
ย 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
ย 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
ย 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
ย 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
ย 

Recently uploaded (20)

Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
ย 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
ย 
Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar โ‰ผ๐Ÿ” Delhi door step de...
Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar  โ‰ผ๐Ÿ” Delhi door step de...Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar  โ‰ผ๐Ÿ” Delhi door step de...
Call Now โ‰ฝ 9953056974 โ‰ผ๐Ÿ” Call Girls In New Ashok Nagar โ‰ผ๐Ÿ” Delhi door step de...
ย 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
ย 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
ย 
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort ServiceCall Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
ย 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
ย 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
ย 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
ย 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
ย 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
ย 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
ย 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
ย 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
ย 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
ย 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
ย 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
ย 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
ย 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
ย 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ย 

Spatial databases

  • 1. By- Neha Kulkarni ME Computer Pune Insitute of Computer Technology
  • 2. ๏ฝ What is spatial data? ๏ฝ Types of spatial data ๏ฝ Types of queries ๏ฝ Applications ๏ฝ Indexing Techniques ๏ฝ Comparison of Indexing techniques ๏ฝ GiST ๏ฝ Indexing High-dimensional data ๏ฝ Conclusion
  • 3. โ€ข Spatial data represent the location ,size and shape of an object on earth โ€ข Ex. Building, lake
  • 4. Point data: Line data: polygon data:
  • 5. ๏ฝ Point Data: ๏ฝ Simplest form of representing spatial data ๏ฝ No space and has no associated area or volume ๏ฝ Consists of collection of points ๏ฝ Ex. Raster data
  • 6. ๏ฝ Region data: ๏ฝ Has spatial extend with location and boundaries ๏ฝ Represented using of points, line, polygons ๏ฝ Ex. roads, rivers: line data
  • 7.
  • 8. 1) Spatial range queries: Related with region data Ex. โ€œFind all cities within 50 miles of Puneโ€ 2)Nearest Neighbor queries: Related with point data Ex. โ€œFind 10 cities nearest to Puneโ€ In ordered cities Use in multimedia database
  • 9. 3) Spatial Join Queries: - use both point and region data. - Ex. โ€œFind pairs of cities within 200 miles of each other AND โ€œ Find all cities near a lakeโ€ - More complex - Expensive to evaluate
  • 10. 1) Geographic Information System(GIS) Ex. MAP 2) Computer Aided Design/Manufacturing(CAD/CAM) Ex. Surface of design object Range and Spatial join queries used 3) Multimedia Database video, audio, image, text also required spatial data Nearest neighbor queries and point data
  • 11. Point Data: Grid files, แธฅE trees, Kdtrees, point quad trees Region data: Quad trees, R trees, SKD trees, -Yet no best indexing technique - R trees are commonly used : due to simplicity, ability to handle both data performance to complex queries
  • 12. Three main indexing techniques : ๏ฝ Region Quad-Trees and Z-Ordering โ€“ handle both point and region data ๏ฝ Grid Files โ€“ only point data ๏ฝ R-Trees โ€“ handle both point and region data
  • 13. ๏ฝ Z-ordering gives us a way to group points according to spatial proximity. ๏ฝ Consider X-01 and Y-11 ๏ฝ Z-value is 0111 by interleaving X and Y values. This gives us the value for the point 7. Space filling curves
  • 14. The Region Quad tree structure corresponds directly to the recursive decomposition of the data space. Each node in the tree corresponds to a square-shaped region of the data space.
  • 15. ๏ฝ Grid files rely upon a grid directory to identify the data page containing a desired point. ๏ฝ The Grid file partitions space into rectangular regions using lines that are parallel to the axes. ๏ฝ If the X axis is cut into i segments and the Y axis is cut into j segments, we have a total of i x j partitions. The grid directory is an i by j array with one entry per partition. ๏ฝ This description is maintained in an array called a linear scale; there is one linear scale per axis.
  • 16. Searching for a point in a grid file
  • 17. Inserting points in a Grid File
  • 18. ๏ฝ Adaptation of B+ Tree ๏ฝ Height-balanced data structure ๏ฝ Search key values are referred to as Bounding Boxes ๏ฝ A data entry consists of a pair (n-dimensional box, Rid) ๏ฝ Rid โ€“ object Identifier ๏ฝ N-dimensional box is the smallest box that contains the object
  • 20. ๏ฝ Search for Objects Overlapping Box Q Start at root. 1. If current node is non-leaf, for each entry <E, ptr>, if box E overlaps Q, search subtree identified by ptr. 2. If current node is leaf, for each entry <E, rid>, if E overlaps Q, rid identifies an object that might overlap Q.
  • 21. Insert Entry <B, ptr> ๏ฝ Start at root and go down to โ€œbest-fitโ€ leaf L. ๏ฝ Go to child whose box needs least enlargement to cover B; resolve ties by going to smallest area child. ๏ฝ If best-fit leaf L has space, insert entry and stop. Otherwise, split L into L1 and L2. ๏ฝ Adjust entry for L in its parent so that the box now covers (only) L1. ๏ฝ Add an entry (in the parent node of L) for L2. (This could cause the parent node to recursively split.)
  • 22. Region Quad Trees Grid Files(point data) R-Trees Range Queries Easily handled Easily handled for point data. Handled by calculating bounding box Nearest Neighbour Queries Can be handled. Sometimes tricky due to long diagonal jumps Easily handled for point data. Handled well by traversing for the point or region Spatial Joins Can be handled with some extension to range queries Easily handled for point data. Handled very well
  • 23. ๏ฝ The Generalized Search Tree (GiST) abstracts the โ€œtreeโ€ nature of a class of indexes including B+ trees and R-tree variants. ๏ฝ Striking similarities in insert/delete/search and even concurrency control algorithms make it possible to provide โ€œtemplatesโ€ for these algorithms that can be customized to obtain the many different tree index structures. ๏ฝ GiST provides an alternative for implementing other tree indexes in an ORDBMS.
  • 24. ๏ฝ Typically, high-dimensional datasets are collections of points, not regions. ๏ฝ E.g., Feature vectors in multimedia applications. ๏ฝ Very sparse ๏ฝ Nearest neighbor queries are common. ๏ฝ R-tree becomes worse than sequential scan for most datasets with more than a dozen dimensions. ๏ฝ As dimensionality increases contrast (ratio of distances between nearest and farthest points) usually decreases; โ€œnearest neighborโ€ is not meaningful.
  • 25. ๏ฝ Spatial data management has many applications, including GIS, CAD/CAM, multimedia indexing, Point and region data ๏ฝ R-tree approach is widely used in GIS systems ๏ฝ Used in spatial data mining approaches. ๏ฝ Popular SDBMS : MySQL(geometry datatype), Neo4j, AllegroGraph, SpaceBase, CouchDB, PostGreSQL, SpatialDB
  • 26. ๏ฝ โ€œDatabase Management Systemsโ€ by Raghu Ramakrishnan, 3rd Edition ๏ฝ www.techopedia.com/definition ๏ฝ dna.fernuni-hagen.de/IntroSpatialDBMS ๏ฝ www.geol-amu.org/notes