This document discusses dashDB, a cloud data warehouse service from IBM that provides fully managed analytics capabilities including geospatial support. Key features highlighted include:
- In-database analytics built on DB2 BLU and Netezza technologies for fast performance.
- Integrated R analytics and predictive modeling capabilities.
- Support for geospatial data and analytics using Open Geospatial Consortium standards.
- Plans available as fully managed cloud service, on-premises, or hybrid architectures.
11. Enterprise MPP
• Dedicated, single tenant environment
• Bare metal
• 3+ node clusters
• 24 cores per node
• 256 GB memory per node
• SSD storage (for about 4 TB of
preload data per node)
• $5410 / month (USD) per node
Entry
• Shared, multi-tenant
environment
• 20 GB SAN storage
capacity
• Freemium: < 1 GB of
raw data is free
• $50 / month (USD)
flat rate
Enterprise - 1TB
• Dedicated, single
tenant environment
• Virtual environment
• 16 cores
• 64 GB memory
• SAN storage (for about
1 TB of preload data)
• $1170 / month (USD)
Enterprise - 4
TB
• Dedicated, single
tenant environment
• Bare metal
• 32 cores
• 256 GB memory
• SAN storage (for about
4 TB of preload data)
• $4700 / month (USD)
Enterprise - 12
TB
• Dedicated, single
tenant environment
• Bare metal
• 32 cores
• 256 GB memory
• SAN storage (for about
12 TB of preload data)
• $7370 / month (USD)
dashDB Plans Suit a Variety of Needs
12. Geospatial Analytics In dashDB
Implements OGC SFS & ISO SQL/MM part 3 standards for spatial
See http://www.iso.org/iso/catalogue_detail.htm?csnumber=38651
Spatial data type ST_GEOMETRY (hierarchy)
Enables spatial operations (e.g. joins) in database through spatial
operators available as user defined functions
Dedicated support in ESRI tools starting V 10.3
dashDB - R support through extension to ibmdbR package
13. Spatial Functions & Predicates in dashDB
SELECT a.name, a.type
FROM highways a, floodzones b
WHERE ST_Intersects(a.location,b.location) = 1
AND b.last_flood > 1950
SELECT a.road_id, a.time, i.id,
ST_Distance(a.loc, i.loc,’METER’) as distance
FROM accidents a, intersections i
WHERE ST_Distance(a.loc,i.loc,’METER’) < 10000
AND a.weather = ‘RAIN’
- accidents near intersections
- highways in flood zones
ST_Distance(g1,g2)
?
ST_Intersects(g1,g2)
?
14. Spatial Constructor Functions
ST_Point(x, y, srs_id) – create point at this location
ST_Point(‘POINT (-121.5, 37.2)’, 1)
ST_Linestring(‘LINESTRING (-121.5 37.2,-121.7 37.1)’,1)
ST_Polygon(CAST (? AS CLOB(1M)),1)
For host variable containing well-known text, well-known binary, or shape representation
18. Harness the Full Power of SQL
Outer join
Common table expressions
Recursive queries, sub-queries
Aggregate functions
Order by, group by, having clauses
OLAP, XML, and more ...
WITH sdStores AS (SELECT * FROM stores
WHERE st_within(location, :sandiego) = 1)
SELECT s.id, s.name, AVG(h.income) FROM houseHolds h, sdStores s
WHERE st_intersects(s.zone, h.location) = 1
GROUP BY s.id, s.name
ORDER BY s.name
Example problem: Determine the average household income for the sales zone of
each store in the San Diego area.