SlideShare a Scribd company logo
University of Minnesota 
GeoJinni 
Spatial Data processing with Hadoop 
http://spatialhadoop.cs.umn.edu/ 
@spatialhadoop 
Ahmed Eldawy
Claudius Ptolemy (AD 90 – AD 168)
Al Idrisi (1099–1165)
Cholera cases in the London epidemic of 1854
Cool technology..!! 
Can I use it in my 
application? 
Oh..!! But, it is not 
made for me. Can’t 
make use of it as is 
My pleasure. Here 
it is..
Kindly let me get 
the technology 
you have 
Kindly let me 
understand your needs
HELP..!! I have too 
much data. Your 
technology is not 
helping me 
mmm…Let me 
check with my good 
friends there. 
My pleasure. Here 
it is.. 
Cool DBMS 
technology..!! 
Can I use it in my 
application? 
Oh..!! But, it is not 
made for me. Can’t 
make use of it as is
Kindly let me 
understand your needs 
Kindly let me 
get the 
technology you 
have
Let me check with 
my other good 
friends there. 
HELP..!! Again, I have 
too much data. Your 
technology is not 
helping me 
Cool MapReduce technology..!! 
Can I use it in my application? 
Sorry, seems like the 
DBMS technology 
cannot scale more 
My pleasure. Here 
Oh..!! But, it is not 
made for me. Can’t 
make use of it as is 
it is..
Kindly let me 
understand your needs 
Kindly let me 
get the 
technology you 
have
Kindly let me 
understand your needs 
Kindly let me 
get the 
technology you 
have 
aka 
GeoJinni
VGI Sensor networks 
27 
Tons of Spatial data out there… 
Smart phones Satellite Images 
Medical data 
Traffic data 
Geotagged Microblogs 
Geotagged pictures
GeoJinni 
Website: http://spatialhadoop.cs.umn.edu/ 
Download source code, binary distribution, and instructions 
Email us at: shadoop@cs.umn.edu 
■ Released in March 2013; 75,000 downloads since then 
Spatial language Built-in spatial data types 
28 
Spatial Indexes Spatial Operations
User Programs 
Pig 
Latin 
Hadoop 
Java APIS 
Job Monitoring and 
29 
The Built-in Approach of GeoJinni 
Spatial Modules 
User Programs 
Pig 
Latin 
Hadoop 
Java APIS 
Job Monitoring and 
Scheduling 
MapReduce 
Runtime 
Storage (HDFS) 
(Spatial) 
User Program 
+ 
MapReduce 
APIs 
+ 
Job Monitoring 
and Scheduling 
+ 
MapReduce 
Runtime 
+ 
Storage 
+ 
… 
Scheduling 
MapReduce 
Runtime 
Storage (HDFS) 
Spatial 
Language 
Spatial 
Operators 
Early 
Pruning 
Spatial 
Indexing 
The On-top 
Approach 
From Scratch 
Approach 
The Built-in Approach 
(GeoJinni)
30 
Spatial Data & Hadoop 
Spatial Data Hadoop 
points = LOAD ’points’ AS 
(id:int, x:int, y:int); 
result = FILTER points BY 
x < xmax AND x >= xmin AND 
y < ymax AND y >= ymin; 
Takes 193 seconds 
 GeoJinni 
GeoJinni 
points = LOAD ’points’ AS 
(id:int, location:point); 
result = FILTER points BY 
IsOverlap(location, rectangle 
(xmin, ymin, xmax, ymax)); 
Finishes in 2 seconds
31 
GeoJinni Architecture 
Applications: MNTG [SSTD’13, ICDE’14] 
SHAHED [ICDE’15] – TAREEG [SIGMOD’14, SIGSPATIAL’14] 
Spatio-temporal Hadoop 
Language: Pigeon [ICDE’14] 
Operations: Basic [VLDB’13] – CG_Hadoop [SIGSPATIAL’13] 
Data Mining – Visualization [Under submission] 
MapReduce: Spatial File Splitter – Spatial Record Reader 
Indexing: Grid File – R-tree – R+-tree [ICDE’15]
32 
Language Layer: Pigeon 
■ Extends Pig Latin with OGC-compliant primitives 
 Spatial data types (e.g., Polygon) 
 Basic operations (e.g., Area) 
 Spatial predicates (e.g., Touches) 
 Spatial analysis (e.g., Union) 
 Spatial aggregate functions (e.g., Convex Hull) 
cities = LOAD ’cities’ 
AS (city_id: int, city_geom); 
City_area = FOREACH cities 
GENERATE Area(city_geom) AS area; 
A. Eldawy and M. F. Mokbel. Pigeon: A Spatial MapReduce Language. In ICDE, 2014
33 
Indexing Layer: R+-tree
34 
Indexing Layer: Grid File
35 
Non-indexed Heap File
36 
Range Query 
SpatialFileSplitter 
prunes blocks 
outside the query 
range 
SpatialRecordReader 
passes local indexes 
to the map function 
Map function selects 
records in range
37 
CG_Hadoop 
■ Make use of GeoJinni to speedup 
computational geometry algorithms 
 Polygon union, Skyline, Convex Hull, 
Farthest/Closest Pair 
■ Single machine implementation 
 E.g., Skyline of 4 billion points takes three hours 
■ Straight forward implementation in Hadoop 
 Hadoop parallel execution 
■ More efficient implementation 
in GeoJinni 
 Spatial indexing 
 Early pruning 
■ Free open source as part of GeoJinni 
Single 
Machine Hadoop 
GeoJinni 
29x 
260x 
1x
38 
Convex Hull 
Find the minimal convex polygon that contains all points 
Input Output
39 
Convex Hull in CG_Hadoop 
Hadoop CG_Hadoop 
Partition 
Pruning 
Local hull 
Global hull
40 
Map rendering 
■ Map rendering creates an image that represents the 
data 
■ Visualization is an international language 
■ Can reveal patterns that are otherwise hard to spot 
■ The visual system occupies about one third of the 
human brain 
210 LINESTRING (-2.3634904 51.3845649, -2.3634254 51.3843983, - 
2.3631927 51.3838436) [highway#primary,ref#A4,name#Gay Street] 
420 LINESTRING (-1.8230973 52.5541131, -1.8230368 52.5540756, - 
1.8229324 52.5540109, -1.8227961 52.5539014, -1.8227365 52.5538461, - 
1.8226952 52.5538058, -1.8226204 52.5537103, -1.8223988 52.5534041, - 
1.8221814 52.5531498, -1.8218478 52.5528188, -1.8215581 52.5525626, - 
1.8213525 52.5524042) [source#GPS 
Survey,highway#residential,postal_code#B72,name#Moss 
Drive,is_in#Sutton Coldfield,maxspeed#30,abutters#residential] 
490 LINESTRING (-0.1896508 51.6456414, -0.1895803 51.6456036, - 
0.1895245 51.645551, -0.1890055 51.6450801, -0.1887808 51.6448764, - 
0.1885605 51.6446756, -0.1883084 51.6443753, -0.1875496 51.6433375, - 
0.1864572 51.6415288, -0.1862165 51.6411939, -0.1859495 51.6406583, - 
0.1858855 51.6405461) [lit#yes,surface#asphalt,maxspeed#30 
mph,highway#residential,abutters#residential,name#Sherrards Way] 
770 LINESTRING (-1.8184653 52.5723683, -1.8182353 52.5723576, - 
…
41 
Smoothing 
Input Buffer 
Only 
Buffer + 
Merge
42 
Multi-level Image 
■ Many images at 
different zoom 
levels 
 Pan 
 Zoom in/out 
 Fly to 
■ More details as 
the zoom level 
increases
43 
MNTG - World-wide traffic generator 
for road networks 
http://mntg.cs.umn.edu/ 
M. F. Mokbel, L. Alarabi, J. Bao, A. Eldawy, A. Magdy, M. Sarwat, E. Waytas, and S. 
Yackel. MNTG: An Extensible Web-based Traffic Generator. In SSTD, 2013
44 
SHAHED – A tool for querying and 
visualizing spatio-temporal satellite data 
http://shahed.cs.umn.edu/ 
"SHAHED: A MapReduce-based System for Querying and Visualizing Spatio-temporal 
Satellite Data“, Ahmed Eldawy et al, ICDE 2015
45 
World Temperature
46 
Smooth World Temperature
47 
World Heat Map on Google Earth
48 
TAREEG – Web-based extractor for 
OpenStreetMap data using MapReduce 
http://tareeg.net/ 
L. Alarabi, A. Eldawy, R. Alghamdi, and M. F. Mokbel. TAREEG: A MapReduce-Based 
Web Service for Extracting Spatial Data from OpenStreetMap. In SIGMOD, 2014
49 
Extracted Road Network
GeoJinni 
Analyze your spatial data efficiently 
50 
Built-in spatial data types 
Spatial high level language 
Efficient Spatial Operations 
Language 
Data types 
Spatial Indexes 
Indexes Operations 
Analyze Datasets your are organized data on large efficiently clusters using with spatial built-in indexes 
spatial 
operations that runs efficiently using spatial indexes 
Interact Have with all your the system spatial and datasets express ready your to queries load in 
in a 
simple SpatialHadoop (Grid Website: high or level R-tree) http://language with that spatialhadoop.the are with built-adapted built-in spatial cs.to in umn.MapReduce 
spatial data edu/ 
support 
types 
Download source code, binary distribution, and instructions 
Email us at: shadoop@cs.umn.edu

More Related Content

What's hot

Map Reduce introduction
Map Reduce introductionMap Reduce introduction
Map Reduce introduction
Muralidharan Deenathayalan
 
Introduction to Map Reduce
Introduction to Map ReduceIntroduction to Map Reduce
Introduction to Map Reduce
Apache Apex
 
Analysing of big data using map reduce
Analysing of big data using map reduceAnalysing of big data using map reduce
Analysing of big data using map reduce
Paladion Networks
 
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedHadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
DataWorks Summit
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
Michel Bruley
 
Working with Scientific Data in MATLAB
Working with Scientific Data in MATLABWorking with Scientific Data in MATLAB
Working with Scientific Data in MATLAB
The HDF-EOS Tools and Information Center
 
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTechGeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
Rob Emanuele
 
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Alexey Zinoviev
 
Apache Hadoop Big Data Technology
Apache Hadoop Big Data TechnologyApache Hadoop Big Data Technology
Apache Hadoop Big Data Technology
Jay Nagar
 
Scaling Storage and Computation with Hadoop
Scaling Storage and Computation with HadoopScaling Storage and Computation with Hadoop
Scaling Storage and Computation with Hadoop
yaevents
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
Vigen Sahakyan
 
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other OptimizationsMastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
scottcrespo
 
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014soujavajug
 
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
npinto
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
Carol McDonald
 
MapReduce Algorithm Design
MapReduce Algorithm DesignMapReduce Algorithm Design
MapReduce Algorithm Design
Gabriela Agustini
 
Extending Hadoop for Fun & Profit
Extending Hadoop for Fun & ProfitExtending Hadoop for Fun & Profit
Extending Hadoop for Fun & Profit
Milind Bhandarkar
 
Hadoop eco system-first class
Hadoop eco system-first classHadoop eco system-first class
Hadoop eco system-first classalogarg
 
Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)
PyData
 

What's hot (20)

Map Reduce introduction
Map Reduce introductionMap Reduce introduction
Map Reduce introduction
 
Introduction to Map Reduce
Introduction to Map ReduceIntroduction to Map Reduce
Introduction to Map Reduce
 
Analysing of big data using map reduce
Analysing of big data using map reduceAnalysing of big data using map reduce
Analysing of big data using map reduce
 
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons LearnedHadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Working with Scientific Data in MATLAB
Working with Scientific Data in MATLABWorking with Scientific Data in MATLAB
Working with Scientific Data in MATLAB
 
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTechGeoSpatially enabling your Spark and Accumulo clusters with LocationTech
GeoSpatially enabling your Spark and Accumulo clusters with LocationTech
 
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
Thorny path to the Large-Scale Graph Processing (Highload++, 2014)
 
Apache Hadoop Big Data Technology
Apache Hadoop Big Data TechnologyApache Hadoop Big Data Technology
Apache Hadoop Big Data Technology
 
Scaling Storage and Computation with Hadoop
Scaling Storage and Computation with HadoopScaling Storage and Computation with Hadoop
Scaling Storage and Computation with Hadoop
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other OptimizationsMastering Hadoop Map Reduce - Custom Types and Other Optimizations
Mastering Hadoop Map Reduce - Custom Types and Other Optimizations
 
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
Hadoop - Introduction to map reduce programming - Reunião 12/04/2014
 
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
[Harvard CS264] 08b - MapReduce and Hadoop (Zak Stone, Harvard)
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
MapReduce Algorithm Design
MapReduce Algorithm DesignMapReduce Algorithm Design
MapReduce Algorithm Design
 
Extending Hadoop for Fun & Profit
Extending Hadoop for Fun & ProfitExtending Hadoop for Fun & Profit
Extending Hadoop for Fun & Profit
 
Pig Experience
Pig ExperiencePig Experience
Pig Experience
 
Hadoop eco system-first class
Hadoop eco system-first classHadoop eco system-first class
Hadoop eco system-first class
 
Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)Python in an Evolving Enterprise System (PyData SV 2013)
Python in an Evolving Enterprise System (PyData SV 2013)
 

Viewers also liked

Sept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical IntroductionSept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical Introduction
Adam Muise
 
Geo-based content processing using hbase
Geo-based content processing using hbaseGeo-based content processing using hbase
Geo-based content processing using hbase
Ravi Veeramachaneni
 
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...nishimurashoji
 
Computation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterComputation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterAbhishek Sagar
 
Big data landscape version 2.0
Big data landscape version 2.0Big data landscape version 2.0
Big data landscape version 2.0
Matt Turck
 
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913Allwyn Antony
 
Spatial Indexing
Spatial IndexingSpatial Indexing
Spatial Indexing
torp42
 
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQLThe DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
torp42
 
VO Course 11: Spatial indexing
VO Course 11: Spatial indexingVO Course 11: Spatial indexing
VO Course 11: Spatial indexing
Joint ALMA Observatory
 
SQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial IndexingSQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial Indexing
Michael Rys
 
European creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshopEuropean creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshop
Outliers Collective
 
Recasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) DataRecasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) Data
Merck
 
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial DataBecoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
Patrick Stotz
 
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsEnabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Rob Emanuele
 
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
nacis_slides
 
Application of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineeringApplication of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineering
Engr Habib ur Rehman
 
a Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resourcesa Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application ResourcesDataWorks Summit
 
Improving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of ServiceImproving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of ServiceDataWorks Summit
 
Map reduce: beyond word count
Map reduce: beyond word countMap reduce: beyond word count
Map reduce: beyond word count
Jeff Patti
 
How to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and AnalyticsHow to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and AnalyticsDataWorks Summit
 

Viewers also liked (20)

Sept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical IntroductionSept 17 2013 - THUG - HBase a Technical Introduction
Sept 17 2013 - THUG - HBase a Technical Introduction
 
Geo-based content processing using hbase
Geo-based content processing using hbaseGeo-based content processing using hbase
Geo-based content processing using hbase
 
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware...
 
Computation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop ClusterComputation of spatial data on Hadoop Cluster
Computation of spatial data on Hadoop Cluster
 
Big data landscape version 2.0
Big data landscape version 2.0Big data landscape version 2.0
Big data landscape version 2.0
 
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
AECOsim_Building_Designer_LTR_LoRes_SinglePgs_F_913
 
Spatial Indexing
Spatial IndexingSpatial Indexing
Spatial Indexing
 
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQLThe DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
The DE-9IM Matrix in Details using ST_Relate: In Picture and SQL
 
VO Course 11: Spatial indexing
VO Course 11: Spatial indexingVO Course 11: Spatial indexing
VO Course 11: Spatial indexing
 
SQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial IndexingSQLBits X SQL Server 2012 Spatial Indexing
SQLBits X SQL Server 2012 Spatial Indexing
 
European creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshopEuropean creativity festival 2014: DataViz workshop
European creativity festival 2014: DataViz workshop
 
Recasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) DataRecasting the Role of Big (or Little) Data
Recasting the Role of Big (or Little) Data
 
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial DataBecoming a Smarter City by Analyzing & Visualizing Spatial Data
Becoming a Smarter City by Analyzing & Visualizing Spatial Data
 
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projectsEnabling Access to Big Geospatial Data with LocationTech and Apache projects
Enabling Access to Big Geospatial Data with LocationTech and Apache projects
 
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
Where Do We Put It All? Lessons Learned Housing Large Geospatial Data Collect...
 
Application of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineeringApplication of probability in daily life and in civil engineering
Application of probability in daily life and in civil engineering
 
a Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resourcesa Secure Public Cache for YARN Application Resources
a Secure Public Cache for YARN Application Resources
 
Improving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of ServiceImproving HDFS Availability with IPC Quality of Service
Improving HDFS Availability with IPC Quality of Service
 
Map reduce: beyond word count
Map reduce: beyond word countMap reduce: beyond word count
Map reduce: beyond word count
 
How to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and AnalyticsHow to use Parquet as a Sasis for ETL and Analytics
How to use Parquet as a Sasis for ETL and Analytics
 

Similar to Spatial Data processing with Hadoop

How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1wang yaohui
 
Large Scale Geo Processing on Hadoop
Large Scale Geo Processing on HadoopLarge Scale Geo Processing on Hadoop
Large Scale Geo Processing on Hadoop
Christoph Körner
 
Visualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographicsVisualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographics
Dr Muhammad Adnan
 
Synthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsSynthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in Robotics
Prabindh Sundareson
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street data
Fabion Kauker
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
IOSR Journals
 
mini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptxmini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptx
tusharpawar803067
 
Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)
Ranel Padon
 
Field Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesField Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service Technologies
Niroshan Sanjaya
 
Walking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial WorldWalking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial World
ICIMOD
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial data
Kudos S.A.S
 
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
Si Chen
 
Introduction of MAGO3D
Introduction of MAGO3DIntroduction of MAGO3D
Introduction of MAGO3D
SANGHEE SHIN
 
Geolocation in Drupal
Geolocation in DrupalGeolocation in Drupal
Geolocation in Drupal
Mediacurrent
 
Thinking spatially with your open data
Thinking spatially with your open dataThinking spatially with your open data
Thinking spatially with your open data
Twinbit
 
Drupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open data
DrupalDay
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GIS
Joe Larson
 
Web Mapping with Drupal
Web Mapping with DrupalWeb Mapping with Drupal
Web Mapping with DrupalRanel Padon
 
Arnold webuquerque20110302
Arnold webuquerque20110302Arnold webuquerque20110302
Arnold webuquerque20110302
lisaarn
 

Similar to Spatial Data processing with Hadoop (20)

How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1
 
Large Scale Geo Processing on Hadoop
Large Scale Geo Processing on HadoopLarge Scale Geo Processing on Hadoop
Large Scale Geo Processing on Hadoop
 
Visualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographicsVisualising large spatial databases and Building bespoke geodemographics
Visualising large spatial databases and Building bespoke geodemographics
 
Synthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in RoboticsSynthetic Data and Graphics Techniques in Robotics
Synthetic Data and Graphics Techniques in Robotics
 
Euro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street dataEuro30 2019 - Benchmarking tree approaches on street data
Euro30 2019 - Benchmarking tree approaches on street data
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
 
mini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptxmini project_shortest path visualizer.pptx
mini project_shortest path visualizer.pptx
 
Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)Of Nodes and Maps (Web Mapping with Drupal - Part II)
Of Nodes and Maps (Web Mapping with Drupal - Part II)
 
Field Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesField Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service Technologies
 
Walking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial WorldWalking in the Cloud: A New Paradigm in Geospatial World
Walking in the Cloud: A New Paradigm in Geospatial World
 
Using python to analyze spatial data
Using python to analyze spatial dataUsing python to analyze spatial data
Using python to analyze spatial data
 
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
CrowdMap: Accurate Reconstruction of Indoor Floor Plan from Crowdsourced Sens...
 
Introduction of MAGO3D
Introduction of MAGO3DIntroduction of MAGO3D
Introduction of MAGO3D
 
Geolocation in Drupal
Geolocation in DrupalGeolocation in Drupal
Geolocation in Drupal
 
Thinking spatially with your open data
Thinking spatially with your open dataThinking spatially with your open data
Thinking spatially with your open data
 
Drupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open data
 
Open Source GIS
Open Source GISOpen Source GIS
Open Source GIS
 
Web Mapping with Drupal
Web Mapping with DrupalWeb Mapping with Drupal
Web Mapping with Drupal
 
Arnold webuquerque20110302
Arnold webuquerque20110302Arnold webuquerque20110302
Arnold webuquerque20110302
 
1 mapreduce-fest
1 mapreduce-fest1 mapreduce-fest
1 mapreduce-fest
 

More from VisionGEOMATIQUE2014

Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014
VisionGEOMATIQUE2014
 
Indoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standardIndoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standard
VisionGEOMATIQUE2014
 
ScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServerScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServer
VisionGEOMATIQUE2014
 
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellisFast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
VisionGEOMATIQUE2014
 
OpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur AndroidOpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur Android
VisionGEOMATIQUE2014
 
Accès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement CanadaAccès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement Canada
VisionGEOMATIQUE2014
 
LocationTech Data Commons
LocationTech Data CommonsLocationTech Data Commons
LocationTech Data Commons
VisionGEOMATIQUE2014
 
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATIONTDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
VisionGEOMATIQUE2014
 
Solution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défisSolution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défis
VisionGEOMATIQUE2014
 
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
VisionGEOMATIQUE2014
 
GeoMesa: Scalable Geospatial Analytics
GeoMesa:  Scalable Geospatial AnalyticsGeoMesa:  Scalable Geospatial Analytics
GeoMesa: Scalable Geospatial Analytics
VisionGEOMATIQUE2014
 
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
VisionGEOMATIQUE2014
 
Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...
VisionGEOMATIQUE2014
 
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORINGMACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
VisionGEOMATIQUE2014
 
Les contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligenteLes contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligente
VisionGEOMATIQUE2014
 
SIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipaleSIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipale
VisionGEOMATIQUE2014
 
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de ShawiniganOptimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
VisionGEOMATIQUE2014
 
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
VisionGEOMATIQUE2014
 
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
VisionGEOMATIQUE2014
 
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
VisionGEOMATIQUE2014
 

More from VisionGEOMATIQUE2014 (20)

Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014Géomatique appliquée : revue des solutions novatrices mises en place en 2014
Géomatique appliquée : revue des solutions novatrices mises en place en 2014
 
Indoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standardIndoor location with the Bluetooth Low Energy standard
Indoor location with the Bluetooth Low Energy standard
 
ScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServerScribeUI: La productivité avec MapServer
ScribeUI: La productivité avec MapServer
 
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellisFast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
Fast, Distributed Geoprocessing with Scala, Spark and GeoTrellis
 
OpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur AndroidOpenGL ES pour le développement d’applications géospatiales sur Android
OpenGL ES pour le développement d’applications géospatiales sur Android
 
Accès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement CanadaAccès ouvert aux données météorologiques d’Environnement Canada
Accès ouvert aux données météorologiques d’Environnement Canada
 
LocationTech Data Commons
LocationTech Data CommonsLocationTech Data Commons
LocationTech Data Commons
 
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATIONTDW FOSS GEO-STACK FOR MINERAL EXPLORATION
TDW FOSS GEO-STACK FOR MINERAL EXPLORATION
 
Solution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défisSolution Geoctopus : améliorations et défis
Solution Geoctopus : améliorations et défis
 
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
Infrastructure de géomatique ouverte (IGO) : un modèle inspirant de développe...
 
GeoMesa: Scalable Geospatial Analytics
GeoMesa:  Scalable Geospatial AnalyticsGeoMesa:  Scalable Geospatial Analytics
GeoMesa: Scalable Geospatial Analytics
 
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
Montrajet.ca : une solution multimodale de covoiturage et de planification d'...
 
Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...Automatisation de la cartographie et de l'analyse des données de comptage de ...
Automatisation de la cartographie et de l'analyse des données de comptage de ...
 
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORINGMACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
 
Les contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligenteLes contributions de la géomatique au développement de la ville intelligente
Les contributions de la géomatique au développement de la ville intelligente
 
SIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipaleSIGim la plateforme adaptée à la gestion municipale
SIGim la plateforme adaptée à la gestion municipale
 
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de ShawiniganOptimisation et analyse des parcours de déneigement à la Ville de Shawinigan
Optimisation et analyse des parcours de déneigement à la Ville de Shawinigan
 
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
AutoTri, une application automatisant l’analyse du stationnement de l’arrondi...
 
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
Requirements for Geospatial Agent Simulation to Strengthen the 'Property-Powe...
 
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
JMap 6.0 : une solution complète et évolutive pour l'intégration, la diffusio...
 

Recently uploaded

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 

Recently uploaded (20)

Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 

Spatial Data processing with Hadoop

  • 1. University of Minnesota GeoJinni Spatial Data processing with Hadoop http://spatialhadoop.cs.umn.edu/ @spatialhadoop Ahmed Eldawy
  • 2.
  • 3. Claudius Ptolemy (AD 90 – AD 168)
  • 5.
  • 6.
  • 7.
  • 8. Cholera cases in the London epidemic of 1854
  • 9.
  • 10.
  • 11.
  • 12. Cool technology..!! Can I use it in my application? Oh..!! But, it is not made for me. Can’t make use of it as is My pleasure. Here it is..
  • 13.
  • 14. Kindly let me get the technology you have Kindly let me understand your needs
  • 15.
  • 16. HELP..!! I have too much data. Your technology is not helping me mmm…Let me check with my good friends there. My pleasure. Here it is.. Cool DBMS technology..!! Can I use it in my application? Oh..!! But, it is not made for me. Can’t make use of it as is
  • 17.
  • 18. Kindly let me understand your needs Kindly let me get the technology you have
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Let me check with my other good friends there. HELP..!! Again, I have too much data. Your technology is not helping me Cool MapReduce technology..!! Can I use it in my application? Sorry, seems like the DBMS technology cannot scale more My pleasure. Here Oh..!! But, it is not made for me. Can’t make use of it as is it is..
  • 24.
  • 25. Kindly let me understand your needs Kindly let me get the technology you have
  • 26. Kindly let me understand your needs Kindly let me get the technology you have aka GeoJinni
  • 27. VGI Sensor networks 27 Tons of Spatial data out there… Smart phones Satellite Images Medical data Traffic data Geotagged Microblogs Geotagged pictures
  • 28. GeoJinni Website: http://spatialhadoop.cs.umn.edu/ Download source code, binary distribution, and instructions Email us at: shadoop@cs.umn.edu ■ Released in March 2013; 75,000 downloads since then Spatial language Built-in spatial data types 28 Spatial Indexes Spatial Operations
  • 29. User Programs Pig Latin Hadoop Java APIS Job Monitoring and 29 The Built-in Approach of GeoJinni Spatial Modules User Programs Pig Latin Hadoop Java APIS Job Monitoring and Scheduling MapReduce Runtime Storage (HDFS) (Spatial) User Program + MapReduce APIs + Job Monitoring and Scheduling + MapReduce Runtime + Storage + … Scheduling MapReduce Runtime Storage (HDFS) Spatial Language Spatial Operators Early Pruning Spatial Indexing The On-top Approach From Scratch Approach The Built-in Approach (GeoJinni)
  • 30. 30 Spatial Data & Hadoop Spatial Data Hadoop points = LOAD ’points’ AS (id:int, x:int, y:int); result = FILTER points BY x < xmax AND x >= xmin AND y < ymax AND y >= ymin; Takes 193 seconds  GeoJinni GeoJinni points = LOAD ’points’ AS (id:int, location:point); result = FILTER points BY IsOverlap(location, rectangle (xmin, ymin, xmax, ymax)); Finishes in 2 seconds
  • 31. 31 GeoJinni Architecture Applications: MNTG [SSTD’13, ICDE’14] SHAHED [ICDE’15] – TAREEG [SIGMOD’14, SIGSPATIAL’14] Spatio-temporal Hadoop Language: Pigeon [ICDE’14] Operations: Basic [VLDB’13] – CG_Hadoop [SIGSPATIAL’13] Data Mining – Visualization [Under submission] MapReduce: Spatial File Splitter – Spatial Record Reader Indexing: Grid File – R-tree – R+-tree [ICDE’15]
  • 32. 32 Language Layer: Pigeon ■ Extends Pig Latin with OGC-compliant primitives  Spatial data types (e.g., Polygon)  Basic operations (e.g., Area)  Spatial predicates (e.g., Touches)  Spatial analysis (e.g., Union)  Spatial aggregate functions (e.g., Convex Hull) cities = LOAD ’cities’ AS (city_id: int, city_geom); City_area = FOREACH cities GENERATE Area(city_geom) AS area; A. Eldawy and M. F. Mokbel. Pigeon: A Spatial MapReduce Language. In ICDE, 2014
  • 34. 34 Indexing Layer: Grid File
  • 36. 36 Range Query SpatialFileSplitter prunes blocks outside the query range SpatialRecordReader passes local indexes to the map function Map function selects records in range
  • 37. 37 CG_Hadoop ■ Make use of GeoJinni to speedup computational geometry algorithms  Polygon union, Skyline, Convex Hull, Farthest/Closest Pair ■ Single machine implementation  E.g., Skyline of 4 billion points takes three hours ■ Straight forward implementation in Hadoop  Hadoop parallel execution ■ More efficient implementation in GeoJinni  Spatial indexing  Early pruning ■ Free open source as part of GeoJinni Single Machine Hadoop GeoJinni 29x 260x 1x
  • 38. 38 Convex Hull Find the minimal convex polygon that contains all points Input Output
  • 39. 39 Convex Hull in CG_Hadoop Hadoop CG_Hadoop Partition Pruning Local hull Global hull
  • 40. 40 Map rendering ■ Map rendering creates an image that represents the data ■ Visualization is an international language ■ Can reveal patterns that are otherwise hard to spot ■ The visual system occupies about one third of the human brain 210 LINESTRING (-2.3634904 51.3845649, -2.3634254 51.3843983, - 2.3631927 51.3838436) [highway#primary,ref#A4,name#Gay Street] 420 LINESTRING (-1.8230973 52.5541131, -1.8230368 52.5540756, - 1.8229324 52.5540109, -1.8227961 52.5539014, -1.8227365 52.5538461, - 1.8226952 52.5538058, -1.8226204 52.5537103, -1.8223988 52.5534041, - 1.8221814 52.5531498, -1.8218478 52.5528188, -1.8215581 52.5525626, - 1.8213525 52.5524042) [source#GPS Survey,highway#residential,postal_code#B72,name#Moss Drive,is_in#Sutton Coldfield,maxspeed#30,abutters#residential] 490 LINESTRING (-0.1896508 51.6456414, -0.1895803 51.6456036, - 0.1895245 51.645551, -0.1890055 51.6450801, -0.1887808 51.6448764, - 0.1885605 51.6446756, -0.1883084 51.6443753, -0.1875496 51.6433375, - 0.1864572 51.6415288, -0.1862165 51.6411939, -0.1859495 51.6406583, - 0.1858855 51.6405461) [lit#yes,surface#asphalt,maxspeed#30 mph,highway#residential,abutters#residential,name#Sherrards Way] 770 LINESTRING (-1.8184653 52.5723683, -1.8182353 52.5723576, - …
  • 41. 41 Smoothing Input Buffer Only Buffer + Merge
  • 42. 42 Multi-level Image ■ Many images at different zoom levels  Pan  Zoom in/out  Fly to ■ More details as the zoom level increases
  • 43. 43 MNTG - World-wide traffic generator for road networks http://mntg.cs.umn.edu/ M. F. Mokbel, L. Alarabi, J. Bao, A. Eldawy, A. Magdy, M. Sarwat, E. Waytas, and S. Yackel. MNTG: An Extensible Web-based Traffic Generator. In SSTD, 2013
  • 44. 44 SHAHED – A tool for querying and visualizing spatio-temporal satellite data http://shahed.cs.umn.edu/ "SHAHED: A MapReduce-based System for Querying and Visualizing Spatio-temporal Satellite Data“, Ahmed Eldawy et al, ICDE 2015
  • 46. 46 Smooth World Temperature
  • 47. 47 World Heat Map on Google Earth
  • 48. 48 TAREEG – Web-based extractor for OpenStreetMap data using MapReduce http://tareeg.net/ L. Alarabi, A. Eldawy, R. Alghamdi, and M. F. Mokbel. TAREEG: A MapReduce-Based Web Service for Extracting Spatial Data from OpenStreetMap. In SIGMOD, 2014
  • 49. 49 Extracted Road Network
  • 50. GeoJinni Analyze your spatial data efficiently 50 Built-in spatial data types Spatial high level language Efficient Spatial Operations Language Data types Spatial Indexes Indexes Operations Analyze Datasets your are organized data on large efficiently clusters using with spatial built-in indexes spatial operations that runs efficiently using spatial indexes Interact Have with all your the system spatial and datasets express ready your to queries load in in a simple SpatialHadoop (Grid Website: high or level R-tree) http://language with that spatialhadoop.the are with built-adapted built-in spatial cs.to in umn.MapReduce spatial data edu/ support types Download source code, binary distribution, and instructions Email us at: shadoop@cs.umn.edu