SlideShare a Scribd company logo
1 of 20
Download to read offline
Department of Computer Science and Engineering - DISI
University of Bologna, Italy
{isam.aljawarneh3, luca.Foschini, paolo.bellavista,
antonio.corradi,Rebecca.montanari}@unibo.it,
Spatially Representative Online Big
Data Sampling for Smart Cities
Isam Mashhour Al Jawarneh, Luca Foschini ,
Paolo Bellavista, Antonio Corradi, Rebecca Montanari
IEEE CAMAD 2020
14-16 September Session 6 Next Generation Networks
Agenda
➢ Spatial Approximate Query Processing: Background and Motivations
➢ QoS-aware Spatial Sampling
○ Ex-SAOS spatial online sampling
○ Supported interactive spatial queries
➢ Ex-SAOS Deployment
○ Baselines
○ Testing setup
➢ Results and Discussion
○ Azure Cloud Spark Test
➢ Concluding remarks
Motivating Application Scenario: Smart
Cities
patient
Community
volunteers
A mixed-workload scenario requiring at least:
• Traffic Light Controller. Actuator decides to
change lights consistently for ambulance to
pass
• Smart Real-time Pathfinder. Interactive
navigation map for ambulances and other
vehicles
• Real-time Community Detector. Identify
volunteers' communities in the surroundings
of the patient
participatory healthcare
 geospatial queries (expensive!)
– Proximity queries
– Spatial join
– Spatial clustering
– Spatial geo-statistics.
– k-Nearest Neighborhoods
© 2020 Isam Al Jawarneh
 Data arrives fast during peak hours
 Exceeds the capacity of ingestion and processing
systems
● Spatial Approximate Query Processing (SAQP) is
the key.
QoS Tension for achieving SLAs
latency
Spatial (Approximate) Query Processing (S(A)QP)
Throughput
Accuracy
Tension
4
● Exactness is not necessary for
decision making in smart
cities!
 After 1 second, we obtain a
99.5 accurate early result,
which is satisfactory for
decision making, which then
makes the final exact result not
needed.
Spatial Approximate Query Processing
(SAQP)
• SAQP employs a spatial sampling
design aiming at resolving gracefully
the tension between low-latency and
high-accuracy QoS goals.
• Spatial sampling. Selecting a
miniscule version of the population
to compute geo-statistics: mean,
range, total.
– Based on the fact that decision
makers are can withstand a tiny
loss in error-bounded
accuracy in exchange for a
plausible latency gain.
– In streaming settings, data
keeps arriving, the ‘population’
metaphor vanishes.
Spatial Sampling
Spatial
sample
Incrementalizing
results w/ rigorous
Error bounds
Incrementalizing
results w/ rigorous
Error bounds
SAQP
Relying on samples for computing geo-
stats
Input
Stream
SLA:
Latency/throughput
QoS goals
Spatial Aware Online Sampling (SAOS):
overview
SAOS
%
Granular
• Nearby points share the same geohash prefixes
• Only the ‘filter’ stage of the ‘filter-and-refine’!
• SAOS focuses on SDL preservation, but with ‘false positives’
• ‘False positives’ are those tuples that have the same geohash, but do
not belong to the same neighborhood
• Geohash indexing. An ordering (string
representation) imposed on grid
surface earth planar representation.
• Nearby spatial objects share a pairwise
relationship
– spatially well-balanced
representative samples  are
known to yield better results for geo-
stats (average, median, etc.) in terms
of accuracy.
• Example online spatial query. “what is the
average trip distance travelled by taxis from
each neighbourhood in the city of Rome,
Italy”
• By sampling the same fraction from each
geohash, we approximately guarantee that
each neighborhood (stratum in statistics
parlance) is fairly represented
• Continuous results are updated by
incrementalization.
Plain SpatialSPE
SpatialSPE overview
• Geohash indexing. An ordering (string
representation) imposed on grid surface earth
planar representation.
Extended Spatial Aware Online Sampling
(SAOS): overview
Ex-SAOS
%
Coarser
• Applying all stages of ‘filter-and-refine’!
• Ex-SAOS focuses on SDL preservation, without ‘false positives’
• More accurate.
• Applying ‘filter-and-refine’ to solve
the PIP test before sampling.
• Discarding ‘false positives’.
• We exactly sample same fractions
from each neighbourhood
(borough, district, etc.,)
• Yields more accurate results.
Incorporating Ex-SAOS in SpatialSPE
Extended SpatialSPE
overview
Extended Spatial Aware Online Sampling
(EX-SAOS)
• Sampling proportionally
balanced tuples from each
administrative division
(neighborhood, district, borough,
etc.,) independently.
• ‘Filter-and-refine’ spatial join for
resolving the Point-in-Polygon
(PIP).
heuristic overview
Producing faster
than consuming
 Congestion!
Typical pipeline architecture w/o SAOS or Ex-
SAOS
Same
serving
speed, but
serving less
data!
The improved architecture w/ Ex-SAOS
SQL-alike
continuous
geospatial query
Supported Queries
“average trip distance
travelled by taxis in Rome,
Italy during the last three
months”
Online
continuous
query
Incremental
geo-stats
error bounds (e.g., relative error RE)
and spearman’s rho
̂
N
=
̂
N
=
( )
=
/ ( )
Single
queries
Stateful
aggregation
“top-3 boroughs in Bologna
city in Italy where people
tend to check out shared
bikes”.
Top-N online aggregation function
( , )
=
( , )
No pre-knowledge on the streaming geo-statistics is required, we
depends on incrementalization
Baselines
Spark Structures
Streaming Sampling is not supported
Spark Streaming SRS, Stratified Sampling
Depending on SRS for selecting a miniscule sample from a highly
skewed spatial dataset yields highly inaccurate results.
This is so because it tends to overlook regions and underlook other
regions.
SpatilSPE (SAOS) Only granular level
Spatial-awareness is
not supported
Accuracy can be
improved
Experimental setup
• Evaluation metrics
– Sampling fraction vs accuracy
– Sampling fraction vs rho
• Testbed
– Cluster: 6 nodes (Microsoft Azure
HDInsight Cluster )
– Datasets:
• NY City taxicab trips datasets (cohort
of six months dataset (around nine
million units)
Sampling fraction vs Accuracy
• We define the accuracy loss as accLoss = |estimatedMean – trueMean| / trueMean.
• ex-SAOS outperforms SpSS-based SRS in addition to SAOS for all geohash values
for all measures, accuracy loss and relative error.
• Ex-SAOS has lower accuracy loss compared to all geohash precisions applied to
SAOS (30 & 35)
0.00%
0.02%
0.04%
0.06%
0.08%
0.10%
0.12%
0.00%
0.10%
0.20%
0.30%
0.40%
0.50%
20% 40% 60% 80%
Accuracy
Loss
(lower
is
better)
Relative
Error(lower
is
better)
sampling fraction
95CI_SAOS_RE 95CI_ex-SAOS_RE
SAOS_loss ex-SAOS_loss
SAOS_loss35 SRS_loss
Spearman’s rho vs Sampling fraction
• rho is a measure for statistical dependency between the
ranking of two variables in a dataset.
• Ranking precision of ex-SAOS outperforms those for
SAOS( with all precisions 30 & 35) in addition to SRS.
0.75
0.8
0.85
0.9
0.95
1
1.05
0.8
0.85
0.9
0.95
1
1.05
20% 40% 60% 80%
rho
35
(higher
better)
rho
30
(higher
better)
sampling rate
SAOS_30 SRS ex-SAOS SAOS_35
Concluding remarks
• Most interesting locational intelligence
queries are required during high-pace data
streaming arrival rates, where SPEs can not
withstand the speed!
• Spatial sampling based on stratified designs
is proven to yield more plausible geo-stats.
• We have extended a plain SAOS by
enabling a spatial sampling design on a
coarser level that results in more accuracy.
Q&A and Contacts
Isam Al Jawarneh
Email: isam.aljawarneh3@unibo.it
Luca Foschini
Email: luca.foschini@unibo.it
Thanks for your attention!
Questions time...
IEEE CAMAD 2020
14-16 September

More Related Content

What's hot

Adaptive Intelligent Mobile Robotics
Adaptive Intelligent Mobile RoboticsAdaptive Intelligent Mobile Robotics
Adaptive Intelligent Mobile Roboticsahmad bassiouny
 
Interpolation
InterpolationInterpolation
Interpolationseidmmd
 
Entropy and its significance related to GIS
Entropy and its significance related to GISEntropy and its significance related to GIS
Entropy and its significance related to GISNaina Gupta
 
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...hpaocec
 
Spatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GISSpatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GISKU Leuven
 
Interpolation 2013
Interpolation 2013Interpolation 2013
Interpolation 2013Atiqa Khan
 
Geospatial analytics -Spatio-Temporal analysis
Geospatial analytics -Spatio-Temporal analysisGeospatial analytics -Spatio-Temporal analysis
Geospatial analytics -Spatio-Temporal analysisKriti Srivastava
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017. Holistic Benchmarking of Big Linked Data
 
Interpolation techniques in ArcGIS
Interpolation techniques in ArcGISInterpolation techniques in ArcGIS
Interpolation techniques in ArcGISHarsha Chamara
 
Hyougo iv2014 slide
Hyougo iv2014 slideHyougo iv2014 slide
Hyougo iv2014 slideYuuki Hyougo
 
spatial interoplation in GIS
spatial interoplation in GISspatial interoplation in GIS
spatial interoplation in GISGopalKharka
 
Charlotte Gallant Presentation
Charlotte Gallant PresentationCharlotte Gallant Presentation
Charlotte Gallant PresentationCOGS Presentations
 
Mapping Water features from SAR Imagery
Mapping Water features from SAR ImageryMapping Water features from SAR Imagery
Mapping Water features from SAR Imagerybramantiyo marjuki
 
Interpolation 2013
Interpolation 2013Interpolation 2013
Interpolation 2013Atiqa khan
 
Watershed Delineation Using ArcMap
Watershed Delineation Using ArcMapWatershed Delineation Using ArcMap
Watershed Delineation Using ArcMapArthur Green
 
Gis cience 2010 presentation
Gis cience 2010 presentationGis cience 2010 presentation
Gis cience 2010 presentationtemporalista
 
Watershed analysis using GIS
Watershed analysis using GISWatershed analysis using GIS
Watershed analysis using GISAya Osman
 

What's hot (20)

Adaptive Intelligent Mobile Robotics
Adaptive Intelligent Mobile RoboticsAdaptive Intelligent Mobile Robotics
Adaptive Intelligent Mobile Robotics
 
Interpolation
InterpolationInterpolation
Interpolation
 
Entropy and its significance related to GIS
Entropy and its significance related to GISEntropy and its significance related to GIS
Entropy and its significance related to GIS
 
Space Tug Rendezvous
Space Tug RendezvousSpace Tug Rendezvous
Space Tug Rendezvous
 
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
 
Spatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GISSpatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GIS
 
Interpolation 2013
Interpolation 2013Interpolation 2013
Interpolation 2013
 
Geospatial analytics -Spatio-Temporal analysis
Geospatial analytics -Spatio-Temporal analysisGeospatial analytics -Spatio-Temporal analysis
Geospatial analytics -Spatio-Temporal analysis
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
 
Interpolation techniques in ArcGIS
Interpolation techniques in ArcGISInterpolation techniques in ArcGIS
Interpolation techniques in ArcGIS
 
Hyougo iv2014 slide
Hyougo iv2014 slideHyougo iv2014 slide
Hyougo iv2014 slide
 
spatial interoplation in GIS
spatial interoplation in GISspatial interoplation in GIS
spatial interoplation in GIS
 
Mobile mapping system
Mobile mapping systemMobile mapping system
Mobile mapping system
 
Mobile mapping
Mobile mappingMobile mapping
Mobile mapping
 
Charlotte Gallant Presentation
Charlotte Gallant PresentationCharlotte Gallant Presentation
Charlotte Gallant Presentation
 
Mapping Water features from SAR Imagery
Mapping Water features from SAR ImageryMapping Water features from SAR Imagery
Mapping Water features from SAR Imagery
 
Interpolation 2013
Interpolation 2013Interpolation 2013
Interpolation 2013
 
Watershed Delineation Using ArcMap
Watershed Delineation Using ArcMapWatershed Delineation Using ArcMap
Watershed Delineation Using ArcMap
 
Gis cience 2010 presentation
Gis cience 2010 presentationGis cience 2010 presentation
Gis cience 2010 presentation
 
Watershed analysis using GIS
Watershed analysis using GISWatershed analysis using GIS
Watershed analysis using GIS
 

Similar to IEEE Camad20 presentation - Isam Al Jawarneh

Driving Behavior for ADAS and Autonomous Driving VII
Driving Behavior for ADAS and Autonomous Driving VIIDriving Behavior for ADAS and Autonomous Driving VII
Driving Behavior for ADAS and Autonomous Driving VIIYu Huang
 
Driving behaviors for adas and autonomous driving XII
Driving behaviors for adas and autonomous driving XIIDriving behaviors for adas and autonomous driving XII
Driving behaviors for adas and autonomous driving XIIYu Huang
 
Ace an accurate and efficient multi entity device-free wlan localization system
Ace an accurate and efficient multi entity device-free wlan localization systemAce an accurate and efficient multi entity device-free wlan localization system
Ace an accurate and efficient multi entity device-free wlan localization systemieeeprojectschennai
 
Driving Behavior for ADAS and Autonomous Driving X
Driving Behavior for ADAS and Autonomous Driving XDriving Behavior for ADAS and Autonomous Driving X
Driving Behavior for ADAS and Autonomous Driving XYu Huang
 
Pedestrian Behavior/Intention Modeling for Autonomous Driving VI
Pedestrian Behavior/Intention Modeling for Autonomous Driving VIPedestrian Behavior/Intention Modeling for Autonomous Driving VI
Pedestrian Behavior/Intention Modeling for Autonomous Driving VIYu Huang
 
Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSwapnil Shrivastav
 
Pedestrian behavior/intention modeling for autonomous driving III
Pedestrian behavior/intention modeling for autonomous driving IIIPedestrian behavior/intention modeling for autonomous driving III
Pedestrian behavior/intention modeling for autonomous driving IIIYu Huang
 
Trajectory improves data delivery in urban vehicular networks
Trajectory improves data delivery in urban vehicular networks Trajectory improves data delivery in urban vehicular networks
Trajectory improves data delivery in urban vehicular networks Papitha Velumani
 
Pedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IVPedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IVYu Huang
 
Pedestrian behavior/intention modeling for autonomous driving V
Pedestrian behavior/intention modeling for autonomous driving VPedestrian behavior/intention modeling for autonomous driving V
Pedestrian behavior/intention modeling for autonomous driving VYu Huang
 
report2.doc
report2.docreport2.doc
report2.docbutest
 
Fisheye-Omnidirectional View in Autonomous Driving III
Fisheye-Omnidirectional View in Autonomous Driving IIIFisheye-Omnidirectional View in Autonomous Driving III
Fisheye-Omnidirectional View in Autonomous Driving IIIYu Huang
 
Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction
Deep Multi-View Spatial-Temporal Network for Taxi Demand PredictionDeep Multi-View Spatial-Temporal Network for Taxi Demand Prediction
Deep Multi-View Spatial-Temporal Network for Taxi Demand Predictionivaderivader
 
Driving behaviors for adas and autonomous driving XI
Driving behaviors for adas and autonomous driving XIDriving behaviors for adas and autonomous driving XI
Driving behaviors for adas and autonomous driving XIYu Huang
 
RIMS Update - Guideline for Pavement Strength Testing
RIMS Update - Guideline for Pavement Strength TestingRIMS Update - Guideline for Pavement Strength Testing
RIMS Update - Guideline for Pavement Strength TestingSimon Gough
 
Traffic models and estimation
Traffic models and estimation Traffic models and estimation
Traffic models and estimation Mina Yonan
 
Predict Traffic flow with KNN and LSTM
Predict Traffic flow with KNN and LSTMPredict Traffic flow with KNN and LSTM
Predict Traffic flow with KNN and LSTMAfzaal Subhani
 
The data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesThe data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesVincenzo Gulisano
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstracttsysglobalsolutions
 

Similar to IEEE Camad20 presentation - Isam Al Jawarneh (20)

Driving Behavior for ADAS and Autonomous Driving VII
Driving Behavior for ADAS and Autonomous Driving VIIDriving Behavior for ADAS and Autonomous Driving VII
Driving Behavior for ADAS and Autonomous Driving VII
 
Driving behaviors for adas and autonomous driving XII
Driving behaviors for adas and autonomous driving XIIDriving behaviors for adas and autonomous driving XII
Driving behaviors for adas and autonomous driving XII
 
Ace an accurate and efficient multi entity device-free wlan localization system
Ace an accurate and efficient multi entity device-free wlan localization systemAce an accurate and efficient multi entity device-free wlan localization system
Ace an accurate and efficient multi entity device-free wlan localization system
 
Driving Behavior for ADAS and Autonomous Driving X
Driving Behavior for ADAS and Autonomous Driving XDriving Behavior for ADAS and Autonomous Driving X
Driving Behavior for ADAS and Autonomous Driving X
 
Pedestrian Behavior/Intention Modeling for Autonomous Driving VI
Pedestrian Behavior/Intention Modeling for Autonomous Driving VIPedestrian Behavior/Intention Modeling for Autonomous Driving VI
Pedestrian Behavior/Intention Modeling for Autonomous Driving VI
 
Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis Tools
 
Pedestrian behavior/intention modeling for autonomous driving III
Pedestrian behavior/intention modeling for autonomous driving IIIPedestrian behavior/intention modeling for autonomous driving III
Pedestrian behavior/intention modeling for autonomous driving III
 
Trajectory improves data delivery in urban vehicular networks
Trajectory improves data delivery in urban vehicular networks Trajectory improves data delivery in urban vehicular networks
Trajectory improves data delivery in urban vehicular networks
 
Pedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IVPedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving IV
 
Pedestrian behavior/intention modeling for autonomous driving V
Pedestrian behavior/intention modeling for autonomous driving VPedestrian behavior/intention modeling for autonomous driving V
Pedestrian behavior/intention modeling for autonomous driving V
 
report2.doc
report2.docreport2.doc
report2.doc
 
Fisheye-Omnidirectional View in Autonomous Driving III
Fisheye-Omnidirectional View in Autonomous Driving IIIFisheye-Omnidirectional View in Autonomous Driving III
Fisheye-Omnidirectional View in Autonomous Driving III
 
Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction
Deep Multi-View Spatial-Temporal Network for Taxi Demand PredictionDeep Multi-View Spatial-Temporal Network for Taxi Demand Prediction
Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction
 
Driving behaviors for adas and autonomous driving XI
Driving behaviors for adas and autonomous driving XIDriving behaviors for adas and autonomous driving XI
Driving behaviors for adas and autonomous driving XI
 
RIMS Update - Guideline for Pavement Strength Testing
RIMS Update - Guideline for Pavement Strength TestingRIMS Update - Guideline for Pavement Strength Testing
RIMS Update - Guideline for Pavement Strength Testing
 
Traffic models and estimation
Traffic models and estimation Traffic models and estimation
Traffic models and estimation
 
Predict Traffic flow with KNN and LSTM
Predict Traffic flow with KNN and LSTMPredict Traffic flow with KNN and LSTM
Predict Traffic flow with KNN and LSTM
 
Akbas-DASC-2022.pdf
Akbas-DASC-2022.pdfAkbas-DASC-2022.pdf
Akbas-DASC-2022.pdf
 
The data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesThe data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architectures
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 

Recently uploaded

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 

Recently uploaded (20)

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 

IEEE Camad20 presentation - Isam Al Jawarneh

  • 1. Department of Computer Science and Engineering - DISI University of Bologna, Italy {isam.aljawarneh3, luca.Foschini, paolo.bellavista, antonio.corradi,Rebecca.montanari}@unibo.it, Spatially Representative Online Big Data Sampling for Smart Cities Isam Mashhour Al Jawarneh, Luca Foschini , Paolo Bellavista, Antonio Corradi, Rebecca Montanari IEEE CAMAD 2020 14-16 September Session 6 Next Generation Networks
  • 2. Agenda ➢ Spatial Approximate Query Processing: Background and Motivations ➢ QoS-aware Spatial Sampling ○ Ex-SAOS spatial online sampling ○ Supported interactive spatial queries ➢ Ex-SAOS Deployment ○ Baselines ○ Testing setup ➢ Results and Discussion ○ Azure Cloud Spark Test ➢ Concluding remarks
  • 3. Motivating Application Scenario: Smart Cities patient Community volunteers A mixed-workload scenario requiring at least: • Traffic Light Controller. Actuator decides to change lights consistently for ambulance to pass • Smart Real-time Pathfinder. Interactive navigation map for ambulances and other vehicles • Real-time Community Detector. Identify volunteers' communities in the surroundings of the patient participatory healthcare  geospatial queries (expensive!) – Proximity queries – Spatial join – Spatial clustering – Spatial geo-statistics. – k-Nearest Neighborhoods © 2020 Isam Al Jawarneh  Data arrives fast during peak hours  Exceeds the capacity of ingestion and processing systems ● Spatial Approximate Query Processing (SAQP) is the key.
  • 4. QoS Tension for achieving SLAs latency Spatial (Approximate) Query Processing (S(A)QP) Throughput Accuracy Tension 4
  • 5. ● Exactness is not necessary for decision making in smart cities!  After 1 second, we obtain a 99.5 accurate early result, which is satisfactory for decision making, which then makes the final exact result not needed. Spatial Approximate Query Processing (SAQP)
  • 6. • SAQP employs a spatial sampling design aiming at resolving gracefully the tension between low-latency and high-accuracy QoS goals. • Spatial sampling. Selecting a miniscule version of the population to compute geo-statistics: mean, range, total. – Based on the fact that decision makers are can withstand a tiny loss in error-bounded accuracy in exchange for a plausible latency gain. – In streaming settings, data keeps arriving, the ‘population’ metaphor vanishes. Spatial Sampling Spatial sample Incrementalizing results w/ rigorous Error bounds Incrementalizing results w/ rigorous Error bounds SAQP Relying on samples for computing geo- stats Input Stream SLA: Latency/throughput QoS goals
  • 7. Spatial Aware Online Sampling (SAOS): overview SAOS % Granular • Nearby points share the same geohash prefixes • Only the ‘filter’ stage of the ‘filter-and-refine’! • SAOS focuses on SDL preservation, but with ‘false positives’ • ‘False positives’ are those tuples that have the same geohash, but do not belong to the same neighborhood • Geohash indexing. An ordering (string representation) imposed on grid surface earth planar representation.
  • 8. • Nearby spatial objects share a pairwise relationship – spatially well-balanced representative samples  are known to yield better results for geo- stats (average, median, etc.) in terms of accuracy. • Example online spatial query. “what is the average trip distance travelled by taxis from each neighbourhood in the city of Rome, Italy” • By sampling the same fraction from each geohash, we approximately guarantee that each neighborhood (stratum in statistics parlance) is fairly represented • Continuous results are updated by incrementalization. Plain SpatialSPE SpatialSPE overview • Geohash indexing. An ordering (string representation) imposed on grid surface earth planar representation.
  • 9. Extended Spatial Aware Online Sampling (SAOS): overview Ex-SAOS % Coarser • Applying all stages of ‘filter-and-refine’! • Ex-SAOS focuses on SDL preservation, without ‘false positives’ • More accurate.
  • 10. • Applying ‘filter-and-refine’ to solve the PIP test before sampling. • Discarding ‘false positives’. • We exactly sample same fractions from each neighbourhood (borough, district, etc.,) • Yields more accurate results. Incorporating Ex-SAOS in SpatialSPE Extended SpatialSPE overview
  • 11. Extended Spatial Aware Online Sampling (EX-SAOS) • Sampling proportionally balanced tuples from each administrative division (neighborhood, district, borough, etc.,) independently. • ‘Filter-and-refine’ spatial join for resolving the Point-in-Polygon (PIP). heuristic overview
  • 12. Producing faster than consuming  Congestion! Typical pipeline architecture w/o SAOS or Ex- SAOS
  • 13. Same serving speed, but serving less data! The improved architecture w/ Ex-SAOS SQL-alike continuous geospatial query
  • 14. Supported Queries “average trip distance travelled by taxis in Rome, Italy during the last three months” Online continuous query Incremental geo-stats error bounds (e.g., relative error RE) and spearman’s rho ̂ N = ̂ N = ( ) = / ( ) Single queries Stateful aggregation “top-3 boroughs in Bologna city in Italy where people tend to check out shared bikes”. Top-N online aggregation function ( , ) = ( , ) No pre-knowledge on the streaming geo-statistics is required, we depends on incrementalization
  • 15. Baselines Spark Structures Streaming Sampling is not supported Spark Streaming SRS, Stratified Sampling Depending on SRS for selecting a miniscule sample from a highly skewed spatial dataset yields highly inaccurate results. This is so because it tends to overlook regions and underlook other regions. SpatilSPE (SAOS) Only granular level Spatial-awareness is not supported Accuracy can be improved
  • 16. Experimental setup • Evaluation metrics – Sampling fraction vs accuracy – Sampling fraction vs rho • Testbed – Cluster: 6 nodes (Microsoft Azure HDInsight Cluster ) – Datasets: • NY City taxicab trips datasets (cohort of six months dataset (around nine million units)
  • 17. Sampling fraction vs Accuracy • We define the accuracy loss as accLoss = |estimatedMean – trueMean| / trueMean. • ex-SAOS outperforms SpSS-based SRS in addition to SAOS for all geohash values for all measures, accuracy loss and relative error. • Ex-SAOS has lower accuracy loss compared to all geohash precisions applied to SAOS (30 & 35) 0.00% 0.02% 0.04% 0.06% 0.08% 0.10% 0.12% 0.00% 0.10% 0.20% 0.30% 0.40% 0.50% 20% 40% 60% 80% Accuracy Loss (lower is better) Relative Error(lower is better) sampling fraction 95CI_SAOS_RE 95CI_ex-SAOS_RE SAOS_loss ex-SAOS_loss SAOS_loss35 SRS_loss
  • 18. Spearman’s rho vs Sampling fraction • rho is a measure for statistical dependency between the ranking of two variables in a dataset. • Ranking precision of ex-SAOS outperforms those for SAOS( with all precisions 30 & 35) in addition to SRS. 0.75 0.8 0.85 0.9 0.95 1 1.05 0.8 0.85 0.9 0.95 1 1.05 20% 40% 60% 80% rho 35 (higher better) rho 30 (higher better) sampling rate SAOS_30 SRS ex-SAOS SAOS_35
  • 19. Concluding remarks • Most interesting locational intelligence queries are required during high-pace data streaming arrival rates, where SPEs can not withstand the speed! • Spatial sampling based on stratified designs is proven to yield more plausible geo-stats. • We have extended a plain SAOS by enabling a spatial sampling design on a coarser level that results in more accuracy.
  • 20. Q&A and Contacts Isam Al Jawarneh Email: isam.aljawarneh3@unibo.it Luca Foschini Email: luca.foschini@unibo.it Thanks for your attention! Questions time... IEEE CAMAD 2020 14-16 September