SlideShare a Scribd company logo
1 of 12
Download to read offline
Big Data in GIS
Environment
Shivaprakash Yaragal
M.Tech GIS(2015-17)
Objective
1. To investigate the existing capabilities of esri
products in handling huge data sets. Processing
and analysis of data sets using esri products.
2. Conduct study on recent esri architecture for Big
Data processing.
Objective 1
• To investigate the existing capabilities of esri
products in handling huge data sets. Processing
and analysis of data sets using esri products.
Tasks involved
• Understanding Big Data in GIS
• Identifying python packages and tools used for
data processing with respect to esri products.
• Identifying visualization package and resources to
be used with esri products
• Working on New York Taxi data
Spatio-
Temporal Big
Data
Data Source
Type Open Source
Pandas Python package Yes
ArcPy Python package No
IPython Python Package Yes
Anaconda IDE Yes
Tableau Public Software Free public version
FME ArcGIS Interoperability
Extension
No
Figure A: New York Taxi Data(Green Taxi)
Data Source :
http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml
Objective 1
Objective 1
Green Taxi Data
Ancillary dataDropOff dataPickup data
csv splitting
NY locality polygon
Spatial Join Spatial Join
Merge Data
Data Filtering
Data Visualization
Methodolo
gy
Python
ArcPy
Tableau Public
Python
Interoperability Extension
Python
Python
ArcPy
Python
Spatial Processing
Pre-Processing
Visualization
Method 1 Method 2 Method 3
Method
1
Preprocessing
and Spatial Processing
Data splitting, Spatial
processing and merging
Tool
Post-processing Tool
Visualization
IPython Analysis-
Visualization
Figure E : From Bushwick South
to Crown Height North(BK78-
BK61)
Figure F From Crown Heights
South-Clinton Hill(BK63-BK69)
Figure B
Figure F
Figure C
Spatial Processing
Method 2
Spatial Join Data Merging
Visualization could be
IPython
Or Tableau Public
Method 3
Spatial Processing by
either ArcPy or
Interoperability tool
Tableau Public
Visualization
Data Analytics and Visualization using Tableau Public
Figure G
Figure I
Figure H
Figure J
Can we answer some question?
Figure K : Peak Traffic: 7 am to 11 am . Fair
remains in and around average even during peak
hour. Hence no dynamic fairing
Figure L: Circle marked in Green are anomalies
that deviates from patter and these should be
investigated. Circle in red are outliers
Objective 1What can inference:
• Most of passenger travel alone.
• Peak pickup of individual passenger is between 7
am to 10 am and then peaks again between 4 PM
to 10 PM.
• For Sustainability point of view 2 sitter vehicle(1
passenger) could be applied to have efficient
transport system.
• Car polling technique could be devised between
evening peak hours between 3 PM to 9 PM.
Figure M
Figure N : 2 sitter
vehicle(1
passenger)
https://public.tableau.com/views/NYTaxiDataMultiDimentionalVisualizationRegionWise12468sitterdistribution/Regio
nWise12468sitterdistribution?:embed=y&:display_count=yes
Objective 1 : ConclusionWhich is the best of 3 methods???
• Data Used : 235 MB, Point Data, 1.2 million rows
• System Used: Windows 10 64 bit OS, 8 GB RAM, 1 core processor
ArcPy (D) ArcPy (S) Python Interoperability
Extension
Python
visualization
Tableau Public
Pre-Processing 15 Min 2 Min 2 Min - - -
Spatial Processing 1.5 Hr 20 Min - 38 Min - -
Post-Processing 18 Min 3 Min 3 Min - - -
Visualization - - - - Basic for Data
and Analytics
Advanced for
Data Analytics
Design Timing 4 Weeks 4 weeks 4 weeks 1 Week 1 Weeks(3 Types
of Graphs)
3 days
Open/License/Publi
c
License License Open License Open Public and Free
Dependency Independent Independent Independent Depends on
ArcGIS License
Independent Independent
ArcPy(D)- Desktop
ArcPy(S)- Server
Objective 2
• Conduct study on esri capabilities in Big Data domain : Architecture
Machine 2
Machine 3
Machine 1: Base ArcGIS
Enterprise
Hosting Server
Web
Adaptor
(Portal)
Web Adaptor
Hosting
server
Portal for
ArcGIS
Web Adaptor
(GeoAnalytics
Server)
(GeoAnalytics
Server)
BigData File
Share . HDFS
folder
ArcGIS Relational
Data Store
ArcGIS
Spatiotemp
oral Store Machine
4
Thank You

More Related Content

What's hot

Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
Yet another population cartogram: Creating gridded cartograms using ArcGIS an...Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
Yet another population cartogram: Creating gridded cartograms using ArcGIS an...Benjamin Hennig
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Neal Lathia
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISShaun Lewis
 
Streaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaLeo Salemann
 
Creating gridded cartograms: Israel and the Palestine Territories
Creating gridded cartograms: Israel and the Palestine TerritoriesCreating gridded cartograms: Israel and the Palestine Territories
Creating gridded cartograms: Israel and the Palestine TerritoriesBenjamin Hennig
 
Temporary Coherence 3D Animation
Temporary Coherence 3D AnimationTemporary Coherence 3D Animation
Temporary Coherence 3D AnimationAkshat Singh
 
A Study on New York City Taxi Rides
A Study on New York City Taxi RidesA Study on New York City Taxi Rides
A Study on New York City Taxi RidesCaglar Subasi
 
Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...
Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...
Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...Hiroyuki Miyazaki
 
Using R to Visualize Spatial Data: R as GIS - Guy Lansley
Using R to Visualize Spatial Data: R as GIS - Guy LansleyUsing R to Visualize Spatial Data: R as GIS - Guy Lansley
Using R to Visualize Spatial Data: R as GIS - Guy LansleyGuy Lansley
 
PLOTCON NYC: Custom Colormaps for Your Field
PLOTCON NYC: Custom Colormaps for Your FieldPLOTCON NYC: Custom Colormaps for Your Field
PLOTCON NYC: Custom Colormaps for Your FieldPlotly
 
SC10 project slides
SC10 project slidesSC10 project slides
SC10 project slidesJason Riedy
 
Prediction of taxi rides ETA
Prediction of taxi rides ETAPrediction of taxi rides ETA
Prediction of taxi rides ETADaniel Marcous
 
SexTant: Visualizing Time-Evolving Linked Geospatial Data
SexTant: Visualizing Time-Evolving Linked Geospatial DataSexTant: Visualizing Time-Evolving Linked Geospatial Data
SexTant: Visualizing Time-Evolving Linked Geospatial DataCharalampos (Babis) Nikolaou
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Viet-Trung TRAN
 

What's hot (19)

Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
Yet another population cartogram: Creating gridded cartograms using ArcGIS an...Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
Yet another population cartogram: Creating gridded cartograms using ArcGIS an...
 
Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)Mining Smartphone Data (with Python)
Mining Smartphone Data (with Python)
 
Icbai 2018 ver_1
Icbai 2018 ver_1Icbai 2018 ver_1
Icbai 2018 ver_1
 
HDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's GuideHDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's Guide
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GIS
 
Studying Migrations Routes: New data and Tools
Studying Migrations Routes: New data and ToolsStudying Migrations Routes: New data and Tools
Studying Migrations Routes: New data and Tools
 
Streaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
 
Creating gridded cartograms: Israel and the Palestine Territories
Creating gridded cartograms: Israel and the Palestine TerritoriesCreating gridded cartograms: Israel and the Palestine Territories
Creating gridded cartograms: Israel and the Palestine Territories
 
Temporary Coherence 3D Animation
Temporary Coherence 3D AnimationTemporary Coherence 3D Animation
Temporary Coherence 3D Animation
 
A Study on New York City Taxi Rides
A Study on New York City Taxi RidesA Study on New York City Taxi Rides
A Study on New York City Taxi Rides
 
Studying Migrations Routes: New data and Tools
Studying Migrations Routes: New data and ToolsStudying Migrations Routes: New data and Tools
Studying Migrations Routes: New data and Tools
 
Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...
Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...
Automated Construction of Coverage Catalogues of ASTER Satellite Image for Ur...
 
Using R to Visualize Spatial Data: R as GIS - Guy Lansley
Using R to Visualize Spatial Data: R as GIS - Guy LansleyUsing R to Visualize Spatial Data: R as GIS - Guy Lansley
Using R to Visualize Spatial Data: R as GIS - Guy Lansley
 
PLOTCON NYC: Custom Colormaps for Your Field
PLOTCON NYC: Custom Colormaps for Your FieldPLOTCON NYC: Custom Colormaps for Your Field
PLOTCON NYC: Custom Colormaps for Your Field
 
SC10 project slides
SC10 project slidesSC10 project slides
SC10 project slides
 
Prediction of taxi rides ETA
Prediction of taxi rides ETAPrediction of taxi rides ETA
Prediction of taxi rides ETA
 
SexTant: Visualizing Time-Evolving Linked Geospatial Data
SexTant: Visualizing Time-Evolving Linked Geospatial DataSexTant: Visualizing Time-Evolving Linked Geospatial Data
SexTant: Visualizing Time-Evolving Linked Geospatial Data
 
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
Paper@Soict2015: GPSInsights: towards a scalable framework for mining massive...
 
NASA Terra Data Fusion
NASA Terra Data FusionNASA Terra Data Fusion
NASA Terra Data Fusion
 

Similar to Big data in GIS Environment

SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...IOSR Journals
 
Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...
Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...
Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...KamleshKumar394
 
Drupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupalDay
 
Thinking spatially with your open data
Thinking spatially with your open dataThinking spatially with your open data
Thinking spatially with your open dataTwinbit
 
Service Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data MiningService Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data MiningIIRindia
 
A Vehicles for Open-Pit Mining with Smart Scheduling System for Transportati...
A Vehicles for Open-Pit Mining with Smart Scheduling  System for Transportati...A Vehicles for Open-Pit Mining with Smart Scheduling  System for Transportati...
A Vehicles for Open-Pit Mining with Smart Scheduling System for Transportati...IJMSIRJOURNAL
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsIRJET Journal
 
LIDAR Magizine 2015: The Birth of 3D Mapping Artificial Intelligence
LIDAR Magizine 2015: The Birth of 3D Mapping Artificial IntelligenceLIDAR Magizine 2015: The Birth of 3D Mapping Artificial Intelligence
LIDAR Magizine 2015: The Birth of 3D Mapping Artificial IntelligenceJason Creadore 🌐
 
Scaling Spatial Analytics with Google Cloud & CARTO
Scaling Spatial Analytics with Google Cloud & CARTOScaling Spatial Analytics with Google Cloud & CARTO
Scaling Spatial Analytics with Google Cloud & CARTOCARTO
 
How to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart LogisticsHow to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart LogisticsAlibaba Cloud
 
Big data analytics for transport
Big data analytics for transportBig data analytics for transport
Big data analytics for transportUKinItaly
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoopIRJET Journal
 
Map reduce advantages over parallel databases report
Map reduce advantages over parallel databases reportMap reduce advantages over parallel databases report
Map reduce advantages over parallel databases reportAhmad El Tawil
 
Complex Analysis in Public Transportation: A Step towards Smart Cities
Complex Analysis in Public Transportation: A Step towards Smart CitiesComplex Analysis in Public Transportation: A Step towards Smart Cities
Complex Analysis in Public Transportation: A Step towards Smart CitiesDataWorks Summit
 
Enabling Application Integrated Proactive Fault Tolerance
Enabling Application Integrated Proactive Fault ToleranceEnabling Application Integrated Proactive Fault Tolerance
Enabling Application Integrated Proactive Fault ToleranceDai Yang
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesCédric Fauvet
 

Similar to Big data in GIS Environment (20)

SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
 
Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...
Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...
Mining on Relationships in Big Data era using Improve Apriori Algorithm with ...
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Drupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open dataDrupal Day 2011 - Thinking spatially with your open data
Drupal Day 2011 - Thinking spatially with your open data
 
Thinking spatially with your open data
Thinking spatially with your open dataThinking spatially with your open data
Thinking spatially with your open data
 
Service Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data MiningService Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data Mining
 
A Vehicles for Open-Pit Mining with Smart Scheduling System for Transportati...
A Vehicles for Open-Pit Mining with Smart Scheduling  System for Transportati...A Vehicles for Open-Pit Mining with Smart Scheduling  System for Transportati...
A Vehicles for Open-Pit Mining with Smart Scheduling System for Transportati...
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather Conditions
 
LIDAR Magizine 2015: The Birth of 3D Mapping Artificial Intelligence
LIDAR Magizine 2015: The Birth of 3D Mapping Artificial IntelligenceLIDAR Magizine 2015: The Birth of 3D Mapping Artificial Intelligence
LIDAR Magizine 2015: The Birth of 3D Mapping Artificial Intelligence
 
Scaling Spatial Analytics with Google Cloud & CARTO
Scaling Spatial Analytics with Google Cloud & CARTOScaling Spatial Analytics with Google Cloud & CARTO
Scaling Spatial Analytics with Google Cloud & CARTO
 
Ramabrahmachary Sattenapalli
Ramabrahmachary SattenapalliRamabrahmachary Sattenapalli
Ramabrahmachary Sattenapalli
 
How to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart LogisticsHow to Leverage Big Data to Deliver Smart Logistics
How to Leverage Big Data to Deliver Smart Logistics
 
Big data analytics for transport
Big data analytics for transportBig data analytics for transport
Big data analytics for transport
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoop
 
Map reduce advantages over parallel databases report
Map reduce advantages over parallel databases reportMap reduce advantages over parallel databases report
Map reduce advantages over parallel databases report
 
Complex Analysis in Public Transportation: A Step towards Smart Cities
Complex Analysis in Public Transportation: A Step towards Smart CitiesComplex Analysis in Public Transportation: A Step towards Smart Cities
Complex Analysis in Public Transportation: A Step towards Smart Cities
 
Enabling Application Integrated Proactive Fault Tolerance
Enabling Application Integrated Proactive Fault ToleranceEnabling Application Integrated Proactive Fault Tolerance
Enabling Application Integrated Proactive Fault Tolerance
 
Big data analysis
Big data analysisBig data analysis
Big data analysis
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"
 

Recently uploaded

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 

Recently uploaded (20)

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 

Big data in GIS Environment

  • 1. Big Data in GIS Environment Shivaprakash Yaragal M.Tech GIS(2015-17)
  • 2. Objective 1. To investigate the existing capabilities of esri products in handling huge data sets. Processing and analysis of data sets using esri products. 2. Conduct study on recent esri architecture for Big Data processing.
  • 3. Objective 1 • To investigate the existing capabilities of esri products in handling huge data sets. Processing and analysis of data sets using esri products. Tasks involved • Understanding Big Data in GIS • Identifying python packages and tools used for data processing with respect to esri products. • Identifying visualization package and resources to be used with esri products • Working on New York Taxi data
  • 4. Spatio- Temporal Big Data Data Source Type Open Source Pandas Python package Yes ArcPy Python package No IPython Python Package Yes Anaconda IDE Yes Tableau Public Software Free public version FME ArcGIS Interoperability Extension No Figure A: New York Taxi Data(Green Taxi) Data Source : http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml Objective 1
  • 5. Objective 1 Green Taxi Data Ancillary dataDropOff dataPickup data csv splitting NY locality polygon Spatial Join Spatial Join Merge Data Data Filtering Data Visualization Methodolo gy Python ArcPy Tableau Public Python Interoperability Extension Python Python ArcPy Python Spatial Processing Pre-Processing Visualization Method 1 Method 2 Method 3
  • 6. Method 1 Preprocessing and Spatial Processing Data splitting, Spatial processing and merging Tool Post-processing Tool Visualization IPython Analysis- Visualization Figure E : From Bushwick South to Crown Height North(BK78- BK61) Figure F From Crown Heights South-Clinton Hill(BK63-BK69) Figure B Figure F Figure C
  • 7. Spatial Processing Method 2 Spatial Join Data Merging Visualization could be IPython Or Tableau Public Method 3 Spatial Processing by either ArcPy or Interoperability tool Tableau Public Visualization Data Analytics and Visualization using Tableau Public Figure G Figure I Figure H Figure J
  • 8. Can we answer some question? Figure K : Peak Traffic: 7 am to 11 am . Fair remains in and around average even during peak hour. Hence no dynamic fairing Figure L: Circle marked in Green are anomalies that deviates from patter and these should be investigated. Circle in red are outliers
  • 9. Objective 1What can inference: • Most of passenger travel alone. • Peak pickup of individual passenger is between 7 am to 10 am and then peaks again between 4 PM to 10 PM. • For Sustainability point of view 2 sitter vehicle(1 passenger) could be applied to have efficient transport system. • Car polling technique could be devised between evening peak hours between 3 PM to 9 PM. Figure M Figure N : 2 sitter vehicle(1 passenger) https://public.tableau.com/views/NYTaxiDataMultiDimentionalVisualizationRegionWise12468sitterdistribution/Regio nWise12468sitterdistribution?:embed=y&:display_count=yes
  • 10. Objective 1 : ConclusionWhich is the best of 3 methods??? • Data Used : 235 MB, Point Data, 1.2 million rows • System Used: Windows 10 64 bit OS, 8 GB RAM, 1 core processor ArcPy (D) ArcPy (S) Python Interoperability Extension Python visualization Tableau Public Pre-Processing 15 Min 2 Min 2 Min - - - Spatial Processing 1.5 Hr 20 Min - 38 Min - - Post-Processing 18 Min 3 Min 3 Min - - - Visualization - - - - Basic for Data and Analytics Advanced for Data Analytics Design Timing 4 Weeks 4 weeks 4 weeks 1 Week 1 Weeks(3 Types of Graphs) 3 days Open/License/Publi c License License Open License Open Public and Free Dependency Independent Independent Independent Depends on ArcGIS License Independent Independent ArcPy(D)- Desktop ArcPy(S)- Server
  • 11. Objective 2 • Conduct study on esri capabilities in Big Data domain : Architecture Machine 2 Machine 3 Machine 1: Base ArcGIS Enterprise Hosting Server Web Adaptor (Portal) Web Adaptor Hosting server Portal for ArcGIS Web Adaptor (GeoAnalytics Server) (GeoAnalytics Server) BigData File Share . HDFS folder ArcGIS Relational Data Store ArcGIS Spatiotemp oral Store Machine 4