The State of Big Data for Geo - ESRI Big Data MeetupPresentation Transcript
The State of Big Datafor Geo @ajturner CTO DC Dev Center @seangorman Strategist DC Dev Center
Of the 318 million mobile handsetsshipped in 2011 79.9% were GPSenabled iSuppli 2010
In the United States 72.2% of usersaccessed social media sites andblogs through their mobile devices –up 37% from 2010 ComScore 2011
What is Big Data?
MapReduce Geo (MrGeo)• DIA project initiated by Terry Busch to extend geoprocessing to very large data sets• Built by SPADAC -> GeoEye -> Digital Globe• Uses HDFS and MapReduce to store, process, and index geospatial imagery and vector data• Interoperable with: – ArcGIS Desktop – COMET – Google Earth – WMS clients – Adobe Flex and Silverlight environments• Listed on the Hadoop Apache page to be open sourced
Brian Levy 2010
GCEP• Geospatial complex event processing• Extends the to include the ability to use Geospatial constructs in the rules for filtering events• The ability to utilize the OGC Geospatial Functions within Espers Event Processing Language (EPL). –Contains, within, disjoint, intersects, overlaps, crosses, intersection, touches, buffer, relate, union, convex hull
Neo4j Spatial• Utilities for importing from ESRI Shapefile as well as Open Street Map files• Support for all the common geometry types• An RTree index for fast searches on geometries• Support for topology operations during the search (contains, within, intersects, covers, disjoint, etc.)• The possibility to enable spatial operations on any graph of data, regardless of the way the spatial data is stored, as long as an adapter is provided to map from the graph to the geometries.• Ability to split a single layer or dataset into multiple sub- layers or views with pre-configured filters
Peter Neubauer 2011
Peter Neubauer 2011
ESRI and Big Data
Why Big Data?
1. Connecting Big Data to GIS
2. Scaling Geoprocessing
Case Study:NYC Marathon
Case Study:Colorado Wildfires
3. Evolving GeoAnalysis
Are we doing itbackwards?
May 2nd OBL Tweets6,454 Tweets with GPS6,000,000 total Tweets
Why run our analysiswith six thousandinstead of six million?