Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Why and how to scrape geospatial data from the web

We highlight various use cases of geospatial data along with its ability to augment existing data for robust insights. It also covers some of the prominent sources of data collection.

Related Books

Free with a 30 day trial from Scribd

See all

Related Audiobooks

Free with a 30 day trial from Scribd

See all
  • Be the first to comment

  • Be the first to like this

Why and how to scrape geospatial data from the web

  1. 1. Why and How to Scrape Geospatial Data from the Web
  2. 2. What is geospatial data? In simple terms, a data set containing geographic data field to in form of location information such as coordinates, city, address, zip code, etc. can be considered as geospatial data.
  3. 3. The importance of geospatial data
  4. 4. Predictive analytics SAP has been collaborating with Esri, a leading company in Geological Information Services, and they together launched SAP HANA in combination with Esri’s Geodatabase. It allows customers to analyze geographic information by combining with data from other sources.
  5. 5. SAP’s prototype calculates a risk prediction based on four indexes - soil, water, steepness, and vegetation. Regional governments can use the software to issue warnings to people living in high-risk areas.
  6. 6. Operational Intelligence Many companies that provide operational intelligence solutions, support the use of geospatial data among other data streams.
  7. 7. One of the most simple instances is network companies using geospatial data to decide where to set up its mobile towers. Uniform placement of network towers is not always the best option. In case you have a small area which is at a higher elevation than the rest, you can set up a connectivity tower there, to reach a larger circumference.
  8. 8. Situational Intelligence It is a technique that takes the help of large volumes of multidimensional real-time data as well as historical data in order to find and solve problems. Parts of the data are often geospatial reporting. Visualization and analysis of this data can help answer questions like why, where and how, related to certain events that occur suddenly.
  9. 9. Ground Analysis Exploration of Geospatial data of areas taken over by ISIS shows how much of the land they had taken over, has been recovered, and which areas are currently prone to violence due to conflict between rebels and the terrorists.
  10. 10. Such ground analysis can be done to benefit people by utilizing Geospatial data collected by satellites.
  11. 11. Geolocating Footage Scraping Geospatial data can be useful for geolocating footage. Suppose you are watching a video with a weird looking building in which a couple of terrorists are hiding. You know the area, but not the exact location. You can cross check the building with the geospatial data from the area to pinpoint the building location.
  12. 12. Archiving Data All geospatial data collected, might not be needed for immediate use, but can be archived. Especially data from areas that are under conflict, or related to war, can be deleted or hidden due to government or political interventions.
  13. 13. Why is the use of different data sources recommended?
  14. 14. Geospatial data is not usually used in standalone form. It is primarily combined with other data sources. It is more likely used as a data to augment existing data - that is to make sure that no incorrect data creeps into the analysis and bolster the insights delivered from the analysis.
  15. 15. Collecting more specific data When using more than one data sources, you can reduce data wastage by only collecting data that you need.
  16. 16. Improving data quality Surveys or any other data that has been collected using human intervention cannot be relied upon completely. In these cases, having more than one data formats helps confirm anomalies in data or data fields that are most prone to small errors.
  17. 17. Getting the complete picture Online and offline are the two sources of information that together build facts today. Certain factors like social networks and chat-forums are becoming more and more important for brands. Geospatial data might show how opening new branches of a popular coffee shop increase customer footfall gradually.
  18. 18. How to get geospatial data from the web?
  19. 19. Any website with geo data (example: Airbnb, Twitter) NaturalEarthData.com OpenData.arcgis.com (Esri Open Data) EarthExplorer.usgs.gov (USGS Earth Explorer) OpenStreetMap.org sedac.ciesin.columbia.edu (NASA’s Socioeconomic Data and Applications Center ) geodata.grid.unep.ch (United Nations Environmental Data Explorer’s online database) neo.sci.gsfc.nasa.gov (NASA’s Earth Observations) scihub.copernicus.eu/dhus (Sentinel Satellite Data) terrapop.org (Terra Populus) Data Sources
  20. 20. Getting the complete picture Online and offline are the two sources of information that together build facts today. Certain factors like social networks and chat-forums are becoming more and more important for brands. Geospatial data might show how opening new branches of a popular coffee shop increase customer footfall gradually.
  21. 21. A specialized web crawling service provider like PromptCloud can help you extract data from specific websites on recurring basis by leveraging the proven web data extraction platform.
  22. 22. A pioneer is custom and large-scale web data extraction. www.promptcloud.com | sales@promptcloud.com

×