Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Upcoming SlideShare
Ijirsm poornima-km-a-survey-on-security-circumstances-for-mobile-cloud-computing
Next
Download to read offline and view in fullscreen.

Share

Terra Populus: Integrated Data on Population and Environment

Download to read offline

Terra Populus is an NSF-funded DataNet project that seeks to lower the barriers for conducting human-environment interactions research. TerraPop provides access to hundreds of census and survey microdata samples, area-level data describing geographic units, and environmental data, commonly stored as raster data, describing land use, land cover, and climate. The data access system adds value to these data by supporting transformations across microdata, area-level data, and raster data. Users may select variables of interest from any of the three formats and obtain output in their desired format. This presentation will provide an overview of the data available in the TerraPop data access system and the system’s transformation functionality, as well as a demonstration of the data access system.

  • Be the first to like this

Terra Populus: Integrated Data on Population and Environment

  1. 1. TerraPop Goals Lower barriers to conducting interdisciplinary human- environment interactions research by making data with different formats from different scientific domains easily interoperable Provide an organizational and technical framework to preserve, integrate, disseminate, and analyze global- scale spatiotemporal data describing population and the environment.
  2. 2. TerraPop in Context Collaborating Organizations • Data integration expertise • Large census and survey data collections & expertise • Institutional foundation • Human-environment interactions research expertise • Environmentally-oriented data collections & expertise • Data preservation and sustainability expertise • Social science data collections & expertise • Major producers and distributors of data on both humans and their environment • Major producers of tools for integrating and transforming data across formats • Leaders in preservation and sustainability
  3. 3. Background  Sustainable Digital Data Access and Preservation Network (DataNet)  Provide reliable digital preservation, access, integration, and analysis  Anticipate and adapt to technological change and user needs  Engage with frontiers of computer/information science and CI  Serve as component elements of interoperable data preservation and access network
  4. 4. established in 2009
  5. 5. established in 2011
  6. 6. TerraPop in Context DataNet Cyberinfrastructure Curated population and environment data collection  Exposed through DataONE, SEAD  Extracts exportable to DFC Integration services  Potentially available through DFC, SEAD  Open source components and API
  7. 7. • T W O D O M AI N S : P O P U L AT I O N & E N V I R O N M E N T • T H R E E D ATA S T R U C T U R E S • Microdata • Area-level data • Rasters Source Data
  8. 8. Making disparate data formats interoperable Microdata: Characteristics of individuals and households Area-level data: Characteristics of places defined by boundaries Raster data: Values tied to spatial coordinates
  9. 9. Location-Based Integration Microdata Area-level dataRasters Mix and match variables originating in any of the data structures Obtain output in the data structure most useful to you
  10. 10. Location-Based Integration Individuals and households with their environmental and social context Microdata Area-level dataRasters Age Sex 36 M 34 F 11 M 8 M 42 M 39 F 15 F Landcover Forest Forest Forest Forest Grassland Grassland Grassland
  11. 11. Location-Based Integration Summarized environmental and population Microdata Area-level dataRasters characteristics for administrative districts County ID G01001 G01003 G01005 G01007 County ID Mean Ann. Precip. Median HH Income G01001 768 50,500 G01003 589 48,500 G01005 867 51,000 G01007 701 50,750
  12. 12. Location-Based Integration Rasters of population and environment data Microdata Area-level dataRasters
  13. 13. Why TerraPop?  Data  Access  Preservation  Documentation  Creation  Transformations
  14. 14. Improved Data Access
  15. 15. Preservation  Data producers have no preservation plan  GLI crops data  Previous versions of data difficult or impossible to find  MODIS Land Cover Collection 4 superseded by Collection 5, but Collection 4 is unavailable
  16. 16. Documentation  Data lacks sufficient (or any) metadata http://gli.environment.umn.edu/
  17. 17. Documentation  GLI crops data originally provided through an anonymous FTP site  No metadata provided with the data files  So, we wrote it!
  18. 18. http://www.earthstat.org/
  19. 19. http://www.earthstat.org/wp-content/uploads/METADATA_HarvestedAreaYield175Crops.pdf
  20. 20. Abaca – Harvested Area GeoTIFF Metadata http://www.earthstat.org
  21. 21. Abaca – Harvested Area GeoTIFF Metadata http://www.terrapop.org
  22. 22. Creation  Historical subnational GIS data  Matched to census data  Aligned with most recent GIS data available for a given country
  23. 23. Photographed Countries Census Bureau Library, Library of Congress, Harvard
  24. 24. Creation  Historical subnational GIS data  Matched to census data  Aligned with most recent GIS data available for a given country  Area-level data  Tabulated from census microdata  Obtained from census agencies as digital files, PDFs, or HTML tables
  25. 25. Transformations Continuous Binary Categorical Min Percent area Mode Max Total area Number of Classes Mean Count Percent area* Total area* * Available for some continuous agricultural rasters
  26. 26. Area-Level Summary of Raster Data
  27. 27. Data in TerraPop
  28. 28. Completed GIS Boundary FilesGIS Boundary Files In Progress
  29. 29. Beta Raster data  Global Landscapes Initiative (GLI)  Yield and harvested area for 175 crops  Global Land Cover 2000 (GLC2000)  Land cover data, circa 2000, derived from the VEGETATION instrument on the SPOT 4 satellite  WorldClim  Climate data describing temperature, precipitation, and bioclimatic variables, created from weather station data collected from approximately 1950-2000
  30. 30. New Raster Data  MODIS Land Cover Type (MCD12Q1)  Yearly land cover data derived from the MODIS Terra and Aqua satellites, available for 2001 - 2012  500 meter spatial resolution  Available in five land cover classifications  IGBP  University of Maryland  LAI/fPAR  Net Primary Productivity  Plant Functional Type  Now available on our staging site
  31. 31. Project Status  Currently in project year 4  Prepping a rollout of new data, but you can preview it at http://beta2.terrapop.org  Prepping a new UI for summer 2015  Always creating new data!

Terra Populus is an NSF-funded DataNet project that seeks to lower the barriers for conducting human-environment interactions research. TerraPop provides access to hundreds of census and survey microdata samples, area-level data describing geographic units, and environmental data, commonly stored as raster data, describing land use, land cover, and climate. The data access system adds value to these data by supporting transformations across microdata, area-level data, and raster data. Users may select variables of interest from any of the three formats and obtain output in their desired format. This presentation will provide an overview of the data available in the TerraPop data access system and the system’s transformation functionality, as well as a demonstration of the data access system.

Views

Total views

1,096

On Slideshare

0

From embeds

0

Number of embeds

643

Actions

Downloads

4

Shares

0

Comments

0

Likes

0

×