SlideShare a Scribd company logo
1 of 33
See the Earth as it could be.
Mapping Deforestation Trends
in the Brazilian Amazon Using
Machine Learning
Jason Brown, Senior Data Scientist
Courtney Whalen, Data Scientist
See the Earth as it could be.
Agenda
• Overview
• Data
• Software
• Model
• Results
• Conclusions
Extracurricular
• Machine learning happy hour 7pm
in Lindbergh Room
• Visit us in booth 14 all week!
• "Using Deep Learning to Derive 3D
Cities from Satellite Imagery"
Wednesday at 2pm in Gateway II
2
See the Earth as it could be. 3
Overview
See the Earth as it could be.
Goal
• Identify areas of the planet
where deforestation is occurring
• Monitoring deforestation is
important for understanding
impacts on climate change
4
See the Earth as it could be.
Methodology
• Supervised classification model
using remote sensing data
• Binary target
• Post-classification change
detection
5
See the Earth as it could be. 6
Data
See the Earth as it could be.
Labeled Data
• System for Terrestrial Ecosystem Parameterization (STEP) reference data
• ~2,000 hand-labeled sites scattered across every continent; 10,000 km2 or 4000 mi2
• 17 land cover types, 5 forest types
7
Remote Sensing
8
See the Earth as it could be.
Feature Data
• Source:
• Moderate Resolution Imaging
Spectroradiometer (MODIS)
• Surface reflectance
• Revisit rate: 1-2 days
• Spatial resolution: 500 m
• Features:
• 7 spectral bands & normalized
difference vegetation index (NDVI)
• Monthly mean
• Yearly aggregates: min, max, mean,
variance
• 128 features in all
Band # Spectral band Bandwidth (nm)
1 Red 620 - 670
2 NIR 841 - 876
3 Blue 459 - 479
4 Green 545 - 565
5 SWIR 1230 - 1250
6 SWIR 1628 - 1652
7 SWIR 2105 - 2155
9
See the Earth as it could be. 10
Software
See the Earth as it could be.
• Why Spark?
• Distributed computation allows global scale processing
• SQL and ML Pipeline APIs
• Top level Apache project
• Why RasterFrames?
• Spark SQL interface to geospatial raster data
• Available in Python and Scala
• LocationTech incubator project
• Apache 2.0 licensed
11
Learn more and try it at RasterFrames.io
See the Earth as it could be.
Label and Feature Rasters
• Label data: Land cover polygons -> binary forest attribute -> bit type
raster
• Feature data: Monthly and yearly surface reflectance and NDVI
summaries
12
Feature Engineering with
13
SELECT spatial_key,
rf_localAggMin(red) as red_min,
rf_localAggMax(red) as red_max,
rf_localAggMean(red) as red_mean
FROM df
GROUP BY spatial_key
df.groupBy("spatial_key").agg(
localAggMin($"red") as "red_min",
localAggMax($"red") as "red_max",
localAggMean($"red") as "red_mean")
df.groupBy(df.spatial_key).agg( 
localAggMin(df.red).alias('red_min'), 
localAggMax(df.red).alias('red_max'), 
localAggMean(df.red).alias('red_mean'))
See the Earth as it could be. 14
Model
Model Training
• Trained on MODIS 2012 data at pixel level
• 80% train, 20% test
• Train/test split strategy:
• All pixels from each STEP site kept within the same set
• Balanced forest/not forest pixels within each set
15
Model Selection
16
Predicted
Forest? True False
Actual
True 1642 452
False 70 8496
Final model: Random Forest
• 1000 trees
• Depth of 15
• Minimum of 10 instances
per node
Precision = 0.96
Recall = 0.78
See the Earth as it could be.
Feature Importance
1. Red band - year aggregate minimum
2. Red band - year aggregate mean
3. NDVI - year aggregate mean
4. Green band - year aggregate mean
5. NDVI - year aggregate maximum
6. Blue band - year aggregate minimum
7. Blue band - year aggregate mean
8. Green band - year aggregate minimum
9. Green band - year aggregate maximum
10. NDVI - June aggregate mean
11. Red band - August aggregate mean
12. NDVI - year aggregate minimum
13. Blue band - year aggregate minimum
14. Red band - year aggregate minimum
15. Red band - June aggregate mean
17
See the Earth as it could be.
Forest Threshold
Threshold Accuracy
0 0.196
0.1 0.807
0.2 0.943
0.3 0.957
0.4 0.959
0.5 0.951
0.6 0.938
0.7 0.920
0.8 0.897
0.9 0.866
1 0.804
18
See the Earth as it could be.
Mato Grosso
• Intense deforestation in Mato
Grasso
• Area: 348,788 mi² or 903,357 km2
19
See the Earth as it could be.
Training vs Scoring
• Training:
• 52,956 pixels
• Mato Grosso:
• 3,613,428 pixels
• 61,428,276 pixels over 17 years
20
= 10,000 pixels
= Mato Grosso
= Train
Scoring the Model
• Score 17 years
• Enables Post Classification Change Detection
• Compute time: 6-7 hours per year
• Spark Standalone cluster: 6 workers, 48 cores, 24 GB per executor
• Write GeoTIFF
21
import geotrellis.raster.io.geotiff._
val raster = scoredRf.toRaster($"pForest", 1800, 1440)
GeoTiff(raster).write("scored.tiff")
See the Earth as it could be. 22
Results
23
2001-Jul-30 Landsat 7 2014-Aug-11 Landsat 8
Full size interactive at http://forest.astraea.earth/
24
Full size interactive at
http://forest.astraea.earth/
See the Earth as it could be. 25
See the Earth as it could be.
Forest Loss Comparison
globalforestwatch.org Forest Model
26
0.88 correlation
Source: Tree cover loss: Hansen/UMD/Google/USGS/NASA
See the Earth as it could be. 27
Conclusion
See the Earth as it could be.
Next Steps
Improving model:
• Add more features
• Higher spatial resolution
• Better handling of seasonal
differences across world
Applying model:
• Shorten time window for
change detection
• Score the whole world
28
See the Earth as it could be. 29
Questions?
astraea.earth
rasterframes.io
forest.astraea.earth
See the Earth as it could be. 30
Appendix
NDVI
31
See the Earth as it could be.
Tile table with colors but no label
32
IGBP Land Cover Classification System
33
Class Class name Description
1
Evergreen needleleaf
forests
Lands dominated by needleleaf woody vegetation with a percent cover >60% and height exceeding 2 m. Almost all trees remain green all
year. Canopy is never without green foliage.
2
Evergreen broadleaf
forests
Lands dominated by broadleaf woody vegetation with a percent cover >60% and height exceeding 2 m. Almost all trees and shrubs
remain green year round. Canopy is never without green foliage.
3
Deciduous needleleaf
forests
Lands dominated by woody vegetation with a percent cover >60% and height exceeding 2 m. Consists of seasonal needleleaf tree
communities with an annual cycle of leaf-on and leaf-off periods.
4
Deciduous broadleaf
forests
Lands dominated by woody vegetation with a percent cover >60% and height exceeding 2 m. Consists of broadleaf tree communities with
an annual cycle of leaf-on and leaf-off periods.
5 Mixed forests
Lands dominated by trees with a percent cover >60% and height exceeding 2 m. Consists of tree communities with interspersed mixtures
or mosaics of the other four forest types. None of the forest types exceeds 60% of landscape.
6 Closed shrublands
Lands with woody vegetation less than 2 m tall and with shrub canopy cover >60%. The shrub foliage can be either evergreen or
deciduous.
7 Open shrublands
Lands with woody vegetation less than 2 m tall and with shrub canopy cover between 10% and 60%. The shrub foliage can be either
evergreen or deciduous.
8 Woody savannas
Lands with herbaceous and other understory systems, and with forest canopy cover between 30% and 60%. The forest cover height
exceeds 2 m.
9 Savannas
Lands with herbaceous and other understory systems, and with forest canopy cover between 10% and 30%. The forest cover height
exceeds 2 m.
10 Grasslands Lands with herbaceous types of cover. Tree and shrub cover is less than 10%.
11 Permanent wetlands
Lands with a permanent mixture of water and herbaceous or woody vegetation. The vegetation can be present either in salt, brackish, or
fresh water
12 Croplands
Lands covered with temporary crops followed by harvest and a bare soil period (e.g., single and multiple cropping systems). Note that
perennial woody crops will be classified as the appropriate forest or shrub land cover type
13
Urban and built-up
lands
Land covered by buildings and other man-made structures.
14
Cropland/natural
vegetation mosaics
Lands with a mosaic of croplands, forests, shrubland, and grasslands in which no one component comprises more than 60% of the
landscape.
15 Snow and ice Lands under snow/ice cover throughout the year.
16 Barren Lands with exposed soil, sand, rocks, or snow and never have more than 10% vegetated cover during any time of the year.
17 Water bodies Oceans, seas, lakes, reservoirs, and rivers. Can be either fresh or saltwater bodies.

More Related Content

Similar to Mapping Deforestation Trends in the Brazilian Amazon Using Machine Learning

Basin planning & watershed management
Basin planning & watershed managementBasin planning & watershed management
Basin planning & watershed managementAmol Inamdar
 
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest Disturbance
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest DisturbanceNISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest Disturbance
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest DisturbanceDr. Pankaj Dhussa
 
GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...
GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...
GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...World Resources Institute (WRI)
 
Caldwell community sustainability and land use policy
Caldwell community sustainability and land use policyCaldwell community sustainability and land use policy
Caldwell community sustainability and land use policyGeCo in the Rockies
 
Remote Sensing and GIS techniques.pptx
Remote Sensing and GIS techniques.pptxRemote Sensing and GIS techniques.pptx
Remote Sensing and GIS techniques.pptxDebashisSatapathy2
 
Geofolio North Norfolk
Geofolio North NorfolkGeofolio North Norfolk
Geofolio North NorfolkGeoBlogs
 
Landuse landcover mapping
Landuse landcover mappingLanduse landcover mapping
Landuse landcover mappingAditya Kushwaha
 
2015 05 Scaling from seeds to ecosystems
2015 05 Scaling from seeds to ecosystems2015 05 Scaling from seeds to ecosystems
2015 05 Scaling from seeds to ecosystemsTimeScience
 
Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...
Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...
Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...CIFOR-ICRAF
 
Morris highres asprs_pecora_final
Morris highres asprs_pecora_finalMorris highres asprs_pecora_final
Morris highres asprs_pecora_finalEric Morris
 
TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]
TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]
TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]TimeScience
 
GIS & RS in Forest Mapping
GIS & RS in Forest MappingGIS & RS in Forest Mapping
GIS & RS in Forest MappingKamlesh Kumar
 
AGRICULTURE INFORMATION SYSTEM USING REMOTE SENSING, GEOGRAPHICAL ANALYSIS ...
AGRICULTURE INFORMATION SYSTEM USING  REMOTE SENSING, GEOGRAPHICAL  ANALYSIS ...AGRICULTURE INFORMATION SYSTEM USING  REMOTE SENSING, GEOGRAPHICAL  ANALYSIS ...
AGRICULTURE INFORMATION SYSTEM USING REMOTE SENSING, GEOGRAPHICAL ANALYSIS ...Alok Singh
 
Geographical Information System
Geographical Information SystemGeographical Information System
Geographical Information SystemDevegowda S R
 
Deforestation diagnostics
Deforestation diagnosticsDeforestation diagnostics
Deforestation diagnosticsCIFOR-ICRAF
 
Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...AngelosAlamanos
 
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest Resources
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest ResourcesNISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest Resources
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest ResourcesDr. Pankaj Dhussa
 

Similar to Mapping Deforestation Trends in the Brazilian Amazon Using Machine Learning (20)

study of dams
study of damsstudy of dams
study of dams
 
Basin planning & watershed management
Basin planning & watershed managementBasin planning & watershed management
Basin planning & watershed management
 
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest Disturbance
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest DisturbanceNISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest Disturbance
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Timber and Forest Disturbance
 
GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...
GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...
GFW Partner Meeting 2017 - Parallel Discussions 2: Global Forest Watch at the...
 
Caldwell community sustainability and land use policy
Caldwell community sustainability and land use policyCaldwell community sustainability and land use policy
Caldwell community sustainability and land use policy
 
Remote Sensing and GIS techniques.pptx
Remote Sensing and GIS techniques.pptxRemote Sensing and GIS techniques.pptx
Remote Sensing and GIS techniques.pptx
 
Geofolio North Norfolk
Geofolio North NorfolkGeofolio North Norfolk
Geofolio North Norfolk
 
Inventory and monitoring of tree resources in agroecosystems
Inventory and monitoring of tree resources in agroecosystemsInventory and monitoring of tree resources in agroecosystems
Inventory and monitoring of tree resources in agroecosystems
 
Landuse landcover mapping
Landuse landcover mappingLanduse landcover mapping
Landuse landcover mapping
 
2015 05 Scaling from seeds to ecosystems
2015 05 Scaling from seeds to ecosystems2015 05 Scaling from seeds to ecosystems
2015 05 Scaling from seeds to ecosystems
 
Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...
Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...
Carbon dynamics in mangrove ecosystems of Andaman Islands: A living laborator...
 
Morris highres asprs_pecora_final
Morris highres asprs_pecora_finalMorris highres asprs_pecora_final
Morris highres asprs_pecora_final
 
TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]
TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]
TraitCapture: NextGen phenomics tools for lab and field [ComBio2015]
 
Carbon Benefits Kds V3 07
Carbon Benefits Kds V3 07Carbon Benefits Kds V3 07
Carbon Benefits Kds V3 07
 
GIS & RS in Forest Mapping
GIS & RS in Forest MappingGIS & RS in Forest Mapping
GIS & RS in Forest Mapping
 
AGRICULTURE INFORMATION SYSTEM USING REMOTE SENSING, GEOGRAPHICAL ANALYSIS ...
AGRICULTURE INFORMATION SYSTEM USING  REMOTE SENSING, GEOGRAPHICAL  ANALYSIS ...AGRICULTURE INFORMATION SYSTEM USING  REMOTE SENSING, GEOGRAPHICAL  ANALYSIS ...
AGRICULTURE INFORMATION SYSTEM USING REMOTE SENSING, GEOGRAPHICAL ANALYSIS ...
 
Geographical Information System
Geographical Information SystemGeographical Information System
Geographical Information System
 
Deforestation diagnostics
Deforestation diagnosticsDeforestation diagnostics
Deforestation diagnostics
 
Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...Drought monitoring, Precipitation statistics, and water balance with freely a...
Drought monitoring, Precipitation statistics, and water balance with freely a...
 
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest Resources
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest ResourcesNISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest Resources
NISAR NASA-ISRO Synthetic Aperture Radar (NISAR) Forest Resources
 

More from Astraea, Inc.

Building a Geospatial Analysis Platform - Dr. Kimberly Scott
Building a Geospatial Analysis Platform - Dr. Kimberly ScottBuilding a Geospatial Analysis Platform - Dr. Kimberly Scott
Building a Geospatial Analysis Platform - Dr. Kimberly ScottAstraea, Inc.
 
Detecting Solar Farms Using Deep Learning
Detecting Solar Farms Using Deep LearningDetecting Solar Farms Using Deep Learning
Detecting Solar Farms Using Deep LearningAstraea, Inc.
 
2018 IEEE WIE Presentation - Dr. Kimberly Scott
2018 IEEE WIE Presentation - Dr. Kimberly Scott2018 IEEE WIE Presentation - Dr. Kimberly Scott
2018 IEEE WIE Presentation - Dr. Kimberly ScottAstraea, Inc.
 
2018 Charlottesville Open Data Challenge - Team DSB
2018 Charlottesville Open Data Challenge - Team DSB2018 Charlottesville Open Data Challenge - Team DSB
2018 Charlottesville Open Data Challenge - Team DSBAstraea, Inc.
 
2018 Charlottesville Open Data Challenge - Alex Miller
2018 Charlottesville Open Data Challenge - Alex Miller2018 Charlottesville Open Data Challenge - Alex Miller
2018 Charlottesville Open Data Challenge - Alex MillerAstraea, Inc.
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryAstraea, Inc.
 
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine LearningRasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine LearningAstraea, Inc.
 

More from Astraea, Inc. (7)

Building a Geospatial Analysis Platform - Dr. Kimberly Scott
Building a Geospatial Analysis Platform - Dr. Kimberly ScottBuilding a Geospatial Analysis Platform - Dr. Kimberly Scott
Building a Geospatial Analysis Platform - Dr. Kimberly Scott
 
Detecting Solar Farms Using Deep Learning
Detecting Solar Farms Using Deep LearningDetecting Solar Farms Using Deep Learning
Detecting Solar Farms Using Deep Learning
 
2018 IEEE WIE Presentation - Dr. Kimberly Scott
2018 IEEE WIE Presentation - Dr. Kimberly Scott2018 IEEE WIE Presentation - Dr. Kimberly Scott
2018 IEEE WIE Presentation - Dr. Kimberly Scott
 
2018 Charlottesville Open Data Challenge - Team DSB
2018 Charlottesville Open Data Challenge - Team DSB2018 Charlottesville Open Data Challenge - Team DSB
2018 Charlottesville Open Data Challenge - Team DSB
 
2018 Charlottesville Open Data Challenge - Alex Miller
2018 Charlottesville Open Data Challenge - Alex Miller2018 Charlottesville Open Data Challenge - Alex Miller
2018 Charlottesville Open Data Challenge - Alex Miller
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite Imagery
 
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine LearningRasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
 

Recently uploaded

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 

Recently uploaded (20)

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 

Mapping Deforestation Trends in the Brazilian Amazon Using Machine Learning

  • 1. See the Earth as it could be. Mapping Deforestation Trends in the Brazilian Amazon Using Machine Learning Jason Brown, Senior Data Scientist Courtney Whalen, Data Scientist
  • 2. See the Earth as it could be. Agenda • Overview • Data • Software • Model • Results • Conclusions Extracurricular • Machine learning happy hour 7pm in Lindbergh Room • Visit us in booth 14 all week! • "Using Deep Learning to Derive 3D Cities from Satellite Imagery" Wednesday at 2pm in Gateway II 2
  • 3. See the Earth as it could be. 3 Overview
  • 4. See the Earth as it could be. Goal • Identify areas of the planet where deforestation is occurring • Monitoring deforestation is important for understanding impacts on climate change 4
  • 5. See the Earth as it could be. Methodology • Supervised classification model using remote sensing data • Binary target • Post-classification change detection 5
  • 6. See the Earth as it could be. 6 Data
  • 7. See the Earth as it could be. Labeled Data • System for Terrestrial Ecosystem Parameterization (STEP) reference data • ~2,000 hand-labeled sites scattered across every continent; 10,000 km2 or 4000 mi2 • 17 land cover types, 5 forest types 7
  • 9. See the Earth as it could be. Feature Data • Source: • Moderate Resolution Imaging Spectroradiometer (MODIS) • Surface reflectance • Revisit rate: 1-2 days • Spatial resolution: 500 m • Features: • 7 spectral bands & normalized difference vegetation index (NDVI) • Monthly mean • Yearly aggregates: min, max, mean, variance • 128 features in all Band # Spectral band Bandwidth (nm) 1 Red 620 - 670 2 NIR 841 - 876 3 Blue 459 - 479 4 Green 545 - 565 5 SWIR 1230 - 1250 6 SWIR 1628 - 1652 7 SWIR 2105 - 2155 9
  • 10. See the Earth as it could be. 10 Software
  • 11. See the Earth as it could be. • Why Spark? • Distributed computation allows global scale processing • SQL and ML Pipeline APIs • Top level Apache project • Why RasterFrames? • Spark SQL interface to geospatial raster data • Available in Python and Scala • LocationTech incubator project • Apache 2.0 licensed 11 Learn more and try it at RasterFrames.io
  • 12. See the Earth as it could be. Label and Feature Rasters • Label data: Land cover polygons -> binary forest attribute -> bit type raster • Feature data: Monthly and yearly surface reflectance and NDVI summaries 12
  • 13. Feature Engineering with 13 SELECT spatial_key, rf_localAggMin(red) as red_min, rf_localAggMax(red) as red_max, rf_localAggMean(red) as red_mean FROM df GROUP BY spatial_key df.groupBy("spatial_key").agg( localAggMin($"red") as "red_min", localAggMax($"red") as "red_max", localAggMean($"red") as "red_mean") df.groupBy(df.spatial_key).agg( localAggMin(df.red).alias('red_min'), localAggMax(df.red).alias('red_max'), localAggMean(df.red).alias('red_mean'))
  • 14. See the Earth as it could be. 14 Model
  • 15. Model Training • Trained on MODIS 2012 data at pixel level • 80% train, 20% test • Train/test split strategy: • All pixels from each STEP site kept within the same set • Balanced forest/not forest pixels within each set 15
  • 16. Model Selection 16 Predicted Forest? True False Actual True 1642 452 False 70 8496 Final model: Random Forest • 1000 trees • Depth of 15 • Minimum of 10 instances per node Precision = 0.96 Recall = 0.78
  • 17. See the Earth as it could be. Feature Importance 1. Red band - year aggregate minimum 2. Red band - year aggregate mean 3. NDVI - year aggregate mean 4. Green band - year aggregate mean 5. NDVI - year aggregate maximum 6. Blue band - year aggregate minimum 7. Blue band - year aggregate mean 8. Green band - year aggregate minimum 9. Green band - year aggregate maximum 10. NDVI - June aggregate mean 11. Red band - August aggregate mean 12. NDVI - year aggregate minimum 13. Blue band - year aggregate minimum 14. Red band - year aggregate minimum 15. Red band - June aggregate mean 17
  • 18. See the Earth as it could be. Forest Threshold Threshold Accuracy 0 0.196 0.1 0.807 0.2 0.943 0.3 0.957 0.4 0.959 0.5 0.951 0.6 0.938 0.7 0.920 0.8 0.897 0.9 0.866 1 0.804 18
  • 19. See the Earth as it could be. Mato Grosso • Intense deforestation in Mato Grasso • Area: 348,788 mi² or 903,357 km2 19
  • 20. See the Earth as it could be. Training vs Scoring • Training: • 52,956 pixels • Mato Grosso: • 3,613,428 pixels • 61,428,276 pixels over 17 years 20 = 10,000 pixels = Mato Grosso = Train
  • 21. Scoring the Model • Score 17 years • Enables Post Classification Change Detection • Compute time: 6-7 hours per year • Spark Standalone cluster: 6 workers, 48 cores, 24 GB per executor • Write GeoTIFF 21 import geotrellis.raster.io.geotiff._ val raster = scoredRf.toRaster($"pForest", 1800, 1440) GeoTiff(raster).write("scored.tiff")
  • 22. See the Earth as it could be. 22 Results
  • 23. 23 2001-Jul-30 Landsat 7 2014-Aug-11 Landsat 8 Full size interactive at http://forest.astraea.earth/
  • 24. 24 Full size interactive at http://forest.astraea.earth/
  • 25. See the Earth as it could be. 25
  • 26. See the Earth as it could be. Forest Loss Comparison globalforestwatch.org Forest Model 26 0.88 correlation Source: Tree cover loss: Hansen/UMD/Google/USGS/NASA
  • 27. See the Earth as it could be. 27 Conclusion
  • 28. See the Earth as it could be. Next Steps Improving model: • Add more features • Higher spatial resolution • Better handling of seasonal differences across world Applying model: • Shorten time window for change detection • Score the whole world 28
  • 29. See the Earth as it could be. 29 Questions? astraea.earth rasterframes.io forest.astraea.earth
  • 30. See the Earth as it could be. 30 Appendix
  • 32. See the Earth as it could be. Tile table with colors but no label 32
  • 33. IGBP Land Cover Classification System 33 Class Class name Description 1 Evergreen needleleaf forests Lands dominated by needleleaf woody vegetation with a percent cover >60% and height exceeding 2 m. Almost all trees remain green all year. Canopy is never without green foliage. 2 Evergreen broadleaf forests Lands dominated by broadleaf woody vegetation with a percent cover >60% and height exceeding 2 m. Almost all trees and shrubs remain green year round. Canopy is never without green foliage. 3 Deciduous needleleaf forests Lands dominated by woody vegetation with a percent cover >60% and height exceeding 2 m. Consists of seasonal needleleaf tree communities with an annual cycle of leaf-on and leaf-off periods. 4 Deciduous broadleaf forests Lands dominated by woody vegetation with a percent cover >60% and height exceeding 2 m. Consists of broadleaf tree communities with an annual cycle of leaf-on and leaf-off periods. 5 Mixed forests Lands dominated by trees with a percent cover >60% and height exceeding 2 m. Consists of tree communities with interspersed mixtures or mosaics of the other four forest types. None of the forest types exceeds 60% of landscape. 6 Closed shrublands Lands with woody vegetation less than 2 m tall and with shrub canopy cover >60%. The shrub foliage can be either evergreen or deciduous. 7 Open shrublands Lands with woody vegetation less than 2 m tall and with shrub canopy cover between 10% and 60%. The shrub foliage can be either evergreen or deciduous. 8 Woody savannas Lands with herbaceous and other understory systems, and with forest canopy cover between 30% and 60%. The forest cover height exceeds 2 m. 9 Savannas Lands with herbaceous and other understory systems, and with forest canopy cover between 10% and 30%. The forest cover height exceeds 2 m. 10 Grasslands Lands with herbaceous types of cover. Tree and shrub cover is less than 10%. 11 Permanent wetlands Lands with a permanent mixture of water and herbaceous or woody vegetation. The vegetation can be present either in salt, brackish, or fresh water 12 Croplands Lands covered with temporary crops followed by harvest and a bare soil period (e.g., single and multiple cropping systems). Note that perennial woody crops will be classified as the appropriate forest or shrub land cover type 13 Urban and built-up lands Land covered by buildings and other man-made structures. 14 Cropland/natural vegetation mosaics Lands with a mosaic of croplands, forests, shrubland, and grasslands in which no one component comprises more than 60% of the landscape. 15 Snow and ice Lands under snow/ice cover throughout the year. 16 Barren Lands with exposed soil, sand, rocks, or snow and never have more than 10% vegetated cover during any time of the year. 17 Water bodies Oceans, seas, lakes, reservoirs, and rivers. Can be either fresh or saltwater bodies.

Editor's Notes

  1. https://jungleheroes.wordpress.com/2012/09/05/palm-oil-free-ritz-crackers/forest-slash-and-burn/
  2. https://blog.conservation.org/2016/08/stopping-deforestation-in-the-amazon-by-2020-its-possible/
  3. https://fsapps.nwcg.gov/imagery.php?op=modis_bands
  4. Check out rasterframes.io to learn more
  5. Big ideas: 1 – this is a data frame 2 – this is a map  https://gisgeography.com/ndvi-normalized-difference-vegetation-index/ https://sentera.com/understanding-ndvi-plant-health/
  6. List all the operations we need to do: Apply QA mask Apply polygon bit mask Scale integer to float Compute NDVI Compute yearly stats (shown) Compute monthly stats
  7. 61,428,276 pixels 
  8. https://www.ucsusa.org/global-warming/solutions/stop-deforestation/brazils-reduction-deforestation.html#.WsTz3NPwbBI
  9. https://www.globalforestwatch.org/country/BRA/12?category=forest-change  https://www.globalforestwatch.org/map/3/15.00/27.00/ALL/grayscale/loss?tab=analysis-tab&begin=2001-01-01&end=2017-01-01&threshold=30&dont_analyze=true  https://earthenginepartners.appspot.com/science-2013-global-forest Global Forest Watch data comes from UMD Department of Geographical Sciences who did a "time-series analysis of Landsat images characterizing forest extent and change". From UMD: "‘Forest Cover Loss’ is defined as a change from a forest to non-forest state, during the period 2000–2016."  In the chart on the right, I am defining forest loss as a greater than 0.5 decrease between years in the probability of a pixel being a forest.