Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018

•Download as PPTX, PDF•

1 like•124 views

The Statistical and Applied Mathematical Sciences Institute

Imaging spectrometers housed on satellites are used to obtain data on vegetated surfaces by measuring reflectance from the Earth’s surface. These data are very useful as they provide information on changes in vegetation over time on global scales, which is important to assess the impacts of changes in weather and climate, and the effects of agricultural practices. However, the information provided by these data can be limited due to the resolution of the sensors and inhibiting factors such as cloud cover. In this project we will use two remote sensing sources of the Enhanced Vegetation Index (EVI) to analyze vegetation over Nebraska. The first, Landsat EVI, is available at fine spatial resolution, but is sparse in time. The second, MODIS EVI, is obtained regularly in time, but is available at a much coarser spatial resolution. We will use these data to explore the relationships between vegetation and changes in temperature and landcover (e.g. corn fields versus grasslands), as well as to classify the landcover in unknown regions. Group members: Samuel Hood, Zhihan Lu, Rita Pradhudesai, Thomas Rechtman, Meghana Tatneni, Ganlin Ye

Education

Classifying Vegetation in
Nebraska using Landsat
data
Mentor: Maggie Johnson
Group member: Samuel Hood, Riya Prabbhudesai, Thomas Rechtman,
Zhihan Lu, Meghana Tatineni, Ganlin Ye

Introduction
Land cover: the surface of the ground (i.e, types of vegetation and water)
Knowledge of Land cover is important because:
● Vegetation affects climate and climate affects vegetation
● Landcover can be an important input into various models (ex.climate change,
air pollution)
● Being able to identify changes in land cover helps us understand changes in
agriculture practices and implications of deforestation and how bodies of
water change over time.

Data Available
Enhanced Vegetation Index: A measurement of how much chlorophyll is present
*Formula of EVI* -> Uses Reflectance Values
Other Data available:
● Dates Corresponding to when the EVI value was calculated
● Land cover type of each location
● Temperature over the region for each day
● X,Y coordinates of each location
● Longitude, Latitude coordinates of each location

Goal
To determine whether remote sensing data can be used to classify the land cover of a region in Nebraska
at high spatial resolution
We’ll accomplish this goal by training our models using the USDA NASS 2008 cropland data layer

Feature Selection Plots
Location (Longitude*Latitude)

Introduction to Features
Feature Ideas
Latitude Blue average reflectance Red average reflectance
Longitude Green average reflectance Temperature at max EVI
Duration of the season Nir Average reflectance Temperature at min EVI
Maximum EVI Max-Min blue reflectance Rate of spring green-up
Time of maximum EVI Max-Min green reflectance Starting date of the season
Max-Min EVI Max-Min NIR reflectance Starting EVI of the season

Logistic Regression
Background
- Generalized linear model with bernoulli random component and logit link function
- Binary outcome
- Formula
- Advantages: Simple
- Disadvantages: Binary Outcome
Model Outcome
- Two outcomes: open water and vegetation
- One covariate used: Average Green Reflectance values
- Error rate of .82%

Multinomial Regression
- Extension of logistic regression
- Outcomes can be more than two categories
- Forward and backward selection used to select features.
- Advantages: Simple
- Disadvantages: Linear Prediction, Overfitting
- 13 out of the 19 features are used in the model

Random Forest (will need more slides)
Machine Learning Algorithm that uses decision trees
Since one decision tree would over fit our data, an average of many random trees
is taken
The processes to random forest is similar to tree bagging (insert equation)
Random Forest differs in the fact that it choses a new random subtree at every
vertex
In our models we looked at classifying through a random forest as well as
concatenating random forest through in steps by grouping our different
classifications.

Cross Validation
- Separate the data into n
non-overlapping sets of
equal size.
- Train on n-1 of these sets
and test on the other 1
set.
- Average all the accuracy
values for final
assessment of the model
- Benefit:
Reduce bias in training
and testing

Method
Error Rates on
randomized test data
Mean error rate in 10-fold
cross validation
Multinomial Logistic
Regression
32.48% 33.38%
Random Forests 16.94% 15.62%
Multi-Layer Random
Forest
16.71% 15.81%
Model Comparison

Confusion Matrices
Multi-Layer Random
Forest
Traditional Random
Forest

Conclusions
Random forest is the best model for classifying landcover.

Future Work
There is missing data in the landsat data and it is collected every 16 days.
Missing data can be filled in with motis data but the motis data has low spatial
resolution.
Possible abrupt changes in EVI and temperature are not recorded in landsat data.

Similar to Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018

Workshop usgs brasil_2015_01Terra-i

Agro-Farm Care – Crop, Fertilizer & Disease Prediction (Web App)IRJET Journal

Models for Stability Analysis (AMMI andBiplots).pptxprasannamodali

Assessing ecosystem services over large areasAlessandro Gimona

A Review on the Application of Natural Computing in Environmental InformaticsAndreas Kamilaris

NDGeospatialSummit2019 - Classification and Calculation of Vegetation Indices...North Dakota GIS Hub

Kim_WE3_T05_2.pptxgrssieee

Mintewab Biodiversity And Productivity1a95osksj

Agroclimatic modeling : CERES Wheat Yassine ADRAB

Climate and crop modeling by Gummadi Sridhar,Gizachew Legesse,Pauline Chiveng...ICRISAT

Forest Change Detection in incomplete satellite images with deep neural networksAatif Sohail

What would farmers doSoil and Water Conservation Society

Crow.IGARSS.talk.pptxgrssieee

Benchmarking grounds in australiaGilba Solutions Pty Ltd

Monitoring Global Biome Dynamics from Space Cassidy Rankine

Advanced biometrical and quantitative genetics akshayAkshay Deshmukh

RemoteSensingProjectPaperJames Sherwood

EcoTas13 BradEvans e-Mast UNSWTERN Australia

EcoTas13 BradEvans e-MASTTERN Australia

Caldwell community sustainability and land use policyGeCo in the Rockies

Similar to Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018 (20)

Workshop usgs brasil_2015_01

Agro-Farm Care – Crop, Fertilizer & Disease Prediction (Web App)

Models for Stability Analysis (AMMI andBiplots).pptx

Assessing ecosystem services over large areas

A Review on the Application of Natural Computing in Environmental Informatics

NDGeospatialSummit2019 - Classification and Calculation of Vegetation Indices...

Kim_WE3_T05_2.pptx

Mintewab Biodiversity And Productivity1

Agroclimatic modeling : CERES Wheat

Climate and crop modeling by Gummadi Sridhar,Gizachew Legesse,Pauline Chiveng...

Forest Change Detection in incomplete satellite images with deep neural networks

What would farmers do

Crow.IGARSS.talk.pptx

Benchmarking grounds in australia

Monitoring Global Biome Dynamics from Space

Advanced biometrical and quantitative genetics akshay

RemoteSensingProjectPaper

EcoTas13 BradEvans e-Mast UNSW

EcoTas13 BradEvans e-MAST

Caldwell community sustainability and land use policy

More from The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...The Statistical and Applied Mathematical Sciences Institute

2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - A Bracketing Relationship between Differe...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Difference-in-differences: more than meet...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...The Statistical and Applied Mathematical Sciences Institute

Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...The Statistical and Applied Mathematical Sciences Institute

2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...The Statistical and Applied Mathematical Sciences Institute

2019 Fall Series: Professional Development, Writing Academic Papers…What Work...The Statistical and Applied Mathematical Sciences Institute

2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...The Statistical and Applied Mathematical Sciences Institute

2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...

2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...

Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...

Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...

Causal Inference Opening Workshop - A Bracketing Relationship between Differe...

Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...

Causal Inference Opening Workshop - Difference-in-differences: more than meet...

Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...

Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...

Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...

Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...

Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...

Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...

Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...

Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...

Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...

2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...

2019 Fall Series: Professional Development, Writing Academic Papers…What Work...

2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...

2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...

Recently uploaded

Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva

FSB Advising Checklist - Orientation 2024Elizabeth Walsh

Interdisciplinary_Insights_Data_Collection_Methods.pptxPooja Bhuva

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1

Jamworks pilot and AI at Jisc (20/03/2024)Jisc

Holdier Curriculum Vitae (April 2024).pdfagholdier

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade

This PowerPoint helps students to consider the concept of infinity.christianmathematics

Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University of Engineering & Technology, Jamshoro

Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand

REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda

Single or Multiple melodic lines structuredhanjurrannsibayan2

How to Give a Domain for a Field in Odoo 17Celine George

How to Manage Global Discount in Odoo 17 POSCeline George

Sociology 101 Demonstration of Learning Exhibitjbellavia9

Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid

Recently uploaded (20)

Python Notes for mca i year students osmania university.docx

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...

FSB Advising Checklist - Orientation 2024

Interdisciplinary_Insights_Data_Collection_Methods.pptx

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx

Jamworks pilot and AI at Jisc (20/03/2024)

Holdier Curriculum Vitae (April 2024).pdf

Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx

This PowerPoint helps students to consider the concept of infinity.

Mehran University Newsletter Vol-X, Issue-I, 2024

Google Gemini An AI Revolution in Education.pptx

REMIFENTANIL: An Ultra short acting opioid.pptx

Single or Multiple melodic lines structure

How to Give a Domain for a Field in Odoo 17

How to Manage Global Discount in Odoo 17 POS

Sociology 101 Demonstration of Learning Exhibit

Basic Civil Engineering first year Notes- Chapter 4 Building.pptx

Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018

1. Classifying Vegetation in Nebraska using Landsat data Mentor: Maggie Johnson Group member: Samuel Hood, Riya Prabbhudesai, Thomas Rechtman, Zhihan Lu, Meghana Tatineni, Ganlin Ye

2. Introduction Land cover: the surface of the ground (i.e, types of vegetation and water) Knowledge of Land cover is important because: ● Vegetation affects climate and climate affects vegetation ● Landcover can be an important input into various models (ex.climate change, air pollution) ● Being able to identify changes in land cover helps us understand changes in agriculture practices and implications of deforestation and how bodies of water change over time.

3. Data Available Enhanced Vegetation Index: A measurement of how much chlorophyll is present *Formula of EVI* -> Uses Reflectance Values Other Data available: ● Dates Corresponding to when the EVI value was calculated ● Land cover type of each location ● Temperature over the region for each day ● X,Y coordinates of each location ● Longitude, Latitude coordinates of each location

4. Goal To determine whether remote sensing data can be used to classify the land cover of a region in Nebraska at high spatial resolution We’ll accomplish this goal by training our models using the USDA NASS 2008 cropland data layer

6. From time series to features

7. Feature Selection Plots

8. Feature Selection Plots Location (Longitude*Latitude)

9. Feature Selection Plots Minimum EVI

10. Introduction to Features Feature Ideas Latitude Blue average reflectance Red average reflectance Longitude Green average reflectance Temperature at max EVI Duration of the season Nir Average reflectance Temperature at min EVI Maximum EVI Max-Min blue reflectance Rate of spring green-up Time of maximum EVI Max-Min green reflectance Starting date of the season Max-Min EVI Max-Min NIR reflectance Starting EVI of the season

11. Logistic Regression Background - Generalized linear model with bernoulli random component and logit link function - Binary outcome - Formula - Advantages: Simple - Disadvantages: Binary Outcome Model Outcome - Two outcomes: open water and vegetation - One covariate used: Average Green Reflectance values - Error rate of .82%

12.

13. Multinomial Regression - Extension of logistic regression - Outcomes can be more than two categories - Forward and backward selection used to select features. - Advantages: Simple - Disadvantages: Linear Prediction, Overfitting - 13 out of the 19 features are used in the model

14. Random Forest (will need more slides) Machine Learning Algorithm that uses decision trees Since one decision tree would over fit our data, an average of many random trees is taken The processes to random forest is similar to tree bagging (insert equation) Random Forest differs in the fact that it choses a new random subtree at every vertex In our models we looked at classifying through a random forest as well as concatenating random forest through in steps by grouping our different classifications.

15. Cross Validation - Separate the data into n non-overlapping sets of equal size. - Train on n-1 of these sets and test on the other 1 set. - Average all the accuracy values for final assessment of the model - Benefit: Reduce bias in training and testing

16. Method Error Rates on randomized test data Mean error rate in 10-fold cross validation Multinomial Logistic Regression 32.48% 33.38% Random Forests 16.94% 15.62% Multi-Layer Random Forest 16.71% 15.81% Model Comparison

17. Confusion Matrices Multi-Layer Random Forest Traditional Random Forest

18.

19.

20. Conclusions Random forest is the best model for classifying landcover.

21. Future Work There is missing data in the landsat data and it is collected every 16 days. Missing data can be filled in with motis data but the motis data has low spatial resolution. Possible abrupt changes in EVI and temperature are not recorded in landsat data.

Editor's Notes

Ultimately have a global landcover so categrfdd this by using remote sensing data Landsat and time series example corn and something else How remote sensing works
1 km of data- google map image of region Add landsat data
qplot(ylim=c(0,1))
Building the model
(the percentage of each model, to conclude xxxx might be the best.) We did 99% in Binomial logistic regression on water aspect.

Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018

Recommended

Recommended

More Related Content

Similar to Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018

Similar to Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018 (20)

More from The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Recently uploaded

Recently uploaded (20)

Undergraduate Modeling Workshop - Vegetation Working Group Final Presentation, May 25, 2018

Editor's Notes