SlideShare a Scribd company logo
Decision tree in GIS using R environment
Omar F. Althuwaynee, Ph.D.
Evaluate and compare the results of applying different
decision trees algorithms, to classify and understand
landslide occurrence predictors distributions, using
GIS and R environment.
Course objectives
Omar F. Althuwaynee, PhD in Geomatics engineering
You have to go through the following videos on my
channel, regarding data preparation in ArcGIS:
1. How to process Logistic regression in GIS: Prepare binary training data by
ArcGIS ?
2. How to easily produce testing binary data set for prediction mapping
validation?
Course preparations
Omar F. Althuwaynee, PhD in Geomatics engineering
1. Create dichotomous (0,1) training and testing data.
2. Effectively set your project environment , and install
packages related to current application.
3. Read spatial data in R environment .
5. Run Statistical analysis, using various decision trees
algorithms.
6. Run statistical tests and produce decision tree.
End of this course, you will be able to
Omar F. Althuwaynee, PhD in Geomatics engineering
Data mining approaches, based on the successive
division of the problem into several sub-problems with
a smaller number of dimensions, until a solution for
each of the simpler problems can be found.
Omar F. Althuwaynee, PhD in Geomatics
engineering
Decision Trees philosophy
1. Mostly use supervised learning methods.
2. Predictive , high accuracy, stability .
3. Mapping non-linear relationships.
4. Used for classification or regression solving methods.
5. Easy to understand: no analytical or statistical background
needed (intuitive graphical representation)
6. Useful in data exploration: (finding significant variables and
its relations).
7. Less data cleaning required: (fairly not influenced by outliers
and missing values).
8. Data type is not a constraint: (handling both numerical and
categorical variables).
Omar F. Althuwaynee, PhD in Geomatics
engineering
Why to use Tree based learning algorithms?
1. Categorical Variable Decision Tree: (categorical target variable
Example:- Target variable, Student will play cricket or not” i.e. YES or NO.
Natural hazards susceptibility, Yes=1, No=0.
2. Continuous Variable Decision Tree: (continuous target variable).
Example: Target variable, continuous Students age classification <=10 &
Age>20,. Earthquake intensity <=x1 & >x2
Omar F. Althuwaynee, PhD in Geomatics
engineering
Types of decision tree
Omar F. Althuwaynee, PhD in Geomatics engineering
Regression trees Classification trees
Dependent variable is continuous Dependent variable is categorical
Value of terminal nodes is the mean
response of observation. (Make its
prediction with mean value).
Value of terminal node is the mode
response of observations . (make its
prediction with mode value.
Similarity
Divide the predictors (independent variables) into distinct and non-overlapping
boxes.
Splits the predictor space down into two new branches down the tree (looks for
best variable available), and looks about only the current split, and not about
future splits
Splitting process is continued until a user defined stopping criteria is reached.
But, the fully grown tree is likely to over fit data, leading to poor accuracy on
unseen data. Therefore, we need to do ‘Pruning’.
Regression vs. Classification
The algorithm stops when any one of the conditions is
true:
• All the samples belong to the same class.
• There are no remaining attributes on which the
samples may be further partitioned
• There are no samples for the branch test attribute
Omar F. Althuwaynee, PhD in Geomatics
engineering
Stopping Criteria
Reference:
https://www.analyticsvidhya.com/blog/2016/04/complete-tutorial-tree-based-modeling-scratch-in-python/
Omar F. Althuwaynee, PhD in Geomatics
engineering
To remove anomalies in the training data due to noise or outliers, and
tackle overfitting.
• The pruned trees are smaller and less complex.( look at a few
steps ahead and make a choice).
• Pre-pruning and Post-pruning (removes a sub-tree from a fully
grown tree).
Pruning
What are the various decision tree algorithms and how
do they differ from each other
Common algorithms: Like; C4.5, ID3, CART, CHAID, Random
forest
1. Classification and regression
2. Numeric (continuous) or categorical targets and factors data.
3. Using tree pruning or not.
4. Amount of memory usage
5. Amount of information and outcomes provided
6. Statistical background
7. Stand alone or ensemble learning based classifiers
Omar F. Althuwaynee, PhD in Geomatics
engineering
To predict whether a landslide will happen in a certain
areas (yes/ no).
• Slope angle is a significant variable but we don’t have enough
details about all the related conditions of previous events.
• Now, as we know there are additional important variables,
then we can build a decision tree to predict landslide (or any
target) based on:
Elevation, Aspect, soil type, vegetation density and various other
variables.
Omar F. Althuwaynee, PhD in Geomatics
engineering
Case study
Slope
Angle
Slope
Angle
Landslides
Elevation
NDVI
Soil type Aspect
Yes=20
No=50
Yes=80
No= 50
Slope ≤5° Slope >5°
NDVI=0.5
S.Type=Silt Aspect=NE Elev.≥300m
Yes=60
No= 10
Yes=20
No= 40
Yes=20
No= 10
Yes=40
No= 0
Yes=100
No=100
Typical Decision Tree
• To predict the probability, whether a landslide will
occur in a particular places, or not.
Data:
• Dependent factor Landslide training (75 observations) and
testing (25 observations) data locations.
• Independent factors (Elevation, slope, NDVI, Curvature).
Note:
• Analysis will depend only on the number of the observations,
more training observations will increase the model efficiency.
Current Application
Omar F. Althuwaynee, PhD in Geomatics engineering
1. Prepare GIS data
2. Resample to similar extent and resolution.
3. Data quality
4. Convert into statistical data format, like, .txt, .csv!!
5. Check the data in R environment, like, summary, str, head,
plot.
Omar F. Althuwaynee, PhD in Geomatics
engineering
Data input
Testing_points Elevation Curvature Slope NDVI
1 275 -0.0625 13.55703 0.516273
0 363 0.0625 16.73342 0.469728
0 267 0.1875 13.2819 0.435414
0 92 0.125 10.01578 0.396327
… …. ….. …. ….
8399= respondents to the survey, 57% = best customers(1) - 43%= other (0).
Left side (Total life time)
• Females ( F) are more likely to be best customers than males (M).
• 1st row: difference between males and females is statistically significant (59 -54)%= 5% females are
more likely to be a best customer .
• Is 5% is significant from a business point of view or not? (ask a business analyst)
Right side (Net sales)
• This suggests that Net sales is a stronger or more relevant predictor of MEN customer status than
Total lifetime visits( used to analyse Females)
To conclude: female behaviour and male behaviour have different nuances.
Further illustration using Categorical Target (1–0)
1. Download the current course data
2. Open R studio, and connect to internet
3. And, Let us begin!
Happy learning..!
Omar F. Althuwaynee, PhD in Geomatics
engineering
• https://www.tutorialspoint.com/data_mining/dm_dti.html
• http://scikit-learn.org/stable/modules/tree.html
• https://goo.Gl/uk6i3x
• https://www.analyticsvidhya.com/blog/2016/04/complete-tutorial-tree-based-modeling-
scratch-in-python/
• https://en.wikibooks.org/wiki/Data_Mining_Algorithms_In_R/Classification/Decision_Trees
• https://cran.r-project.org/web/views/MachineLearning.html
• http://machinelearningmastery.com/non-linear-classification-in-r-with-decision-trees/
• http://dni-institute.in/blogs/random-forest-using-r-step-by-step-tutorial/
References
Omar F. Althuwaynee, PhD in Geomatics engineering

More Related Content

What's hot

Introduction to Aerial Photogrammetry
Introduction to Aerial PhotogrammetryIntroduction to Aerial Photogrammetry
Introduction to Aerial Photogrammetry
Malla Reddy University
 
Remote Sensing fundamentals
Remote Sensing fundamentalsRemote Sensing fundamentals
Remote Sensing fundamentals
Mohammed_82
 
GIS - lecture-1.ppt
GIS - lecture-1.pptGIS - lecture-1.ppt
GIS - lecture-1.ppt
sapna kinattinkara
 
Change detection using remote sensing and GIS
Change detection using remote sensing and GISChange detection using remote sensing and GIS
Change detection using remote sensing and GIS
Tilok Chetri
 
Remote Sensing For Environmental Impact And Monitoring
Remote Sensing For Environmental Impact And MonitoringRemote Sensing For Environmental Impact And Monitoring
Remote Sensing For Environmental Impact And Monitoring
PaulDavidShaw
 
Introduction to GIS systems
Introduction to GIS systemsIntroduction to GIS systems
Introduction to GIS systems
Vivek Srivastava
 
Arc gis introduction-ppt
Arc gis introduction-pptArc gis introduction-ppt
Arc gis introduction-ppt
Ashok Peddi
 
Stereoscopic Parallax
Stereoscopic ParallaxStereoscopic Parallax
Stereoscopic Parallax
Siva Subramanian M
 
Introduction and Application of GIS
Introduction and Application of GISIntroduction and Application of GIS
Introduction and Application of GIS
Satish Taji
 
Remote Sensing:. Image Filtering
Remote Sensing:. Image FilteringRemote Sensing:. Image Filtering
Remote Sensing:. Image Filtering
Kamlesh Kumar
 
Remote sensing
Remote sensingRemote sensing
Remote sensing
Prabhat Paudel
 
Introduction to MAPS,Coordinate System and Projection System
Introduction to MAPS,Coordinate System and Projection SystemIntroduction to MAPS,Coordinate System and Projection System
Introduction to MAPS,Coordinate System and Projection System
NAXA-Developers
 
Stereoscopic vision
Stereoscopic visionStereoscopic vision
Remote sensing
Remote sensingRemote sensing
Remote sensing
Mohamed Salah
 
Digital photogrammetry
Digital photogrammetryDigital photogrammetry
Digital photogrammetry
Juan José Machado Oviedo
 
Determination of Flight Direction
Determination of Flight DirectionDetermination of Flight Direction
Determination of Flight Direction
AvinashAvi110
 
IMAGE INTERPRETATION TECHNIQUES of survey
IMAGE INTERPRETATION TECHNIQUES of surveyIMAGE INTERPRETATION TECHNIQUES of survey
IMAGE INTERPRETATION TECHNIQUES of survey
Karan Patel
 
Image classification and land cover mapping
Image classification and land cover mappingImage classification and land cover mapping
Image classification and land cover mapping
Kabir Uddin
 
Lab report 9 satellite image classification using Erdas imagine
Lab report 9  satellite image classification using Erdas imagineLab report 9  satellite image classification using Erdas imagine
Lab report 9 satellite image classification using Erdas imagine
Sharik Shamsudhien
 
An introduction to geographic information systems (gis) m goulbourne 2007
An introduction to geographic information systems (gis)   m goulbourne 2007An introduction to geographic information systems (gis)   m goulbourne 2007
An introduction to geographic information systems (gis) m goulbourne 2007
Michelle Goulbourne @ DiaMind Health
 

What's hot (20)

Introduction to Aerial Photogrammetry
Introduction to Aerial PhotogrammetryIntroduction to Aerial Photogrammetry
Introduction to Aerial Photogrammetry
 
Remote Sensing fundamentals
Remote Sensing fundamentalsRemote Sensing fundamentals
Remote Sensing fundamentals
 
GIS - lecture-1.ppt
GIS - lecture-1.pptGIS - lecture-1.ppt
GIS - lecture-1.ppt
 
Change detection using remote sensing and GIS
Change detection using remote sensing and GISChange detection using remote sensing and GIS
Change detection using remote sensing and GIS
 
Remote Sensing For Environmental Impact And Monitoring
Remote Sensing For Environmental Impact And MonitoringRemote Sensing For Environmental Impact And Monitoring
Remote Sensing For Environmental Impact And Monitoring
 
Introduction to GIS systems
Introduction to GIS systemsIntroduction to GIS systems
Introduction to GIS systems
 
Arc gis introduction-ppt
Arc gis introduction-pptArc gis introduction-ppt
Arc gis introduction-ppt
 
Stereoscopic Parallax
Stereoscopic ParallaxStereoscopic Parallax
Stereoscopic Parallax
 
Introduction and Application of GIS
Introduction and Application of GISIntroduction and Application of GIS
Introduction and Application of GIS
 
Remote Sensing:. Image Filtering
Remote Sensing:. Image FilteringRemote Sensing:. Image Filtering
Remote Sensing:. Image Filtering
 
Remote sensing
Remote sensingRemote sensing
Remote sensing
 
Introduction to MAPS,Coordinate System and Projection System
Introduction to MAPS,Coordinate System and Projection SystemIntroduction to MAPS,Coordinate System and Projection System
Introduction to MAPS,Coordinate System and Projection System
 
Stereoscopic vision
Stereoscopic visionStereoscopic vision
Stereoscopic vision
 
Remote sensing
Remote sensingRemote sensing
Remote sensing
 
Digital photogrammetry
Digital photogrammetryDigital photogrammetry
Digital photogrammetry
 
Determination of Flight Direction
Determination of Flight DirectionDetermination of Flight Direction
Determination of Flight Direction
 
IMAGE INTERPRETATION TECHNIQUES of survey
IMAGE INTERPRETATION TECHNIQUES of surveyIMAGE INTERPRETATION TECHNIQUES of survey
IMAGE INTERPRETATION TECHNIQUES of survey
 
Image classification and land cover mapping
Image classification and land cover mappingImage classification and land cover mapping
Image classification and land cover mapping
 
Lab report 9 satellite image classification using Erdas imagine
Lab report 9  satellite image classification using Erdas imagineLab report 9  satellite image classification using Erdas imagine
Lab report 9 satellite image classification using Erdas imagine
 
An introduction to geographic information systems (gis) m goulbourne 2007
An introduction to geographic information systems (gis)   m goulbourne 2007An introduction to geographic information systems (gis)   m goulbourne 2007
An introduction to geographic information systems (gis) m goulbourne 2007
 

Viewers also liked

(R in GIS) دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال...
 (R in GIS)  دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال... (R in GIS)  دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال...
(R in GIS) دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال...
Omar F. Althuwaynee
 
Trabajo calidad termic apdf
Trabajo calidad termic apdfTrabajo calidad termic apdf
Trabajo calidad termic apdf
loreferres
 
Missing Data and data imputation techniques
Missing Data and data imputation techniquesMissing Data and data imputation techniques
Missing Data and data imputation techniques
Omar F. Althuwaynee
 
How to use Frequency Ratio with ArcMap and Excel for prediction
How to use Frequency Ratio with ArcMap and Excel for predictionHow to use Frequency Ratio with ArcMap and Excel for prediction
How to use Frequency Ratio with ArcMap and Excel for prediction
Omar F. Althuwaynee
 
Zaxid fest
Zaxid festZaxid fest
Zaxid fest
Julia Dragan
 
CRITHINKEDU Overview (Dutch)
CRITHINKEDU Overview (Dutch)CRITHINKEDU Overview (Dutch)
Using Salesforce, ERP, Tableau & R in Sales Forecasting
Using Salesforce, ERP, Tableau & R in Sales ForecastingUsing Salesforce, ERP, Tableau & R in Sales Forecasting
Using Salesforce, ERP, Tableau & R in Sales Forecasting
Senturus
 
Leaflet CUVID EN
Leaflet CUVID ENLeaflet CUVID EN
Leaflet CUVID EN
Future Balloons
 
Documentation Workbook Series. Step 3 Presenting Information (Visual Document...
Documentation Workbook Series. Step 3 Presenting Information (Visual Document...Documentation Workbook Series. Step 3 Presenting Information (Visual Document...
Documentation Workbook Series. Step 3 Presenting Information (Visual Document...
Adrienne Bellehumeur
 
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet MahanaArima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
Amrinder Arora
 
Creación de videojuegos 2 anyi moncayo
Creación de videojuegos 2 anyi moncayo Creación de videojuegos 2 anyi moncayo
Creación de videojuegos 2 anyi moncayo
anyi paola moncayo solarte
 
Caderno p0301
Caderno p0301Caderno p0301
Caderno p0301
Uratinai Ketlis
 
Caderno p0302
Caderno p0302Caderno p0302
Caderno p0302
Uratinai Ketlis
 
Cyaniding and Nitriding
Cyaniding and NitridingCyaniding and Nitriding
Cyaniding and Nitriding
Faizan Ali
 
[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...
[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...
[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...
Digital Classicist Seminar Berlin
 
MÉTODOS DE UNIÓN (SOLDADURA)
MÉTODOS DE UNIÓN (SOLDADURA)MÉTODOS DE UNIÓN (SOLDADURA)
MÉTODOS DE UNIÓN (SOLDADURA)
Juan Arteaga Funtes
 
Il trovatore. 4ºESO Música IES César Manrique.
Il trovatore.  4ºESO Música IES César Manrique.Il trovatore.  4ºESO Música IES César Manrique.
Il trovatore. 4ºESO Música IES César Manrique.
juanramon alvarez lafuente
 
Role of gis in telecommunications
Role of gis in telecommunicationsRole of gis in telecommunications
Role of gis in telecommunications
Akhil Gupta
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1
Venkata Reddy Konasani
 

Viewers also liked (20)

(R in GIS) دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال...
 (R in GIS)  دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال... (R in GIS)  دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال...
(R in GIS) دورة احترافية في استخدام برمجة كأداة لنظم المعلومات المكانية بال...
 
Trabajo calidad termic apdf
Trabajo calidad termic apdfTrabajo calidad termic apdf
Trabajo calidad termic apdf
 
Missing Data and data imputation techniques
Missing Data and data imputation techniquesMissing Data and data imputation techniques
Missing Data and data imputation techniques
 
How to use Frequency Ratio with ArcMap and Excel for prediction
How to use Frequency Ratio with ArcMap and Excel for predictionHow to use Frequency Ratio with ArcMap and Excel for prediction
How to use Frequency Ratio with ArcMap and Excel for prediction
 
Zaxid fest
Zaxid festZaxid fest
Zaxid fest
 
CRITHINKEDU Overview (Dutch)
CRITHINKEDU Overview (Dutch)CRITHINKEDU Overview (Dutch)
CRITHINKEDU Overview (Dutch)
 
Using Salesforce, ERP, Tableau & R in Sales Forecasting
Using Salesforce, ERP, Tableau & R in Sales ForecastingUsing Salesforce, ERP, Tableau & R in Sales Forecasting
Using Salesforce, ERP, Tableau & R in Sales Forecasting
 
Leaflet CUVID EN
Leaflet CUVID ENLeaflet CUVID EN
Leaflet CUVID EN
 
Documentation Workbook Series. Step 3 Presenting Information (Visual Document...
Documentation Workbook Series. Step 3 Presenting Information (Visual Document...Documentation Workbook Series. Step 3 Presenting Information (Visual Document...
Documentation Workbook Series. Step 3 Presenting Information (Visual Document...
 
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet MahanaArima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
 
Θεματα πανελλαδικων 2000-2016
Θεματα πανελλαδικων 2000-2016Θεματα πανελλαδικων 2000-2016
Θεματα πανελλαδικων 2000-2016
 
Creación de videojuegos 2 anyi moncayo
Creación de videojuegos 2 anyi moncayo Creación de videojuegos 2 anyi moncayo
Creación de videojuegos 2 anyi moncayo
 
Caderno p0301
Caderno p0301Caderno p0301
Caderno p0301
 
Caderno p0302
Caderno p0302Caderno p0302
Caderno p0302
 
Cyaniding and Nitriding
Cyaniding and NitridingCyaniding and Nitriding
Cyaniding and Nitriding
 
[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...
[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...
[DCSB] Undine Lieberwirth & Axel Gering (TOPOI) 3D GIS in archaeology – a mic...
 
MÉTODOS DE UNIÓN (SOLDADURA)
MÉTODOS DE UNIÓN (SOLDADURA)MÉTODOS DE UNIÓN (SOLDADURA)
MÉTODOS DE UNIÓN (SOLDADURA)
 
Il trovatore. 4ºESO Música IES César Manrique.
Il trovatore.  4ºESO Música IES César Manrique.Il trovatore.  4ºESO Música IES César Manrique.
Il trovatore. 4ºESO Música IES César Manrique.
 
Role of gis in telecommunications
Role of gis in telecommunicationsRole of gis in telecommunications
Role of gis in telecommunications
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1
 

Similar to Using Decision trees with GIS data for modeling and prediction

IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
Vikash Kumar
 
Guidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability PredictionGuidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability Prediction
ijsrd.com
 
13 random forest
13 random forest13 random forest
13 random forest
Vishal Dutt
 
IRJET- Emotion Recognition from Voice
IRJET- Emotion Recognition from VoiceIRJET- Emotion Recognition from Voice
IRJET- Emotion Recognition from Voice
IRJET Journal
 
Tree net and_randomforests_2009
Tree net and_randomforests_2009Tree net and_randomforests_2009
Tree net and_randomforests_2009
Matthew Magistrado
 
Algoritma Random Forest beserta aplikasi nya
Algoritma Random Forest beserta aplikasi nyaAlgoritma Random Forest beserta aplikasi nya
Algoritma Random Forest beserta aplikasi nya
batubao
 
Mb0050 research methodology
Mb0050   research methodologyMb0050   research methodology
Mb0050 research methodology
smumbahelp
 
Hx3115011506
Hx3115011506Hx3115011506
Hx3115011506
IJERA Editor
 
IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...
IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...
IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...
IRJET Journal
 
Mb0050 research methodology
Mb0050   research methodologyMb0050   research methodology
Mb0050 research methodology
smumbahelp
 
Improved Security Detection & Response via Optimized Alert Output: A Usabilit...
Improved Security Detection & Response via Optimized Alert Output: A Usabilit...Improved Security Detection & Response via Optimized Alert Output: A Usabilit...
Improved Security Detection & Response via Optimized Alert Output: A Usabilit...
Russ McRee
 
83 learningdecisiontree
83 learningdecisiontree83 learningdecisiontree
83 learningdecisiontree
tahseen shaikh
 
Qualitative and Quantitative Research Plans By Malik Muhammad Mehran
Qualitative and Quantitative Research Plans By Malik Muhammad MehranQualitative and Quantitative Research Plans By Malik Muhammad Mehran
Qualitative and Quantitative Research Plans By Malik Muhammad Mehran
Malik Mughal
 
Alberta innovates pem_presentation_feb13_2012_ram_version1
Alberta innovates pem_presentation_feb13_2012_ram_version1Alberta innovates pem_presentation_feb13_2012_ram_version1
Alberta innovates pem_presentation_feb13_2012_ram_version1
Bob MacMillan
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniques
ijsrd.com
 
Detection of Attentiveness from Periocular Information
Detection of Attentiveness from Periocular InformationDetection of Attentiveness from Periocular Information
Detection of Attentiveness from Periocular Information
IRJET Journal
 
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
IRJET Journal
 
Raskob Iscram 2009
Raskob Iscram 2009Raskob Iscram 2009
Raskob Iscram 2009
guestee5a52
 
Rohan's Masters presentation
Rohan's Masters presentationRohan's Masters presentation
Rohan's Masters presentation
rohan_anil
 
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques  Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
ijsc
 

Similar to Using Decision trees with GIS data for modeling and prediction (20)

IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
Guidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability PredictionGuidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability Prediction
 
13 random forest
13 random forest13 random forest
13 random forest
 
IRJET- Emotion Recognition from Voice
IRJET- Emotion Recognition from VoiceIRJET- Emotion Recognition from Voice
IRJET- Emotion Recognition from Voice
 
Tree net and_randomforests_2009
Tree net and_randomforests_2009Tree net and_randomforests_2009
Tree net and_randomforests_2009
 
Algoritma Random Forest beserta aplikasi nya
Algoritma Random Forest beserta aplikasi nyaAlgoritma Random Forest beserta aplikasi nya
Algoritma Random Forest beserta aplikasi nya
 
Mb0050 research methodology
Mb0050   research methodologyMb0050   research methodology
Mb0050 research methodology
 
Hx3115011506
Hx3115011506Hx3115011506
Hx3115011506
 
IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...
IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...
IRJET-Comparison of SIFT & SURF Corner Detector as Features and other Machine...
 
Mb0050 research methodology
Mb0050   research methodologyMb0050   research methodology
Mb0050 research methodology
 
Improved Security Detection & Response via Optimized Alert Output: A Usabilit...
Improved Security Detection & Response via Optimized Alert Output: A Usabilit...Improved Security Detection & Response via Optimized Alert Output: A Usabilit...
Improved Security Detection & Response via Optimized Alert Output: A Usabilit...
 
83 learningdecisiontree
83 learningdecisiontree83 learningdecisiontree
83 learningdecisiontree
 
Qualitative and Quantitative Research Plans By Malik Muhammad Mehran
Qualitative and Quantitative Research Plans By Malik Muhammad MehranQualitative and Quantitative Research Plans By Malik Muhammad Mehran
Qualitative and Quantitative Research Plans By Malik Muhammad Mehran
 
Alberta innovates pem_presentation_feb13_2012_ram_version1
Alberta innovates pem_presentation_feb13_2012_ram_version1Alberta innovates pem_presentation_feb13_2012_ram_version1
Alberta innovates pem_presentation_feb13_2012_ram_version1
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniques
 
Detection of Attentiveness from Periocular Information
Detection of Attentiveness from Periocular InformationDetection of Attentiveness from Periocular Information
Detection of Attentiveness from Periocular Information
 
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
 
Raskob Iscram 2009
Raskob Iscram 2009Raskob Iscram 2009
Raskob Iscram 2009
 
Rohan's Masters presentation
Rohan's Masters presentationRohan's Masters presentation
Rohan's Masters presentation
 
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques  Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
Methodological Study Of Opinion Mining And Sentiment Analysis Techniques
 

Recently uploaded

Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
NgcHiNguyn25
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
sayalidalavi006
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
paigestewart1632
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 

Recently uploaded (20)

Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5Community pharmacy- Social and preventive pharmacy UNIT 5
Community pharmacy- Social and preventive pharmacy UNIT 5
 
Cognitive Development Adolescence Psychology
Cognitive Development Adolescence PsychologyCognitive Development Adolescence Psychology
Cognitive Development Adolescence Psychology
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 

Using Decision trees with GIS data for modeling and prediction

  • 1. Decision tree in GIS using R environment Omar F. Althuwaynee, Ph.D.
  • 2. Evaluate and compare the results of applying different decision trees algorithms, to classify and understand landslide occurrence predictors distributions, using GIS and R environment. Course objectives Omar F. Althuwaynee, PhD in Geomatics engineering
  • 3. You have to go through the following videos on my channel, regarding data preparation in ArcGIS: 1. How to process Logistic regression in GIS: Prepare binary training data by ArcGIS ? 2. How to easily produce testing binary data set for prediction mapping validation? Course preparations Omar F. Althuwaynee, PhD in Geomatics engineering
  • 4. 1. Create dichotomous (0,1) training and testing data. 2. Effectively set your project environment , and install packages related to current application. 3. Read spatial data in R environment . 5. Run Statistical analysis, using various decision trees algorithms. 6. Run statistical tests and produce decision tree. End of this course, you will be able to Omar F. Althuwaynee, PhD in Geomatics engineering
  • 5. Data mining approaches, based on the successive division of the problem into several sub-problems with a smaller number of dimensions, until a solution for each of the simpler problems can be found. Omar F. Althuwaynee, PhD in Geomatics engineering Decision Trees philosophy
  • 6. 1. Mostly use supervised learning methods. 2. Predictive , high accuracy, stability . 3. Mapping non-linear relationships. 4. Used for classification or regression solving methods. 5. Easy to understand: no analytical or statistical background needed (intuitive graphical representation) 6. Useful in data exploration: (finding significant variables and its relations). 7. Less data cleaning required: (fairly not influenced by outliers and missing values). 8. Data type is not a constraint: (handling both numerical and categorical variables). Omar F. Althuwaynee, PhD in Geomatics engineering Why to use Tree based learning algorithms?
  • 7. 1. Categorical Variable Decision Tree: (categorical target variable Example:- Target variable, Student will play cricket or not” i.e. YES or NO. Natural hazards susceptibility, Yes=1, No=0. 2. Continuous Variable Decision Tree: (continuous target variable). Example: Target variable, continuous Students age classification <=10 & Age>20,. Earthquake intensity <=x1 & >x2 Omar F. Althuwaynee, PhD in Geomatics engineering Types of decision tree
  • 8. Omar F. Althuwaynee, PhD in Geomatics engineering Regression trees Classification trees Dependent variable is continuous Dependent variable is categorical Value of terminal nodes is the mean response of observation. (Make its prediction with mean value). Value of terminal node is the mode response of observations . (make its prediction with mode value. Similarity Divide the predictors (independent variables) into distinct and non-overlapping boxes. Splits the predictor space down into two new branches down the tree (looks for best variable available), and looks about only the current split, and not about future splits Splitting process is continued until a user defined stopping criteria is reached. But, the fully grown tree is likely to over fit data, leading to poor accuracy on unseen data. Therefore, we need to do ‘Pruning’. Regression vs. Classification
  • 9. The algorithm stops when any one of the conditions is true: • All the samples belong to the same class. • There are no remaining attributes on which the samples may be further partitioned • There are no samples for the branch test attribute Omar F. Althuwaynee, PhD in Geomatics engineering Stopping Criteria
  • 10. Reference: https://www.analyticsvidhya.com/blog/2016/04/complete-tutorial-tree-based-modeling-scratch-in-python/ Omar F. Althuwaynee, PhD in Geomatics engineering To remove anomalies in the training data due to noise or outliers, and tackle overfitting. • The pruned trees are smaller and less complex.( look at a few steps ahead and make a choice). • Pre-pruning and Post-pruning (removes a sub-tree from a fully grown tree). Pruning
  • 11. What are the various decision tree algorithms and how do they differ from each other Common algorithms: Like; C4.5, ID3, CART, CHAID, Random forest 1. Classification and regression 2. Numeric (continuous) or categorical targets and factors data. 3. Using tree pruning or not. 4. Amount of memory usage 5. Amount of information and outcomes provided 6. Statistical background 7. Stand alone or ensemble learning based classifiers Omar F. Althuwaynee, PhD in Geomatics engineering
  • 12. To predict whether a landslide will happen in a certain areas (yes/ no). • Slope angle is a significant variable but we don’t have enough details about all the related conditions of previous events. • Now, as we know there are additional important variables, then we can build a decision tree to predict landslide (or any target) based on: Elevation, Aspect, soil type, vegetation density and various other variables. Omar F. Althuwaynee, PhD in Geomatics engineering Case study
  • 13. Slope Angle Slope Angle Landslides Elevation NDVI Soil type Aspect Yes=20 No=50 Yes=80 No= 50 Slope ≤5° Slope >5° NDVI=0.5 S.Type=Silt Aspect=NE Elev.≥300m Yes=60 No= 10 Yes=20 No= 40 Yes=20 No= 10 Yes=40 No= 0 Yes=100 No=100 Typical Decision Tree
  • 14. • To predict the probability, whether a landslide will occur in a particular places, or not. Data: • Dependent factor Landslide training (75 observations) and testing (25 observations) data locations. • Independent factors (Elevation, slope, NDVI, Curvature). Note: • Analysis will depend only on the number of the observations, more training observations will increase the model efficiency. Current Application Omar F. Althuwaynee, PhD in Geomatics engineering
  • 15. 1. Prepare GIS data 2. Resample to similar extent and resolution. 3. Data quality 4. Convert into statistical data format, like, .txt, .csv!! 5. Check the data in R environment, like, summary, str, head, plot. Omar F. Althuwaynee, PhD in Geomatics engineering Data input Testing_points Elevation Curvature Slope NDVI 1 275 -0.0625 13.55703 0.516273 0 363 0.0625 16.73342 0.469728 0 267 0.1875 13.2819 0.435414 0 92 0.125 10.01578 0.396327 … …. ….. …. ….
  • 16. 8399= respondents to the survey, 57% = best customers(1) - 43%= other (0). Left side (Total life time) • Females ( F) are more likely to be best customers than males (M). • 1st row: difference between males and females is statistically significant (59 -54)%= 5% females are more likely to be a best customer . • Is 5% is significant from a business point of view or not? (ask a business analyst) Right side (Net sales) • This suggests that Net sales is a stronger or more relevant predictor of MEN customer status than Total lifetime visits( used to analyse Females) To conclude: female behaviour and male behaviour have different nuances. Further illustration using Categorical Target (1–0)
  • 17. 1. Download the current course data 2. Open R studio, and connect to internet 3. And, Let us begin! Happy learning..! Omar F. Althuwaynee, PhD in Geomatics engineering
  • 18. • https://www.tutorialspoint.com/data_mining/dm_dti.html • http://scikit-learn.org/stable/modules/tree.html • https://goo.Gl/uk6i3x • https://www.analyticsvidhya.com/blog/2016/04/complete-tutorial-tree-based-modeling- scratch-in-python/ • https://en.wikibooks.org/wiki/Data_Mining_Algorithms_In_R/Classification/Decision_Trees • https://cran.r-project.org/web/views/MachineLearning.html • http://machinelearningmastery.com/non-linear-classification-in-r-with-decision-trees/ • http://dni-institute.in/blogs/random-forest-using-r-step-by-step-tutorial/ References Omar F. Althuwaynee, PhD in Geomatics engineering