SlideShare a Scribd company logo
1 of 25
Download to read offline
Modeling Weather-induced home Insurance
Risks: A Machine Learning Approach
Asim Dey
The University of Texas at Dallas
joint with
Yulia R. Gel,The University of Texas at Dallas
Slava Lyubchich, University of Maryland Center for Environmental Science.
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Outline
1 Introduction
2 Data
3 Methods
Support Vector Regression (SVR)
Neural Network (NN)
4 Prediction
5 Uncertainty due to Climate Models
6 Future Work
Asim Dey 2/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
US trend in heavy precipitation
Figure: 1. The relative occurrences of 2-day precipitation totals that exceed
on average only once in a five-year period. Changes are compared to
1901-1960 average (Source: GlobalChange.gov).
Since 2008, the United States has seen six floods costing at least $1
billion each (SOA).
Asim Dey 3/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Flood disasters in Canada
1900 1910 1920 1930 1940 1950 1960 1970 1980 1990
Numberofflooddisasters
0102030405060 3
4
9
7 8
13 12
44
50 51
Figure: 2. Frequency of flood disaster in Canada (Figure adapted from
Cheng et al. 2012).
In Canada, from 2009 to 2014, total insured losses from catastrophic
events were close to or above $1 billion each year (IBC).
Asim Dey 4/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Flood disasters in Canada
1900 1910 1920 1930 1940 1950 1960 1970 1980 1990
Numberofflooddisasters
0102030405060 3
4
9
7 8
13 12
44
50 51
Figure: 2. Frequency of flood disaster in Canada (Figure adapted from
Cheng et al. 2012).
In Canada, from 2009 to 2014, total insured losses from catastrophic
events were close to or above $1 billion each year (IBC).
The Intergovernmental Panel on Climate Change (IPCC) has projected
that the severity and frequency of extreme rainfalls will further increase
(IPCC, 2014).
Asim Dey 4/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Literature Review
Statistical approaches for claim frequency: GLM (Haug et al., 2011),
GARMA (Soliman et al., 2015) , Bayesian hierarchical model (Scheel et
al., 2013), data-driven nonparametric procedure (Lyubchich and Gel,
2017).
Machine learning techniques for claim frequency: Neural Network (NN)
(Caldeira et al., 2015), SVR (Wu and Akbarov, 2013).
Goals:
1 Model and forecast a joint dynamics of weather-related home insurance
claims (frequencies) and losses (severities).
2 Utility of Support Vector Regression (SVR) and Neural Network (NN), in
forecasting future claim dynamics.
Asim Dey 5/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Data
Table: 1. Overview of the data sets
Period Data type/Climate model Variables
Control Observations Precipitation,
(2002–2011) Number of claims,
Aggregate loss
Scenario Projections from
CanESM2 4.5,
(2021–2080) CanESM2 8.5,
GFDL ESM2M R 8.5,
GFDL ESM2M W 8.5, Precipitation
MPI ESM 8.5,
HadGEM2 ES 8.5
.
Representative Concentration Pathways (RCPs) 4.5 (RCP 4.5) assumes that
the emissions will peak around 2040, then decline. Under the RCP 8.5
scenario, emissions continue to rise throughout the period 2000–2100.
Asim Dey 6/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Normalized Data
Year
Precipitation(mm/week)
0
20
40
60
80
100
120
2001 2003 2005 2007 2009 2011
x = 7.698
x~ = 3.80
s = 10.611
Year
NumberofCliam/week
051015202530
2001 2003 2005 2007 2009 2011
x = 3.608
x~ = 3.0
s = 8.01
Year
Lossinthousanddollars/week
0100200300400500600
2001 2003 2005 2007 2009 2011
x = 46620
x~ = 22300
s = 215961
Figure: 3. Weekly observed precipitation, number of claims and total loss
(2001-2011) in Canadian city A , ¯x is mean, ˜x is median, and s is standard
deviation.
Asim Dey 7/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Modeling Precipitation Related Risk
Number of claims at week t, Nt and aggregate loss at week t, Lt can be
modeled as
Nt = f (Rt , Rt−1) (1)
Lt = f (Rt , Rt−1, Nt ) (2)
where, Rt is total precipitation at week t, and Rt−1 is total precipitation at
week t − 1.
Asim Dey 8/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Support Vector Regression (SVR)
Least Squares Regression minimizes in-sample residual sum of squares.
SVR attempts to minimize the generalized error bound (Eq. 4).
SVR only depends on a particular subset (Residual > ε) of the training data.
− ε 0 ε
0
r
Vε
(r)
Figure: 4. SVR ε-insensitive error function.
f (x) = β, x + β0, (3)
H(β, β0) =
n
i=1
Vε (yi − f (xi )) +
λ
2
β
2
, (4)
k(xi , x) = exp −
x − xi
2
2σ2
(5)
Minimization of loss function leads to a quadratic programming problem.
Asim Dey 9/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Genetic Algorithm (GA)
The results of SVR modeling highly depend on the three user-defined
parameters (hyper-parameters):
1 Regularization parameter (λ),
2 Tube size of ε-insensitive loss function (ε),
3 Bandwidth of the kernel function (σ2
).
Inappropriate choice of hyper-parameters leads to over-fitting or under-fitting.
Genetic Algorithm (GA) is applied to simultaneously optimize all SVR
parameters (Goldberg, 1989).
Asim Dey 10/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Genetic Algorithm (GA) cont...
Each generation is like a iteration in numerical optimization problem. At
each iteration there is a progressive improvement of objective function.
Define:
- Parameters (C, σ, ε)
- Population size
- Fitness function
- Stopping criteria
Generate initial random population
Train SVR model
and calculate fitness
i = n?
Create new population by:
- Reproduction
- Crossover
- Mutation
Select optimal (C, σ, ε)
Train SVR model using ob-
tained hyper-parameters
i = 1
No
i = i + 1
Yes
Figure: 5. Flow-chart of the GA-SVR.
Asim Dey 11/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Genetic Algorithm (GA)
Table: 2. GA parameter settings
Parameter Value
Number of generations
(stopping criterion): 1000
Population size: 50
Fitness function: RMSE
Search domain (Hsu et al., 2003):
λ (10−3, 103)
σ2 (10−3, 24)
(10−2, 23)
Asim Dey 12/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Claims model-Neural Network (NN)
NN models the response as a nonlinear function of linear combinations of the
predictors.
X1
X2
X3
X4
Y
Input
layer
Output
layer
Figure 6(a): Linear Regression
Model.
Y = g
4
j=1
βj Xj + ,
g(v) = v,
X1
X2
X3
X4
Z1
Z2
Y
Input
layer
Hidden
layer
Ouput
layer
Figure 6(b): Neural Network
Model.
Zj = f
4
i=1
αij Xi ,
Y = g
2
j=1
βj Zj + ,
where, f (v) = 1/(1 + e−v ) is the sigmoid activation function. The unknown
parameters αij , αoj , βj , and βo are estimated by back-propagation method
(Bishop, 2006).
Asim Dey 13/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Number of hidden layers and nodes in a hidden
layer
The rules of thumb:
One hidden layer network can approximate any function that contains a
continuous mapping from one finite space to another (Heaton, 2008).
0.02750.02850.02950.0305
Number of claim model
Hidden neurons
Testerror(rRMSE)
1 2 3 4 1 2 3 4 5
0.0270.0280.0290.030
Loss Model
Hidden neurons
Testerror(rRMSE)
Figure 7: Number of nodes selection in a hidden layer, number of resamples
K is 100.
Asim Dey 14/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Model Selection
05102030
Year
Numberofclaims/week
Observed
GA−SVR
GLM
NN
2008 2009 2010 2011 2012
Year
Weeklytotalloss(1000CAD2002)
0
100
200
300
400
500
600
2008 2009 2010 2011 2012
Observed
GLM
NN
GA−SVR
Figure 8: Observed values versus fitted values from three different models.
GA-SVR captures the variability of observed data better than other two
model, particularly, the sudden high spikes in the number of claims and total
losses.
Asim Dey 15/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Cross-validation
Table: Number of claim model: Cross-validated Normalized average RMSE
(Em), City A
Model 5 years-RMSE (Em)
NN 29
SVR 27
GA-SVR 24
Em =
aveRMSEm
m × 1
N
N
t=1
yt
100%, (6)
where, aveRMSEm = 1
K
K
i=1
RMSEmi , the denominator of Eq. 6 is average
total number of claims (or, average total loss) in the period of m years,
N = 365 × 10, and K = 100 is number of resamples.
Asim Dey 16/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Prediction
The change from the control period (2002 − 2011) to the 6 sub periods, 10
years each, of projection (scenario) period (2021 − 2080) is
∆ = t∈scn
ˆYt /10
t∈ctr
Yt /10
− 1, (7)
where, ‘scn’ and ‘ctr’ refers to the scenario period and control period,
respectively.
Asim Dey 17/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Risk Prediction using GA-SVR
2021−2030 2031−2040 2041−2050 2051−2060 2061−2070 2071−2080
Number of claims
Aggregate loss
GA−SVR Model
Year
Changeinannualnumberofclaimsandloss,%
0
10
20
30
40
50
Figure 9: Projected percentage changes relative to the control period of
2002–2011. Climate model is CanESM2 4.5.
The annual number of home insurance claims and annual aggregate loss will
increase noticeably in the scenario periods.
Asim Dey 18/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
GA-SVR claims
2021−2030 2031−2040 2041−2050 2051−2060 2061−2070 2071−2080
Climate scenario
CanESM2 4.5
CanESM2 8.5
MPI ESM 8.5
GFDL ESM2MR 8.5
GFDL ESM2MW 8.5
HadGEM2 ES 8.5
GA−SVR Model
Year
Changeinannualnumberofclaims,%
0
10
20
30
40
50
60
Figure 10: Predicted claims from different climate models.
Asim Dey 19/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
GA-SVR losses
2021−2030 2031−2040 2041−2050 2051−2060 2061−2070 2071−2080
Climate scenario
CanESM2 4.5
CanESM2 8.5
MPI ESM 8.5
GFDL ESM2MR 8.5
GFDL ESM2MW 8.5
HadGEM2 ES 8.5
GA−SVR Model
Year
Changeinannualaggregateloss,%
0
10
20
30
40
50
Figure 11: Predicted losses from different climate models.
Asim Dey 20/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Quantile Regression (QR)
Number of claims at week t, Nt :
Nt = f (Rt , Rt−1)
where, Rt is total precipitation at week t, and Rt−1 is total precipitation at
week t − 1.
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.8
pcp0
0.0 0.2 0.4 0.6 0.8 1.0
0.00.10.20.30.4
pcp0l1
Figure 12: Regression coefficients (y-axis) for different quantiles of NCL
Asim Dey 21/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Observed vs predicted quantiles
Figure 13: Observed and fitted quantiles for weekly number of claims for
probabilities from 1% to 99%.
Asim Dey 22/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Predictions from different climate model
0
5
10
15
20
25
Quantileofweeklynumberofclaims
2002−2012 2021−2030 2061−2070
Observed
CanESM2rcp45CanRCM4
CanESM2rcp85CanRCM4
GFDLESM2Mrcp85RegCM4
GFDLESM2Mrcp85WRF
HadGEM2ESrcp85RegCM4
MPIESMLRrcp85RegCM4
Figure 13: Observed and predicted quantiles for weekly number of claims for
probabilities from 1% to 99%.
Asim Dey 23/24
Introduction
Data
Methods
Support Vector Regression
(SVR)
Neural Network (NN)
Prediction
Uncertainty due
to Climate
Models
Future Work
Future Works
Combine predictions from different climate models to a single distribution-
Ensembles of climate models.
1 Bayesian Models (Smith et al., 2009).
2 Bayesian Hierarchical Models (Sansom et al., 2017).
Expand the spatial domain of our analysis to other cities.
Incorporate other information, e.g., seasonal component, location and value
of assets.
Asim Dey 24/24

More Related Content

Similar to CLIM: Transition Workshop - Modeling Weather-related House Insurance Claims with Machine Learning Approach - Asim Dey, May 15, 2018

MFBLP Method Forecast for Regional Load Demand System
MFBLP Method Forecast for Regional Load Demand SystemMFBLP Method Forecast for Regional Load Demand System
MFBLP Method Forecast for Regional Load Demand SystemCSCJournals
 
IEOR 265 Final Paper_Minchao Lin
IEOR 265 Final Paper_Minchao LinIEOR 265 Final Paper_Minchao Lin
IEOR 265 Final Paper_Minchao LinMinchao Lin
 
IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...
IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...
IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...IRJET Journal
 
Detection of Damage in Beam from Measured Natural Frequencies Using Support V...
Detection of Damage in Beam from Measured Natural Frequencies Using Support V...Detection of Damage in Beam from Measured Natural Frequencies Using Support V...
Detection of Damage in Beam from Measured Natural Frequencies Using Support V...Subhajit Mondal
 
Random Forest Ensemble of Support Vector Regression for Solar Power Forecasting
Random Forest Ensemble of Support Vector Regression for Solar Power ForecastingRandom Forest Ensemble of Support Vector Regression for Solar Power Forecasting
Random Forest Ensemble of Support Vector Regression for Solar Power ForecastingMohamed Abuella
 
Short-term load forecasting with using multiple linear regression
Short-term load forecasting with using multiple  linear regression Short-term load forecasting with using multiple  linear regression
Short-term load forecasting with using multiple linear regression IJECEIAES
 
Expert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnnExpert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnnijcsa
 
KatRisk RAA 2018 Highlights
KatRisk RAA 2018 HighlightsKatRisk RAA 2018 Highlights
KatRisk RAA 2018 HighlightsDag Lohmann
 
Short Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research HighlightsShort Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research HighlightsMohamed Abuella
 
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...Dr. Amarjeet Singh
 
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...Environmental Intelligence Lab
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...AIST
 
Panel data methods for microeconometrics using Stata! Short and good one :)
Panel data methods for microeconometrics using Stata! Short and good one :)Panel data methods for microeconometrics using Stata! Short and good one :)
Panel data methods for microeconometrics using Stata! Short and good one :)Wondmagegn Tafesse
 
The internal-multiple elimination algorithm for all first-order internal mult...
The internal-multiple elimination algorithm for all first-order internal mult...The internal-multiple elimination algorithm for all first-order internal mult...
The internal-multiple elimination algorithm for all first-order internal mult...Arthur Weglein
 
American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1Double Check ĆŐNSULTING
 
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...IJDKP
 
Transportation and logistics modeling 2
Transportation and logistics modeling 2Transportation and logistics modeling 2
Transportation and logistics modeling 2karim sal3awi
 

Similar to CLIM: Transition Workshop - Modeling Weather-related House Insurance Claims with Machine Learning Approach - Asim Dey, May 15, 2018 (20)

MFBLP Method Forecast for Regional Load Demand System
MFBLP Method Forecast for Regional Load Demand SystemMFBLP Method Forecast for Regional Load Demand System
MFBLP Method Forecast for Regional Load Demand System
 
IEOR 265 Final Paper_Minchao Lin
IEOR 265 Final Paper_Minchao LinIEOR 265 Final Paper_Minchao Lin
IEOR 265 Final Paper_Minchao Lin
 
IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...
IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...
IRJET- Rainfall Simulation using ANN based Generealized Feed Forward and MLR ...
 
Detection of Damage in Beam from Measured Natural Frequencies Using Support V...
Detection of Damage in Beam from Measured Natural Frequencies Using Support V...Detection of Damage in Beam from Measured Natural Frequencies Using Support V...
Detection of Damage in Beam from Measured Natural Frequencies Using Support V...
 
Random Forest Ensemble of Support Vector Regression for Solar Power Forecasting
Random Forest Ensemble of Support Vector Regression for Solar Power ForecastingRandom Forest Ensemble of Support Vector Regression for Solar Power Forecasting
Random Forest Ensemble of Support Vector Regression for Solar Power Forecasting
 
Short-term load forecasting with using multiple linear regression
Short-term load forecasting with using multiple  linear regression Short-term load forecasting with using multiple  linear regression
Short-term load forecasting with using multiple linear regression
 
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
Undergraduate Modeling Workshop - Southeastern US Rainfall Working Group Fina...
 
50120130405032
5012013040503250120130405032
50120130405032
 
Expert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnnExpert system design for elastic scattering neutrons optical model using bpnn
Expert system design for elastic scattering neutrons optical model using bpnn
 
KatRisk RAA 2018 Highlights
KatRisk RAA 2018 HighlightsKatRisk RAA 2018 Highlights
KatRisk RAA 2018 Highlights
 
Short Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research HighlightsShort Presentation: Mohamed abuella's Research Highlights
Short Presentation: Mohamed abuella's Research Highlights
 
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
 
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
Curses, tradeoffs, and scalable management: advancing evolutionary direct pol...
 
Joint GWP CEE/DMCSEE training: From Drought Management Strategies to Drought...
Joint GWP CEE/DMCSEE training: From Drought Management Strategies to  Drought...Joint GWP CEE/DMCSEE training: From Drought Management Strategies to  Drought...
Joint GWP CEE/DMCSEE training: From Drought Management Strategies to Drought...
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
 
Panel data methods for microeconometrics using Stata! Short and good one :)
Panel data methods for microeconometrics using Stata! Short and good one :)Panel data methods for microeconometrics using Stata! Short and good one :)
Panel data methods for microeconometrics using Stata! Short and good one :)
 
The internal-multiple elimination algorithm for all first-order internal mult...
The internal-multiple elimination algorithm for all first-order internal mult...The internal-multiple elimination algorithm for all first-order internal mult...
The internal-multiple elimination algorithm for all first-order internal mult...
 
American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1
 
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
DEEP LEARNING BASED MULTIPLE REGRESSION TO PREDICT TOTAL COLUMN WATER VAPOR (...
 
Transportation and logistics modeling 2
Transportation and logistics modeling 2Transportation and logistics modeling 2
Transportation and logistics modeling 2
 

More from The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
 
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
 
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
 
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
 
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
 
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
 
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
 
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
 
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
 
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
 
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
 
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
 
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
 
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
 
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
 
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
 
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
 
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

Recently uploaded

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 

Recently uploaded (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 

CLIM: Transition Workshop - Modeling Weather-related House Insurance Claims with Machine Learning Approach - Asim Dey, May 15, 2018

  • 1. Modeling Weather-induced home Insurance Risks: A Machine Learning Approach Asim Dey The University of Texas at Dallas joint with Yulia R. Gel,The University of Texas at Dallas Slava Lyubchich, University of Maryland Center for Environmental Science.
  • 2. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Outline 1 Introduction 2 Data 3 Methods Support Vector Regression (SVR) Neural Network (NN) 4 Prediction 5 Uncertainty due to Climate Models 6 Future Work Asim Dey 2/24
  • 3. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work US trend in heavy precipitation Figure: 1. The relative occurrences of 2-day precipitation totals that exceed on average only once in a five-year period. Changes are compared to 1901-1960 average (Source: GlobalChange.gov). Since 2008, the United States has seen six floods costing at least $1 billion each (SOA). Asim Dey 3/24
  • 4. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Flood disasters in Canada 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 Numberofflooddisasters 0102030405060 3 4 9 7 8 13 12 44 50 51 Figure: 2. Frequency of flood disaster in Canada (Figure adapted from Cheng et al. 2012). In Canada, from 2009 to 2014, total insured losses from catastrophic events were close to or above $1 billion each year (IBC). Asim Dey 4/24
  • 5. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Flood disasters in Canada 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 Numberofflooddisasters 0102030405060 3 4 9 7 8 13 12 44 50 51 Figure: 2. Frequency of flood disaster in Canada (Figure adapted from Cheng et al. 2012). In Canada, from 2009 to 2014, total insured losses from catastrophic events were close to or above $1 billion each year (IBC). The Intergovernmental Panel on Climate Change (IPCC) has projected that the severity and frequency of extreme rainfalls will further increase (IPCC, 2014). Asim Dey 4/24
  • 6. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Literature Review Statistical approaches for claim frequency: GLM (Haug et al., 2011), GARMA (Soliman et al., 2015) , Bayesian hierarchical model (Scheel et al., 2013), data-driven nonparametric procedure (Lyubchich and Gel, 2017). Machine learning techniques for claim frequency: Neural Network (NN) (Caldeira et al., 2015), SVR (Wu and Akbarov, 2013). Goals: 1 Model and forecast a joint dynamics of weather-related home insurance claims (frequencies) and losses (severities). 2 Utility of Support Vector Regression (SVR) and Neural Network (NN), in forecasting future claim dynamics. Asim Dey 5/24
  • 7. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Data Table: 1. Overview of the data sets Period Data type/Climate model Variables Control Observations Precipitation, (2002–2011) Number of claims, Aggregate loss Scenario Projections from CanESM2 4.5, (2021–2080) CanESM2 8.5, GFDL ESM2M R 8.5, GFDL ESM2M W 8.5, Precipitation MPI ESM 8.5, HadGEM2 ES 8.5 . Representative Concentration Pathways (RCPs) 4.5 (RCP 4.5) assumes that the emissions will peak around 2040, then decline. Under the RCP 8.5 scenario, emissions continue to rise throughout the period 2000–2100. Asim Dey 6/24
  • 8. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Normalized Data Year Precipitation(mm/week) 0 20 40 60 80 100 120 2001 2003 2005 2007 2009 2011 x = 7.698 x~ = 3.80 s = 10.611 Year NumberofCliam/week 051015202530 2001 2003 2005 2007 2009 2011 x = 3.608 x~ = 3.0 s = 8.01 Year Lossinthousanddollars/week 0100200300400500600 2001 2003 2005 2007 2009 2011 x = 46620 x~ = 22300 s = 215961 Figure: 3. Weekly observed precipitation, number of claims and total loss (2001-2011) in Canadian city A , ¯x is mean, ˜x is median, and s is standard deviation. Asim Dey 7/24
  • 9. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Modeling Precipitation Related Risk Number of claims at week t, Nt and aggregate loss at week t, Lt can be modeled as Nt = f (Rt , Rt−1) (1) Lt = f (Rt , Rt−1, Nt ) (2) where, Rt is total precipitation at week t, and Rt−1 is total precipitation at week t − 1. Asim Dey 8/24
  • 10. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Support Vector Regression (SVR) Least Squares Regression minimizes in-sample residual sum of squares. SVR attempts to minimize the generalized error bound (Eq. 4). SVR only depends on a particular subset (Residual > ε) of the training data. − ε 0 ε 0 r Vε (r) Figure: 4. SVR ε-insensitive error function. f (x) = β, x + β0, (3) H(β, β0) = n i=1 Vε (yi − f (xi )) + λ 2 β 2 , (4) k(xi , x) = exp − x − xi 2 2σ2 (5) Minimization of loss function leads to a quadratic programming problem. Asim Dey 9/24
  • 11. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Genetic Algorithm (GA) The results of SVR modeling highly depend on the three user-defined parameters (hyper-parameters): 1 Regularization parameter (λ), 2 Tube size of ε-insensitive loss function (ε), 3 Bandwidth of the kernel function (σ2 ). Inappropriate choice of hyper-parameters leads to over-fitting or under-fitting. Genetic Algorithm (GA) is applied to simultaneously optimize all SVR parameters (Goldberg, 1989). Asim Dey 10/24
  • 12. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Genetic Algorithm (GA) cont... Each generation is like a iteration in numerical optimization problem. At each iteration there is a progressive improvement of objective function. Define: - Parameters (C, σ, ε) - Population size - Fitness function - Stopping criteria Generate initial random population Train SVR model and calculate fitness i = n? Create new population by: - Reproduction - Crossover - Mutation Select optimal (C, σ, ε) Train SVR model using ob- tained hyper-parameters i = 1 No i = i + 1 Yes Figure: 5. Flow-chart of the GA-SVR. Asim Dey 11/24
  • 13. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Genetic Algorithm (GA) Table: 2. GA parameter settings Parameter Value Number of generations (stopping criterion): 1000 Population size: 50 Fitness function: RMSE Search domain (Hsu et al., 2003): λ (10−3, 103) σ2 (10−3, 24) (10−2, 23) Asim Dey 12/24
  • 14. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Claims model-Neural Network (NN) NN models the response as a nonlinear function of linear combinations of the predictors. X1 X2 X3 X4 Y Input layer Output layer Figure 6(a): Linear Regression Model. Y = g 4 j=1 βj Xj + , g(v) = v, X1 X2 X3 X4 Z1 Z2 Y Input layer Hidden layer Ouput layer Figure 6(b): Neural Network Model. Zj = f 4 i=1 αij Xi , Y = g 2 j=1 βj Zj + , where, f (v) = 1/(1 + e−v ) is the sigmoid activation function. The unknown parameters αij , αoj , βj , and βo are estimated by back-propagation method (Bishop, 2006). Asim Dey 13/24
  • 15. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Number of hidden layers and nodes in a hidden layer The rules of thumb: One hidden layer network can approximate any function that contains a continuous mapping from one finite space to another (Heaton, 2008). 0.02750.02850.02950.0305 Number of claim model Hidden neurons Testerror(rRMSE) 1 2 3 4 1 2 3 4 5 0.0270.0280.0290.030 Loss Model Hidden neurons Testerror(rRMSE) Figure 7: Number of nodes selection in a hidden layer, number of resamples K is 100. Asim Dey 14/24
  • 16. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Model Selection 05102030 Year Numberofclaims/week Observed GA−SVR GLM NN 2008 2009 2010 2011 2012 Year Weeklytotalloss(1000CAD2002) 0 100 200 300 400 500 600 2008 2009 2010 2011 2012 Observed GLM NN GA−SVR Figure 8: Observed values versus fitted values from three different models. GA-SVR captures the variability of observed data better than other two model, particularly, the sudden high spikes in the number of claims and total losses. Asim Dey 15/24
  • 17. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Cross-validation Table: Number of claim model: Cross-validated Normalized average RMSE (Em), City A Model 5 years-RMSE (Em) NN 29 SVR 27 GA-SVR 24 Em = aveRMSEm m × 1 N N t=1 yt 100%, (6) where, aveRMSEm = 1 K K i=1 RMSEmi , the denominator of Eq. 6 is average total number of claims (or, average total loss) in the period of m years, N = 365 × 10, and K = 100 is number of resamples. Asim Dey 16/24
  • 18. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Prediction The change from the control period (2002 − 2011) to the 6 sub periods, 10 years each, of projection (scenario) period (2021 − 2080) is ∆ = t∈scn ˆYt /10 t∈ctr Yt /10 − 1, (7) where, ‘scn’ and ‘ctr’ refers to the scenario period and control period, respectively. Asim Dey 17/24
  • 19. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Risk Prediction using GA-SVR 2021−2030 2031−2040 2041−2050 2051−2060 2061−2070 2071−2080 Number of claims Aggregate loss GA−SVR Model Year Changeinannualnumberofclaimsandloss,% 0 10 20 30 40 50 Figure 9: Projected percentage changes relative to the control period of 2002–2011. Climate model is CanESM2 4.5. The annual number of home insurance claims and annual aggregate loss will increase noticeably in the scenario periods. Asim Dey 18/24
  • 20. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work GA-SVR claims 2021−2030 2031−2040 2041−2050 2051−2060 2061−2070 2071−2080 Climate scenario CanESM2 4.5 CanESM2 8.5 MPI ESM 8.5 GFDL ESM2MR 8.5 GFDL ESM2MW 8.5 HadGEM2 ES 8.5 GA−SVR Model Year Changeinannualnumberofclaims,% 0 10 20 30 40 50 60 Figure 10: Predicted claims from different climate models. Asim Dey 19/24
  • 21. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work GA-SVR losses 2021−2030 2031−2040 2041−2050 2051−2060 2061−2070 2071−2080 Climate scenario CanESM2 4.5 CanESM2 8.5 MPI ESM 8.5 GFDL ESM2MR 8.5 GFDL ESM2MW 8.5 HadGEM2 ES 8.5 GA−SVR Model Year Changeinannualaggregateloss,% 0 10 20 30 40 50 Figure 11: Predicted losses from different climate models. Asim Dey 20/24
  • 22. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Quantile Regression (QR) Number of claims at week t, Nt : Nt = f (Rt , Rt−1) where, Rt is total precipitation at week t, and Rt−1 is total precipitation at week t − 1. 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.8 pcp0 0.0 0.2 0.4 0.6 0.8 1.0 0.00.10.20.30.4 pcp0l1 Figure 12: Regression coefficients (y-axis) for different quantiles of NCL Asim Dey 21/24
  • 23. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Observed vs predicted quantiles Figure 13: Observed and fitted quantiles for weekly number of claims for probabilities from 1% to 99%. Asim Dey 22/24
  • 24. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Predictions from different climate model 0 5 10 15 20 25 Quantileofweeklynumberofclaims 2002−2012 2021−2030 2061−2070 Observed CanESM2rcp45CanRCM4 CanESM2rcp85CanRCM4 GFDLESM2Mrcp85RegCM4 GFDLESM2Mrcp85WRF HadGEM2ESrcp85RegCM4 MPIESMLRrcp85RegCM4 Figure 13: Observed and predicted quantiles for weekly number of claims for probabilities from 1% to 99%. Asim Dey 23/24
  • 25. Introduction Data Methods Support Vector Regression (SVR) Neural Network (NN) Prediction Uncertainty due to Climate Models Future Work Future Works Combine predictions from different climate models to a single distribution- Ensembles of climate models. 1 Bayesian Models (Smith et al., 2009). 2 Bayesian Hierarchical Models (Sansom et al., 2017). Expand the spatial domain of our analysis to other cities. Incorporate other information, e.g., seasonal component, location and value of assets. Asim Dey 24/24