SlideShare a Scribd company logo
4th International Summer School
Achievements and Applications of Contemporary
Informatics, Mathematics and Physics
National University of Technology of the Ukraine
Kiev, Ukraine, August 5-16, 2009




                        Prediction of Credit Default
                            by Continuous Optimization


                                   Gerhard-
                                   Gerhard-Wilhelm Weber           *

                             Efsun Kürüm, Kasırga Yıldırak
                               Institute of Applied Mathematics
                       Middle East Technical University, Ankara, Turkey
           * Faculty of Economics, Management and Law, University of Siegen, Germany
           Center for Research on Optimization and Control, University of Aveiro, Portugal
Outline

•   Main Problem from Credit Default

•   Logistic Regression and Performance Evaluation

•   Cut-Off Values and Thresholds

•   Classification and Optimization

•   Nonlinear Regression

•   Numerical Results

•   Outlook and Conclusion
Main Problem from Credit Default



    Whether a credit application should be consented or rejected.




Solution

    Learning about the default probability of the applicant.
Main Problem from Credit Default



    Whether a credit application should be consented or rejected.




Solution

    Learning about the default probability of the applicant.
Logistic Regression



     P(Y = 1 X = xl ) 
log                     = β0 + β1 ⋅ xl1 + β2 ⋅ xl 2 + K + β p ⋅ xlp
     P(Y = 0 X = x ) 
                   l 


                                                         (l = 1, 2,..., N )
Goal

Our study is based on one of the Basel II criteria which
recommend that the bank should divide corporate firms by
8 rating degrees with one of them being the default class.


We have two problems to solve here:
 To distinguish the defaults from non-defaults.
 To put non-default firms in an order based on their credit quality
  and classify them into (sub) classes.
Data

Data have been collected by a bank from the firms operating in the
manufacturing sector in Turkey.
They cover the period between 2001 and 2006.
There are 54 qualitative variables and 36 quantitative variables originally.
Data on quantitative variables are formed based on a balance sheet
submitted by the firms’ accountants.
Essentially, they are the well-known financial ratios.
The data set covers 3150 firms from which 92 are in the state of default.
As the number of default is small, in order to overcome the possible
statistical problems, we downsize the number to 551,
keeping all the default cases in the set.
We evaluate performance of the model

non-default                      default
cases                            cases
              cut-off value



                                                                       ROC curve

                              test result value




                                                  TPF, sensitivity



                                                                     FPF, 1-specificity
Model outcome versus truth



                                           truth

                                    d                n

                               True Positive    False Positive
                                  Fraction         Fraction
                          dı
                                    TPF              FPF
          model outcome

                               False Negative   True Negative
                                  Fraction         Fraction
                          nı
                                    FNF             TNF


                                     1                1

                                            total
Definitions


• sensitivity (TPF) := P( Dı | D)
• specificity        := P( NDı | ND )
• 1-specificity (FPF) := P( Dı | ND )

• points (TPF, FPF) constitute the ROC curve
• c := cut-off value
• c takes values between - ∞ and ∞


• TPF(c) := P( z>c | D )
• FPF(c) := P( z>c | ND )
normal-deviate axes
TPF




                                         Normal Deviate (TPF)
                                   FPF




  FPF (ci ) := Φ(ci )
  TPF (ci ) := Φ(a + b ⋅ ci )

             µn - µs          σn
      a :=             b :=
               σs             σs
                                                                Normal Deviate (FPF)
normal-deviate axes
TPF




                                   t

                                             Normal Deviate (TPF)
                                       FPF




  FPF (ci ) := Φ(ci )
  TPF (ci ) := Φ(a + b ⋅ ci )
                                                                                      c

             µn - µs          σn
      a :=             b :=
               σs             σs
                                                                    Normal Deviate (FPF)
Classification

    Ex.:              cut-off values




                actually non-default                                          actually default
                       cases                                                            cases

                                                                                                 c

           −∞                    class I   class II   class III class IV   class V                   ∞


   To assess discriminative power of such a model,
   we calculate the Area Under (ROC) Curve:

                                           ∞
                             AUC :=         ∫   Φ(a + b ⋅ c) d Φ(c).
                                           −∞
relationship between thresholds and cut-off values


    Ex.:
               TPF




                                                   FPF

                     t0   t1   t2   t3   t4   t5         R=5



                  Φ( c ) = t   ⇔     c = Φ − 1 (t )
Optimization in Credit Default



    Problem:


    Simultaneously to obtain the thresholds and the parameters a and b
    that maximize AUC,

    while balancing the size of the classes (regularization)


    and guaranteeing a good accuracy.
Optimization Problem


                                                                                         2
                 1
                                         -1
          α 1 ⋅ ∫ Φ( a + b ⋅ Φ (t )) dt − α 2 ⋅∑ 
                                                    γi       R −1      
 max                                                    − (ti +1 − ti ) 
                                              i =0                     
                                                     n
 a,b,τ         0


                             ti +1
         subject to
                               ∫     Φ(a + b ⋅ Φ −1 (t ))d t ≥ δi       (i = 0,1,..., R − 1)
                               ti




          τ   := ( t1 , t 2 ,..., t R -1 ) T     t0   = 0,   tR   = 1
Optimization Problem


                                                                                              2
                 1
                                         -1
          α 1 ⋅ ∫ Φ( a + b ⋅ Φ (t )) dt − α 2 ⋅∑ 
                                                    γi       R −1      
 max                                                    − (ti +1 − ti ) 
                                              i =0                     
                                                     n
 a,b,τ         0


                              ti +1
         subject to
                                ∫     Φ(a + b ⋅ Φ−1(t ))d t ≥ δi        >0   (i = 0,1,..., R − 1)
                               ti
                                                              ⇒ ti +1 > ti



          τ   := ( t1 , t 2 ,..., t R -1 ) T     t0   = 0,   tR   = 1
Over the ROC Curve

         TPF

                    1-AUC



                                    AUC




                                                     FPF

               t0     t1       t2     t3   t4   t5


                           1
        AOC : =            ∫ (1 − Φ( a + b ⋅ Φ − 1 (t ))) dt
                           0
New Version of the Optimization Problem



               R −1                      2
                       γi                    1
 min       α 2 ⋅ ∑  − (ti +1 − ti )  + α 1 ⋅ ∫ (1 − Φ(a + b ⋅ Φ −1 (t ))) dt
                 i =0               
 a, b, τ                n                      0




      subject to

                   t
                       j +1
                       ∫ (1− Φ(a + b ⋅Φ−1(t ))) dt ≤ t j +1 − t j − δ j   ( j = 0,1, ..., R −1)
                       t
                           j
Regression in Credit Default


    Optimization problem:

    Simultaneously to obtain the thresholds and the parameters a and b
    that maximize AUC,
    while balancing the size of the classes (regularization)
    and guaranteeing a good accuracy




                                            discretization of integral
                                            nonlinear regression problem
Discretization of the Integral


    Riemann-Stieltjes integral
                              ∞
                 AUC =        ∫   Φ (a + b ⋅ c ) dΦ(c )
                           −∞
     Riemann integral
                          1
                 AUC =    ∫ Φ( a + b ⋅ Φ −1 (t )) dt
                          0
     Discretization
                              R
                 AUC ≈ ∑ Φ( a + b ⋅ Φ −1 (tk )) ⋅ ∆tk
                           k =1
Optimization Problem with Penalty Parameters

  In the case of violation of anyone of these constraints, we introduce penalty
  parameters. As some penalty becomes increased, the iterates are forced
  towards the feasible set of the optimization problem.



                           R −1                        2
                                  γi                          1
     ΠΘ ( a,b, τ ) := α 2 ⋅ ∑        − (ti +1 − ti )  − α 1 ⋅ ∫ (1 - Φ( a + b ⋅ Φ -1 (t ))) dt +
                            i =0                     
                                   n                            0




                               R -1           t j +1                      
                                                                         −1
                        α 3 ⋅ ∑ θ j ⋅  δ j −  ∫ Φ( a + b ⋅ Φ (t ))) dt  
                                              tj                          
                              j =0           
                                      1444442444444      4                 
                                                                            3
                                                       =: Ψ j ( a , b , τ )


 Θ := ( θ1 , θ 2 , ..., θ R − 1 ) T                             θj ≥0           ( j = 0,1, ..., R − 1)
Optimization Problem                           further discretized



                 R−1                 2
                        γi
                                               R
                                    
ΠΘ(a,b,τ ) = α2 ⋅ ∑  − (ti+1 −ti )  + α1 ⋅ ∑( (1-Φ(a + b⋅Φ−1(t j ))) ∆t j )2 +
                  i =0             
                         n                   j =1



                                                                                            2
                                       nj                                                 
                                R-1
                           α 3. ∑ θ j  ∑                 −1(ην ) ) − δ j              ν 
                                                Φ(a + b ⋅Φ
                                                              j                   
                                                                                   
                                                                                       ∆η j
                               j =0   ν=0                          t j +1 − t j          
                                                                                           
                                                                                             
Optimization Problem                         further discretized



                 R−1                 2
                        γi
                                              R
                                    
ΠΘ(a,b,τ ) = α2 ⋅ ∑  − (ti+1 −ti )  + α1 ⋅ ∑( (1-Φ(a + b⋅Φ−1(t j ))) ∆t j )2 +
                  i =0             
                         n                   j =1




                                                                                      2
                                 R-1    nj            −1(ην ) ) − δ j  ∆ην  
                           α 3 . ∑ θ j  ∑   Φ(a + b ⋅Φ                           
                                                                                   j  
                                            
                                       ν=0  
                                                             j                   
                                                                    t j +1 − t j 
                                j =0                                                  
                                       
                                                                                       
                                                                                        
Nonlinear Regression


                                                   2

                           ∑ j ( j )
                            N
         min f ( β ) =           d − g x ,β 
                                             
                           j =1 
                            N
                      =:   ∑       f j2 ( β )
                            j =1




                                                F ( β ) := ( f1 ( β ),..., f N ( β ) )
                                                                                     T




        min f ( β ) = F T ( β ) F ( β )
Nonlinear Regression

                                                                        β k +1 := β k + qk
 • Gauss-Newton method :


                      ∇F ( β )∇T F ( β )q = −∇F ( β ) F ( β )




 • Levenberg-Marquardt method :
                                                                                   λ ≥0

                  (                            )
                      ∇F ( β )∇T F (β ) + λ I p q = −∇F ( β ) F ( β )
Nonlinear Regression


alternative solution



 min    t,
  t,q


 subject to     ( ∇F (β )∇   T
                                               )
                                 F ( β ) + λ I p q − ( −∇F ( β ) F ( β ) )
                                                                             2
                                                                                 ≤ t , t ≥ 0,

                  || Lq || 2 ≤ M




conic quadratic programming
Nonlinear Regression


alternative solution



 min    t,
  t,q


 subject to     ( ∇F (β )∇   T
                                               )
                                 F ( β ) + λ I p q − ( −∇F ( β ) F ( β ) )
                                                                             2
                                                                                 ≤ t , t ≥ 0,

                  || Lq || 2 ≤ M




conic quadratic programming

interior point methods
Numerical Results

                                         Initial Parameters
                a          b                      Threshold values (t)

                1         0.95       0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35

               1.5        0.85       0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35
               0.80       0.95       0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35
                2         0.70       0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35




                                         Optimization Results
           a          b                        Threshold values (t)               AUC

          0.9999 0.9501          0.0004 0.0020 0.0032 0.012 0.03537 0.09 0.3400   0.8447

          1.4999 0.8501          0.0003 0.0017 0.0036 0.011 0.03537 0.10 0.3500   0.9167

          0.7999 0.9501          0.0004 0.0018 0.0032 0.011 0.03400 0.10 0.3300   0.8138

          2.0001 0.7001          0.0004 0.0020 0.0031 0.012 0.03343 0.11 0.3400   0.9671
Numerical Results

                                Accuracy Error in Each Class
               I          II         III        IV           V         VI        VII       VIII
            0.0000     0.0000     0.0000     0.0001 0.0001 0.0010              0.0010 0.0075
            0.0000     0.0000     0.0000     0.0001 0.0001 0.0010              0.0018 0.0094

            0.0000     0.0000     0.0000     0.0000 0.0001 0.0002              0.0018 0.0059

            0.0000     0.0000     0.0000     0.0001 0.0001 0.0006              0.0018 0.0075




                                Number of Firms in Each Class
               I          II         III        IV           V         VI         VII       VIII
               4          56         27        133       115          102         129        61
               2          42         52        120       119          111         120        61

               4          43         40        129       114          116         120        61

               4          56         24        136       106          129         111        61
           Number of firms in each class at the beginning:       10, 26, 58, 106, 134, 121, 111, 61
Generalized Additive Models




                              http://144.122.137.55/gweber/
Generalized Additive Models
References

Aster, A., Borchers, B., and Thurber, C., Parameter Estimation and Inverse Problems, Academic Press, 2004.

Boyd, S., and Vandenberghe, L., Convex Optimization, Cambridge University Press, 2004.

Buja, A., Hastie, T., and Tibshirani, R., Linear smoothers and additive models, The Ann. Stat. 17, 2 (1989)
453-510.
Fox, J., Nonparametric regression, Appendix to an R and S-Plus Companion to Applied Regression,
Sage Publications, 2002.

Friedman, J.H., Multivariate adaptive regression splines, Annals of Statistics 19, 1 (1991) 1-141.

Friedman, J.H., and Stuetzle, W., Projection pursuit regression, J. Amer. Statist Assoc. 76 (1981) 817-823.

Hastie, T., and Tibshirani, R., Generalized additive models, Statist. Science 1, 3 (1986) 297-310.

Hastie, T., and Tibshirani, R., Generalized additive models: some applications, J. Amer. Statist. Assoc.
82, 398 (1987) 371-386.

Hastie, T., Tibshirani, R., and Friedman, J.H., The Element of Statistical Learning, Springer, 2001.

Hastie, T.J., and Tibshirani, R.J., Generalized Additive Models, New York, Chapman and Hall, 1990.
Nash, G., and Sofer, A., Linear and Nonlinear Programming, McGraw-Hill, New York, 1996.
Nemirovski, A., Lectures on modern convex optimization, Israel Institute of Technology (2002).
References

Nemirovski, A., Modern Convex Optimization, lecture notes, Israel Institute of Technology (2005).

Nesterov, Y.E , and Nemirovskii, A.S., Interior Point Methods in Convex Programming, SIAM, 1993.

Önalan, Ö., Martingale measures for NIG Lévy processes with applications to mathematical finance,
presentation in: Advanced Mathematical Methods for Finance, Side, Antalya, Turkey, April 26-29, 2006.

Taylan, P., Weber, G.-W., and Yerlikaya, F., A new approach to multivariate adaptive regression spline
by using Tikhonov regularization and continuous optimization, to appear in TOP, Selected Papers at the
Occasion of 20th EURO Mini Conference (Neringa, Lithuania, May 20-23, 2008).

Stone, C.J., Additive regression and other nonparametric models, Annals of Statistics 13, 2 (1985) 689-705.

Weber, G.-W., Taylan, P., Akteke-Öztürk, B., and Uğur, Ö., Mathematical and data mining contributions
dynamics and optimization of gene-environment networks, in the special issue Organization in Matter
from Quarks to Proteins of Electronic Journal of Theoretical Physics.

Weber, G.-W., Taylan, P., Yıldırak, K., and Görgülü, Z.K., Financial regression and organization, to appear
in the Special Issue on Optimization in Finance, of DCDIS-B (Dynamics of Continuous, Discrete and
Impulsive Systems (Series B)).

More Related Content

Similar to Prediction of Credit Default by Continuous Optimization

modelperfcheatsheet.pdf
modelperfcheatsheet.pdfmodelperfcheatsheet.pdf
modelperfcheatsheet.pdf
ERNESTOVEIGA
 
Data Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification SystemData Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification SystemJamie (Taka) Wang
 
CreditRisk+.pdf
CreditRisk+.pdfCreditRisk+.pdf
CreditRisk+.pdf
OstapKhomenko
 
Dr. Amir Nejat
Dr. Amir NejatDr. Amir Nejat
Dr. Amir Nejat
knowdiff
 
Regression Theory
Regression TheoryRegression Theory
Regression Theory
SSA KPI
 
Skew Berlin2009
Skew Berlin2009Skew Berlin2009
Skew Berlin2009
matthewcpollard
 
Semi-Infinite and Robust Optimization
Semi-Infinite and Robust OptimizationSemi-Infinite and Robust Optimization
Semi-Infinite and Robust Optimization
SSA KPI
 
Further Advanced Methods from Mathematical Optimization
Further Advanced Methods from Mathematical OptimizationFurther Advanced Methods from Mathematical Optimization
Further Advanced Methods from Mathematical Optimization
SSA KPI
 
Presentation
PresentationPresentation
Presentation
guest635cb8
 
Power Market and Models Convergence ?
Power Market and Models Convergence ?Power Market and Models Convergence ?
Power Market and Models Convergence ?
NicolasRR
 
NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)
Christian Robert
 
An Empirical Study of Exposure at Default
An Empirical Study of Exposure at DefaultAn Empirical Study of Exposure at Default
An Empirical Study of Exposure at DefaultMichael Jacobs, Jr.
 
Portfolios and Risk Premia for the Long Run
Portfolios and Risk Premia for the Long RunPortfolios and Risk Premia for the Long Run
Portfolios and Risk Premia for the Long Runguasoni
 
Skiena algorithm 2007 lecture02 asymptotic notation
Skiena algorithm 2007 lecture02 asymptotic notationSkiena algorithm 2007 lecture02 asymptotic notation
Skiena algorithm 2007 lecture02 asymptotic notationzukun
 
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
STAIR Lab, Chiba Institute of Technology
 
CDT 22 slides.pdf
CDT 22 slides.pdfCDT 22 slides.pdf
CDT 22 slides.pdf
Christian Robert
 
NIPS2007: learning using many examples
NIPS2007: learning using many examplesNIPS2007: learning using many examples
NIPS2007: learning using many exampleszukun
 
Captum Palisades UC 2011
Captum Palisades UC 2011Captum Palisades UC 2011
Captum Palisades UC 2011
Captum Capital Limited
 
chap4_imbalanced_classes.pptx
chap4_imbalanced_classes.pptxchap4_imbalanced_classes.pptx
chap4_imbalanced_classes.pptx
ssuser6dd52e
 
Lecture6
Lecture6Lecture6
Lecture6voracle
 

Similar to Prediction of Credit Default by Continuous Optimization (20)

modelperfcheatsheet.pdf
modelperfcheatsheet.pdfmodelperfcheatsheet.pdf
modelperfcheatsheet.pdf
 
Data Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification SystemData Mining With A Simulated Annealing Based Fuzzy Classification System
Data Mining With A Simulated Annealing Based Fuzzy Classification System
 
CreditRisk+.pdf
CreditRisk+.pdfCreditRisk+.pdf
CreditRisk+.pdf
 
Dr. Amir Nejat
Dr. Amir NejatDr. Amir Nejat
Dr. Amir Nejat
 
Regression Theory
Regression TheoryRegression Theory
Regression Theory
 
Skew Berlin2009
Skew Berlin2009Skew Berlin2009
Skew Berlin2009
 
Semi-Infinite and Robust Optimization
Semi-Infinite and Robust OptimizationSemi-Infinite and Robust Optimization
Semi-Infinite and Robust Optimization
 
Further Advanced Methods from Mathematical Optimization
Further Advanced Methods from Mathematical OptimizationFurther Advanced Methods from Mathematical Optimization
Further Advanced Methods from Mathematical Optimization
 
Presentation
PresentationPresentation
Presentation
 
Power Market and Models Convergence ?
Power Market and Models Convergence ?Power Market and Models Convergence ?
Power Market and Models Convergence ?
 
NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)
 
An Empirical Study of Exposure at Default
An Empirical Study of Exposure at DefaultAn Empirical Study of Exposure at Default
An Empirical Study of Exposure at Default
 
Portfolios and Risk Premia for the Long Run
Portfolios and Risk Premia for the Long RunPortfolios and Risk Premia for the Long Run
Portfolios and Risk Premia for the Long Run
 
Skiena algorithm 2007 lecture02 asymptotic notation
Skiena algorithm 2007 lecture02 asymptotic notationSkiena algorithm 2007 lecture02 asymptotic notation
Skiena algorithm 2007 lecture02 asymptotic notation
 
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
 
CDT 22 slides.pdf
CDT 22 slides.pdfCDT 22 slides.pdf
CDT 22 slides.pdf
 
NIPS2007: learning using many examples
NIPS2007: learning using many examplesNIPS2007: learning using many examples
NIPS2007: learning using many examples
 
Captum Palisades UC 2011
Captum Palisades UC 2011Captum Palisades UC 2011
Captum Palisades UC 2011
 
chap4_imbalanced_classes.pptx
chap4_imbalanced_classes.pptxchap4_imbalanced_classes.pptx
chap4_imbalanced_classes.pptx
 
Lecture6
Lecture6Lecture6
Lecture6
 

More from SSA KPI

Germany presentation
Germany presentationGermany presentation
Germany presentationSSA KPI
 
Grand challenges in energy
Grand challenges in energyGrand challenges in energy
Grand challenges in energySSA KPI
 
Engineering role in sustainability
Engineering role in sustainabilityEngineering role in sustainability
Engineering role in sustainabilitySSA KPI
 
Consensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentConsensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentSSA KPI
 
Competences in sustainability in engineering education
Competences in sustainability in engineering educationCompetences in sustainability in engineering education
Competences in sustainability in engineering educationSSA KPI
 
Introducatio SD for enginers
Introducatio SD for enginersIntroducatio SD for enginers
Introducatio SD for enginersSSA KPI
 
DAAD-10.11.2011
DAAD-10.11.2011DAAD-10.11.2011
DAAD-10.11.2011
SSA KPI
 
Talking with money
Talking with moneyTalking with money
Talking with money
SSA KPI
 
'Green' startup investment
'Green' startup investment'Green' startup investment
'Green' startup investment
SSA KPI
 
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea wavesFrom Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
SSA KPI
 
Dynamics of dice games
Dynamics of dice gamesDynamics of dice games
Dynamics of dice games
SSA KPI
 
Energy Security Costs
Energy Security CostsEnergy Security Costs
Energy Security Costs
SSA KPI
 
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environmentsNaturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
SSA KPI
 
Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5
SSA KPI
 
Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4
SSA KPI
 
Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3
SSA KPI
 
Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2
SSA KPI
 
Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1
SSA KPI
 
Fluorescent proteins in current biology
Fluorescent proteins in current biologyFluorescent proteins in current biology
Fluorescent proteins in current biology
SSA KPI
 
Neurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functionsNeurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functions
SSA KPI
 

More from SSA KPI (20)

Germany presentation
Germany presentationGermany presentation
Germany presentation
 
Grand challenges in energy
Grand challenges in energyGrand challenges in energy
Grand challenges in energy
 
Engineering role in sustainability
Engineering role in sustainabilityEngineering role in sustainability
Engineering role in sustainability
 
Consensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentConsensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable development
 
Competences in sustainability in engineering education
Competences in sustainability in engineering educationCompetences in sustainability in engineering education
Competences in sustainability in engineering education
 
Introducatio SD for enginers
Introducatio SD for enginersIntroducatio SD for enginers
Introducatio SD for enginers
 
DAAD-10.11.2011
DAAD-10.11.2011DAAD-10.11.2011
DAAD-10.11.2011
 
Talking with money
Talking with moneyTalking with money
Talking with money
 
'Green' startup investment
'Green' startup investment'Green' startup investment
'Green' startup investment
 
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea wavesFrom Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
 
Dynamics of dice games
Dynamics of dice gamesDynamics of dice games
Dynamics of dice games
 
Energy Security Costs
Energy Security CostsEnergy Security Costs
Energy Security Costs
 
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environmentsNaturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
 
Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5
 
Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4
 
Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3
 
Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2
 
Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1
 
Fluorescent proteins in current biology
Fluorescent proteins in current biologyFluorescent proteins in current biology
Fluorescent proteins in current biology
 
Neurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functionsNeurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functions
 

Recently uploaded

Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
Wasim Ak
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
The Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptxThe Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptx
DhatriParmar
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 

Recently uploaded (20)

Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
The Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptxThe Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 

Prediction of Credit Default by Continuous Optimization

  • 1. 4th International Summer School Achievements and Applications of Contemporary Informatics, Mathematics and Physics National University of Technology of the Ukraine Kiev, Ukraine, August 5-16, 2009 Prediction of Credit Default by Continuous Optimization Gerhard- Gerhard-Wilhelm Weber * Efsun Kürüm, Kasırga Yıldırak Institute of Applied Mathematics Middle East Technical University, Ankara, Turkey * Faculty of Economics, Management and Law, University of Siegen, Germany Center for Research on Optimization and Control, University of Aveiro, Portugal
  • 2. Outline • Main Problem from Credit Default • Logistic Regression and Performance Evaluation • Cut-Off Values and Thresholds • Classification and Optimization • Nonlinear Regression • Numerical Results • Outlook and Conclusion
  • 3. Main Problem from Credit Default Whether a credit application should be consented or rejected. Solution Learning about the default probability of the applicant.
  • 4. Main Problem from Credit Default Whether a credit application should be consented or rejected. Solution Learning about the default probability of the applicant.
  • 5. Logistic Regression  P(Y = 1 X = xl )  log  = β0 + β1 ⋅ xl1 + β2 ⋅ xl 2 + K + β p ⋅ xlp  P(Y = 0 X = x )   l  (l = 1, 2,..., N )
  • 6. Goal Our study is based on one of the Basel II criteria which recommend that the bank should divide corporate firms by 8 rating degrees with one of them being the default class. We have two problems to solve here: To distinguish the defaults from non-defaults. To put non-default firms in an order based on their credit quality and classify them into (sub) classes.
  • 7. Data Data have been collected by a bank from the firms operating in the manufacturing sector in Turkey. They cover the period between 2001 and 2006. There are 54 qualitative variables and 36 quantitative variables originally. Data on quantitative variables are formed based on a balance sheet submitted by the firms’ accountants. Essentially, they are the well-known financial ratios. The data set covers 3150 firms from which 92 are in the state of default. As the number of default is small, in order to overcome the possible statistical problems, we downsize the number to 551, keeping all the default cases in the set.
  • 8. We evaluate performance of the model non-default default cases cases cut-off value ROC curve test result value TPF, sensitivity FPF, 1-specificity
  • 9. Model outcome versus truth truth d n True Positive False Positive Fraction Fraction dı TPF FPF model outcome False Negative True Negative Fraction Fraction nı FNF TNF 1 1 total
  • 10. Definitions • sensitivity (TPF) := P( Dı | D) • specificity := P( NDı | ND ) • 1-specificity (FPF) := P( Dı | ND ) • points (TPF, FPF) constitute the ROC curve • c := cut-off value • c takes values between - ∞ and ∞ • TPF(c) := P( z>c | D ) • FPF(c) := P( z>c | ND )
  • 11. normal-deviate axes TPF Normal Deviate (TPF) FPF FPF (ci ) := Φ(ci ) TPF (ci ) := Φ(a + b ⋅ ci ) µn - µs σn a := b := σs σs Normal Deviate (FPF)
  • 12. normal-deviate axes TPF t Normal Deviate (TPF) FPF FPF (ci ) := Φ(ci ) TPF (ci ) := Φ(a + b ⋅ ci ) c µn - µs σn a := b := σs σs Normal Deviate (FPF)
  • 13. Classification Ex.: cut-off values actually non-default actually default cases cases c −∞ class I class II class III class IV class V ∞ To assess discriminative power of such a model, we calculate the Area Under (ROC) Curve: ∞ AUC := ∫ Φ(a + b ⋅ c) d Φ(c). −∞
  • 14. relationship between thresholds and cut-off values Ex.: TPF FPF t0 t1 t2 t3 t4 t5 R=5 Φ( c ) = t ⇔ c = Φ − 1 (t )
  • 15. Optimization in Credit Default Problem: Simultaneously to obtain the thresholds and the parameters a and b that maximize AUC, while balancing the size of the classes (regularization) and guaranteeing a good accuracy.
  • 16. Optimization Problem 2 1 -1 α 1 ⋅ ∫ Φ( a + b ⋅ Φ (t )) dt − α 2 ⋅∑   γi R −1  max − (ti +1 − ti )  i =0   n a,b,τ 0 ti +1 subject to ∫ Φ(a + b ⋅ Φ −1 (t ))d t ≥ δi (i = 0,1,..., R − 1) ti τ := ( t1 , t 2 ,..., t R -1 ) T t0 = 0, tR = 1
  • 17. Optimization Problem 2 1 -1 α 1 ⋅ ∫ Φ( a + b ⋅ Φ (t )) dt − α 2 ⋅∑   γi R −1  max − (ti +1 − ti )  i =0   n a,b,τ 0 ti +1 subject to ∫ Φ(a + b ⋅ Φ−1(t ))d t ≥ δi >0 (i = 0,1,..., R − 1) ti ⇒ ti +1 > ti τ := ( t1 , t 2 ,..., t R -1 ) T t0 = 0, tR = 1
  • 18. Over the ROC Curve TPF 1-AUC AUC FPF t0 t1 t2 t3 t4 t5 1 AOC : = ∫ (1 − Φ( a + b ⋅ Φ − 1 (t ))) dt 0
  • 19. New Version of the Optimization Problem R −1 2  γi  1 min α 2 ⋅ ∑  − (ti +1 − ti )  + α 1 ⋅ ∫ (1 − Φ(a + b ⋅ Φ −1 (t ))) dt i =0   a, b, τ n 0 subject to t j +1 ∫ (1− Φ(a + b ⋅Φ−1(t ))) dt ≤ t j +1 − t j − δ j ( j = 0,1, ..., R −1) t j
  • 20. Regression in Credit Default Optimization problem: Simultaneously to obtain the thresholds and the parameters a and b that maximize AUC, while balancing the size of the classes (regularization) and guaranteeing a good accuracy discretization of integral nonlinear regression problem
  • 21. Discretization of the Integral Riemann-Stieltjes integral ∞ AUC = ∫ Φ (a + b ⋅ c ) dΦ(c ) −∞ Riemann integral 1 AUC = ∫ Φ( a + b ⋅ Φ −1 (t )) dt 0 Discretization R AUC ≈ ∑ Φ( a + b ⋅ Φ −1 (tk )) ⋅ ∆tk k =1
  • 22. Optimization Problem with Penalty Parameters In the case of violation of anyone of these constraints, we introduce penalty parameters. As some penalty becomes increased, the iterates are forced towards the feasible set of the optimization problem. R −1 2  γi  1 ΠΘ ( a,b, τ ) := α 2 ⋅ ∑  − (ti +1 − ti )  − α 1 ⋅ ∫ (1 - Φ( a + b ⋅ Φ -1 (t ))) dt + i =0   n 0 R -1   t j +1  −1 α 3 ⋅ ∑ θ j ⋅  δ j −  ∫ Φ( a + b ⋅ Φ (t ))) dt     tj  j =0   1444442444444 4  3 =: Ψ j ( a , b , τ ) Θ := ( θ1 , θ 2 , ..., θ R − 1 ) T θj ≥0 ( j = 0,1, ..., R − 1)
  • 23. Optimization Problem further discretized R−1 2  γi R  ΠΘ(a,b,τ ) = α2 ⋅ ∑  − (ti+1 −ti )  + α1 ⋅ ∑( (1-Φ(a + b⋅Φ−1(t j ))) ∆t j )2 + i =0   n j =1  2  nj     R-1 α 3. ∑ θ j  ∑  −1(ην ) ) − δ j  ν   Φ(a + b ⋅Φ  j   ∆η j j =0 ν=0   t j +1 − t j         
  • 24. Optimization Problem further discretized R−1 2  γi R  ΠΘ(a,b,τ ) = α2 ⋅ ∑  − (ti+1 −ti )  + α1 ⋅ ∑( (1-Φ(a + b⋅Φ−1(t j ))) ∆t j )2 + i =0   n j =1  2 R-1  nj  −1(ην ) ) − δ j  ∆ην   α 3 . ∑ θ j  ∑   Φ(a + b ⋅Φ   j    ν=0   j  t j +1 − t j  j =0      
  • 25. Nonlinear Regression 2 ∑ j ( j ) N min f ( β ) =  d − g x ,β   j =1  N =: ∑ f j2 ( β ) j =1 F ( β ) := ( f1 ( β ),..., f N ( β ) ) T min f ( β ) = F T ( β ) F ( β )
  • 26. Nonlinear Regression β k +1 := β k + qk • Gauss-Newton method : ∇F ( β )∇T F ( β )q = −∇F ( β ) F ( β ) • Levenberg-Marquardt method : λ ≥0 ( ) ∇F ( β )∇T F (β ) + λ I p q = −∇F ( β ) F ( β )
  • 27. Nonlinear Regression alternative solution min t, t,q subject to ( ∇F (β )∇ T ) F ( β ) + λ I p q − ( −∇F ( β ) F ( β ) ) 2 ≤ t , t ≥ 0, || Lq || 2 ≤ M conic quadratic programming
  • 28. Nonlinear Regression alternative solution min t, t,q subject to ( ∇F (β )∇ T ) F ( β ) + λ I p q − ( −∇F ( β ) F ( β ) ) 2 ≤ t , t ≥ 0, || Lq || 2 ≤ M conic quadratic programming interior point methods
  • 29. Numerical Results Initial Parameters a b Threshold values (t) 1 0.95 0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35 1.5 0.85 0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35 0.80 0.95 0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35 2 0.70 0.0006 0.0015 0.0035 0.01 0.035 0.11 0.35 Optimization Results a b Threshold values (t) AUC 0.9999 0.9501 0.0004 0.0020 0.0032 0.012 0.03537 0.09 0.3400 0.8447 1.4999 0.8501 0.0003 0.0017 0.0036 0.011 0.03537 0.10 0.3500 0.9167 0.7999 0.9501 0.0004 0.0018 0.0032 0.011 0.03400 0.10 0.3300 0.8138 2.0001 0.7001 0.0004 0.0020 0.0031 0.012 0.03343 0.11 0.3400 0.9671
  • 30. Numerical Results Accuracy Error in Each Class I II III IV V VI VII VIII 0.0000 0.0000 0.0000 0.0001 0.0001 0.0010 0.0010 0.0075 0.0000 0.0000 0.0000 0.0001 0.0001 0.0010 0.0018 0.0094 0.0000 0.0000 0.0000 0.0000 0.0001 0.0002 0.0018 0.0059 0.0000 0.0000 0.0000 0.0001 0.0001 0.0006 0.0018 0.0075 Number of Firms in Each Class I II III IV V VI VII VIII 4 56 27 133 115 102 129 61 2 42 52 120 119 111 120 61 4 43 40 129 114 116 120 61 4 56 24 136 106 129 111 61 Number of firms in each class at the beginning: 10, 26, 58, 106, 134, 121, 111, 61
  • 31. Generalized Additive Models http://144.122.137.55/gweber/
  • 33. References Aster, A., Borchers, B., and Thurber, C., Parameter Estimation and Inverse Problems, Academic Press, 2004. Boyd, S., and Vandenberghe, L., Convex Optimization, Cambridge University Press, 2004. Buja, A., Hastie, T., and Tibshirani, R., Linear smoothers and additive models, The Ann. Stat. 17, 2 (1989) 453-510. Fox, J., Nonparametric regression, Appendix to an R and S-Plus Companion to Applied Regression, Sage Publications, 2002. Friedman, J.H., Multivariate adaptive regression splines, Annals of Statistics 19, 1 (1991) 1-141. Friedman, J.H., and Stuetzle, W., Projection pursuit regression, J. Amer. Statist Assoc. 76 (1981) 817-823. Hastie, T., and Tibshirani, R., Generalized additive models, Statist. Science 1, 3 (1986) 297-310. Hastie, T., and Tibshirani, R., Generalized additive models: some applications, J. Amer. Statist. Assoc. 82, 398 (1987) 371-386. Hastie, T., Tibshirani, R., and Friedman, J.H., The Element of Statistical Learning, Springer, 2001. Hastie, T.J., and Tibshirani, R.J., Generalized Additive Models, New York, Chapman and Hall, 1990. Nash, G., and Sofer, A., Linear and Nonlinear Programming, McGraw-Hill, New York, 1996. Nemirovski, A., Lectures on modern convex optimization, Israel Institute of Technology (2002).
  • 34. References Nemirovski, A., Modern Convex Optimization, lecture notes, Israel Institute of Technology (2005). Nesterov, Y.E , and Nemirovskii, A.S., Interior Point Methods in Convex Programming, SIAM, 1993. Önalan, Ö., Martingale measures for NIG Lévy processes with applications to mathematical finance, presentation in: Advanced Mathematical Methods for Finance, Side, Antalya, Turkey, April 26-29, 2006. Taylan, P., Weber, G.-W., and Yerlikaya, F., A new approach to multivariate adaptive regression spline by using Tikhonov regularization and continuous optimization, to appear in TOP, Selected Papers at the Occasion of 20th EURO Mini Conference (Neringa, Lithuania, May 20-23, 2008). Stone, C.J., Additive regression and other nonparametric models, Annals of Statistics 13, 2 (1985) 689-705. Weber, G.-W., Taylan, P., Akteke-Öztürk, B., and Uğur, Ö., Mathematical and data mining contributions dynamics and optimization of gene-environment networks, in the special issue Organization in Matter from Quarks to Proteins of Electronic Journal of Theoretical Physics. Weber, G.-W., Taylan, P., Yıldırak, K., and Görgülü, Z.K., Financial regression and organization, to appear in the Special Issue on Optimization in Finance, of DCDIS-B (Dynamics of Continuous, Discrete and Impulsive Systems (Series B)).