SlideShare a Scribd company logo
1 of 32
Radial Basis Function (RBF)


       Syed Zohaib Hassan
       Muhammad Awais Bawazir
CONTENTS

 Introduction
 Architecture
 Types of RBF
 Example
 Advantages/Disadvantages
Radial Basis Function (RBF)

A radial basis function (RBF) is a real-valued
 function whose value depends only on the distance
 from the origin, so that ᶲ(x)=ᶲ(|x|) ; or alternatively
 on the distance from some other point c, called a
 center, so that ᶲ(x,c)=ᶲ(|x-c|). Any function that
 satisfies the property is a radial function.
RBF network

This is becoming an increasingly popular neural
 network with diverse applications and is probably
 the main rival to the multi-layered perceptron
 Much of the inspiration for RBF networks has
 come from traditional statistical pattern
 classification techniques
Radial Basis Function Network

Approximate function with linear combination of
 Radial basis functions


           F(x) = Σ wi h(x)

h(x) is mostly Gaussian function
Radial basis function

         m

f(x) = ∑ w j h j (x)
        j=
        1


h j (x) = exp( -(x-c j ) 2 / r j 2 )

             Where c j is center of a region,
                     r j is width of the receptive field
Architecture


  x1        h1
                   W1
  x2        h2
                   W2
  x3        h3     W3         f(x)
                   Wm
  xn        hm



Input   Hidden layer    Output layer
layer
Three layers
 Input layer
  o   Source nodes that connect to the network to its
      environment
 Hidden layer
o Hidden units provide a set of basis function
o hidden layer performs a non-linear mapping
  from the input space into a (usually) higher
  dimensional space in which the patterns
  become linearly separable.
 Output layer
  o   Linear combination of hidden functions
Clustering

The unique feature of the RBF network is the
 process performed in the hidden layer.
The idea is that the patterns in the input space
 form clusters.
If the centres of these clusters are known, then the
 distance from the cluster centre can be measured.
Furthermore, this distance measure is made non-
 linear, so that if a pattern is in an area that is close
 to a cluster centre it gives a value close to 1.
Types of RBF

 Gaussian
 Multiquadric
 Inverse quadratic
 Inverse multiquadric
 Polyharmonic spline
 Thin plate spline
Gaussian function

The most commonly used radial-basis function is a
 Gaussian function
In a RBF network, r is the distance from the
 cluster centre.
The equation represents a Gaussian bell-shaped
 curve, as shown in Fig.
1
  0.9
  0.8
  0.7
φ 0.6
  0.5
  0.4
  0.3
  0.2
  0.1
    0
                                         0




                                                                     1
                                                   0.4

                                                         0.6

                                                               0.8
                                             0.2
        -1

             -0.8




                                  -0.2
                    -0.6

                           -0.4




                                         x
Distance measure

The distance measured from the cluster centre is
 usually the Euclidean distance.
For each neuron in the hidden layer, the weights
 represent the co-ordinates of the centre of the
 cluster.
Therefore, when that neuron receives an input
 pattern, X, the distance is found using the
 following equation:
Distance measure



        n
rj =   ∑ (x − w )
       i =1
              i      ij
                          2
Polyharmonic splines

 Polyharmonic splines are used for function
 approximation and data interpolation. They are
 very useful for interpolation of scattered data in
 many dimension
Multiquadric RBF




         r + ( x − c)
          2              2
h( x ) =
             r
Width of hidden unit basis function


                                          n

                                        ∑ (x    i   − wij )   2


     (hidden _ unit )ϕ j = exp(−         i =1
                                                                  )
                                                2σ   2



The variable sigma, σ, defines the width or radius of the
bell-shape and is something that has to be determined
empirically. When the distance from the centre of the
Gaussian reaches σ, the output drops from 1 to 0.6.
Basis functions of polyharmonic splines
    are radial basis functions of the for
Example

An often quoted example which shows how the
 RBF network can handle a non-linearly separable
 function is the exclusive-or problem.
One solution has 2 inputs, 2 hidden units and 1
 output.
The centres for the two hidden units are set at c1 =
 0,0 and c2 = 1,1, and the value of radius σ is chosen
 such that 2σ2 = 1.
Example

The inputs are x, the distances from the centres
 squared are r, and the outputs from the hidden
 units are ϕ.
When all four examples of input patterns are
 shown to the network, the outputs of the two
 hidden units are shown in the following table.
x1   x2   r1   r2   ϕ1    ϕ2




0    0    0    2    1     0.1




0    1    1    1    0.4   0.4




1    0    1    1    0.4   0.4




1    1    2    0    0.1   1
Example

Next Fig. shows the position of the four input
 patterns using the output of the two hidden units
 as the axes on the graph - it can be seen that the
 patterns are now linearly separable.
This is an ideal solution - the centres were chosen
 carefully to show this result.
Methods generally adopted for learning in an RBF
 network would find it impossible to arrive at those
 centre values - later learning methods that are
 usually adopted will be described.
φ1
 1




0.5
          XX




      0    0.5   1 φ2
Training hidden layer

The hidden layer in a RBF network has units which
 have weights that correspond to the vector
 representation of the centre of a cluster.
These weights are found either using a traditional
 clustering algorithm such as the k-means
 algorithm, or adaptively using essentially the
 Kohonen algorithm.
Training hidden layer

In either case, the training is unsupervised but the
 number of clusters that you expect, k, is set in
 advance. The algorithms then find the best fit to
 these clusters.
The k -means algorithm will be briefly outlined.
Initially k points in the pattern space are randomly
 set.
Training hidden layer

Then for each item of data in the training set, the
 distances are found from all of the k centres.
The closest centre is chosen for each item of data -
 this is the initial classification, so all items of data
 will be assigned a class from 1 to k.
Then, for all data which has been found to be class
 1, the average or mean values are found for each of
 co-ordinates.
Training hidden layer

These become the new values for the centre
 corresponding to class 1.
Repeated for all data found to be in class 2, then
 class 3 and so on until class k is dealt with - we now
 have k new centres.
Process of measuring the distance between the
 centres and each item of data and re-classifying the
 data is repeated until there is no further change –
 i.e. the sum of the distances monitored and
 training halts when the total distance no longer
 falls.
Adaptive k-means

The alternative is to use an adaptive k -means
 algorithm which similar to Kohonen learning.
Input patterns are presented to all of the cluster
 centres one at a time, and the cluster centres
 adjusted after each one. The cluster centre that is
 nearest to the input data wins, and is shifted
 slightly towards the new data.
This has the advantage that you don’t have to store
 all of the training data so can be done on-line.
Finding radius of Gaussians

Having found the cluster centres using one or
 other of these methods, the next step is
 determining the radius of the Gaussian curves.
This is usually done using the P-nearest neighbour
 algorithm.
A number P is chosen, and for each centre, the P
 nearest centres are found.
Finding radius of Gaussians

The root-mean squared distance between the
 current cluster centre and its P nearest neighbours
 is calculated, and this is the value chosen for σ.
So, if the current cluster centre is cj, the value is:



                        P
          1
      σj=   ∑ (ck − ci )
          P i =1
                         2
Finding radius of Gaussians

                                    σ is set to
A typical value for P is 2, in which case
 be the average distance from the two nearest
 neighbouring cluster centres.
Advantages/Disadvantages

RBF trains faster than a MLP
Another advantage that is claimed is that the
 hidden layer is easier to interpret than the hidden
 layer in an MLP.
Although the RBF is quick to train, when training
 is finished and it is being used it is slower than a
 MLP, so where speed is a factor a MLP may be
 more appropriate.

More Related Content

What's hot

Case Study of Convolutional Neural Network
Case Study of Convolutional Neural NetworkCase Study of Convolutional Neural Network
Case Study of Convolutional Neural NetworkNamHyuk Ahn
 
Self Organizing Maps: Fundamentals
Self Organizing Maps: FundamentalsSelf Organizing Maps: Fundamentals
Self Organizing Maps: FundamentalsSpacetoshare
 
201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture SearchDaeJin Kim
 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with PerformersJoonhyung Lee
 
Offline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural NetworkOffline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural Networkijaia
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetSungminYou
 
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic SegmentationReview : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic SegmentationDongmin Choi
 
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMLAI2
 
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...Tahmid Abtahi
 
Review-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learningReview-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learningTrong-An Bui
 
Performance Analysis of Various Activation Functions in Generalized MLP Archi...
Performance Analysis of Various Activation Functions in Generalized MLP Archi...Performance Analysis of Various Activation Functions in Generalized MLP Archi...
Performance Analysis of Various Activation Functions in Generalized MLP Archi...Waqas Tariq
 
Convolutional neural networks
Convolutional neural networks Convolutional neural networks
Convolutional neural networks Roozbeh Sanaei
 
Modern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentationModern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentationGioele Ciaparrone
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNNShuai Zhang
 

What's hot (20)

Cnn
CnnCnn
Cnn
 
www.ijerd.com
www.ijerd.comwww.ijerd.com
www.ijerd.com
 
Case Study of Convolutional Neural Network
Case Study of Convolutional Neural NetworkCase Study of Convolutional Neural Network
Case Study of Convolutional Neural Network
 
Self Organizing Maps: Fundamentals
Self Organizing Maps: FundamentalsSelf Organizing Maps: Fundamentals
Self Organizing Maps: Fundamentals
 
201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search201907 AutoML and Neural Architecture Search
201907 AutoML and Neural Architecture Search
 
Rethinking Attention with Performers
Rethinking Attention with PerformersRethinking Attention with Performers
Rethinking Attention with Performers
 
Offline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural NetworkOffline Character Recognition Using Monte Carlo Method and Neural Network
Offline Character Recognition Using Monte Carlo Method and Neural Network
 
Convolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNetConvolutional neural network from VGG to DenseNet
Convolutional neural network from VGG to DenseNet
 
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic SegmentationReview : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
 
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
 
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
A Framework for Scene Recognition Using Convolutional Neural Network as Featu...
 
Review-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learningReview-image-segmentation-by-deep-learning
Review-image-segmentation-by-deep-learning
 
Performance Analysis of Various Activation Functions in Generalized MLP Archi...
Performance Analysis of Various Activation Functions in Generalized MLP Archi...Performance Analysis of Various Activation Functions in Generalized MLP Archi...
Performance Analysis of Various Activation Functions in Generalized MLP Archi...
 
Cnn method
Cnn methodCnn method
Cnn method
 
Sefl Organizing Map
Sefl Organizing MapSefl Organizing Map
Sefl Organizing Map
 
Deep learning
Deep learningDeep learning
Deep learning
 
Convolutional neural networks
Convolutional neural networks Convolutional neural networks
Convolutional neural networks
 
Cnn
CnnCnn
Cnn
 
Modern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentationModern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentation
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNN
 

Viewers also liked

Introduction to Radial Basis Function Networks
Introduction to Radial Basis Function NetworksIntroduction to Radial Basis Function Networks
Introduction to Radial Basis Function NetworksESCOM
 
Artificial neural networks
Artificial neural networksArtificial neural networks
Artificial neural networksstellajoseph
 
LLNL Poster Symposium 2015
LLNL Poster Symposium 2015LLNL Poster Symposium 2015
LLNL Poster Symposium 2015Andrew Dublin
 
Fokker–Planck equation and DPD simulations
Fokker–Planck equation and DPD simulationsFokker–Planck equation and DPD simulations
Fokker–Planck equation and DPD simulationsKotaro Tanahashi
 
Radial Basis Function Interpolation
Radial Basis Function InterpolationRadial Basis Function Interpolation
Radial Basis Function InterpolationJesse Bettencourt
 
Counter propagation Network
Counter propagation NetworkCounter propagation Network
Counter propagation NetworkAkshay Dhole
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural NetworkMuhammad Ishaq
 
Adaptive Resonance Theory
Adaptive Resonance TheoryAdaptive Resonance Theory
Adaptive Resonance TheoryNaveen Kumar
 
lecture 30
lecture 30lecture 30
lecture 30sajinsc
 
Gradient Search Cauchy Method
Gradient Search Cauchy MethodGradient Search Cauchy Method
Gradient Search Cauchy MethodAnuja Joshi
 
4 stochastic processes
4 stochastic processes4 stochastic processes
4 stochastic processesSolo Hermelin
 
Artificial Neural Networks
Artificial Neural NetworksArtificial Neural Networks
Artificial Neural NetworksSaif Al-Kalbani
 
Stochastic modelling and its applications
Stochastic modelling and its applicationsStochastic modelling and its applications
Stochastic modelling and its applicationsKartavya Jain
 
WCSMO-WFLO-2015-mehmani
WCSMO-WFLO-2015-mehmaniWCSMO-WFLO-2015-mehmani
WCSMO-WFLO-2015-mehmaniOptiModel
 
Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Mostafa G. M. Mostafa
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural NetworkBurhan Muzafar
 

Viewers also liked (20)

Introduction to Radial Basis Function Networks
Introduction to Radial Basis Function NetworksIntroduction to Radial Basis Function Networks
Introduction to Radial Basis Function Networks
 
Artificial neural networks
Artificial neural networksArtificial neural networks
Artificial neural networks
 
LLNL Poster Symposium 2015
LLNL Poster Symposium 2015LLNL Poster Symposium 2015
LLNL Poster Symposium 2015
 
Fokker–Planck equation and DPD simulations
Fokker–Planck equation and DPD simulationsFokker–Planck equation and DPD simulations
Fokker–Planck equation and DPD simulations
 
Radial Basis Function Interpolation
Radial Basis Function InterpolationRadial Basis Function Interpolation
Radial Basis Function Interpolation
 
main
mainmain
main
 
Counter propagation Network
Counter propagation NetworkCounter propagation Network
Counter propagation Network
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 
Adaptive Resonance Theory
Adaptive Resonance TheoryAdaptive Resonance Theory
Adaptive Resonance Theory
 
lecture 30
lecture 30lecture 30
lecture 30
 
Gradient Search Cauchy Method
Gradient Search Cauchy MethodGradient Search Cauchy Method
Gradient Search Cauchy Method
 
Meetup_talk
Meetup_talkMeetup_talk
Meetup_talk
 
4 stochastic processes
4 stochastic processes4 stochastic processes
4 stochastic processes
 
Artificial Neural Networks
Artificial Neural NetworksArtificial Neural Networks
Artificial Neural Networks
 
Stochastic modelling and its applications
Stochastic modelling and its applicationsStochastic modelling and its applications
Stochastic modelling and its applications
 
WCSMO-WFLO-2015-mehmani
WCSMO-WFLO-2015-mehmaniWCSMO-WFLO-2015-mehmani
WCSMO-WFLO-2015-mehmani
 
Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)Neural Networks: Radial Bases Functions (RBF)
Neural Networks: Radial Bases Functions (RBF)
 
hopfield neural network
hopfield neural networkhopfield neural network
hopfield neural network
 
Hopfield Networks
Hopfield NetworksHopfield Networks
Hopfield Networks
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 

Similar to RBF Neural Network Explained in 40 Characters

Classification of Iris Data using Kernel Radial Basis Probabilistic Neural N...
Classification of Iris Data using Kernel Radial Basis Probabilistic  Neural N...Classification of Iris Data using Kernel Radial Basis Probabilistic  Neural N...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural N...Scientific Review SR
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Scientific Review
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier홍배 김
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Recognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural networkRecognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural networkeSAT Journals
 
Recognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural networkRecognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural networkeSAT Publishing House
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function홍배 김
 
Knowledge extraction from support vector machines
Knowledge extraction from support vector machinesKnowledge extraction from support vector machines
Knowledge extraction from support vector machinesEyad Alshami
 
My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.
My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.
My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.Anirbit Mukherjee
 
DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...
DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...
DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...cscpconf
 
Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...
Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...
Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...ArchiLab 7
 
Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...
Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...
Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...ArchiLab 7
 
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...Anirbit Mukherjee
 
An evolutionary method for constructing complex SVM kernels
An evolutionary method for constructing complex SVM kernelsAn evolutionary method for constructing complex SVM kernels
An evolutionary method for constructing complex SVM kernelsinfopapers
 
Practical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsPractical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsNaughty Dog
 
NEURALNETWORKS_DM_SOWMYAJYOTHI.pdf
NEURALNETWORKS_DM_SOWMYAJYOTHI.pdfNEURALNETWORKS_DM_SOWMYAJYOTHI.pdf
NEURALNETWORKS_DM_SOWMYAJYOTHI.pdfSowmyaJyothi3
 
Introduction to Support Vector Machines
Introduction to Support Vector MachinesIntroduction to Support Vector Machines
Introduction to Support Vector MachinesSilicon Mentor
 

Similar to RBF Neural Network Explained in 40 Characters (20)

Circuitanlys2
Circuitanlys2Circuitanlys2
Circuitanlys2
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural N...
Classification of Iris Data using Kernel Radial Basis Probabilistic  Neural N...Classification of Iris Data using Kernel Radial Basis Probabilistic  Neural N...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural N...
 
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
Classification of Iris Data using Kernel Radial Basis Probabilistic Neural Ne...
 
Anomaly detection using deep one class classifier
Anomaly detection using deep one class classifierAnomaly detection using deep one class classifier
Anomaly detection using deep one class classifier
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
tutorial.ppt
tutorial.ppttutorial.ppt
tutorial.ppt
 
Recognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural networkRecognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural network
 
Recognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural networkRecognition of handwritten digits using rbf neural network
Recognition of handwritten digits using rbf neural network
 
The world of loss function
The world of loss functionThe world of loss function
The world of loss function
 
Knowledge extraction from support vector machines
Knowledge extraction from support vector machinesKnowledge extraction from support vector machines
Knowledge extraction from support vector machines
 
My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.
My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.
My 2hr+ survey talk at the Vector Institute, on our deep learning theorems.
 
DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...
DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...
DESIGN AND IMPLEMENTATION OF BINARY NEURAL NETWORK LEARNING WITH FUZZY CLUSTE...
 
Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...
Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...
Fessant aknin oukhellou_midenet_2001:comparison_of_supervised_self_organizing...
 
Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...
Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...
Erwin. e. obermayer k._schulten. k. _1992: self-organising maps_stationary st...
 
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
My invited talk at the 2018 Annual Meeting of SIAM (Society of Industrial and...
 
An evolutionary method for constructing complex SVM kernels
An evolutionary method for constructing complex SVM kernelsAn evolutionary method for constructing complex SVM kernels
An evolutionary method for constructing complex SVM kernels
 
Practical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT MethodsPractical Spherical Harmonics Based PRT Methods
Practical Spherical Harmonics Based PRT Methods
 
NEURALNETWORKS_DM_SOWMYAJYOTHI.pdf
NEURALNETWORKS_DM_SOWMYAJYOTHI.pdfNEURALNETWORKS_DM_SOWMYAJYOTHI.pdf
NEURALNETWORKS_DM_SOWMYAJYOTHI.pdf
 
Introduction to Support Vector Machines
Introduction to Support Vector MachinesIntroduction to Support Vector Machines
Introduction to Support Vector Machines
 
Bz02516281633
Bz02516281633Bz02516281633
Bz02516281633
 

Recently uploaded

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Recently uploaded (20)

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

RBF Neural Network Explained in 40 Characters

  • 1. Radial Basis Function (RBF) Syed Zohaib Hassan Muhammad Awais Bawazir
  • 2. CONTENTS  Introduction  Architecture  Types of RBF  Example  Advantages/Disadvantages
  • 3. Radial Basis Function (RBF) A radial basis function (RBF) is a real-valued function whose value depends only on the distance from the origin, so that ᶲ(x)=ᶲ(|x|) ; or alternatively on the distance from some other point c, called a center, so that ᶲ(x,c)=ᶲ(|x-c|). Any function that satisfies the property is a radial function.
  • 4. RBF network This is becoming an increasingly popular neural network with diverse applications and is probably the main rival to the multi-layered perceptron  Much of the inspiration for RBF networks has come from traditional statistical pattern classification techniques
  • 5. Radial Basis Function Network Approximate function with linear combination of Radial basis functions F(x) = Σ wi h(x) h(x) is mostly Gaussian function
  • 6. Radial basis function m f(x) = ∑ w j h j (x) j= 1 h j (x) = exp( -(x-c j ) 2 / r j 2 ) Where c j is center of a region, r j is width of the receptive field
  • 7. Architecture x1 h1 W1 x2 h2 W2 x3 h3 W3 f(x) Wm xn hm Input Hidden layer Output layer layer
  • 8. Three layers  Input layer o Source nodes that connect to the network to its environment  Hidden layer o Hidden units provide a set of basis function o hidden layer performs a non-linear mapping from the input space into a (usually) higher dimensional space in which the patterns become linearly separable.  Output layer o Linear combination of hidden functions
  • 9. Clustering The unique feature of the RBF network is the process performed in the hidden layer. The idea is that the patterns in the input space form clusters. If the centres of these clusters are known, then the distance from the cluster centre can be measured. Furthermore, this distance measure is made non- linear, so that if a pattern is in an area that is close to a cluster centre it gives a value close to 1.
  • 10. Types of RBF  Gaussian  Multiquadric  Inverse quadratic  Inverse multiquadric  Polyharmonic spline  Thin plate spline
  • 11. Gaussian function The most commonly used radial-basis function is a Gaussian function In a RBF network, r is the distance from the cluster centre. The equation represents a Gaussian bell-shaped curve, as shown in Fig.
  • 12. 1 0.9 0.8 0.7 φ 0.6 0.5 0.4 0.3 0.2 0.1 0 0 1 0.4 0.6 0.8 0.2 -1 -0.8 -0.2 -0.6 -0.4 x
  • 13. Distance measure The distance measured from the cluster centre is usually the Euclidean distance. For each neuron in the hidden layer, the weights represent the co-ordinates of the centre of the cluster. Therefore, when that neuron receives an input pattern, X, the distance is found using the following equation:
  • 14. Distance measure n rj = ∑ (x − w ) i =1 i ij 2
  • 15. Polyharmonic splines  Polyharmonic splines are used for function approximation and data interpolation. They are very useful for interpolation of scattered data in many dimension
  • 16. Multiquadric RBF r + ( x − c) 2 2 h( x ) = r
  • 17. Width of hidden unit basis function n ∑ (x i − wij ) 2 (hidden _ unit )ϕ j = exp(− i =1 ) 2σ 2 The variable sigma, σ, defines the width or radius of the bell-shape and is something that has to be determined empirically. When the distance from the centre of the Gaussian reaches σ, the output drops from 1 to 0.6.
  • 18. Basis functions of polyharmonic splines are radial basis functions of the for
  • 19. Example An often quoted example which shows how the RBF network can handle a non-linearly separable function is the exclusive-or problem. One solution has 2 inputs, 2 hidden units and 1 output. The centres for the two hidden units are set at c1 = 0,0 and c2 = 1,1, and the value of radius σ is chosen such that 2σ2 = 1.
  • 20. Example The inputs are x, the distances from the centres squared are r, and the outputs from the hidden units are ϕ. When all four examples of input patterns are shown to the network, the outputs of the two hidden units are shown in the following table.
  • 21. x1 x2 r1 r2 ϕ1 ϕ2 0 0 0 2 1 0.1 0 1 1 1 0.4 0.4 1 0 1 1 0.4 0.4 1 1 2 0 0.1 1
  • 22. Example Next Fig. shows the position of the four input patterns using the output of the two hidden units as the axes on the graph - it can be seen that the patterns are now linearly separable. This is an ideal solution - the centres were chosen carefully to show this result. Methods generally adopted for learning in an RBF network would find it impossible to arrive at those centre values - later learning methods that are usually adopted will be described.
  • 23. φ1 1 0.5 XX 0 0.5 1 φ2
  • 24. Training hidden layer The hidden layer in a RBF network has units which have weights that correspond to the vector representation of the centre of a cluster. These weights are found either using a traditional clustering algorithm such as the k-means algorithm, or adaptively using essentially the Kohonen algorithm.
  • 25. Training hidden layer In either case, the training is unsupervised but the number of clusters that you expect, k, is set in advance. The algorithms then find the best fit to these clusters. The k -means algorithm will be briefly outlined. Initially k points in the pattern space are randomly set.
  • 26. Training hidden layer Then for each item of data in the training set, the distances are found from all of the k centres. The closest centre is chosen for each item of data - this is the initial classification, so all items of data will be assigned a class from 1 to k. Then, for all data which has been found to be class 1, the average or mean values are found for each of co-ordinates.
  • 27. Training hidden layer These become the new values for the centre corresponding to class 1. Repeated for all data found to be in class 2, then class 3 and so on until class k is dealt with - we now have k new centres. Process of measuring the distance between the centres and each item of data and re-classifying the data is repeated until there is no further change – i.e. the sum of the distances monitored and training halts when the total distance no longer falls.
  • 28. Adaptive k-means The alternative is to use an adaptive k -means algorithm which similar to Kohonen learning. Input patterns are presented to all of the cluster centres one at a time, and the cluster centres adjusted after each one. The cluster centre that is nearest to the input data wins, and is shifted slightly towards the new data. This has the advantage that you don’t have to store all of the training data so can be done on-line.
  • 29. Finding radius of Gaussians Having found the cluster centres using one or other of these methods, the next step is determining the radius of the Gaussian curves. This is usually done using the P-nearest neighbour algorithm. A number P is chosen, and for each centre, the P nearest centres are found.
  • 30. Finding radius of Gaussians The root-mean squared distance between the current cluster centre and its P nearest neighbours is calculated, and this is the value chosen for σ. So, if the current cluster centre is cj, the value is: P 1 σj= ∑ (ck − ci ) P i =1 2
  • 31. Finding radius of Gaussians σ is set to A typical value for P is 2, in which case be the average distance from the two nearest neighbouring cluster centres.
  • 32. Advantages/Disadvantages RBF trains faster than a MLP Another advantage that is claimed is that the hidden layer is easier to interpret than the hidden layer in an MLP. Although the RBF is quick to train, when training is finished and it is being used it is slower than a MLP, so where speed is a factor a MLP may be more appropriate.