Neural nets

Neural Nets
Kevin Bernhardt
Troy Galvin
Brian Galvin

What is it?
 Based on neuron network in the brain – learning

capability.
 Classification/prediction
 Supports very complex relationship between

predictors and a response.....it learns.

Courtesy of brainblogger.com

Structure - explained
Input Layer
Output Layer

Hidden Layers

Nodes
multilayer feedforward networks

Structure – explained (cont.)
weight

bias
node

Structure – explained (cont.)
 Input nodes
 Consist of the predictors (ex. #bedrooms, ft2 for

home prices)
 Hidden Layer nodes
 Each Hidden Layer node receives input from all

Input nodes
 Output: g(weighted sums of inputs + bias node)
 Where g() = Linear, exponential, or logistic/sigmoidal

function

 Output nodes
 Output = the prediction of the model
 Output: Same equation as Hidden Layer node
 Use a cutoff value for binary responses

Rules of thumb
 Normalize numerical variables

; where variable is within range of



[a,b], (a<b)
 Create dummy variables for categorical

variables
 Ordinal: m fractions between [0,1]
 Nominal: transform into m-1 dummies

 p predictors = p nodes
 Setting weights/node bias‟: Start at 0.00 ± 0.05

 Transform highly skewed predictors. Ex. log

transform

Training the Model
 Back Propagation
 Algorithm – most popular to update weights

(Learning)
 Uses difference between predicted and actual value
(error) to determine the weight
 Weight – skews the input values to each node
 This is distributed evenly to each hidden node back through

the system

 Two updating methods
 Case updating – after each observation (or trial)
 Completely running through every observation (or trial) is
called an Epoch or Sweep or Iteration
 Batch updating – Entire training set is run then sum of
errors is used

Training the Model
 When do we stop?
 We never stop learning…. jk
1. Only incremental differences (diminished return)
2. Misclassification has hit reasonable threshold
3. You cant run no mo….. You have reached your

limit
 Now to the Neural Net

Avoiding Overfitting
 Causes error rate to be too large
 Important to limit the number of training epochs
 Detected by examining the performance of the

validation set and seeing when it starts to
deteriorate

Required User Input
 Deciding the network architecture
 Specify number of hidden layers and nodes in

each layer
 Trial-and-error based on experience

Recommendations
 Number of hidden layers - should usually be 1

because even just one layer can capture complex
relationships between predictors
 Size of hidden layer – start with the same number
of nodes as your number of predictors, then
decrease or increase accordingly. Too many can
lead to overfitting
 Number of Outputs – should equal the number of
classes. For binary, a single node is sufficient.

Advantages of Neural Networks
 Good predictive performance
 Due to ability to capture complicated relationships

and high tolerance to noisy data

Weaknesses
 Cannot provide insight into structure of the

relationship
 Flexibility becomes a weakness when dealing
with small training sets
 Long run-time can hinder performance when realtime predictions are necessary

Facebook and Neural Networks
 Facebook‟s AI Team is building neural networks

to better target advertising and News Feed
inclusion
 “it has to be able to turn „I <3 u babe‟ into a series
of machine learning events, from an increase to
said babe‟s visibility in the News Feed to an alert
if she changes her relationship status.”
 “Deep learning is about making data analysis
sophisticated enough to derive your personality
from your natural social output.”


http://www.extremetech.com/computing/167179-facebook-is-working-on-deep-learning-neural-networks-to-learn-even-more-about-yourpersonal-life

Neural nets

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Neural nets

Similar to Neural nets (20)

Recently uploaded

Recently uploaded (20)

Neural nets