Why we need the bias term

Thursday, June 1, 2017

1. Introduction
2. History and Overview about Artificial Neural Network

3. Single neural network

3.1 Perceptron

3.2 Adaptive Linear Neurons

3.3 Problems with Perceptron (AI Winter)

4. Multi-layer neural network

4.1 Overview about Multi-layer Neural Network
4.2 Forward Propagation
4.3 Cost function
4.4 Backpropagation
4.5 Implement simple Multi-layer Neural Network to solve the problem of Perceptron

4.6 Some optional techniques for Multi-layer Neural Network Optimization

4.7 Multi-layer Neural Network for binary/multi classification

5. Install and using Multi-layer Neural Network to classify MNIST data

5.1 Overview about MNIST data
5.2 Implement Multi-layer Neural Network
5.3 Debugging Neural Network with Gradient Descent Checking

6. Summary
7. References

The bias term

Look at the pictures of Perceptron and the equation \((1)\) where we have extra \(w_0x_0\). It's call the bias, so what is it and why we definitely need it?
Firstly, it's constant value with \(x_0\) always equal 1 and \(w_0\) similar to another weight in Perceptron and we can also update \(w_0\), too. So, what is the role of the Bias in Neural Network?
The bias allow we to shift the Unit step function to the left or right to achieve successful learning. Usually in Perceptron, we set \(x_0 = 1\) and \(w_0 = -\theta\), so look at the images below to have an intuition about it

So, why we want to shift the Unit step function? Let's consider when we have all of the input equal to \(0\), but we want our Perceptron's output is \(1\). Adjust the weight is hopeless because \(\mathbf{z = w^Tx}\) always equal to \(0\). Then, the bias will come to solve this situation. That's why we definitely needed and the role of the bias in Multi-layer Neural Network is trying to shift the Sigmoid function instead of the Unit step function.

Why we need the bias term

The bias term

No comments :

Post a Comment