gradient descent – Machine Learning Interviews

What are the optimization algorithms typically used in a neural network ?

Posted on February 14, 2019 by MLNerds

Gradient descent is the most commonly used training algorithm. Momentum is a common way to augment gradient descent such that gradient in each step is accumulated over past steps to enable the algorithm to proceed in a smoother fashion towards the minimum. RMS prop attempts to adjust learning rate for each iteration in an automated…

Given a deep learning model, what are the considerations to set mini-batch size ?

Posted on February 14, 2019 by MLNerds

The batch size is a hyper parameter. Usually people try various values to see what works best in terms of speed and accuracy. Suppose you have M training instances and k batches, higher batch size is faster to do a pass on the entire dataset, through M/k mini batch iterations. As long as the data…