Deep Learning – Machine Learning Interviews

Positional Encoding in the Transformer Model

Posted on May 3, 2024 by MLNerds

Skip or Residual Connections in Deep Networks

Posted on March 1, 2024 by MLNerds

The BERT Score – Evaluating Text Generation

Posted on November 28, 2023 by MLNerds

This video talks about the evaluation metric BERTScore, why it needed over existing metrics such as the BLEU score and so on and how it is computed and evaluated. Traditional metrics look at exact text match. BERTScore looks at semantic similarity leveraging contextual word embeddings of words in the candidate and the reference sentences.

BERT Model

Posted on October 30, 2023 by MLNerds

Batch vs Mini-Batch vs Stochastic Gradient Descent

Posted on September 1, 2023 by MLNerds

Normalization in Deep Neural Networks

Posted on July 20, 2023 by MLNerds

Batch norm and Layer norm are common normalization techniques. This brief video talks about the need for normalization and the types of norms in deep neural networks.

When are deep learning algorithms more appropriate compared to traditional machine learning algorithms?

Posted on May 13, 2019May 13, 2019 by MLInterview

Deep learning algorithms are capable of learning arbitrarily complex non-linear functions by using a deep enough and a wide enough network with the appropriate non-linear activation function. Traditional ML algorithms often require feature engineering of finding the subset of meaningful features to use. Deep learning algorithms often avoid the need for the feature engineering step….

Why do you typically see overflow and underflow when implementing an ML algorithms ?

Posted on March 5, 2019May 13, 2019 by MLInterview

A common pre-processing step is to normalize/rescale inputs so that they are not too high or low. However, even on normalized inputs, overflows and underflows can occur: Underflow: Joint probability distribution often involves multiplying small individual probabilities. Many probabilistic algorithms involve multiplying probabilities of individual data points that leads to underflow. Example : Suppose you…

Older posts →

Category: Deep Learning