Category: Machine Learning
GPT Model
The BERT Score – Evaluating Text Generation
This video talks about the evaluation metric BERTScore, why it needed over existing metrics such as the BLEU score and so on and how it is computed and evaluated. Traditional metrics look at exact text match. BERTScore looks at semantic similarity leveraging contextual word embeddings of words in the candidate and the reference sentences.
BERT Model
The Transformer Architecture
Why are Transformers So Successful?
Batch vs Mini-Batch vs Stochastic Gradient Descent
What is Layer Normalization
What is Batch Normalization
Normalization in Deep Neural Networks
Batch norm and Layer norm are common normalization techniques. This brief video talks about the need for normalization and the types of norms in deep neural networks.