Blog - My Blog - Page 17

154. Approaches For Tuning Models

There are mainly 2 approaches to tuning a model. Panda Approach: Tune 1 model at a time Caviar Approach: Tune Multiple model at once

July 14, 2022

Computer Vision, Deep Learning

153. Non-Local Neural Networks

“Local” means only understanding the CURRENT “time” and “space”. To understand “non-local” nuances (What will the person in the image do next? Where will the soccer ball being kicked head towards?), if we were to use traditional methods such as…

July 13, 2022

AI, Statistics

152. KL Divergence

KL Divergence measures the distance between 2 distributions. This can be used to understand Cross-Entropy and deep learning model architectures such as VAE. For Example, lets say there is a coin which has 50% chance of being HEADS and 50%…

July 12, 2022

AI, Statistics

151. Different Types of Optimizers

There are mainly two approaches for optimizing gradient descent. Adjusting the Learning Rate or Adjusting the Gradients.

July 11, 2022

Book Review

150. The Mom Test

There are my key takeaways from the book The Mom Test by Rob Fitzpatrick The Mom Test: Talk about their life instead of your idea,Ask about specifics in the past instead of generics or opinions about the future, Talk less…

July 10, 2022

149. Data Scientists and Machine-Learning Engineer

These 2 engineering jobs sounds pretty close, but are involved in different phases in AI development.

July 9, 2022

AI, Pytorch

148. Similar Losses in Pytorch

I found these losses quite hard to distinguish between each other. So, I created an intuitive explanation of the differences.

July 8, 2022

Statistics

147. Why Squared Loss?

Why do we use squared loss instead of absolute loss? One reason is because by squaring the loss you can magnify it which can help train the model. Another reason is because absolute loss is not differentiable when equals 0.…

July 7, 2022

146. BERT

What is BERT? BERT is a deep learning architecture for natural language processing. If you stack the Transformer’s encoder, you get BERT. What can BERT Solve? Neural Machine Translation Question Answering Sentiment Analysis Text Summarization How to solve the problems…

July 6, 2022

Statistics

145. Multi-Class vs Multi-Label / SoftMax vs Sigmoid

Multi-Class = 1 class per image Multi-Label = Includes multiple label in a single image Softmax = Scale output to 0~1 and make the sum equal to 1 so that it becomes probabilities. Useful for multi-class classification. Sigmoid = Scale…

July 5, 2022