Cross-Entropy Loss for Next Token Prediction in TransformersIn light of the tremendous success of transformers in the context of the Next Token Prediction task, I’ve decided to create this post to…Aug 6, 2024Aug 6, 2024
Neural Network Series: Give me the formulas already! (Part VII)Today I recommend grabbing a cup of coffee, tea or, in my case, prepare yourself a mate. The conceptual elements of multilayer perceptrons…Apr 28, 20241Apr 28, 20241
Neural Network Series: Backpropagation step-by-step conceptual explanation (Part VI)Little by little, we are moving towards the end of the series. In the previous article, the multi-layer perceptron was introduced and there…Apr 24, 2024Apr 24, 2024
Neural Network Series: Multilayer perceptron, can you really do it all? (Part V)The previous article was the last of the simple perceptron introductory series. Before moving on, let’s summarize the basic concepts that…Apr 22, 2024Apr 22, 2024
Neural Network Series: Is binary classification the best you can do? (Part IV)Something worth noting from the perceptron previously explained, is that the activation function is the element restricting the neuron’s…Apr 20, 2024Apr 20, 2024
Neural Network Series: Rosenblatt’s perceptron algorithm (Part III)Simple perceptrons, as discussed in the previous article, can be useful when dealing with binary classification, especially if the problem…Apr 18, 2024Apr 18, 2024
Neural Network Series: Congratulations, it’s a baby network (Part II).We’ve established in Part I of the series that the goal is to understand the subject from the ground up. Neural Networks is an area that…Apr 16, 2024Apr 16, 2024
Neural Network Series: We’re not releasing autonomous black boxes into the world (Part I).Unless you’ve been living under a rock, there are high chances you can identify some of the technologies displayed on the image below…Apr 15, 2024Apr 15, 2024