Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning KL Divergence - How to tell how different two distributions are What are Transformer Models and how do they work? 1y | Louis Serano The math behind Attention Mechanisms 2y | Louis Serano The Attention Mechanism in Large Language Models 2y | Louis Serano The Binomial and Poisson Distributions 2y | Louis Serano Euler's number, derivatives, and the bank at the end of the universe 3y | Louis Serano Decision tree - A friendly introduction 3y | Louis Serano Thank you for 100K subscribers! I’m planning tons of new content coming soon, so excited! 3y | Louis Serano How do you minimize a function when you can't take derivatives? CMA-ES and PSO 3y | Louis Serano What is Quantum Machine Learning? 3y | Louis Serano Can a false positive also be a false negative? - A COVID short story 3y | Louis Serano << < 2 3 4 5 6 > >> Rejoindre le groupe Membres Chercher ÉtabliUn jour passéQuatre derniers joursMois passé Choose a GroupLouis Serano Choose a User Trier parpar pertinenceUpvotedNouveau en premierNombre de signetscompteur de commentaire Chercher