Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning KL Divergence - How to tell how different two distributions are What are Transformer Models and how do they work? 1y | Louis Serano The math behind Attention Mechanisms 1y | Louis Serano The Attention Mechanism in Large Language Models 1y | Louis Serano The Binomial and Poisson Distributions 2y | Louis Serano Euler's number, derivatives, and the bank at the end of the universe 2y | Louis Serano Decision tree - A friendly introduction 2y | Louis Serano Thank you for 100K subscribers! I’m planning tons of new content coming soon, so excited! 2y | Louis Serano How do you minimize a function when you can't take derivatives? CMA-ES and PSO 2y | Louis Serano What is Quantum Machine Learning? 2y | Louis Serano Can a false positive also be a false negative? - A COVID short story 3y | Louis Serano << < 1 2 3 4 5 > >> Alăturați-vă grupului Membri Căutare CreatăA trecut o ziUltimele patru zileLuna trecuta Choose a GroupLouis Serano Choose a User Filtrează dupădupă relevanțăVotat în susMai întâi nouNumăr marcajeNumăr de comentarii Căutare