Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning KL Divergence - How to tell how different two distributions are The Attention Mechanism in Large Language Models 1y | Louis Serano The Binomial and Poisson Distributions 2y | Louis Serano Euler's number, derivatives, and the bank at the end of the universe 2y | Louis Serano Decision tree - A friendly introduction 2y | Louis Serano Thank you for 100K subscribers! I’m planning tons of new content coming soon, so excited! 2y | Louis Serano How do you minimize a function when you can't take derivatives? CMA-ES and PSO 2y | Louis Serano What is Quantum Machine Learning? 2y | Louis Serano Can a false positive also be a false negative? - A COVID short story 3y | Louis Serano Denoising and Variational Autoencoders 3y | Louis Serano Training Latent Dirichlet Allocation: Gibbs Sampling (Part 2 of 2) 3y | Louis Serano << < 1 2 3 4 5 > >> Join group Members Search CreatedPast one dayPast four dayPast month Choose a GroupLouis Serano Choose a User Sort byby relevanceUpvotedNew firstBookmark countComment count Search