Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning KL Divergence - How to tell how different two distributions are Decision tree - A friendly introduction 2y | Louis Serano Thank you for 100K subscribers! I’m planning tons of new content coming soon, so excited! 2y | Louis Serano How do you minimize a function when you can't take derivatives? CMA-ES and PSO 2y | Louis Serano What is Quantum Machine Learning? 2y | Louis Serano Can a false positive also be a false negative? - A COVID short story 3y | Louis Serano Denoising and Variational Autoencoders 3y | Louis Serano You are much better at math than you think 3y | Louis Serano Training Latent Dirichlet Allocation: Gibbs Sampling (Part 2 of 2) 3y | Louis Serano Live chat with Luis Serrano! 3y | Louis Serano Restricted Boltzmann Machines (RBM) - A friendly introduction 3y | Louis Serano << < 2 3 4 5 6 > >> Pridať sa k skupine Členovia Vyhľadávanie VytvorenéPosledný deňPosledný štyri dniMinulý mesiac Choose a GroupLouis Serano Choose a User Triediť podľapodľa relevantnostiUpvotedNové ako prvéPočet záložiekPočet komentárov Vyhľadávanie