Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning KL Divergence - How to tell how different two distributions are Live with Jay Alammar, Josh Starmer, and Luis Serrano 9h | Louis Serano What is AdaBoost? Friendly explanation with code! 2d | Louis Serano What is Positional Encoding in Transformer Models? 8d | Louis Serano Live with Jay Alammar, Josh Starmer, and Luis Serrano 8d | Louis Serano Model that won the 2024 Physics Nobel Prize - Hopfield Networks 14d | Louis Serano The Fast Fourier Transform 1mo | Louis Serano Detecting Periodicity with the Discrete Fourier Transform 2mo | Louis Serano The Discrete Fourier Transform 2mo | Louis Serano State Space Models (SSMs) and Mamba 4mo | Louis Serano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning 5mo | Louis Serano 1 2 3 4 5 > >> Join group Members Search CreatedPast one dayPast four dayPast month Choose a GroupLouis Serano Choose a User Sort byby relevanceUpvotedNew firstBookmark countComment count Search