Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning KL Divergence - How to tell how different two distributions are Live with Jay Alammar, Josh Starmer, and Luis Serrano 4mo | Louis Serano Model that won the 2024 Physics Nobel Prize - Hopfield Networks 4mo | Louis Serano The Fast Fourier Transform 4mo | Louis Serano Detecting Periodicity with the Discrete Fourier Transform 5mo | Louis Serano The Discrete Fourier Transform 6mo | Louis Serano State Space Models (SSMs) and Mamba 7mo | Louis Serano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning 8mo | Louis Serano KL Divergence - How to tell how different two distributions are 8mo | Louis Serano Josh Starmer and Luis Serrano livestream 2 - Double BAM! 9mo | Louis Serano Bessel correction and a different way to see variance 9mo | Louis Serano << < 1 2 3 4 5 > >> Dołączyć do grupy Członkowie Szukaj UtworzonyMinął jeden dzieńOstatnie cztery dniMiniony miesiąc Choose a GroupLouis Serano Choose a User Sortuj wedługwedług znaczeniaUpvotedNowy pierwszyLiczba zakładekLiczba komentarzy Szukaj