Show HN: Llama 3.2 Interpretability with Sparse Autoencoders

I spent a lot of time and money on this rather big side project of mine that attempts to replicate the mechanistic interpretability research on proprietary LLMs that was quite popular this year and produced great research papers by Anthropic [1], OpenAI [2] and Deepmind [3].

I am quite proud of this project and since I consider myself the target audience for HackerNews did I think that maybe some of you would appreciate this open research replication as well. Happy to answer any questions or face any feedback.

Cheers

[1] https://transformer-circuits.pub/2024/scaling-monosemanticit...

[2] https://arxiv.org/abs/2406.04093

[3] https://arxiv.org/abs/2408.05147

Comments URL: https://news.ycombinator.com/item?id=42208383

Points: 46

# Comments: 1

https://github.com/PaulPauls/llama3_interpretability_sae

Created 1mo | Nov 21, 2024, 9:40:08 PM

Other posts in this group

VW breach exposes location of 800k electric vehicles

Article URL: https://cyberinsider.com/vw-suffers-major-breach-exposing-location-of-800000-

Dec 28, 2024, 12:40:13 AM | Hacker news

Breaking the Mirror – A Look at Apple's New iPhone Remote Control Feature [video]

Article URL: https://media.ccc.de/v/38c3-breaking-the-mirror-a-look-at-apple-s-new-iph

Dec 28, 2024, 12:40:12 AM | Hacker news

Demystifying Common Microcontroller Debug Protocols [video]

Article URL: https://media.ccc.de/v/38c3-demystifying-common-microcontroller-debug-protocols

Dec 28, 2024, 12:40:12 AM | Hacker news

SBCL "user-guided optimization" notice

Article URL: https://github.com/sbcl/sbcl/commit/42fd0ced76e851fe883f8651b832234a7cbd1fa2

Comments

Dec 28, 2024, 12:40:11 AM | Hacker news

Show HN: Asak – cross-platform audio recording/playback CLI tool written in Rust

Article URL: https://github.com/chaosprint/asak

Comments URL: https://news.ycombinat

Dec 28, 2024, 12:40:10 AM | Hacker news

Show HN: Minimal, self-hosted exercise tracker

After decades of tracking my exercise programs in progressively more complex spreadsheets I eventually burned out on metrics and complicated periodization programs to the point where I had almost

Dec 28, 2024, 12:40:09 AM | Hacker news

Spotify is full of AI music

Article URL: https://www.fastcompany.com/91170296/spotify-ai-music

Comments URL:

Dec 28, 2024, 12:40:08 AM | Hacker news

Techie