I spent a lot of time and money on this rather big side project of mine that attempts to replicate the mechanistic interpretability research on proprietary LLMs that was quite popular this year and produced great research papers by Anthropic [1], OpenAI [2] and Deepmind [3].
I am quite proud of this project and since I consider myself the target audience for HackerNews did I think that maybe some of you would appreciate this open research replication as well. Happy to answer any questions or face any feedback.
Cheers
[1] https://transformer-circuits.pub/2024/scaling-monosemanticit...
[2] https://arxiv.org/abs/2406.04093
[3] https://arxiv.org/abs/2408.05147
Comments URL: https://news.ycombinator.com/item?id=42208383
Points: 46
# Comments: 1
Login to add comment
Other posts in this group
Article URL: https://github.com/sbcl/sbcl/commit/42fd0ced76e851fe883f8651b832234a7cbd1fa2
Comments
Article URL: https://github.com/chaosprint/asak
Comments URL: https://news.ycombinat
After decades of tracking my exercise programs in progressively more complex spreadsheets I eventually burned out on metrics and complicated periodization programs to the point where I had almost