Writing an LLM from scratch, part 8 – trainable self-attention

Article URL: https://www.gilesthomas.com/2025/03/llm-from-scratch-8-trainable-self-attention

Comments URL: https://news.ycombinator.com/item?id=43261650

Points: 7

# Comments: 0

https://www.gilesthomas.com/2025/03/llm-from-scratch-8-trainable-self-attention

Created 1mo | Mar 5, 2025, 4:10:10 AM

Login to add comment

Other posts in this group

Scaling Up Reinforcement Learning for Traffic Smoothing

Scaling Up Reinforcement Learning for Traffic Smoothing

Article URL: https://bair.berkeley.edu/blog/2025/03/25/rl-av-smoothing/

Comments URL:

Apr 5, 2025, 6:50:09 AM | Hacker news

404s – gallery of error 404 page designs

404s – gallery of error 404 page designs

Article URL: https://www.404s.design/

Comments URL: https://news.ycombinator.com/item?id=43589

Apr 5, 2025, 6:50:08 AM | Hacker news

Coqui TTS: Free Text-to-Speech

Coqui TTS: Free Text-to-Speech

Article URL: https://coquitts.com

Comments URL: https://news.ycombinator.com/item?id=43590570

Apr 5, 2025, 6:50:08 AM | Hacker news

I don't like traveling anymore

I don't like traveling anymore

Article URL: https://sidverma.io/posts/i-dont-like-traveling-anymore/

Comments URL:

Apr 5, 2025, 6:50:07 AM | Hacker news

$Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)$

Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)

Hi HN,

I’ve been working on an OCR pipeline specifically optimized for machine learning dataset preparation. It’s designed to process complex academic materials — including math formulas, tables

Apr 5, 2025, 6:50:06 AM | Hacker news

Recreating Daft Punk's Something About Us

Recreating Daft Punk's Something About Us

Article URL: https://thoughts-and-things.ghost.io/recreating-daft-punks-something-about-us/

Comm

Apr 5, 2025, 6:50:05 AM | Hacker news

Investigating MacPaint's Source Code

Investigating MacPaint's Source Code

Article URL: https://ztoz.blog/posts/macpaint-source-code/

Comments URL:

Apr 5, 2025, 4:30:11 AM | Hacker news

Techie