Lossless LLM compression for efficient GPU inference via dynamic-length float

Article URL: https://arxiv.org/abs/2504.11651

Comments URL: https://news.ycombinator.com/item?id=43796935

Points: 137

# Comments: 38

https://arxiv.org/abs/2504.11651

Vytvořeno 11h | 25. 4. 2025 20:30:13

Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

Colossal Cave Adventure (1976)

Article URL: https://github.com/wh0am1-dev/adventure

Comments URL: https://news

26. 4. 2025 5:50:04 | Hacker news

Show HN: Empty Enter Expander – Type less in the terminal with this tool

When you have a lot of aliases it can be difficult to remember how was the one you need named especially if you do not use it very often. You can also have files stored in a bin folder and look th

26. 4. 2025 5:50:03 | Hacker news

A tuition-free school created by Zuckerberg and Chan will shutter next year

Article URL: https://www.cnn.com/2025/04/25/tech/chan-zuckerberg-primary-school-closing/index.html

26. 4. 2025 5:50:02 | Hacker news

ACM's flagship magazine seeks submissions by/for practitioners

Article URL: https://cacm.acm.org/practice/call-for-papers-cacm-practice-section/

Comments URL:

26. 4. 2025 3:30:16 | Hacker news

Reading RSS content is a skilled activity

Article URL: https://www.doliver.org/articles/rss-as-a-skill

Comments URL:

26. 4. 2025 3:30:14 | Hacker news

Your phone isn't secretly listening to you, but the truth is more disturbing

Article URL: https://newatlas.com/computers/smartphone-listening-conversations-ads-facebook/

26. 4. 2025 3:30:14 | Hacker news

I wrote a book called "Crap Towns". It seemed funny at the time

Article URL: https://samj.substack.com/p/that-joke-isnt-funny-any-more

Comments URL:

26. 4. 2025 3:30:12 | Hacker news

Techie