Lossless LLM compression for efficient GPU inference via dynamic-length float

Article URL: https://arxiv.org/abs/2504.11651

Comments URL: https://news.ycombinator.com/item?id=43796935

Points: 137

# Comments: 38

https://arxiv.org/abs/2504.11651

Utworzony 11h | 25 kwi 2025, 20:30:13

Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

Colossal Cave Adventure (1976)

Article URL: https://github.com/wh0am1-dev/adventure

Comments URL: https://news

26 kwi 2025, 05:50:04 | Hacker news

Show HN: Empty Enter Expander – Type less in the terminal with this tool

When you have a lot of aliases it can be difficult to remember how was the one you need named especially if you do not use it very often. You can also have files stored in a bin folder and look th

26 kwi 2025, 05:50:03 | Hacker news

A tuition-free school created by Zuckerberg and Chan will shutter next year

Article URL: https://www.cnn.com/2025/04/25/tech/chan-zuckerberg-primary-school-closing/index.html

26 kwi 2025, 05:50:02 | Hacker news

ACM's flagship magazine seeks submissions by/for practitioners

Article URL: https://cacm.acm.org/practice/call-for-papers-cacm-practice-section/

Comments URL:

26 kwi 2025, 03:30:16 | Hacker news

Reading RSS content is a skilled activity

Article URL: https://www.doliver.org/articles/rss-as-a-skill

Comments URL:

26 kwi 2025, 03:30:14 | Hacker news

Your phone isn't secretly listening to you, but the truth is more disturbing

Article URL: https://newatlas.com/computers/smartphone-listening-conversations-ads-facebook/

26 kwi 2025, 03:30:14 | Hacker news

I wrote a book called "Crap Towns". It seemed funny at the time

Article URL: https://samj.substack.com/p/that-joke-isnt-funny-any-more

Comments URL:

26 kwi 2025, 03:30:12 | Hacker news

Techie