Task-Specific LLM Evals That Do and Don't Work

Article URL: https://eugeneyan.com/writing/evals/

Comments URL: https://news.ycombinator.com/item?id=42366481

Points: 20

# Comments: 7

https://eugeneyan.com/writing/evals/

Creato 1mo | 9 dic 2024, 16:20:08

Accedi per aggiungere un commento

Altri post in questo gruppo

A secure distributed actor language

A secure distributed actor language

Article URL: https://mistysystem.com/

Comments URL: https://news.ycombinator.com/item?id=42671

14 gen 2025, 07:40:13 | Hacker news

Why Can't Programmers Be More Like Ants? Or a Lesson in Stigmergy (2015)

Why Can't Programmers Be More Like Ants? Or a Lesson in Stigmergy (2015)

Article URL: https://blog.ubiquity.acm.org/why-cant-programmers-be-more-like-ants-or-a-less

14 gen 2025, 07:40:09 | Hacker news

Campsite is now open source

Campsite is now open source

Article URL: https://github.com/campsite/campsite

Comments URL: https://news.ycomb

14 gen 2025, 07:40:08 | Hacker news

ZFS 2.3.0 released with ZFS raidz expansion

ZFS 2.3.0 released with ZFS raidz expansion

Article URL: https://github.com/openzfs/zfs/releases/tag/zfs-2.3.0

Comments URL:

14 gen 2025, 07:40:07 | Hacker news

James Thomson on the Origins of the macOS Dock

James Thomson on the Origins of the macOS Dock

Article URL: https://daringfireball.net/linked/2025/01/10/thomson-dock

Comments URL:

14 gen 2025, 05:30:08 | Hacker news

Training AI models might not need enormous data centres

Training AI models might not need enormous data centres

Article URL: https://www.economist.com/science-and-technology/2025/01/

14 gen 2025, 05:30:07 | Hacker news

'Absolutely insane'. Dragonfly's extreme loop-the-loops unparalleled in nature

'Absolutely insane'. Dragonfly's extreme loop-the-loops unparalleled in nature

Article URL: https://www.science.org/content/article/absolutely-insane-dr

14 gen 2025, 05:30:06 | Hacker news

Techie