Would you board a plane safety-tested by GenAI?

Ben and Ryan are joined by Robin Gupta for a conversation about benchmarking and testing AI systems. They talk through the lack of trust and confidence in AI, the inherent challenges of nondeterministic systems, the role of human verification, and whether we can (or should) expect an AI to be reliable. https://stackoverflow.blog/2024/05/24/would-you-board-a-plane-safety-tested-by-genai/

Creată 1y | 24 mai 2024, 05:50:06

Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

Programming problems that seem easy, but aren't, featuring Jon Skeet

Jon Skeet, the first Stack Overflow user with a million reputation, sits down with Ryan to share his wealth of knowledge on all things development: the deceptively simple but actually complicated prob

1 iul. 2025, 06:40:04 | StackOverflow blog

Reliability for unreliable LLMs

Large language models are non-deterministic by design. Here's how you can inject a little bit of determinism into GenAI workflows. https://stackoverflow.blog/2025/06/30/reliability-for-unreliable-llm

30 iun. 2025, 14:30:02 | StackOverflow blog

You’ve got 99 problems but data shouldn’t be one

Ryan is joined by Tobiko Data co-founders Toby Mao and Iaroslav Zeigerman to talk about the crucial role of rigorous data practices and tooling, the innovations of Tobiko Data’s SQLMesh and SQLGlot, a

27 iun. 2025, 05:20:07 | StackOverflow blog

Not an option, but a necessity: How organizations are adopting and implementing AI internally

AI is no longer just a luxury for the most tech savvy companies — it's now a necessity for organizational transformation. How are real teams successfully leveraging and innovating with these new tools

25 iun. 2025, 13:50:07 | StackOverflow blog

You've vibe coded an app. Now what?

On this episode, Ryan chats with Vish Abrams, chief architect at Heroku, about all the work that needs to be done after you’ve vibe coded your dream app. https://stackoverflow.blog/2025/06/25/you-ve-

25 iun. 2025, 06:50:08 | StackOverflow blog

How to build your prototypes without a 35% tariff

Ryan and Ben welcome Alex Malcoci, CEO and founder of MiniProto, to talk innovations in hardware prototyping, the evolving complexities of the global supply chain, the impact of the US-China trade war

24 iun. 2025, 05:20:11 | StackOverflow blog

Defending the realm: Trust and safety at Stack Overflow

In this special episode, Ryan is joined by our Senior VP of Community, Philippe Beaudette, and the Trust and Safety team at Stack Overflow to discuss maintaining platform integrity and managing user s

20 iun. 2025, 06:20:09 | StackOverflow blog

Tomas_r2