Would you board a plane safety-tested by GenAI?

Ben and Ryan are joined by Robin Gupta for a conversation about benchmarking and testing AI systems. They talk through the lack of trust and confidence in AI, the inherent challenges of nondeterministic systems, the role of human verification, and whether we can (or should) expect an AI to be reliable. https://stackoverflow.blog/2024/05/24/would-you-board-a-plane-safety-tested-by-genai/

Creato 11mo | 24 mag 2024, 05:50:06

Accedi per aggiungere un commento

Altri post in questo gruppo

Visually orchestrating data diagnostics but platform agnostic

Ryan chats with Dataiku CEO and cofounder Florian Douetteau about the complexities of the genAI data stack and how his company is orchestrating it. https://stackoverflow.blog/2025/04/22/visually-orch

22 apr 2025, 05:50:08 | StackOverflow blog

Generating components, not tokens

On today’s episode, Ben and Ryan chat with Laly Bar-Ilan, Chief Scientist at Bit. https://stackoverflow.blog/2025/04/18/generating-components-not-tokens/

18 apr 2025, 06:50:07 | StackOverflow blog

Wait, what is agentic AI?

Is “agentic AI” just a buzzword, or is it the sea change it seems? https://stackoverflow.blog/2025/04/17/wait-what-is-agentic-ai/

17 apr 2025, 14:40:07 | StackOverflow blog

WBIT #6: Be curious, ask questions, and don’t argue with JavaScript

Kyle chats with Jesse Tomchak a software engineer at ClickUp about all the spicy backend takes they could find. https://stackoverflow.blog/2025/04/09/wbit-6-be-curious-ask-questions-and-don-t-argue-w

16 apr 2025, 17:50:07 | StackOverflow blog

Engineering teams need to adapt to AI’s scaling challenges

AI is not a linear process. To scale effectively, engineering leaders must account for varied edge cases, presenting a new set of challenges. https://stackoverflow.blog/2025/04/16/engineering-teams-ne

16 apr 2025, 17:50:07 | StackOverflow blog

WBIT #7: Exploring WebAssembly with the first SO user to get 10k rep

Kyle interviews Michael Stum, a former Stacker who started (and returned) to answering questions on the community site. https://stackoverflow.blog/2025/04/16/wbit-7-exploring-webassembly-with-the-fir

16 apr 2025, 15:30:09 | StackOverflow blog

Smarter insights, stronger teams: New features for Stack Overflow for Teams

2025.3 | Teams Enterprise Release https://stackoverflow.blog/2025/04/15/smarter-insights-stronger-teams-new-features-for-stack-overflow-for-teams/

15 apr 2025, 14:10:03 | StackOverflow blog

Tomas_r2