Would you board a plane safety-tested by GenAI?

Ben and Ryan are joined by Robin Gupta for a conversation about benchmarking and testing AI systems. They talk through the lack of trust and confidence in AI, the inherent challenges of nondeterministic systems, the role of human verification, and whether we can (or should) expect an AI to be reliable. https://stackoverflow.blog/2024/05/24/would-you-board-a-plane-safety-tested-by-genai/

созданный 11mo | 24 мая 2024 г., 05:50:06

Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Grab bag! On the floor at HumanX

Today’s episode is a roundup of spontaneous, on-the-ground conversations from HumanX 2025, featuring guests from CodeConductor, DDN, Cloudflare, and Galileo. https://stackoverflow.blog/2025/04/25/grab

25 апр. 2025 г., 05:50:06 | StackOverflow blog

Standardization and simplification as key to engineering excellence

In this episode of Leaders of Code, we chat with guests from Lloyds Banking Group about their focus on engineering excellence and the need for organizations to adapt to new technologies while ensuring

24 апр. 2025 г., 13:40:03 | StackOverflow blog

Community Products roadmap update, April 2025

An update on recent launches and the upcoming roadmap https://stackoverflow.blog/2025/04/23/community-products-roadmap-update-april-2025/

23 апр. 2025 г., 16:40:09 | StackOverflow blog

Visually orchestrating data diagnostics but platform agnostic

Ryan chats with Dataiku CEO and cofounder Florian Douetteau about the complexities of the genAI data stack and how his company is orchestrating it. https://stackoverflow.blog/2025/04/22/visually-orch

22 апр. 2025 г., 05:50:08 | StackOverflow blog

Generating components, not tokens

On today’s episode, Ben and Ryan chat with Laly Bar-Ilan, Chief Scientist at Bit. https://stackoverflow.blog/2025/04/18/generating-components-not-tokens/

18 апр. 2025 г., 06:50:07 | StackOverflow blog

Wait, what is agentic AI?

Is “agentic AI” just a buzzword, or is it the sea change it seems? https://stackoverflow.blog/2025/04/17/wait-what-is-agentic-ai/

17 апр. 2025 г., 14:40:07 | StackOverflow blog

WBIT #6: Be curious, ask questions, and don’t argue with JavaScript

Kyle chats with Jesse Tomchak a software engineer at ClickUp about all the spicy backend takes they could find. https://stackoverflow.blog/2025/04/09/wbit-6-be-curious-ask-questions-and-don-t-argue-w

16 апр. 2025 г., 17:50:07 | StackOverflow blog

Tomas_r2