Detecting errors in AI-generated code

Ben chats with Gias Uddin, an assistant professor at York University in Toronto, where he teaches software engineering, data science, and machine learning. His research focuses on designing intelligent tools for testing, debugging, and summarizing software and AI systems. He recently published a paper about detecting errors in code generated by LLMs. Gias and Ben discuss the concept of hallucinations in AI-generated code, the need for tools to detect and correct those hallucinations, and the potential for AI-powered tools to generate QA tests. https://stackoverflow.blog/2024/09/20/detecting-errors-in-ai-generated-code/

Creato 7mo | 21 set 2024, 09:40:02


Accedi per aggiungere un commento

Altri post in questo gruppo

Grab bag! On the floor at HumanX

Today’s episode is a roundup of spontaneous, on-the-ground conversations from HumanX 2025, featuring guests from CodeConductor, DDN, Cloudflare, and Galileo. https://stackoverflow.blog/2025/04/25/grab

25 apr 2025, 05:50:06 | StackOverflow blog
Standardization and simplification as key to engineering excellence

In this episode of Leaders of Code, we chat with guests from Lloyds Banking Group about their focus on engineering excellence and the need for organizations to adapt to new technologies while ensuring

24 apr 2025, 13:40:03 | StackOverflow blog
Visually orchestrating data diagnostics but platform agnostic

Ryan chats with Dataiku CEO and cofounder Florian Douetteau about the complexities of the genAI data stack and how his company is orchestrating it. https://stackoverflow.blog/2025/04/22/visually-orch

22 apr 2025, 05:50:08 | StackOverflow blog
Generating components, not tokens

On today’s episode, Ben and Ryan chat with Laly Bar-Ilan, Chief Scientist at Bit. https://stackoverflow.blog/2025/04/18/generating-components-not-tokens/

18 apr 2025, 06:50:07 | StackOverflow blog
Wait, what is agentic AI?

Is “agentic AI” just a buzzword, or is it the sea change it seems? https://stackoverflow.blog/2025/04/17/wait-what-is-agentic-ai/

17 apr 2025, 14:40:07 | StackOverflow blog
WBIT #6: Be curious, ask questions, and don’t argue with JavaScript

Kyle chats with Jesse Tomchak a software engineer at ClickUp about all the spicy backend takes they could find. https://stackoverflow.blog/2025/04/09/wbit-6-be-curious-ask-questions-and-don-t-argue-w

16 apr 2025, 17:50:07 | StackOverflow blog