Detecting errors in AI-generated code

Ben chats with Gias Uddin, an assistant professor at York University in Toronto, where he teaches software engineering, data science, and machine learning. His research focuses on designing intelligent tools for testing, debugging, and summarizing software and AI systems. He recently published a paper about detecting errors in code generated by LLMs. Gias and Ben discuss the concept of hallucinations in AI-generated code, the need for tools to detect and correct those hallucinations, and the potential for AI-powered tools to generate QA tests. https://stackoverflow.blog/2024/09/20/detecting-errors-in-ai-generated-code/

Utworzony 7d | 21 wrz 2024, 09:40:02


Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

Deedy Das: from coding at Meta, to search at Google, to investing with Anthropic

We chat with Deedy Das, a Principal at Menlo Ventures, who began his career as a software engineer at Facebook and Google. He then dipped a toe in the startup world, spending time at the company now k

27 wrz 2024, 04:50:06 | StackOverflow blog
Masked self-attention: How LLMs learn relationships between tokens

Masked self-attention is the key building block that allows LLMs to learn rich relationships and patterns between the words of a sentence. Let’s build it together from scratch. https://stackoverflow.b

26 wrz 2024, 15:10:02 | StackOverflow blog
He sold his first company for billions. Now he’s building a better developer experience.

Founder and entrepreneur Jyoti Bansal tells Ben, Cassidy, and Eira about the developer challenges he aims to solve with his new venture, Harness, an AI-driven software development platform meant to ta

24 wrz 2024, 14:20:04 | StackOverflow blog
Elevating your search experience: Stack Overflow for Teams ML-powered reranking experiment

Today, we're excited to share details about our latest experiment that aims to make your search results in Stack Overflow for Teams Enterprise even more relevant and useful. https://stackoverflow.blog

19 wrz 2024, 17:20:04 | StackOverflow blog
Looking under the hood at the tech stack that powers multimodal AI

Ryan chats with Russ d’Sa, cofounder and CEO of LiveKit, about multimodal AI and the technology that makes it possible. They talk through the tech stack required, including the use of WebRTC and UDP p

17 wrz 2024, 04:50:07 | StackOverflow blog
The world’s largest open-source business has plans for enhancing LLMs

Ben and Ryan talk to Scott McCarty, Global Senior Principal Product Manager for Red Hat Enterprise Linux, about the intersection between LLMs (large language models) and open source. They discuss the

13 wrz 2024, 05:10:05 | StackOverflow blog