Google accused of using novices to fact-check Gemini's AI answers

There's no arguing that AI still has quite a few unreliable moments, but one would hope that at least its evaluations would be accurate. However, last week Google allegedly instructed contract workers evaluating Gemini not to skip any prompts, regardless of their expertise, TechCrunch reports based on internal guidance it viewed. Google shared a preview of Gemini 2.0 earlier this month.  

Google reportedly instructed GlobalLogic, an outsourcing firm whose contractors evaluate AI-generated output, not to have reviewers skip prompts outside of their expertise. Previously, contractors could choose to skip any prompt that fell far out of their expertise — such as asking a doctor about laws. The guidelines had stated, "If you do not have critical expertise (e.g. coding, math) to rate this prompt, please skip this task."

Now, contractors have allegedly been instructed, "You should not skip prompts that require specialized domain knowledge" and that they should "rate the parts of the prompt you understand" while adding a note that it's not an area they have knowledge in. Apparently, the only times contracts can skip now are if a big chunk of the information is missing or if it has harmful content which requires specific consent forms for evaluation. 

One contractor aptly responded to the changes stating, "I thought the point of skipping was to increase accuracy by giving it to someone better?" 

Google has not responded to a request for comment. 

This article originally appeared on Engadget at https://www.engadget.com/ai/google-accused-of-using-novices-to-fact-check-geminis-ai-answers-143044552.html?src=rss https://www.engadget.com/ai/google-accused-of-using-novices-to-fact-check-geminis-ai-answers-143044552.html?src=rss
Établi 12d | 19 déc. 2024 à 15:10:13


Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

The best SSDs in 2025

When it comes to boosting your system’s performance, upgrading to one of the fastest SSDs is a no-brainer. Whether you’re building a gaming PC, speeding up an older laptop or simply craving lightni

31 déc. 2024 à 21:30:11 | Engadget
Squid Game's second season is officially Netflix's biggest TV debut

Squid Game has set a new high for Netflix's television programming.

31 déc. 2024 à 21:30:10 | Engadget
Bluesky and Threads showed us very different visions for a post-X future

There’s no longer any question that Threads and Bluesky have created the most viable alternatives to the platform once known as Twitter. But while the two services may share some of the same goals,

31 déc. 2024 à 19:10:16 | Engadget
Terraform Labs co-founder Do Kwon will face securities fraud charges in the US

Terraforms Labs CEO Do Kwon spent the last day of 2024 getting extradited to the US,

31 déc. 2024 à 16:50:05 | Engadget
Russia is trying to make its own game consoles in a bid for technological independence

It’s no secret that Russia has been slowly working towards eschewing as much Western technology as it can and developing its own, and its latest effort seems to be related to video games. On Decemb

31 déc. 2024 à 16:50:04 | Engadget
Dang, 2024 was a great year for horror game fans

When it comes to new horror games, there are times of feast and famine, and this past year we gorged until our bellies bulged and our mouths dripped with gruesome grease. In 2024, we received a ric

31 déc. 2024 à 16:50:03 | Engadget
The Morning After: A microwave with a 27-inch touchscreen

We’re wrapping up 2024, so why not do it with some frivolous CES announcements? Like this premium (it has to be premium!)

31 déc. 2024 à 14:30:10 | Engadget