Google accused of using novices to fact-check Gemini's AI answers

There's no arguing that AI still has quite a few unreliable moments, but one would hope that at least its evaluations would be accurate. However, last week Google allegedly instructed contract workers evaluating Gemini not to skip any prompts, regardless of their expertise, TechCrunch reports based on internal guidance it viewed. Google shared a preview of Gemini 2.0 earlier this month.  

Google reportedly instructed GlobalLogic, an outsourcing firm whose contractors evaluate AI-generated output, not to have reviewers skip prompts outside of their expertise. Previously, contractors could choose to skip any prompt that fell far out of their expertise — such as asking a doctor about laws. The guidelines had stated, "If you do not have critical expertise (e.g. coding, math) to rate this prompt, please skip this task."

Now, contractors have allegedly been instructed, "You should not skip prompts that require specialized domain knowledge" and that they should "rate the parts of the prompt you understand" while adding a note that it's not an area they have knowledge in. Apparently, the only times contracts can skip now are if a big chunk of the information is missing or if it has harmful content which requires specific consent forms for evaluation. 

One contractor aptly responded to the changes stating, "I thought the point of skipping was to increase accuracy by giving it to someone better?" 

Google has not responded to a request for comment. 

This article originally appeared on Engadget at https://www.engadget.com/ai/google-accused-of-using-novices-to-fact-check-geminis-ai-answers-143044552.html?src=rss https://www.engadget.com/ai/google-accused-of-using-novices-to-fact-check-geminis-ai-answers-143044552.html?src=rss
Created 8d | Dec 19, 2024, 3:10:13 PM


Login to add comment

Other posts in this group

How to use chatGPT on your iPhone

Since the release of iOS 18.2

Dec 26, 2024, 8:50:21 PM | Engadget
Squid Game will have a third (and final) season in 2025

It looks like we won’t have to wait long to find out what happens in the next installment of Netflix’s addictive and deadly drama Squid Game. The Netflix-owned blog

Dec 26, 2024, 8:50:20 PM | Engadget
The best PS5 games for 2025: Top PlayStation titles to play right now

Got a PlayStation 5 but not sure what to play next? With the massive library available, it’s easy to get a little lost scrolling through titles. From award-winning adventures to intense action RPG

Dec 26, 2024, 8:50:19 PM | Engadget
LG found a new job for your standing lamp

LG is bringing

Dec 26, 2024, 6:40:19 PM | Engadget
Tech's biggest winners in 2024

In recent years, reflecting on the past 12 months has seemed to bring back

Dec 26, 2024, 6:40:18 PM | Engadget
Bluesky launches a Trending Topics feature in search

Social media platform Bluesky has launched Trending Topics into beta, the company announced in a post on

Dec 26, 2024, 2:10:05 PM | Engadget
How to spend your $100 gift card after Christmas

Some consider gift cards not the most personal of gifts, but I say that's not the case. They allow you to get exactly what you want with no confusion, and (typically) both gifter and giftee walk aw

Dec 26, 2024, 2:10:04 PM | Engadget