Google accused of using novices to fact-check Gemini's AI answers

There's no arguing that AI still has quite a few unreliable moments, but one would hope that at least its evaluations would be accurate. However, last week Google allegedly instructed contract workers evaluating Gemini not to skip any prompts, regardless of their expertise, TechCrunch reports based on internal guidance it viewed. Google shared a preview of Gemini 2.0 earlier this month.  

Google reportedly instructed GlobalLogic, an outsourcing firm whose contractors evaluate AI-generated output, not to have reviewers skip prompts outside of their expertise. Previously, contractors could choose to skip any prompt that fell far out of their expertise — such as asking a doctor about laws. The guidelines had stated, "If you do not have critical expertise (e.g. coding, math) to rate this prompt, please skip this task."

Now, contractors have allegedly been instructed, "You should not skip prompts that require specialized domain knowledge" and that they should "rate the parts of the prompt you understand" while adding a note that it's not an area they have knowledge in. Apparently, the only times contracts can skip now are if a big chunk of the information is missing or if it has harmful content which requires specific consent forms for evaluation. 

One contractor aptly responded to the changes stating, "I thought the point of skipping was to increase accuracy by giving it to someone better?" 

Google has not responded to a request for comment. 

This article originally appeared on Engadget at https://www.engadget.com/ai/google-accused-of-using-novices-to-fact-check-geminis-ai-answers-143044552.html?src=rss https://www.engadget.com/ai/google-accused-of-using-novices-to-fact-check-geminis-ai-answers-143044552.html?src=rss
Établi 3mo | 19 déc. 2024, 15:10:13


Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

A big Playdate sale discounts 13 of our favorite games

It's the second anniversary of the Playdate's Catalog game store and to celebrate, you can get a bunch of great Playdate gam

7 mars 2025, 02:30:06 | Engadget
The first private asteroid mission probe is probably lost in deep space

It was a swing and a miss for the first private attempt at an asteroid mission, but the company is still chalking it up as a win. California startup AstroForge launched a spacecraft dubbed Odin on

7 mars 2025, 00:10:16 | Engadget
Instagram is experimenting with a Discord-like ‘community chat’ feature

It seems that Instagram is working on a “community chat” feature that allows people to organize groups of up to 250 people in the app. The so-far unreleased feature was

7 mars 2025, 00:10:15 | Engadget
ChatGPT for macOS can now directly edit Xcode projects

ChatGPT on macOS is about to become more useful for coding. With t

6 mars 2025, 21:50:04 | Engadget
House Republicans subpoena Google over alleged censorship

Google is once again in the crosshairs of Republicans in Congress because of alleged censorship,

6 mars 2025, 21:50:03 | Engadget
The MagicX Zero 40 handheld features a vertical display for DS emulation

The Nintendo DS is one of the toughest consoles to emulate, for an obvious reason. It’s the two screens. This is even an issue with ports. Some developers avoid the problem by mushing everything to

6 mars 2025, 19:30:26 | Engadget
Prime Gaming's March freebies include Saints Row: The Third and Mafia II remasters

It's Thursday, which means there are some more PC games that Amazon Prime members can claim for free. Amazon has also revealed the entire slate of freebies that subscribers can snag throughout Marc

6 mars 2025, 19:30:25 | Engadget