An AI watchdog accused OpenAI of using copyrighted books without permission

An artificial intelligence watchdog is accusing OpenAI of training its default ChatGPT model on copyrighted book content without permission.

In a new paper published this week, the AI Disclosures Project alleges that OpenAI likely trained its GPT-4o model using nonpublic material from O’Reilly Media. The researchers used a legally obtained dataset of 34 copyrighted O’Reilly books and found that GPT-4o showed “strong recognition” of the company’s paywalled content. By contrast, GPT-3.5 Turbo appeared more familiar with publicly accessible O’Reilly book samples.

“These results highlight the urgent need for increased corporate transparency regarding pre-training data sources as a means to develop formal licensing frameworks for AI content training,” the authors wrote in the paper. Tim O’Reilly, one of the paper’s authors, is a cofounder and CEO of O’Reilly Media.

An OpenAI spokesperson didn’t immediately respond to Fast Company‘s request for comment.

Training data lies at the heart of all artificial intelligence models. Large language models (LLMs) require an incredible amount of information that it uses to guide back on when it churns out text or images for users.

OpenAI has struck up some licensing deals to be able to train their models on certain content. But the company, which recently fundraised and is worth $300 billion, has also come under fire for sourcing certain content. The New York Times, for example, is leading a charge against OpenAI and minority owner Microsoft over alleged copyright infringement.

The researchers acknowledged limitations in their study but argued that the issue is likely part of a broader systemic problem in how large language models are developed.

“Sustainable ecosystems need to be designed so that both creators and developers can benefit from generative AI,” the authors wrote. “Otherwise, model developers are likely to rapidly plateau in their progress, especially as newer content becomes produced less and less by humans.”


https://www.fastcompany.com/91310223/an-ai-watchdog-accused-openai-of-using-copyrighted-books-without-permission?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creato 21h | 2 apr 2025, 20:30:07


Accedi per aggiungere un commento

Altri post in questo gruppo

An OpenAI ‘open’ model shows how much the company—and AI—has changed in two years

Welcome to AI DecodedFast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week 

3 apr 2025, 17:20:11 | Fast company - tech
How Elon Musk’s political gambit could tarnish his legacy at Tesla

Tech leaders often brand themselves as “disruptors”—and few fit that label more snugly than Elon Musk. In the three months since joining Donald Trump in the White House following Trump’s election,

3 apr 2025, 17:20:10 | Fast company - tech
Visa unveils a trio of new tools to make the payments process easier

At Visa’s ETA Transact event on April 3, the payments giant introduced three new products designed to simplify and secure payment acceptance. These innovations—Authorize.net 2.0, Unified Checkout,

3 apr 2025, 12:40:06 | Fast company - tech
Straight Talk Wireless rolls out smartphone vending machines at Walmart stores

For those tired of waiting in line to buy a new smartphone or anxiously refreshing a delivery tracking site to make sure a new phone arrives intact, Verizon’s

3 apr 2025, 10:30:03 | Fast company - tech
The Tumblr revival is real—and Gen Z is leading the charge

Rumors of a Tumblr comeback have been bubbling for a couple of years—think a pair of Doc Martens here, a splash of pastel hair dye there. Now, Gen Z is embracing the platform as a refuge from an i

3 apr 2025, 05:40:10 | Fast company - tech
Andrew Tate is back—and he’s getting a hero’s welcome from right-wing podcasters

You can’t talk about the manosphere without mentioning Andrew Tate. The British-American influencer and former professional kickboxer built his platform by promoting misogynistic ideas—claiming wo

2 apr 2025, 22:50:04 | Fast company - tech