OpenAI's new GPT-4.5 model is a better, more natural conversationalist

In what has already been a busy past few days for new model releases, OpenAI is capping off the week with a research preview of GPT-4.5. The company is touting the new system as its largest and best model for chat yet. In early testing, OpenAI says people found GPT-4.5 to be a more natural conversationalist, with the ability to convey warmth and display a kind of emotional intelligence.

In one example shared by OpenAI, a person tells ChatGPT they're going through a hard time after failing a test. Where the company's previous models, including GPT-4o and o3-mini, might commiserate with the individual before offering a long list of unsolicited advice, GPT-4.5 takes a different tact. "Want to talk about what happened, or do you just need a distraction? I'm here either way," the chatbot says when powered by GPT-4.5.

The gains shown by GPT-4.5 are the result of advancements OpenAI made in unsupervised learning. With unsupervised learning, a machine learning algorithm is given an unlabeled data set and left to its own devices to find patterns and insights. GPT-4.5 doesn't "think" like the company's state-of-the-art reasoning models, but in training the new model OpenAI made architectural enhancements and gave it access to more data and compute power. "The result is a model that has broader knowledge and a deeper understanding of the world, leading to reduced hallucinations," the company says.

Speaking of reduced hallucinations, OpenAI measured how much better GPT-4.5 in that regard. When put through SimpleQA, an OpenAI-designed benchmark that tests large language models on their ability to answer "straightforward but challenging knowledge questions," GPT-4.5 beat out o3-mini, GPT-4o and even o1 with a hallucination rate of 37.1 percent. Obviously, the new model doesn't solve the problem of AI hallucinations altogether, but it is a step in the right direction.

Despite its relative strengths over GPT-4o and o3-mini, GPT-4.5 isn't a direct replacement for those models. Compared to OpenAI's reasoning systems, GPT-4.5 is "a more general-purpose, innately smarter model." Additionally, it's not natively multimodal like GPT-4o, meaning it doesn't work with features like Voice Mode, video or screensharing. It’s also "a very large and compute-intensive model."

It's best to think of GPT-4.5 as a stepping stone to systems OpenAI plans to offer in the future. In fact, Sam Altman said as much earlier this month when he shared the company's roadmap, noting GPT-4.5 would be "our last non-chain-of-thought model" — referring to the fact that the new system doesn't solve problems by tackling them step by step like OpenAI's reasoning models do. Its successor, GPT-5, will likely integrate many of OpenAI's latest technologies, including its frontier o3 model. OpenAI reiterated that today, saying it plans to bring GPT-4.5's "unique strengths, including broader knowledge, stronger intuition, and greater 'EQ,' to all users in future models."

In the meantime, ChatGPT Pro subscribers can begin using GPT-4.5 starting today, with Pro and Team users slated to gain access starting next week.

This article originally appeared on Engadget at https://www.engadget.com/ai/openais-new-gpt-45-model-is-a-better-more-natural-conversationalist-200035185.html?src=rss https://www.engadget.com/ai/openais-new-gpt-45-model-is-a-better-more-natural-conversationalist-200035185.html?src=rss

Établi 2mo | 2 mars 2025, 18:10:47

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

How to watch and follow LlamaCon 2025, Meta's first generative AI developer conference, today

After a couple years of having its open-source Llama AI model be just a part of its Connect conferences, Meta is breaking things out and hosting an entirely generative AI-focused developer conferen

29 avr. 2025, 10:20:15 | Engadget

Wholesome Direct 2025 will premiere on June 7

Wholesome Direct, an annual showcase of cute and cozy games, is returning on Saturday, June 7 at 12PM ET / 9AM PT. This year's event will show off "a vibrant lineup of artistic, uplifting, and emot

28 avr. 2025, 22:40:18 | Engadget

There’s a massive power outage cross Spain, Portugal and parts of France

Spain, Portugal and parts of France have experienced a

28 avr. 2025, 20:30:19 | Engadget

How to delete your Twitter (or X) account

There are plenty of good reasons to delete your X account, whether it's because of a general desire to

28 avr. 2025, 20:30:18 | Engadget

Mycopunk is an upbeat love letter to extraction shooters

The extraction-shooter genre is getting a little more crowded and a lot more stylish with the announcement of Mycopunk, a four-player, first-person romp from indie studio Pigeons at Play a

28 avr. 2025, 20:30:16 | Engadget

Researchers secretly experimented on Reddit users with AI-generated comments

A group of researchers covertly ran a months-long "unauthorized" experiment in one of Reddit’s most popular communities using AI-generated comments to test the persuasiveness of large language mode

28 avr. 2025, 20:30:15 | Engadget

Russian regulators are trying to seize assets from the developers of World of Tanks

Top executives from Wargaming and Lesta Games, the joint developers of World of Tanks, could have their stakes in their respective companies seized by the Russian government, according to

28 avr. 2025, 20:30:14 | Engadget

Techie