OpenAI's new GPT-4.5 model is a better, more natural conversationalist

In what has already been a busy past few days for new model releases, OpenAI is capping off the week with a research preview of GPT-4.5. The company is touting the new system as its largest and best model for chat yet. In early testing, OpenAI says people found GPT-4.5 to be a more natural conversationalist, with the ability to convey warmth and display a kind of emotional intelligence.

In one example shared by OpenAI, a person tells ChatGPT they're going through a hard time after failing a test. Where the company's previous models, including GPT-4o and o3-mini, might commiserate with the individual before offering a long list of unsolicited advice, GPT-4.5 takes a different tact. "Want to talk about what happened, or do you just need a distraction? I'm here either way," the chatbot says when powered by GPT-4.5.

The gains shown by GPT-4.5 are the result of advancements OpenAI made in unsupervised learning. With unsupervised learning, a machine learning algorithm is given an unlabeled data set and left to its own devices to find patterns and insights. GPT-4.5 doesn't "think" like the company's state-of-the-art reasoning models, but in training the new model OpenAI made architectural enhancements and gave it access to more data and compute power. "The result is a model that has broader knowledge and a deeper understanding of the world, leading to reduced hallucinations," the company says.

Speaking of reduced hallucinations, OpenAI measured how much better GPT-4.5 in that regard. When put through SimpleQA, an OpenAI-designed benchmark that tests large language models on their ability to answer "straightforward but challenging knowledge questions," GPT-4.5 beat out o3-mini, GPT-4o and even o1 with a hallucination rate of 37.1 percent. Obviously, the new model doesn't solve the problem of AI hallucinations altogether, but it is a step in the right direction.

Despite its relative strengths over GPT-4o and o3-mini, GPT-4.5 isn't a direct replacement for those models. Compared to OpenAI's reasoning systems, GPT-4.5 is "a more general-purpose, innately smarter model." Additionally, it's not natively multimodal like GPT-4o, meaning it doesn't work with features like Voice Mode, video or screensharing. It’s also "a very large and compute-intensive model."

It's best to think of GPT-4.5 as a stepping stone to systems OpenAI plans to offer in the future. In fact, Sam Altman said as much earlier this month when he shared the company's roadmap, noting GPT-4.5 would be "our last non-chain-of-thought model" — referring to the fact that the new system doesn't solve problems by tackling them step by step like OpenAI's reasoning models do. Its successor, GPT-5, will likely integrate many of OpenAI's latest technologies, including its frontier o3 model. OpenAI reiterated that today, saying it plans to bring GPT-4.5's "unique strengths, including broader knowledge, stronger intuition, and greater 'EQ,' to all users in future models."

In the meantime, ChatGPT Pro subscribers can begin using GPT-4.5 starting today, with Pro and Team users slated to gain access starting next week.

This article originally appeared on Engadget at https://www.engadget.com/ai/openais-new-gpt-45-model-is-a-better-more-natural-conversationalist-200035185.html?src=rss https://www.engadget.com/ai/openais-new-gpt-45-model-is-a-better-more-natural-conversationalist-200035185.html?src=rss
Établi 24d | 2 mars 2025, 18:10:47


Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

The Amazon Spring Sale 2025 is live: The best tech deals from Apple, Bose, Sonos, Anker and others

The Amazon Spring Sale has arrived, bringing a slew of discounts on household essentials, fashion, outdoor gear a

26 mars 2025, 01:11:11 | Engadget
The Pentagon warns government officials that Signal is being targeted by Russian hackers

As it turns out, including a reporter in your national security leader group chat about military strikes isn't the only way to compromise sensitive information on Signal. NPR

25 mars 2025, 22:50:04 | Engadget
Game Informer is back and so is its entire team

Gaming journalism stalwart Game Informer has risen from the ashes. More than thirty years afte

25 mars 2025, 22:50:03 | Engadget
Google releases Gemini 2.5 AI model for complex thinking

Google has the pedal to the metal on its AI development. Just a few months after the debut of

25 mars 2025, 20:30:37 | Engadget
Dreamhaven's Tabletop RPG party game Sunderfolk arrives on April 23

Sunderfolk, a game that borrows elements from tabletop games like Dungeons & Dragons and couch party games like Jackbox, has a launch date. The

25 mars 2025, 20:30:36 | Engadget
The UK could greenlight direct-to-phone satellite services this year

If you live in a rural area of the UK, you may soon be able to use your phone for satellite calls, messages and other standard data use. On Tuesday, the nation's telecom regulator, Ofcom,

25 mars 2025, 20:30:35 | Engadget
Vampire: The Masquerade - Bloodlines 2 is now slated to launch in October 2025

Vampire: The Masquerade - Bloodlines 2 has been delayed again. Publisher Paradox Interactive

25 mars 2025, 20:30:34 | Engadget