OpenAI’s GPT-4o brings us closer to the ‘Her’ experience

OpenAI held a webcast Monday to roll out a new version of its free ChatGPT app, which sounds and acts a lot like the AI in the 2013 Spike Jonze film, Her.

The experience is powered by a new version of its GPT-4 large language model—available on desktop and mobile—called GPT-4o (“GPT-four-oh”). The new model, OpenAI says, returns answers much faster than GPT-4, and improves on its text, vision, and audio capabilities.

The model is a showcase for OpenAI’s development of multi-modal AI. GPT-4o can recieve and reason about text, audio, and visual inputs, then deliver outputs in natural language and natural-sounding voice.

OpenAI researcher Mark Chen demonstrated the new model’s impressive conversational capabilities during a live demo. He told the chatbot that he was nervous about the demo, and asked her for advice to help calm down. Chen then mock-hyperventilated into phone, to which the app responded “Mark! You’re not a vacuum cleaner.” The AI was spontaneous and funny, much like the voice assistant (voiced by Scarlett Johansson) in Her, which has become a North Star for people developing consumer AI.

The app was asked to tell a story with various levels of “drama” in its voice, which it did, convincingly. The AI then told the same story in a stereotypical robot’s voice, and then again in sing-song fashion.

Chen also demonstrated how he could interrupt the AI voice, and she would quickly stop talking. ChatGPT, in other words, is getting more “emotionally” intelligent. This is very similar to what Inflection.ai was developing with its Pi AI app. But Inflection.ai was essentially bought out by Microsoft, the same tech giant that owns almost half of OpenAI.

The ChatGPT app also has the ability to “see” things and reason about them. Through the phone camera, the app was shown a math problem written on a white board and asked for help in working it out. It was then asked to explain some computer code. The app also did a live translation from Italian to English and back.

The new features in the ChatGPT app will roll out to users of the free version of ChatGPT over the next few weeks. OpenAI says it’s also making GPT-4o available to developers through its API. OpenAI’s live streamed announcement Monday seemed timed to steal some thunder from Google, which is expected to make a series of AI-related announcements at its I/O developer conference Tuesday.

https://www.fastcompany.com/91123206/openai-gpt-4o-announcement?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creado 11mo | 13 may 2024, 18:40:08


Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

Why Amazon is doubling down on movie theaters

Amazon is betting big on movie theaters—even if it isn’t counting on mega profits.

The Silicon Valley giant

8 abr 2025, 13:20:04 | Fast company - tech
Meet the Space Resistance Companies

If you’re at all unsettled by the amount of power that Elon Musk wields on Earth, don’t look up. There are now essentially three big players controlling what goes into space and who gets access to

8 abr 2025, 13:20:03 | Fast company - tech
How ChatGPT is helping bend websites to my will

I’m a writer, not a programmer, so until recently a lot of the hype around ChatGPT’s abitilies as a coding tool went over my head.

But then I realized generative

8 abr 2025, 10:50:06 | Fast company - tech
Meta is bringing stricter parental controls to Facebook and Messenger

Meta is bringing its Teen Accounts, which have stricter parental controls, to its Facebook and Messenger platforms on Tuesday, expanding its teen service from just Instagram.

The social

8 abr 2025, 10:50:05 | Fast company - tech
Europe considers new tariffs that could punish tech companies like Google, Meta, and Apple

As the European Union looks at how best to respond to Donald Trump’s trade war, officials are considering f

7 abr 2025, 21:10:06 | Fast company - tech