This is Nvidia’s lesser-known plan to stay dominant in the AI chip business

Welcome to AI Decoded, Fast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week here.

Nvidia’s big news Monday wasn’t a new chip, it was a strategy

It’s a good time to be Nvidia CEO Jensen Huang. Right now, Nvidia dominates the market for the chips needed to run AI models. Nvidia stock has tripled in value over the past 12 months. And delivering a keynote speech at San Jose’s SAP Center on Monday as part of the company’s GTC developer conference, Huang looked like a literal rock star. No wonder one attendee dubbed the event “AI Woodstock.”

At the keynote Huang announced a new graphics processing unit (GPU) called Blackwell that it says is up to 30 times as fast as its predecessor (Hopper) and far more efficient. That’s obviously big news—Nvidia will have to keep the accelerator pressed down to stay ahead of challengers such as Intel, AMD, Cerberus, and SambaNova—but the bigger story from the conference concerns how Nvidia will ensure its dominant place in AI even when its chips aren’t markedly faster than others.

Nvidia also announced Monday a new product called NIM (Nvidia Inference Microservices), a “container” of all the software an enterprise might need to put AI models to work. This includes application programming interfaces (APIs) to popular foundation models, software needed to deploy open-source models, pre-built models and software needed to access and process the company’s own proprietary data, and software links to popular business software such as SAP and the cybersecurity-focused CrowdStrike.

In 2023, many enterprises learned the hard way that deploying AI models is a messy business that requires building a lot of infrastructure and some PhDs on deck to make it all work. NIM is trying to package up all the major components that fit around the models, and abstract some of the deep technical stuff into controls that non-PhDs can use. Yes, other companies, including the major cloud providers, are doing this, but NIM is focused on making all the components work seamlessly and efficiently with Nvidia’s hardware. It’s similar to Apple’s superpower, which is producing both software and hardware and integrating them so tightly that they bring out the best in each other.

It’s clear that Nvidia isn’t content with being just a chip supplier. It wants to be a tech company on the same level as Apple, Google, and Meta. And becoming a platform player is a tried-and-true way of reaching that rarified air.

Why Apple using Google’s Gemini is disastrous and unlikely

Bloomberg reporter Mark Gurman notes that Google and Apple have been in talks to add Google cloud-based AI service to the iPhone. The service would be powered by Google’s Gemini AI models, Gurman writes, citing unnamed sources. This would be an extension of Google’s current arrangement with Apple in which it pays billions per year to supply the default search experience on the iPhone.

The deal would be a boon for Google’s generative AI efforts; there are currently about two billion active iOS devices in use around the world. The exact use of the Gemini model on the iPhone remains unclear, but it’s possible that the model would anchor some form of chatbot, or perhaps a writing app. It could also power a form of conversational search similar to Google’s experimental Search Generative Experience.

What is certain, however, is that the antitrust environment around tech has changed a lot since Google began paying to put its search on the iPhone. The Federal Trade Commission under the leadership of Lina Khan would almost certainly open an investigation into a big money deal to put Gemini on the iPhone in some form. The FTC last summer opened a probe into Microsoft’s large-scale investment in OpenAI, and Google’s and Amazon’s investments in Anthropic. (Gurman reports that Apple has also held talks with OpenAI to provide some form of AI function.)

A deal with Google would suggest that Apple sees generative AI as the forte of another company. This is somewhat surprising because Apple has been working with machine learning for years, and has deployed features driven by that technology on its devices, including several camera features. In 2018 Apple even poached Google’s then-head of AI John Giannandrea to lead its own AI efforts. Apple was the first tech company to embrace a voice assistant, Siri, on its devices way back in 2011.

Apple has also developed its own generative AI models over the past few years, but the company may not have been able to advance the capabilities of its models as quickly as Google and OpenAI. Apple’s big opportunity is offering privacy-protecting personal AI apps powered by models that run mostly or completely on-device.

InflectionAI wasn’t bought by Microsoft—it was absorbed by it

When I spoke to InflectionAI cofounder Mustafa Suleyman last September, he’d landed a huge $1.3 billion funding round (at a $4 billion valuation). His new book about the future impact of AI had just come out. Inflection’s app, an emotionally intelligent personal AI assistant called Pi, was doing well. “This is the arrival of a new era in computing,” he told me. “This is going to be like bringing your digital life with you wherever you are.” He bragged that his company had been the first to get Nvidia’s latest H100 servers—22,000 of them in a $1.2 billion cluster.

What a difference six months makes. On Tuesday Suleyman confirmed that he and most of Inflection’s 70 employees have taken jobs at Microsoft, which had earlier made an investment in the fledgling company. But don’t call it an acquisition, an Inflection spokesperson was quick to point out on the phone Tuesday. Inflection would stay around as a B2B company and sell API access to the Inflection generative AI model that powers Pi. Microsoft will also sell access to the model via its Azure cloud. The spokesperson declined to say exactly how many employees would be going to Microsoft, nor did he know what would become of the $1.2 billion server cluster.

Terms of the “agreement” were not divulged, but Inflection AI co-founder Reid Hoffman said on LinkedIn Tuesday that the Microsoft deal “means that all of Inflection’s investors will have a good outcome today.” It’s unclear how exactly investors can have a “good outcome” if Microsoft isn’t buying their shares at a premium. Certainly the investors didn’t foresee making their multiples from a B2B company that collects API fees.

When all is said and done a promising company that was developing large AI models independent of Google, Amazon, or Microsoft has vanished. The brains that designed the Inflection LLM and the Pi app (which will live on for the time being) are now under Microsoft’s roof. The brain power in AI continues to converge with the big money in the tech industry.

More AI coverage from Fast Company: 

Want exclusive reporting and trend analysis on technology, business innovation, future of work, and design? Sign up for Fast Company Premium.

https://www.fastcompany.com/91065684/this-is-nvidias-lesser-known-plan-to-stay-dominant-in-the-ai-chip-business?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creată 1y | 20 mar. 2024, 17:50:03


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

These tech companies are building healthier social media habits for kids

The last year has seen a global reckoning with the effects of social media on kids. Australia banned

23 mar. 2025, 12:30:02 | Fast company - tech
This wellness app is like TikTok for your feelings

Would you share the pages of your journal with a bunch of strangers, because that’s the idea behind social wellness app Exist. 

The new

23 mar. 2025, 05:30:03 | Fast company - tech
Yes, Apple is delaying some AI features. But does it really matter?

Earlier this month, Apple officially announced that it would be postponing the launch of some planned Apple Intelligence features to a later, unspecified date in the future. These features mainly

22 mar. 2025, 10:50:06 | Fast company - tech
Suffering from loneliness? These businesses may have a cure

Loneliness isn’t just a lingering by-product of COVID lockdowns—it’s a public health crisis. The impacts of social isolation are said to be as detrimental to human health as

22 mar. 2025, 10:50:06 | Fast company - tech
Anthropic is adding web search to its Claude chatbot in a very smart way

Anthropic announced Thursday that it has added web search capability to its Claude chatbot. It’s not a new feature to the AI world—but the company’s approach stands as one of the most thoughtful t

21 mar. 2025, 23:20:06 | Fast company - tech
In this horror game, the monster can see you through your webcam

If the thought of being hunted by something that can see your every move makes your skin crawl, you might want to steer clear of Eyes Never Wake.

This viral horror game takes im

21 mar. 2025, 21:10:03 | Fast company - tech
Fewer than 500 people are responsible for $3.2 trillion of artificial crypto trading

Market manipulation in the cryptocurrency world is rampant—and fewer than 500 people are responsible for as much as $250 million a year in profits and over $3.2 trillion in artificial trading, acc

21 mar. 2025, 18:40:04 | Fast company - tech