How DeepSeek engineered a hyper-efficient rival to ChatGPT

DeepSeek is No. 12 on the list of the World’s 50 Most Innovative Companies of 2025. Explore the full list of companies that are reshaping industries and culture.

The Chinese company DeepSeek delivered a one-two punch in December and January, when it released a pair of state-of-the-art AI models that require far less computing power and capital than those of western AI companies. This immediately called into question the belief that the U.S. leads the world in AI—and roiled the markets.

Generative models use a lot of memory and computing power while they’re reasoning through problems because they must “remember” a lot of contextual information. DeepSeek invented a way to compress some of that data, easing the workload of the GPUs during both model training and content generation.

With a U.S. ban preventing DeepSeek from accessing the most powerful Nvidia GPUs, the company innovated on known engineering approaches to achieve efficiencies that conserved GPU horsepower. DeepSeek’s researchers found a
way to improve what’s known as mixture-of-experts architecture that divides a large language model into segments that contain specialized
knowledge.

The company also invented a more efficient way to teach its smaller model, DeepSeek-R1, how to reason. The researchers fed a relatively small amount of reinforcement learning data (questions and answers generated by its larger DeepSeek-V3 model, along with its “train of thought”) to R1. The researchers then gave the model a series of problems to solve, and rewarded it with special code for good answers. Eventually R1 began to “think” about the most promising routes to favorable answers and the reward.

DeepSeek faces fierce competition from other AI labs, but instead of keeping its research breakthroughs a secret, it shared its methods through research papers and by open-sourcing its models for others to use and modify. The message: Cutting-edge large language models are becoming an open secret.

Explore the full 2025 list of Fast Company’s Most Innovative Companies, 609 organizations that are reshaping industries and culture. We’ve selected the companies making the biggest impact across 58 categories, including advertisingapplied AIbiotechretailsustainability, and more.

https://www.fastcompany.com/91270727/deepseek-most-innovative-companies-2025?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Utworzony 21d | 18 mar 2025, 11:50:23


Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

Why Amazon is doubling down on movie theaters

Amazon is betting big on movie theaters—even if it isn’t counting on mega profits.

The Silicon Valley giant

8 kwi 2025, 13:20:04 | Fast company - tech
Meet the Space Resistance Companies

If you’re at all unsettled by the amount of power that Elon Musk wields on Earth, don’t look up. There are now essentially three big players controlling what goes into space and who gets access to

8 kwi 2025, 13:20:03 | Fast company - tech
How ChatGPT is helping bend websites to my will

I’m a writer, not a programmer, so until recently a lot of the hype around ChatGPT’s abitilies as a coding tool went over my head.

But then I realized generative

8 kwi 2025, 10:50:06 | Fast company - tech
Meta is bringing stricter parental controls to Facebook and Messenger

Meta is bringing its Teen Accounts, which have stricter parental controls, to its Facebook and Messenger platforms on Tuesday, expanding its teen service from just Instagram.

The social

8 kwi 2025, 10:50:05 | Fast company - tech
Europe considers new tariffs that could punish tech companies like Google, Meta, and Apple

As the European Union looks at how best to respond to Donald Trump’s trade war, officials are considering f

7 kwi 2025, 21:10:06 | Fast company - tech