DeepSeek was not built for $5 million: Bernstein challenges AI firms' claim

Bernstein said that this figure is misleading and does not reflect the true costs involved in research, experimentation, and infrastructure

Deepseek
Deepseek (Photo: Reuters)
Md Zakariya Khan New Delhi
2 min read Last Updated : Jan 29 2025 | 4:21 PM IST
A recent report by Bernstein has examined the buzz around DeepSeek, an AI firm known for developing advanced language models. While the AI firm claims that it built an OpenAI competitor for just $5 million, Bernstein said that this figure is misleading and does not reflect the true costs involved in research, experimentation, and infrastructure.

DeepSeek’s key AI Models

DeepSeek has introduced two primary AI models: DeepSeek-V3 and DeepSeek R1. The V3 model is a large language model (LLM) that employs a Mixture-of-Experts (MOE) architecture, a method that distributes tasks among smaller models to optimise efficiency.
 
The model comprises 671 billion parameters, with 37 billion active at any given time. It also integrates Multi-Head Latent Attention (MHLA), which reduces memory usage, and mixed-precision training using FP8 computation, enhancing efficiency. To train the V3 model, DeepSeek used 2,048 NVIDIA H800 GPUs over a two-month period, consuming approximately 2.7 million GPU hours for pre-training and 2.8 million GPU hours in total.
 
While estimates suggest a training cost of $5 million based on a USD 2 per GPU hour rental rate, Bernstein’s report argues that this calculation excludes crucial factors such as research, experimentation, and infrastructure expenses.
 

Who benefits from DeepSeek’s innovations?

Bernstein predicts that AI infrastructure providers will gain the most from DeepSeek’s advancements. Companies in data infrastructure, identity verification, observability, and communications are expected to benefit. Key firms identified include:
 
·  Confluent (CFLT) – Data streaming solutions
 
·  Okta (OKTA) – Identity verification
 
·  Datadog (DDOG) – Cloud observability
 
·  Twilio (TWLO) – Communications tooling
 
·  Cloudflare (NET) – Browser delivery services
 
Twilio, in particular, has seen a 93 per cent stock return in the past year, with analysts raising earnings forecasts.
 

The future of AI adoption

While DeepSeek has demonstrated cost-efficient AI training, large-scale enterprise adoption remains slow due to high costs and validation challenges. However, as AI models become more scalable and affordable, software infrastructure firms could see accelerated adoption and growth.
 
While DeepSeek’s models are impressive, claims about their low development costs are exaggerated. The biggest winners in the AI race will likely be the companies powering AI infrastructure, rather than just the AI model creators themselves.
*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

Topics :Deepseekartifical intelligenceChatGPTOpenAIMicrosoftBS Web Reports

First Published: Jan 29 2025 | 4:12 PM IST

Next Story