The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop
DeepSeek's surprisingly affordable AI model challenges industry giants. The company's new chatbot boasts impressive capabilities, contributing to a significant drop in NVIDIA's stock price. Its success stems from a unique combination of innovative technologies and significant, albeit undisclosed, investment.
Image: ensigame.com
DeepSeek V3 leverages several cutting-edge techniques: Multi-token Prediction (MTP) for enhanced accuracy and efficiency; Mixture of Experts (MoE), employing 256 neural networks for accelerated training; and Multi-head Latent Attention (MLA) to ensure crucial details aren't overlooked.
Image: ensigame.com
While DeepSeek initially claimed a mere $6 million training cost, SemiAnalysis revealed a far more substantial infrastructure: approximately 50,000 Nvidia GPUs, totaling around $1.6 billion in server investment and $944 million in operational expenses. This includes a substantial workforce, with some researchers earning over $1.3 million annually.
Image: ensigame.com
DeepSeek's independent structure, fueled by substantial funding from its parent company, High-Flyer, and its ownership of its data centers, allows for rapid innovation and efficient resource allocation. This contrasts sharply with the larger, more bureaucratic structures of its competitors.
Image: ensigame.com
Although the initial $6 million figure is misleading, omitting significant research and infrastructure costs, DeepSeek's overall investment of over $500 million in AI development still represents a comparatively efficient approach. This is highlighted by comparing training costs: DeepSeek's R1 cost $5 million, while ChatGPT-4 cost a reported $100 million. DeepSeek's success underscores the potential of well-funded, agile AI companies to compete effectively, even if the "budgetary revolution" narrative is somewhat inflated.



