Executive Summary

As of 3 March 2026 morning, DeepSeek AI V4 has not yet been released, though it is widely anticipated any time now — ahead of China’s Two Sessions in 2026. Building on the January 2026 Engram paper, V4 is expected to match the capabilities of leading Western AI models at a considerably lower cost. It will be fully open-source, enabling firms to operate it and keep all their data securely on their own servers.

According to credible industry leaks, V4 is expected to introduce native multimodal capabilities (interpreting images and videos alongside text), a substantial 1 million token context window (allowing it to process entire manuals or codebases in one go), and expert-level coding performance that internal benchmarks suggest surpasses both Claude and GPT. The model is also said to be built on a new “manifold Hyperconnection” architecture that addresses the industry-wide problem of catastrophic forgetting, meaning it maintains old knowledge reliably as it learns new skills. Strategically, it is being optimised for domestic chips such as Huawei and Cambricon and Hygon, reducing dependence on foreign infrastructure, and no early access granted to NVIDIA or AMD.

DeepSeek named this breakthrough “Engram” — a term from neuroscience referring to the instant memory trace that allows you to recall simple facts effortlessly. It gives AI the same capability: a quick, cost-effective memory for straightforward questions, saving the full, powerful thought process for complex problems. This could make advanced AI significantly more affordable and secure for regulated industries worldwide.

It would be wise to involve the Technology, Legal, Compliance, and Cybersecurity teams in developing a post-release proof-of-concept for the most valuable use cases, such as thousand-page regulatory filings, contracts, medical records, and codebases. Conducting a structured vendor risk assessment is essential, considering the current geopolitical climate.

DeepSeek V4: What to Expect?

Since DeepSeek has not yet made an official announcement, the following is based on credible leaks and industry speculation. If the rumours are accurate, V4 will introduce several technical improvements important to our business.

DeepSeek is Open Source: An Advantage None of Its Rivals Can Match

Claude, GPT, and Gemini cannot be downloaded. These models are proprietary—your data leaves your organisation, goes to external servers, and is subject to their terms.

DeepSeek V4 will be open-source, enabling organisations to operate it on their own servers and maintain their data locally. For regulated sectors, this significantly alters the risk profile—although, unlike a cloud API, open-source models demand ongoing maintenance, security updates, and internal expertise for safe deployment. It is a capability to develop, not merely a service to purchase. This necessitates a skilled IT team and standard servers—an manageable, one-time investment that permanently reduces ongoing cloud expenses.

DeepSeek Knows When to Think Deep, When to Simply Look It Up

Ask today’s AI, “Who is Albert Einstein?” and it uses the same costly computing power as when you ask it to explain relativity. It cannot tell easy from hard, wasting precious computing resources. In January 2026, DeepSeek published a paper introducing an innovative improvement. They named their creation Engram—a term borrowed from neuroscience, where an “engram” is the brain’s instant memory trace (the reason you can remember a simple fact effortlessly).

DeepSeek’s Engram functions similarly for AI: a quick, cost-effective memory system for simple questions, conserving the full, powerful reasoning engine for truly complex problems. This breakthrough could make advanced AI considerably more affordable and secure for regulated industries worldwide. The results are notable. When tested on retrieving facts from very long documents, standard AI scores 84 out of 100, while Engram scores 97.

For teams reviewing thousand-page contracts or filings, this makes the difference between reliable answers and missing details that turn costly. It also makes AI much more economical to run. Basic knowledge shifts to memory, which costs 15–20 times less, so DeepSeek V4 operates efficiently on standard company servers—not on costly data-centre equipment for each query.

DeepSeek Built at a Fraction of the Cost

Training a high-end AI model like GPT or Claude costs over $1 billion. DeepSeek developed its 2025 V3 model for approximately $5.6 million. V4 is expected to be equally efficient.

When this news emerged in 2025 after the V3 release, it took the stock market by surprise. NVIDIA and other shares collapsed. Investors believed that developing AI required huge costs. DeepSeek proved them wrong. The Engram paper explains how.

DeepSeek Born in a Hedge Fund—and Why That Matters

DeepSeek was founded by Liang Wenfeng, a quantitative hedge fund manager, rather than by a Silicon Valley entrepreneur. In 2015, he co-founded China’s top quant fund, High-Flyer, which utilised machine learning for market predictions. Eight years later, in 2023, Liang spun off DeepSeek as an AI research company funded by High-Flyer.

Quant trading heritage influences DeepSeek’s focus, emphasising efficiency in solving complex problems with limited resources. While US labs may spend billions on breakthroughs, DeepSeek seeks smarter solutions, as the Engram paper shows. This engineering culture—squeezing maximum capability from minimal resources—is exactly why V4 is expected to deliver Western-tier performance at a fraction of the cost.

DeepSeek Distillation Attacks

A distillation attack is when one AI company secretly feeds millions of questions to a rival’s model, then uses those answers to train their own model — absorbing its intelligence without permission.

In February 2026, Anthropic accused DeepSeek of creating 24,000 fake accounts and generating 16 million conversations to secretly copy their AI models—similar to studying a rival’s exam papers. OpenAI made comparable complaints. If true, this was a deliberate and large-scale effort.

Anthropic is justified in criticising it, but this practice, as I understand, is widespread in the industry—both OpenAI and Anthropic have used it themselves. OpenAI faces several copyright lawsuits, and Anthropic have also settled a similar case. No lawsuit has been filed against DeepSeek; they have not responded publicly, and the allegations remain unproven.

DeepSeek in the wider AI Landscape – in brief

DeepSeek compared with others – in brief

DeepSeek related Risks & Challenges

DeepSeek – Next Steps

  1. Monitor official DeepSeek channels for V4 announcement
  2. Prepare cross-functional team (Tech, Legal, Compliance, Cyber) for post-release PoC
  3. Initiate a vendor risk assessment framework adapted for open-source AI
  4. Identify use cases: regulatory filings, contracts, medical records, codebases

Sources:

  1. DeepSeek – Engram Paper (Jan 2026), V3 Technical Report (baseline)
  2. Analysis – Comparison of Models, Comparisons with DeepSeek-V3
  3. Hugging Face – Open-source weights & self-hosting
  4. GitHub  – DeepSeek V3
  5. IBM – Deepseek Architecture
  6. WaveSpeed – Deepseek V4 1M Token Context
  7. Macaron– Enterprise Security, Data Retention
  8. Labelbox – Complex Reasoning Leaderboard
  9. Skywork – APIFree AI Access
  10. Anthropic – Detecting and preventing distillation attacks (Feb 2026)
  11. Lawsuits / News – Anthropic, OpenAI, Futurism, Financial Times, Register, Italy
  12. Strategy – Huawei, Cambricon, Hygon , Cambricon meteoric rise

About the author

Viren Mantri is a cybersecurity advisor and former senior technology leader across Standard Chartered, UBS, McAfee, and KPMG. With 30 years of navigating the intersection of technology, risk, and regulations, he now helps organisations cut through complexity and make better security decisions.

CC-BY Viren Mantri, 2026, licensed under a Creative Commons Attribution 4.0 International License.

Disclaimer: All views expressed here are entirely mine.