DeepSeek funding, news & analysis

Home > Companies > DeepSeek

DeepSeek

Developer of large language models enabling advanced natural language processing and AI applications

#ai

Funding

$1.10B

2025

View PDF

Details

Headquarters

Hangzhou, Zhejiang

CEO

Liang Wenfeng

Website

deepseek.com

Milestones

FOUNDING YEAR

2023

Listed In

#ai

Valuation & Funding

DeepSeek has not disclosed any external funding rounds as of March 2026. The company has been funded by High-Flyer Quant, the quantitative hedge fund founded by DeepSeek's founder Liang Wenfeng, which has served as the primary source of research and development capital since DeepSeek's founding in 2023.

In early 2025, reports circulated that DeepSeek was considering its first external financing round. Those reports were subsequently denied by people close to the company, and no external raise has been publicly confirmed. No third-party investors, no disclosed valuation, and no total external capital raised are on record.

Product

DeepSeek is an AI model lab whose output reaches users through three surfaces: a free consumer chat app available on web and mobile, a self-serve developer API, and a library of open-weight models that anyone can download and run independently.

The consumer product works like a general-purpose AI assistant. A user opens the web interface or mobile app, types a question or uploads a file, and gets a response.

The product accepts text, images, files, and voice as inputs, and it can optionally pull in real-time search results when the query calls for current information. The experience is free with no ads or in-app purchases.

What makes DeepSeek's product distinct from a standard chatbot is the two-gear system built into its flagship model. In non-thinking mode, the model answers quickly and directly.

In thinking mode—exposed in the API as deepseek-reasoner—the model works through a problem step by step internally before producing a final answer, which makes it substantially better at math, multi-step logic, code generation, and self-verification tasks. The current flagship, DeepSeek-V3.2, integrates both modes into a single model, with deepseek-chat mapping to the non-thinking mode and deepseek-reasoner to the thinking mode, both on a 128K context window.

For developers, the API is designed to slot into existing workflows with minimal friction. It is OpenAI-compatible, meaning a developer can often switch from OpenAI to DeepSeek by changing a base URL and model name while keeping the same SDK. DeepSeek also supports the Anthropic API format, including compatibility with Claude Code, so it can be dropped into a second major ecosystem of developer tooling without rewriting integration code.

The API supports JSON output for machine-readable structured responses, tool calls for agentic workflows, FIM completion for code insertion tasks, and prefix completion.

The tool-call workflow is the most important of these for production use: a developer defines a set of functions in JSON schema format, the model decides when to invoke one, the developer's code executes it and returns the result, and the model incorporates that result into its final answer. In thinking mode, the model can alternate between internal reasoning and tool calls across multiple turns before settling on a response—which makes it suitable for coding agents, search agents, and multi-step enterprise automations.

DeepSeek-V3.2 is explicitly positioned as a reasoning-first model built for agents, trained on data synthesized across more than 1,800 environments and 85,000-plus complex instructions. That framing reflects a deliberate product shift from answering questions to orchestrating multi-step tasks.

Business Model

DeepSeek operates as a hybrid of open research lab, free consumer distribution channel, and usage-priced developer infrastructure business. It creates value by building frontier-class open models efficiently, distributes that value through free chat and open-source releases, and captures value primarily through API consumption.

The go-to-market is mixed across B2C and B2B. Consumers access the product directly through the web and app at no cost.

Developers and companies access it through a self-serve B2B API, topping up a prepaid balance and paying per token consumed. Downstream software makers who build customer-facing products on top of DeepSeek's API create a B2B2C layer where DeepSeek's model powers end-user experiences without a direct relationship with those end users.

Monetization is usage-based and structured around a cache-hit/cache-miss pricing split that does more than just charge for compute.

By pricing repeated prompt prefixes at roughly one-tenth the cost of fresh input tokens, DeepSeek aligns its pricing with its infrastructure architecture: customers who design their applications around reusable long contexts get substantially lower bills, while DeepSeek shifts more of the serving workload from expensive GPU compute to cheaper disk-backed cache reads. This is both a cost-shaping mechanism and a product differentiator.

The cost structure is built around efficiency-first frontier modeling. DeepSeek's V3 architecture uses a mixture-of-experts design that activates only 37 billion of 671 billion parameters per token, which reduces the compute cost per inference call relative to a dense model of equivalent quality.

FP8 training, sparse attention for long-context workloads, and multi-token prediction further compress the training and serving cost profile. The company has claimed full V3 training required roughly 2.788 million H800 GPU hours—a figure that, if accurate, is dramatically lower than comparable closed-model training runs.

That efficiency story is what makes the pricing strategy viable. DeepSeek has repeatedly cut API prices—most recently by more than 50% in September 2025—while maintaining a cost structure that still implies strong unit economics at scale. The March 2025 operating disclosure showing a theoretical 545% cost-profit ratio on a single day's traffic suggests the gross margin profile on API revenue is unusually high for an infrastructure business, even after accounting for the gap between theoretical and realized revenue.

The open-source strategy is integral to the business model rather than in tension with it. By releasing weights and technical reports under permissive terms—including explicit permission to use API outputs for fine-tuning and distillation—DeepSeek expands its install base, encourages community tooling, and positions its model family as a de facto standard in the open-model ecosystem.

The short-term tradeoff is that anyone can self-host a DeepSeek-derived model without paying DeepSeek directly. The long-term logic is that open distribution builds brand power, developer goodwill, and ecosystem influence that converts into API demand from customers who want the convenience of hosted inference without the operational overhead of running their own cluster.

Competition

Ecosystem owners in China

DeepSeek's most structurally dangerous rivals are not other model labs—they are companies that own the distribution surfaces through which AI reaches users and enterprises.

ByteDance's Doubao is the clearest example. By August 2025, Doubao had surpassed DeepSeek as China's most-used AI app, reaching 157 million monthly active users, with a meaningful share of users who left DeepSeek migrating directly to Doubao.

ByteDance pairs its model development with Volcengine as the enterprise and API arm, and bundles the model layer with Coze, TRAE, vector databases, knowledge tooling, search plugins, and AI media generation. Its coding subscription plan even bundles access to DeepSeek-V3.2 alongside its own Doubao-Seed-Code and rivals like GLM and Kimi—a deliberate strategy to commoditize the underlying model and own the developer relationship instead. The more AI buying shifts toward integrated agent platforms and coding seats rather than raw API access, the more ByteDance's full-stack posture becomes a direct threat to DeepSeek's revenue capture.

Alibaba's Qwen competes with DeepSeek at every layer simultaneously: model quality, API pricing, open-weight releases, and consumer chat via Qwen Chat. Unlike DeepSeek, Alibaba can subsidize model distribution through Alibaba Cloud, bundle models into broader enterprise infrastructure, and use tiered pricing to segment customers across experimentation and production workloads. In large enterprise accounts where procurement prefers a single vendor for compute, storage, governance, and models, Alibaba's vertical integration gives it a structural advantage that DeepSeek's model economics alone cannot overcome.

Tencent's Hunyuan is a different kind of threat. Even when Tencent integrates DeepSeek into WeChat—which it has done—that coopetition is strategically double-edged: DeepSeek gains reach, but Tencent controls the user relationship, learns demand patterns, and can swap in Hunyuan where margins or control are better. If the market shifts from best standalone model to best model inside a super-app workflow, Tencent's distribution moat matters more than benchmark gaps.

Specialized Chinese model labs

Moonshot's Kimi and Zhipu's GLM attack DeepSeek from a different angle: specialization rather than ecosystem breadth.

Kimi K2 and K2 Thinking target developers who care specifically about long-context reasoning, coding, and agent-like execution, with 256K context and explicit positioning around agentic workflows. Moonshot is not trying to win the mass-market chatbot battle; it is trying to own the highest-value developer use cases where benchmark wins convert into repeat usage. That is a direct threat to DeepSeek's strongest wedge.

Zhipu's GLM family overlaps with DeepSeek in the segment most likely to compress margins: open-ish, developer-friendly models with strong coding and reasoning narratives, laddered pricing from free flash tiers to premium reasoning tiers, and built-in web search and tool support. Zhipu does not need to outperform DeepSeek everywhere—it just needs to be good enough, cheap enough, and domestic enough to capture customers at every budget band.

Baidu's ERNIE remains relevant primarily through search distribution, enterprise cloud relationships, and domestic regulatory familiarity. Baidu has also shown willingness to integrate DeepSeek into its own products, which validates DeepSeek technologically but risks reducing it to a component supplier rather than the system-of-record brand for end users.

Global frontier labs and open-model alternatives

OpenAI and Anthropic set the global benchmark ceiling for premium enterprise accounts. For DeepSeek, these firms are less of a price threat than a trust and workflow threat. Multinationals, regulated industries, and Western enterprises may choose OpenAI or Anthropic because of procurement familiarity, security review processes, or board-level risk tolerance around data jurisdiction. DeepSeek's OpenAI-compatible API helps on interoperability but does not erase buyer concerns around governance or vendor support depth.

Google Gemini adds a search-grounding and enterprise productivity dimension that DeepSeek does not currently match. As customers increasingly want search-grounded agents and workspace integrations rather than raw model output, Google can win on workflow completeness even where DeepSeek wins on raw reasoning economics.

In the open-model ecosystem, Meta's Llama family, Mistral, and Alibaba's open Qwen releases all normalize the idea that frontier-class capability should be portable, self-hostable, and cheaper than closed APIs.

DeepSeek benefits from that trend but also loses exclusivity from it. Platforms like Fireworks AI and Together AI list DeepSeek alongside these alternatives, and enterprise buyers—including companies like Hebbia—often want DeepSeek-class models without sending data to DeepSeek directly, routing through inference hosts instead.

That pattern expands DeepSeek's ecosystem footprint but reduces its direct capture of end-customer value. Character.AI has similarly shifted toward open alternatives including DeepSeek, and Poolside references DeepSeek-Coder as a self-hosted option—both patterns that reflect DeepSeek's role as a widely adopted model family rather than a destination platform.

TAM Expansion

Agent platform and developer tooling

DeepSeek's clearest expansion path is from a low-cost reasoning model vendor into a broader agent platform.

The current API already supports tool calls, structured outputs, multi-turn reasoning, and search agent behavior, and DeepSeek-V3.2 is explicitly positioned as a reasoning-first model built for agents.

The company synthesized agent-training data across more than 1,800 environments and 85,000-plus complex instructions for V3.2, signaling that agent capability is a deliberate product investment rather than a byproduct of general model improvement.

The practical opportunity is to become the default reasoning backend for coding agents, browser agents, enterprise search, and automation layers, not just a chat API.

DeepSeek's Chinese release notes already highlight support for Claude Code in thinking mode, which is a concrete go-to-market move: instead of building a competing IDE, DeepSeek inserts itself as the low-cost reasoning engine inside tooling developers already use. Deepening integrations with agent frameworks, IDEs, and workflow orchestration platforms is the highest-ROI near-term growth initiative.

Customer base and self-hosted enterprise deployments

DeepSeek's current paying customer base skews toward cost-sensitive startups, researchers, and API brokers. The larger opportunity is in enterprises and governments that want AI capability without vendor dependence on a closed Western platform.

Because DeepSeek publishes model weights openly on GitHub and Hugging Face under permissive licensing, it can serve buyers who prefer local deployment, customization, or sovereign control over their AI stack.

That broadens the addressable market beyond API subscriptions into a larger ecosystem of integrators, cloud hosts, managed-service providers, and on-premises deployments built around DeepSeek-compatible models.

Thinking Machines and OpenPipe already support DeepSeek in managed fine-tuning APIs, which illustrates how the open-weight strategy converts into enterprise deployment demand even without a direct sales relationship.

The OpenAI-compatible and Anthropic-compatible API formats are the primary customer acquisition lever here. They reduce migration friction to near zero for the large installed base of developers already using those platforms, which shortens adoption cycles and broadens the range of potential customers without requiring a dedicated enterprise sales motion.

Geographic expansion into price-sensitive markets

DeepSeek's open-source distribution and aggressive API pricing make it structurally well-suited to markets where Western frontier model pricing is prohibitive and local AI infrastructure is underdeveloped.

In parts of Asia, Africa, Latin America, and the Middle East, low cost and open deployment flexibility matter more than premium brand preference or enterprise support depth. DeepSeek's model family is already widely available through inference hosts globally, and its permissive licensing means regional cloud providers and telecom operators can build managed services on top of DeepSeek weights without a direct commercial relationship.

That creates a path to geographic TAM expansion that does not require DeepSeek to build a local sales presence in each market.

The domestic Chinese market also has a distinct expansion vector tied to hardware. DeepSeek's efficiency-focused architecture and open inference tooling—including published components like FlashMLA and DeepGEMM—position it as a natural reference software layer for Chinese AI infrastructure built on domestic accelerators rather than NVIDIA hardware.

As enterprises and government buyers seek alternatives less exposed to U.S. export controls, DeepSeek's combination of model quality, open deployment, and inference optimization work gives it a credible role in the domestic hardware stack beyond just model provision.

Risks

Regulatory exclusion: DeepSeek's international expansion is structurally vulnerable to privacy and data-sovereignty enforcement actions that have nothing to do with model quality. Italy blocked the app in January 2025, South Korea paused local downloads in February 2025, and Germany's data protection authority subsequently asked Apple and Google to consider blocking the app over unlawful data transfer concerns. These actions create a real risk that DeepSeek becomes strong in some markets while being structurally shut out of European enterprise accounts, government procurement, and regulated industries—precisely the customer segments with the highest long-term contract value.

Compute chokepoints: DeepSeek's cost leadership depends on sustaining very low-cost, high-availability inference at scale, but that position is exposed to hardware procurement constraints driven by U.S. export controls on advanced chips. U.S. officials alleged in mid-2025 that DeepSeek sought access to restricted advanced semiconductors, and the company's own infrastructure has shown service incidents and partial outages into early 2026. If hardware access tightens or reliability lags demand growth, the efficiency advantage that underpins DeepSeek's pricing strategy and gross margin profile could erode faster than the model roadmap can compensate.

Platform disintermediation: DeepSeek's open-weight strategy and OpenAI-compatible API minimize switching costs, which is a powerful adoption lever but also a structural ceiling on customer retention. As ByteDance, Alibaba, and other ecosystem owners bundle DeepSeek-class models into subscription coding seats, agent platforms, and enterprise cloud stacks, DeepSeek risks becoming an interchangeable model supplier inside brokered interfaces rather than the vendor that owns the customer relationship—capturing a shrinking share of the value it creates as the market shifts from single-model competition to ecosystem competition

News

DISCLAIMERS

This report is for information purposes only and is not to be used or considered as an offer or the solicitation of an offer to sell or to buy or subscribe for securities or other financial instruments. Nothing in this report constitutes investment, legal, accounting or tax advice or a representation that any investment or strategy is suitable or appropriate to your individual circumstances or otherwise constitutes a personal trade recommendation to you.

This research report has been prepared solely by Sacra and should not be considered a product of any person or entity that makes such report available, if any.

Information and opinions presented in the sections of the report were obtained or derived from sources Sacra believes are reliable, but Sacra makes no representation as to their accuracy or completeness. Past performance should not be taken as an indication or guarantee of future performance, and no representation or warranty, express or implied, is made regarding future performance. Information, opinions and estimates contained in this report reflect a determination at its original date of publication by Sacra and are subject to change without notice.

Sacra accepts no liability for loss arising from the use of the material presented in this report, except that this exclusion of liability does not apply to the extent that liability arises under specific statutes or regulations applicable to Sacra. Sacra may have issued, and may in the future issue, other reports that are inconsistent with, and reach different conclusions from, the information presented in this report. Those reports reflect different assumptions, views and analytical methods of the analysts who prepared them and Sacra is under no obligation to ensure that such other reports are brought to the attention of any recipient of this report.

All rights reserved. All material presented in this report, unless specifically indicated otherwise is under copyright to Sacra. Sacra reserves any and all intellectual property rights in the report. All trademarks, service marks and logos used in this report are trademarks or service marks or registered trademarks or service marks of Sacra. Any modification, copying, displaying, distributing, transmitting, publishing, licensing, creating derivative works from, or selling any report is strictly prohibited. None of the material, nor its content, nor any copy of it, may be altered in any way, transmitted to, copied or distributed to any other party, without the prior express written permission of Sacra. Any unauthorized duplication, redistribution or disclosure of this report will result in prosecution.