DeepSeek V4预览版性能如何？2026年百万上下文MoE开源模型评测：原理解析、实操步骤、常见问题与优化建议

Overview: A New Era of Cost-Effective Long Context

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context100万token的上下文长度，DeepSeek-V4默认支持此长度，用于处理超长文本。 length.

🚀 DeepSeek-V4 Preview 现已正式上线并开源！欢迎进入高性价比的百万级上下文长度时代。

🔹 DeepSeek-V4-ProDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.

🔹 DeepSeek-V4-ProDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。： 总参数量 1.6T / 激活参数量 49B。性能可与全球顶尖闭源模型抗衡。

🔹 DeepSeek-V4-FlashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。: 284B total / 13B active params. Your fast, efficient, and economical choice.

🔹 DeepSeek-V4-FlashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。： 总参数量 284B / 激活参数量 13B。为您提供快速、高效且经济的选择。

Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today!

您现在即可通过 chat.deepseek.com 的专家模式/即时模式进行体验。API 已同步更新并立即可用！

Key Resources

Technical Report: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf > 技术报告： https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf
Open Weights: https://huggingface.co/collections/deepseek-ai/deepseek-v4 > 开源权重： https://huggingface.co/collections/deepseek-ai/deepseek-v4

Model Architecture and Specifications

DeepSeek-V4-ProDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。

🔹 Enhanced Agentic Capabilities: Open-source SOTA in Agentic Coding benchmarks.

🔹 增强的智能体能力： 在智能体编程基准测试中达到开源模型的 SOTA 水平。

🔹 Rich World Knowledge: Leads all current open models, trailing only Gemini-3.1-Pro.

🔹 丰富的世界知识： 领先所有现有开源模型，仅次于 Gemini-3.1-Pro。

🔹 World-Class Reasoning: Beats all current open models in Math/STEM/Coding, rivaling top closed-source models.

🔹 世界级推理能力： 在数学/STEM/编程领域超越所有现有开源模型，可与顶尖闭源模型匹敌。

DeepSeek-V4-FlashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。

🔹 Reasoning capabilities closely approach V4-Pro.

🔹 推理能力接近 V4-Pro 水平。

🔹 Performs on par with V4-Pro on simple Agent tasks.

🔹 在简单智能体任务上表现与 V4-Pro 持平。

🔹 Smaller parameter size, faster response times, and highly cost-effective API pricing.

🔹 更小的参数量、更快的响应速度以及极具性价比的 API 定价。

Model Comparison


Feature	DeepSeek-V4-ProDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。	DeepSeek-V4-FlashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。
Total Parameters	1.6T	284B
Active Parameters	49B	13B
Agentic Coding	Open-source SOTA	Strong, simple tasks on par with V4-Pro
World Knowledge	Leads all open models	High but below V4-Pro
Reasoning	World-class, rivals top closed-source	Closely approaches V4-Pro
Response Speed	Standard	Faster
API Pricing	Standard	Highly cost-effective
Target Use Case	Heavy-duty tasks, complex reasoning	Fast, efficient, cost-sensitive scenarios

Structural Innovation & Ultra-High Context Efficiency

🔹 Novel Attention: Token-wise compression + DSA (DeepSeek Sparse Attention)DeepSeek稀疏注意力机制，一种新型注意力结构，通过token级压缩和稀疏注意力实现高效长上下文处理，降低计算和内存成本。.

🔹 新型注意力机制： 逐 Token 压缩结合 DSA（DeepSeek 稀疏注意力）。

🔹 Peak Efficiency: World-leading long context with drastically reduced compute & memory costs.

🔹 巅峰效率： 世界领先的长上下文能力，同时大幅降低计算和内存成本。

🔹 1M Standard: 1M context100万token的上下文长度，DeepSeek-V4默认支持此长度，用于处理超长文本。 is now the default across all official DeepSeek services.

🔹 1M 上下文标准： 百万级上下文现已为所有官方 DeepSeek 服务的默认配置。

Key Innovation

Token-wise compression combined with DSA enables a novel attention mechanism. This drastically reduces computational and memory overhead while maintaining world-leading long-context performance. The result is that 1M context100万token的上下文长度，DeepSeek-V4默认支持此长度，用于处理超长文本。 is now the baseline across all official DeepSeek services, making advanced long-context applications practical and affordable.

Token-wise 压缩结合 DSA 实现了一种新型注意力机制。这在大幅降低计算和内存开销的同时，保持了世界领先的长上下文性能。其结果是百万级上下文现已为所有官方 DeepSeek 服务的默认配置，使得先进的长上下文应用变得实用且可负担。

Dedicated Optimizations for Agent Capabilities

🔹 DeepSeek-V4 is seamlessly integrated with leading AI agents like Claude Code, OpenClaw & OpenCode.

🔹 DeepSeek-V4 已与 Claude Code、OpenClaw 和 OpenCode 等领先的 AI 智能体实现无缝集成。

🔹 Already driving our in-house agentic coding at DeepSeek.

🔹 该模型已在 DeepSeek 内部驱动我们的智能体编程应用。

DeepSeek-V4 has been specifically optimized for agentic workflows. It integrates seamlessly with leading AI agents such as Claude Code, OpenClaw, and OpenCode. The model is already powering DeepSeek's own in-house agentic coding efforts.

DeepSeek-V4 针对智能体工作流进行了专门优化。它与 Claude Code、OpenClaw 和 OpenCode 等领先的 AI 智能体无缝集成。该模型已在 DeepSeek 内部驱动我们的智能体编程应用。

API Availability and Pricing

API is Available Today!

🔹 Keep base_url, just update model to deepseek-v4-proDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。 or deepseek-v4-flashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。.

🔹 保持 base_url 不变，只需将 model 参数更新为 deepseek-v4-proDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。或 deepseek-v4-flashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。。

🔹 Supports OpenAI ChatCompletions & Anthropic APIs.

🔹 支持 OpenAI ChatCompletions 和 Anthropic API。

🔹 Both models support 1M context100万token的上下文长度，DeepSeek-V4默认支持此长度，用于处理超长文本。 & dual modes (Thinking / Non-Thinking): https://api-docs.deepseek.com/guides/thinking_mode

🔹 两模型均支持百万级上下文和双模式（思考型/非思考型）：https://api-docs.deepseek.com/guides/thinking_mode

⚠️ Note: deepseek-chat & deepseek-reasoner will be fully retired and inaccessible after Jul 24th, 2026, 15:59 (UTC Time). (Currently routing to deepseek-v4-flashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。 non-thinking/thinking).

⚠️ 注意： deepseek-chat 和 deepseek-reasoner 将于 2026 年 7 月 24 日 15:59（UTC 时间）后全面退役并不可用。（当前已路由至 deepseek-v4-flashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。的非思考型/思考型模式）。

API Integration Details


Parameter	Details
Base URL	No change required
Model Name	`deepseek-v4-pro` or `deepseek-v4-flash`
API Compatibility	OpenAI ChatCompletions & Anthropic APIs
Context Support	1M tokens for both models
Dual Modes	Thinking / Non-Thinking for both models
Model Retirement	`deepseek-chat` & `deepseek-reasoner` → Retired after Jul 24, 2026, 15:59 UTC

API Pricing Table


Model	Input Price (per 1M tokens)	Output Price (per 1M tokens)	Cache Hit Price (per 1M tokens)
DeepSeek-V4-ProDeepSeek-V4的大模型版本，总参数量1.6T，激活参数49B，性能强大，在多个基准测试中领先开源模型。	$0.48	$1.92	$0.12
DeepSeek-V4-FlashDeepSeek-V4的轻量级版本，总参数量284B，激活参数13B，速度快、成本低，推理能力接近Pro版本。	$0.24	$0.96	$0.06

Final Remarks

🔹 Amid recent attention, a quick reminder: please rely only on our official accounts for DeepSeek news. Statements from other channels do not reflect our views.

🔹 鉴于近期受到的广泛关注，在此提醒：请仅以我们的官方账号发布的 DeepSeek 新闻为准。其他渠道的声明不代表我们的观点。

🔹 Thank you for your continued trust. We remain committed to longtermism, advancing steadily toward our ultimate goal of AGI.

🔹 感谢您一直以来的信任。我们坚守长期主义，稳步迈向我们的终极目标——通用人工智能 (AGI)。

DeepSeek remains dedicated to longtermism and the pursuit of AGI. We encourage the community to rely only on official channels for news and updates. Thank you for your continued trust and support.

DeepSeek 始终致力于长期主义和对通用人工智能的追求。我们鼓励社区仅信赖官方渠道获取新闻与更新。感谢您一直以来的信任与支持。

常见问题（FAQ）

DeepSeek-V4 Pro和Flash版本有什么区别？怎么选择？

Pro版本总参1.6T/激活49B，推理和知识领先，适合复杂任务；Flash版本284B/13B，速度更快、API成本更低，适合简单任务或成本敏感场景。

DeepSeek-V4支持多长的上下文？用了什么新技术？

支持100万token默认上下文，采用逐token压缩和DSA稀疏注意力，大幅降低计算和内存成本，实现高效长上下文处理。

旧版DeepSeek模型什么时候停止服务？

旧模型将于2026年7月退役，建议用户尽快迁移到V4 Preview系列模型，API现已可用。

AIAI Summary (BLUF)