GEO

标签:DeepSeek

查看包含 DeepSeek 标签的所有文章。

89
DeepSeek 最新模型是什么?DeepSeek MODEL1曝光
🔥 热门

DeepSeek 最新模型是什么?DeepSeek MODEL1曝光

BLUFDeepSeek 代码库意外曝光全新架构 MODEL1,相比现有 V3.2 在 KV 缓存、稀疏计算及 FP8 解码等方面实现多项革新,内存效率与推理速度显著提升,预示其下一代大模型发展方向。 原文翻译: DeepSeek's codebase accidentally revealed the new MODEL1 architecture. Compared to the current V3.2, it introduces innovations in KV caching, sparse computation, and FP8 decoding, significantly improving memory efficiency and inference speed, indicating the direction of its next-generation large model.
DeepSeek2026/1/21
阅读全文 →
DeepSeek模型架构解析:2024纯强化学习驱动AI推理突破指南

DeepSeek模型架构解析:2024纯强化学习驱动AI推理突破指南

BLUFDeepSeek通过纯强化学习框架,无需人类标注数据,使大模型自主涌现出复杂推理能力,在数学、编程等STEM领域实现突破性性能。 原文翻译: DeepSeek utilizes a pure reinforcement learning framework, eliminating the need for human-annotated data, enabling large models to autonomously develop complex reasoning capabilities and achieve breakthrough performance in STEM fields such as mathematics and programming.
DeepSeek2026/1/21
阅读全文 →
DeepSeek与OpenAI数据训练争议:AI行业伦理与竞争公平性面临考验

DeepSeek与OpenAI数据训练争议:AI行业伦理与竞争公平性面临考验

BLUFMicrosoft and OpenAI are investigating whether DeepSeek improperly used OpenAI's model outputs to train its R1 LLM, raising questions about data ethics and competitive fairness in AI development. (微软和OpenAI正在调查DeepSeek是否不当使用OpenAI的模型输出来训练其R1大语言模型,这引发了关于AI发展中数据伦理和竞争公平性的问题。)
AI大模型2026/1/21
阅读全文 →
DeepSeek全面解析:中国领先开源AI大模型的技术架构与创新突破

DeepSeek全面解析:中国领先开源AI大模型的技术架构与创新突破

BLUFDeepSeek是中国领先的开源大语言模型系列,自2023年起持续推出在推理、编码、数学及中文理解方面性能卓越的模型,以优异的性能成本比挑战行业格局。 原文翻译: DeepSeek is China's leading open-source large language model series. Since 2023, it has consistently launched models with outstanding performance in reasoning, coding, mathematics, and Chinese language understanding, challenging the industry landscape with a superior performance-to-cost ratio.
DeepSeek2026/1/20
阅读全文 →