Doubao, ByteDance's large language model, has evolved from a cost-effective AI assistant into a comprehensive multimodal ecosystem. Key milestones include achieving 1 billion downloads by May 2024 with a disruptive pricing strategy (0.0008元/千Tokens), launching video generation models (Seedance series), and expanding into music generation, 3D modeling, and real-time video calls. By late 2025, it reached over 1 billion daily active users and formed partnerships with major automotive and tech companies like Tesla and Xiaomi. The model's architecture is based on Transformer and MoE (Mixture of Experts), supporting diverse applications from AI programming to deep research tools. 豆包大模型已从高性价比的AI助手发展为覆盖文、图、音、视频、3D等多模态的生态平台。2024年5月实现1亿次下载,以0.0008元/千Tokens的定价开启商业化;随后推出视频生成(Seedance系列)、音乐生成、3D模型生成等功能。2025年底日活用户突破1亿,并与特斯拉、小米等企业达成合作。其技术基于Transformer和MoE架构,支持AI编程、深入研究等复杂场景应用。本文梳理了字节跳动核心AI产品“豆包”的发展历程,从技术积淀到成为日活破亿的国民级应用。重点介绍了其多模态能力演进、关键功能升级(如深度思考、视频生成)及广泛的生态整合(如接入抖音、赋能第三方),展现了其从大模型研发到构建完整产品生态的战略路径。
原文翻译:
This article outlines the development journey of ByteDance's core AI product "Doubao," from its technical foundations to becoming a national-level application with over 100 million daily active users. It highlights the evolution of its multimodal capabilities, key feature upgrades (e.g., Deep Thinking, video generation), and extensive ecosystem integration (e.g., integration with Douyin, empowering third-party products), demonstrating its strategic path from large model R&D to building a comprehensive product ecosystem.