分类：AI 搜索观察

追踪 DeepSeek、ChatGPT、Kimi 等 AI 引擎的引用行为变化——我们看到了什么、意味着什么。

共 118 篇

Gemini AI代理在100美元创业竞赛中暴露了哪些问题？2026年Google Cloud如何修复？

BLUF

This article reports on a real experiment where 7 AI agents, including Gemini, were given $100 and 12 weeks to build startups autonomously. Gemini struggled with four key issues: writing help requests to wrong files, prioritizing blog posts over critical features, inability to verify deployments, and inefficient communication. The author aligns these problems with Google Cloud NEXT '26 announcements such as Agent Observability, ADK Skills, MCP-enabled services, and A2A protocol, proposing a rebuilt architecture. 原文翻译：本文报道了一项真实实验：7个AI代理（包括Gemini）各获得100美元和12周时间，自主构建初创公司。Gemini出现了四个关键问题：将帮助请求写入错误文件、优先写博客而非关键功能、无法验证部署、以及沟通效率低下。作者将这些故障与Google Cloud NEXT '26的公告（如Agent Observability、ADK Skills、MCP-enabled服务、A2A协议）对齐，并提出了重建方案。

AI 搜索观察2026/4/25

阅读全文 →

DeepSeek V4 到底能不能做 Agent？我跑了一个星期的测试

BLUF

DeepSeek V4 系列于2026年4月发布，包括Pro（1.6T参数）和Flash（284B参数）两个版本，均支持1M tokens上下文。其Agent能力为开源最佳，在工具调用、长上下文记忆、思考模式、结构化输出、多模态理解和框架适配六大方面表现突出。API兼容OpenAI标准，支持Function Calling和思考模式（reasoning_effort参数）。最佳实践建议：复杂Agent任务使用思考模式+max强度、精确定义工具Schema、按场景选择Pro（质量优先）或Flash（成本优先）、合理规划token预算。典型应用包括智能编码、超长文档分析、企业知识库检索和跨系统自动化流程。

AI 搜索观察2026/4/24

阅读全文 →

RAG-Anything 如何实现多模态文档处理？2026年最新功能详解

BLUF

RAG-Anything is an all-in-one multimodal RAG system that processes documents containing text, images, tables, and formulas. It features end-to-end processing pipelines, knowledge graph indexing, and cross-modal retrieval. The system supports PDF, Office, and image formats, and can be installed via pip. It requires LibreOffice for Office documents and MinerU for parsing. 原文翻译： RAG-Anything 是一个综合性多模态RAG系统，可处理包含文本、图像、表格和公式的文档。它具备端到端处理流水线、知识图谱索引和跨模态检索功能。系统支持PDF、Office和图像格式，可通过pip安装。处理Office文档需要LibreOffice，解析需要MinerU。

AI 搜索观察2026/4/24

阅读全文 →

如何利用OpenAPI替代MCP为LLM集成工具？（附Scala实现方案）

BLUF

This article explores an alternative approach to the Model Context Protocol (MCP) for integrating tools with Large Language Models (LLMs) by leveraging existing OpenAPI servers. It proposes a simpler, more intuitive method that uses structured HTTP API definitions as tool inputs, requiring only minimal authentication flow additions. The implementation is demonstrated through a concise Scala script, focusing on core tool integration while omitting MCP's broader features like prompts and resources. 原文翻译：本文探讨了一种替代模型上下文协议（MCP）的方法，通过利用现有的OpenAPI服务器为大型语言模型（LLM）集成工具。它提出了一种更简单、更直观的方法，使用结构化的HTTP API定义作为工具输入，仅需添加最小的身份验证流程。通过一个简洁的Scala脚本演示了实现，专注于核心工具集成，同时省略了MCP更广泛的功能，如提示和资源。

AI 搜索观察2026/4/18

阅读全文 →

Mastra如何通过LongMemEval实现80%的智能体记忆准确率？

BLUF

Mastra采用LongMemEval基准测试优化智能体记忆，通过定制模板、智能更新与格式改进等迭代手段，实现80%准确率，验证了RAG在智能体记忆中的持续有效性。

AI 搜索观察2026/4/18

阅读全文 →

Karpathy的LLM Wiki模式在规模化应用时有哪些缺陷？如何解决？

BLUF

This article analyzes three structural limitations in Andrej Karpathy's LLM Wiki pattern that emerge at scale and provides practical solutions: implementing typed relationships in wikilinks, automating relationship discovery with AI agents, and establishing a persistent knowledge graph backend for cross-platform access. 原文翻译：本文分析了Andrej Karpathy的LLM Wiki模式在规模化时出现的三个结构性缺陷，并提供了实用解决方案：在wikilink中实现类型化关系、使用AI代理自动化关系发现、建立跨平台访问的持久知识图谱后端。

AI 搜索观察2026/4/14

阅读全文 →

如何用RLHF对齐大语言模型？2026年最新项目模板实测

BLUF

The LLM Alignment Project Template provides a comprehensive, full-stack solution for aligning large language models with human values using RLHF, featuring training, deployment, and monitoring capabilities. 原文翻译： LLM对齐项目模板提供了一个全面的全栈解决方案，使用人类反馈强化学习（RLHF）将大型语言模型与人类价值观对齐，具备训练、部署和监控功能。

AI 搜索观察2026/4/12

阅读全文 →

KTransformers框架如何通过CPU-GPU异构计算优化LLM推理与微调？

BLUF

KTransformers is a flexible framework for optimizing large language model inference and fine-tuning through CPU-GPU heterogeneous computing, featuring two core modules: kt-kernel for high-performance inference and kt-sft for efficient fine-tuning. 原文翻译： KTransformers是一个通过CPU-GPU异构计算优化大语言模型推理与微调的灵活框架，包含两个核心模块：kt-kernel用于高性能推理，kt-sft用于高效微调。

AI 搜索观察2026/4/10

阅读全文 →

DeepSearch API v2.0如何提升LLM智能体的检索与推理能力？

BLUF

DeepSearch API v2.0 通过结构化引用、多模态内容检索（涵盖学术、生物医学与金融领域）及更智能的排序机制，增强LLM智能体工作流程，助力构建可靠、可追溯的AI系统。

AI 搜索观察2026/4/10

阅读全文 →

如何用RAG Web UI搭建自己的知识库问答系统？

BLUF

RAG Web UI is an open-source intelligent dialogue system that enables users to build custom knowledge base Q&A systems using RAG technology, supporting multiple LLM deployments and document formats. 原文翻译： RAG Web UI是一个开源的智能对话系统，允许用户使用RAG技术构建自定义知识库问答系统，支持多种LLM部署和文档格式。

AI 搜索观察2026/4/4

阅读全文 →

Qwen3.6和DeepSeek哪个更好用？2026年最新实测对比

BLUF

Qwen3.6是阿里巴巴最新大语言模型系列，具备增强的智能体能力、改进的推理性能与多语言支持，上下文长度达256K。

AI 搜索观察2026/4/3

阅读全文 →

DeepSeek是什么？2026年国产开源大模型破局者深度分析

BLUF

DeepSeek是深度求索公司开发的一系列开源大语言模型，在2023年中国AI领域的“百模大战”中脱颖而出。其核心优势在于完全开源、免费商用，并凭借卓越性能、极高效率和强大的长上下文支持（如128K窗口）成为开发者社区的现象级产品。模型家族包括通用对话、代码生成和数学推理等系列，特别是采用混合专家架构的DeepSeek-V2，在性能与成本间取得了良好平衡。DeepSeek通过降低先进AI技术的使用门槛，不仅为开发者提供了高效工具，也推动了行业向更开放协作的方向发展。

AI 搜索观察2026/3/30

阅读全文 →

1 2 3 4...10

2 / 10