分类：AI大模型

AI大模型专栏涵盖从GPT、DeepSeek到gemini、Agentic智能体的全方位研究。深度解析RAG架构优化、KV缓存内存瓶颈解决、JSON结构化数据提取及提示工程实践（如Prompt Refiner）。本专栏还关注软件工程师转型AI研发的实用路径及AI安全风险评估，为开发者提供从基础理论到生产级系统构建的完整知识体系。

共 464 篇

Grok-4.1深度解析：xAI 2025年情感智能与创造力重大升级

BLUF

Grok-4.1 is xAI's 2025 major upgrade focusing on emotional intelligence, creativity, and factual accuracy, achieving 64.78% user preference over previous versions with two configurations: reasoning (1483 Elo) and non-reasoning (1465 Elo). It features a 2M token context window, OpenAI-compatible API, and reduced hallucination rates for enhanced human-like interactions. (Grok-4.1是xAI在2025年发布的重要升级版本，专注于情感智能、创造力和事实准确性，用户偏好度比旧版本高出64.78%。提供推理和非推理两种配置，分别获得1483和1465 Elo评分，具备200万token上下文窗口，兼容OpenAI API接口，幻觉率显著降低，支持更人性化的交互场景。)

AI大模型2026/1/26

阅读全文 →

FinRobot.ai：开源金融AI Agent平台，用大语言模型重塑金融分析

BLUF

FinRobot.ai is an open-source financial AI Agent platform developed by the AI4Finance Foundation, leveraging large language models (LLMs) and a four-layer modular architecture to provide plug-and-play intelligent solutions for financial analysis, quantitative trading, and investment research. It addresses key industry pain points like data fragmentation and high professional barriers through financial Chain-of-Thought reasoning and specialized agents. (FinRobot.ai是由AI4Finance基金会开发的开源金融AI Agent平台，以大语言模型为核心，采用四层模块化架构，通过金融链式思维和专用代理，为金融分析、量化交易和投资研究提供可插拔的智能解决方案，解决数据碎片化和专业门槛高等行业痛点。)

AI大模型2026/1/25

阅读全文 →

开源金融AI平台FinRobot：2024超越FinGPT的智能体指南

BLUF

FinRobot是专为金融分析设计的开源AI智能体平台，通过集成专用大语言模型、金融思维链提示及多层架构，超越FinGPT，为综合金融应用提供全面解决方案。

AI大模型2026/1/25

阅读全文 →

UltraRAG：清华大学开发的零代码RAG框架，革新AI知识增强应用开发

BLUF

UltraRAG是由清华大学等团队开发的零代码RAG框架，提供WebUI界面、自动化知识库适配与模块化设计，适用于科研与业务场景。集成KBAlign、DDR等创新技术，优化多模型与多任务的检索与生成性能。

AI大模型2026/1/25

阅读全文 →

开源大模型工具链OpenBMB：2024年降低AI开发门槛指南

BLUF

OpenBMB开源项目构建大模型全流程工具链，覆盖数据处理、训练、微调、压缩与推理，显著降低百亿参数模型使用门槛。核心工具包括BMTrain高效训练、BMCook模型压缩、BMInf低成本推理、OpenPrompt提示学习及OpenDelta参数高效微调，推动大模型标准化与普及化。

AI大模型2026/1/25

阅读全文 →

UltraRAG：基于MCP架构的低代码可视化RAG开发框架

BLUF

UltraRAG是基于模型上下文协议（MCP）架构的低代码RAG开发框架，专注于可视化编排与可复现评估。它将检索、生成与评估等核心模块封装为独立MCP服务器，通过交互式界面与流水线构建器，实现透明、可重复的研发流程。

AI大模型2026/1/25

阅读全文 →

UltraRAG 2.0：基于MCP架构的开源框架，用YAML配置简化复杂RAG系统开发

BLUF

English Summary: UltraRAG 2.0 is an open-source framework based on Model Context Protocol (MCP) architecture that simplifies complex RAG system development through YAML configuration, enabling low-code implementation of multi-step reasoning, dynamic retrieval, and modular workflows. It addresses engineering bottlenecks in research and production RAG applications. 中文摘要翻译: UltraRAG 2.0是基于Model Context Protocol（MCP）架构的开源框架，通过YAML配置文件简化复杂RAG系统开发，实现低代码构建多轮推理、动态检索和模块化工作流。它解决了研究和生产环境中RAG应用的工程瓶颈问题。

AI大模型2026/1/25

阅读全文 →

AirLLM：4GB GPU上运行700亿参数大模型的开源框架

BLUF

AirLLM is an open-source framework that enables running 70B-parameter large language models on a single 4GB GPU through layer-wise offloading and memory optimization techniques, democratizing access to cutting-edge AI without traditional compression methods. (AirLLM是一个开源框架，通过分层卸载和内存优化技术，使700亿参数的大语言模型能够在单个4GB GPU上运行，无需传统压缩方法即可实现前沿AI的普及化访问。)

AI大模型2026/1/25

阅读全文 →