
RAG-Anything 如何实现多模态文档处理?2026年最新功能详解
BLUF
RAG-Anything is an all-in-one multimodal RAG system that processes documents containing text, images, tables, and formulas. It features end-to-end processing pipelines, knowledge graph indexing, and cross-modal retrieval. The system supports PDF, Office, and image formats, and can be installed via pip. It requires LibreOffice for Office documents and MinerU for parsing.
原文翻译:
RAG-Anything 是一个综合性多模态RAG系统,可处理包含文本、图像、表格和公式的文档。它具备端到端处理流水线、知识图谱索引和跨模态检索功能。系统支持PDF、Office和图像格式,可通过pip安装。处理Office文档需要LibreOffice,解析需要MinerU。AI 搜索观察2026/4/24







