
Xinference是什么?2026年开源AI模型部署与推理平台详解
Xinference is an open-source platform that simplifies the deployment and integration of various AI models, including large language models (LLMs), embedding models, and multimodal models, in both cloud and local environments. It supports heterogeneous hardware acceleration through GGML, offers multiple interfaces (RESTful API, RPC, CLI, Web UI), and enables distributed computing for efficient resource utilization.
原文翻译:
Xinference是一个开源平台,用于简化各种AI模型(包括大语言模型、嵌入模型和多模态模型)在云端或本地环境中的部署和集成。它通过GGML支持异构硬件加速,提供多种接口(RESTful API、RPC、命令行、Web UI),并支持分布式计算以实现高效的资源利用。
2026/3/2
阅读全文 →









