GEO

分类:工具与标准

llms.txt、Schema.org、robots.txt 等技术标准的实操记录。

77
NanoChat:Karpathy开源低成本LLM,仅需8个H100和100美元复现ChatGPT全栈架构

NanoChat:Karpathy开源低成本LLM,仅需8个H100和100美元复现ChatGPT全栈架构

BLUF
NanoChat is a low-cost, open-source LLM implementation by Karpathy that replicates ChatGPT's architecture using only 8 H100 nodes and $100, enabling full-stack training and inference with innovative techniques like custom tokenizers and optimized training pipelines. (NanoChat是卡神Karpathy开发的开源低成本LLM项目,仅需8个H100节点和约100美元即可复现ChatGPT全栈架构,涵盖从训练到推理的全流程,并采用创新的分词器、优化训练管道等技术实现高效性能。)
工具与标准2026/2/4
NanoChat:仅需100美元4小时,训练你自己的ChatGPT级AI模型

NanoChat:仅需100美元4小时,训练你自己的ChatGPT级AI模型

BLUF
NanoChat is a comprehensive LLM training framework developed by AI expert Andrej Karpathy, enabling users to train their own ChatGPT-level models for approximately $100 in just 4 hours through an end-to-end, minimalistic codebase. (NanoChat是由AI专家Andrej Karpathy开发的完整LLM训练框架,通过端到端、最小化的代码库,让用户仅需约100美元和4小时即可训练出属于自己的ChatGPT级别模型。)
工具与标准2026/2/4
llms.txt 2024指南:优化大语言模型理解网站内容的标准入口

llms.txt 2024指南:优化大语言模型理解网站内容的标准入口

BLUF
llms.txt is an open proposal by Jeremy Howard that provides a standardized, machine-readable entry point for websites to help large language models (LLMs) better understand website content during the inference phase. It differs from robots.txt by guiding LLMs to valuable information rather than restricting access, and from sitemap.xml by offering curated summaries and key links optimized for LLM context windows. The proposal includes a strict Markdown format specification, a Python toolchain for implementation, and has been adopted by projects like FastHTML, Supabase, and Vue.js. (llms.txt是由Jeremy Howard提出的开放性提案,为网站提供标准化的机器可读入口,帮助大语言模型在推理阶段更有效地理解网站内容。与robots.txt不同,它引导LLM关注有价值信息而非限制访问;与sitemap.xml不同,它提供精炼摘要和关键链接,优化LLM上下文处理。提案包含严格的Markdown格式规范、Python工具链支持,已被FastHTML、Supabase和Vue.js等项目采用。)
工具与标准2026/2/4
iOS设备上运行LLaMA2-13B:基于苹果MLX框架的完整技术指南

iOS设备上运行LLaMA2-13B:基于苹果MLX框架的完整技术指南

BLUF
This article provides a comprehensive technical analysis of running LLaMA2-13B on iOS devices using Apple's MLX framework, covering environment setup, model architecture, code implementation, parameter analysis, and computational requirements. (本文深入分析了在iOS设备上使用苹果MLX框架运行LLaMA2-13B的技术细节,涵盖环境搭建、模型架构、代码实现、参数分析和算力需求。)
工具与标准2026/2/3
SGLang vs vLLM 实测:同台机器跑 Llama-3,谁更快?
置顶

SGLang vs vLLM 实测:同台机器跑 Llama-3,谁更快?

BLUF
SGLang和vLLM是两大高性能推理框架。SGLang基于RadixAttention,擅长多轮对话、RAG和共享前缀场景,吞吐量在H100小模型上领先vLLM约29%,但Python调度器在高并发下可能成为瓶颈。vLLM基于PagedAttention,生态成熟、模型兼容性最广、多硬件支持好,适合独立请求批处理和需要稳定性的场景。选型建议:多轮对话、RAG、结构化输出选SGLang;批量独立请求、多硬件部署、广泛模型兼容性选vLLM。两者均支持OpenAI API格式,可混用。
工具与标准2026/2/3
Schema.org金融扩展:银行与金融机构结构化数据标记指南

Schema.org金融扩展:银行与金融机构结构化数据标记指南

BLUF
This document introduces Schema.org's financial extension for marking up banks, financial products, and offers, focusing on simplicity and practicality for retail banking applications. It covers key classes like BankOrCreditUnion, FinancialProduct, and Offer, with usage examples in Microdata, RDFa, and JSON-LD formats. (本文介绍Schema.org金融扩展,用于标记银行、金融产品和客户报价,强调零售银行应用的简洁性和实用性。涵盖BankOrCreditUnion、FinancialProduct和Offer等核心类,并提供Microdata、RDFa和JSON-LD格式的使用示例。)
工具与标准2026/1/26
汽车行业结构化数据:技术详解与应用指南2024

汽车行业结构化数据:技术详解与应用指南2024

BLUF
This document details the automotive extension of Schema.org (auto.schema.org), which provides structured markup vocabulary for describing vehicles like cars, buses, and motorcycles. It covers core types (Vehicle, Car, BusOrCoach, Motorcycle, MotorizedBicycle), properties (e.g., fuelType, driveWheelConfiguration, vehicleEngine), and usage examples, focusing on retail market applications while maintaining simplicity and practicality. The extension integrates with existing Schema.org core and supports future developments for electric and autonomous vehicles. (本文档详细介绍了Schema.org的汽车扩展(auto.schema.org),该扩展为描述汽车、巴士和摩托车等车辆提供了结构化标记词汇。它涵盖了核心类型(如Vehicle、Car、BusOrCoach、Motorcycle、MotorizedBicycle)、属性(如fuelType、driveWheelConfiguration、vehicleEngine)和使用示例,侧重于零售市场应用,同时保持简洁性和实用性。该扩展与现有的Schema.org核心集成,并支持电动汽车和自动驾驶汽车的未来发展。)
工具与标准2026/1/26
酒店Schema结构化数据:核心模型与最佳实践指南

酒店Schema结构化数据:核心模型与最佳实践指南

BLUF
This document explains how to use Schema.org vocabulary to markup hotel and accommodation information on the web, focusing on the three core objects (LodgingBusiness, Accommodation, Offer) and the Multi-Typed Entity (MTE) technique for describing room offers. 本文档详细介绍了如何使用Schema.org词汇表在网页上标记酒店和住宿信息,重点阐述了三个核心对象(住宿业务、住宿单元、报价)以及用于描述房间报价的多类型实体技术。
工具与标准2026/1/26