流式 - Docs by LangChain

LangChain 实现了一个流式系统以呈现实时更新。流式传输对于增强基于LLM的应用程序的响应性至关重要。通过逐步显示输出，甚至在完整响应准备就绪之前，流式传输可以显著提升用户体验（UX），尤其是在处理LLM的延迟时。

概述

LangChain的流式系统让您能够将智能体运行时的实时反馈呈现到您的应用程序中。 LangChain流式传输可以实现什么？

流智能体进度 — 在每个智能体步骤后获取状态更新。
流 LLM 令牌 — 在生成时流式传输语言模型令牌。
流自定义更新 — 发射用户定义的信号（例如，"Fetched 10/100 records"）。
流多模式 — 从 updates（智能体进度）、messages（LLM 令牌 + 元数据）或 custom（任意用户数据）中选择。

智能体进度

为了流式传输智能体进度，使用 stream()（https://langchain-ai.github.io/langgraphjs/reference/classes/langgraph.CompiledStateGraph.html#stream）方法与 streamMode: "updates"。这将在每个智能体步骤后触发一个事件。例如，如果您有一个只调用一次工具的智能体，您应该看到以下更新：

LLM 节点: AIMessage 带有工具调用请求
工具节点: @[ToolMessage] 带有执行结果
LLM 节点: 最终 AI 响应

import z from "zod";
import { createAgent, tool } from "langchain";

const getWeather = tool(
    async ({ city }) => {
        return `The weather in ${city} is always sunny!`;
    },
    {
        name: "get_weather",
        description: "Get weather for a given city.",
        schema: z.object({
        city: z.string(),
        }),
    }
);

const agent = createAgent({
    model: "openai:gpt-5-nano",
    tools: [getWeather],
});

for await (const chunk of await agent.stream(
    { messages: [{ role: "user", content: "what is the weather in sf" }] },
    { streamMode: "updates" }
)) {
    const [step, content] = Object.entries(chunk)[0];
    console.log(`step: ${step}`);
    console.log(`content: ${JSON.stringify(content, null, 2)}`);
}
/**
 * step: model
 * content: {
 *   "messages": [
 *     {
 *       "kwargs": {
 *         // ...
 *         "tool_calls": [
 *           {
 *             "name": "get_weather",
 *             "args": {
 *               "city": "San Francisco"
 *             },
 *             "type": "tool_call",
 *             "id": "call_0qLS2Jp3MCmaKJ5MAYtr4jJd"
 *           }
 *         ],
 *         // ...
 *       }
 *     }
 *   ]
 * }
 * step: tools
 * content: {
 *   "messages": [
 *     {
 *       "kwargs": {
 *         "content": "The weather in San Francisco is always sunny!",
 *         "name": "get_weather",
 *         // ...
 *       }
 *     }
 *   ]
 * }
 * step: model
 * content: {
 *   "messages": [
 *     {
 *       "kwargs": {
 *         "content": "The latest update says: The weather in San Francisco is always sunny!\n\nIf you'd like real-time details (current temperature, humidity, wind, and today's forecast), I can pull the latest data for you. Want me to fetch that?",
 *         // ...
 *       }
 *     }
 *   ]
 * }
 */

大型语言模型令牌

为了将LLM生成的令牌实时传输，请使用 streamMode: "messages"：

import z from "zod";
import { createAgent, tool } from "langchain";

const getWeather = tool(
    async ({ city }) => {
        return `The weather in ${city} is always sunny!`;
    },
    {
        name: "get_weather",
        description: "Get weather for a given city.",
        schema: z.object({
        city: z.string(),
        }),
    }
);

const agent = createAgent({
    model: "openai:gpt-4o-mini",
    tools: [getWeather],
});

for await (const [token, metadata] of await agent.stream(
    { messages: [{ role: "user", content: "what is the weather in sf" }] },
    { streamMode: "messages" }
)) {
    console.log(`node: ${metadata.langgraph_node}`);
    console.log(`content: ${JSON.stringify(token.contentBlocks, null, 2)}`);
}

自定义更新

要实时接收工具执行过程中的更新，您可以使用配置中的 writer 参数。

import z from "zod";
import { tool, createAgent } from "langchain";
import { LangGraphRunnableConfig } from "@langchain/langgraph";

const getWeather = tool(
    async (input, config: LangGraphRunnableConfig) => {
        // Stream any arbitrary data
        config.writer?.(`Looking up data for city: ${input.city}`);
        // ... fetch city data
        config.writer?.(`Acquired data for city: ${input.city}`);
        return `It's always sunny in ${input.city}!`;
    },
    {
        name: "get_weather",
        description: "Get weather for a given city.",
        schema: z.object({
        city: z.string().describe("The city to get weather for."),
        }),
    }
);

const agent = createAgent({
    model: "openai:gpt-4o-mini",
    tools: [getWeather],
});

for await (const chunk of await agent.stream(
    { messages: [{ role: "user", content: "what is the weather in sf" }] },
    { streamMode: "custom" }
)) {
    console.log(chunk);
}

Output

Looking up data for city: San Francisco
Acquired data for city: San Francisco

如果您将 writer 参数添加到您的工具中，您将无法在没有提供 writer 函数的情况下，在 LangGraph 执行上下文之外调用该工具。

流式传输多种模式

您可以通过传递streamMode数组来指定多个流模式：streamMode: ["updates", "messages", "custom"]

import z from "zod";
import { tool, createAgent } from "langchain";
import { LangGraphRunnableConfig } from "@langchain/langgraph";

const getWeather = tool(
    async (input, config: LangGraphRunnableConfig) => {
        // Stream any arbitrary data
        config.writer?.(`Looking up data for city: ${input.city}`);
        // ... fetch city data
        config.writer?.(`Acquired data for city: ${input.city}`);
        return `It's always sunny in ${input.city}!`;
    },
    {
        name: "get_weather",
        description: "Get weather for a given city.",
        schema: z.object({
        city: z.string().describe("The city to get weather for."),
        }),
    }
);

const agent = createAgent({
    model: "openai:gpt-4o-mini",
    tools: [getWeather],
});

for await (const [streamMode, chunk] of await agent.stream(
    { messages: [{ role: "user", content: "what is the weather in sf" }] },
    { streamMode: ["updates", "messages", "custom"] }
)) {
    console.log(`${streamMode}: ${JSON.stringify(chunk, null, 2)}`);
}

禁用流

在某些应用中，您可能需要禁用给定模型的单个标记的流式传输。这在多智能体系统中很有用，用于控制哪些智能体流式传输其输出。查看模型指南了解如何禁用流式传输。

在GitHub上编辑此页面的源代码。

通过MCP将这些文档编程连接到Claude、VSCode等，以获取实时答案。

LangChain v1.0

开始使用

核心组件

高级用法

生产环境使用

流式

概述

智能体进度

大型语言模型令牌

自定义更新

流式传输多种模式

禁用流

LangChain v1.0

开始使用

核心组件

高级用法

生产环境使用

​概述

​智能体进度

​大型语言模型令牌

​自定义更新

​流式传输多种模式

​禁用流

概述

智能体进度

大型语言模型令牌

自定义更新

流式传输多种模式

禁用流