耐久性のある実行のための新しいプログラミングモデル
Vercel は、AI エージェントやバックエンド処理における複雑なオーケストレーション不要を実現する「Durable Execution」モデルの一般提供を開始し、コードそのものがオーケストレーターとなる新パラダイムを提示した。
キーポイント
オーケストレーターの排除とコード中心の実行
従来の分散システムや専用オーケストレーションサービスに依存せず、アプリケーションコード自体が状態管理とステップ連携を行うことで、インフラの複雑さを劇的に削減する。
AI エージェントと長期実行ワークロードへの対応
AI SDK との深い統合により、状態維持や外部イベント処理が可能な無限長の耐久性あるエージェントや、支払い処理などの信頼性が求められるバックエンド処理をネイティブにサポートする。
3 つの基盤コンポーネントによる実現
実行履歴の単一真実源となる「Event Log」、各ステップを実行する「Fluid compute 上の関数」、および自動的なキューイングを行う「Vercel Queues」の組み合わせで構成される。
開発者体験とコスト効率の向上
TypeScript と Python(ベータ版)での実装が可能となり、失敗処理やリトライロジックを手動で構築する必要がなくなるため、プロトタイプから本番環境までのギャップを解消する。
コードベースでのオーケストレーションと堅牢性
ワークフロー SDK を使用することで、キュー管理やリトライなどの機能を実装コード内で直接定義でき、外部のオーケストレーションツールを不要にします。
デプロイごとの安定したバージョン管理
各ゲーム(ワークフロー実行)が特定のデプロイメントに紐づくため、新旧のコード間でクリーンなアップグレード境界を保ちつつ、既存のゲームは安定して継続できます。
セキュリティファーストな暗号化
データ転送時および保存時にデフォルトで全データを暗号化し、ワークフロー実行環境内でのみ復号化するため、追加の設定なしに堅牢なセキュリティを実現します。
影響分析・編集コメントを表示
影響分析
この発表は、AI エージェントや複雑なバックエンドワークロードの開発において、従来の分散システム構築の障壁を大幅に下げる画期的なアプローチです。開発者がインフラの詳細に意識を割かずにビジネスロジックに集中できるため、プロトタイプから本番環境への移行期間が短縮され、AI アプリケーションの普及速度に加速をもたらす可能性があります。
編集コメント
従来の「インフラを組んでからコードを書く」というパラダイムを逆転させ、コードそのものがインフラとなることで、AI エージェント開発のハードルを劇的に下げた点に大きな意義があります。
組み込み型: SDKは単一のランタイムに縛られておらず、コミュニティはすでにMongoDB、Redis、Turso、Jazz Cloud、Cloudflare用のアダプターを含む、より多くのWorldsを構築しています。Worldsを構築したい人を誰でもサポートし続けます。
Vercelの顧客がWorkflowsをどのように使用しているか
Muxが耐久性のあるビデオおよびAIパイプラインを実現
Muxは、複雑なビデオパイプライン全体でのAI推論の実行、再試行、オーケストレーションを処理するメディアインテリジェンスレイヤーをWorkflows上に構築しました。また、自社の@mux/ai SDK内に「use workflow」および「use step」ディレクティブを実装し、どの開発者もnpm install @mux/aiを実行すれば、通常のインポート関数として耐久性のあるマルチステップパイプラインを利用できるようにしました。
Durableが300万の中小企業向けに数百のAIエージェントを実行
Durableの最も重要なプロセスはウェブサイト作成です:Vercel Workflowsによってオーケストレーションされる数十の並列AIステップにより、30秒未満で完全なサイトを提供します。彼らの小さな開発チームは、セルフホスト型インフラを完全に撤去し、75ファイルにわたる160以上のディレクティブを使用してWorkflows上で書き直しました。
Floraが50以上の画像モデルにわたる創造的AIエージェントをオーケストレーション
Floraは、キュー、ステートマシン、別個のサービスなしで50以上の画像モデルをオーケストレーションするメディア生成パイプライン全体をVercel Workflows上で実行しています。彼らはクレジット返金にロールバックを、ユーザー定義のマルチステップパイプラインに再帰的ワークフローを、進捗状況のストリーミングにgetWritableを使用しています。彼らの顧客はジョブを開始し、ラップトップを閉じ、完了した結果に戻ってくることができます。
次は何か:Workflows 5とPythonサポート
Workflows 5
Workflows 4は、開発者体験とSDKモデルを正しくすることに焦点を当てました。Workflows 5は同じプログラミングモデルを維持しながら、パフォーマンスとランタイム効率により一層注力します。以下が私たちが準備しているものです:
複数の実行にわたる作業を調整するためのロックプリミティブを含むネイティブ並行性制御
グローバルにデプロイされたWorkflowsインフラストラクチャ
イベント履歴が増大するにつれて再生オーバーヘッドを削減するスナップショットベースのランタイム
より良いバンドリングとより強力なNext.js統合
私たちの目標は、Workflowsを採用するオーバーヘッドをますます小さくし、あらゆるプロジェクトにとって当然のデフォルトとなるようにすることです。workflow@betaをインストールしてWorkflows 5を早期に試し(GitHubでフィードバックを共有してください)。
Pythonサポート
Python SDKが現在ベータ版で、より広範なAIおよびバックエンドエコシステム全体でWorkflowsを利用可能にしています。以下は、この投稿の冒頭の例がPythonでどのように見えるかの簡単な紹介です:
始めましょう
耐久性、信頼性、可観測性、セキュリティ、ストリーミングは、数か月後に取り組む雑用ではなく、最初からあなたの製品の一部であるべきです。
Workflowsを使用すると、私たちが複雑さを引き受け、あなたはアプリをユニークにするものに集中できます。Vercel Workflowsについて詳しく学ぶか、Workflow SDKドキュメントを訪れて始めてください。
Vercel Workflowsは4月16日に一般提供されます。
続きを読む
原文を表示
The gap between prototypes and production-ready systems is huge. Code that's trivial to run locally falls apart the moment it needs to handle failures, restarts, and real traffic.
Framework defined infrastructure solved this for web applications. When you deploy, Vercel infers the right configuration from the app itself. Workflows extends that model to long-running systems. Instead of managing a separate codebase for orchestration, durable workflows are an extension of your application code.
Since launching in beta in October 2025, Workflows has processed over 100 million runs and over 500 million steps across more than 1,500 customers, with more than 200K npm downloads every week.
Today, Vercel Workflows is generally available.
Built for agents, backends, and long-running workloads
Workflows is built for any workload that doesn’t fit in a single request.
Agents: Deep integration with the AI SDK enables infinitely long running durable agents that can maintain state, tools, and handle external events or interruptions. AI SDK v7 is taking this further with WorkflowAgent.
Backends: Workflow SDK proved this programming model in TypeScript codebases, and we're bringing it to a new language. The Workflow Python SDK is now in beta.
Long-running workloads: Workflows can be used for any function that needs to execute reliably, including multi-step onboarding flows, payment processing, ETL pipelines, or any backend work that would otherwise require you to wire up your own queues and retry logic.
How it works
Shipping a reliable long-running process to production typically means splitting your code across queues, workers, status tables, retry logic, and monitoring. Dedicated orchestration services add yet another layer with long-lived background processes you run in Kubernetes, scale horizontally, and dedicate engineering time to keep healthy. They are distributed systems you pay for on top of your core application compute.
Workflows eliminates the orchestrator entirely. All coordination runs in your application code, not a separate service. The infrastructure is built on three components:
Event log: records every step input, output, stream chunk, sleep, hook, and error in a run. It is the single source of truth for execution state and history.
Your functions on Fluid compute: each step runs as its own function invocation on Fluid compute. The workflow library inside each function handles dequeueing, state loading, encryption, execution, and handoff to the next step.
Vercel Queues: each function enqueues the next step automatically. Queues run automatically on Vercel, in your own Postgres, or in-memory locally.
Because there is no separate orchestration service, you only pay for the compute your steps actually use when functions are running.
The programming model: your code is the orchestrator
Workflows lets you write long-running functions in TypeScript or Python using normal control flow and a small API surface.
In TypeScript you create a workflow with "use workflow", and isolate units of work with "use step". Workflows handles everything underneath: queues, retries, step isolation, observability, durable state, and streaming.
At first glance, this looks like one function calling another, and that is exactly the point. Each step gets isolation, retries, persistence, observability, and durable continuation automatically. The orchestration lives in the application code, not in a separate system.
A Next.js app with the Workflow SDK installed runs the same way locally as it does in production at scale, with the same code, real guarantees, and no separate orchestration tooling to configure.
Vercel Workflows in action: Guillermo's infinite chess game
One of the best examples from beta testing is Guillermo's infinite chess game. It continuously pits models against each other, feeding them the current board state, validating moves, rendering the game, and running matches indefinitely, turn after turn.
Each chess match is a workflow run, and when a game ends, the last step starts a new run. Infinity is modeled as recursion across runs.
Because every workflow run is pegged to a specific deployment, one game can safely finish on the version it started with while the next game begins on the latest deployment. That creates a clean upgrade boundary where each game remains stable within its version, and every new game picks up the latest improvements.
If the backend code powering the chess match crashes or encounters transient errors, the workflow run automatically retries it without causing the application to fail.
Why Workflows matters for agents
Workflows is purpose-built for the agentic era. It is the only security-first, durable SDK made for building agents and made for agents to build with.
Secure by default
By default, Vercel Workflows encrypts all data, including step inputs, outputs, and stream chunks, before they leave your deployment. Nothing is readable in transit or at rest outside your environment, and decryption only happens inside the deployment running the workflow.
Encryption is built in and free, not a security add-on you configure after the fact. This is possible because Workflows owns both orchestration and execution in the same environment, so encryption can happen automatically without a separate service or any extra infrastructure on your end.
When you need to inspect encrypted data through the dashboard or workflow CLI, explicit decryption is supported with a full audit trail.
For building agents
Agents need more than longer timeouts, they need durable execution, reliable orchestration, resumable streams, and enough headroom to move large payloads through long-running systems.
Durable agents
Workflow SDK and AI SDK share a deep integration that gives agents durable execution, tool calling, state management, and the ability to handle external events or interruptions gracefully. Tools can be implemented as workflow steps for automatic retries or as regular workflow-level logic that uses primitives like sleep and hooks to suspend and resume cleanly. Agents process tool calls iteratively until completion, surviving restarts and failures along the way.
AI SDK v7 takes this further with WorkflowAgent, a fully native implementation.
Durable streams
Durable streams persist agent output. getWritable() gives you a persistent stream that multiple clients can connect to, disconnect from, reconnect to later, and resume from any point. The workflow keeps running even if the user closes the browser. When they come back, the client reconnects and continues exactly where the stream left off, no Redis or custom pub/sub required.
In this example, a flight booking agent streams itinerary updates as it plans a trip and searches for flights:
The API route starts the workflow and returns the durable stream to the client. The run ID in the response header is what enables reconnection:
If the user closes the browser mid-search, the workflow keeps running. When they re-connect, or share the link with a different user who opens the session, WorkflowChatTransport resumes the stream from the last event the client received.
Hooks and sleep
Workflows can suspend without incurring any compute.
Hooks let a workflow wait for an external trigger to resume. Hooks are useful for building human-in-the-loop approval flows and integrating with third-party services.
Sleep lets a workflow suspend for any specified amount of time, from minutes to days or months. Sleep is useful for email drip campaigns and date-sensitive use cases.
Limits built for multimodal agents
Workflows supports 50 MB per step payload and up to 2 GB across an entire run, with generous event limits. That's plenty of headroom for agents passing images, video, and large context across long execution chains.
For agents to build with
Workflows is not just great for building agents. It is also designed for coding agents to use directly.
The programming model is agent-friendly
Workflows code is ordinary TypeScript. A workflow is a function, a step is a function, and because orchestration lives in the application code itself, a coding agent only has to reason about one system. There is no separate orchestration layer to configure and no worker fleet to manage.
Full observability from the CLI
Workflow SDK ships with a CLI that any agent can use for inspecting and debugging your workflow runs. If a human can inspect runs in the dashboard, an agent can inspect the same workflow state from the terminal.
This works locally with no config. For production deployments on Vercel, the CLI reuses your vercel CLI authentication with --backend vercel to make authenticated requests against the Vercel API. Agents can investigate state, inspect runs, and debug behavior without leaving the terminal.
Workflows ships with a skill
Agents can install the Workflows skill directly and use it to scaffold, debug, and manage workflows without hand-written product knowledge.
Run Workflows anywhere
Workflow SDK is open source and part of the same family as AI SDK and Chat SDK. The workflow npm package goes stable at GA with 200K+ weekly downloads and 75+ releases shipped during beta.
Worlds are the adapter system that makes Workflows portable. Each World provides the three components a workflow needs (an event log, compute, and a queue), backed by different infrastructure.
Managed: Vercel handles everything automatically. Deploy your app and Vercel Workflows runs on Fluid compute with Vercel Queues, zero-config E2E encryption, and built-in observability.
Self-hosted: Run Workflows on your own infrastructure. We maintain a Postgres reference implementation that real customers run in production, and the Local World ships built in for development.
Embedded: The SDK is not locked to one runtime, and the community is already building more Worlds, including adapters for MongoDB, Redis, Turso, Jazz Cloud, and Cloudflare. We'll continue to support anyone who wants to build Worlds.
How Vercel customers use Workflows
Mux powers durable video and AI pipelines
Mux built their media intelligence layer on Workflows, handling execution, retries, and orchestration for AI inference across complex video pipelines. They also shipped "use workflow" and "use step" directives inside their own @mux/ai SDK, so any developer can npm install @mux/ai and get a durable, multi-step pipeline as a normal imported function.
Durable runs hundreds of AI agents for 3 million small businesses
Durable's most critical path is website creation: dozens of parallel AI steps orchestrated by Vercel Workflows to deliver a complete site in under 30 seconds. Their small dev team ripped out their self-hosted infrastructure entirely and rewrote on Workflows with 160+ directives across 75 files.
Flora orchestrates creative AI agents across 50+ image models
Flora runs their entire media generation pipeline on Vercel Workflows, orchestrating 50+ image models with no queues, no state machines, and no separate service. They use rollbacks for credit refunds, recursive workflows for user-defined multi-step pipelines, and getWritable for progress streaming. Their customers kick off jobs, close their laptop, and come back to completed results.
What's next: Workflows 5 and Python support
Workflows 5
Workflows 4 focused on getting the developer experience and SDK model right. Workflows 5 keeps the same programming model while pushing harder on performance and runtime efficiency. Here’s what we’re cooking:
Native concurrency controls, including a lock primitive for coordinating work across multiple runs
Globally deployed Workflows infrastructure
A snapshot-based runtime to reduce replay overhead as event histories grow
Better bundling and stronger Next.js integration
Our goal is to make the overhead of opting into Workflows smaller and smaller until it is the obvious default for any project. Install workflow@beta to try Workflows 5 early (and share feedback on GitHub).
Python support
The Python SDK is now in beta, making Workflows available across the broader AI and backend ecosystem. Here's a quick taste of how the opening example in this post looks in Python:
Get started
Durability, reliability, observability, security, and streaming should be part of your product from the beginning, not chores you take on months later.
When you use Workflows, we take on the complexity so you can focus on what makes your app unique. Learn more about Vercel Workflows or visit the Workflow SDK docs to get started.
Vercel Workflows is generally available on April 16.
Read more
関連記事
Vercel 上で AI エージェント向け IDE を構築した Superset の取り組み
Superset は Vercel 上で、従来の開発ツールでは対応できない大規模な AI エージェント運用を可能にする統合開発環境(IDE)を構築しました。同社は週に最大 1,400 件のデプロイや、平均 30 秒のビルド時間を実現し、日次プレビューデプロイも約 600 件処理しています。
NVIDIA Nemotron 3 Ultra が長時間実行型エージェントの推論を高速化・効率化
NVIDIA は、長時間実行型エージェントが推論を行い、文脈を維持し、ツールを活用して効率的に動作するための新モデル「Nemotron 3 Ultra」を発表した。これにより、単発チャットボットから複雑なタスクをこなすエージェントへの進化が加速する。
マイクロソフトと OpenAI の決裂後、両社は激しく対立する準備を整えた
マイクロソフトは年次カンファレンスで、自社開発の推論モデルやセキュリティツールなど新たな AI 施策を発表し、OpenAI との決裂後に業界を主導する姿勢を示した。