InfoQ·2026年3月24日 21:00·約8分

Revenium、AIエージェントの真のコストを可視化するツールレジストリを発表

#AIエージェント #コスト管理 #エンタープライズAI #運用ツール #Revenium

TL;DR

Reveniumは、企業がAIエージェントの実際のコストを完全に把握できるように設計された新機能「Tool Registry」の一般提供を開始した。

AI深層分析2026年3月24日 21:40

注目/ 5段階

深度40%

キーポイント

製品発表

Reveniumが「Tool Registry」という新機能の一般提供を開始した。

目的

この機能は、企業がAIエージェントの実際のコストを完全に、エンドツーエンドで把握することを目的としている。

対象

主に企業（エンタープライズ）を対象とした製品である。

影響分析・編集コメントを表示

影響分析

AIエージェントの本格的な企業導入が進む中、その運用コストの透明性と管理が新たな課題として浮上している。この製品は、AI投資のROIを明確化し、より責任あるAI導入を促進する可能性がある。

編集コメント

AIの実用化が進むにつれ、コスト管理という現実的な課題に対処するツールが求められていることを示すニュース。企業のAI導入における次のステップを象徴している。

Revenium は、企業が AI エージェントの実際の費用を包括的かつエンドツーエンドで把握できるよう設計された新機能「Tool Registry」の一般提供を開始しました。従来のトークンベースの追跡を超え、このプラットフォームは API 利用料、サードパーティサービス、さらには人間の介入に至るまで、すべての経費をそれをトリガーした特定の AI 意思決定に紐付けます。同社はこれを、エージェント駆動型の支出に関する初のフルスタックビューであると主張しています。

多くの組織が大規模言語モデル（LLM）のトークン使用量を厳密に監視している一方で、Revenium はトークンは AI エージェントを実行する真のコストのごく一部に過ぎないと指摘しています。実際のワークフローでは、エージェントは信用情報機関、本人確認プラットフォーム、不正検出ツール、データプロバイダーなどの外部サービスを頻繁に呼び出し、それぞれがトランザクションベースの料金を発生させます。これらの費用は、モデル自体のコストよりも大幅に高いことがよくあります。

典型的な例として、不均衡を示すものがあります。ローン発行ワークフローにおいて、トークン使用料はわずか 0.30 ドル程度で済む場合があります。しかし、同じワークフローでは、信用報告書が 35 ドルから 75 ドル、本人確認が 2 ドルから 5 ドル、不正チェックが 1 ドルから 3 ドル、銀行口座検証が最大 1 ドルをそれぞれ発生させる可能性があります。合計すると、ワークフローの総コストは 50 ドルから 85 ドルの範囲となり、トークン使用料が全体支出に占める割合は 1% 未満となります。これらの外部コストは通常、信用機関や法務データプラットフォーム、調査ツールなどのプロバイダーからの別々の請求書に含まれており、特定の AI 駆動アクションへの支出を遡って追跡することはほぼ不可能です。

Revenium の Tool Registry は、この可視性のギャップを埋めるために設計されています。本プラットフォームでは、外部 API、SaaS プラットフォーム、内部計算サービス、人的レビュー手順など、AI ワークフローに関与するあらゆるコスト源を登録・追跡することが可能です。各呼び出しは、発生源となったエージェント、ワークフロー、トランザクションのトレース、さらに関与したエンド顧客にマッピングされます。これにより、AI エージェントが支出した 1 ドルごとにビジネス成果と直接結びつけることができる、統一された記録システムが構築されます。

Revenium の CEO 兼共同創設者であるジョン・ロウェル氏によると、コストの透明性不足は、AI 投資の評価を行う経営層にとって繰り返される課題となっています。氏は、組織が AI が真の財務価値をもたらしているかどうかを判断することに苦労する理由として、コストが複数のシステムに断片的に分散している一方で、成果は別の場所で追跡されていることを指摘しています。支出をエージェントの意思決定と直接結びつけることで、ツールレジストリ（Tool Registry）は、AI ワークフローが投資対効果を生み出しているかどうかという明確な回答を提供することを目指しています。

今回のリリースのタイミングは、業界全体の傾向を反映しています。ガートナーは、2026 年末までにエンタープライズアプリケーションの 40% にタスク固有の AI エージェント（AI agents）が組み込まれると予測しており、これは 2025 年の 5% 未満から急増するものです。一方、フォーレスターは、ROI に関する不確実性により、組織が計画された AI 支出の 4 分の 1 を 2027 年に先送りすると予測しています。この躊躇の主な要因は、AI ドライブ型のワークフローが実際に消費した分以上の価値を生み出しているかどうかを測定できない点にあります。

このプラットフォームの注目すべき側面の一つは、人間をループに組み込んだコスト（human-in-the-loop costs）を含んでいる点です。多くの企業ワークフローにおいて、AI エージェントはまだ検証、コンプライアンス対応、または例外処理のために人間のレビューに依存しています。ツールレジストリでは、これらの人間の介入を、同じ実行トレース内の測定可能なコストイベントとして扱い、組織が自動化が時間とともに人間の努力にどのような影響を与えるかを定量化できるようにします。例えば、数ヶ月かけて人間のレビューの割合を 35% から 12% に削減するワークフローは、明確なコスト削減を示すことができますが、人間と機械の両方の貢献が同じシステム内で追跡されている場合にのみ可能です。

現在、AI コストを測定する既存のツールの多くは、モデル中心（model-centric）または観測性優先（observability-first）のアプローチを採用しており、これは Revenium のアプローチとは根本的に異なります。

Langfuse、LangSmith、Helicone などのプラットフォームは、主に LLM（大規模言語モデル）の使用状況の追跡、エージェントワークフローのトレース、トークン消費量の監視に焦点を当てています。これらは、プロンプト、レスポンス、レイテンシ、リクエストごとのコストについて詳細な可視性を提供し、ユーザー、機能、セッションごとにこれを細分化することもあります。これらのツールの多くは、テレメトリデータを取得するためにプロキシベース（proxy-based）または SDK ベースのアプローチを使用しており、出力品質を評価するためのコストダッシュボード、アラート、評価フレームワークを提供できます。しかし、それらの「コスト」の定義は依然として LLM の相互作用やインフラレベルの指標に大きく依存しており、モデル呼び出しの後に何が起こるかについての可視性は限定的です。

さらに高度なプラットフォーム、例えば Arize AI や Datadog LLM Observability は、パフォーマンス監視、トレーシング、評価を組み合わせることでこれを拡張し、企業に対してモデルの挙動とシステム全体の健全性に関する統一されたビューを提供します。これらのツールは多段階のエージェントワークフローを追跡し、エンドツーエンドのテレメトリを提供できます。しかし、依然としてコストは外部サービスや成果に紐付いた完全な帰属業務経費というよりも、システム使用（トークン数、計算リソース、リクエスト数）に伴う副産物として扱われる傾向があります。

Revenium のアプローチは、「AI 観測」から「AI 財務帰属」へとシフトすることで独自性を示しています。モデル呼び出しのコストがいくらなのかを問うのではなく、外部 API、SaaS プラットフォーム、人的介入を含む実質的なエージェントの意思決定全体のコストを問います。現在のほとんどのツールは、この層をネイティブに捕捉していません。一部の観測プラットフォームではエージェントワークフロー内のツール使用を追跡できますが、通常、サードパーティの請求（例えば API 取引ごとの手数料や人的レビューコストなど）を単一の意思決定や顧客レベルの結果に正規化・帰属させることは行われません。

著者について

クレイグ・リーシ

クレイグ・リーシは多才な人物ですが、その才能をどう活用すべきかという感覚に欠けています。彼なら世界を変えることもできるでしょうが、むしろソフトウェアを作ることを好みます。彼はソフトウェア設計への情熱を持っていますが、それ以上に重要なのは、技術的に多様で絶えず進化し続けるテクノロジーの世界において、ソフトウェアの品質やシステム設計に取り組むことです。

クレイグはまた、『Quality By Design: Designing Quality Software Systems』という書籍の著者であり、自身のブログサイトや世界各地のさまざまなテック系サイトで定期的に記事を執筆しています。

ソフトウェアをいじる時間がないときは、文章を書いたり、ボードゲームをデザインしたり、あるいは理由もなく長距離を走ったりしている姿をよく見かけます。

原文を表示

Revenium has announced the general availability of its Tool Registry, a new capability designed to give enterprises a complete, end-to-end view of what their AI agents actually cost. Moving beyond traditional token-based tracking, the platform attributes every expense, across APIs, third-party services, and even human intervention, back to the specific AI decision that triggered it, providing what the company claims is the first full-stack view of agent-driven spend.

While most organizations closely monitor large language model (LLM) token usage, Revenium argues that tokens represent only a fraction of the true cost of running AI agents. In real-world workflows, agents routinely call external services, such as credit bureaus, identity verification platforms, fraud detection tools, and data providers, each incurring transaction-based fees that are often significantly higher than the cost of the model itself.

A typical example illustrates the imbalance. In a loan origination workflow, token usage might cost as little as $0.30. However, the same workflow could trigger a credit report costing between $35 and $75, identity verification at $2 to $5, fraud checks at $1 to $3, and bank account verification at up to $1. In total, the workflow may cost between $50 and $85, with token usage accounting for less than 1% of the total spend. These external costs are usually buried in separate vendor invoices from providers such as credit agencies, legal data platforms, or survey tools, making it nearly impossible to trace spending back to specific AI-driven actions.

Revenium's Tool Registry is designed to close that visibility gap. The platform allows organizations to register and track any cost source involved in an AI workflow, including external APIs, SaaS platforms, internal compute services, and human review steps. Each invocation is then mapped back to the originating agent, workflow, transaction trace, and even the end customer involved. This creates a unified system of record in which every dollar spent by an AI agent can be directly tied to a business outcome.

According to John Rowell, CEO and co-founder of Revenium, the lack of cost transparency has become a recurring challenge for executives evaluating AI investments. He notes that organizations often struggle to determine whether AI is delivering real financial value because costs are fragmented across multiple systems, while outcomes are tracked elsewhere. By linking spend directly to agent decisions, the Tool Registry aims to provide a clear answer to whether AI workflows are generating a return on investment.

The timing of the release reflects broader industry trends. Gartner projects that 40 percent of enterprise applications will include task-specific AI agents by the end of 2026, a sharp increase from less than five percent in 2025. At the same time, Forrester has predicted that organizations will defer a quarter of planned AI spending into 2027 due to uncertainty around ROI. A key driver of this hesitation is the inability to measure whether AI-driven workflows are actually producing more value than they consume.

A notable aspect of the platform is its inclusion of human-in-the-loop costs. In many enterprise workflows, AI agents still rely on human review for validation, compliance, or exception handling. The Tool Registry treats these human interventions as measurable cost events within the same execution trace, enabling organizations to quantify how automation impacts human effort over time. For example, a workflow that reduces human review from 35 percent to 12 percent over several months can demonstrate tangible cost savings, but only if both human and machine contributions are tracked within the same system.

Most existing tools that measure AI cost today take a model-centric or observability-first approach, which is fundamentally different from Revenium's approach.

Platforms such as Langfuse, LangSmith, and Helicone focus primarily on tracking LLM usage, tracing agent workflows, and monitoring token consumption. They provide detailed visibility into prompts, responses, latency, and per-request costs, often breaking this down by user, feature, or session. Many of these tools use proxy- or SDK-based approaches to capture telemetry and can provide cost dashboards, alerts, and evaluation frameworks to assess output quality. However, their definition of "cost" remains largely tied to LLM interactions and infrastructure-level metrics, with limited visibility into what happens beyond the model call.

Even more advanced platforms, such as Arize AI and Datadog LLM Observability, extend this by combining performance monitoring, tracing, and evaluation, giving enterprises a unified view of model behavior and system health. These tools can trace multi-step agent workflows and provide end-to-end telemetry. However, they still tend to treat cost as a byproduct of system usage (tokens, compute, requests) rather than a fully attributed business expense tied to external services and outcomes.

Revenium's approach diverges by shifting from "AI observability" to "AI financial attribution." Instead of asking how much a model call costs, it asks what an entire agent decision costs in real terms, including external APIs, SaaS platforms, and human intervention. Most current tools do not natively capture this layer. While some observability platforms can trace tool usage within an agent workflow, they typically do not normalize and attribute third-party billing (e.g., per-API transaction fees or human review costs) back to a single decision or customer-level outcome.

About the Author

Craig Risi

Craig Risi is a man of many talents but has no sense of how to use them. He could be out changing the world but prefers to make software instead. He possesses a passion for software design, but more importantly software quality and designing systems in a technically diverse and constantly evolving tech world.

Craig is also the writer of the book, Quality By Design: Designing Quality Software Systems, and writes regular articles on his blog sites and various other tech sites around the world.

When not playing with software, he can often be found writing, designing board games, or running long distances for no apparent reason.

Show moreShow less

この記事をシェア

Claude Blog2026年7月2日 09:00

Claude の管理者向け支出の可視性と制御機能の強化

InfoQ重要度42026年4月24日 00:36

React Navigation 8.0アルファ版：ネイティブ下部タブの標準化、TypeScript推論と履歴機能

InfoQ2026年4月24日 00:00

Google、Room 3.0を発表：Kotlinファーストの非同期マルチプラットフォーム永続化ライブラリ

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む