AI News·2026年6月23日 01:11·約9分で読める

Sakana AI の「Fugu」がベンダーロックインを緩和するマルチエージェントモデルとして登場

#マルチエージェントシステム #LLM オーケストレーション #AI ソーブリン #Sakana AI #Fugu

TL;DR

Sakana AI は、単一ベンダー依存や地政学的リスクを回避するため、複数のモデルを動的に連携させる「Fugu」マルチエージェントシステムを発表し、企業向けの高耐性な AI インフラを提供した。

AI深層分析2026年6月23日 02:02

重要/ 5段階

深度40%

キーポイント

ベンダーロックインと地政学リスクの緩和

単一 API への依存による供給途絶リスクに対し、多様なモデルプールを動的にルーティングするアーキテクチャで、AI ソーブリンを実現する。

OpenAI 互換エンドポイントと内部オーケストレーション

ユーザーは単一のインターフェースからアクセスし、システムが内部でモデルの選択、委任、検証、合成を自動処理して複雑なタスクを解決する。

用途別 2 つのデプロイメントティア

低遅延を重視した標準版と、学術論文再現や特許分析など高精度が求められる Ultra バリアントが用意され、データガバナンスへの対応も可能。

サイバーセキュリティ分野での実証

ベータテストでは 500 名以上の早期ユーザーが長期の計算ワークフローを処理し、完全なセキュリティ評価サイクルの自動化に成功した。

厳格なコンプライアンスと安全なペネトレーションテスト

マルチエージェントのルーティングにより、攻撃範囲を厳守し、破壊的なアクションを行わずに複雑なセキュリティチェックを実行できることが実証された。

コードレビューにおける高精度なバグ検出

従来のモノリス型ツールと比較して、Fugu Ultra は論理欠陥やセキュリティ脆弱性の検出率が著しく高く、見落としを防ぐ能力を示した。

長時間セッションでの人格安定性

単一呼び出しモデルが抱える文脈の劣化やアイデンティティの漂移（ドリフト）に比べ、Fugu Ultra は長期の自律的な研究タスクにおいて高い人格安定性を維持する。

影響分析・編集コメントを表示

影響分析

この発表は、企業が特定の AI ベンダーや地政学的状況に縛られずにインフラを構築できる新たなパラダイムを示しており、AI ソーブリンの実現に向けた重要な一歩となる。特に、マルチエージェント技術を用いてリスク分散と性能向上を同時に達成するアプローチは、今後_enterprise_ AI 導入の標準的な設計思想として定着する可能性が高い。

編集コメント

単一モデル依存のリスクを解消する「Fugu」のアプローチは、地政学的な不確実性が高まる現代において、企業にとって極めて戦略的な意義を持つ。マルチエージェント技術の実用化と、それをビジネス継続性の担保に結びつけた点は注目すべき進展である。

Sakana AI は、Fugu を立ち上げ、エンタープライズ展開における単一ベンダー依存リスクを緩和し、マルチエージェント運用をオーケストレーションします。

企業は、モノリス型 AI API に完全に依存することで運用上の脆弱性に直面しています。日本の AI 企業である Sakana AI は、これらの集中リスクへの対応として、多様なモデルのプールを呼び出して多段階タスクを完了させるオーケストレーション言語モデル（orchestration language model）Fugu を設計しました。

ユーザーはこのエコシステムに、単一の OpenAI 互換エンドポイントを通じてアクセスします。Fugu はクエリを内部でルーティングし、プロンプトを直接解決するか、より深い分析のために専門家のモデルからなる調整されたチームを組み立てるかを判断します。このシステムは、モデルの選択、委任、検証、合成を内部で処理します。エンジニアリングチームは一つのモデルと対話しているように見えますが、実際の計算を実行するのは背後にある専門家システムの群れです。

Sakana AI は、AI ソーシングに伴う地政学的および規制リスクを対象としています。Fable や Mythos といった Anthropic モデルに影響を与える最近の輸出管理は、特定の基盤アーキテクチャへのアクセスが外交政策の決定に基づいて消失し得ることを示しました。

Fugu は、これらの突発的なサプライチェーン断絶に対するヘッジとして機能します。このプラットフォームは、完全に交換可能なエージェントプールに依存しています。Fugu は、制限されたまたは劣化したプロバイダーを迂回してトラフィックを動的にルーティングすることで、サービスの継続性を維持します。Sakana AI によれば、この機能は AI の主権に必要なレジリエントなアーキテクチャを提供するものです。

Fugu デプロイメントティア

2 つのティアが用意されており、異なる運用上のレイテンシ要件に対応しています。

標準的な Fugu モデルは、日常業務における低レイテンシを優先し、Codex などの標準的な開発者ツールに統合され、ライブコーディングやコードレビューに活用されます。厳格なデータガバナンスまたはプライバシーの要請に直面している組織では、手動で特定の基盤モデルを標準的な Fugu ルーティングプールから除外することも可能です。

Fugu Ultra は、最大限の精度が求められる複雑な多段階分析問題を対象としています。Ultra バリアントは、学術論文の再現、文献調査、特許分析といった集中的なタスクのために、より深い専門家エージェントのプールを調整します。

Sakana AI によると、Fugu Ultra は科学、工学、推論ベンチマークにおいて、Fable 5 や Mythos Preview などの主要なクローズドモデルと競合する性能を発揮しています:

このオーケストレーション手法により、企業はクローズドモデルに内在するベンダー集中リスクや輸出規制への曝露を伴うことなく、最高レベルのコンピューティング能力にアクセスできます。

サイバーセキュリティにおける実装

約 500 名の初期ユーザーが、長期にわたるベータプログラムを通じて、長大で多段階の計算ワークフローに焦点を当てたシステムをテストしました。Claude Mythos などのモデルにおいてサイバーセキュリティが主要な焦点となっている中、エンジニアリングチームは Fugu Ultra を導入し、完全なセキュリティ評価サイクルの自動化を実現しました。

人間オペレーターがスコープ限定の指示を出し、オーケストレーションエンジンが偵察フェーズ全体を実行しました。モデルはクロスサイトスクリプティング（Cross-Site Scripting）および SQL インジェクション（SQL Injection）チェックを成功裡に実施するとともに、徹底的な認証レビューも完了させました。

参加したサイバーセキュリティエンジニアは、モデルが運用パラメータ内に厳格に留まり、標的インフラに対して破壊的なアクションを開始しないことを確認しました。Fugu は、検証証拠と人間のリメディエーションチームによる再テスト手順を明記したクリーンな脆弱性レポートを生成することで、自動化されたエンゲージメントを終了させました。

この実装により、マルチエージェントルーティングが複雑なペネトレーションテストシーケンスを実行しながらも、厳格なコンプライアンス境界を維持できることが示されました。

ソフトウェア開発チームはまた、Fugu Ultra を主要なコードレビューパイプラインに統合し、既存のモノリス型ツールとの欠陥検出率を比較しました。オーケストレーションエンジンは、複雑なエンタープライズコードベース内のロジック欠陥やセキュリティ脆弱性を特定する際、常にベースラインモデルを上回る性能を発揮しました。

「コードレビューにおいては、Fugu Ultra は GPT-5.5 よりもはるかに優れています。包括的な回答を提供し、他で見逃されるバグも発見します」と、ベータ展開に関与したソフトウェアエンジニアは報告しています。「他のツールが約 3 つの問題を指摘するのに対し、Fugu は 20 件以上を浮き彫りにしました。今や私がすべてのレビューに使用するモデルとなっています。」

自動化された研究とペルソナの安定性

データサイエンス部門は、ほぼ完全自動化された研究モードでこのシステムを展開しました。Fugu Ultra は数学的仮説の探索、実験コードの実行、失敗状態の解釈、そして最小限の人間的介入で長期間にわたる進捗を維持するための自らのアプローチの修正を成功裏に行いました。この機能は、論理エラーからの回復のために絶え間ない人間の指示を必要とする単一呼び出しモデルの運用上の限界に直接対応するものです。

ある無名のエンタープライズプラットフォーム企業のリーダーシップ層は、これらの長期セッションにおいて、長期的なペルソナの安定性を主要な利点として特定しました。従来のモノリス型アーキテクチャでは、広範な会話履歴を処理する際に文脈の劣化やアイデンティティの漂移（ドリフト）に陥る傾向があります。

「生出力の品質は最先端モデルと同等ですが、Fugu は長期セッション全体で非常に強いペルソナ安定性を示し、他のモデルが漂移する中でそのアイデンティティを維持しました」と執行役員は述べています。「エージェント製品においては、これは純粋なベンチマークスコアよりも重要である可能性があります。」

長期にわたるベンチマーク検証

Sakana AI は、学習されたモデルのオーケストレーションに関する広範な研究に基づき、内部ルーティングロジックを構築しました。この製品の技術的基盤は、同社が ICLR 2026 で発表した論文、特に Trinity および Conductor フレームワークにおける知見に由来しています。

これらの学術的基盤により、Fugu はタスクに委譲が必要か直接解決が必要かを正確に理解してリクエストを処理できます。内部の言語モデルが個々のエージェント間の通信プロトコルを規定し、それらの個別の計算出力の最終的な統合を構造化します。

最先端 AI 競合他社に対する検証テストでは、金融時系列予測から機械設計に至るまで、複雑でオープンエンドな分野が網羅されました。Fugu はまた、ルービックキューブの解決や日本語筆跡分析など、ニッチな物理ロジックテストや視覚解釈タスクにおいても高い熟練度を示しました。定量的な金融モデリングと定性的な画像処理の両方で卓越する能力は、マルチエージェントオーケストレーションアプローチの有効性を裏付けています。

Sakana AI は、より広範な AI ハードウェアおよびソフトウェア市場が成熟するにつれてシステムを有機的に拡張できるように設計しました。製品が固定された運用ルールセットではなく、学習されたオーケストレーションロジックに完全に依存しているため、サードパーティのイノベーションから自動的に恩恵を受けます。Sakana AI は、利用可能なエキスパートエージェントのプールを継続的に拡大する計画です。

エンジニアリングチームは、新たにリリースされたオープンソースツールや独自開発の Sakana AI モデルが利用可能になった次第に、ルーティングプールに取り込んでいきます。標準版 Fugu および Fugu Ultra モデルは、本日より企業クライアント向けに提供されています。

関連記事：SAP と Google Cloud がエージェント型コマースアーキテクチャを展開

業界のリーダーから AI とビッグデータについてさらに学びたいですか？アムステルダム、カリフォルニア、ロンドンで開催される「AI & Big Data Expo」をご覧ください。この包括的なイベントは TechEx の一部であり、サイバーセキュリティ＆クラウドエキスポなど他の主要なテクノロジーイベントと併催されています。詳細についてはこちらをクリックしてください。

AI News は TechForge Media によって運営されています。その他の今後のエンタープライズ向けテクノロジーイベントやウェビナーはこちらからご覧ください。

本記事「Sakana AI の Fugu マルチエージェントモデルによるベンダーロックインの緩和」は、元々 AI News に掲載されたものです。

原文を表示

Sakana AI launched Fugu to orchestrate multi-agent operations and mitigate single-vendor dependency risks in enterprise deployments.

Enterprises face operational vulnerabilities when relying entirely on monolithic AI APIs. Japanese AI firm Sakana AI designed Fugu as a response to these concentration risks by creating an orchestration language model that calls upon a pool of varied models to complete multi-step tasks.

Users access this ecosystem through a single OpenAI-compatible endpoint. Fugu routes queries internally, deciding whether to resolve a prompt directly or to assemble a coordinated team of expert models for deeper analysis. The system handles model selection, delegation, verification, and synthesis internally. Engineering teams interact with what appears to be one model while a background system of specialists executes the actual computation.

Sakana AI targets the geopolitical and regulatory risks associated with AI sourcing. Recent export controls affecting Anthropic models like Fable and Mythos demonstrated that access to specific foundational architectures can vanish based on foreign policy decisions.

Fugu functions as a hedge against these sudden supply chain disruptions. The platform relies on a completely swappable agent pool. Fugu dynamically routes traffic around any restricted or degraded provider to maintain service continuity. Sakana AI states this capability provides the resilient architecture required for AI sovereignty.

Fugu deployment tiers

Two tiers are available to accommodate different operational latency requirements.

The standard Fugu model prioritises low latency for daily tasks, integrating into standard developer tools like Codex for live coding and code review. Organisations subject to strict data governance or privacy mandates can manually opt specific underlying models out of the standard Fugu routing pool.

Fugu Ultra targets complex, multi-step analytical problems that demand maximum accuracy. The Ultra variant coordinates a deeper pool of expert agents for intensive tasks such as academic paper reproduction, literature investigations, and patent analysis.

Sakana AI reports that Fugu Ultra performs competitively against leading closed models like Fable 5 and Mythos Preview across scientific, engineering, and reasoning benchmarks:

The orchestration method ensures companies can access top-tier computing capabilities without carrying the vendor concentration risk or export control exposure inherent to those closed models.

Implementation in cybersecurity

Almost 500 early users tested the system during an extended beta program focused on lengthy, multi-step computational workflows. With cybersecurity such a focus for models like Claude Mythos, engineering teams deployed Fugu Ultra to automate complete security assessment cycles.

Human operators issued one scoped instruction, and the orchestration engine executed the entire reconnaissance phase. The model successfully conducted cross-site scripting and SQL injection checks alongside thorough authentication reviews.

A participating cybersecurity engineer confirmed the model stayed strictly within its operational parameters and avoided initiating destructive actions against the target infrastructure. Fugu concluded the automated engagement by generating a clean vulnerability report complete with verifying evidence and exact retest steps for human remediation teams.

The implementation demonstrated that multi-agent routing maintains strict compliance boundaries while executing complex penetration testing sequences.

Software development teams also integrated Fugu Ultra into their primary code review pipelines to compare defect detection rates against established monolithic tools. The orchestration engine consistently outperformed baseline models in identifying logic flaws and security vulnerabilities within complex enterprise codebases.

“For code review, Fugu Ultra is significantly better than GPT-5.5. It gives comprehensive answers and finds the bugs others miss,” reported a software engineer involved in the beta deployment. “Where other tools flag about three issues, Fugu surfaced more than twenty. It’s become the model I run all my reviews through.”

Automated research and persona stability

Data science units deployed the system in an almost fully-automated research mode. Fugu Ultra successfully explored mathematical hypotheses, executed experimental code runs, interpreted failure states, and revised its own approaches to sustain progress over extended periods with minimal human intervention. This capability directly addresses the operational limitations of single-call models that require constant human prompting to recover from logic errors.

Leadership at an unnamed enterprise platform company identified long-term persona stability as a primary advantage during these extended sessions. Conventional monolithic architectures often suffer from context degradation and identity drift when processing extensive conversational histories.

“Raw output quality is on par with top frontier models, but Fugu showed unusually strong persona stability across long sessions, holding its identity where other models drift,” the executive stated. “For agent products, that may matter more than raw benchmark scores.”

Extended benchmark validation

Sakana AI built the internal routing logic upon extensive research into learned model orchestration. The technical foundation for the product stems from findings published in the company’s ICLR 2026 papers, specifically the Trinity and Conductor frameworks.

These academic foundations allow Fugu to process requests by understanding precisely when a task requires delegation versus direct resolution. The internal language model dictates communication protocols between the individual agents and structures the final synthesis of their separate computational outputs.

Validation testing against frontier AI competitors covered complex, open-ended disciplines ranging from financial time series prediction to mechanical design. Fugu also demonstrated high proficiency in niche physical logic tests and visual interpretation tasks, including solving the Rubik’s Cube and performing Japanese handwriting analysis. The capacity to excel in both quantitative financial modelling and qualitative image processing confirms the efficacy of the multi-agent orchestration approach.

Sakana AI designed the system to scale organically as the broader AI hardware and software market matures. Because the product relies entirely on learned orchestration logic rather than fixed operational rulesets, it automatically benefits from third-party innovations. Sakana AI plans to continuously expand the available pool of expert agents.

The engineering team will fold newly-released open-source tools and proprietary Sakana AI models into the routing pool as they become available. Both the standard Fugu and Fugu Ultra models are available to enterprise clients today.

See also: SAP and Google Cloud deploy agentic commerce architecture

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events including the Cyber Security & Cloud Expo. Click here for more information.

AI News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

The post Mitigating vendor lock-in with Sakana AI Fugu multi-agent models appeared first on AI News.

この記事をシェア

Sakana AI★42026年4月27日 00:00

自然言語によるエージェントのオーケストレーション学習：Conductor モデル

研究者らは強化学習を用いて Conductor モデルを訓練し、この AI が複数の大規模言語モデルに対して効果的なカスタム指示を自然言語で自動生成する能力を獲得したことを発見しました。これにより、人間が行っていたプロンプトエンジニアリングの役割を AI が代替可能になることが示されました。

AWS Machine Learning Blog★42026年5月27日 02:41

Amazon Bedrock AgentCore を用いた AWS 上の高スケーラブルなサーバーレス LangGraph マルチエージェントシステムの構築

AWS は、生成 AI の本番環境運用における遅延や拡張性などの課題に対処するため、Amazon Bedrock AgentCore と LangGraph を組み合わせた高スケーラブルなサーバーレス型マルチエージェントシステム構築の手法を公開した。

TechCrunch AI★32026年6月24日 08:30

インドのMoEngage、マーケティングの未来は数百万のAIエージェントだと確信

インドの企業MoEngageが、マーケティングの将来は数百万台のAIエージェントによって支えられると主張している。同社はこの戦略に賭けている。

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む