エージェントAIが置き換え不可能なシステムを修復する方法
AmazonのAGI Labは、レガシーシステムの動作をシミュレーションで学習したエージェントを「ユニバーサルAPI」として活用し、置き換えが不可能な既存インフラの運用知識を保存・統合する手法を提示している。
キーポイント
レガシーシステムの現代社会における不可欠性と置き換えの困難性
金融、航空、行政などの基幹システムは1960-70年代に構築され、Web層でラップされたまま数十年間運用されており、ダウンタイムや完全リプレイスが現実的に不可能な状態にある。
高忠実度シミュレーションを用いたエージェントの動作学習手法
理想化されたインターフェースではなく、既存システムの癖、遅延、エラー状態、隠れた依存関係を正確に捉える高忠実度シミュレーション上でエージェントを訓練するアプローチを採用している。
エージェントを「ユニバーサルAPI」として機能させ、属人化知識の継承
複雑な既存システムの詳細を背後で管理することで、エージェントは単一の統一インターフェースとして振る舞い、長年人間に受け継がれてきた属人化トリビア知識をAIが継承・標準化する。
社会インフラの安定運用を損なわない統合的インターフェース提供
システムを再構築したりオフラインにしたりすることなく、既存ワークフローの中心にある脆弱なインフラに対して、安全かつ統一された操作環境を提供する。
要点タイトル
1-2文の説明
システム欠陥を含む完全な動作学習
エージェントはシステムの不具合や遅延、履歴を学習することで、表面的な正しさを超え、エラー発生時の回復とワークフローの推論が可能になる。
レガシーシステム向けの「合成API」構築
APIが存在しない既存インターフェースを深く学習することで、エージェントは安定したプログラムmaticな操作面を提供し、システム間の連携や段階的な近代化を可能にする。
影響分析・編集コメントを表示
影響分析
レガシーシステムへの依存が社会インフラの根幹をなす中、AIエージェントを用いた「置き換え而非リプレイスメント」のアプローチは、企業DXの現実的な道筋を示す。これにより、技術的負債を抱える大規模組織でも、属人化リスクを軽減しつつシステム統合を進められる。
編集コメント
レガシーシステムのリプレイスメントに注目が集まりがちな現状において、「既存システムの習熟を通じて統合インターフェースを構築する」という発想は、実務レベルのDX推進に極めて現実的かつ効果的な戦略である。
タイトル: エージェントAIが置き換えられないシステムをいかに修復するか(続き 2/2)
エージェントは、異なる形の継続性を提供します。失われた文書ではなく、システムそのものから動作を学習することで、エージェントは、そうでなければ失われてしまう運用ロジックを保存できます。安全に書き換えられる人がいないコードの上に成り立つワークフローを安定化させ、そうでなければ労働力の陳腐化と共に失われる制度的知識を継承することができます。
この意味で、今後必要とされる作業は二つあります。一つは、これらの環境が求める信頼性を満たすエージェントを構築する技術的作業です。もう一つは、人々がもろいインターフェースに縛られなくなったときに初めて可能になる、人間による作業です。どのフィールドを二回入力しなければならないかを暗記する作業ではなく、判断、調整、共感、設計に基づく作業です。
エージェントは私たちのデジタル世界の基盤を再構築しません。しかし、別のものを再構築するかもしれません。すなわち、「イノベーションは置き換えのみから生まれる」という私たちの概念です。もろいシステムを安定したプラットフォームに変えることで、エージェントは新たな進歩のモデルを提供します。すでに機能しているものから成長するモデルです。
研究分野: 会話型AI
タグ: エージェントAI, 強化学習, 生成AI
原文を表示
How agentic AI helps heal the systems we cant replace
By learning the idiosyncrasies of accumulated layers of legacy systems, AI agents can preserve institutional knowledge and provide a unified interface to a range of services.
Conversational AI
Staff writer March 16, 09:00 AM March 16, 09:00 AM Many of the worlds most important systems the ones that move money, book flights, issue licenses, and process claims are slow, brittle, and deeply outdated. Built decades ago and extended repeatedly, they now sit at the center of workflows too vital to pause, take offline, rebuild, or replace.
Inside Amazons Artificial General Intelligence (AGI) Lab, teams train agents not on idealized interfaces but on high-fidelity simulations of such legacy systems. Learning the real behaviors of these systems the quirks, delays, error states, and invisible dependencies makes possible a different kind of innovation, one that grows from the systems we have instead of requiring their replacement. And by managing the idiosyncrasies of legacy systems behind the scenes, the agent effectively becomes a universal API a single interface that the customer can use to perform a wide range of special-purpose tasks.
image Over time, modernization settled into layers: a mainframe instruction set at the bottom; a 1990s database above it; a 2000s portal above that; and a modern web interface masking everything beneath. The legacy systems that power everyday life
Step behind the scenes of any large institution a bank, an insurer, a hospital, a state agency and youll find the same thing: an invisible layer of human labor holding software together. People know which buttons must be clicked in which order, which warnings can be ignored, which fields must be entered twice, and which screens must never be refreshed. The institutional knowledge required to navigate these eccentricities is passed down like the oral traditions of legacy systems.
Much of the infrastructure beneath these workflows is older than the people managing it. The software backbone of modern finance, insurance, travel, scientific research, and public services took shape in the 1960s and 70s, built on mainframe architectures and written in languages like COBOL and FORTRAN designed for stability rather than adaptability.
When the web arrived, institutions didnt rebuild. They wrapped. Web forms fed mainframe jobs, middleware translated modern inputs into decades-old formats, and enterprise portals accumulated atop business rules that were never rewritten. Over time, modernization settled into layers: a mainframe instruction set at the bottom; a 1990s database above it; a 2000s portal above that; and a modern web interface masking everything beneath. A single transaction today might pass through all these layers scripts, connectors, and integrations holding them together in ways no one fully understands.
Attempts to replace these systems routinely stall. Dependencies surface no one knew existed, migrations fail, budgets spiral, and public-sector modernization efforts collapse under their own complexity. These systems cannot be taken offline, which means institutions must keep operating them no matter how brittle they become. For Amazon, this is one of the most compelling applications of agentic AI navigating not the polished surfaces of web-era consumer apps but the deep, slow-moving architectures that keep institutions running.
image Many state agency workflows require entering the same information twice once for the UI layer and once for the backend batch job that processes it. The agent trains on these odd redundancies: fields that reject data until another field is saved, warnings that must be dismissed before real progress begins, and confirmation steps that look identical but encode different logic. Each repetition teaches the agent the rules humans pass down like folklore. Learning the bad to heal the bad
The hardest part of training an AI agent is not teaching it what a successful workflow looks like; its teaching it why workflows fail. The logic behind legacy systems reveals itself most clearly through friction: the modal (mandatory) window that appears late because it encodes a sequencing rule; the field that refuses input until another value is saved; the form that resets because a backend job restarted midflow. These behaviors arent glitches. They are the real semantics of the system.
Researchers at Amazons AGI Labs seek this friction out. To surface failure modes safely and repeatedly, Amazon trains agents inside reinforcement learning (RL) gyms synthetic environments designed to reproduce the quirks, delays, and ordering rules embedded in real workflows. These include synthetic web environments that simulate systems like state agencies, airline bookings, and specialized tax- and benefits-processing, among hundreds of others.
Jason Laster, an AGI software engineer who works on agent training and replay systems, puts it plainly: I want to push our RL training gyms to have all of the warts, all of the issues.
This is what learning the bad to heal the bad means: training an agent on the full spectrum of a systems true behavior, including flaws, inconsistencies, delays, and all the embedded histories humans have quietly adapted to. By exposing agents to the same brokenness people navigate every day, Amazon trains them to move beyond surface correctness and understand the deeper logic beneath the interface.
image A common problem with state agency systems is pages that submit forms, spin, and then return to their original states with no explanation. In the gym, the agent learns to recognize this pattern, revalidate the system state, re-enter only whats necessary, and attempt the workflow again without corrupting anything. What looks like stubbornness is actually sensitivity to the systems real semantics. Agents as a new interface layer
Once an agent can reliably navigate the idiosyncrasies of legacy interfaces, something more interesting begins to happen. Researchers have observed agents inferring not just what to click next but why the latent workflow the interface expresses. In one simulated benefits application environment, an agent that realized it had added only one dependent was able to navigate back, correct the omission, and resume the flow without restarting an early sign of understanding the nature of the system.
For lab members, this marks an architectural turning point. Many institutional systems simply dont expose APIs that reflect how real workflows behave; the only faithful expression of the logic is the interface itself. As Laster puts it, the UI was designed to be discoverable, learnable even if its bad. When agents learn that layer deeply enough to predict outcomes and recover from failures, they begin to function as a kind of synthetic API a stable, programmatic surface over infrastructure that cant be changed. That shift enables new architectural possibilities:
Stable semantics over unstable UIs: Agents turn inconsistent behaviors delays, re-renders, partial saves into predictable patterns.
Cross-system abstraction: Because the agent reasons about the workflow rather than the application, it can bridge systems never designed to interoperate.
Incremental modernization: Institutions can update components gradually without breaking workflows; the agent absorbs transitional fragility.
Preservation of institutional logic: Agents retain operational knowledge otherwise stored only in human memory rules, sequences, dependencies no one has documented.
This is not workflow automation. It is a new interface layer for the worlds oldest systems an upgrade path that doesnt require turning anything off.
The work ahead
Agentic AI will not replace the administrative tasks that structure daily life booking vacations, renewing licenses, scheduling medical appointments but it can help make them more efficient by allowing the evolution of systems once too fragile to change.
image Adding a pet to an existing flight reservation looks simple, but the workflow exposes how many layers sit beneath a modern booking portal. In this workout, the agent learns to detect whether the system has truly registered the pet entry or silently dropped it. It must revalidate the itinerary without duplicating actions, re-enter only whats necessary, and recover when the workflow jumps backward without warning. Mastering this drill means learning the real logic beneath the interface not the version the UI pretends to show. That fragility is becoming more acute. The programmers who built the institutional backbone of the 1960s and 70s COBOL batch jobs, FORTRAN routines, mainframe integrations are retiring. Few younger developers learn these languages, and the knowledge embedded in those systems grows harder to access each year. Critical workflows now run atop software whose inner workings fewer and fewer people understand.
Agents offer a different form of continuity. By learning how these systems behave not from lost documentation but from the systems themselves they can preserve operational logic that would otherwise disappear. They can stabilize workflows sitting atop code no one can safely rewrite and carry forward institutional knowledge that would otherwise age out of the workforce.
In that sense, the work ahead is twofold. There is the technical work of building agents that can meet the reliability these environments demand. And there is the human work that becomes newly possible when people are no longer trapped inside brittle interfaces work grounded in judgment, coordination, empathy, and design rather than memorizing which field must be entered twice.
Agents will not rebuild the foundations of our digital world. But they may rebuild something else: our notion that innovation comes only from replacement. By turning brittle systems into stable platforms, agents offer a new model of progress one that grows from what already works.
Research areas: Conversational AI
Tags: Agentic AI, Reinforcement learning, Generative AI
関連記事
エンタープライズにおけるエージェンシックAI パート2:ペルソナによるガイダンス
Amazon SageMaker Unified StudioとSageMaker Catalogを使用したオフライン機能ストアの構築
WorkhumanがAmazon QuickSight埋め込みダッシュボードで構築したマルチテナント・セルフサービス型レポーティング
今日のまとめ
AI日報で今日の重要ニュースをまとめ読み