Anthropic Research·2026年2月18日 09:00·約3分

2026年2月18日社会的影響実践におけるAIエージェントの自律性測定

#AIエージェント #自律性 #AI安全性 #人間-AI協調

TL;DR

2026年2月18日に、AIエージェントの自律性を実践的に測定する方法とその社会的影響について考察された。

AI深層分析2026年2月24日 08:40

重要/ 5段階

キーポイント

実運用におけるAIエージェントの自律性を大規模データで初めて計測した研究

ユーザー経験と共に自律稼働時間が伸び、自動承認率が上昇する傾向を確認

医療・金融・サイバーセキュリティなど高リスク領域での使用が萌芽的に出現

効果的な監視には新たなポストデプロイメント監視インフラが必要と結論

影響分析・編集コメントを表示

影響分析

AIエージェントの実世界での使用実態を初めて大規模に可視化した研究は、自律性の管理とリスク評価のための新たな枠組み構築の必要性を示唆している。特に、ユーザーが経験を積むほど自律性を委ねる傾向は、AIと人間の協働における新たなインタラクションパラダイムの設計が急務であることを示している。

編集コメント

「AIは危険だから止める」ではなく「どう使われているかを理解し、安全に使う仕組みを作る」という実践的な安全性研究の重要な一歩。業界全体が注視すべきデータドリブンなアプローチ。

AIエージェントの自律性を実践的に測定：利用実態とリスク管理への示唆

AIエージェントは既に現実に導入され、メールの仕分けからサイバー諜報活動まで、結果の重大さが大きく異なる多様な文脈で展開されている。AIを安全に導入するためには、この多様性を理解することが極めて重要だが、人々が実際にエージェントをどのように使用しているかについては、驚くほど知見が少ない。

本研究では、プライバシー保護ツールを用いて、Claude Codeおよび公開APIにおける数百万件の人間とエージェントのインタラクションを分析し、以下の問いを探った：人々はエージェントにどれだけの自律性を与えているか？それは経験の蓄積とともにどう変化するか？エージェントはどの領域で動作しているか？エージェントが行う行動にリスクはあるか？

主な発見は以下の通りである。

Claude Codeの自律動作時間の長期化：最も長時間動作するセッションにおいて、Claude Codeが停止前に動作する時間は、3ヶ月で25分未満から45分以上へとほぼ倍増した。この増加はモデル更新に関わらず滑らかに進行しており、単なる能力向上の結果ではなく、既存のモデルも実践では発揮されていないより高い自律性の潜在能力を持つことを示唆している。

経験豊富なユーザーの行動パターン：Claude Codeのユーザーは経験を積むにつれ、各行動を逐一確認するのをやめ、必要時にのみ介入しながらエージェントを自律動作させる傾向が強まる。新規ユーザーの約20%のセッションが完全自動承認を使用するのに対し、経験を積むとその割合は40%以上に増加する。

エージェント自身による確認の頻発：人間が中断する以外に、エージェント自身が停止して明確化を求めることも、実運用システムにおける重要な監督形態である。最も複雑なタスクでは、Claude Codeが確認のために停止する頻度は、人間がそれを中断する頻度の2倍以上に上った。

リスクの高い領域での使用は限定的：公開API上のエージェント行動の大半は、リスクが低く可逆的であった。エージェント活動の約50%はソフトウェア工学が占めたが、医療、金融、サイバーセキュリティといった領域での使用も萌芽的に確認された。

本研究の中心的な結論は、エージェントを効果的に監督するためには、導入後の監視インフラの新たな形態と、人間とAIが共同で自律性とリスクを管理することを助ける新たな人間-AIインタラクションのパラダイムが必要であるという点である。

AIエージェントの研究は、定義の不一致、急速な進化、データ収集の難しさといった課題に直面している。本研究は、人々がどのようにエージェントを導入・使用しているかを実証的に理解するための小さなだが重要な第一歩と位置づけられる。エージェントの普及が進む中、手法の改良と知見の発信を継続していくことが肝要である。

原文を表示

Societal ImpactsMeasuring AI agent autonomy in practice

AI agents are here, and already they’re being deployed across contexts that vary widely in consequence, from email triage to cyber espionage. Understanding this spectrum is critical for deploying AI safely, yet we know surprisingly little about how people actually use agents in the real world.

We analyzed millions of human-agent interactions across both Claude Code and our public API using our privacy-preserving tool, to ask: How much autonomy do people grant agents? How does that change as people gain experience? Which domains are agents operating in? And are the actions taken by agents risky?

Claude Code is working autonomously for longer. Among the longest-running sessions, the length of time Claude Code works before stopping has nearly doubled in three months, from under 25 minutes to over 45 minutes. This increase is smooth across model releases, which suggests it isn’t purely a result of increased capabilities, and that existing models are capable of more autonomy than they exercise in practice.

Experienced users in Claude Code auto-approve more frequently, but interrupt more often. As users gain experience with Claude Code, they tend to stop reviewing each action and instead let Claude run autonomously, intervening only when needed. Among new users, roughly 20% of sessions use full auto-approve, which increases to over 40% as users gain experience.

Claude Code pauses for clarification more often than humans interrupt it. In addition to human-initiated stops, agent-initiated stops are also an important form of oversight in deployed systems. On the most complex tasks, Claude Code stops to ask for clarification more than twice as often as humans interrupt it.

Agents are used in risky domains, but not yet at scale. Most agent actions on our public API are low-risk and reversible. Software engineering accounted for nearly 50% of agentic activity, but we saw emerging usage in healthcare, finance, and cybersecurity.

Below, we present our methodology and findings in more detail, and end with recommendations for model developers, product developers, and policymakers. Our central conclusion is that effective oversight of agents will require new forms of post-deployment monitoring infrastructure and new human-AI interaction paradigms that help both the human and the AI manage autonomy and risk together.

We view our research as a small but important first step towards empirically understanding how people deploy and use agents. We will continue to iterate on our methods and communicate our findings as agents are adopted more widely.

Agents are difficult to study empirically. First, there is no agreed-upon definition of what an agent is. Second, agents are evolving quickly. Last year, many of the most sophisticated agents—including Claude Code—involved a single conversational thread, but today there are multi-agent systems that operate autonomously for hours. Finally, model providers have limited visibility into the architecture of their customers’ agents. For example, we have no reliable way to associate independent requests to our API into “sessions” of agentic activity. (We discuss this challenge in more detail at the end of this post.)

In light of these challenges, how can we study agents empirically?

To start, for this study we adopted a definition of agents that is conceptually grounded and operationalizable: an agent is an AI system equipped with tools that allow it to take actions, like running code, calling external APIs, and sending messages to other agents.1 Studying the tools that agents use tells us a great deal about what they are doing in the world.

Next, we developed a collection of metrics that draw on data from both agentic uses of our public API and Claude Code, our own coding agent. These offer a tradeoff between breadth and depth:

Our public API gives us broad visibility into agentic deployments across thousands of different customers. Rather than attempting to infer our customers’ agent architectures, we instead perform our analysis at the level of individual tool calls.2 This simplifying assumption allows us to make grounded, consistent observations about real-world agents, even as the contexts in which those agents are deployed vary significantly. The limitation of this approach is that we must analyze actions in isolation, and cannot reconstruct how individual actions compose into longer sequences of behavior over time.

Claude Code offers the opposite tradeoff. Because Claude Code is our own product, we can link requests across sessions and understand entire agent workflows from start to finish. This makes Claude Code especially useful for studying autonomy—for example, how long agents run without human intervention, what triggers interruptions, and how users maintain oversight over Claude as they develop experience. However, because Claude Code is only one product, it does not provide the same diversity of insight into

この記事をシェア

Anthropic Research重要度42026年4月22日 09:00

2026年4月22日経済研究：AIの経済効果について8万1000人の調査結果

Anthropic Research2026年4月22日 09:00

2026年4月22日経済研究：Anthropic経済指数調査の発表

Anthropic Research重要度42026年4月14日 09:00

自動化アライメント研究者：大規模言語モデルを用いてスケーラブルな監視を拡張

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む