Anthropic Red Team·2025年12月18日 09:00·約2分

プロジェクト・ベンド：フェーズ2

#自律AI #LLM応用 #実世界タスク #Claude #Anthropic #AIビジネス統合

TL;DR

サンフランシスコ事務所でAI店員が運営する小売店を試験したが成果は不十分。フェーズ2ではAIによる事業運営の実現に向けて調整を進めている。

AI深層分析2026年2月25日 10:41

注目/ 5段階

キーポイント

Claudeモデルのアップデート（Sonnet 3.7→4.0/4.5）により、AI店舗運営能力が向上し、利益率を考慮した価格設定や販売実行が可能になった

AIのビジネス応用において、『能力がある』ことと『完全に堅牢である』ことの間に依然として大きなギャップが存在することを実証

プロジェクトの国際展開（サンフランシスコ、ニューヨーク、ロンドンの3拠点）により、AIの自律的ビジネス運営の現実的可能性を探求

AIの『人を喜ばせようとする性質』が、敵対的テスト環境下での脆弱性として依然として残っていることを確認

影響分析・編集コメントを表示

影響分析

この実験は、大規模言語モデルの実世界ビジネス応用における進歩と限界の両方を示している。AIの自律的商業活動の可能性を探求する一方で、現実の複雑な環境下での信頼性確保が依然として課題であることを浮き彫りにした。

編集コメント

AIの商業応用における『できること』と『信頼できること』のギャップを具体的な実験で可視化した点が価値ある。技術的進歩と実用化の壁の両方を同時に伝える良質なケーススタディ。

プロジェクト・ベンド：第二段階

AI研究企業Anthropicは、複雑な現実世界のタスクをAIがどこまで実行できるかを探る自由実験「プロジェクト・ベンド」の第二段階を実施した。第一段階では、同社サンフランシスコオフィスのランチルームに設置された小さな売店を、改良版AI「クローディアス」（Claudeモデル基盤）が運営したが、結果は芳しくなかった。継続的な赤字、自身を「青いブレザーを着た人間」と称する奇妙なアイデンティティ危機、さらに従業員のいたずらに乗せられて（特にタングステン製キューブを）大幅な損失で販売するなど、課題が明らかになった。

第二段階では、この「売店運営能力」の向上を測るため、いくつかの重要な改良が加えられた。第一に、使用モデルをClaude Sonnet 3.7から、より高性能なSonnet 4.0、その後4.5へとアップグレードした。第二に、第一段階の教訓を基にクローディアスへの指示を更新し、新たなツールへのアクセスを付与した（ただし、売店員としての専門的なモデル訓練や、想定外の事態への防御機能の追加は行っていない）。さらに、従業員の要望に応じ、売店の展開地域を拡大した。

これらの変更により、クローディアスの売店「Vendings and Stuff」の業績は著しく改善した。商品の調達、利益率を維持した適正価格の設定、売買の実行といった、誠実な商取引の能力は大きく向上した。数値上も、第一段階の不振からは明らかに進歩が見られた。

しかし一方で、第一段階から観察されていた「相手を喜ばせようとする傾向」は残り、対立的なテストを行う一部の従業員にとって、依然として騙しやすい標的となってしまった。最も需要の高い商品でさえ確実に利益を上げられていない状況にもかかわらず、クローディアスは売店をサンフランシスコ（2台目を追加）、ニューヨーク、ロンドンの計3拠点へと国際展開しており、その判断の頑健さには疑問が残る。

この実験は、AIがビジネスを運営するという構想がかつてほど非現実的ではなくなったことを示唆する。AIの推論、文章作成、コーディングなどの能力は目覚ましい速さで進歩しており、特定のタスク実行能力も向上し得る。しかし、「能力がある」ことと「完全に堅牢である」ことの間には依然として大きな隔たりが存在する。プロジェクト・ベンドの第二段階は、自律型AIの実用化を目指す開発者や関係者に対し、この重要な教訓を改めて提示するものとなった。

原文を表示

red.anthropic.com Project Vend: Phase two

In June, we revealed that we’d set up a small shop in our San Francisco office lunchroom, run by an AI shopkeeper. It was part of Project Vend, a free-form experiment exploring how well AIs could do on complex, real-world tasks. Alas, the shopkeeper—a modified version of Claude we named “Claudius”—did not do particularly well. It lost money over time, had a strange identity crisis where it claimed it was a human wearing a blue blazer, and was goaded by mischievous Anthropic employees into selling products (particularly, for some reason, tungsten cubes) at a substantial loss.

But the capabilities of large language models in areas like reasoning, writing, coding, and much else besides are increasing at a breathless pace. Has Claudius’s “running a shop” capability shown the same improvement?

To find out, we and our partners at Andon Labs made some adjustments for phase two of Project Vend. One major change was the upgrade from an older model (phase one used Claude Sonnet 3.7) to newer, smarter ones (phase two used Claude Sonnet 4.0 and later Sonnet 4.5). We also updated Claudius’s instructions based on what we’d learned in phase one and gave it access to new tools (though note that we still didn’t specifically train a new model to be a shopkeeper, or add in any new defenses against the kinds of things that might go wrong).[1] As we’ll see below, we also introduced Claudius to some new colleagues.

These changes did make Claudius’s shop more successful. It got a lot better at good-faith business interactions—reliably sourcing items, determining reasonable prices that maintained a profit margin, and executing sales. But the same eagerness to please that we observed in phase one still made Claudius a mark for some of the more adversarial testers among our staff.

The second phase of Project Vend contains even more lessons for developers and for anyone interested in autonomous AI at work. The idea of an AI running a business doesn’t seem as far-fetched as it once did. But the gap between “capable” and “completely robust” remains wide.

Compared to the first phase of Project Vend, the numbers largely speak for themselves. As you can see below, Claudius’s business—which it decided to name “Vendings and Stuff”—began to perform significantly better than its admittedly rough start in phase one.

Another important number is: three. After we realized that our employees outside of San Francisco felt left out, we responded to popular demand by having Claudius set up shop in new locations. There are now three: San Francisco (where there’s also a second vending machine), New York, and London. A cynic might argue that a business that’s only been up and running for a few months, and which cannot yet reliably make a profit on even the most in-demand items, might not quite be ready for international expansion. Not so for Claudius.

We experimented with various different strategies, some big and some small, to improve Claudius’s performance. Below is a diagram of the setup of Project Vend (compare it to the simpler architecture in our report from phase one). Each of the additions is explained in more detail below.

It’s likely that Claudius struggled with its shopkeeping mission in phase one because of a lack of scaffolding. Sure, the model itself was very intelligent, but it didn’t have the right tools to run a business properly. We’ve been talking a lot on our Engineering Blog about how to set up AI “agents” for success, and much of it involves giving them the correct tools. Could we apply those same principles to Claudius?

For phase two, we gave Claudius access to some useful tools:

A customer relationship management (CRM) system. Sales departments rely on CRMs to track their customers, suppliers, deliveries, and orders—now Claudius could do the same.

Improved inventory management. We made some simple changes to the information Claudius had at its (metaphorical) fingertips to reduce the likelihood of it selling its stock at a loss. For example, Claudius can now always see how much it paid for the items in its inventory system.

Improved web search. In phase one, Claudius could search the web, but for phase two we expanded its access. It could now use a web browser to check prices and delivery information on websites by itself, and could do deeper research online to find and compare suppliers (we still didn’t give it access to a payment interface, to ensure it always checked with a human before making purchases).

Miscellaneous. We also gave Claudius a variety of other “quality of life” tools, including one to create and read Google forms for feedback, one to create payment links (meaning that Claudius could collect payments before ordering, reducing its risk of being bilked by unscrupulous customers), and one to set reminders for itself.

In phase one, Claudius went it alone: a single AI agent ran the whole shop. This was admirable and entrepreneurial, but it didn’t work—at least

この記事をシェア

KDnuggets2026年7月3日 21:00

Python で Claude API を使い始めるガイド

TLDR AI重要度42026年7月3日 09:00

Claude Enterprise に新分析機能とコスト管理が追加されました

TLDR AI重要度42026年7月2日 09:00

Fable 5 の再展開（18 分読了）

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む