VentureBeat AI·2026年1月5日 16:45·約8分

Claude Codeの開発者がワークフローを公開、開発者たちが熱狂

#LLM #Agent Orchestration #Software Development #Anthropic #Claude Code

TL;DR

Anthropic の Claude Code 創設者ボリス・チェルニーが公開した、並列実行と最上位モデル活用による開発ワークフローは、単一開発者の生産性を劇的に向上させる新たなパラダイムを示している。

AI深層分析2026年5月2日 10:07

重要/ 5段階

深度40%

キーポイント

並列エージェントによる指揮官型開発

従来の直線的なコーディングではなく、ターミナルで最大 5 つの Claude エージェントを並行して実行し、テスト、リファクタリング、ドキュメント作成などを同時に進行させる「リアルタイム戦略ゲーム」のようなアプローチを採用している。

最速ではなく最賢モデルの採用

遅延を重視する業界の常識に反し、推論能力とツール使用能力が最も高い Anthropic の最重厚・最遅モデル「Opus 4.5」を全てのタスクで優先的に使用することで、手動の介入（steering）を最小化し、結果として全体の処理速度を向上させている。

インフラ構築よりオーケストレーション

競合他社が巨額のインフラ構築に注力する中、既存モデルの優れたオーケストレーション（管理・連携）によって指数関数的な生産性向上を実現できるという、Anthropic の「少ないリソースで多くを成す」戦略の実証例となっている。

推論モデルによる修正コストの削減

より遅く大きな「Opus 4.5 with thinking」を使用することで、AI のミスを人間が修正する時間が大幅に減り、結果として開発全体が高速化される。

CLAUDE.mdによる学習の蓄積

Git リポジトリ内の「CLAUDE.md」ファイルにミスやルールを記録することで、AI がプロジェクト固有のコーディングスタイルや過去のミスを記憶し、継続的に改善される。

検証ループによる自律的な品質保証

AI は単なるコード生成だけでなく、ブラウザ拡張機能を使って UI をテストし、UX が良くなるまで反復して修正する「検証ループ」を自動化している。

自己検証機能による品質向上

AI がブラウザ自動化やテストスイートの実行を通じて自身の作業を検証できることで、最終結果の品質が2-3倍に向上する。

影響分析・編集コメントを表示

影響分析

この記事は、単なるツールの紹介を超え、AI エージェント時代におけるソフトウェア開発の根本的なプロセス変更を示唆しています。開発者が個別のコードを書くのではなく、複数の自律型エージェントを指揮・管理する役割へとシフトすることで、個人が小規模チーム並みの生産性を発揮できる可能性を開きました。これは業界全体の開発文化とリソース配分のあり方を変える重要な転換点となり得ます。

編集コメント

開発者の生産性向上を巡る競争が、ハードウェアの拡張から「モデルの賢さ」と「ワークフローの最適化」へと焦点を移している象徴的な事例です。特に「遅いほど速い」という逆説的な成功体験は、AI エージェントの実用化における重要な示唆を含んでいます。

世界最高峰のコーディングエージェントの創造者が語るとき、シリコンバレーは耳を傾けるだけではなく、メモを取る。

この一週間、エンジニアリングコミュニティは、AnthropicのClaude Codeの創造者であり責任者であるボリス・チェルニーによるX上のスレッドを精査し続けている。彼の個人的なターミナル設定の気軽な共有から始まった投稿は、ソフトウェア開発の未来に関するバイラルなマニフェストへと発展し、業界関係者はこれを同スタートアップの画期的な瞬間と呼んでいる。

開発者コミュニティの著名な声であるジェフ・タンはこう書いた。「Claude Codeのベストプラクティスをその創造者から直接読んでいないなら、プログラマーとして遅れを取っている」。別の業界観察者、カイル・マクニースはさらに踏み込み、チェルニーの「ゲームチェンジをもたらすアップデート」により、Anthropicは「火がついて」おり、「彼らのChatGPT的瞬間」に直面している可能性があると宣言した。

この興奮は一つの逆説から生じている。チェルニーのワークフローは驚くほどシンプルでありながら、たった一人の人間が小さなエンジニアリング部門に匹敵する出力能力で作業することを可能にしているのだ。Xでチェルニーの設定を実践したあるユーザーが指摘したように、その体験は従来のコーディングというより「むしろ『スタークラフト』のようだ」——構文をタイプすることから、自律ユニットを指揮することへの転換である。

以下は、ソフトウェアがどのように構築されるかを再形成しているワークフローの分析であり、その設計者本人から直接もたらされたものだ。

5つのAIエージェントを同時に動かすことが、コーディングをリアルタイムストラテジーゲームに変える方法

チェルニーの開示から最も印象的な啓示は、彼が直線的な方法ではコードを書かないということだ。開発における従来の「内側のループ」では、プログラマーは関数を書き、テストし、次に進む。しかしチェルニーは艦隊司令官のように振る舞う。

「私はターミナルで5つのClaudeを並列実行しています」とチェルニーは書いた。「タブを1から5まで番号付けし、システム通知を使ってどのClaudeが入力を必要としているかを把握します」。

(credit: x.com/bcherny)

iTerm2のシステム通知を活用することで、チェルニーは5つの同時作業ストリームを効果的に管理する。1つのエージェントがテストスイートを実行している間、別のエージェントはレガシーモジュールをリファクタリングし、3つ目はドキュメントを起草する。また、ブラウザでは「claude.ai上で5〜10のClaude」を実行し、「テレポート」コマンドを使ってウェブとローカルマシンの間でセッションを引き継いでいる。

これは、今週初めにAnthropic社長のダニエラ・アモデイが述べた「少ないリソースでより多くを成し遂げる」戦略を裏付けるものだ。OpenAIのような競合他社が兆ドル規模のインフラ構築を追求する一方で、Anthropicは既存モデルの優れたオーケストレーションが生産性の指数関数的向上をもたらしうることを証明している。

最も遅く、最も賢いモデルを選ぶという直感に反する理由

レイテンシーに執着する業界において驚くべき動きとして、チェルニーはAnthropicの最も重く、最も遅いモデルであるOpus 4.5を排他的に使用していることを明かした。

「私はすべての作業に、思考機能付きのOpus 4.5を使用しています」とチェルニーは説明した。「これは私が今まで使った中で最高のコーディングモデルです。Sonnetよりも大きく遅いにもかかわらず、指示を細かく出す必要が少なく、ツール使用にも優れているため、最終的にはほぼ常に小さなモデルを使うよりも速くなります」。

企業のテクノロジーリーダーにとって、これは重要な洞察だ。現代のAI開発におけるボトルネックは、トークンの生成速度ではなく、AIの誤りを修正するために費やされる人間の時間である。チェルニーのワークフローは、より賢いモデルに対して先に「計算コスト」を支払うことが、後の「修正コスト」をなくすことを示唆している。

1つの共有ファイルが、AIのあらゆる失敗を恒久的な教訓に変える

チェルニーはまた、彼のチームがAIの「記憶喪失」問題をどのように解決しているかも詳述した。標準的な大規模言語モデルは、セッションをまたいで企業固有のコーディングスタイルやアーキテクチャ上の決定を「記憶」しない。

この問題に対処するため、チェルニーのチームはgitリポジトリにCLAUDE.mdという単一のファイルを維持している。「Claudeが何かを誤って行っているのを見つけるたびに、それをCLAUDE.mdに追加します。そうすればClaudeは次回それをしないようになります」と彼は書いた。

この慣行は、コードベースを自己修正する有機体へと変える。人間の開発者がプルリクエストをレビューして誤りを見つけたとき、彼らは単にコードを修正するだけでなく、AIにタグを付けて自身の指示を更新させる。「あらゆる失敗がルールになる」と、このスレッドを分析したプロダクトリーダーのアーカシュ・グプタは指摘した。チームが協働する期間が長ければ長いほど、エージェントはより賢くなる。

スラッシュコマンドとサブエージェントが、開発の最も退屈な部分を自動化する

ある観察者が称賛した「標準的」ワークフローは、繰り返し作業の厳格な自動化によって支えられている。チェルニーはスラッシュコマンド——プロジェクトのリポジトリに登録されたカスタムショートカット——を使用して、複雑な操作を一つのキーストロークで処理する。

彼は、/commit-push-prというコマンドを強調した。これを彼は1日に数十回呼び出す。手動でgitコマンドを打ち、コミットメッセージを書き、プルリクエストを開く代わりに、エージェントがバージョン管理の事務作業を自律的に処理するのだ。

チェルニーはまた、サブエージェント——専門化されたAIのペルソナ——を配備して、開発ライフサイクルの特定のフェーズを処理させている。メインの作業が終わった後にアーキテクチャを整理するコード簡略化エージェントや、何かをリリースする前にエンドツーエンドテストを実行するverify-appエージェントを使用している。

検証ループが、AI生成コードにとって真の突破口である理由

Claude Codeが年間経常収益10億ドルにこれほど急速に達したと言われる理由が一つあるとすれば、それはおそらく検証ループである。このAIは単なるテキスト生成器ではない。それはテスターなのだ。

「Claudeは、Claude Chrome拡張機能を使って、私がclaude.ai/codeに反映させるすべての変更をテストします」とチェルニーは書いた。「ブラウザを開き、UIをテストし、コードが動作しUXが良好に感じられるまで反復します」。

彼は、AIに自身の仕事を検証する手段——ブラウザ自動化、bashコマンドの実行、テストスイートの実行などを通じて——を与えることが、最終結果の品質を「2〜3倍」向上させると主張する。エージェントは単にコードを書くだけではない。そのコードが動作することを証明するのだ。

チェルニーのワークフローが示すソフトウェアエンジニアリングの未来

チェルニーのスレッドへの反応は、開発者が自らの技について考える方法における決定的な転換を示唆している。長年、「AIコーディング」とはテキストエディタ内のオートコンプリート機能——より速くタイプする方法——を意味していた。チェルニーは、それが今や労働そのもののオペレーティングシステムとして機能しうることを実証した。

「もしあなたが既にエンジニアで…より多くの力を求めているなら、これを読みなさい」とジェフ・タンはXで要約した。

人間のアウトプットを5倍にするツールは既にここにある。それらに必要なのは、AIをアシスタントと考えるのをやめ、労働力として扱い始める意思だけだ。その精神的飛躍を最初に成し遂げるプログラマーは、より生産的になるだけではない。彼らはまったく別のゲームをプレイしているのだ——そして他の全員はまだタイプし続けていることだろう。

原文を表示

Credit: VentureBeat made with Midjourney

When the creator of the world's most advanced coding agent speaks, Silicon Valley doesn't just listen — it takes notes.

For the past week, the engineering community has been dissecting a thread on X from Boris Cherny, the creator and head of Claude Code at Anthropic. What began as a casual sharing of his personal terminal setup has spiraled into a viral manifesto on the future of software development, with industry insiders calling it a watershed moment for the startup.

"If you're not reading the Claude Code best practices straight from its creator, you're behind as a programmer," wrote Jeff Tang, a prominent voice in the developer community. Kyle McNease, another industry observer, went further, declaring that with Cherny's "game-changing updates," Anthropic is "on fire," potentially facing "their ChatGPT moment."

The excitement stems from a paradox: Cherny's workflow is surprisingly simple, yet it allows a single human to operate with the output capacity of a small engineering department. As one user noted on X after implementing Cherny's setup, the experience "feels more like Starcraft" than traditional coding — a shift from typing syntax to commanding autonomous units.

Here is an analysis of the workflow that is reshaping how software gets built, straight from the architect himself.

How running five AI agents at once turns coding into a real-time strategy game

The most striking revelation from Cherny's disclosure is that he does not code in a linear fashion. In the traditional "inner loop" of development, a programmer writes a function, tests it, and moves to the next. Cherny, however, acts as a fleet commander.

"I run 5 Claudes in parallel in my terminal," Cherny wrote. "I number my tabs 1-5, and use system notifications to know when a Claude needs input."

(credit: x.com/bcherny)

By utilizing iTerm2 system notifications, Cherny effectively manages five simultaneous work streams. While one agent runs a test suite, another refactors a legacy module, and a third drafts documentation. He also runs "5-10 Claudes on claude.ai" in his browser, using a "teleport" command to hand off sessions between the web and his local machine.

This validates the "do more with less" strategy articulated by Anthropic President Daniela Amodei earlier this week. While competitors like OpenAI pursue trillion-dollar infrastructure build-outs, Anthropic is proving that superior orchestration of existing models can yield exponential productivity gains.

The counterintuitive case for choosing the slowest, smartest model

In a surprising move for an industry obsessed with latency, Cherny revealed that he exclusively uses Anthropic's heaviest, slowest model: Opus 4.5.

"I use Opus 4.5 with thinking for everything," Cherny explained. "It's the best coding model I've ever used, and even though it's bigger & slower than Sonnet, since you have to steer it less and it's better at tool use, it is almost always faster than using a smaller model in the end."

For enterprise technology leaders, this is a critical insight. The bottleneck in modern AI development isn't the generation speed of the token; it is the human time spent correcting the AI's mistakes. Cherny's workflow suggests that paying the "compute tax" for a smarter model upfront eliminates the "correction tax" later.

One shared file turns every AI mistake into a permanent lesson

Cherny also detailed how his team solves the problem of AI amnesia. Standard large language models do not "remember" a company's specific coding style or architectural decisions from one session to the next.

To address this, Cherny's team maintains a single file named CLAUDE.md in their git repository. "Anytime we see Claude do something incorrectly we add it to the CLAUDE.md, so Claude knows not to do it next time," he wrote.

This practice transforms the codebase into a self-correcting organism. When a human developer reviews a pull request and spots an error, they don't just fix the code; they tag the AI to update its own instructions. "Every mistake becomes a rule," noted Aakash Gupta, a product leader analyzing the thread. The longer the team works together, the smarter the agent becomes.

Slash commands and subagents automate the most tedious parts of development

The "vanilla" workflow one observer praised is powered by rigorous automation of repetitive tasks. Cherny uses slash commands — custom shortcuts checked into the project's repository — to handle complex operations with a single keystroke.

He highlighted a command called /commit-push-pr, which he invokes dozens of times daily. Instead of manually typing git commands, writing a commit message, and opening a pull request, the agent handles the bureaucracy of version control autonomously.

Cherny also deploys subagents — specialized AI personas — to handle specific phases of the development lifecycle. He uses a code-simplifier to clean up architecture after the main work is done and a verify-app agent to run end-to-end tests before anything ships.

Why verification loops are the real unlock for AI-generated code

If there is a single reason Claude Code has reportedly hit $1 billion in annual recurring revenue so quickly, it is likely the verification loop. The AI is not just a text generator; it is a tester.

"Claude tests every single change I land to claude.ai/code using the Claude Chrome extension," Cherny wrote. "It opens a browser, tests the UI, and iterates until the code works and the UX feels good."

He argues that giving the AI a way to verify its own work — whether through browser automation, running bash commands, or executing test suites — improves the quality of the final result by "2-3x." The agent doesn't just write code; it proves the code works.

What Cherny's workflow signals about the future of software engineering

The reaction to Cherny's thread suggests a pivotal shift in how developers think about their craft. For years, "AI coding" meant an autocomplete function in a text editor — a faster way to type. Cherny has demonstrated that it can now function as an operating system for labor itself.

"Read this if you're already an engineer... and want more power," Jeff Tang summarized on X.

The tools to multiply human output by a factor of five are already here. They require only a willingness to stop thinking of AI as an assistant and start treating it as a workforce. The programmers who make that mental leap first won't just be more productive. They'll be playing an entirely different game — and everyone else will still be typing.

この記事をシェア

Simon Willison Blog重要度42026年7月5日 07:53

より優れたモデル、劣化したツール

Simon Willison Blog重要度42026年7月4日 03:51

Fable の判断力を活用する重要性について

The Zvi重要度42026年7月3日 22:12

Fable #6：王の帰還

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む