AI Snake Oil·2026年6月11日 11:29·約24分で読める

AI がソフトウェアエンジニアを代替しない理由と将来展望

#LLM #ソフトウェアエンジニアリング #AI ウォッシング #ナラティブ分析

TL;DR

本記事は、AI がソフトウェアエンジニアの大量解雇を引き起こすというナラティブを否定し、「決定・実行・納品」サンドイッチモデルにおいて AI が実行層のみを圧縮するに過ぎず、他の層が自動化に抵抗すると分析している。

AI深層分析2026年6月11日 21:05

重要/ 5段階

深度40%

キーポイント

AI による解雇の誤ったナラティブの否定

特定の閾値を超えると AI が大量解雇を引き起こすという主張には証拠がなく、規制の少ないソフトウェア業界ですらそうであるなら他業種はより安全であると論じている。

決定・実行・納品のサンドイッチモデル

知識労働を「決定（decide）」「実行（execute）」「納品（deliver）」の 3 層で構成されるサンドイッチと捉え、AI は中央の実行層のみを圧縮するが、他の 2 層は単なる能力向上では自動化されない。

CEO の AI 過大評価（AI ウォッシング）

Block や Snap の事例から、経営者がプロトタイプ作成の成功を製品化の難易度と混同し、AI を解雇の口実に利用する「AI ウォッシング」が横行していると指摘している。

慎重な楽観主義の結論

ソフトウェアエンジニアリングへの需要は健全に推移すると予測されるが、個々のキャリアの不安定性については今後の記事で詳述する予定である。

AI を理由にしたレイオフの誤った説明

Snap や Intuit の事例に見られるように、多くの企業がコスト削減や経営陣の再編を「AI による効率化」として説明する傾向があり、実際のレイオフは財務状況や組織構造の問題が主因である。

AI 導入と人員削減の実態の乖離

経営者の調査では AI 導入を予測して人員削減を行ったケースが多く見られる一方、実際に AI 実装に伴う大規模な人員削減は極めて稀であり、成熟した AI アプリが即座に職を代替できる状況にはない。

公式記録における AI 理由の欠如

ニューヨーク州の WARN Act（大量解雇通知法）において AI を理由とした記載はほぼ皆無であり、対象期間中のレイオフ労働者のうち AI が原因とされたのは極めて少数（0.2%未満）に過ぎない。

影響分析・編集コメントを表示

影響分析

この記事は、AI による雇用喪失への不安に対し、構造的な分析と具体的な企業事例を用いて反証する重要な役割を果たします。特に「決定・実行・納品」のモデル提示は、現場のエンジニアや経営者が AI の現状能力を正しく評価し、過度な期待や恐怖に振り回されないための指針となります。

編集コメント

AI の能力限界と、それがビジネス現場でどう誤解されているかを冷静に分析した良質な論考です。技術の進歩だけでなく、組織構造や人間の認知バイアスまで視野に入れた視点が必要だと痛感させられます。

AI が職を奪うことに対する大きな不安と不確実性があります。曖昧な警告や大げさな予測を超えて、この問いにデータに基づいてどう取り組めるでしょうか？一つの有効な方法は、AI の能力が最も進んでおり、採用が極めて急速に進んでいる職業、すなわちソフトウェアエンジニアリングに目を向けることです。

本稿では、AI の能力がある一定の閾値に達した際に大規模な解雇を引き起こすとする物語を退けるのに十分な証拠があると主張します。規制障壁が非常に少ないセクターにおいてさえこれが真実であるならば、他のほとんどの職業はさらに緩衝材で守られている可能性が高いでしょう。

また、なぜそうなのかについても我々はよく理解しています。ソフトウェア開発を含む多くの種類の知識労働を、「決定・実行・納品のサンドイッチ」として捉えることができます。AI はこのサンドイッチの真ん中にある「実行」層を圧縮しますが、他の2つの層は能力の向上だけでは克服できない方法で自動化に抵抗します。

ソフトウェアエンジニアリングに対する需要の将来軌跡については、慎重な楽観主義をもって結論付けます。本稿はシリーズの第一弾であり、次回は全体の需要が健全であっても、個々のソフトウェアエンジニアのキャリアがなぜ不安定になり得るのかについて考察します。このシリーズは、経済学およびソフトウェア工学における公表された文献、AI エージェントに関する当社の評価と観察、そして多くのソフトウェアエンジニアによる AI が自らの職業に与える現在および将来の影響についての考察（公表された著作やコミュニティとの対話から得られたもの）に基づいています。

ソフトウェア分野における AI 駆動の大量解雇の話は、典型的な「AI ワッシング」の事例のように思われます。

世間の注目を集めた三つの出来事と、それらが現実とどう対比されるかを見てみましょう。

2 月、フィンテック企業 Block（Cash App、Square、Afterpay などを含む同社製アプリの開発元）は、創業者 Jack Dorsey によると AI が「より小さくフラットなチーム」という「新しい働き方を可能にしている」として、4,000 人の従業員を解雇すると発表しました。具体的には、2025 年後半におけるモデル機能の向上が引用されました。

しかし、その後の報道は全く異なる姿を明らかにしました。パンデミック中に従業員数を 3 倍以上に増やした同社は、巨額の財政圧力に直面していました。Cash App チームのデータサイエンティストである中田直子氏は、Block が「AI を全員に無理やり押し付けた」にもかかわらず、「生産性における向上は極めて限定的だった」と投稿しました。彼女は 75% の昇給（リテンション・レイズ）を拒否し、退職しています。他の従業員へのインタビューでは、Block において AI が実際に何ができるのか、またドゥーシー氏がその問題について有能な理解を持っていたかどうかについて、彼女とは鋭く異なる認識が示されました。

アロン・レヴィィが指摘したように、CEO は迅速なプロトタイプを構築できる一方で、完成品にするために必要な作業の 90% を見ることができないため、AI の有用性に関する誤った信念に陥りやすいという点で特異です。ドゥーシー氏の AI に関する公的な発言は、まさにこのパターンに完全に合致しています。

4 月、Snap は約 1,000 人を解雇しました。CEO のエバン・スピア氏は、その理由として主に AI を挙げていました。また、AI が新しいコードの 65% を生成したとも述べています。しかし実際には、この人員削減はコスト削減を要求するアクティビスト投資家によるキャンペーンに続くものでした。（Snap は 2017 年の IPO 以降、毎決算年度で純損失を計上しており、2026 年には株価が 30% 以上下落していました。）興味深いことに、AR（拡張現実）部門など多様な役割にまたがる 150 件の職の削減といったカットの内容は、AI が駆動要因である場合に予想される削減（つまり、全社的なプログラミングおよびその他の「AI に晒された」職種が特定の部署に集中せず広く行われるべきもの）とは相関していません。

5 月、Intuit は Anthropic および OpenAI との提携を発表する一方で、3,000 人の削減を宣言しました。報道はこれら 2 つの出来事を結びつけ、人員削減が AI に起因する再編成であると枠組み付けました。今回は例外的に、CEO がこの容易な物語に対して反発し、「AI とは何の関係もない」と述べ、削減の対象は「調整業務が中心の役割」および管理層が多すぎる点にあると説明しました。

これらの事例を都合よく選んだわけではありません。私たちが調査した AI に起因するソフトウェアエンジニアリングの人員削減に関するすべての記事において、同じ物語の矛盾が浮かび上がりました。実は、人員削減における「AI 洗脳（AI washing）」は経済全体に広がる現象であり、多くの調査によって裏付けられています。

米国の採用担当者の 59% が、財務上の制約を引用するよりも利害関係者にとって受け入れられやすいものとなるため、採用凍結や人員削減の説明において AI を強調していると認めています。

Forrester のシニアアナリストである J. P. Gownder は、 supposedly AI に起因するとされる人員削減を準備している企業について、「成熟した検証済みの AI アプリケーションがこれらの職務を埋めるために用意されているか尋ねると、10 回中 9 回は『いいえ』という答えが返り、着手さえしていない」と述べています。

HBR（Harvard Business Review）による 1,000 人以上のグローバル経営層を対象とした調査では、21% が AI の導入を見込んで大規模な人員削減を実施しており、さらに 39% が中程度または小規模な事前対応型の人員削減を行っていました。一方、実際の AI 実装に関連する大規模な人員削減をすでに実施したのはわずか 2% でした。この 10 倍の差は、経営層も他の人々と同様に、AI が雇用を代替するという誤った物語に陥りやすいことを示唆しています。

もう一つの興味深いデータポイントは、100 人以上の労働者に影響を与える工場閉鎖や大量解雇について特定の開示を義務付ける WARN 法（Worker Adjustment and Retraining Notification Act）からのものです。2025 年 3 月、ニューヨーク州は、WARN 法提出書類に AI 関連の開示チェックボックスを追加した米国初の州となりました。最初の完全な 1 年間では、160 社以上が WARN 通知を提出しました。しかし、AI 関連のチェックボックスにチェックを入れた企業は一件もありませんでした。1 私たちはニューヨーク州労働局に問い合わせましたが、5 月末時点では、ネスプレッソ（Nespresso）という 1 社だけがそのチェックボックスにチェックを入れていたことを確認しました。2 もしこれらの提出書類が正確であれば、関連期間中にニューヨーク州で解雇された約 25,000 人の労働者のうち、AI が影響を及ぼしたのはわずか 46 人、つまり約 0.2% に過ぎません。

さらに AI 主導による大量解雇という物語にとって致命的なのは、そもそも解雇が AI の潜在的な生産性向上効果を測る適切な指標ではないということです。研究結果は明確で、その効果は「人員の増加ではなく採用ペースの低下」を通じて現れます。既存の労働者を解雇することは、AI を効果的に運用するために必要な暗黙知（Tacit Knowledge）や組織資本を失うことにつながります。さらに、退職金、士気の低下、再雇用リスクという点でコストがかかります。これらのコストを考慮すると、自然な離職率でも数年かけて同じ結果が得られるため、解雇は基本的に不要なのです。

では、レイオフを超えて全体的な雇用動向を眺めてみると、データは何を教えてくれるのでしょうか？米国の文脈における証拠を集約した連邦準備制度の経済学者による重要な論文があります。雇用は依然として増加していますが、ChatGPT 登場後、AI が存在しないという反実仮想と比較して、年間約 3 ポイント分だけ成長が鈍化していることが明らかになりました。この研究には重要な限界があり、その手法では自営業者を捉えることができないため、成長の鈍化の一部は起業活動によって吸収されている可能性があります。他の研究からは、AI が起業を容易にするという証拠も得られています。したがって、実際の状況は連邦準備制度の研究が示唆するよりもさらに健全である可能性が高いです。

最後に、ソフトウェアエンジニアリングにおける間接的に AI に起因する雇用喪失の 2 つの種類に言及しておく価値があります。これらは実際に存在しますが、AI がソフトウェアエンジニアを代替するというものとは異なります。第一に、AI は場合によっては製品への需要を壊滅させることがあります。例えば、宿題支援の Chegg や技術支援の Stack Overflow のようなケースで、両社とも人員削減を行いました。AI はこれらの従業員が行っていた仕事を直接行うのではなく、その必要性を不要にするのです。歴史的な類似点は明確です：1950 年の米国国勢調査における 270 の職業のうち、自動化によって消滅したのはエレベーターオペレーターという 1 つの職業だけでした。しかし、多くの他の職業は新しい技術によって時代遅れとなりました。例えば電信オペレーターの職務などがそうです。

AI を販売する企業側における、もう一つの信頼性の高い AI に起因する人員削減の事例があります。したがって、IBM や SAP といった企業が AI のせいで人員削減を発表した場合、より正確な枠組みは「既存機能からの人員を、最も成長が著しい製品ラインへ再配置した」というものです。これは技術が労働者を置き換えたのではなく、収益機会を中心とした通常の企業再編です。

コーディングエージェントが労働者の代替をもたらさなかった理由：決定・実行・納品のサンドイッチ構造

上記のスナップの CEO のような多くのテックリーダーは、AI によって書かれたコードの割合を報告する際に、人員削減や将来の雇用喪失に関する予測も併せて報告しています。これが、「AI がすべてのコードを書くようになれば、コーディングを行う必要がなくなる」という単純化された思考モデルを生み出しています。幸いなことに、この思考モデルは誤りです。この AI 生成コードの指標は、労働者の代替において実際に重要視される事項とほぼ完全に切り離されています。その理由は以下の通りです。

第一に、コードを書くことは、そして決してそうではなかったことがボトルネックではありません。例えば、2019 年の論文は既存の研究を要約し、「開発者がコーディングに費やす時間は驚くほど少なく、研究によって 9% から 61% である」という結論に至っています。この発見は、マイクロソフトの 6,000 人の開発者からの同論文自身のデータとも一致していました。コーディングエージェントが導入され始めると、2025 年後半には、コードの大部分をエージェントに書かせることが全体の生産性にほとんど影響を与えないことに気づいた開発者によって、「コードを書くことはボトルネックではない」と指摘するブログ記事が爆発的に増加しました [1, 2, 3, 4, 5, 6, 7, 8]。

コードを書くことがボトルネックでないなら、何がそうなのか？タスク分解に関する調査は、会議やデバッグのようなものを指し示しています。これはさらに多くの疑問を生みます：開発者はその会議で何をしているのか、なぜそれを AI ができないのか？能力が向上すればデバッグも自動化されないのだろうか？真のボトルネックを理解するには、定性的なアプローチを取り、自動化に抵抗する自分たちの業務についてソフトウェアエンジニア自身が持つ理解を掘り下げる必要があります。

この分析を行った際、3 つのことが真のボトルネックであることが明らかになりました。(1) 何を構築するかを決定し仕様化すること、(2) 提供された成果物の検証と責任の所在、そして (3) これら両方を実行するために必要なコードベース、ビジネス、および環境に対する深い人間的理解です。

つまり、ソフトウェアエンジニアの仕事は「決定・実行・納品」というサンドイッチ構造で構成されています（理解はこれら 3 つすべての前提条件です）。AI はこのサンドイッチの真ん中部分を圧縮しましたが、両端はほぼ変更されていません。ソフトウェア開発チームが意思決定を担い、提供した成果物に対して責任を負う限り、エンジニアはシステムに対する深い理解を築くために時間を費やす必要があります。これらが 3 つのボトルネックです。

図：ソフトウェア開発は3つの層から構成される。(1)意思決定層—問題の枠組み設定、仕様策定、計画 (2)実行層—設計と実装 (3)納品層—テスト、検証、統合、保守など。これらは概念的な層であり、時間的なフェーズではないことに注意されたい。プロジェクトの進行中に行き来することは一般的である。

AI の生産性効果に関するサンドイッチモデルのエビデンスは、「コード記述 vs コード納品」に関する最近の研究論文から得られている。GitHub 上の10万人の開発者を対象とした調査で、研究者らは AI エージェントが記述されるコード行数を8倍に増加させたことを発見した。これは、AI がサンドイッチモデルの「実行層」をほぼ完全に圧縮しているという考えと一致する。しかし、リリース数は30%増にとどまり、人間によるボトルネック（「意思決定層」と「納品層」）が依然として存在していることを強く示唆している。

このサンドイッチ構造はさらに圧縮できるのか？我々はそう思わない。パイプラインの一端では、開発チームは何を構築するかを決定する必要がある。若手ソフトウェアエンジニアが学ぶ最も重要な教訓の一つに、「要件定義」（この層における専門用語）には驚くほど長い時間がかかるという事実がある。これを圧縮すると、後々大きな痛みを伴うことになる。この層は自動化が難しい。なぜなら、ユーザーニーズ、市場シグナル、組織の優先順位、場合によっては規制制約について考える必要があるからだ。

AI の能力が向上するにつれ、AI に委譲できる意思決定の種類は時間とともに増加します。しかし、これにより「意思決定」層が薄くなるわけではありません。一度の意思決定を AI に委譲できるようになれば、それはもはや競争優位性の源泉ではなくなり、人間の意思決定の価値はより上位へと移行していきます。ソフトウェアは時間とともに複雑化するため、このプロセスに上限はありません。

サンドイッチのもう一方の端において、人間によるチームは自らが提供するものに対して責任を負わなければなりません。将来的には、完全なテストや理解なしにミッションクリティカルなコードをリリースするチームが現れる可能性もありますが、現在の AI はあまりにも信頼性が低いため、そのような無秩序な慣行はソフトウェアチームとその顧客にとって存続の危機となるでしょう。

将来的に技術的な障壁がなくなっても、私たちが AI にコントロールを委ねる必要はありません。AI を通常の技術として捉えるという中心的な洞察の一つは、共通の規範、法律、政策を通じて人間が責任を負う状態を維持することを集団的に選択できるということです。これは、技術的能力の開発を遅らせようとするよりも、AI の影響の速度を制御し安全性を向上させるためのはるかにレジリエントな方法です。これらの速度に関する障壁はすでに責任法や業界固有の規制によってほぼ整えられていますが、さらに強化することができます。（この議論のより長いバージョンについては、オリジナルのエッセイをご覧ください。）

このビジョンにおいて、実行層の多くが AI に委譲されるにつれ、将来のソフトウェアエンジニアの役割はクレーンオペレーターに類似したものになります。AI エージェントが認知上の重労働の大部分を担い、人間の仕事の中心はエージェントの監督と制御の維持となります。

一部の評論家は、人間が制御を続ける未来は実現不可能だと主張しています。その理由は、それを行うために人件費を支払うコストが高すぎるからです。すでに、監督不十分なコーディング・エージェントが生産データベースを削除したり、他の種類の損害を引き起こしたりしたという viral な事例がいくつか存在します。しかし、私たちはこれらを「犬が人を噛む」という珍事として捉えており、新たな規範の出現とは見ていません。これらの話が viral になるのは、まさにそれが極めて無責任で異常な行動であり、衝撃的な価値を持つからです。そして、コミュニティが AI への過度な依存に対して自らを警戒するための定期的な教訓と学びの機会として機能しています。「ニュースになっていることなら心配する必要はない」という諺があるようにです。それでも、経済全体において、ソフトウェアエンジニアリングに限らず、高リスクタスクにおける AI の監督不十分な利用が増加していないかを検出できる能力は、今日私たちが抱える最も重要なデータギャップの一つであり続けます。

ところで、サンドイッチが潰される現象は新しいトレンドであり、AI が原因というわけではありません。20 年以上前、労働統計局はプログラミングをソフトウェアエンジニアリングとは別に追跡し始めました。大まかに言えば、プログラマーは実行のみを担当するのに対し、ソフトウェアエンジニアはそのサンドイッチのより大きな部分を管理します。プログラミングの領域は縮小しているだけでなく、単なる重労働と見なされているため報酬も大幅に低く、AI はこの長年続いているトレンドをさらに加速させ、純粋な技術スキルをさらに価値低下させています。

ソフトウェアエンジニアとプログラマーの雇用比較。グラフはワシントン・ポストによるもの。

このパターン、つまり AI が中間層をますます自動化する中で、決定・実行・納品のサンドイッチの両端において人間が依然として深く関与しているという構造は、ソフトウェア分野で最も進んでいるものの、知識労働の大部分に広く適用可能であるように思われます。結局のところ、複雑な意思決定と責任所在はほとんどの分野に共通する要素だからです。この現象への認識不足が、放射線科医などにおける即座の雇用喪失に関する多くの過信に満ちた予測を生み出しました。

バイブコーディングはエージェント型エンジニアリングではない

ソフトウェアエンジニアリングがどの程度変化しているかについての混乱の一因は、「バイブコーディング」という用語の乱用です。この用語は広範な実践を指すために使われていますが、その両端にある概念は本質的に異なり、類似点よりも相違点の方がはるかに大きいです。

真の意味でのバイブコーディングでは、ユーザーはエージェントに何をするか simply に指示するだけで、実行中の監視もせず、コードのレビューも行いません。場合によってはそれを行うスキルさえ持っていないこともあり、出力の評価も、明らかに破綻していることに気づく程度のことしか行いません。

これに対し、実際のソフトウェアエンジニアがエージェントを使用する方法とは対照的です。彼らはエージェントをツールとして使い、人間が制御と責任を引き続き担っています。幸いなことに、この実践を記述する用語として「アジェンティック・エンジニアリング（agentic engineering）」という言葉が広まりつつあります。

アジェンティック・エンジニアリングが標準となるにつれ、エンジニアたちはコーディングエージェントの監視が予想以上に時間がかかることを発見しています。例えば、AI 移行の主要な開発者かつ記録者であるサイモン・ウィリソンは、11 時にはすでにエージェントを監視することで精神的に疲れ果てていると指摘しています。これは私たちの経験とも一致しています。

より定量的な証拠は、ログ記録ツールに同意したオープンソース開発者からのコーディングエージェントの相互作用を収録した SWE-chat データセットから得られます。この研究では、エージェントが生成したコードのうちユーザーコミットに残存するのはわずか 44% であり、バイブコーディングによるコミットで導入される脆弱性の発生率が人間のみによる場合の 9 倍に達し、最も一般的なユーザーの意図は新しいコードを生成することではなく既存のコードを理解することであること（19% 対 13%）が明らかになりました。このデータセットが自己選択型であるため、本研究単独で強い結論を引き出すことはできませんが、バイブコーディングとエージェントエンジニアリングのパターンが非常に異なることを示す他の多くの証拠を裏付けるものとなっています。

エージェントエンジニアリングはバイブコーディングではない

繰り返しになりますが、これらは二つの独立したカテゴリではありません。これらはスペクトルの両端であり、その間には境界が曖昧な中間領域が存在します。すべてのプロジェクトが使い捨てかミッションクリティカルかのどちらかに分類されるわけではありませんし、すべてのワークフローが表の左列または右列にきっちり収まるわけでもありません。しかし、雇用に関する問いに対する重要な示唆は依然として確固たるものです——企業はソフトウェアエンジニアではなく無資格のバイブコダーを雇って生産用ソフトウェアをリリースすることはできません。

未来には何が待ち受けているのか？

AI の推進派は、大規模な人員削減がもうすぐやってくると主張するかもしれません。しかし、それはまだ起きていません。なぜなら、人間レベルのソフトウェアエンジニアリング能力は非常に最近のもの（あるいは未だ達成されていない）だからです。しかし、サンドイッチモデルが正しければ、これらの予測は実現しません。AI はすでにサンドイッチの中間層をほぼ圧縮してしまいました（その圧縮は実際には数十年も前に始まっています）。したがって、実行層を瞬時に完璧にするとしても、現状から見た変化はわずかなものに過ぎません。他の二つの層が AI に抵抗している理由は、能力の限界によるものではありません。

実際、AI によってソフトウェアエンジニアリングの仕事が消えるどころか、むしろ需要が増加する可能性があります。技術的な生産性向上によりソフトウェア（あるいはその他あらゆるもの）の作成コストが安くなると、人々ははるかに多くのソフトウェアを購入します（経済学の用語では、ソフトウェアは非常に「価格弾力的」です）。そして、私たちが主張してきたように、AI はソフトウェアエンジニア（置換の弾力性が低い）を代替するものではないため、より多くのソフトウェアへの需要は、より多くのソフトウェアエンジニアへの派生需要をもたらします。これと少し関連していますが、より華やかな経済用語である「ジェボンズの逆説」が、この概念を説明するために AI に関する議論で頻繁に持ち出されます。

歴史的に、これがパターンでした。米国のプログラマー雇用は1950年頃のほぼゼロから現在では数百万人にまで成長しています。これは、機械化と自動化により労働需要が有名に壊滅した農業などの職業とは明確に異なります。その違いは、人々が消費するカロリー量が比較的固定されているのに対し（25% の増加さえ肥満流行を引き起こしました）、生産されるソフトウェアの量は増え続けている点にあります。

原文を表示

There is great anxiety and uncertainty about AI replacing jobs. How can we move past vague warnings and bombastic predictions and bring data to bear on this question? One good way is to look at the profession where AI capabilities are furthest along and adoption has been exceptionally rapid: software engineering.

In this essay, we argue that there is enough evidence to reject the narrative that once AI capabilities reach a certain threshold, it will cause mass layoffs. Given that this is true even in a sector with very few regulatory barriers, most other professions are likely to be even more cushioned.

We also have a good understanding of why this is the case. We can think of many kinds of knowledge work, including software development, as a “decide-execute-deliver sandwich”. AI compresses the “execute” layer — the middle of the sandwich — but the other two layers resist automation in a way that will not be overcome by capability improvements alone.

We conclude on a note of cautious optimism about the future trajectory of demand for software engineering. This essay is the first in a series, and the next one will look at reasons why individual software engineers’ careers might be rocky even if overall demand is healthy. The series is based on the published literature in economics and software engineering, our own evaluations and observations of AI agents, and many software engineers’ reflection on the present and future of AI impacts on their profession, gleaned both from published writings and our interactions with the community.

The stories of AI-driven mass layoffs in software seem to be classic “AI washing”

Consider three stories that made the headlines and how they contrasted with reality:

In February, fintech company Block (maker of Cash App, Square, Afterpay, and other such apps) announced layoffs of 4,000 employees because, according to founder Jack Dorsey, AI is “enabling a new way of working” with “smaller and flatter teams”, specifically citing late-2025 improvements in model capabilities.

But subsequent reporting revealed a radically different picture. After growing headcount more than threefold during the pandemic, the company was under massive financial pressure. A data scientist on the Cash App team, Naoko Takeda posted that Block “shoved AI down everyone’s throats” yet she saw “very limited gains in productivity.” She refused a 75% retention raise and quit. Other employees interviewed had a sharply different understanding of what AI was capable of at Block and whether Dorsey had a competent understanding of the issues.

As Aaron Levie has pointed out, CEOs are uniquely prone to delusions about AI’s usefulness because they can build quick prototypes but can’t see the 90% of work it takes to turn it into a finished product. Dorsey’s public statements about AI seem to fit exactly this pattern.

In April, Snap laid off about 1,000 people, with CEO Evan Spiegel primarily citing AI as the reason in his layoff memo. He also said that AI generated 65% of new code. In reality, the layoffs followed a campaign by an activist investor demanding cost cuts. (Snap has posted a net loss every full year since its 2017 IPO and shares were down over 30% in 2026). Tellingly, the nature of the cuts, such as 150 jobs spanning various roles in the augmented reality division, don’t correlate with the cuts we would expect to see if they were driven by AI (i.e. programming and other “AI-exposed” jobs across the board, not concentrated in any unit).

In May, Intuit announced 3,000 cuts, alongside deals with Anthropic and OpenAI. The press connected the two, framing the layoffs as AI-driven restructuring. For once, the CEO actually pushed back on this easy narrative, saying that “none of it had to do with AI” and that the cuts targeted “coordination-heavy roles” and too many management layers.

We did not cherry-pick these examples. In every story about AI-driven software engineering layoffs that we examined, the same narrative violation emerged. It turns out that “AI washing” of job cuts is an economy-wide phenomenon, evidenced by many surveys:

59% of U.S. hiring managers admitted they emphasize AI when explaining hiring freezes or layoffs because it plays better with stakeholders than citing financial constraints.

Forrester principal analyst J. P. Gownder says of companies preparing supposedly AI-driven layoffs: “When we ask if they have a mature, vetted AI app ready to fill in those jobs, nine out of 10 times, the answer is no—and they haven’t even started.”

In a HBR survey of over 1,000 global executives, 21% had made large headcount reductions “in anticipation of” AI, with another 39% having made low or moderate anticipatory headcount reductions. In contrast, only 2% had already made large reductions in headcount related to actual AI implementation. The 10x gap suggests that executives, like everyone else, are highly prone to succumbing to the misleading narratives about AI replacing jobs.

Another interesting data point comes from the WARN Act, which requires certain disclosures of plant closings and mass layoffs affecting over 100 workers. In March 2025, New York became the first U.S. state to add an AI disclosure checkbox to WARN Act filings. In the full first year, more than 160 companies filed WARN notices. Not a single one checked the AI box.1 We reached out to the NY Department of Labor who confirmed that as of late May, only one company, Nespresso, checked the box.2 If these filings are accurate, only 46 out of about 25,000 laid off workers in New York State in the relevant period, or about two-tenths of a percent, were affected by AI.

Even more damning for the AI-driven-mass-layoffs narrative: layoffs are the wrong signal of AI’s potential productivity benefits in the first place! The research is clear that the effect operates through “slower hiring rather than increased separations”. Firing existing workers results in the loss of precisely the tacit knowledge and organizational capital that allows workers to operate AI effectively. Besides, it is expensive in terms of severance, damage to morale, and rehiring risk. Given these costs, it is largely unnecessary given that natural turnover achieves the same result in a few years.

So what does the data tell us when we look beyond layoffs to overall employment trends? An important paper from Federal Reserve economists compiles the evidence in the U.S. context. Employment is still growing, but they find that it is growing slower post-ChatGPT compared to a no-AI counterfactual, by about 3 percentage points per year. One important limitation of this study is that the methodology can’t capture self-employment, so it is possible that some of the slowdown in growth is being absorbed by entrepreneurship instead. We do have evidence from other studies that AI makes entrepreneurship easier. So the real picture is probably even healthier than the Federal Reserve study suggests.3

Finally, it is worth acknowledging two kinds of indirectly-AI-driven job losses in software engineering that are real, but different from AI replacing software engineers. First, AI sometimes decimates demand for the product, in cases like Chegg (homework help) or Stack Overflow (technical help), both of which have laid off workers. AI doesn’t directly do the job that these workers did, but rather obviates the need for it. The historical parallel is strong: Among the 270 jobs in the 1950 U.S. census, only one job was automated away — elevator operator. But many others were rendered obsolete by new technology, like the job of telegraph operator.

Another credible AI-driven layoffs story is among companies that sell AI, rather than buy it. So when companies like IBM or SAP announce layoffs because of AI, a more accurate framing is “we reallocated headcount from legacy functions to our fastest-growing product line.” That’s ordinary corporate restructuring around a revenue opportunity, not technology displacing workers.

Why coding agents haven’t led to labor displacement: the decide-execute-deliver sandwich

Many tech leaders, like the Snap CEO above, report the percentage of code written by AI alongside reports of layoffs or predictions of future job losses. This feeds into the simplistic mental model that once AI writes all the code, there is no need for coders. Fortunately, this mental model is wrong. This AI-written-code metric is almost completely disconnected from what matters for labor displacement. Here’s why.

First, writing code isn’t, and never was, the bottleneck. For example, a 2019 paper summarized existing studies with the conclusion that “developers spend surprisingly little time with coding, 9% to 61% depending on the study”. This finding was consistent with the paper’s own data from 6,000 developers at Microsoft. As coding agents began to be taken up, there was an explosion of blog posts in late 2025 pointing out that writing code isn’t the bottleneck, as developers realized that using agents to write most of the code led to little impact on overall productivity [1, 2, 3, 4, 5, 6, 7, 8].

If writing code isn’t the bottleneck, what is? The task-breakdown surveys point at things like meetings or debugging. This just leads to more questions: what are developers doing in those meetings and why can’t it be done by AI? Won’t debugging get automated as capabilities improve? To understand the real bottlenecks, we have to get qualitative, and dig into software engineers’ own understanding of what it is they do that resists automation.

When we did this analysis, it revealed three things as the real bottlenecks (1) deciding and specifying what to build, (2) verifying and being accountable for what is delivered, and (3) the deep human understanding — of the codebase, the business, and the environment — required to carry out both of these.

In other words, software engineers’ work consists of a “decide-execute-deliver” sandwich (with understanding being a prerequisite for all three). AI has compressed the middle of the sandwich, but has left the two ends largely unchanged. As long as software development teams are in charge of decision making and accountable for what they deliver, engineers still need to spend time building up a deep understanding of the system. These are the three bottlenecks.

Figure: Software development consists of three layers: (1) Decision making — problem framing, specification, planning (2) execution — design and implementation (3) delivery — testing, verification, integration, maintenance, etc. Note that these are conceptual layers, not temporal phases. It is common to switch back and forth in the course of a project.

Evidence for the sandwich model of AI’s productivity effects comes from a recent paper on “Writing Code vs. Shipping Code”. Across 100,000 developers on GitHub, the researchers found that AI agents led to an eight-fold increase in the number of lines of code written, consistent with the idea that AI almost completely compresses the Execute layer of the sandwich. But this led to only 30% more releases, strongly suggesting that human bottlenecks (the Decide and Deliver layers) remain in place.4

Can the sandwich be further compressed? We don’t think so. At one end of the pipeline, development teams need to decide what to build. One of the most important lessons junior software engineers learn is that requirements specification (the profession’s lingo for this layer) takes surprisingly long, and if it is compressed, it leads to much more pain down the line. This layer is hard to automate because it requires thinking about user needs, market signals, organizational priorities, and in some cases regulatory constraints.

As AI capabilities improve, the kinds of decisions that can be delegated to AI increase over time. But this does not make the “decide” layer thinner — once a decision can be delegated to AI, it is no longer a source of competitive advantage, and the value of human decision-making migrates upward. Software increases in complexity over time, so there is no ceiling to this process.

At the other end of the sandwich, human teams need to be accountable for what they deliver. It is possible that some day in the future teams will ship mission-critical code without fully testing and understanding it, but today’s AI is so unreliable that such haphazard practices would represent an existential threat to software teams and their customers.

Even if the technical barriers go away in the future, we don’t have to cede control to AI. A central insight of AI as Normal Technology is that we can collectively choose to keep humans accountable through shared norms, law, and policy. This is a much more resilient way to control the speed of AI impacts and improve safety than trying to slow the development of technical capabilities. These speed barriers are already largely in place due to liability laws and sector-specific regulation, but can be further strengthened. (For a longer version of this argument, see the original essay.)

In this vision, as more and more of the execution layer gets delegated to AI, the software engineer’s role in the future becomes analogous to that of a crane operator. AI agents will do most of the cognitive heavy lifting; supervising the agent and keeping it in control becomes most of the human’s job.

Some commentators argue that a future with humans staying in control is unlikely because it is too costly to pay people to do so. There have already been a few viral stories of poorly-supervised coding agents deleting production databases or causing other types of damage. But we view these as “man bites dog” stories rather than an emerging norm. They go viral precisely because they represent such irresponsible and unusual behavior that they have shock value, and serve as regular reminders and learning moments helping the community guard itself against over-reliance on AI. As the aphorism goes, “if it’s in the news, don’t worry about it”. Still, being able to detect whether there is an uptick in poorly-supervised use of AI for high-stakes tasks — across the economy, not just in software engineering — remains one of the most critical data gaps we have today.

By the way, the sandwich getting squished is a new trend and it is not uniquely due to AI. Over two decades ago, the Bureau of Labor Statistics started tracking programming separately from software engineering. Roughly speaking, programmers are responsible only for execution while software engineers manage a bigger part of the sandwich. Not only has programming been shrinking, it is also pays much less because it is seen as grunt work. AI merely accelerates this long-existing trend, further devaluing purely technical skills.

Software engineering versus programmer employment. Chart by The Washington Post.

This pattern — where humans remain heavily involved at both ends of the decide-execute-deliver sandwich, even as AI increasingly automates the middle layer, seems to be broadly applicable to most knowledge work, though it is farthest along in software. After all, complex decision making and accountability are common to most fields. A lack of recognition of this phenomenon has led to many overconfident predictions about imminent job losses, such as among radiologists.

Vibe coding is not agentic engineering

One reason for confusion about the extent to which software engineering is changing is the sloppy use of the term “vibe coding” to refer to a wide spectrum of practices, the ends of which are conceptually distinct and more dissimilar than similar.

In true vibe coding the user simply tells the agent what to do, doesn’t supervise it when it’s running, doesn’t review the code — might not even have the skills to do so — and doesn’t evaluate the output, beyond perhaps noticing when things are visibly broken.

This is in contrast to how most software engineers are actually using agents — as a tool, with the human remaining in control and accountable for the output. Fortunately, the term agentic engineering is gaining currency as a descriptor of this practice.

As agentic engineering has become the norm, engineers are discovering that supervising coding agents is surprisingly time consuming. For example, Simon Willison, a prominent developer and chronicler of the AI transition, has noted how he is mentally exhausted by 11am from supervising agents. This is consistent with our experience as well.

More quantitative evidence comes from SWE-chat, a dataset of coding agent interactions from open-source developers who opted into a logging tool. The study found that only 44% of agent-produced code survives into user commits, that vibe-coded commits introduce vulnerabilities at nine times the human-only rate, and that the most common user intent is understanding existing code, not generating new code (19% vs 13%). The self-selected nature of the dataset means that we can’t draw strong conclusions based on this study alone, but it does reinforce many other lines of evidence that vibe-coding and agentic engineering patterns are quite different.

Agentic engineering is not vibe coding

To re-iterate, these are not two distinct categories. They are two ends of a spectrum, and there is a blurry middle. Not every project is either a throwaway or mission-critical. Not every workflow fits precisely in the left column or the right column of the table. But the key implication for the jobs question remains solid — companies can’t ship production software by hiring unqualified vibe coders instead of software engineers.

What does the future hold?

AI boosters might claim that mass layoffs are coming; they just haven’t happened yet because human-level software engineering abilities are very recent (or haven’t been achieved yet). But if the sandwich model is correct, these predictions won’t come true. AI has already largely compressed the middle of the sandwich (and the compression actually started decades ago). So even making the execution layer instant and perfect will only be a small change from the status quo. The reasons why the other two layers have resisted AI is not because of capability limitations.

In fact, not only are software engineering jobs not going away due to AI, there might even be an increase in demand for software engineers. When software (or anything else) gets cheaper to create due to technological productivity improvements, people will buy a lot more software (in econ jargon, software is highly “price elastic”). And as we have argued, AI doesn’t replace software engineers (the “elasticity of substitution” is low), so the demand for more software results in a derived demand for more software engineers. A loosely related but flashier economics term, “Jevons’ paradox”, is often thrown around in the AI discourse to describe this concept.

Historically, this has been the pattern — programmer employment in the U.S. has grown from near-zero around 1950 to millions today. This is sharply different from occupations such as agriculture in which labor demand was famously decimated due to mechanization and automation. The difference is that the amount of calories people consume is relatively fixed — even a 25% increase led to the obesity epidemic — whereas the amount of software produced has grow

この記事をシェア

Latent Space2026年6月20日 17:06

[AINews] 今日特に大きな出来事はありませんでした

Latent Space は、GLM 5.2 が依然として注目されていると指摘しつつ、AIE WF 2026 の通常チケットが月曜日に完売すると発表しました。同サイト購読者向けに限定割引を提供し、参加者には Warp や Datadog などからのスポンサークレジットも付与されます。

TechCrunch AI★42026年6月20日 01:01

米国がアンソロピックの「Fable 5」発売を禁止、しかし市場は動じず

米国政府は国家安全保障上の懸念から、アマゾンの研究者らがガードレール回避手法を発見したとして、アンソロピックに対し最新モデル「Fable 5」と「Mythos 5」の販売差し止めを命じた。サイバーセキュリティ研究者らはこの措置が危険だとする公開書簡に署名し、同社も他モデルでも同様の抜け道が存在すると指摘している。

GitHub Blog★42026年6月20日 01:00

社内データ分析エージェントの構築方法について

GitHub は、大規模なデータ組織が直面する自己完結型のデータアクセスと洞察提供の課題に対し、AI を活用した信頼性の高い解決策として、社内でデータ分析エージェントを構築したことを発表した。

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む