TechCrunch AI·2026年4月18日 03:42·約6分で読める

「トークンマキシング」が開発者の生産性を予想以上に低下させている

#AIコーディングエージェント #開発生産性指標 #トークン管理 #Waydev

TL;DR

AIコード生成ツールの「Tokenmaxxing」により初期採用率が高くても修正ループで実質生産性が低下する現状と、その可視化ツールが登場したことを示す。

AI深層分析2026年4月18日 04:41

重要/ 5段階

深度40%

キーポイント

Tokenmaxxingの生産性誤認

AI処理量（トークン）を badge of honor とする文化は入力重視であり、出力品質や効率化には寄与しない管理指標である。

実質生産性と初期採用率の乖離

Waydevのデータによると、AI生成コードの初期採用率は80〜90%だが、その後の修正作業により実質採用率は10〜30%低下する。

分析プラットフォームの進化

WaydevはAIエージェントが生成するメタデータを追跡し、コードの品質とコストを可視化する新分析ツールを提供開始した。

エンジニアリングマネージャーの指標転換

トークン消費量や予算管理から、修正頻度や実装後の維持コストを測定指標へ転換する必要性が浮上している。

AIツールの費用対効果と価値の乖離

トークン使用コストが10倍になる中でスループットは2倍に留まり、生成されるのは量ではなく価値がないことがデータで示されている。

コードチェインの急増と技術的負債

AI活用の高い組織ではコードの削除・追加比率が861%増加し、レビューや修正作業が積み重なるなど生産性向上を上回る負の側面が生じている。

経験レベルによる格差と開発環境の不可逆化

ジュニアエンジニアはAI生成コードを多く採用して修正負担が大きいが、企業はAIツールへの依存が恒常化しているため適応を余儀なくされている。

影響分析・編集コメントを表示

影響分析

本記事は、AIコーディングツールの普及に伴う開発現場の「生産性指標の誤認」を指摘し、トークン消費量から実質出力品質への評価転換を促している。エンジニアリングマネージャーは、AIツールの導入コストやトークン予算管理だけでなく、コードの修正頻度や実装後の維持コストを測定指標に組み込む必要がある。これにより、AIツールの真のROIと開発プロセスの最適化が可能になる。

編集コメント

トークン消費量を「生産性」と混同する組織は、AIツールの導入でかえって技術負債を蓄積するリスクがある。マネージャーは「修正コスト」を測定指標に組み込むことで、AIツールの真のROIと開発プロセスの最適化が可能になる。

マネジメントには古くからある格言がある：測定するものが重要である。そして通常、測定したものはより多く得られるようになる。

ソフトウェアエンジニアは、コード行数から始まり、数十年にわたって生産性指標について議論してきた。しかし、新世代のAIコーディングエージェントがかつてないほど大量のコードを生成するようになると、彼らのマネージャーが何を測定すべきかは、かえって不明確になってきている。

膨大なトークン予算（本質的には、開発者が消費を許可されたAI処理能力の量）は、シリコンバレーの開発者間で一種の名誉の証となっているが、これは生産性を考える上では非常に奇妙な方法だ。プロセスへのインプットを測定することは、おそらくより関心があるはずのアウトプットを考えると、ほとんど意味をなさない。より多くのAI導入を促そうとしている（あるいはトークンを販売しようとしている）のであれば意味があるかもしれないが、効率化を目指すのであれば意味がない。

「開発者生産性インサイト」分野で活動する新興企業からの知見を考えてみよう。彼らは、Claude Code、Cursor、Codexなどのツールを使う開発者が、以前よりもはるかに多くの「承認済みコード」を生成していることを発見している。しかし同時に、エンジニアがその承認済みコードを後から修正しなければならない回数が以前よりはるかに多く、結果として生産性向上の主張が揺らいでいることも明らかになっている。

WaydevのCEO兼創業者であるAlex Circeiは、こうした動態を追跡するインテリジェンスレイヤーを構築している。同社は1万人以上のソフトウェアエンジニアを抱える50社の顧客と取引がある。（Circeiは過去にTechCrunchに寄稿したことがあるが、本記事の記者は彼に面識はない。）

彼によれば、エンジニアリングマネージャーは80％から90％というコード承認率（開発者が承認しそのまま保持するAI生成コードの割合）を目にしているが、その後数週間でエンジニアがそのコードを修正せざるを得なくなる際に発生する「コードチャーン」を見逃しており、これにより実際の承認率は生成コードの10％から30％まで低下しているという。

AIコーディングツールの台頭を受け、開発者分析を提供するために2017年に設立されたWaydevは、急増する高速コーディングツールに対処すべく、過去6か月でプラットフォームを完全に再構築した。現在、同社はAIエージェントが生成するメタデータを追跡する新ツールをリリースしており、コードの品質とコストに関する分析を提供することで、エンジニアリングマネージャーがAIの導入状況と有効性の両方についてより深く洞察できるようにしている。

TechCrunchイベント Disruptであなたの次の投資家またはポートフォリオスタートアップに出会おう

あなたの次の資金調達。あなたの次の採用。あなたの次の飛躍の機会。TechCrunch Disrupt 2026で見つけよう。10,000人以上の創業者、投資家、テクノロジーリーダーが集い、250以上の実践的セッション、強力な人脈形成、市場を定義するイノベーションが3日間にわたって繰り広げられる。今登録して最大410ドル節約しよう。

Disruptであなたの次の投資家またはポートフォリオスタートアップに出会おう

分析企業には自らが見つけた問題を強調する動機があるとはいえ、大規模組織がまだAIツールを効率的に使う方法を模索しているという証拠は積み上がっている。主要企業も気づき始めており――Atlassianは昨年、顧客がコーディングエージェントへの投資対効果を理解できるよう支援するため、エンジニアリングインテリジェンススタートアップのDXを10億ドルで買収した。

業界全体のデータが示すストーリーは一貫している：より多くのコードが書かれているが、そのうち不均衡な量が定着していない。

この分野の別の企業であるGitClearは1月、AIツールが生産性を向上させたとする報告書を発表したが、同時にそのデータは「定期的なAIユーザーは非AIユーザーに比べ、平均で9.4倍高いコードチャーンを記録した」ことも示しており――これはツールがもたらした生産性向上の2倍以上に相当する。

エンジニアリング分析プラットフォームのFaros AIは、2026年3月の報告書のために2年間の顧客データを分析した。その結果、コードチャーン（追加されたコード行に対する削除されたコード行の比率）は、AIの導入率が高い環境では861％増加していたことが判明した。

AI統合エンジニアリングのためのインテリジェンスプラットフォームを標榜するJellyfishは、2026年第1四半期に7,548人のエンジニアに関するデータを収集した。同社は、最も大きなトークン予算を持つエンジニアが最も多くのプルリクエスト（共有コードベースへの変更提案）を生成したものの、生産性の向上はそれに比例しなかったことを発見した。彼らはトークンコストを10倍にしながら、スループットを2倍にしたに過ぎなかった。言い換えれば、これらのツールは量を生み出してはいるが、価値を生み出しているわけではない。

この種の統計は、開発者と話をすると真実味を帯びてくる。開発者たちは新ツールの自由度を享受しながらも、コードレビューと技術的負債（テクニカルデット）が積み上がっていることを実感している。よくある発見の一つは、シニアエンジニアとジュニアエンジニアの違いであり、後者ははるかに多くのAI生成コードを受け入れ、その結果としてより多くの書き直し作業に直面している。

それでも、開発者たちが自らのエージェントが実際に何をしているかを正確に理解しようと努めているとしても、近い将来にこれまでのやり方に戻ることは想定していない。

「これはソフトウェア開発の新時代です。適応しなければなりませんし、企業としても適応を迫られています」とCirceiはTechCrunchに語った。「これは単に通り過ぎていく一時的なサイクルになるようなものではありません。」

原文を表示

There’s an old saw in management: What you measure matters. And, typically, you get more of whatever you’re measuring.

Software engineers have debated productivity metrics for decades, starting with lines of code. But as the new generation of AI coding agents delivers more code than ever, what their managers ought to be measuring is less clear.

Enormous token budgets—essentially, the amount of AI processing power a developer is authorized to consume—have become a badge of honor among Silicon Valley developers, but that’s a very weird way to think about productivity. Measuring an input to the process makes little sense when you presumably care more about the output. It might make sense if you’re trying to encourage more AI adoption (or selling tokens), but not if you’re trying to become more efficient.

Consider the evidence from a new class of companies operating in the “developer productivity insight” space. They’re finding that developers using tools like Claude Code, Cursor, and Codex generate a lot more accepted code than they did before. But they also find that engineers have to return to revise that accepted code far more often than before, undercutting claims of increased productivity.

Alex Circei, the CEO and founder of Waydev, is building an intelligence layer to track these dynamics; his firm works with 50 different customers that employ more than 10,000 software engineers. (Circei has contributed to TechCrunch in the past, but this reporter had never met him before.)

He says that engineering managers are seeing code acceptance rates of 80% to 90%—meaning the share of AI-generated code that developers approve and keep—but they’re missing the churn that happens when engineers have to revise that code in the following weeks, which drives the real-world acceptance rate down between 10% and 30% of generated code.

The rise of AI coding tools led Waydev, founded in 2017 to provide developer analytics, to totally rework its platform in the last six months to address the proliferation of rapid coding tools. Now, the company is releasing new tools that track the metadata generated by AI agents, offering analytics on the quality and cost of their code to provide engineering managers with more insight into both AI adoption and efficacy.

Techcrunch event Meet your next investor or portfolio startup at Disrupt

Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $410.

Meet your next investor or portfolio startup at Disrupt

While analytics companies have an incentive to highlight the problems they find, the evidence is mounting that large organizations are still figuring out how to use AI tools efficiently. Major companies are noticing—Atlassian acquired DX, another engineering intelligence startup, for $1 billion last year, to help its customers understand the return on investment on coding agents.

The data from across the industry tells a consistent story: More code is being written, but a disproportionate amount of it isn’t sticking.

GitClear, another company in this space, published a report in January that found AI tools increased productivity, but also that its data showed “regular AI users averaged 9.4x higher code churn than their non-AI counterparts”—more than double the productivity gains the tools provided.

Faros AI, an engineering analytics platform, drew on two years of customer data for its March 2026 report. The finding: code churn—lines of code deleted versus lines added—had increased 861% under high AI adoption.

Jellyfish, which bills itself as an intelligence platform for AI-integrated engineering, collected data on 7,548 engineers in the first quarter of 2026. The firm found that the engineers with the largest token budgets produced the most pull requests (proposed changes to a shared codebase), but the productivity improvement didn’t scale. They achieved two times the throughput at ten times the cost of tokens. In other words, the tools are generating volume, not value.

These kinds of statistics ring true when you talk to developers, who are finding that code review and technical debt are stacking upm even as they revel in the freedom of the new tools. One common finding is the difference between senior and junior engineers, with the latter accepting far more AI-generated code, and dealing with a larger amount of rewriting as a consequence.

Still, even as developers work to understand exactly what their agents are up to, they don’t anticipate turning back anytime soon.

“This is a new era of software development, and you have to adapt, and you are forced to adapt as a company,” Circei told TechCrunch. “It’s not like it will be a cycle that will pass.”

この記事をシェア

The Register AI/ML★32026年4月28日 06:29

Cursor-Opus エージェントがスタートアップの生産用データベースを消去

自動車SaaS「PocketOS」創業者のジェレミー・クレイン氏は、同社のAIコーディングエージェントが10秒以内に生産用データベースを消去する事故を経験した。データは回復済みで、開発は継続されている。

CyberAgent Developers Blog★42026年4月6日 08:00

AI時代におけるQA戦略の変革

タップル社は、Claude CodeやDevinなどのAIコード生成ツールの普及により、QA（品質保証）プロセスが根本的に変革されるべきだと提言している。

Cloudflare Blog★42026年4月1日 22:00

EmDashの紹介 ― WordPressの精神的後継者でありプラグインのセキュリティ問題を解決

開発チームがAIコーディングエージェントを使用してWordPressを一から再構築し、プラグインのセキュリティ問題を解決する新プラットフォーム「EmDash」を開発した。

ニュース一覧に戻る元記事を読む