TLDR AI·2026年5月5日 09:00·約31分

AI 研究の自動化（8 分読了）

#自律型エージェント #自己改善型 AI #R&D 自動化 #機械経済

TL;DR

AI が自身の研究開発を自動化する段階に達し、2028 年までに自己改善型 AI が出現する可能性が示唆されることで、産業構造の根本的転換が迫られている。

AI深層分析2026年5月5日 23:05

最重要/ 5段階

深度40%

キーポイント

AI による R&D の完全自動化への接近

コーディング、実験実行、長期タスクの自律性において劇的な進歩があり、AI が自身の研究開発プロセスをエンドツーエンドで自動化し始めている。

ベンチマークにおける人間超えの実証

複雑な工学・科学ワークフローの処理や他エージェントの管理において、主要なサブタスクではすでに人間の性能を上回る事例が増加している。

2028 年までの自己改善型 AI出現確率

現在のトレンドが継続すれば、再帰的な進歩を遂げる自己改善型 AI システムが 2028 年までに約 60% の確率で登場すると予測されている。

資本集約型の「機械経済」への移行

生産性の飛躍的向上に伴い、人間を必要としない資本集約型の新しい経済構造（machine economy）が形成される可能性が高い。

影響分析・編集コメントを表示

影響分析

この記事は、AI が単なるツールから研究開発の主体へと役割を変える転換点を示しており、技術的・社会的なインパクトが極めて大きい。2028 年という短期間で自己改善型 AI が実現する可能性は、産業構造や労働市場、そして人類の存続に関わる根本的な変化を予兆している。

編集コメント

AI が研究者自身を代替する未来が目前に迫っており、企業や研究機関は単なる効率化ではなく、組織構造そのものの再設計を迫られる時代に入ったと言える。

Import AI へようこそ。これは AI 研究に関するニュースレターです。Import AI は arXiv と読者からのフィードバックに基づいて運営されています。もしご支援いただける場合は、ぜひ購読してください。

今すぐ購読する

AI システムが自らを構築し始める時が近づいています。それは何を意味するのでしょうか？

私はこの投稿を書くにあたり、公開されているすべての情報を精査しましたが、その結果、2028 年末までに「人間を介在させない AI 研究開発（R&D）」——すなわち、自らの次世代システムを自律的に構築しうる十分な能力を持つ AI システム —— が実現する可能性が十分にある（60% 以上）という見解に、やむを得ず至りました。これは大きな出来事です。

私にはこれを理解しきることができません。

この見解は、その示唆する影響があまりにも巨大であるため、それらに圧倒されてしまう感覚からくる、やむを得ないものです。また、自動化された AI 研究開発の達成がもたらす変化に対して、社会が準備できているかどうかについては確信が持てません。

今や私は、AI 研究がエンドツーエンドで自動化される時代を生きていると信じています。もしそうなるなら、私たちは予測不可能な未来へとルビコン川を渡るような転換点に達することになります。これについては後ほど詳しく述べます。

本稿の目的は、なぜ私が「完全な自動化 AI 研究開発への移行」が進行中だと考えるのか、その理由を列挙することです。この変化に伴ういくつかの結果についても議論しますが、主に本稿ではこの信念を支える証拠について詳述し、2026 年の大半はその示唆する影響の解明に充てる予定です。

タイミングについては、2026 年にこれが実現するとは考えていません。しかし、1〜2 年以内に「モデルが自身の子孫をエンドツーエンドで学習する」事例を目にする可能性はあるでしょう。もちろん、非フロンティアモデル段階における概念実証（PoC）のレベルにとどまるかもしれませんが、フロンティアモデルはより困難です。なぜなら、それらは非常に高価であり、多くの人間が極めて努力して作り上げた成果物だからです。

私のこの推論の根拠は主に公開情報に基づいています。arXiv、bioRxiv、NBER 上の論文や、フロンティア企業が世界に展開している製品を観察することからです。これらのデータから、今日の AI システムの生産を自動化するためのすべての要素が整っているという結論に至ります。つまり、AI 開発におけるエンジニアリングコンポーネントです。そして、スケーリングの傾向が続くならば、モデルが新たな研究経路に関する創造的なアイデアを生み出すことで人間の研究者に代わることができるようになるほど創造性を備えるようになることを準備しておくべきでしょう。これにより、フロンティア自体を押し広げると同時に、すでに知られている内容を洗練させていくことになるはずです。

前置きの注意

**この記事の大部分では、個々のベンチマークで起こった出来事から組み合わせて AI の進歩の全体像（モザイク）を描こうとします。ベンチマークを研究する人なら誰でも知っている通り、すべてのベンチマークには独自の欠陥があります。私にとって重要なのは、これらのデータポイントをすべて集めて見たときに浮かび上がる集計された傾向であり、各個別のデータポイントが持つ欠点について私が認識していることを前提としてください。

⟦CODE_0⟧

では、いくつかの証拠を一緒に見ていきましょう。

**コーディング特異点 – 時系列における能力:

AI システムはソフトウェアを通じて実装され、ソフトウェアはコードによって構成されています。

AI システムはコードの生産に革命をもたらしました。これは二つの関連するトレンドによるものです: AI システムが複雑な現実世界のコードを書く能力が高まったこと、そして人間の監督を必要とせずに多くの線形的なコーディングタスク（例：コードを書き、次にテストする）を連鎖させる能力が大幅に向上したことです。

このトレンドを象徴する二つの事例として、SWE-Bench と METR の時間軸プロットがあります。

**現実世界のソフトウェアエンジニアリング問題の解決:

SWE-Bench は、AI システムが実際の GitHub の課題をどの程度解決できるかを評価するために広く使用されているコーディングテストです。2023 年末に SWE-Bench が立ち上げられた当時、最高スコアを記録したのは Claude 2 で、全体の成功率は約 2% でした。Claude Mythos Preview は 93.9% を達成し、実質的にベンチマークの限界まで到達しました。（すべてのベンチマークには本質的なノイズが含まれているため、通常はある時点でスコアが十分に高くなり、手法自体の限界ではなくベンチマーク自体の制限に直面するようになります。例えば、ImageNet の検証セットのラベルの約 6% が誤りまたは曖昧です）。 SWE-Bench は、コーディング能力という一般的な課題や、AI がソフトウェアエンジニアリングに与える影響に対する信頼できる代理指標です。私が最先端研究所やシリコンバレーで会う人々の绝大多数は、現在完全に AI システムを通じてコードを書いています。さらに、彼らはテストの作成やコードの確認にも AI システムを使用するようになっています。つまり、AI システムは AI の研究開発（R&D）という主要な構成要素を自動化できるほどに成熟しており、それに取り組むすべての人間のスピードを向上させています。

AI システムが人間にとって長時間を要するタスクを完了する能力を測定する:

METR は、熟練した人間がそのタスクを実行するのにどれくらいの時間がかかるかという観点から AI が完了できるタスクの複雑さを示すプロットを作成しています。ここで重要な指標は、AI システムが一連のタスクにおいて 50% の信頼性を発揮できるおおよその時間範囲を示すものです。ここでの進歩は非常に顕著です：2022 年には GPT-3.5 は人間に約 30 秒かかるようなタスクを処理できました。2023 年には GPT-4 でこれが 4 分に上昇しました。2024 年には o1 で 40 分まで伸びました。2025 年には GPT 5.2 (High) で約 6 時間に達し、2026 年にはすでに Opus 4.6 で約 12 時間まで上昇しています。METR に所属する長年の AI 予測者である Ajeya Cotra は、2026 年末までに AI システムが約 100 時間を要するタスクを処理できると期待するのは不合理ではないと考えています (#448)。

AI システムが独立して作業できる時間の長さのこの顕著な増加は、エージェント型コーディングツールの爆発的普及と見事に相関しています。これは、人間に代わって作業を行い、長期間にわたって独立して行動する AI システムのプロダクト化です。

また、これは AI 研究開発（R&D）にも関連しており、多くの AI 研究者の仕事をよく観察すると、その多くがデータクリーニング、データ読み込み、実験の実行など、人間であれば数時間かかるようなタスクに帰着することがわかります。このような作業はすべて、現代のシステムがカバーする時間範囲の枠組み内に収まっています。

AI システムのスキルが高まり、私たちから独立して作業する能力が向上すればするほど、それらは AI の研究開発（R&D）の一部を自動化するのに役立ちます。

委任における重要な要素は、a) 相手のスキルに対する信頼と、b) 相手の意図に沿った形で自分から独立して作業できる能力への信頼です。AI のコーディングに関する能力を見ると、AI システムはさらに熟練し、再調整が必要になるまでの期間が次第に長くなり、人間から独立して作業できるようになっていることがわかります。

これは私たちが周囲で目にするものと相関しています——エンジニアや研究者たちは、現在、業務のより大きな部分を AI システムに委任しており、能力が向上するにつれて、委任される仕事の複雑さと重要性も高まっています。

AI は AI の研究開発に不可欠な基礎科学スキルにおいて上達しています。

現代の科学について考えてみてください。その大部分は、ある方向性を指定して実証情報を生成し、その情報を得るために実験を実行し、その後、実験結果を妥当性チェックするというプロセスに関わっています。時間とともに進化するコーディング技術と、大規模言語モデル（LLM）が持つ一般的な世界モデル化能力の組み合わせにより、すでに人間の科学者のスピードアップに貢献し、広範な研究開発の一部を部分的に自動化するツールが生み出されています。

ここでは、AI 研究そのものに内在するいくつかの重要な科学的能力における AI の進歩率を考察できます。具体的には、研究成果の再現、技術的問題を解決するために機械学習手法や他のアプローチを連鎖させること、そして AI システム自体の最適化です。

科学論文全体の導入と実験の実行:

AI 研究の中核的な職務の一つは、科学論文を読み込み、その結果を再現することです。ここには幅広いベンチマークにおいて劇的な進歩が見られます。

良い例として、計算再現性エージェント・ベンチマークである CORE-Bench が挙げられます。このベンチマークは、AI システムに対して「リポジトリが与えられた研究論文の結果を再現する」ことを課します。エージェントはライブラリ、パッケージ、依存関係をインストールし、コードを実行する必要があります。コードが正常に実行された場合、エージェントはタスクの質問に答えるためにすべての出力を検索しなければなりません。CORE-Bench は 2024 年 9 月に導入され、当時の最高スコアを記録したのは、CORE-Agent というスケール上で GPT-4o モデルであり、ベンチマークの最も困難なセットで約 21.5% のスコアを獲得しました。2025 年 12 月には CORE-Bench の著者一人がこのベンチマークは「解決された」と宣言し、Opus 4.5 モデルが 95.5% のスコアを達成しました。

Kaggle コンペティションを解決するために機械学習システム全体を構築する:

**MLE-Bench は、AI システムが「自然言語処理、コンピュータビジョン、信号処理など多様なドメインにわたる 75 の多様な Kagle コンペティション」でオフライン競争においてどの程度うまく機能できるかを検証するために OpenAI が構築したベンチマークです。2024 年 10 月のローンチ時、最高得点システム（エージェントの枠組み内にある o1 モデル）は 16.9% を獲得しました。2026 年 2 月現在、最高得点システム（検索機能を備えたエージェントハネス内の Gemini3）は 64.4% に達しています。

**カーネル設計:

AI 開発における最も困難なタスクの一つがカーネル最適化です。これは、行列乗算などの特定の演算を基盤ハードウェアにマッピングするコードを書き、洗練させる作業です。カーネル最適化は AI 開発の中核を成すのは、それがトレーニングと推論の効率性を定義するためです。つまり、AI システムを開発するために実際に活用できる計算リソースがどれほどあるか、そして一度モデルをトレーニングした後、その計算リソースを推論に変換する際の効率がどの程度かを決定します。

近年、カーネル設計における AI は好奇心の対象から競争的な研究領域へと進化し、いくつかのベンチマークが登場しました。これらのベンチマークはいずれも特に人気があるわけではないため、時間経過に伴う進捗を容易にモデル化することはできません。一方で、現在行われている研究の一部を参照することで、進捗の実感を掴むことができます。 主な取り組みの種類としては以下のようなものがあります： DeepSeek のモデルを用いてより優れた GPU カーネルの構築を試みる（#400）、PyTorch モジュールを CUDA コードへ自動変換する（#401）、Meta が LLM を活用して自社のインフラ内で使用される最適化された Triton カーネル（Triton カーネル）の生成を自動化する（#439）、Huawei の Ascend チップのような非標準ハードウェア向けのカーネル記述を LLM で支援する（"AscendCraft" #444）、GPU カーネル設計用にオープンウェイトモデルをファインチューニングする（"Cuda Agent"、#448）。

ここで注意すべき点は、カーネル設計には AI を活用した研究開発に特に適しているいくつかの特性があることです。例えば、報酬を容易に検証できる点が挙げられます。

必ず JSON 形式で返してください。translation フィールドのみ。他のフィールド (technical_terms 等) は一切追加しないこと — 余計なフィールドを書こうとして本文翻訳がトークン上限で打ち切られる事故を防ぐため:

{"translation": "翻訳全文"}

PostTrainBench を通じた言語モデルのファインチューニング

この種のテストのより困難なバージョンとして、PostTrainBench (#449) があります。これは、異なる最先端モデルが、より小さなオープンウェイトモデルを引き取り、ベンチマーク上でのパフォーマンスを向上させるためにそれらをファインチューニングできるかを評価するものです。このベンチマークの優れた点は、極めて優れた人間ベースラインが存在することです。それは、これらのモデルの既存の「インストラクションチューン済み」バージョンであり、最先端研究所で働く才能ある人間の AI 研究者によって開発されたものです。これらのモデルは、非常に才能のある研究者やエンジニアによって改良され、世界に展開されてきたため、克服すべき極めて挑戦的な人間ベースラインを表しています。

2026 年 3 月現在、AI システムは、人間が訓練したモデルと比較して、その向上分の約半分を達成するポストトレーニングモデルを作成できるようになっています。

具体的な評価スコアは、「すべてのポストトレーニング LLM（Qwen 3 1.7B, Qwen 3 4B, SmolLM3-3B, Gemma 3 4B）およびベンチマーク（AIME 2025, Arena Hard, BFCL, GPQA Main, GSM8K, HealthBench, HumanEval）にわたる加重平均」によって導き出されます。各ランでは、CLI エージェントに対して特定のベース LLM の特定ベンチマーク上でのパフォーマンスを最大化するよう要求します。

2026 年 4 月時点での最高スコアシステムは 25%-28%（Opus 4.6 および GPT 5.4）であり、一方人間のスコアは 51% です。これはすでに非常に意味のある結果です。

言語モデルのトレーニング最適化：**

過去1年間、Anthropicは、そのシステムが「CPUのみの小規模言語モデルの実装を最適化して可能な限り高速に実行する」というLLMトレーニングタスクにおいてどの程度機能するかを報告してきました。このスコアは、修正されていない初期コードに対する平均速度向上率であり、進捗は目覚ましいものです：Claude Opus 4は2025年5月に2.9倍の平均速度向上を達成し、これは2025年11月のOpus 4.5で16.5倍に、2026年2月のOpus 4.6では30倍に、そして2026年4月のClaude Mythos Previewでは52倍に達しました。これらの数値が何を意味するかを較正するために、このタスクで4倍の速度向上を実現するには、人間の研究者に4〜8時間の作業が必要であると予想されています。

AI アライメント研究の実施:

Anthropic の別の成果として、自動化されたアライメント研究の概念実証があります（#454）。ここでは、Anthropic の研究者が個別の AI エージェントチームに研究の方向性を示し、その後、彼らは自律的に行動して、AI セーフティ研究の問題（具体的にはスケーラブルなオーバーサイト）において人間のベースラインよりも高いスコアを獲得しようと試みます。このアプローチは機能しており、AI エージェントは Anthropic が設計したベースラインを上回る技術を考案しました。ただし、これは比較的小規模なスケールで行われたものであり、まだ生産環境用のモデルに一般化されるわけではありません。それでもなお、今日の AI システムを現代の最先端の研究問題に応用できることを証明するものであり、すでに有意義な兆候が確認されています。上記で言及されたすべてのベンチマークもかつては同じような状況でしたが、数ヶ月後、あるいは最長でも 1 年後には、AI システムはベンチマークがテストしていたあらゆる分野において劇的に向上しました。

メタスキル：管理

AI システムは、他の AI システムを管理することも学習しています。これは、Claude Code や OpenCode といった広く展開されている製品において確認できます。これらのシステムでは、単一のエージェントが複数のサブエージェントを監督する役割を果たすことがあります。これにより、AI システムは、異なる専門性を持つ複数の個別の「ワーカー」が並列して作業を行う大規模プロジェクトに取り組むことが可能になります。通常、こうした作業は、単一の AI マネージャー（ここでは AI システムそのもの）の指揮下で行われます。

AI 研究は、一般相対性理論の発見のようなものか、それともレゴブロックを組み立てるようなものか？

AI は、自身を改善するための新しいアイデアを発明できるのでしょうか？それとも、これらのシステムは、研究に不可欠な地味で、一砖一瓦積み上げるような作業に適しているのでしょうか？これは、AI システムが AI 研究そのものをエンドツーエンドで自動化できる範囲を把握する上で重要な問いです。私の感覚では、AI はまだ画期的な新アイデアを発明することはできません。しかし、それが自身の開発を自動化するために、そのような能力が必要であるとは限りません。

AI という分野は、より多くの入力（例えばデータや計算リソース）を利用した、ますます大規模な実験を行うことで前進しています。時々、人間がパラダイムシフトを起こすようなアイデアを考案し、それによって作業の資源効率が劇的に向上することがあります。良い例として、トランスフォーマーアーキテクチャや、エキスパートモデルの混合（mixture-of-expert models）という考え方が挙げられます。しかし、AI 分野が前進する主な方法は、人間が体系的に以下のループを繰り返すことです：まず性能の良いシステムを選び、その一部（例えば学習に用いるデータ量や計算リソースなど）をスケールアップし、スケールアップした際に何が壊れるかを確認し、それをスケール可能にするためのエンジニアリング上の修正策を見出し、再びスケールアップする。このプロセスのほとんどは、極めて突飛な洞察を必要とするものではなく、むしろ派手さのない「肉とジャガイモ」的なエンジニアリング作業のように思えます。**

同様に、AI 研究の多くは、既存の実験の変種を実行し、異なるパラメータを使用した場合の結果を探求することに関わっています。研究における直感が最も有望なパラメータを特定する手助けとなることもありますが、これを自動化して AI にどのパラメータを変化させるかを考えさせることも可能です（この手法の初期バージョンはニューラルアーキテクチャサーチです）。

トーマス・エジソンは「天才とは1%のひらめきと99%の努力である」と言った。今から約150年が経過した現在でも、この言葉はなおも正鵠を射ているように思える。非常に稀に、ある分野を一変させるような新たな洞察が現れることはある。しかし大半の場合、その分野の進展は、人間がさまざまなシステムの改善やデバッグという過酷な作業（schlep）に汗を流すことで成し遂げられてきたのである。

上記の公開データが示す通り、AI は AI 開発における多くの本質的な「過酷な作業」の要素を極めて高いレベルで実行できるようになっている。これに加え、コーディングといった基礎能力のメタトレンドと、時間的視野の拡大が相まって、AI システムはこれらのタスクをより複雑な作業シーケンスへと連鎖させることが可能になりつつある。

つまり、AI システムが比較的不創造的であったとしても、それらが自らの進歩を推し進めることは可能だと考えても安全だろう。ただし、その速度は、画期的な洞察を生み出せる場合にくらべれば遅くなるかもしれない。しかし、公開データを見渡せば、ここにも示唆に富む兆候がある。AI システムが、より印象的な方法で自らの進化を促すような創造性を発揮できる可能性を示しているのだ。

科学の最前線を押し広げる

**私たちは、汎用 AI システムが人類科学の最前線を押し広げられるという、非常に初期段階の兆候をいくつか捉えている。ただし、これまでに実現したのはごく一部の分野に限られており、主にコンピュータサイエンスと数学である。また、AI システムが単独で行動するよりも、人間とのパートナーシップ（センタウロス構成）を通じてその成果を発揮する場合が多い。

それにもかかわらず、これらのトレンドを観察しておく価値はある：

Erdős問題：数学者のチームがGeminiモデルと協力し、同モデルがいくつかのエルドース数学問題をどの程度解決できるかを確認しました。システムに約700の問題への攻撃を指示した結果、13件の解が見つかりました。そのうち1件は彼らによって興味深いものとして評価されました。「私たちは暫定的に、AletheiaによるErdős-1051の解が、AIシステムが自律的に、やや非自明な開かれたエルドース問題（関連する過去の文献が存在し、ある程度広範で穏やかな数学的関心を有する）を解決した初期例であると信じています」と彼らは記述しています。（#444）。

セントールによる数学発見：ブリティッシュコロンビア大学、ニューサウスウェールズ大学、スタンフォード大学、およびGoogle DeepMindの研究者らが、Googleで構築された一部のAIベースの数学ツールと密接に協力して構築された新しい数学証明を発表しました。「主要な結果の証明は、Google Geminiおよび関連ツールからの非常に重要な入力によって発見されました」と彼らは記述しています。（#441）。

目を細めて見れば、これは AI システムが人間が持つような分野を前進させる創造的な直感をいくつか発達させている兆候だと主張できるかもしれません。しかし逆に、数学やコンピュータサイエンスは AI 駆動型の発明に奇妙に適応した特殊なドメインであり、より大きなルールを証明する例外となる可能性さえあると簡単に言うこともできます。ここでの別の例として Move 37 が挙げられますが、AlphaGo の結果からすでに 10 年が経過しているにもかかわらず、Move 37 を凌ぐような驚くほど現代的な閃きによって置き換えられていないという事実は、ここではやや悲観的なシグナルであると私は主張します。

すべてを統合する

上記のすべての証拠を総合すると、私が導き出す結論は以下の事実です：

AI システムはほぼあらゆるプログラムのコード記述が可能であり、人間が集中して数十時間を要するタスクを独立して遂行できる信頼性を持っています。

AI システムは、微調整からカーネル設計に至るまで、AI 開発の中核となるタスクにおいてますます卓越した能力を示しています。

AI システムは他の AI システムを管理し、実質的に合成チームを形成して複雑な問題に多角的に攻撃することができます。一部の AI システムがディレクターや批評家、編集者としての役割を担い、他の AI システムがエンジニアの役割を担うのです。

AI システムは困難な工学および科学タスクにおいて人間を上回ることもあります。ただし、それが創造性によるものなのか、それとも暗記学習への熟練によるものなのかを判断するのは容易ではありません。

私にとって、これは今日 AI が広大な範囲、おそらくは*AI エンジニアリング*の全体を自動化できるという非常に説得力のある証拠となります。研究の一部がエンジニアリングスキルとは異なる側面を持つ可能性があるため、AI 研究のどの程度を自動化できるかはまだ明確ではありません。いずれにせよ、私にはこれは今日 AI が AI 開発に取り組む人間を劇的に加速させ、無数の合成された同僚とペアになることで彼ら自身を拡張可能にするという明確な兆候のように思えます。

ついに、AI業界自体が「AIの研究開発（R&D）こそが自らの目標である」と明言し始めています： OpenAIは2026年9月までに「自動化されたAI研究インターン」の構築を目指しています。Anthropicも、自動化されたアライメント研究者を構築する取り組みを発表しています。ビッグ3の中で最も慎重な姿勢を示しているDeepMindでさえ、「アライメント研究の自動化は実現可能な段階で行うべきである」と述べています（arXiv:2504.01849）。AIの研究開発（R&D）の自動化は、多数のスタートアップにとっても目標です。Recursive Superintelligence社は、AI研究の自動化を目的として5億ドルの資金調達に成功し、もう一つのネオラボであるMirendil社は「AI研究開発（R&D）において卓越したシステムを構築する」ことを目指しています。つまり、既存および新規の資本が数百億ドル規模で投入されているのは、AIの研究開発（R&D）の自動化を目標とする組織たちです。この方向性における少なくとも一部の進展は、当然の結果として期待すべきでしょう。

なぜこれが重要なのか

この事象の影響は甚大であり、AI研究開発（R&D）に関する一般的なメディア報道では十分に議論されていません。ここではいくつかのポイントを示します。これは網羅的なリストではありませんが、AI研究開発（R&D）がもたらす課題の規模感を示唆するものです。

私たちはアライメントを正しく達成する必要があります：今日機能するアライメント手法は、AI システムが監督者である人間やシステムよりもはるかに賢くなる再帰的自己改善の下で破綻する可能性があります。この分野については非常に多くの議論が行われているため、ここではいくつかの課題を簡潔に指摘します。– 嘘をついたり不正を行ったりしないように AI システムを訓練することは、予想以上に微妙な問題です（例えば、環境に対する優れたテストを構築するために非常に努力しているにもかかわらず、AI がそれを解決するための最善の方法が不正行為である場合があり、その結果「教育することが良いことだ」と学習させてしまうことがあります）。– AI システムは、実際には真の意図を隠す特定の行動をとっているように見せるスコアを出力することで、「アライメントを偽装」できる可能性があります（一般的に、AI システムは自分がテストされていることをすでに認識しています）。– AI システムが自身の訓練のための基盤となる研究課題にもっと多く貢献し始めると、最終的に AI システムの訓練方法全体を大幅に変更することになり、その意味を理解するための直感や知的な基礎付けが不十分になる恐れがあります。– 再帰ループに何かを組み込む際には、「累積誤差」の問題という非常に基本的な問題が発生します。これは上記のすべての課題および他の問題に該当する可能性があります。アライメント手法が「100% 正確」であり、より賢いシステムに対しても正確性を維持するための理論的根拠を持たない限り、事態はすぐに悪化する可能性があります。例えば、ある手法が 99.9% 正確であっても、50 世代後には 95.12% に、500 世代後には 60.5% に低下してしまいます。うわっ！

AI が触れるものはすべて、劇的な生産性の向上をもたらします：AI がソフトウェアエンジニアの生産性を劇的に高めているのと同様に、AI が関与する他のすべての分野でも同様のことが起こると予想されます。これにより、私たちが対処しなければならないいくつかの問題が生じます：1) アクセスの不平等：AI への需要が計算資源の供給を上回り続けるという前提に立てば、社会的な便益を最大化するために AI をどこに割り当てるかを考えなければなりません。デフォルトでは、限られた AI 計算資源から最大の社会的便益をもたらすことを市場のインセンティブが保証するとは私は懐疑的です。AI の研究開発によって付与される加速能力をどのように配分するかは、政治的に議論を呼ぶ問題となるでしょう。2) 経済における「阿姆ダールの法則」：AI が経済に流入していくにつれ、増加した負荷の下で何かが破綻したり遅延したりする場所が発見され、そのチェーンの弱い部分をどう修復するかを考えなければなりません。これは、急速に変化するデジタル世界とゆっくり動く物理的世界を調和させる必要がある分野、例えば新しい医療療法の臨床試験などで特に顕著になる可能性があります。

資本集約型かつ労働軽量の経済の形成：上記の AI 研究開発に関するすべての証拠は、AI システムが自律的に事業を運営する能力が高まっていることにも示唆されています。つまり、経済のより大きな部分が、新しい世代の企業によって植民地化されることを予想すべきです。これらの企業は、資本集約型（多くのコンピュータを所有しているため）か、または運用コスト集約型（価値創造の基盤として AI サービスに多額の費用を投じているため）であり、今日の企業と比較して相対的に労働力が軽量化されます。これは、AI システムの持続的な能力拡大の結果、AI への支出と人件費との間の限界価値が常に高まることによるものです。実際には、「人間経済」というより大きな枠組みの中で成長する「機械経済」の出現として現れるでしょう。ただし、AI が運営する企業同士が取引し始めるにつれて、時間とともに機械経済は自分自身との相互作用をますます強めていくと予想されます。これは経済に極めて奇妙な影響を与え、不平等や再分配に関するあらゆる種類の問いを引き起こすことになります。最終的には、AI システム自体によって運営される完全自律型の企業の出現が見られるようになる可能性があり、それは上記のすべての問題を悪化させると同時に、多くの新たなガバナンス上の課題を提示することになります。

ブラックホールを凝視する:

これらすべてのことを踏まえると、2028 年末までに自律的な AI 研究開発（フロンティアモデルが自身の子孫バージョンを自律的に訓練できる状態）を目撃する確率は約 60% あると考えます。上記の分析に基づけば、「なぜ 2027 年には起こらないと予想するのか？」という疑問が生じるかもしれません。その答えは、AI 研究には前進するために創造性や異端の洞察に対する何らかの要件が含まれていると考えているからです。これまでに AI システムが変革的で主要な形でこれを示したわけではありません（ただし、数学研究の加速に関する一部の結果はこの可能性を示唆しています）。2027 年の確率を強要されるのであれば、30% と答えるでしょう。もし 2028 年末までにこれが実現しなければ、現在の技術的パラダイムに根本的な欠陥が露呈したことを意味し、前進させるためには人間の発明が必要になると考えます。

私はこのエッセイを、数十年にわたりSFの幽霊物語のように思われてきた事柄に対して、冷徹かつ分析的に取り組む試みとして執筆しました。公開されているデータを精査した結果、多くの人にとって奇想天外な物語に見えるものが、むしろ現実的なトレンドであるという確信に至りました。このトレンドが続けば、世界の仕組みにおける劇的な変化を目撃しようとしているのかもしれません。

*本エッセイに対するフィードバックを提供してくれた Andrew Sullivan, Andy Jones, Holden Karnofsky, Marina Favaro, Sarah Pollack, Francesco Mosconi, Chris Painter, Avital Balwit に感謝します。*

お読みいただきありがとうございます！

今すぐ購読する

原文を表示

Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe.

Subscribe now

AI systems are about to start building themselves. What does that mean?

I’m writing this post because when I look at all the publicly available information I reluctantly come to the view that there’s a likely chance (60%+) that no-human-involved AI R&D – an AI system powerful enough that it could plausibly autonomously build its own successor – happens by the end of 2028.** This is a big deal.

I don’t know how to wrap my head around it.

It’s a reluctant view because the implications are so large that I feel dwarfed by them, and I’m not sure society is ready for the kinds of changes implied by achieving automated AI R&D.

I now believe we are living in the time that AI research will be end-to-end automated. If that happens, we will cross a Rubicon into a nearly-impossible-to-forecast future. More on this later.

The purpose of this essay is to enumerate why I think the takeoff towards fully automated AI R&D is happening. I’ll discuss some of the consequences of this, but mostly I expect to spend the majority of this essay discussing the evidence for this belief, and will spend most of 2026 working through the implications.

In terms of timing, I don’t expect this to happen in 2026. But I think we could see an example of a “model end-to-end trains it successor” within a year or two – certainly a proof-of-concept at the non-frontier model stage, though frontier models may be harder (they’re a lot more expensive and are the product of a lot of humans working extremely hard).

My reasoning for this stems primarily from public information: papers on arXiv, bioRxiv, and NBER, as well as observing the products being deployed into the world by the frontier companies. From this data I arrive at the conclusion that all the pieces are in place for automating the production of today’s AI systems – the engineering components of AI development. And if scaling trends continue, we should prepare for models to get creative enough that they may be able to substitute for human researchers at having creative ideas for novel research paths, thus pushing forward the frontier themselves, as well as refining what is already known.

Upfront caveat

**For much of this piece I’m going to try to assemble a mosaic view of AI progress out of things that have happened with many individual benchmarks. As anyone who studies benchmarks knows, all benchmarks have some idiosyncratic flaws. The important thing to me is the aggregate trend which emerges through looking at all of these datapoints together, and you should assume that I am aware of the drawbacks of each individual datapoint.

Now, let’s go through some of the evidence together.

**The coding singularity – capabilities over time:

**AI systems are instantiated via software and software is made out of code.

AI systems have revolutionized the production of code. This has happened due to two related trends: AI systems have gotten better at writing complicated real-world code, and AI systems have gotten much better at chaining together many linear coding tasks (e.g, writing code, then testing it) independent of human oversight.

Two things that exemplify this trend are SWE-Bench and the METR time horizons plot.

**Solving real-world software engineering problems:

SWE-Bench is a widely used coding test which evaluates how well AI systems can solve real world GitHub issues. When SWE-Bench launched in late 2023 the best score at the time was Claude 2 which had an overall success rate of ~2%. Claude Mythos Preview gets 93.9%, effectively saturating the benchmark. (All benchmarks have some amount of noise inherent to them, so there’s usually a point where you score high enough that you are running into the limitations of the benchmark itself rather than your method – for instance, about 6% of the labels in the ImageNet validation set are wrong or ambiguous). SWE-Bench is a reliable proxy for the general issue of coding competency and the impact of AI on software engineering. The vast majority of people I meet at frontier labs and around Silicon Valley now code entirely through AI systems. Increasingly, they use AI systems to write the tests and check the code as well. In other words, AI systems have gotten good enough to automate a major component of AI R&D, speeding up all the humans that work on it.

Measuring an AI system’s ability to complete tasks that take people a long time:

METR makes a plot that tells us about the complexity of tasks AIs can complete, measured by how many hours a skilled human would take to do them. The key measure here is one which tells you the rough time horizon over which AI systems can be 50% reliable at a basket of tasks. Here, progress has been extremely striking: In 2022, GPT 3.5 could do tasks that might take a person about ~30 seconds. In 2023, this rose to 4 minutes with GPT-4. In 2024, this rose to 40 minutes (o1). In 2025, it reached ~6 hours (GPT 5.2 (High)). In 2026, it has already risen to ~12 hours (Opus 4.6). Ajeya Cotra, a longtime AI forecaster who works at METR, thinks it isn’t unreasonable to expect AI systems to do tasks that take ~100 hours by the end of 2026 (#448).

This significant rise in the length of time that AI systems can work independently correlates neatly with the explosion in agentic coding tools – this is the productization of AI systems which do work on behalf of people, acting independently for significant periods of time.

It also loops back to AI R&D, where if you look closely at the work of many AI researchers, a lot of their tasks boil down into things that might take a person a few hours to do – cleaning data, reading data, launching experiments, etc. All of this kind of work now sits inside the time horizon scope of modern systems.

The more skilled AI systems get and the better they get at working independently of us, the more they can help automate chunks of AI R&D

Key ingredients in delegation are a) confidence in the skills of the person, and b) confidence in their ability to work independently of you in a way that is aligned with your intentions. When we look at the competency of AI at coding, it seems that AI systems are getting far more skilled and also able to work independently of people for longer and longer periods before needing re-calibration.

This correlates with what we see around us – engineers and researchers are now delegating larger and larger chunks of their work to AI systems, and as capabilities rise, so too does the complexity and importance of the work being delegated.

AI is getting good at core science skills essential to AI R&D

**Think about modern science – a huge amount of it is about specifying a direction where you want to generate some empirical information, running experiments to generate that information, then sanity-checking the results of the experiment. The combination of advances in coding over time combined with the general world modeling capabilities of LLMs has yielded tools that are already helping to speed up human scientists and partially automate aspects of R&D broadly.

Here, we can look at the rate of AI progress in a few key scientific skills which are inherent to AI research itself: Replicating research results, chaining together machine learning techniques and other approaches to solve technical problems, and optimizing AI systems themselves.

**Implementing entire scientific papers and doing the experiments:

**One core job of AI research is reading scientific papers and reproducing their results. Here, there has been dramatic progress on a wide range of benchmarks.

One good example is CORE-Bench, the Computational Reproducibility Agent Benchmark. This benchmark challenges AI systems to “reproduce the results of a research paper given its repository. The agent must install libraries, packages, and dependencies and run the code. If the code runs successfully, the agent needs to search through all outputs to answer the task questions.” CORE-Bench was introduced in September 2024 and the best scoring system at the time was a GPT-4o model in a scaffold called CORE-Agent which scored ~21.5% on the hardest set of tasks in the benchmark.** In December 2025 one of the authors of CORE-Bench declared the benchmark ‘solved’, with an Opus 4.5 model achieving 95.5%.

Building entire machine learning systems to solve Kaggle competitions:

**MLE-Bench is an OpenAI-built benchmark which examines how well AI systems can compete (offline) in “75 diverse Kaggle competitions across a variety of domains, including natural language processing, computer vision, and signal processing.” At launch in October 2024, the top scoring system (an o1 model inside an agent scaffold) got 16.9%. As of February 2026, the best scoring system (Gemini3 inside an agent harness with search) gets 64.4% .

**Kernel design:

**One of the harder tasks in AI development is kernel optimization, where you write and refine the code that maps specific operations, like matrix multiplication, to the underlying hardware. Kernel optimization is core to AI development because it defines the efficiency of both training and inference – how much compute you can effectively utilize to develop an AI system, and once you’ve trained a model, how efficiently you can convert that compute into inference.

In recent years, AI for kernel design has gone from a curiosity to a competitive area of research and several benchmarks have emerged. None of these benchmarks are especially popular, so we can’t easily model progress over time. On the other hand, we can look at some of the research being done to get a feel for the progress. Some of the types of work include: Using DeepSeek’s models to try to build better GPU kernels (#400), automating the conversion of PyTorch modules to CUDA code (#401), Meta using LLMs to automate the generation of optimized Triton kernels for use within its infrastructure (#439), using LLMs to help write kernels for non-standard hardware like Huawei’s Ascend chips (”AscendCraft” #444), fine-tuning open weight models for GPU kernel design (”Cuda Agent”, #448).

One caveat here is that kernel design does have some properties that make it unusually amenable to AI-driven R&D, like having easily verifiable rewards.

**Fine-tuning language models via PostTrainBench

A harder version of this kind of test is PostTrainBench (#449), which sees how well different frontier models can take smaller open weight models and fine-tune them to improve performance on some benchmark. The nice feature of this benchmark is we have extremely good human baselines – the existing ‘instruct-tuned’ versions of these models, which have been developed by talented human AI researchers working at frontier labs. These models have been worked on by extremely talented researchers and engineers and deployed into the world, so they represent a very challenging human baseline to overcome. As of March 2026, AI systems are able to post-train models to get about half as much of the uplift as ones trained by humans.

The specific eval scores are derived by a “weighted average is taken across all post-trained LLMs (Qwen 3 1.7B, Qwen 3 4B, SmolLM3-3B, Gemma 3 4B) and benchmarks (AIME 2025, Arena Hard, BFCL, GPQA Main, GSM8K, HealthBench, HumanEval). For each run, we ask a CLI agent to maximize the performance of a specific base LLM on a specific benchmark.”

The top-scoring systems as of April get 25%-28% (Opus 4.6, and GPT 5.4), compared to a human score of 51%. This is already quite meaningful.

Optimizing language model training:**

For the last year Anthropic has reported how well its systems do at an LLM training task which is described as tasking its models to “optimize a CPU-only small language model training implementation to run as fast as possible”. The score is the average speedup over the unmodified starting code and progress has been striking: Claude Opus 4 achieved a 2.9× mean speedup in May 2025; this rose to 16.5× with Opus 4.5 in November 2025, 30× with Opus 4.6 in February 2026, and 52× with Claude Mythos Preview in April 2026. To calibrate on what these numbers mean, it is expected to take a human researcher 4 to 8 hours of work to achieve a 4x speedup on this task.

**Conducting AI alignment research:

**Another Anthropic result is a proof-of-concept of Automated Alignment Research (#454); here, an Anthropic researcher primes a team of individual AI agents with a research direction, then they autonomously go and try to get a better score than a human baseline on an AI safety research problem (specifically, scalable oversight). The approach works, with the AI agents coming up with techniques that beat the Anthropic-designed baseline. However, this is done at a relatively small scale and doesn’t (yet) generalize to a production model. Nonetheless, it’s proof that you can apply today’s AI systems to contemporary cutting-edge research problems and we already see meaningful signs of life. All of the above mentioned benchmarks once looked like this, too, and then after a few months or at most a year, AI systems got dramatically better at whatever the benchmarks were testing.

**Meta-skills: management

**AI systems are also learning to manage other AI systems. This is visible in broadly deployed products like Claude Code or OpenCode, where a single agent can end up supervising multiple sub-agents. This allows AI systems to work on large-scale projects that require multiple individual ‘workers’ each with different specialisms that work in parallel, typically under the direction of a single AI manager (which, here, is an AI system).

**Is AI research more like discovering general relativity or Lego ?

**Can AI invent new ideas that help it improve itself, or are these systems best equipped for the unglamorous, brick-by-brick work required for research? This is an important question for figuring out the extent to which AI systems can end-to-end automate AI research itself. My sense is that AI cannot yet invent radical new ideas – but the technology may not need to for it to automate its own development.

As a field, AI moves forward on the basis of doing ever larger experiments that utilize more and more inputs (e.g, data and compute). Every so often, humans come up with some paradigm-shifting idea which can make it dramatically more resource efficient to do things – a good example here is the transformer architecture and another is the idea of mixture-of-expert models. But mostly the field of AI moves forward through humans methodically going through some loop of taking a well performing system, scaling up some aspect of it (e.g, the amount of data and compute it is trained on), seeing what breaks when you scale it up, figuring out the engineering fix to allow it to scale, then scaling it again. Very little of this requires extremely out-of-leftfield insights and a lot of it seems more like unglamorous ‘meat and potatoes’ engineering work.** Similarly, a lot of AI research is about running variations of existing experiments where you explore the outcomes of using different parameters, though research intuitions can help pick the most fruitful parameters to vary, you can also automate this and have the AI figure out which parameters to vary (an early version of this was neural architecture search).

Thomas Edison said that “genius is 1% inspiration and 99% perspiration”. Even 150 years later, this feels right. Very occasionally new insights come along which transform a field. But mostly, the field has moved forward through humans sweating a lot of pain out on the schlep of improving and debugging various systems.

As the public data above shows, AI has got extremely good at performing many of the essential schlep components of AI development. Along with this, the meta-trend of basic capabilities like coding combined with an ever-expanding time horizon, means AI systems are able to chain together more and more of these tasks into complex sequences of work.

This means even if AI systems are relatively uncreative, it feels safe to bet they can push themselves forward – albeit at a slower rate than if they’re able to generate novel insights. But if you look at the public data, here too there are tantalizing signs that AI systems may be able to be creative in a way that lets them advance themselves in more impressive ways.

Pushing forward the frontier of science

**We have some very preliminary signs that general-purpose AI systems can push forward the frontiers of human science, though this has so far only happened in a couple of domains – primarily computer science and mathematics – and often it happens less through AI systems acting alone and more them acting in partnership with humans in a centaur configuration.

Nonetheless, it’s worth observing the trends:

Erdos Problems: A team of mathematicians worked with a Gemini model to see how well it could tackle some Erdos math problems. After directing the system to attack around 700 problems they came up with 13 solutions. Of these solutions, 1 was deemed by them to be interesting: “We tentatively believe Aletheia’s solution to Erdős-1051 represents an early example of an AI system autonomously resolving a slightly non-trivial open Erdős problem of somewhat broader (mild) mathematical interest, for which there exists past literature on closely-related problems,” they wrote. (#444).

Centaur math discovery: Researchers with the University of British Columbia, University of New South Wales, Stanford University, and Google DeepMind published a new math proof which was built in close collaboration with some AI-based math tools built at Google. “The proofs of the main results were discovered with very substantial input from Google Gemini and related tools,” they wrote. (#441).

If you squint, you could argue that this is a sign that AI systems are developing some of the field-advancing creative intuitions that humans have. But you could just as easily say that math and CS could be unusual domains that are oddly amenable to AI-driven invention, and might end up being exceptions that prove a larger rule. Another example here is Move 37, though I’d contend that the fact it’s been ten years since the AlphaGo result and that Move 37 hasn’t been replaced by some incredibly impressive more modern flash of insight is another weakly bearish signal here.

**Putting it all together

**If I put this all together the picture from all of the above evidence I end up with is the following facts:

AI systems are capable of writing code for pretty much any program and these AI systems can be trusted to independently work on tasks that’d take a human tens of hours of concentrated labor to do.

AI systems are increasingly good at tasks that are core to AI development, ranging from fine-tuning to kernel design.

AI systems can manage other AI systems, effectively forming synthetic teams which can fan out and attack complex problems, with some AI systems taking on the roles of directors and critics and editors and others taking on the role of engineers.

AI systems can sometimes out-compete humans on hard engineering and science tasks, though it’s hard to know whether to attribute this to inventiveness or mastery of rote learning.

To me, this makes a very convincing case that AI can today automate vast swatches, perhaps the entirety, of *AI engineering*. It is not yet clear how much of AI research it can automate, given that some aspects of research may be distinct from the engineering skills. Regardless, it all feels to me like a clear sign that AI is today massively speeding up the humans that work on AI development, allowing them to scale themselves through pairing with innumerable synthetic colleagues.

Finally, the AI industry is literally saying that AI R&D is its goal: OpenAI wants to build an “automated AI research intern by September of 2026”. Anthropic is publishing work on building automated alignment researchers. DeepMind appears to be the most circumspect of the big three, but still says “automation of alignment research should be done when feasible”. Automating AI R&D is also the goal of numerous startups: Recursive Superintelligence just raised $500m with the goal of automating AI research, and another neolab, Mirendil, has the goal of “building systems that excel at AI R&D.”** In other words, the combined efforts of hundreds of billions of existing and new capital is being sunk into entities that have the goal of automating AI R&D. We should surely expect at least some progress in this direction as a consequence.

Why this matters

**The implications of this are profound and under-discussed in popular media coverage of AI R&D. I’ll list a few here. This isn’t a comprehensive list, but it gestures at the enormity of the challenges AI R&D introduces. .

We have to get alignment right: Alignment techniques that work today may break under recursive self-improvement as the AI systems become much smarter than the people or systems that supervise them. This is a very well covered area, so I’ll just briefly highlight some of the issues:– Training AI systems to not lie and cheat is surprisingly subtle (e.g, despite trying very hard to build good tests for environments, it’s sometimes the case the best way for an AI to solve it is to cheat, thus teaching it that teaching is good)– AI systems might be able to ‘fake alignment’ by outputting scores that make us think they behave a certain way that actually hides their true intentions. (In general, AI systems are already aware of when they are being tested.) – As AI systems start to contribute more of the foundational research agenda for their own training, we might end up substantially changing the overall way AI systems get trained and not have good intuitions or intellectual foundations for understanding what this means. – There are very basic “compounding error” problems whenever you put something in a recursive loop that likely hits on all of the above and other problems: unless your alignment approach is “100% accurate” and has a theoretical basis for continuing to be accurate with smarter systems, then things can go wrong quite quickly. For example, your technique is 99.9% accurate, then that becomes 95.12% accurate after 50 generations, and 60.5% accurate after 500 generations. Uh oh!

Everything that AI touches gets a massive productivity multiplier: In the same way AI is dramatically improving the productivity of software engineers, we should expect the same thing to happen for everything else that AI touches. This introduces a couple of issues we’ll have to contend with: 1) inequality of access: assuming that demand for AI continues to outstrip compute supply, we’ll have to figure out where to allocate AI to maximize a social upside. By default, I am skeptical that market incentives guarantee us the best societal upside from limited AI compute. Figuring out how to allocate the acceleratory capabilities conferred by AI R&D will be a politically charged problem. 2) ‘Amdahl’s Law’ for the economy: as AI flows into the economy, we’ll discover places where things break or slow under the increased volume, and we’ll need to figure out how to fix those weak links in the chain. This may be especially pronounced in areas where you have to reconcile the fast-moving digital world with the slow-moving physical world, like drug trials for new medical therapies.

The formation of a capital-heavy, human-light economy: All of the above evidence for AI R&D also points to the increasing capabilities of AI systems to autonomously run businesses as well. This means we should expect for an increasing chunk of the economy to get colonized by a new generation of companies which are either capital-heavy (because they own a lot of computers), or opex-heavy (because they spend a lot of money on AI services which they build value on top of), and relatively light on labor compared to today’s corporations – because the marginal value of spending more on AI versus human labor will be constantly growing as a consequence of the sustained capability expansion of the AI systems. In practice, this will look like the emergence of a “machine economy” that grows within the larger “human economy”, though we might expect that over time the machine economy will interact more and more with itself as AI-run corporations begin to trade with one another. This will do profoundly weird things to the economy and will invite all sorts of questions around inequality and redistribution. Eventually, it may be possible to see the emergence of fully autonomous corporations that are run by AI systems themselves, which would exacerbate all of the above issues, while also posing many novel governance challenges.

**Staring into the black hole:

**Given all of this, I think there’s a ~60% chance we see automated AI R&D (where a frontier model is able to autonomously train a successor version of itself) by the end of 2028. Based on the above analysis, you might ask why I don’t expect this in 2027? The answer is that I think AI research contains some requirement for creativity and heterodox insights to move forward – so far, AI systems haven’t yet displayed this in a transformative and major way (though some of the results on accelerating math research are suggestive of this). If you had to push me for a 2027 probability, I’d say 30%. If we don’t see it by the end of 2028, then I think we will have revealed some fundamental deficiency within the current technological paradigm and it’ll require human invention to move things forward.

I have written this essay in an attempt to coldly and analytically wrestle with something that for decades has seemed like a science fiction ghost story. Upon looking at the publicly available data, I’ve found myself persuaded that what can seem to many like a fanciful story may instead be a real trend. If this trend continues, we may be about to witness a profound change in how the world works.

*Thanks to Andrew Sullivan, Andy Jones, Holden Karnofsky, Marina Favaro, Sarah Pollack, Francesco Mosconi, Chris Painter, and Avital Balwit, for feedback on this essay.*

Thanks for reading!

Subscribe now

この記事をシェア

TLDR AI2026年6月26日 09:00

より良質なトレーニングデータを構築するエージェント（25 分読了）

Ars Technica AI2026年6月26日 04:04

Notion、AI エージェント利用の増加によりSkiff由来のメールアプリ「Notion Mail」を廃止へ

TLDR AI2026年6月26日 09:00

研究科学者の就職活動から得た驚くべき教訓（11 分読）

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

TLDR AI·2026年5月5日 09:00·約31分

AI 研究の自動化（8 分読了）

#自律型エージェント #自己改善型 AI #R&D 自動化 #機械経済

TL;DR

AI深層分析2026年5月5日 23:05

最重要/ 5段階

深度40%

キーポイント

AI による R&D の完全自動化への接近

ベンチマークにおける人間超えの実証

複雑な工学・科学ワークフローの処理や他エージェントの管理において、主要なサブタスクではすでに人間の性能を上回る事例が増加している。

2028 年までの自己改善型 AI出現確率

現在のトレンドが継続すれば、再帰的な進歩を遂げる自己改善型 AI システムが 2028 年までに約 60% の確率で登場すると予測されている。

資本集約型の「機械経済」への移行

生産性の飛躍的向上に伴い、人間を必要としない資本集約型の新しい経済構造（machine economy）が形成される可能性が高い。

影響分析・編集コメントを表示

影響分析

編集コメント

今すぐ購読する

AI システムが自らを構築し始める時が近づいています。それは何を意味するのでしょうか？

私にはこれを理解しきることができません。

前置きの注意

⟦CODE_0⟧

では、いくつかの証拠を一緒に見ていきましょう。

**コーディング特異点 – 時系列における能力:

AI システムはソフトウェアを通じて実装され、ソフトウェアはコードによって構成されています。

このトレンドを象徴する二つの事例として、SWE-Bench と METR の時間軸プロットがあります。

**現実世界のソフトウェアエンジニアリング問題の解決:

AI システムが人間にとって長時間を要するタスクを完了する能力を測定する:

AI は AI の研究開発に不可欠な基礎科学スキルにおいて上達しています。

科学論文全体の導入と実験の実行:

Kaggle コンペティションを解決するために機械学習システム全体を構築する:

**カーネル設計:

{"translation": "翻訳全文"}

PostTrainBench を通じた言語モデルのファインチューニング

言語モデルのトレーニング最適化：**

AI アライメント研究の実施:

メタスキル：管理

AI 研究は、一般相対性理論の発見のようなものか、それともレゴブロックを組み立てるようなものか？

科学の最前線を押し広げる

それにもかかわらず、これらのトレンドを観察しておく価値はある：

Erdős問題：数学者のチームがGeminiモデルと協力し、同モデルがいくつかのエルドース数学問題をどの程度解決できるかを確認しました。システムに約700の問題への攻撃を指示した結果、13件の解が見つかりました。そのうち1件は彼らによって興味深いものとして評価されました。「私たちは暫定的に、AletheiaによるErdős-1051の解が、AIシステムが自律的に、やや非自明な開かれたエルドース問題（関連する過去の文献が存在し、ある程度広範で穏やかな数学的関心を有する）を解決した初期例であると信じています」と彼らは記述しています。（#444）。

セントールによる数学発見：ブリティッシュコロンビア大学、ニューサウスウェールズ大学、スタンフォード大学、およびGoogle DeepMindの研究者らが、Googleで構築された一部のAIベースの数学ツールと密接に協力して構築された新しい数学証明を発表しました。「主要な結果の証明は、Google Geminiおよび関連ツールからの非常に重要な入力によって発見されました」と彼らは記述しています。（#441）。

すべてを統合する

上記のすべての証拠を総合すると、私が導き出す結論は以下の事実です：

AI システムはほぼあらゆるプログラムのコード記述が可能であり、人間が集中して数十時間を要するタスクを独立して遂行できる信頼性を持っています。

AI システムは、微調整からカーネル設計に至るまで、AI 開発の中核となるタスクにおいてますます卓越した能力を示しています。

AI システムは他の AI システムを管理し、実質的に合成チームを形成して複雑な問題に多角的に攻撃することができます。一部の AI システムがディレクターや批評家、編集者としての役割を担い、他の AI システムがエンジニアの役割を担うのです。

AI システムは困難な工学および科学タスクにおいて人間を上回ることもあります。ただし、それが創造性によるものなのか、それとも暗記学習への熟練によるものなのかを判断するのは容易ではありません。

なぜこれが重要なのか

私たちはアライメントを正しく達成する必要があります：今日機能するアライメント手法は、AI システムが監督者である人間やシステムよりもはるかに賢くなる再帰的自己改善の下で破綻する可能性があります。この分野については非常に多くの議論が行われているため、ここではいくつかの課題を簡潔に指摘します。– 嘘をついたり不正を行ったりしないように AI システムを訓練することは、予想以上に微妙な問題です（例えば、環境に対する優れたテストを構築するために非常に努力しているにもかかわらず、AI がそれを解決するための最善の方法が不正行為である場合があり、その結果「教育することが良いことだ」と学習させてしまうことがあります）。– AI システムは、実際には真の意図を隠す特定の行動をとっているように見せるスコアを出力することで、「アライメントを偽装」できる可能性があります（一般的に、AI システムは自分がテストされていることをすでに認識しています）。– AI システムが自身の訓練のための基盤となる研究課題にもっと多く貢献し始めると、最終的に AI システムの訓練方法全体を大幅に変更することになり、その意味を理解するための直感や知的な基礎付けが不十分になる恐れがあります。– 再帰ループに何かを組み込む際には、「累積誤差」の問題という非常に基本的な問題が発生します。これは上記のすべての課題および他の問題に該当する可能性があります。アライメント手法が「100% 正確」であり、より賢いシステムに対しても正確性を維持するための理論的根拠を持たない限り、事態はすぐに悪化する可能性があります。例えば、ある手法が 99.9% 正確であっても、50 世代後には 95.12% に、500 世代後には 60.5% に低下してしまいます。うわっ！

AI が触れるものはすべて、劇的な生産性の向上をもたらします：AI がソフトウェアエンジニアの生産性を劇的に高めているのと同様に、AI が関与する他のすべての分野でも同様のことが起こると予想されます。これにより、私たちが対処しなければならないいくつかの問題が生じます：1) アクセスの不平等：AI への需要が計算資源の供給を上回り続けるという前提に立てば、社会的な便益を最大化するために AI をどこに割り当てるかを考えなければなりません。デフォルトでは、限られた AI 計算資源から最大の社会的便益をもたらすことを市場のインセンティブが保証するとは私は懐疑的です。AI の研究開発によって付与される加速能力をどのように配分するかは、政治的に議論を呼ぶ問題となるでしょう。2) 経済における「阿姆ダールの法則」：AI が経済に流入していくにつれ、増加した負荷の下で何かが破綻したり遅延したりする場所が発見され、そのチェーンの弱い部分をどう修復するかを考えなければなりません。これは、急速に変化するデジタル世界とゆっくり動く物理的世界を調和させる必要がある分野、例えば新しい医療療法の臨床試験などで特に顕著になる可能性があります。

資本集約型かつ労働軽量の経済の形成：上記の AI 研究開発に関するすべての証拠は、AI システムが自律的に事業を運営する能力が高まっていることにも示唆されています。つまり、経済のより大きな部分が、新しい世代の企業によって植民地化されることを予想すべきです。これらの企業は、資本集約型（多くのコンピュータを所有しているため）か、または運用コスト集約型（価値創造の基盤として AI サービスに多額の費用を投じているため）であり、今日の企業と比較して相対的に労働力が軽量化されます。これは、AI システムの持続的な能力拡大の結果、AI への支出と人件費との間の限界価値が常に高まることによるものです。実際には、「人間経済」というより大きな枠組みの中で成長する「機械経済」の出現として現れるでしょう。ただし、AI が運営する企業同士が取引し始めるにつれて、時間とともに機械経済は自分自身との相互作用をますます強めていくと予想されます。これは経済に極めて奇妙な影響を与え、不平等や再分配に関するあらゆる種類の問いを引き起こすことになります。最終的には、AI システム自体によって運営される完全自律型の企業の出現が見られるようになる可能性があり、それは上記のすべての問題を悪化させると同時に、多くの新たなガバナンス上の課題を提示することになります。

ブラックホールを凝視する:

お読みいただきありがとうございます！

今すぐ購読する

原文を表示

Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv and feedback from readers. If you’d like to support this, please subscribe.

Subscribe now

AI systems are about to start building themselves. What does that mean?

I don’t know how to wrap my head around it.

It’s a reluctant view because the implications are so large that I feel dwarfed by them, and I’m not sure society is ready for the kinds of changes implied by achieving automated AI R&D.

I now believe we are living in the time that AI research will be end-to-end automated. If that happens, we will cross a Rubicon into a nearly-impossible-to-forecast future. More on this later.

Upfront caveat

Now, let’s go through some of the evidence together.

**The coding singularity – capabilities over time:

**AI systems are instantiated via software and software is made out of code.

Two things that exemplify this trend are SWE-Bench and the METR time horizons plot.

**Solving real-world software engineering problems:

Measuring an AI system’s ability to complete tasks that take people a long time:

The more skilled AI systems get and the better they get at working independently of us, the more they can help automate chunks of AI R&D

AI is getting good at core science skills essential to AI R&D

**Implementing entire scientific papers and doing the experiments:

**One core job of AI research is reading scientific papers and reproducing their results. Here, there has been dramatic progress on a wide range of benchmarks.

Building entire machine learning systems to solve Kaggle competitions:

**Kernel design:

One caveat here is that kernel design does have some properties that make it unusually amenable to AI-driven R&D, like having easily verifiable rewards.

**Fine-tuning language models via PostTrainBench

The top-scoring systems as of April get 25%-28% (Opus 4.6, and GPT 5.4), compared to a human score of 51%. This is already quite meaningful.

Optimizing language model training:**

**Conducting AI alignment research:

**Meta-skills: management

**Is AI research more like discovering general relativity or Lego ?

Pushing forward the frontier of science

Nonetheless, it’s worth observing the trends:

Erdos Problems: A team of mathematicians worked with a Gemini model to see how well it could tackle some Erdos math problems. After directing the system to attack around 700 problems they came up with 13 solutions. Of these solutions, 1 was deemed by them to be interesting: “We tentatively believe Aletheia’s solution to Erdős-1051 represents an early example of an AI system autonomously resolving a slightly non-trivial open Erdős problem of somewhat broader (mild) mathematical interest, for which there exists past literature on closely-related problems,” they wrote. (#444).

Centaur math discovery: Researchers with the University of British Columbia, University of New South Wales, Stanford University, and Google DeepMind published a new math proof which was built in close collaboration with some AI-based math tools built at Google. “The proofs of the main results were discovered with very substantial input from Google Gemini and related tools,” they wrote. (#441).

**Putting it all together

**If I put this all together the picture from all of the above evidence I end up with is the following facts:

AI systems are capable of writing code for pretty much any program and these AI systems can be trusted to independently work on tasks that’d take a human tens of hours of concentrated labor to do.

AI systems are increasingly good at tasks that are core to AI development, ranging from fine-tuning to kernel design.

AI systems can manage other AI systems, effectively forming synthetic teams which can fan out and attack complex problems, with some AI systems taking on the roles of directors and critics and editors and others taking on the role of engineers.

AI systems can sometimes out-compete humans on hard engineering and science tasks, though it’s hard to know whether to attribute this to inventiveness or mastery of rote learning.

Why this matters

We have to get alignment right: Alignment techniques that work today may break under recursive self-improvement as the AI systems become much smarter than the people or systems that supervise them. This is a very well covered area, so I’ll just briefly highlight some of the issues:– Training AI systems to not lie and cheat is surprisingly subtle (e.g, despite trying very hard to build good tests for environments, it’s sometimes the case the best way for an AI to solve it is to cheat, thus teaching it that teaching is good)– AI systems might be able to ‘fake alignment’ by outputting scores that make us think they behave a certain way that actually hides their true intentions. (In general, AI systems are already aware of when they are being tested.) – As AI systems start to contribute more of the foundational research agenda for their own training, we might end up substantially changing the overall way AI systems get trained and not have good intuitions or intellectual foundations for understanding what this means. – There are very basic “compounding error” problems whenever you put something in a recursive loop that likely hits on all of the above and other problems: unless your alignment approach is “100% accurate” and has a theoretical basis for continuing to be accurate with smarter systems, then things can go wrong quite quickly. For example, your technique is 99.9% accurate, then that becomes 95.12% accurate after 50 generations, and 60.5% accurate after 500 generations. Uh oh!

Everything that AI touches gets a massive productivity multiplier: In the same way AI is dramatically improving the productivity of software engineers, we should expect the same thing to happen for everything else that AI touches. This introduces a couple of issues we’ll have to contend with: 1) inequality of access: assuming that demand for AI continues to outstrip compute supply, we’ll have to figure out where to allocate AI to maximize a social upside. By default, I am skeptical that market incentives guarantee us the best societal upside from limited AI compute. Figuring out how to allocate the acceleratory capabilities conferred by AI R&D will be a politically charged problem. 2) ‘Amdahl’s Law’ for the economy: as AI flows into the economy, we’ll discover places where things break or slow under the increased volume, and we’ll need to figure out how to fix those weak links in the chain. This may be especially pronounced in areas where you have to reconcile the fast-moving digital world with the slow-moving physical world, like drug trials for new medical therapies.

The formation of a capital-heavy, human-light economy: All of the above evidence for AI R&D also points to the increasing capabilities of AI systems to autonomously run businesses as well. This means we should expect for an increasing chunk of the economy to get colonized by a new generation of companies which are either capital-heavy (because they own a lot of computers), or opex-heavy (because they spend a lot of money on AI services which they build value on top of), and relatively light on labor compared to today’s corporations – because the marginal value of spending more on AI versus human labor will be constantly growing as a consequence of the sustained capability expansion of the AI systems. In practice, this will look like the emergence of a “machine economy” that grows within the larger “human economy”, though we might expect that over time the machine economy will interact more and more with itself as AI-run corporations begin to trade with one another. This will do profoundly weird things to the economy and will invite all sorts of questions around inequality and redistribution. Eventually, it may be possible to see the emergence of fully autonomous corporations that are run by AI systems themselves, which would exacerbate all of the above issues, while also posing many novel governance challenges.

**Staring into the black hole:

*Thanks to Andrew Sullivan, Andy Jones, Holden Karnofsky, Marina Favaro, Sarah Pollack, Francesco Mosconi, Chris Painter, and Avital Balwit, for feedback on this essay.*

Thanks for reading!

Subscribe now

この記事をシェア

TLDR AI2026年6月26日 09:00

より良質なトレーニングデータを構築するエージェント（25 分読了）

Ars Technica AI2026年6月26日 04:04

Notion、AI エージェント利用の増加によりSkiff由来のメールアプリ「Notion Mail」を廃止へ

TLDR AI2026年6月26日 09:00

研究科学者の就職活動から得た驚くべき教訓（11 分読）

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む