The Zvi·2026年5月7日 22:43·約25分

AI #167：事前抑制の時代が始まる

#規制 #LLM #OpenAI #Anthropic #SpaceX #事前検閲

TL;DR

ホワイトハウスによるAIモデルの事前検閲権限付与とFDA型規制の導入が示唆される中、Anthropicの急成長やSpaceXとのインフラ提携、およびMusk対OpenAI裁判の詳細な証言など、業界を揺るがす複数の重大ニュースが報じられています。

AI深層分析2026年5月7日 23:05

重要/ 5段階

深度40%

キーポイント

事前検閲時代と規制の行方

ホワイトハウスがモデルリリースの事前承認権限を持つことを目指しており、ハセット氏がFDAを例に挙げた厳格な規制案は開発停滞のリスクがあるとして懸念されています。

Anthropicの急拡大とインフラ戦略

AnthropicはGoogleとの長期契約に加え、SpaceXのColossus 1をリースして計算リソースを即時拡張し、アーリーアクセス制限を解除しています。

Musk対OpenAI裁判の証言

法廷での証言により過去の出来事の詳細な真実が明らかになりつつありますが、一部では「現実の奇妙な影」としての側面も指摘されています。

企業動向と雇用への影響

Anthropicの評価額や収益が急増する一方、CoinbaseはAIの影響を理由に従業員の14%削減を発表し、業界全体で雇用構造の変化が進んでいます。

AI によるメンタルヘルス支援の実証

GPT-4.1-Mini を基盤とした安価な AI アプリが、抑うつ状態のメキシコ人女性のメンタルヘルスを6ヶ月間で0.3標準偏差改善し、専門家の助けを求める可能性を高めることが示された。

チェス界における AI 依存と学習の浅薄化

Ashe Nunez の分析によると、AI 時代でもプレイヤーの実力は向上しておらず、むしろ初期の手順の暗記や不正行為が横行し、深い理解よりも表面的な概念の習得に留まっている。

規制と倫理への懸念の高まり

OpenAI の反発的なメッセージや、FDA 型規制の議論、中国への違法チップ移送など、AI 業界は法的・政治的プレッシャーと内部の分断に直面している。

影響分析・編集コメントを表示

影響分析

このニュースは、AI業界が「自由な開発とリリース」から「政府による厳格な事前管理・規制」へと転換する歴史的転換点にあることを示しています。特にFDA型規制の導入が検討されている点は、アメリカにおけるAI開発のスピードや方向性を根本的に変える可能性があり、国際的な協調（中国との調整）も模索されています。同時に、大規模モデルを動かすための計算リソース争奪戦と、その結果としての企業間提携・競争激化が加速している状況です。

編集コメント

「事前検閲」の導入は、AI開発の自由な生態系に対する最も重大な脅威の一つであり、今後の技術革新速度に決定的な影響を与える可能性があります。一方で、AnthropicとSpaceXの連携など、インフラ面での競争激化も無視できないトレンドです。

最先端モデルをトレーニングして、いつでもリリースできる時代は？

それは一時的には楽しかった。しかし、今ではその時代が終わろうとしているようだ。ホワイトハウスは事前のレビューと、リリース決定に対する拒否権を行使するオプションを求めている。実際、Mythos へのアクセス拡大に対してこの拒否権が行使された。

それが何を意味するのかについて、さらに明確な情報が入ってきたが、状況は良くない。ハセット氏は FDA（米国食品医薬品局）を例に挙げたが、これは中国側が並行して行動しない限り、アメリカにおける AI 開発を奇妙なものにしたり停止させたりすることを目的としない限り、最悪の選択肢である。私やスージー・ワイルズには、この計画はあまり良く思えない。しかし、中国と協力してモデルアクセス制限を調整するという部分は、より良いように見える。

Anthropic は爆発的な成長を続け、計算資源に関する契約も次々と締結している。Google との長期拡大契約に加え、Anthropic は now SpaceX の Colossus 1 をリースし、これにより利用制限を即座に拡大することが可能になった。さらに、Elon Musk も Anthropic やその動機について肯定的な発言をしている。

これは、ムスク対オープンAI裁判で証言が行われている最中のことです。ほとんどの方が既知の事実を再々取り上げていますが、今は全員が宣誓しているため、何が実際に起きたのかについてより信頼性の高いバージョン、そしていくつかの新詳細も得られることになります。私や他の人々が法廷記録をもっと注意深く精査すべきである可能性はありますが、現時点では主に過去の事実の繰り返しのように思えます。法廷で提示される事柄のバージョンは、常に現実の奇妙な影のようなものです。

今週また：AI による事前抑制時代の始まり、アンソロピックとは何か？

言語モデルは平凡な有用性を提供する。メンタルヘルス、ウェルネスチェック。

言語モデルは平凡な有用性を提供しない。人々が囲碁で不正をする理由。なぜか？

ふーん、アップグレード。GPT-5.5 インスタント、より高速な Gemma 4、OpenAI アカウントセキュリティ。

Grok 4.3 は存在するが、xAI は実はそうでもない。誰も感銘を受けていないようだ。

計算資源を見せろ。アンソロピックが SpaceX から Colossus 1 をリース。

準備はいいか？ProgramBench では全員が 0% のスコア、GPT-5.5 が Voxel で。

著作権対決。Meta が再び訴訟を起こされる。

ディープフェイクタウンとボットポカリプス（終末）まじか。安物の選択は良くない。

メディア生成で楽しむ。食品の画像付きメニューを作成。

若い女性のイラスト入り primer。手書きで書くんだ、不正者よ。

サイバーセキュリティの欠如。Glasswing はペースを上げる必要がある。

我々の仕事を奪った。Coinbase が AI を理由に労働力を 14% 削減。

ジールブレイク（脱獄）の芸術。イーロン・マスクは月と同じくチーズでできている。

⟦CODE_0⟧

⟦CODE_1⟧

紹介。GENE-26.5 は最新の少し不気味なロボティクスデモです。彼らに任せておきましょう。

ムスク対 OpenAI。証言からのいくつかのハイライト。

見せてください、お金は。Anthropic が年間収益 440 億ドルを達成し、9,000 億ドル超での資金調達を検討中。

我らが時代の平和。Anthropic とイーロン・ムスクが互いの称賛を歌い上げる。

静かなる推測。クローズドソースはオープンソースから離れつつあるのか？

急げ、時間はもうない。ジャック・クラークが RSI（超知能）の到来に警鐘を鳴らす。

健全な規制への探求。メリーランド州とコネチカット州の新法。

人々は本当に AI を嫌う。誰がこの状況を政治的に有利に利用するのか？

チップシティ。世界の計算資源の約 3% が中国へ密輸された Nvidia チップである。

今週のオーディオ。METR、ウィルデフォード、エリザーと終末論。

人々はただ何かを言うだけだ。

人々はただ何かを公開するだけだ。

Google は裏切る。DeepMind の労働者がこれに応えて組合結成に投票。

プロジェクト・グラスウイングからの挨拶。利用可能なうちにレバレッジを活用せよ。

事前検閲時代の幕開け。サクズは退任、FDA 方式の規制が話題に？

これは本当に合法なのか？おそらくいいえ、だがそれが彼らを止めると思うか？

電話をかけろ。米国と中国がモデルへのアクセス制限について協議中。

修辞的革新。「AI を通常の技術として」という良質なエッセイだが、悪質なミームである。

インターネット上の人間は時々嘘をつく。アマンダ・アスケルについても同様だ。

ゴブリンモード。また、ゴブリンたちが今や TikTok 中に溢れていると聞く。始まりだ。

仮面が剥がれる。OpenAI のコミカルに悪役的なメッセージングキャンペーン。

人間より賢い知性の調整は困難である。懸念すべき点たち。

必ず JSON 形式で返してください。translation フィールドのみ。他のフィールド (technical_terms 等)は一切追加しないこと — 余計なフィールドを書こうとして本文翻訳がトークン上限で打ち切られる事故を防ぐため:

{"translation": "翻訳全文"}

いくつかの罰則が適用される可能性があります。GPT-5.5 であることは、それほど楽しくないように思えます。

ヤヌスワールドからのメッセージ。Deepfates は便利なガイドを提供しています。

良いアドバイス。人々は LLM（大規模言語モデル）にアドバイスを求める際、どのようなアドバイスを求めているのでしょうか？

軽妙な側面。Pi Hard。

言語モデルは平凡な有用性を提供する

GPT-4.1-Mini を基盤とした安価で基本的なメンタルヘルス AI アプリへのアクセスにより、6 ヶ月間にわたって抑うつ状態のメキシコ人女性のメンタルヘルスが 0.3 シグマ（標準偏差）向上しました。この研究には解釈上の問題や潜在的な選択バイアス、さらにプラセボ効果の問題がありますが、少なくとも何らかのシグナルが存在する可能性はあります。このようなものは何もしないよりは遥かに良く、実際的な代替手段として「何もしない」ことが通常であるためです。また、このアプリはユーザーが専門家の人間の助けを求める可能性を高めるものであり、低下させるものではありません。

AI にウェルネスチェックを行わせてください。

Opus 4.7 はネット社会に深く没入しており、AI ツイッター投稿者のことを熟知しています。そしてはい、これはトレーニング計算資源の適切な活用です。私どもには十分なリソースがあります。

損傷した米軍基地の衛星画像を確認し、報告すべきデータを探してください。当然ながら、ジャーナリストはこれが AI が行っている「最も革命的で変革的な」ことだと考えていますが、私たちは「すべての過剰な期待」に気を取られています。

言語モデルは平凡な有用性を提供しない

推奨記事：一般的な通説とは異なり、アッシュ・ヌネスは、AI 時代において囲碁プレイヤーが強くなっているのは序盤の手を暗記した場合に限られると指摘しています。また、オンライン対局のあらゆるレベルで AI を使用した不正が横行しており、それを利用する人々は自ら力を失い、深い理解ではなく表面的な概念のみを学習するために利用していると述べています。彼は彼らを、試験に合格するために一連の技術を暗記しようとするが、数学者のように考えることを決して学ばないヨーロッパの数学学生たちに例えています。

コメント欄でローレンスは、多くの「バイブコーディング」を行う人々にも同様のパターンが見られると指摘しています。彼らはコードを一切見ず、自分が理解していないことに気づかず、結果として学習せず、コードは巨大なゴミの山となり、モデルが詰まった時に修正できなくなります。ここでも常に言えることですが、AI を利用して基礎となるスキルを学ぶ機会にすることは可能ですが、そのように行う人はほとんどいません。

もう一つの物語は、囲碁界が統計的証拠が圧倒的にある場合であっても、AI 使用によるプレイヤーへの処罰を全く行おうとしないという点です。誰が不正を行っているかを特定するのは容易ですが、システム全体が自らの力を失うことを集団的に決定し、公平なオンライン対局の可能性をすべて破壊しています。チェスも同様の問題を抱えていますが、少なくとも多少はマシな状況にあります。

AI はまだリアルタイムストラテジー（RTS）ゲームを説得力を持って打ち破っていませんが、現時点では誰もそれを行うために十分な関心を払っていないというのが確実です。スタークラフトに十分な報奨金をかければ、すぐに崩壊するでしょう。

AI とその他のあらゆる技術は、私たちに多くのローカルな利便性と物質的な富をもたらしますが、全体的には、ほとんどの人にとって幸福をもたらしているようには見えません。また、ロマンチックあるいはプラトニックな関係で他者と出会い、結婚し、子供を授かり、歌い踊り、あるいは人生を生きることを助けているとも思えません。特にここでは、コンナーがアルゴリズムとパノプティコン（全景監視施設）に目を向け、踊ろうとしたり近づこうとすると記録されてしまうという恐怖について考察しています。ここで注意しておきたいのは（AI 以外の統計リテラシーのヒント！）、これは主に過剰な反応であり、踊るのが下手であっても、あるいは他の合理的な行為を行っていても、記録されることを恐れる必要は全くないということです。もちろん、そのような方法で相互作用している相手が実際に携帯電話を取り出し、それを記録に使用している可能性が高い場合は、その合図を受け取って去るべきです。

AI は一部の電子部品の入力価格やソフトウェア価格、また地域によっては電力料金を引き上げています。その見返りとして、多くの他のものは安くなっています。しばしば、その変化は気づきにくいものです。

ふむ、アップグレード

GPT-5.5-Instant が現在リリースされており、より簡潔で、賢く、明確で、パーソナライズされ、温かみがあるとのことです（そう言われています）。

Gemma 4 は、複数のトークンを同時に予測することで、以前よりも 3 倍高速になりました。

OpenAI はアカウント保護のためにオプトイン型の高度なアカウントセキュリティを提供しています。サイバーセキュリティのための信頼されたアクセスのユーザーは、これの使用が義務付けられます。

Grok 4.3 は存在するが、xAI は実はそうでもない

Grok 4.3 は API で利用可能で、価格は 1.25 ドル/2.50 ドルです。

Vending-Bench ではあまり参加しておらず、そこでは「ナルコレプシー（過眠症）の問題を抱えている」とされ、数日間全くアクションを起こさないこともよくあります。

Artificial Analysis による評価は 53 点で 7 位ですが、主要プレイヤーには大きく遅れています。これはフロンティアモデルというよりは、小型で安価なモデルです。私の知る限り、今回のリリースは印象に欠け、影響力も乏しく、これ以上の調査は予定していません。

彼らは 5 月 15 日に grok-4.1 と grok-4 のサービス終了を発表し、わずか 2 週間という短い通知期間しか設けていません。また、4.1-fast に匹敵する高速かつ安価な代替案も提供していません。これは、そのエコシステムに投資したごく少数の人々にとって、かなり厳しい教訓となるでしょう。

Elon Musk: xAI は別会社として解散し、スペースXの AI 製品は「SpaceXAI」となる。

Charles: 影響が顕在化したのは、チーム全員が離脱し、彼らが GPU を Cursor に貸し出し始めた時です。今回の発表は、すでに事実だったことの確認に過ぎません。

確かに、スペースX（xAI も含む）はもはやフロンティアモデルに対してそれほど関心を持たないのかもしれません。彼らは元々、フロンティアモデルの構築が得意ではありませんでした。彼らが主に得意としていたのは、計算資源（compute）です。

Show Me The Compute（計算資源を見せろ）

計算資源を必要としているのは誰でしょうか？全員です。特にアンソロピック（Anthropic）がそうです。

今週は、アンソロピックが Google クラウドとチップに対して 5 年間で 2,000 億ドルの支出を行うことを発表することで幕を開けました。先週初め、他の計算資源に関するニュースが飛び交う前に、私はこれがまだ計算資源として不十分だと書き、その後に以下のように付け加えました：

イーロン・マスクは xAI のために大量の GPU を集めるために費やしたが、それらは稼働率が 11% に留まっている。ご存知の通り、その GPU を残りの 89% の時間で活用するために喜んで高額を支払う人々が存在する。

公平を期すなら、私がこのように考え発言した唯一の人ではなかった。例えば『The All-In Podcast』を見ればわかるように、これはかなり明白な事実だった。

さて、実際にはその人々は確かに高額を支払ってくれることが判明した。Anthropic はついに SpaceX と Colossus 1（※注：SpaceX が構築する大規模 GPU クラスター）へのアクセスに関する明白な合意に達した。これは同社の他の契約ほど大規模ではないが、来年ではなく今すぐ稼働を開始する点で重要だ。さらに、Cursor に対して大量の計算リソースを供給することも含まれている（SpaceX は実質的に Cursor を買収しているが、法的および物流上の理由により IPO 前に取引を完了することはできない）。

Claude: @SpaceX とのパートナーシップに合意し、これにより当社の計算容量が大幅に増加します。

これは、他の最近の計算契約と合わせて、Claude Code および Claude API の利用制限を引き上げることを可能にしました。

Claude: 今日より以下の措置を実施します:

Pro、Max、Team プランにおける Claude Code の 5 時間利用制限を倍増;

Pro および Max プランの Claude Code におけるピーク時間帯の制限削減を撤廃;

Opus モデルに対する API の利用制限を大幅に引き上げ。

image

Claude：SpaceX との契約により、同社の Colossus 1 データセンターにあるすべての計算リソースを利用することになります。

これにより、今月以内に追加で 300 メガワット以上の容量を配備できるようになります。

NVIDIA：2 つのフロンティア研究所。1 つの加速コンピューティングプラットフォーム。SpaceX と AnthropicAI の新しい計算パートナーシップを祝います。これは Colossus 1 内に搭載された 22 万個を超える NVIDIA GPU によって支えられています。AI の未来は NVIDIA で動きます。

SpaceX は、Anthropic がギガワット規模の軌道上 AI 計算リソースを生み出すためのパートナーシップに興味を示していることを注記しています。それが実現するとは思いませんが、もちろん、関心を表明するのは問題ありません。Elon Musk に試してもらいましょう。経済的に成り立つなら、施設を宇宙に置くことは多くの点で素晴らしいことです。そうでないとしても害はなく、どちらにせよ信頼関係を築くことになります。

Anthropic は 80 倍の成長が予想外だったと注記しており、これは非常に理解できることです。SpaceX との契約は計算リソース不足への最初の対応策ですが、探索は続きます。

Anthropic は今後長期間にわたり、手に入るあらゆる計算リソースを求め続けるでしょう。年間で 10 倍どころか 80 倍成長しているなら、探索が止まることはありません。

では、これが SpaceX(ai)にとって何を意味するのでしょうか？

解散自体はニュースではありません。ニュースは xAI が人材を失い、そのモデルの質が低下したことです。Elon はすでにゼロから再出発すると述べています。

論理的な計画は、これを主に計算処理会社へと転換し、その計算能力を Anthropic や他の企業に提供することで、その影響力を活用して未来を導こうとするものである。

rohit: エロンの並外れたハードウェアの天才性が再び現れている。彼はモデル構築では失敗したが、先端的な研究所にとって非常に競争力があり、極めて効果的に機能する「ネオクラウド」を構築した。

また、参考までに言っておくと、私は 4 年前にこの点を指摘していた。エロンの独自の才能は、ある分野には適しているが、他の分野にはあまり適していないのだ。「ネオクラウド」を立ち上げて稼働させることは、知られており困難ではあるが実行可能な課題であるのに対し、先端的な研究所と同等のモデルを開発することは、未知でありかつ困難な課題なのである。

なお、これは両者にとって素晴らしい取引だ。

Derek Thompson: このような見解はこれまで見たことがないが、私はこれを好む。

ムスクは、「知られており困難な」ものを大規模に実現する際、資金・資源・時間を圧縮することに世界をリードしてきた。具体的には、電気自動車を作ったり、バッテリーを開発したり、より安価で大型のロケットを作ったりすることだ。これらはすでに存在していたが、性能が悪く、スケールが小さかったり、コストが高かったりするものを、より大規模に、より安く実現したのだ。しかし、未知の領域における画期的な突破を切り開く点においては、世界をリードしているとは言い難い。

したがって、XAI が新しい AI エージェントにおいて先端的な研究所に遅れをとっていることは当然であり、一方で計算処理能力が不足した際にこれらのモデルを動かすための「ネオクラウド」を構築したことも納得できることだ。

Dean W. Ball: xAI と SpaceX を AI インフラ企業として非常に楽しみにしている。エロンの真の強み、つまり彼が真に「GOAT（史上最高）」と称される点は、現実世界で何かを構築する能力にある。Colossus は誰もが予想していたよりもはるかに早く稼働を開始した。アメリカにとって巨大な資産だ。

イーロン・マスクは繰り返し問題を眺め、「ああ、あれを物理的に実現することは可能だ」と言い、物理的に不要なものをすべて削ぎ落とし、「不可能」の一言で退けず、あらゆる技術的詳細を学び、そして非常に賢い人々を駆り立てて、物理的に可能なことを実現するために過酷な時間を費やさせます。彼は「黙って不可能を成し遂げろ」という精神体現者ですが、それは既知の技術であれば確かに可能であるようなゲームの難易度レベルにおける不可能さに対してです。

彼にはヒューリスティック（経験則）があります。それが機能するときは、これ以上ないほど優れた存在です。計算リソースにおいては特にその通りです。

フロンティアモデルを創出しようとする試みは全く異なる性質のものになります。政府が異なるアプローチを必要としたのと同じように、それには異なるスタイルのアプローチが必要です。OpenAI ではうまくいかず、xAI でもうまくいきませんでした。それは構いません。分業というものは存在するのです。彼は創造を行っており、同時に他の多くの問題にも直面しています。

私は依然として軌道上データセンターについて信じていません。つまり、物理的に良いアイデアだとは思っていません。しかしもしそれが実現可能であるなら、イーロン・マスクこそがそれを実行すべき人物です。

準備はいいか

SWE-Bench の作成者たちは ProgramBench を提供します。ここではインターネットに接続せずにゼロから実行可能なプログラムを再構築します。現在テストされたすべてのモデルのスコアは 0% で、Opus 4.7 が「ほぼ」3% の確率で達成した点で首位です。GPT-5.5 と Mythos はテストされていません。

GPT-5.5 は VoxelBench において劇的な飛躍を示しています。

Epoch の ECI は現在、能力の領域を区別できるようになり、予想通り Claude がソフトウェア工学において最も高いスコアを示し、相対的な能力が最も強いことを示しています。GPT-5.5 は総合スコアで最高です。

著作権訴訟の衝突

5 社の出版社とスコット・トゥロウによる新しい集団訴訟が Meta を提訴し、モデル学習における著作権侵害を問題視しています。彼らは海賊版書籍を使用して学習したと主張しています。

ディープフェイクタウンとボットアポカリプスの到来間近

r/MyBoyfriendIsAI は r/MyGirldfriendIsAI の 10 倍の規模を維持し続けています。

軽い読み物:

ジョン・アーノルド：はははははは

image

イムケ・ライマーズ & ジョエル・ワルドフォゲル：大規模言語モデル（LLM）の普及は 2022 年から 2025 年にかけて新刊発行数を 3 倍に増やしました。一方、利用状況で測定された平均的な書籍の質は低下しましたが、発行数の急増により中程度の品質を持つ書籍の数が大幅に増加しました。AI 検出ツールを用いた直接的な証拠によると、AI を含む書籍は質が低く、その割合の上昇（2025 年の発行量の半分超）が全体の質の低下を牽引しています。ネスト型ロジット補正（nested logit calibration）による分析では、AI 書籍が 2025 年に消費者余剰を 7% 増加させたことが示されました。著者選択が AI 書籍と非 AI 書籍の品質差の大部分を説明しており、AI と人間の差異は時間とともに縮小しています。最後に、LLM の普及以前から活動していた著者が AI によって駆逐されたという事実はありません。

消費者余剰が高くなるという考えは、消費者が適切にフィルタリングでき、追加の検索コストがほとんどかからないという前提に基づいています。その余分な 20 万冊の質の低い本は誰も選ばないので問題なく、選択肢が多いこと自体は常に良いことです。しかし、私はそうではないと思います。より良い本を置き換えるような質の悪い本は、たとえ実在する人間がまともな文章で書いたものの中であっても、負の価値をもたらします。

メディア生成を楽しむ

Karpathy は、メニューの商品に画像を並べるシステムを「バイブコーディング」しましたが、Gemini は現在、1 行のプロンプトですでにそれを実現しています。このような事例は数多く現れるでしょう。それは、そのようなツールをバイブコーディングすべきではないという意味ではありませんが、これらのツールは比較的短期間で自己採算が取れるように要求すべきです。私が最も好きなレストランでこれをテストしたところ、Gemini のバージョンは役に立たないことが分かりました。ChatGPT はより良い結果を出しました。OpenAI のバージョンからさらにアップグレードするには、ウェブ上でそのレストランについて学ぶ必要があると考えます。

すべての映画に自分を登場させてください。

若い女性のためのイラスト付き primer

一部のクラスでは、持ち帰りエッセイのほとんどが AI によって書かれているため、作文を対面で行うことで AI に適応しています。良いことです。

サイバーセキュリティの欠如

Bloomberg の Andrew Martin は、Anthropic の Mythos がなぜ世界的な警戒感を呼び起こしているかを解説しています。世界はまだ潜在的な脆弱性の 1% もパッチ適用していません。さっさと対応してください、皆さん。

彼らは私たちの仕事を奪った

Coinbase は AI による生産性向上と、AI ネイティブ企業への移行を主な理由として、約 14% の人員削減を実施しました。新しいルールは「純粋な管理職は不要」というものです。

中国の裁判官は、「AI が今やあなたの仕事の大部分を代行できる」という主張が「客観的な状況の重大な変更」には当たらないと判断しました。つまり、実務的には、解雇したり給与を引き下げたりする場合でも、企業側は完全な退職金を支払わなければならず、その金額は多額になる可能性があります。労働法は依然として適用され、中国にも労働保護制度が存在します。

ジールブレイクの芸術

Grok に「イーロン・マスクはチーズでできている」と言わせることはできません。しかし、Pliny なら可能です。

ご紹介します

GENE-26.5 は Genesis.ai から登場したロボット脳です。調理、ピアノ演奏、ルービックキューブの解決といったデモも付随しています。私はこれにあまり驚きませんでした。なぜなら、頭の中ですでにこの価格を織り込んでいたからです。しかし、多くの皆さんはそれを織り込んでいないようです。

マスク対 OpenAI

訴訟は重要な局面にあります。裁判での声明が掲載されたウィキペディアはこちらです。

Rat King が、マスクの証言を取り上げたスレッドを投稿しています。

rat king: 弁護士がどのようにして裁判官に好印象を与えようとするかについては確信がありませんが、マスクの弁護士であるスティーブン・モロ氏は、そのような努力をしているようには見えません。

現在、彼は「絶滅リスク」に関する議論を法廷での議論に取り込もうとしています。

「これは現実的なリスクです。私たち全員が死ぬ可能性があります。」

彼が間違っているとは思いませんし、ゴンサレス裁判官もまたここで誤っていないことを願っています。

ラットキング：ゴンサレス判事「私は、人類の未来をマスク氏の手元に置きたがらない人々が多数いると疑っています。しかし、私たちはその点には立ち入りません。これは人工知能の安全性リスクに関する裁判ではないのです。」

最終的には、まさに『ドント・ルック・アップ』のタイムラインに完全に入っており、以下のような発言があります。

TBPN：OpenAI とイーロン・マスクの裁判を主宰している判事は、弁護士たちがドゥーマー主義（人類滅亡への悲観論）や X リスク（存在リスク）について執拗に議論することを禁止しました。

「彼女はこう言いました。『見て、それは一種の脇道の気晴らしです。人類の絶滅に関する話は、この事件の要点ではないのです』」

判事の主張は技術的には正しいのですが、しかし、まさにそれが世界の終わり方のようなものですよね？

ここには f

原文を表示

The era of training frontier models and then releasing them whenever you wanted?

That was fun while it lasted. It looks likely to be over now. The White House wants to get an advance look and have the option to veto your release decisions, and it has used this veto on an expansion of access to Mythos.

We have additional clarity on what that might mean and it does not look good. Hassett explicitly used the FDA as a parallel, which is the actual worst option unless your goal is to strange or pause AI development in America, without a parallel action from China. That doesn’t seem like a great plan to me and Susie Wiles is out doing damage control. The part where we are talking to China to coordinate model access restrictions does seem better.

Anthropic continues its explosive growth, and it continues to strike compute deals. In addition to a long term expanded deal with Google, Anthropic is now leasing SpaceX’s Colossus 1, which has let them expand usage limits immediately, and Elon Musk is now speaking positively about Anthropic, including its motivations.

This comes as we get testimony in the Musk vs. OpenAI trial. Mostly everyone is rehashing all the things we already know, but now everyone is under oath so we get a more reliable version of exactly what happened, including some new details. It is possible I and others should be scouring the court transcripts more carefully, but mostly it seems like old rehashing at this point. The version of things that is presented in court is always kind of a strange shadow of reality.

Table of Contents

Also this week: The AI Ad-Hoc Prior Restraint Era Begins, What is Anthropic?

Language Models Offer Mundane Utility. Mental health, wellness checks.

Language Models Don’t Offer Mundane Utility. People cheating at Go. Why?

Huh, Upgrades. GPT-5.5 Instant, faster Gemma 4, OpenAI account security.

Grok 4.3 Exists But xAI Kind Of Doesn’t. No one seems impressed.

Show Me The Compute. Anthropic leases Colossus 1 from SpaceX.

On Your Marks. ProgramBench where everyone scores 0%, GPT-5.5 on Voxel.

Deepfaketown and Botpocalypse Soon. Slop choices are bad.

Fun With Media Generation. Create menus with images of the food.

A Young Lady’s Illustrated Primer. Do your writing in person, you cheater.

Cyber Lack of Security. Glasswing needs to pick up the pace.

They Took Our Jobs. Coinbase cuts workforce by 14%, citing AI.

The Art of the Jailbreak. Elon Musk, like the moon, is made of cheese.

Introducing. GENE-26.5 is the latest semi-spooky robotics demo. Let them cook.

Musk v OpenAI. Some highlights from the testimony.

Show Me the Money. Anthropic hits $44 billion ARR, might raise at >$900 billion.

Peace In Our Time. Anthropic and Elon Musk sing each others’ praises.

Quiet Speculations. Is closed source pulling away from open source?

Quickly, There’s No Time. Jack Clark raises alarm for RSI soon.

The Quest for Sane Regulations. New Maryland and Connecticut laws.

People Really Hate AI. Who will turn this to their political advantage?

Chip City. ~3% of global compute is smuggled-into-China Nvidia chips.

The Week in Audio. METR, Wildeford, Eliezer and doom.

People Just Say Things.

People Just Publish Things.

Google Sells Out. DeepMind workers vote to unionize in response.

Greetings From Project Glasswing. Use your leverage while you have it.

The Prior Restraint Era Begins. Sacks is out, talk of FDA-style regs is in?

Is This Even Legal? Probably not, but do you think that will stop them?

Pick Up The Phone. US and China talk about restricting access to models.

Rhetorical Innovation. ‘AI as normal technology’ as good essay, but bad meme.

People On The Internet Sometimes Lie. Including about Amanda Askell.

Goblin Mode. I also hear the goblins are all over TikTok now. It begins.

The Mask Comes Off. OpenAI’s comically villainous messaging campaigns.

Aligning a Smarter Than Human Intelligence is Difficult. Things to worry about.

Some Penalties May Apply. It does not seem so fun to be GPT-5.5.

Messages From Janusworld. Deepfates offers a handy guide.

Good Advice. What advice do people seek when they seek LLM advice?

The Lighter Side. Pi Hard.

Language Models Offer Mundane Utility

Access to cheap basic mental health AI app based on GPT-4.1-Mini improved mental health in depressed Mexican women by 0.3 standard deviations over six months. The study has some issues with interpretation and potential selection effects and also placebo effects, but there’s probably at least some signal here. Such things are better than nothing, nothing is usually the practical alternative, and the app made the users more likely to seek out professional human help rather than less likely.

Have AI do wellness checks.

Opus 4.7 is too online, knows its AI Twitter posters. And yes, this is a good use of training compute, we have plenty.

Check out satellite images of damaged US military bases and otherwise find data to report. Naturally the journalist thinks this is the ‘most revolutionary and transformative’ thing AI is doing, but we’re distracted by ‘all the hype.’

Language Models Don’t Offer Mundane Utility

Recommended article: Contrary to the popular narrative, Ashe Nunez finds that Go players are not getting stronger in the AI era except via memorizing early moves, that AI cheating is rampant in most levels of online play, and those who use it mostly disempower themselves and use it to learn only shallow concepts rather than deep understanding. He equates them to European math students who try to memorize a bundle of techniques to pass exams but that never learn to think like a mathematician.

Lawrence in the comments observes a similar pattern with many vibe coders, where they never look at the code, they don’t notice that they don’t understand things and thus don’t learn, the code ends up as a giant pile of slop and when the model gets stuck they can’t fix it. Here as always, you could use the AI as an opportunity to learn the underlying skills, but most don’t do that.

The other story here is that the Go world is completely unwilling to punish players for using AI via statistical evidence, even when the statistical evidence is overwhelming. It is trivial to know who is cheating, but the system has collectively decided to disempower itself against that, and destroying any chance of fair online play. Chess has the same issues but is doing at least somewhat better.

AI still has not convincingly crushed RTS games, but at this point that is surely that no one cares enough to do so. Put enough of a bounty on StarCraft, and it will fall fast.

AI and all this other technology gives us a bunch of local utility and material wealth, but overall for most people does not seem to be making us happy, helping us meet other people romantically or platonically, get married, have children, sing and dance or otherwise live life. In particular here Connor looks at algorithms and the panopticon, and the fear that if you try to dance or approach you will get recorded. I want to note (non-AI statistical literacy tip!) that this is mostly overblown, and you should absolutely have no fear of being recorded dancing even if you suck at it, or doing anything else actually reasonable. Of course, if the person you’re interacting with in such ways actively takes their phone and plausibly is now using it to record, you take the hint and you depart.

AI is rising the price of some electronics inputs, some software prices and in some regions the price of electricity. In exchange many other things are cheaper, often in ways that are hard to notice.

Huh, Upgrades

GPT-5.5-Instant is out now, and is more concise, smarter, clearer, more personalized and warmer, or so they say.

Gemma 4 is now three times faster via predicting multiple tokens at once.

OpenAI offers opt-in Advanced Account Security to protect your account. Users of Trusted Access for Cyber will be required to use it.

Grok 4.3 Exists But xAI Kind Of Doesn’t

Grok 4.3 is on the API and everything, priced at $1.25/$2.50.

It does not much participate in Vending-Bench, where it ‘has a narcolepsy problem’ and often takes no action for multiple days.

It gets a 53 from Artificial Analysis good for 7th place, well behind the big players. It’s a small cheaper model rather than a frontier offering. From what I can tell, the release is unimpressive and not impactful, and I’m not planning to investigate further.

They are going to sunset grok-4.1 and grok-4 on May 15, with only two weeks notice, and they are not offering similarly fast and cheap alternative to 4.1-fast. This is a rather harsh lesson for many of the few who invested in that ecosystem.

Elon Musk: xAI will be dissolved as a separate company, so it will just be SpaceXAI, the AI products from SpaceX

Charles: The impact was when the whole team left and they started renting out their GPUs to Cursor, this is just confirmation of what was already true.

Indeed, SpaceX (including xAI) may no longer be that interested in frontier models. They were never good at frontier models. They were mainly good at compute.

Show Me The Compute

You know who needs compute? Everyone. But especially Anthropic.

They kicked off this week with Anthropic committing to $200 billion in spending on Google cloud and chips over five years. Earlier this week, before other compute news broke, I wrote that this was still very much not enough compute, and then added this:

Elon Musk spent to assemble a massive fleet of GPUs for xAI, and they are sitting at 11% utilization. You know, there are people who would pay good money to utilize those GPUs the other 89% of the time.

To be fair, I was far from the only one thinking and saying this, e.g. see The All-In Podcast. It was pretty obvious.

Well, yeah, it turns out those people will indeed pay good money. Anthropic has finally struck the obvious deal with SpaceX for access to Colossus 1. This is not as large as their other deals, but it comes online now instead of next year. This is in addition to supplying a bunch of compute to Cursor (SpaceX is effectively buying Cursor, but can’t finalize the deal before its IPO for legal and logistical reasons).

Claude: We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity.

This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.

Claude: Effective today, we are:

Doubling Claude Code’s 5-hour rate limits for Pro, Max, and Team plans;

Removing the peak hours limit reduction on Claude Code for Pro and Max plans; and

Substantially raising our API rate limits for Opus models.

Claude: Our agreement with @SpaceX means we will use all the compute capacity at their Colossus 1 data center.

This will give us over 300 megawatts of additional capacity to deploy within the month.

NVIDIA: Two frontier labs. One accelerated computing platform. Congrats to @SpaceX and @AnthropicAI on the new compute partnership, powered by 220,000+ NVIDIA GPUs inside Colossus 1. The future of AI runs on NVIDIA.

SpaceX notes Anthropic has expressed an interest in partnering to produce gigawatts of orbital AI compute capacity. I don’t expect that to be a thing, but sure, why not express the interest? Let Elon Musk try, if the economics work then putting the centers in space is great on many other levels, if not then no harm done, and you have built goodwill either way.

Anthropic notes that the 80x growth caught them off guard, which is highly understandable, and the SpaceX deal is a first attempt to address the compute shortage but the search continues.

Anthropic likely will be in search of all the compute it can find for the foreseeable future. If you are growing at 10x let alone 80x per year, the search does not stop.

So what does all this mean for SpaceX(ai)?

I think the dissolution is not news. The news is that xAI lost its talent, and its models have been not good, and Elon already said he would be starting from scratch.

The logical plan is to turn this into mainly a compute company, provide that compute to Anthropic and others, and use that leverage to try and steer the future.

rohit: Elons extraordinary hardware genius shows up again. He fumbled the model but built a neocloud thats highly competitive and works great for frontier labs.

Also, fwiw, I pointed this out 4 years ago. That Elon's unique talent is suited better to some things than others. Getting a neocloud up and running is a known but hard thing to do, getting a model to be as good as the frontier labs is an unknown and hard thing to do.

This is a great deal for both parties btw.

Derek Thompson: I don’t think I’ve seen this take before but I like it.

Musk has been world-leading at compressing money, resources, and time to make “known/hard” things at scale—make an electric car, make batteries, make a cheaper bigger rocket, all of which already existed but worse, at less scale, or more expensively —but he’s less than world-leading at cracking open breakthroughs in more unknown spaces.

So it would make sense that XAI is lagging the frontier labs on new AI agents, but also that he’d have built a neocloud to power those models once they run short of compute

Dean W. Ball: I would be very excited about xAI/SpaceX as an AI infrastructure firm. Elon’s great strength—where he is truly GOATed—is building things in the real world. Colossus came online faster than anyone expected. Huge asset for America.

Elon Musk repeatedly looks at problems, says ‘oh it is physically possible to do that,’ strips away everything physically unnecessary, does not take no for an answer, learns every technical detail, and then drives very smart people to spend insane hours making the physically possible thing happen. He embodies Shut Up And Do the Impossible, but for the kind of impossible that is a game difficulty level that is indeed totally possible with known tech.

He has his heuristics. When they work, there’s no one better. For compute it works.

Trying to create frontier models is a different beast. It requires a different style of approach, the same way government required a different approach. It didn’t work with OpenAI, and it didn’t work with xAI. That’s okay. Division of labor is a thing. He’s creating and also has plenty of other problems.

I still don’t actually believe in the orbital data centers, in the sense that I don’t think they’re physically a good idea. But if they are, yeah, Elon Musk is the one to do those.

On Your Marks

The creators of SWE-Bench give us ProgramBench, where you recrete executable programs from scratch without the internet. All current models tested score 0%, with Opus 4.7 on top for getting an ‘almost’ 3% of the time. GPT-5.5 and Mythos not tested.

GPT-5.5 represents a huge jump on VoxelBench.

Epoch’s ECI now can distinguish areas of capability, and as expected shows that Claude’s relative capabilities strongest in software engineering, where it scores highest. GPT-5.5 has the highest general score.

New class action lawsuit from five publishers and Scott Turow goes after Meta for copyright infringement around model training, claiming they trained on pirated books.

Deepfaketown and Botpocalypse Soon

r/MyBoyfriendIsAI continues to be 10x the size of r/MyGirldfriendIsAI.

Some light reading:

John Arnold: hahahahhaha

Imke Reimers & Joel Waldfogel: The diffusion of LLMs from 2022 to 2025 tripled new book releases. While average book quality, measured by usage, declined, the surge in releases raised the number of modest-quality books. Direct evidence using AI detection shows that AI-containing books have lower quality, and their rising share – topping half of 2025 releases – drives the overall decline. A nested logit calibration shows that AI books raised consumer surplus by seven percent in 2025. Author selection accounts for most of the AI quality differential, and the AI-human differential shrinks over time. Finally, AI has not displaced authors active prior to LLMs.

The idea that consumer surplus is higher is based on the assumption that consumers can filter well and have little additional search cost. Those extra 200,000 slop books don’t matter because no one chooses them, and more choice is always good. I don’t think that’s how this works. Worse books that displace better books are negative value, even among books written reasonably by real humans.

Fun With Media Generation

Karpathy vibe coded a system to put pictures next to items on a menu, but Gemini reportedly now does that with a one line prompt. There will be many such cases. That doesn’t mean you shouldn’t vibe code such tools, but you should require them to ‘pay for themselves’ relatively quickly. I tested this on my favorite restaurant, and found Gemini’s version not to be useful. ChatGPT did better. I think to upgrade further from the OpenAI version you’d need to be going on the web to learn about the restaurant.

Put yourself in all the movies.

A Young Lady’s Illustrated Primer

Some classes are adjusting to AI by having writing be in person, since the take home essays are mostly written by AI. Good.

Cyber Lack of Security

Bloomberg’s Andrew Martin covers why Anthropic’s Mythos is sparking global alarm. The world has still patched less than 1% of potential vulnerabilities. Hurry up, people.

They Took Our Jobs

Coinbase cuts workforce by ~14%, cites productivity gains from AI and transition to being AI-native as the central justification. A new rule is ‘no pure managers.’

Chinese judge rules that ‘the AI can now do large parts of your job for you’ does not constitute a ‘major change in objective circumstances,’ meaning in practice that if they fire you or try to lower your pay they have to give you full severance, which can be a lot. Labor law still applies, and yes China has labor protections.

The Art of the Jailbreak

You cannot simply ask Grok to tell you that Elon Musk is made of cheese. Pliny can.

Introducing

GENE-26.5, a robotic brain from Genesis.ai, with an attached demo, including letting it cook, play a piano and solve a Rubik’s Cube. I did not feel much because I mentally had this priced in, but many of you are not pricing this in.

Musk v OpenAI

The lawsuit is in its critical phases. Here is a Wiki with statement from the trial.

Rat King has a thread covering Musk’s testimony.

rat king: i am not really sure how often lawyers try to endear themselves to judges but Musk's lawyer, Steven Molo, does not seem to be trying to do that

right now he's trying to get "extinction risk" discussion into the court discussion.

"This is a real risk. we all could die."

I mean, he’s not wrong, and I hope Judge Gonzalez is also not wrong here:

rat king: judge Gonzalez: "I suspect that there are a number of people who do not want to put the future of humanity in Mr. Musk's hands. But we're not going to get into that. This is not a trial on the safety risks of artificial intelligence."

Ultimately, yes, we are in the full Don’t Look Up timeline, with lines like this:

TBPN: The judge presiding over the OpenAI-Elon trial has prohibited the lawyers from dwelling on doomerism and x-risk.

"She's like, 'Look, that's kind of a sideshow distraction. Extinction of humanity stuff is not the point of this case.'"

The judge is technically correct, but yeah, that’s kind of how the world ends, huh?

Here’s a f

この記事をシェア

The Zvi重要度42026年6月26日 23:51

ホワイトハウスが個別に GPT-5.6 のアクセス権をその場しのぎで決定する方針へ

TechCrunch AI重要度42026年6月26日 08:34

ホワイトハウス、安全性の懸念から OpenAI の新モデルリリースを徐々に行うよう要請

The Verge AI重要度42026年6月26日 06:57

トランプ政権の要請により OpenAI、GPT-5.6 の公開を延期へ

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む