The Zvi·2026年6月25日 05:03·約24分で読める

「かつてありし未来の寓話」第4回：Claude Code の新バージョンが Fable 5 の復活を示唆

#LLM #規制 #Claude Code #Amazon Bedrock #輸出規制

TL;DR

Claude Code の更新や Amazon Bedrock への再登場により Fable 5 の復旧確率が上昇している一方、米政府による AI 輸出規制と企業への強制的なレビュー要請という政策的混乱が業界全体に深刻な影響を与えている。

AI深層分析2026年6月24日 22:04

重要/ 5段階

深度40%

キーポイント

Fable 5 の復旧兆候と利用制限の定着

Claude Code v2.1.190 の更新や Amazon Bedrock への再登場により、7 月までの Fable 5 復旧確率が 60% から 88% に上昇。ただし、今後はサブスクリプションに組み込まれた定額利用枠（クォータ）での提供が予想される。

米政府による AI 規制と「強制」の実態

トランプ政権下で AI 輸出規制やモデルの政府レビューが事実上義務化されており、OpenAI や Anthropic は同意したが Meta が唯一の抵抗勢力となっている。この政策は「任意」というより実質的な強制である。

NSA のアクセス喪失と信頼性の危機

これまでの政策的混乱により、米国家安全保障局（NSA）が重要な AI システム「Mythos」へのアクセスを失う事態が発生し、アメリカの AI スタック全体の信頼性が問われている。

政策の逆転思考による批判

著者は、もしカマラ・ハリス大統領が理由もなく Grok を輸出規制していた場合の業界反応を想像させることで、現在の AI 規制政策がいかに不合理で混乱を招くものであるかを指摘している。

開発の継続と内部化

一般公開を制限してもAIの開発は止まらず、むしろリソースが解放されて加速する可能性があり、最良のモデルが規制を避けるために社内に留め置かれるリスクが高まっている。

競争とビジネスモデルの維持

オープンソースや競合他社との競争に勝つため、企業は事業モデルを守るために常により高度なシステムを訓練し続ける必要があり、規制の不確実性は開発を最大限に進める動機を弱めない。

政府規制の予測困難性

米国政府による輸出管理や制裁への反応は予測不可能であり、展開の遅延や不確実性が事業上のメリットを多少減らすとしても、開発を止めるには至らない。

影響分析・編集コメントを表示

影響分析

この記事は、AI モデルの可用性に関する技術的な動向（Fable 5 の復旧）と、それを覆う政治的・規制的な不確実性（輸出規制と政府レビューの強制）が同時に進行している現状を浮き彫りにしています。特に、米国の主要 AI エコシステムにおける信頼性の低下や、企業に対する事実上の強制的なコンプライアンス要求は、開発者や利用者が直面するリスク環境を根本的に変える可能性があり、業界全体の方針転換を迫る重要な示唆を含んでいます。

編集コメント

技術的な復旧の兆候が見える一方で、米政府による強硬な規制介入が業界の信頼性を損ないつつあるという深刻な二重構造が描かれています。特に「任意」とされる規制の実態が強制である点は、今後の AI ガバナンスを考える上で重要な示唆となります。

実際、かなり良く見えますね。

確率が大幅に低下した後、再び好調な様子です。7 月 1 日までに復元される可能性は 60%、7 月 31 日には 88% と見られています。これは、各地で基盤整備が進んでいるように見える状況を受けてのものです:

leo: BREAKING: Claude Code v2.1.190 は、Fable 5 の復帰に向けた準備を示唆するいくつかの文字列変更を導入しました。これにより、Fable 5 は毎週の利用制限付きでサブスクリプションに恒久的に組み込まれることになります。

「今週の利用枠をすべて使用しました」という文字列が追加され、「プランとは別に購入する必要があります」という文言は削除されました。

leo: UPDATE: Fable 5 は now Amazon Bedrock にも再び登場したと報じられています。

上記の情報に基づいた更新のみを根拠とする場合、新しい確率評価は過信であると考えられます。復元への確信がなくても、その瞬間に備えるためにこれらの措置を取ることは合理的です。

これはまた、サブスクリプションユーザー向けに Fable の恒久的な利用枠が設定される可能性も示唆しています。たとえ少量の割り当てであっても、これは大きな意味を持ちます。なぜなら、わずかな割り当てでも、サブスクリプション内でコーディング以外のタスクや軽微なコーディングタスクに使用できるからです。

この見通しを踏まえると、これまで何が起こり、今後何が起きる可能性が高いのかという理解を更新し、全体をどう捉えるべきかを考える時期が来たようです。

非常にひどい政策

アクセスが来週に予定通り回復したとしても、これは大失敗でした。

ディーン・ボール氏と私は、皆さんにこの状況を逆転させて想像していただきたいと思います。

デーン・W・ボール：カマラ・ハリス大統領が、理由を一切開示せず、grok に対して無期限の輸出管理を実施した場合、さまざまな評論家や業界関係者が何と言ったかを想像してみてください。

選ばれた実装はあまりにも大惨事であり、同盟国を怒らせただけでなく、「アメリカ製 AI ストック」全体の信頼性に疑問を投げかけただけでなく、NSA が Mythos にアクセスできなくさせてしまいました。

この文章を読んでいる誰かが、新しい AI 大統領令が実質的に「任意（voluntary）」であると考えているとは決して思わないでしょう。ジェシカ・ティリッップマンは、連邦政府の調達が一連の方法であり、その規則が下請け業者にも適用されると示しています。実際にはそれよりもさらに任意性はありません。なぜなら、政府は必要に応じてあなたに圧力をかけるからです。現在、孤立した抵抗者である Meta に対して行っているようにです。

アンドリュー・カラン：トランプ政権は、META に自社のモデルを政府による任意の審査に提出するよう同意させるよう圧力をかけています。Meta は現在唯一の抵抗者であり、OpenAI、Anthropic、Google、xAI、Microsoft はいずれもこれらの条件に合意しています。ニューヨーク・タイムズ（NYT）による報道。

Meta のスポークスフランシス・ブレナン氏によると、同社は「詳細を調整中で、近々契約に署名する予定」と述べています。

国民の声が聞こえた

つまり、私たち（ほぼ全員）がそう言っているのです。

朗報は、これらの決定が Twitter における公衆投票やその他の公衆投票によるものではないという点です。一般大衆はこの問題について理解していません。一般大衆は何を話しているのか全く分かっておらず、その判断は正解とはほとんど無関係なものでしょう。

Thank You, Next

一般大衆は、さらに通知があるまで Fable 5 を利用することはできません。

それは、研究所がより能力の高い AI モデルの開発を停止するということではありません。

Andrew Curran: 訓練から、より能力の高い Mythos の新バージョンが出現しました。これが Mythos 5.1 と呼ばれるのか、Mythos 6 と呼ばれるのか、あるいは Anthropic がさらなる開発を加速させるためにこれを内部に留めるのかは分かりませんが、それは既に到着しています。

Fable 5 や Mythos 5 のようなモデルを一般大衆に提供することを停止しても、開発のスピードを遅らせることにはなりません。実際、リソースを解放することで、わずかに開発が加速する可能性があります。また、現在のモデルが embargo（供給制限）下にある間も研究所が能力の向上を続けることを防ぐ規則はなく、発表するまで進捗を静かに保つことも可能です。どの研究所も停止したり、開発を遅らせたりすることはできません。

これの証明として、GLM-5.2 の能力の高さを単に見れば十分です。自らのビジネスモデルを守るために、最先端の研究機関はオープンソースや互いの他社に先んじるため、より高度なシステムを継続的に訓練し続けなければなりません。氷の下では現在の流れが激しく吹き荒れ、私たちは目的地に向かって競い続けています。

₿AX: 出典は？

Andrew Curran: 私です。

Phil: つまり、あなたはでっち上げたのですか？

Andrew Curran: いいえ。

Andrew Curran: オリジナルの『Mythos』のトレーニング完了は2月7日でした。十分な時間があります。

Anthropicが内部でMythos 5を使用して次のMythosを訓練できず、より劣るモデルに頼らざるを得ないことで一時的にやや足踏みした時期がありました。その期間はすでに終わっているかもしれません。Curranが出典なしででっち上げるとは思いませんが、仮に彼が事前の知識（priors）のみに基づいて主張していたとしても、その予測はそれほど外れてはいないでしょう。

もし最良のモデルを公開しようとした結果、それが制限される可能性があれば、AnthropicやOpenAIはそれを内部留保する判断を下すかもしれません。ここが最大のリスクとなる場所です。

Law360 のこのレポートをご覧ください。「Anthropicの輸出管理措置が予期せぬ制裁への恐怖を煽る」。今後、米国政府がどのように反応するかを確実に予測することは不可能です。

展開できないこと、あるいは展開に関する不確実性や遅延は、開発を推進するビジネス上の合理性をわずかに低下させますが、最大限の努力を払う必要性まで消し去るほどではありません。他に選択肢はなく、企業の評価額が関連コストを十分に賄える水準に保たれているからです。

非常に静かにせよ

この予測がどの程度成立するか、私たちは見ていくことになります。

Peter Wildeford（6 月 21 日 7:07pm）：主要な AI ニュースが整整八日間全くないという事実は、ちょっと奇妙だよね

Dean W. Ball：アメリカの AI 業界全体は、USG が彼らが迷い込んだ「Fable」事態を解決するまで、新たな公開リリースから事実上凍結されていると私は推測します。

Kevin A. Bryan：もちろん、リリースからは凍結されていても、開発からは凍結されていないのです！

Dean W. Ball：その通り。むしろ開発はもっと速くなるかもしれないね

これらのベビーたちが何ができ、何ができないか

これはもはやニュースではありませんが、 stakes（賭け事）を思い出させるために：Mythos の完全版は、十分な情報、時間、文脈を与えられれば、複雑な対象であれば基本的に何でもハッキングできます。

それができないのは、外部攻撃者が持たない様々な他の利点を与えられない限り、数時間ですべての機密ネットワークに侵入することです。

実際に起きたことは、物理的にエアギャップされたシステムへのアクセス権限を与えられた専門の承認されたレッドチーム（敵対的シミュレーションを行うテストグループ）が、数時間以内に私たちの機密システム内の問題点を特定したという事実です。

Dustin Volz と Julian E. Barnes：実際には、このテストは「レッドチーム」である N.S.A. 分析官たちによって行われ、彼らは敵対者が再現することは極めて unlikely（ありえない）環境で Mythos を使用していたと当局者は述べています。レッドチームは、特定のコンピューターからのみアクセス可能であり、広範なインターネットから完全に遮断されたように設計された機密 N.S.A. システム内でテストを開始しました。

明確にしておくと、これは依然として印象的であり、期待を上回るものでした。

Mythos は本物です。

ダスティン・ヴォルツとジュリアン・E・バーンズ：NSA の庁舎内においても、この統制されたテストの結果は印象的でした。ワシントンの外に位置する NSA は、外国の敵対者に対するデジタル諜報技術の開発や、米国のネットワークをサイバー攻撃から守ることを専門とする秘密の要塞です。

…それでもなお、NSA が一部の者が懸念していた終末的な事態を経験しなかったにもかかわらず、諜報機関の分析官たちは、Mythos が統制されたテスト環境においてどれほど有能であるかに驚かされました。これはすでに高い水準にあった期待さえも上回るものでした。

一連の誤解が、「Mythos は数時間で NSA のほぼすべての機密システムに侵入した」という引用文を生み出すことになりました。

Mythos は大きな反響を呼び、最初の結果は今でも大きな意味を持ちます。

しかし、政府内部を含む一部の者たちのように、過剰な反応は慎むべきです。

ブル理論：NSA 自身の長官は、Mythos が読隊演習中に数時間で機密システムのほぼすべてに侵入したと述べています。

『エコノミスト』紙によると、上院情報委員会の副委員長であるマーク・ワーナー上院議員は、NSA とペンタゴンのサイバーコマンドを率いるジョシュア・ラッド将軍がこれを直接彼に語ったと述べました。

これは 6 月 11 日、つまり Amazon が Anthropic のモデルで別の脱獄（ Jailbreak）を発見したと報じられた同じ日に明らかになりました。数時間後、トランプは [フェイルをオフラインにする] と命令しました。

IRIS C2：この物語の背景を理解するお手伝いをしましょう：ペンタゴン、サイバーコマンド、そして NSA にはすべてレッドチームが存在します。

これらのレッドチームのフルタイムの仕事は、機密システムへのハッキングを試みることです。これらのレッドチームで働く人々（その多くを私は知っています）は、途中で遭遇する可能性のある機密情報を見る許可を受けています。

ほぼすべての演習において、レッドチームはまず、エアギャップされた機密ネットワークに対する何らかの初期アクセスから始めます。

Rudd 将軍の発言は、Mythos にアクセスできる誰でもが、外部世界からこれらのネットワークへの初期アクセスを獲得できることを意味するものではありません。

彼が言っているのはこうです：米政府のレッドチームは、Cobalt Strike やその他諸々を使用していた状態から、Cobalt Strike やその他諸々に加えて、検閲されていないバージョンの Mythos を使用できるようになり、そして驚くべきことに、Mythos にアクセスできるようになると、より成功し、より迅速に運用できるようになったのです。

現実世界の例を挙げましょう：1 月、NSA のレッドチームは、機密ネットワーク内の Cisco ルーターが最近の CVE の対象であることを知っているかもしれません。それが、レッドチームがそれに対処する信頼性の高い n-day エクスプロイトを作成できる立場にあることを意味しますか？いいえ。通常、そのような作業に時間や予算を割く余裕はありません。また、ルーター向けに高機能な Lua 埋め込みプログラム（Lua implant）を作成できることを意味しますか？これもいいえです。

しかし今では、両方とも可能です。そして、Mythos のおかげで、比較的迅速に行うことができます。

Shashank Joshi: シャシャンク・ジョーシ氏：現在広く流通しているこの主張は、私が先週記した一文に基づいています。私は上院情報委員会の副議長であるマーク・ワーナー氏が、「NSA 長官が私に、Mythos は数週間ではなく数時間で『ほぼすべての機密システムに侵入した』と伝えた」と正確に引用しました。しかし、これを文字通り受け取ることは誤りだと考えます。おそらく Mythos を非常に特定の条件下で他のツールと共に使用した場合に限られるでしょう。私は Mythos の威力を伝えるためにこの引用を行いました。ただし、注意点を加えなかったのは間違いでした。

追記：米国の当局者が私に語ったところによると、この件においてワーナー上院議員は NSA 長官の Rudd 将軍の発言を誤解していたとのことです。Rudd 氏は確かに「数週間ではなく数時間」という表現を使用しましたが、この文脈における Mythos の使用は、広く想定されている通り、レッドチーム演習（red-teaming effort）、すなわち内部ネットワークのセキュリティを検証するテストの一部でした。当局者はまた、同機関のレッドチームが now Mythos にアクセスできなくなったと伝えました。その理由は、Mythos へのアクセス権限が「グラスウィング・プロジェクト（Project Glasswing）」の下にあったためです。

Ed Tarnowski: たとえ真実だとしても、パニックを起こす理由にはなりません。

すでにいくつかのオープンソースモデルは、OpenBSD に関する Mythos の発見と同様の成果を上げる能力を実証しています。

バグは引き続き存在し続けます。防御側に最先端モデルへのアクセス権を与えることで、彼らは現在それらを見つけ、パッチを適用できるようになります。

実際に何が起きたのかを整理するのに数日かかりました。この引用は、政治家によって繰り返し引用されるゾンビ化した誤った物語の一つとして残り続ける可能性が高く、Fable をオフラインにするためにそしてそれを維持するために実質的に影響があったと考えられます。

Bull Theory は残念ながら、他のいくつかの誤解も繰り返しています。例えば、Anthropic が外国人へのアクセスを切断する代わりに Fable をオフラインにしたという考えです。しかし、Bessent からの明確な確認により、目的はモデルをオフラインに強制することだったことがわかっています。

Connor Leahy: 最近の Mythos/NSA の警告射撃 [引用部分のこと] は、私が「警告射撃」がどのようなものだと予想していたかのように見え始めています。

全く信じられないような出来事が起こり、その後に FUD（恐怖・不確実性・疑念）マシーンが作動し、「文脈」という言葉が無限大の量だけ追加され、もはや何が真実かわからなくなり、誰もが都合の良いことを信じるようになります。

これもまた、「警告射撃を待っていれば、皆が突然合理的になる」という変化の理論がなぜ欠陥があるのかを示しています。

状況を明確にし、人々を合理的にするためには、多くの作業が必要です。

最悪の場合どうなるか？

Teortaxes: Zvi からの非常に美味しいアップデートパッケージです。そしてはい…彼らは理解し始めています。「スーパーハンマーハッカー」のまさにこのことが、ハッキングに関する話題を完全に終わらせるための全目的なのです。

私は、もし誰もが同じく十分に高度な超人的 AI ハッカーを持っていた場合、火力を特定の標的に集中させる能力があるにもかかわらず、これが攻撃よりも防御に有利になるとする理論がなぜ理解できないのか、ずっと不思議に思ってきました。もちろん、「しばらくの間はコードの多くはまだ超人的ではない」という問題については無視しています。

そこで私は Twitter で質問しました。

Teortaxes 氏や他の人々は形式検証（formal verification）を指摘しました。この概念が存在することは私も認識していますが、同時に社会的エンジニアリング、詐欺、なりすましも存在しますし、システムをハックする他の形式的枠組み外の手法も潜在的に存在する可能性があります。また、より大きなシステムにとって重要な部分の形式検証は非常に困難であり、効率的なコードにはなりません。

Teortaxes 氏：これは確かに非常に深刻な質問です。はい、私は形式検証が存在するため、そのように信じています。これは脆弱性検索とは少し異なりますが、概念実証として、MathArena で anyhow 首位を争う汎用 10T LLM が存在します。Fable または Fable+ は*壊れない*ソフトウェアを書くことができるでしょう。

すべての Web 公開ソフトウェアを保護することは明らかに超人的なタスクですが、主な問題は労働コストです。最終局面では、攻撃されたシステムにおける最も弱いリンクはポスト量子暗号（post-quantum cryptography）であり、どこかへ進むためには現代数学の否定が必要になります。

Glasswing はそれではありません。Glasswing の存在意義は、GLM 5.4 がすべての MRI スキャナと iPhone にランサムウェアを仕掛けることで米国が崩壊しないようにすることです。現在のエコシステムは脆くゴミのようなものです。

どのようにしてそこに到達し始めるのかについては、[The Great Refactor from IFP in 2025] をお読みください。

entirelyuseless: 非超人間的なプログラマーたちも、100 倍の努力を費やしても破ることができなかったソフトウェア（広く使われている暗号化システムなど）をすでに作成しています。したがって、一般的に可能であることは明白です。

singularvessel: これは明らかに単純なコードであれば可能ですよね？問題は、必要な労力がコードの複雑さにどのように比例して増加するかという点だけです。

NondescriptTransfer: はい、間違いなくそうです。チェスに似ていますね。勝利とは、相手が完璧なプレイから逸脱した隙を突くことです。だからこそ、上位レベルになるほど引き分けが増えるのです。

セキュリティ研究も同じです。ただし、ここでは防衛側が引き分けを獲得します。能力が向上するにつれて、私たちは安全なコードの漸近線に近づいていきます。また業界の動向を見れば、全体的なコードはかつてよりもはるかに安全になっていることがわかります。AI はこの傾向をさらに加速させるでしょう。

しかし、結局のところすべてのハッキングは、どこかの時点で誰かが愚かな行動をとったことに遡り、そのハッカーが最初にそれを見つけ出したという点に帰着します。AI がより賢くなり、愚かな行動をする頻度が減り、コードに対する監視の目（AI を実行する防衛側から）が増えるにつれて、そのような事例はさらに減少していくでしょう。

チェスの面白い側面の一つは、プレイレベルに関わらず人々がほぼ同じ数の『愚かなこと』を犯すという点です。そのプレイレベルの視点から見れば、プレイが上達するほど多くのことが『愚か』としてカウントされるようになるからです。理論的には完璧なチェスは可能ですが、AI が私が完璧なプレイの可能性と考えていた領域を超えてさらに進化し続ける様子を見続けています。私のセキュリティに関するモデルもこれと似ています。

私の理解では、コードが非常に明確に定義された単純なタスクを実行したい場合、意味のある形式的検証は達成可能な範囲内です。しかし、多くの事柄はそれほど単純ではありません。

MRI スキャナーや iPhone のすべてにランサムウェアが存在することを確実に避ける必要があります。それはかなり悪い事態になり得ます。

Arthur Tellis (QTing Teortaxes): はい、しかし私はこれらのトレードオフを緩和する方法があると考えています：例えば、Fable の脆弱性調査/セキュリティ工学への利用を、特定のユーザーのために生成されたソフトウェア、または特定のユーザーに属すると検証できるソフトウェアに限定することができます。これは持続可能な均衡状態ではないかもしれませんが、短期的にはこれらのシステムの用途が脆弱性調査ではなくハードニング（堅牢化）へと偏向させることになります。

私はこれを成し遂げられるとは思いませんが、仮に成し遂げたとしてもそれは非常に厄介な手間となり、Fable を最も価値のあるソフトウェアプロジェクトの広範な範囲で使用することを排除することになります。

しかし一般的には、この特定の機能バグのリスクを過大評価している我认为思います。Glasswing 構造内において、IT メジャーがゲートなしモデルを持ち、トークン予算が高く、任意のコードベースについてより明瞭かつ暗黙的な文脈を持っている場合、Fable アクセスを持つ脆弱性研究者よりも迅速に脆弱性を特定できるはずです。さらに、セキュリティ工学/脆弱性研究における Fable の利用に対する KYC（本人確認）と分類器駆動型の観測性は、攻撃的研究者にとって少なくとも中程度の参入障壁として機能するはずです。

ここで最も強力な論拠は、データ保持であり、ある程度の KYC（本人確認）と分類器駆動型の観測可能性が、真の防御手段となることです。

つまり、Anthropic が Fable トークンを大量に使用して主要なサイバーセキュリティ関連のオペレーションを行っているユーザーを特定したい場合、それは確実に可能であり、その行動をユーザーの評判と紐付けることもできます。

これは完全無欠なものではなく、「ジールブレイク」を完全に「修正」するものではありませんが、攻撃側のオペレーションにおける実質的な難易度、リスク、コストを大幅に引き上げる、多層防御の一環としての障害の一部です。

Anthropic も行ったと予想されるように、そのような対策を十分に講じれば、Fable の影響のバランスは変化し、リリースされた状態の方が安全であるという結論になります。

しかし、真のポイントは、IT 経済におけるコンピュータネットワークオペレーション（CNO）やその危険性において、0 日脆弱性の悪用が過大評価されているということです。サイバーオペレーションの绝大多数は 0 日脆弱性の悪用に依存しておらず、これは効果的な監視や情報ドメインの分離などを通じて一般的に対処可能です。

重要なのは「効果的な悪用」です。これには悪用の連鎖が必要です。Mythos が特別だったのは、理解する限りにおいて、それが独自にそのような連鎖を見つけ、完了できる能力を持っていたからであり、しばしば予期せぬ方法でそれを実現した点にあります。

一方、世界中におけるサイバーエスパーション/攻撃のコストは、IT 経済が生み出す総価値の単一ベーシスポイント未満であり、民主化された機械知能が生み出す価値の単一パーセント未満である可能性が高い。私たちは、このオムニユース技術がプログラム解析およびセキュリティエンジニアリング・脆弱性研究において持つ莫大な能力を根拠に、AI モデルのリリースに対する過度なセキュリティ化が行われているのを目撃している。

ここがアーサーが私を失う点だ。サイバーセキュリティの価値は「すべての IT 価値の単一ベーシスポイント未満」ではない。

サイバーセキュリティの価値とは、要するにすべての IT の価値そのものである。なぜなら、あなたのサイバーセキュリティが完全に失敗すれば、もはや IT は存在しないか、それよりもさらに悪い状況になるからだ。

テールリスク管理は確かに時として最優先されなければならない。しかし、それが「AI が全員を殺すかもしれない」という主張を必要とするわけではない。これは、「パンデミック対策は国家予算のごく一部に過ぎないから」という理由で生物兵器について心配する必要がないと言うのと同じだ。生物兵器に対する適切な懸念レベルが何であれ、その議論は生物兵器を心配しないための非常に悪い理由である。

問われるべきは、サイバーセキュリティにおいて通常どれほど多くのことが間違っているかではない。それは、サイバーセキュリティが崩壊した世界で何が間違うのかという問いだ。もし国家セキュリティシステムに侵入されたらどうなるか？ハッカーがほぼあらゆる企業ネットワークに侵入したらどうなるか？病院やその他の重要インフラがオンライン状態を維持できなくなったらどうなるか？もはや安全ではないため、オンラインでさまざまなことができなくなった場合どうなるか？

政策立案者は再調整を行う必要があります。AI が私たちの時代の革新的技術であるのは、科学生産、ソフトウェア開発、労働の増強におけるその価値によるものであり、この価値はパッチ逆エンジニアリングが提示するリスクを完全に上回ります。パッチ逆エンジニアリング自体が蔓延する病気に過ぎないからです。

原文を表示

It does look good, actually.

After the odds had dropped quite a bit, they’re looking good again, with a 60% chance of restoration by July 1 and 88% by July 31, in the wake of groundwork looking like it is being laid in various places:

leo: BREAKING: Claude Code v2.1.190 introduces several string changes that hint at preparations for a Fable 5 return, with it being permanently included in subscriptions with weekly usage.

The string "You've used your Fable 5 usage for this week" has been added, and "purchased separately from your plan" has been removed

leo: UPDATE: Fable 5 has now reportedly also reappeared in Amazon Bedrock

If the update is based purely on the above info I would treat the new odds as overconfident. These moves seem reasonable to make even if you have no confidence in the restoration, in order to be ready if that moment arrives.

This also suggests a potential permanent quota for Fable for subscribers. Even a modest amount is a big game here, since even a modest allocation means you can use it for non-coding tasks or minor coding tasks within the subscription.

With that on the horizon, it seems time to update our understanding of what has happened and is otherwise likely to happen, and how to think about all this.

A Rather Terrible Policy

Even if access is restored in the coming week as expected, this was a fiasco.

Dean Ball and I both invite you to imagine this situation in reverse.

Dean W. Ball: I invite you to imagine what various commentators and industry actors would have said if president Kamala Harris had export controlled grok indefinitely and for no disclosed reason.

The chosen implementation was such a train wreck that not only did we piss off our allies and call into question the reliability of the entire ‘American AI stack,’ we also caused the NSA to lose access to Mythos.

Hopefully no one reading this thinks the new AI executive order is meaningfully ‘voluntary.’ Jessica Tillipman illustrates that Federal procurement is one set of ways they will be imposing the new licensing mandate, with the rules flowing down to subcontractors. It’s actually a lot less voluntary than that, because the government will lean on you as needed, as it is currently leaning on lone holdout Meta.

Andrew Curran: The Trump administration is pressuring META to agree to submit its models to the government for voluntary review. META is now the only holdout, as OpenAI, Anthropic, Google, xAI and Microsoft have all agreed to these terms. Reporting by the NYT.

Meta says they are ‘working through the details and hope to sign an agreement soon’ as per Meta spokesperson Francis Brennan.

The People Have Spoken

So say we (almost) all.

The good news is, these decisions are not up to a public vote, either of Twitter or otherwise. The public does not understand these issues. The public has no idea what it is talking about, and its decisions will be mostly uncorrelated with the right answer.

Thank You, Next

The public does not get to use Fable 5 until further notice.

That does not mean the labs are going to stop developing more capable AI models.

Andrew Curran: A new, more capable version of Mythos has emerged from training. I don't know whether it will be called Mythos 5.1 or Mythos 6, or if Anthropic will keep it internal to accelerate further development - but it has arrived.

Stopping models like Fable 5 or Mythos 5 from being served to the public does nothing to slow down development. In fact, it probably speeds it up slightly by freeing up resources. There are also no rules preventing the labs from continuing to advance capabilities while any current model is under embargo - or from keeping progress quiet until they choose to release it. None of them can afford to pause or slow down.

We need only look at how capable GLM-5.2 is as proof of this. To protect their business models, the frontier labs must continually train increasingly capable systems to stay ahead of open source, and each other. The current continues to rage beneath the ice, and we continue to race toward our destination.

₿AX: source?

Andrew Curran: Me.

Phil: So you made it up?

Andrew Curran: No.

Andrew Curran: Original Mythos finished training February 7th. They have had plenty of time.

There was a brief period where Anthropic will have been somewhat slowed down by inability to use Mythos 5, internally, to train the next Mythos, forcing it to fall back on worse models. That period may already be over. I don’t think Curran would make this up without a source, but even if he claimed it purely on priors, those priors are not going to be that far off.

And if trying to release their best model gets it potentially restricted, Anthropic or OpenAI might well decide to keep it internal. Which is where the biggest risks are.

See this Law360 report, Anthropic Export Controls Stir Fear of Unforseen Sanctions. It is impossible to reliably predict how the US government will react going forward.

Inability to deploy, or uncertainty and delays around deployment, do modestly reduce the business case for pushing development, but not enough that you wouldn’t push maximally hard. There is no alternative, and the valuations of the companies make the related costs highly affordable.

Be Very Very Quiet

We will see if this prediction holds up.

Peter Wildeford (June 21, 7:07pm): The fact that we've gone eight whole days without any major AI news is kinda weird

Dean W. Ball: I’d assume the whole AI industry in America is effectively frozen from new public releases until USG resolves the Fable situation they have stumbled into.

Kevin A. Bryan: Frozen from releases but not from development, of course!

Dean W. Ball: correct. if anything development can maybe be faster

What These Babies Can And Cannot Do

It’s not news, but to remind us of the stakes: The full version of Mythos can hack basically anything complex that it wants to hack, if given sufficient information, time and context.

What it cannot do is hack into all the classified networks in a matter of hours without being given various other affordances an outside attacker would not have.

What actually happened was that a professional authorized red team, given physical access to air gapped systems, identified issues in our classified systems within hours.

Dustin Volz and Julian E. Barnes: In reality, the tests involved “red teams” of N.S.A. analysts who were using Mythos in a highly tailored environment that would be extremely unlikely for an adversary to replicate, officials said. The red teams began their tests within classified N.S.A. systems designed to be accessible only from certain computers and completely cut off from the broader internet.

To be clear, this was still impressive and exceeded expectations.

Mythos is the real deal.

Dustin Volz and Julian E. Barnes: The controlled tests proved impressive even within the halls of the N.S.A., a secretive fortress outside Washington that specializes in developing digital espionage techniques against foreign adversaries and protecting U.S. networks from cyberattacks.

… Still, even though the N.S.A. did not experience the doomsday scenario some had feared, analysts at the spy agency were stunned by how capable Mythos appeared to be in controlled test settings, which exceeded already lofty expectations.

A series of misunderstandings created quite the pull quote that Mythos ‘broke into almost all NSA classified systems in hours.’

Mythos provided a lot of uplift, and that first result is still a big deal.

But let’s not get carried away, as some did, including inside the government.

Bull Theory: The NSA's own director says Mythos broke into almost all of its classified systems in hours [during a read team exercise].

Per The Economist, Senator Mark Warner, vice chair of the Senate Intelligence Committee, said General Joshua Rudd, who runs the NSA and the Pentagon's Cyber Command, told him this directly.

This came out on June 11, the same day Amazon reportedly found a separate jailbreak in Anthropic's models. Within hours, Trump ordered [Fable taken offline.]

IRIS C2: Let me help everyone put this story into perspective: The Pentagon, CyberCommand and NSA all have red teams.

The full time job of these red teams is to attempt to hack into classified systems. The people who work on these red teams (many of whom I know), are cleared to see the classified information that they might bump into along the way.

In the case of almost every exercise, the red teams begin with some kind of initial access to the air-gapped classified network.

The quote from General Rudd does NOT imply that any random Joe with access to Mythos could, from the outside world, gain initial access to these networks.

What he is saying is this: USG red teams went from having Cobalt Strike, et al to having Cobalt Strike et al + the uncensored version of Mythos—and, low and behold, they were more successful, and able to operate more quickly, once they had access to Mythos.

Here's a real world example: In January, an NSA Red Team might know that the Cisco router within classified network was the subject of a recent CVE. Does that mean that the red team was in a position to write a reliable n-day exploit to target it? No. They usually would not have the time/budget to mess around with that. Does it mean that they could actually craft a high end Lua implant for the router? Also no.

Now they can do both. And they can do so relatively quickly. Thanks to Mythos.

Shashank Joshi: Shashank Joshi: This now widely circulated claim is based on a line I wrote last week. I accurately quoted Mark Warner, vice chair of the Senate intelligence committee, saying that the NSA chief had told him that Mythos “broke into almost all of our classified systems, not in weeks, but in hours”. But it would be a mistake to read that literally, I think. It surely depends on using Mythos alongside other tools under very particular conditions. I quoted it to give a sense of Mythos’ potency. But it was a mistake not to have added caveats.

An update. A US official tells me that Sen. Warner misunderstood the NSA director Gen. Rudd in this case. Rudd did use the 'hours, not weeks' wording, but the use of Mythos in this context was—as widely assumed—part of a red-teaming effort, i.e. testing the security of internal networks. The official also told me that the agency's red-teams no longer have access to Mythos, because their authority for accessing it was under Project Glasswing.

Ed Tarnowski: Even if true, it’s not reason for panic.

Several open-source models have already proven capable, for example, of accomplishing the same Mythos discovery re: OpenBSD.

Bugs will continue to persist. Giving defenders access to frontier models lets them find and patch them now.

It took several days for us to sort out what actually happened here. The pull quote will probably remain one of those zombie debunked stories that keeps being cited by politicians, and that plausibly mattered in taking and keeping Fable offline.

Bull Theory unfortunately also repeats several other misconceptions, including the idea that Anthropic took Fable offline ‘instead of’ cutting off access to foreigners, when we have explicit confirmation from Bessent that the point was to force the models offline.

Connor Leahy: The recent Mythos/NSA warning shot [as in the pull quote] is looking more and more like how I expected a "warning shot" to look like.

An absolutely insane thing happens, and then the FUD machine kicks into action and adds infinity gazillion """context""" until no one knows what is true anymore and everyone believes what is most convenient.

This, once again, shows why the theory of change of "we just wait for a wArNiNg ShOt and then everyone will suddenly be reasonable!" is flawed.

We need to do a lot of work to make the situation legible, and people reasonable!

What’s The Worst That Could Happen?

Teortaxes: A very tasty update package from Zvi

and yes… they’re beginning to understand. This is the whole damn point of “superhuman hackers”. To close the whole topic of hacking for good.

I have never understood the theory that if everyone had the same sufficiently advanced superhuman AI hacker, that this would favor defense over offense, given the ability to concentrate firepower on a given target, even ignoring the ‘most code will still not be superhuman for a while’ issue.

So I asked Twitter.

Teortaxes and others pointed to formal verification. I realize this exists, but so do social engineering, fraud and impersonation, and so potentially do other outside-the-formal-box forms of hacking the system, and formal verification of what matters for the larger system seems very hard and does not result in efficient code.

Teortaxes: This is indeed a very serious Q

Yes, I believe that, because formal verification exists. This is a bit different from vulnerability search, but the proof of concept is a general purpose 10T LLM that tops MathArena anyway. Fable or Fable+ could write *unbreakable* software.

securing even all web-exposed software is a superhuman task, obviously. but the main issue is labor cost. The endgame is that the weakest link in the attacked system is post-quantum cryptography and you'd need to disprove contemporary mathematics to get anywhere.

Glasswing isn't that btw. Glasswing exists to make sure the US isn't brought low by GLM 5.4 putting ransomware in every MRI scanner and iPhone. The current ecosystem is flimsy trash.

For how we even begin getting to that, read [The Great Refactor from IFP in 2025].

entirelyuseless: Non-superhuman coders have already written software they were unable to break with 100x effort (widely used cryptographic systems.) So evidently possible in general.

singularvessel: This is clearly doable for simple code, right? The question is just how the effort required scales with the complexity of the code.

NondescriptTransfer: Yes definitely. It's kind of like chess, a win is you taking advantage of the opponents detour from perfect play. Which is why you see more and more draws at higher levels.

Security research is the same way, except the defender wins draws. As capabilities improve we'll approach an asymptote of secure code. Also if you look at industry trends, code in general is far more secure than it used to be. AI will only accelerate this.

But yeah it all comes down to every hack is traced back to someone doing something dumb at some point and the hacker being the first one to catch it. As AI gets smarter and does dumb stuff less often, and there are more eyes on the code (from defender running AI) you'll see fewer

A fun aspect of chess is that people mostly do about as many ‘dumb things’ no matter what level of play, from the perspective of that level of play, because as you improve your play more things count as dumb. In theory perfect chess is possible, but we keep seeing the AIs get better beyond where I thought perfect play might be. My model of security is similar.

My understanding is that if your code wants to do very well-defined simple things, meaningful formal verification is within reach. But many things are not so simple.

You do distinctly have to avoid there being ransomware in every MRI scanner and iPhone. That sounds quite bad.

Arthur Tellis (QTing Teortaxes): Yes, but I think there are ways to soften these tradeoffs: for example, one could limit Fable's use for vuln research/security engineering to only software that it generates for a given user or can verify as belonging to a given user. This probably isn't a sustainable equilibrium, but it biases these systems' use toward hardening rather than vulnerability research in the short run.

I do not think you could pull this off, but even if you did it would be a horrendous pain in the ass, and would rule out use of Fable on a wide variety of the most valuable software projects.

In general though, I think that we are overrating the risk of this particular feature-bug. If, within the Glasswing construct, IT majors have ungated models, higher token budgets, and more legible/tacit context about any given codebase, they should be able to identify vulnerabilities faster than vulnerability researchers with Fable access. Moreover, KYC and classifier-driven observability of Fable use for security engineering/vulnerability research should act as at least modest barriers to entry for offensive researchers.

The strongest argument here is data retention, some amount of KYC plus classifier-driven observability as the real defense.

As in, if Anthropic wants to figure out who is using a lot of Fable tokens for major cybersecurity related operations, that is something they can definitely do, and they can associate it with your reputation.

This is not foolproof, it does not fully ‘fix’ the ‘jailbreak,’ but it is part of a defense-in-depth series of obstacles that greatly raises the de facto difficulty, risks and costs of offensive operations.

Do enough of that, as I expect Anthropic did, and the balance of impact of Fable shifts so that you’re safer with it released than not released.

But the real point is that we're overrating the importance of 0-day exploitation in computer network operations and the peril of computer network operations in general within the IT economy. The vast majority of cyber operations do not rely on 0-day exploitation, which can be mitigated against in general through effective monitoring, information domain separation, etc.

The name of the game is effective exploitation. That requires a chain of exploitation. Mythos was special, as I understand it, in large part because it could on its own find and complete such chains, often in unexpected ways.

The cost of cyber espionage/attack worldwide, meanwhile, is less than a single basis point of the total value produced by the IT economy and likely less than a single percent of the value produced by democratized machine intelligence. We are witnessing excessive securitization of AI model release on the basis of this omni-use technology's immense capability in program analysis and security engineering-vulnerability research.

This is where Arthur loses me. The value of cybersecurity is not ‘less than a basis point of all IT value.’

The value of cybersecurity is all of the value of all IT, period, in the sense that if your cybersecurity fully fails then you have no IT, or might be even worse off than that.

Tail risk management does indeed have to take priority sometimes. That does not need to involve ‘AI might get everyone killed.’ This is the same as saying ‘pandemic defense is a small percentage of the national budget’ as a reason not to worry about bioweapons. No matter the right level of worry about bioweapons, that argument is a very bad reason to not worry about bioweapons.

The question is not how much typically goes wrong in cybersecurity. It is what goes wrong in a world where cybersecurity collapses. What happens if they breach our national security systems? If hackers penetrate almost any corporate network? If our hospitals and other critical infrastructure can’t stay online? If you can’t do a variety of things online anymore because they are no longer secure?

Policymakers need to recalibrate. If AI is the transformative technology of our age, it is because its value in scientific production, software development, and labor augmentation; this value absolutely swamps the risk presented by patch reverse engineering, which is itself a malady swa

この記事をシェア

404 Media★32026年6月25日 22:47

データセンター会議での発言が長すぎたとして警察に逮捕された男性のボディカメラ映像が公開される

オクラホマ州クレアモアの警察は、2月に開かれたデータセンターに関する地域会議で発言時間が長すぎた農夫ダレン・ブランチャードを不法侵入罪で逮捕した。ブランチャード氏は罰金刑に反発し、この逮捕の経緯を示すボディカメラ映像をメディアに初めて公開して対抗する方針を表明している。

Vercel Blog★42026年6月25日 22:00

AI SDK 7 の発表

Vercel は、週に 1600 万回のダウンロードがある TypeScript 製 AI SDK の新バージョン「7」を発表した。このアップデートにより、推論制御やツール承認機能など、エージェント開発の生産性を高める機能が強化された。

KDnuggets★32026年6月25日 21:00

2026 年に AI エンジニアになるためのロードマップ

KDnuggets が、2026 年までに AI エンジニアとして活躍するための学習ロードマップを提示している。

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む