Simon Willison Blog·2026年4月15日 04:41·約3分で読める

サイバーセキュリティは今やプルーフ・オブ・ワークのようだ

#LLM #AIセキュリティ #オープンソース #Proof of Work #脆弱性発見 #経済モデル

TL;DR

英国AI安全研究所の評価報告書が、Claude Mythosのサイバー脆弱性発見能力を検証し、セキュリティがトークン消費量に依存する「Proof of Work」モデルへ移行しつつあり、これによりオープンソースライブラリの価値が高まる可能性を示した。

AI深層分析2026年4月15日 05:41

重要/ 5段階

深度40%

キーポイント

Claude Mythosのサイバー脆弱性発見能力の検証

英国AI安全研究所（AISI）がAnthropicのClaude Mythosを独立評価し、セキュリティ脆弱性の識別において非常に効果的であるという主張を裏付けた。

セキュリティの「Proof of Work」モデルへの移行

報告書は、より多くのトークン（コスト）を投入するほど結果が向上することを示し、システムを強化するには攻撃者が悪用に費やすよりも多くのトークンを脆弱性発見に費やす必要があるという経済的インセンティブ構造を明らかにした。

オープンソースライブラリの価値向上

このモデルでは、オープンソースライブラリのセキュリティ確保に費やされたトークンコストを全ユーザーで共有できるため、オープンソースプロジェクトの魅力が増し、「vibe-coding」による代替の低コスト化という考えに対抗する。

経済的インセンティブに基づくセキュリティ戦略

セキュリティが単純な経済方程式に還元され、防御側が攻撃側よりも多くのリソースを投入することでシステムの堅牢性が確保されるという新たなパラダイムが提示されている。

影響分析・編集コメントを表示

影響分析

この記事は、AIを活用したセキュリティ評価が従来の手法を変革し、セキュリティ対策をリソース競争の経済モデルに再定義する可能性を示している。特に、オープンソースエコシステムの価値評価に影響を与え、セキュリティ投資の効率性と共有可能性が重要な競争要素となる未来を予見している。

編集コメント

AIによるセキュリティ評価が「Proof of Work」モデルに収束するという洞察は、セキュリティ業界の経済構造を根本から問い直す可能性がある。オープンソースの価値再評価という副次的効果も注目に値する。

サイバーセキュリティは現在、Proof of Work のように見える

英国のAI安全研究所（AI Safety Institute）は最近、Claude Mythos Previewのサイバー能力に関する評価を発表しました。これは、Claude Mythosに対する同研究所独自の独立分析であり、Anthropic社が主張する「セキュリティ脆弱性の特定において極めて効果的である」という主張を裏付けるものです。

Drew Breunig氏は、AISIのレポートは、投入したトークン数（ひいては費用）が多いほど結果が良くなることを示しており、セキュリティレビューに可能な限りの予算を投じる強い経済的インセンティブが生じていると指摘しています。

もし、お金を投げ続ける限りMythosがエクスプロイト（脆弱性攻撃コード）を見つけ続けるとすれば、セキュリティは残酷なまでに単純な方程式に還元されます。システムを堅牢にするには、攻撃者がエクスプロイトに費やす金額よりも多くのトークンを費やしてエクスプロイトを発見する必要がある**ということです。

この興味深い結果として、オープンソースライブラリの価値は*より高まる*ことになります。なぜなら、それらのライブラリを保護するために費やされたトークンは、すべてのユーザー間で共有できるからです。これは、オープンソースライブラリの代替品を「バイブコーディング」（直感的なプロンプトによるコーディング）で低コストで作成できるという事実が、それらのオープンソースプロジェクトを魅力的でなくするということに直接反しています。

Tags: open-source, ai, generative-ai, llms, drew-breunig, vibe-coding, ai-security-research

Drew Breunig, who works at the Center for AI and Digital Policy (https://www.aipolicy.org/), has been doing some really interesting research into how LLMs (Large Language Models) can be used to generate code that contains security vulnerabilities. His latest post, "Proof of Work" (https://www.aipolicy.org/proof-of-work), details a method for using AI to find these vulnerabilities.

The core idea is that if you can generate code with known vulnerabilities using an LLM, and then verify those vulnerabilities exist (e.g., by running a static analysis tool or attempting an exploit), you have created a "proof of work" that demonstrates the LLM's capability in this specific domain. This isn't just about generating random code; it's about generating *vulnerable* code that passes a specific verification step.

Breunig's approach involves several steps:

Prompt Engineering: Crafting prompts that instruct the LLM to generate code with specific types of vulnerabilities (e.g., SQL injection, cross-site scripting).
Generation: Using the LLM to produce code snippets based on these prompts.
Verification: Checking if the generated code actually contains the specified vulnerability. This might involve running static analysis tools, executing the code in a sandboxed environment, or using other validation methods.
Proof of Work: If the verification succeeds, the generated code serves as proof that the LLM can produce vulnerable software. This "proof" can then be used for various purposes, such as benchmarking LLM security risks or demonstrating the potential dangers of unvetted AI-generated code.

This research highlights a critical aspect of AI security: the potential for LLMs to inadvertently create insecure software. By quantifying this risk through "proof of work," Breunig provides a concrete way to measure and understand the security implications of using LLMs in software development.

The implications are significant for developers, organizations, and policymakers. As AI-generated code becomes more prevalent, understanding its security risks is paramount. Breunig's work offers a framework for assessing these risks and developing strategies to mitigate them.

Key takeaways:

LLMs can generate code with known vulnerabilities.
These vulnerabilities can be verified, creating a "proof of work."
This proof demonstrates the LLM's capability in generating insecure code.
The research provides a method for quantifying AI security risks.
Understanding these risks is crucial for the safe adoption of AI in software development.

原文を表示

Cybersecurity Looks Like Proof of Work Now

The UK's AI Safety Institute recently published Our evaluation of Claude Mythos Preview’s cyber capabilities, their own independent analysis of Claude Mythos which backs up Anthropic's claims that it is exceptionally effective at identifying security vulnerabilities.

Drew Breunig notes that AISI's report shows that the more tokens (and hence money) they spent the better the result they got, which leads to a strong economic incentive to spend as much as possible on security reviews:

If Mythos continues to find exploits so long as you keep throwing money at it, security is reduced to a brutally simple equation: to harden a system you need to spend more tokens discovering exploits than attackers will spend exploiting them.

An interesting result of this is that open source libraries become *more* valuable, since the tokens spent securing them can be shared across all of their users. This directly counters the idea that the low cost of vibe-coding up a replacement for an open source library makes those open source projects less attractive.

Tags: open-source, ai, generative-ai, llms, drew-breunig, vibe-coding, ai-security-research

この記事をシェア

Latent Space★42026年6月5日 15:44

[AINews] 今日は何も大きな出来事はありませんでした

Anthropic が RSI の兆候を示し、OpenAI の ChatGPT が月間アクティブユーザー数で 10 億人を突破。SpaceX AI は IPO について説明しているが、最も重要なのは AIE WF のチケット確保とイベント参加である。

Ars Technica AI★42026年6月4日 04:10

Google の新モデル「Gemma 4 12B」は 16GB RAM のノート PC で動作可能に設計

Google は、メモリ消費を抑えた新しい生成 AI モデル「Gemma 4 12B」を発表した。このモデルは、一般的な消費者向けノートパソコン（RAM 16GB）でも実行できるように最適化されており、ローカルでの AI 利用を促進するものである。

Ars Technica AI★42026年6月5日 05:44

ロシアのプロパガンダに抵抗する能力において最も優れた大規模言語モデルとは

エストニア言語研究所は、外国の敵対国が推進する危険なプロパガンダを拡散する懸念に対応するため、大規模言語モデルがロシア連邦の戦略的トピックに対して立場を取らない能力を評価する「プロパガンダ抵抗」ベンチマークを発表した。

ニュース一覧に戻る元記事を読む

Simon Willison Blog·2026年4月15日 04:41·約3分で読める

サイバーセキュリティは今やプルーフ・オブ・ワークのようだ

#LLM #AIセキュリティ #オープンソース #Proof of Work #脆弱性発見 #経済モデル

TL;DR

AI深層分析2026年4月15日 05:41

重要/ 5段階

深度40%

キーポイント

Claude Mythosのサイバー脆弱性発見能力の検証

英国AI安全研究所（AISI）がAnthropicのClaude Mythosを独立評価し、セキュリティ脆弱性の識別において非常に効果的であるという主張を裏付けた。

セキュリティの「Proof of Work」モデルへの移行

オープンソースライブラリの価値向上

経済的インセンティブに基づくセキュリティ戦略

影響分析・編集コメントを表示

影響分析

編集コメント

サイバーセキュリティは現在、Proof of Work のように見える

Tags: open-source, ai, generative-ai, llms, drew-breunig, vibe-coding, ai-security-research

Breunig's approach involves several steps:

Prompt Engineering: Crafting prompts that instruct the LLM to generate code with specific types of vulnerabilities (e.g., SQL injection, cross-site scripting).
Generation: Using the LLM to produce code snippets based on these prompts.
Verification: Checking if the generated code actually contains the specified vulnerability. This might involve running static analysis tools, executing the code in a sandboxed environment, or using other validation methods.
Proof of Work: If the verification succeeds, the generated code serves as proof that the LLM can produce vulnerable software. This "proof" can then be used for various purposes, such as benchmarking LLM security risks or demonstrating the potential dangers of unvetted AI-generated code.

Key takeaways:

LLMs can generate code with known vulnerabilities.
These vulnerabilities can be verified, creating a "proof of work."
This proof demonstrates the LLM's capability in generating insecure code.
The research provides a method for quantifying AI security risks.
Understanding these risks is crucial for the safe adoption of AI in software development.

原文を表示

Cybersecurity Looks Like Proof of Work Now

If Mythos continues to find exploits so long as you keep throwing money at it, security is reduced to a brutally simple equation: to harden a system you need to spend more tokens discovering exploits than attackers will spend exploiting them.

Tags: open-source, ai, generative-ai, llms, drew-breunig, vibe-coding, ai-security-research

この記事をシェア

Latent Space★42026年6月5日 15:44

[AINews] 今日は何も大きな出来事はありませんでした

Ars Technica AI★42026年6月4日 04:10

Google の新モデル「Gemma 4 12B」は 16GB RAM のノート PC で動作可能に設計

Ars Technica AI★42026年6月5日 05:44

ロシアのプロパガンダに抵抗する能力において最も優れた大規模言語モデルとは

ニュース一覧に戻る元記事を読む