AIニュース最前線
最新ニュースAI日報Hacker日報週報動画AIツールトレンド企業

AIニュース最前線

世界中のAI最新情報を日本語で毎時更新

最新ニュース日報トレンド企業プレミアムRSS
© 2026 ainew.jp特定商取引法に基づく表記
ニュース一覧元記事を開く
The Decoder·2026年4月17日 00:52·約1分で読める

AnthropicのClaude Opus 4.7、コーディングで大きな進歩を達成、一方でサイバーセキュリティ機能を意図的に縮小

#LLM#AI安全性#コード生成#Anthropic#責任あるAI#モデル開発
TL;DR

Anthropicの新フラッグシップモデルClaude Opus 4.7はコーディングタスクで大きな改善を提供する一方、トレーニング中に特定のサイバーセキュリティ能力を意図的に低減させた。

AI深層分析2026年4月17日 09:42
3
注目/ 5段階
深度40%
2
関連度30%
4
実用性20%
3
革新性10%
3

キーポイント

1

コーディング能力の大幅向上

Claude Opus 4.7はコーディングタスクにおいて主要な改善を提供する新フラッグシップモデルとして発表された。

2

サイバーセキュリティ能力の意図的制限

トレーニングプロセスにおいて、企業は特定のサイバーセキュリティ関連能力を意図的に低減させる取り組みを行った。

3

Anthropicの責任あるAI開発アプローチ

能力向上と安全性のバランスを取るための意図的な設計選択が示されており、AI開発における倫理的配慮が反映されている。

影響分析・編集コメントを表示

影響分析

この発表は、AIモデルの能力向上と安全性のバランスを取るための業界の動向を示している。コーディング支援ツールとしての実用性が高まる一方、潜在的な悪用リスクを軽減するための意図的な設計が、責任あるAI開発の重要な事例となる可能性がある。

編集コメント

AI能力の向上と安全性のトレードオフを具体的に示す事例として注目される。技術進歩と倫理的配慮の両立が、今後のAI開発の重要な課題であることを再認識させる内容だ。

image
image

Anthropicの新たなフラッグシップモデル「Claude Opus 4.7」は、コーディングタスクにおいて大幅な改善を実現しました。トレーニングの過程で、同社は特定のサイバーセキュリティ能力を意図的に低減させています。

この記事「Anthropic's Claude Opus 4.7 makes a big leap in coding, while deliberately scaling back cyber capabilities」は、The Decoderで最初に公開されました。

原文を表示

Anthropic's new flagship model Claude Opus 4.7 delivers major improvements in coding tasks. During training, the company deliberately tried to reduce certain cybersecurity capabilities.

Anthropic has released Claude Opus 4.7, a direct upgrade to its predecessor, Opus 4.6. The company positions the model primarily as a step forward in autonomous coding. On the SWE-bench Pro coding benchmark, Opus 4.7 scores 64.3 percent, up from 53.4 percent for its predecessor and ahead of OpenAI's GPT-5.4 at 57.7 percent. Anthropic's own top model, Claude Mythos Preview, still leads by a wide margin at 77.8 percent.

Anthropic says Opus 4.7 follows instructions more precisely than its predecessor. The company notes that prompts written for older models may now produce unexpected results, as Opus 4.7 interprets instructions more literally than Opus 4.6, which sometimes loosely interpreted or skipped parts of them entirely.

Image: Anthropic

Image resolution triples for better visual understanding

Opus 4.7 processes images at up to 2,576 pixels on the long edge, which Anthropic says works out to roughly 3.75 megapixels, more than three times what earlier Claude models could handle. This isn't an API setting but a model-level change: images are automatically processed at higher resolution, though they consume more tokens as a result. Users who don't need the extra detail can downscale images before sending them.

Anthropic sees this as a major advantage for computer-use agents that need to read dense screenshots and for extracting data from complex diagrams. On the Document Reasoning benchmark (OfficeQA Pro), the company reports 80.6 percent accuracy, up from 57.1 percent with Opus 4.6. The benchmarks also show significant gains in biomolecular reasoning and visual navigation (ScreenSpot-Pro).

Anthropic deliberately throttles cyber capabilities

One of the more notable aspects of this release is how Anthropic handles the model's cybersecurity capabilities. The company says it experimentally tried to reduce certain cyber capabilities differentially during training. New safeguards are designed to automatically detect and block requests that suggest prohibited or high-risk cybersecurity use.

The background here is the recently announced Project Glasswing, in which Anthropic addressed the risks and benefits of AI models for cybersecurity. The company had explained that it would restrict the release of the more capable Mythos Preview and first test new safeguards on less capable models. Opus 4.7 is the first test case for this strategy.

Security researchers who want to use the model for penetration testing or red-teaming can sign up for a new Cyber Verification Program.

Hallucinations drop but don't disappear

According to the system card, Anthropic distinguishes between two types of hallucinations: factual hallucinations - wrong claims about the world, like fabricated quotes or incorrect data - and input hallucinations, where the model acts as if it has access to a tool or attachment that doesn't actually exist.

For factual hallucinations, Opus 4.7 performs better than or on par with Opus 4.6 across four benchmarks but falls short of Mythos Preview. Anthropic says the gap comes mainly from Mythos Preview's higher hit rate on obscure facts, not from a higher error rate in Opus 4.7.

For input hallucinations, Opus 4.7 achieves the lowest hallucination rate of all tested models when users request a tool that isn't available. When context information is missing, it comes close to Mythos Preview and sits well ahead of older models. Anthropic acknowledges, however, that the test cases for the tool set were tailored to Opus 4.6's weaknesses, which skews that model's results.

When dealing with questions based on made-up facts Opus 4.7 performs on par with Opus 4.6 and below Mythos Preview. Under pressure, such as when users or system prompts push the model to contradict its own assessment, Opus 4.7 is more honest than Opus 4.6 but less firm than Mythos Preview.

Alignment results are a mixed bag

Overall, Anthropic describes Opus 4.7's safety profile as similar to Opus 4.6, with low rates of deception, sycophancy, and cooperation with misuse. The model is more resistant to prompt injection attacks.

A known issue from earlier Claude models partially persists: refusing to help with legitimate AI safety research. According to the system card, Opus 4.7 still refuses to assist in 33 percent of simulated safety research tasks. That's a significant drop from 88 percent with Opus 4.6, but still a substantial share.

Same per-token prices, potentially much higher real-world costs

Pricing stays at $5 per million input tokens and $25 per million output tokens. However, Opus 4.7 uses a new tokenizer that can map the same text to up to 1.35 times as many tokens. The model also generates more output tokens at higher effort levels. In practice, the cost per request can rise significantly even though the per-token prices remain unchanged.

Image: Anthropic

A new effort level called "xhigh" slots in between "high" and "max." Claude Code also gets a new "/ultrareview" command for dedicated code reviews and an expanded "Auto Mode" for Max users, where Claude makes decisions on its own. Opus 4.7 is available through the Claude API, Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry.

More details and tips are available in the migration guide for Opus 4.7.

この記事をシェア

関連記事

Anthropic Research★32026年3月6日 09:00

2026年3月6日 Frontier Red TeamによるClaudeのCVE-2026-2796エクスプロイトのリバースエンジニアリング

Frontier Red Teamが、Claudeの脆弱性CVE-2026-2796を悪用するエクスプロイトをリバースエンジニアリングした。

Anthropic Research★32026年3月6日 09:00

フロンティア・レッドチーム、Firefoxのセキュリティ向上のためにMozillaと提携

フロンティア・レッドチームは、Firefoxのセキュリティを向上させるため、Mozillaと提携した。

宝玉的分享★42026年2月17日 09:00

59%のユーザーがより安価なモデルを選択:Sonnet 4.6の詳細解説

Anthropic社がClaude Sonnet 4.6をリリースし、Claude Codeテストで70%のユーザーが前世代モデルより好み、59%がフラッグシップモデルOpus 4.5よりも選択した。コーディング、コンピュータ利用、100万トークンコンテキストなど6次元で全面アップグレードされ、価格は据え置き。

ニュース一覧に戻る元記事を読む