Amazon Science·2026年6月11日 00:00·約5分で読める

Graviton5 の改良設計が速度とエネルギー効率を向上 - ムーアの法則を超える

#AI 推論 #AWS Graviton #カスタムシリコン #チップレットアーキテクチャ #メモリ帯域

TL;DR

AWS は、コア数倍増と 3 ナノメートルプロセスを採用した新 CPU「Graviton5」を搭載した EC2 インスタンス M9g/M9gd の一般提供を開始し、メモリ帯域やキャッシュ容量を大幅に強化して AI 推論を含む実ワークロードの性能向上を実現した。

AI深層分析2026年6月11日 01:03

重要/ 5段階

深度40%

キーポイント

Graviton5 のハードウェア革新

コア数を前世代比で倍増（192 コア）し、3 ナノメートルプロセスへ移行することで回路密度とクロック速度を向上させ、各コアの性能も 25% 改善された。

メモリ帯域とキャッシュの劇的強化

DDR5-8800 メモリと PCIe Gen6 をサポートし、L3 キャッシュ容量を前世代比で 5 倍以上（192MB）に拡大してメモリアクセス待ち時間を大幅に削減した。

アーキテクチャの再設計

チップレット構成を変更し、4 つのチップレットにコアとコントローラを分散配置することで、チップ間接続帯域を 420GB/s に高め、大規模な実アプリケーションでの分岐予測精度を向上させた。

AI・実ワークロードへの最適化

マイクロベンチマークではなくデータベースや AI 推論などの複雑な実アプリケーションを想定し、分岐予測機能の強化により最大 30% の性能向上を実現した。

影響分析・編集コメントを表示

影響分析

この発表は、クラウドプロバイダが独自チップの設計・製造能力を極限まで高めたことを示す象徴的な出来事であり、特にメモリ帯域とキャッシュ容量の拡大により、従来の CPU ベンチマークでは測れない大規模データ処理や AI 推論ワークロードのパフォーマンスボトルネックを解消する可能性が高い。業界全体として、Moore の法則が限界を迎える中で、アーキテクチャレベルでの最適化（チップレット、キャッシュ増強）とプロセス微細化の組み合わせが、次世代クラウドコンピューティングの標準となることを示唆している。

編集コメント

単なるスペックの向上にとどまらず、分岐予測やキャッシュ設計といったアーキテクチャレベルでの抜本的な見直しにより、実世界の複雑な AI ワークロードに対するパフォーマンスを劇的に改善した点は評価に値します。クラウド利用者が直面するメモリボトルネック解消への貢献は極めて大きいです。

AWS Graviton プロセッサは世代ごとに着実に進化を遂げ、各イテレーションで計算性能、価格対性能、エネルギー効率、メモリ容量の向上を実現してきました。本日、Amazon は一般用途ワークロード向けに、Elastic Compute Cloud (EC2) の新しい M9g および M9gd インスタンスの一般提供を開始したと発表しました。これらは、Amazon の CPU である最新世代の Graviton5 で動作する最初の Amazon プロダクトです。カスタムシリコンにおける五代間の開発と、8 年間にわたる継続的な投資を経て、Graviton は現在、ウェブアプリケーション、マイクロサービス、分析、データベース、機械学習推論、電子設計自動化、ゲーム、動画エンコーディング、そしてエージェント AI を含む多様なワークロードに適した 350 種類以上のインスタンスタイプを駆動しています。Graviton5 は Graviton4 に比べてコア数を倍増させ、96 から 192 に増加し、DDR5-8800 メモリと最新の PCIe gen6 インターコネクトをサポートします。私たちは主要な DRAM メーカーと緊密に協力して DDR5-8800 の性能レベルを実現しており、AWS Graviton インスタンスはクラウド上のプロセッサインスタンスの中で最も高速なメモリを提供しています。Graviton5 では、Amazon はさらに 3 ナノメートルプロセスを採用し、より高い回路密度とオンチップ通信の高速化を可能にしました。Graviton5 は Graviton4 よりも多くのコアを搭載しているだけでなく、各コアのパフォーマンスも 25% 向上しています。私たちはこれまで、マイクロベンチマークが実際の大きなワークロードとは大きく異なることについて議論してきましたが、私たちが設計するのは顧客の実際のワークロードです。小さなループではなく、データベースのような実際のアプリケーションのすべてのコードと複雑さを対象としています。

コードを高速に実行するために、現代のプロセッサはプログラム内の制御フローから生じる分岐を予測し、予測されたパスを先行的に実行します。Graviton5 に採用されている Neoverse V3 コア（Arm と Amazon の Annapurna Labs によって共同設計）は、CPU の分岐予測能力を大幅に向上させ、その結果、データベースのような実際のアプリケーションの実行性能が最大 30% 向上します。CPU の DRAM は約 100 ナノ秒の距離にある可能性があります。これは一見短く思えますが、3.3 ギガヘルツで動作する CPU において、1 回のメモリアクセスには 330 サイクルを要します。CPU はキャッシュを使用してデータを CPU に近づけ、リクエストがこれらのキャッシュのいずれかで満たされる場合、CPU はフル DRAM のレイテンシを待つ必要がありません。Graviton5 は、64 キロバイトの第 1 レベルキャッシュ、2 メガバイトの第 2 レベルキャッシュ、そして 192 メガバイトの第 3 レベルキャッシュを搭載しており、これは前世代の Graviton の 5 倍以上です。

Graviton3 は、コア、DRAM コントローラ、PCIe コントローラの間に 7 つのダイを使用するチップレットアーキテクチャを採用した最初の Graviton CPU でした。Graviton4 も同様のアーキテクチャを踏襲し、いくつかの改良を加えました。しかし、Graviton5 では大幅に変更を加えています：Graviton5 の 192 コアは 4 つのチップレットに分割され、各チップレットには DRAM コントローラ、PCIe コントローラ、および 48 コアが含まれています。また、チップレット間の帯域幅が最大 420 ギガバイト/秒に達するカスタムダイ間接続により、メッシュ内のコア間のレイテンシを最小化しています。これにより、個別の I/O ダイや個別の DRAM コントローラダイは不要となりました。この構成により、チップごとに 2 つまたは 4 つの非一様メモリアクセス (NUMA) リージョンを設定し、CPU で実行されている仮想マシン (VM) のサイズに合わせて L3 キャッシュのサイズを分割することが可能になり、48 コア以下の VM に対するメモリアクセスレイテンシを低減できます。

これらの強化により、Graviton5 は Graviton4 ベースのインスタンスと比較して最大 25% の計算性能向上を実現し、ウェブアプリケーションでは最大 35%、機械学習推論では最大 35%、データベースでは最大 30% の高速化を提供します。Graviton5 で動作する M9g および M9gd インスタンスは、Nitro Isolation Engine（Nitro イソレーションエンジン）の導入により、セキュリティの基準をさらに引き上げています。Nitro Isolation Engine は、インスタンスの分離を強制し、形式検証 (formal verification) を活用して数学的な精度で分離を保証する Nitro System の強化機能です。Nitro Isolation Engine は、VM 間の分離を強制するために特別に設計されたコンポーネントであり、最小限の API セットを通じて VM メモリ、CPU レジスタ状態、および I/O デバイスへのすべてのアクセスを仲介する責任を負っています。

Nitro Isolation Engine は形式検証という技術を活用しており、これはハードウェアやソフトウェアが特定のテストケースだけでなく、意図した通りに動作することを数学的に証明するための手法です。この集中的な検証により、Nitro は初めて形式検証されたクラウドハイパーバイザとなり、数学的に証明されたクラウドセキュリティの新たな基準を先駆けて確立しました。Nitro Isolation Engine について詳しく知りたい場合は、Amazon Science のブログ記事をご覧ください。

原文を表示

AWS Graviton processors have improved steadily across generations, with each iteration delivering advances in computational performance, price performance, energy efficiency, and memory capacity. Today, Amazon announced the general availability of the new M9g and M9gd instances of its Elastic Compute Cloud (EC2), for general-purpose workloads. These are the first Amazon products powered by Graviton5, the latest generation of Amazon’s CPU. After five generations of custom silicon and eight years of continuous investment, Graviton powers over 350 instance types that are suitable for workloads including web applications, microservices, analytics, databases, machine learning inference, electronic design automation, gaming, video encoding, and agentic AI. Graviton5 doubles the number of cores from Graviton4, from 96 to 192, and it supports DDR5-8800 memory and the latest PCIe gen6 interconnects. We’ve worked closely with leading DRAM manufactures to meet the DDR5-8800 level of performance, and AWS Graviton instances deliver the fastest memory of any processor instances in the cloud. With Graviton5, Amazon also moved to a three-nanometer process, enabling greater circuit density and faster on-chip communication. Not only does Graviton5 pack in more cores than Graviton4, but each of those cores offers 25% better performance. We've talked for a while about how micro benchmarks are very different from big, real-life workloads, and we design for our customers’ actual workloads — not small loops but all the code and complexity of a real application like a database. To execute code quickly, modern processors predict branches that come from control flow in programs and speculatively execute the predicted paths. The Neoverse V3 core used in Graviton5, codefined by Arm and Amazon’s Annapurna Labs, substantially improves the branch prediction capability of the CPU, and that in turn makes it able to execute real applications like databases up to 30% better. The DRAM of a CPU can be about 100 nanoseconds away. That doesn’t sound like a lot, but for a CPU that runs at 3.3 gigahertz, one memory access takes 330 cycles. CPUs use caches to bring data closer to the CPU, and when a request can be fulfilled from one of these caches, the CPU doesn’t have to wait for the full DRAM latency. Graviton5 has 64-kilobyte first-level caches, two-megabyte second-level caches, and 192 megabytes of level-three cache — more than five times as much as the previous generation of Graviton. Graviton3 was the first Graviton CPU to adopt a chiplet architecture, using seven dies across cores, DRAM controllers, and PCIe controllers. Graviton4 followed the same architecture as Graviton3, with a few refinements. However, in Graviton5, we’ve changed it substantially: the 192 cores in Graviton5 are split across four chiplets, with each chiplet containing DRAM controllers, PCIe controllers, and 48 cores, with custom die-to-die connectivity that provides up to 420 gigabytes per second of bandwidth between chiplets, minimizing latencies between cores in the mesh. There is no longer a separate I/O die nor a separate DRAM controller die. This organization allows us to configure two or four nonuniform-memory-access (NUMA) regions per chip and partition the size of the L3 cache to the size of the virtual machines (VMs) running on the CPU while reducing memory latency for VMs that are 48 cores or smaller. With these enhancements, Graviton5 oﬀers up to 25% better computational performance than Graviton4-based instances, with up to 35% faster performance for web applications, up to 35% for machine learning inference, and up to 30% for databases. The M9g and M9gd instances that are powered by Graviton5 are also raising the bar on security even further with the introduction of the Nitro Isolation Engine. The Nitro Isolation Engine is an enhancement to the Nitro System, which enforces isolation of instances and harnesses formal verification to provide assurances of isolation with mathematical precision. The Nitro Isolation Engine is a purpose-built component that is responsible for enforcing isolation between VMs, including mediation of all access to VM memory, CPU register state, and I/O devices through a minimal set of APIs. The Nitro Isolation Engine leverages formal verification, a technique for mathematically demonstrating that hardware or software behaves as intended, and not just in specific test cases. This intensive verification establishes Nitro as the first formally verified cloud hypervisor, pioneering a new standard for mathematically proven cloud security. To learn more about the Nitro Isolation Engine, read the Amazon Science blog post.

この記事をシェア

TechCrunch AI★42026年6月9日 21:00

e スクーター創業者が宇宙データセンター構築に 500 万ドルを調達

e スクーターの創業者が、宇宙空間でのデータセンター建設プロジェクトのために 500 万ドルの資金調達を実現した。

The Register AI/ML★52026年4月30日 08:47

Amazon の半導体事業が年間200億ドル規模に成長し、主要プレイヤーへ

アンディ・ジャシーCEOは、同社の半導体部門が世界有数のデータセンター用チップ企業となり、年間売上高が200億ドルのペースを超えたと発表した。内部顧客を含めれば500億ドル規模に達する可能性を示した。

The Decoder★42026年4月24日 23:12

Meta、Amazonから数千万個のAWS Graviton 5プロセッサコアを購入

MetaはAmazonから数千万個のAWS Graviton 5プロセッサコアを購入し、Graviton最大の顧客の一つとなる。

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み