読み込み中…

Interconnects·2026年2月5日 03:00·約23分

なぜNvidiaはオープンモデルを構築するのか：ブライアン・カタンザロに聞く

#LLM #オープンソース #MoE #Nemotron #NVIDIA

TL;DR

NVIDIA はオープンモデル戦略を強化し、Nemotron 3 シリーズの発表を通じて、自社ハードウェアとの最適化による持続可能な競争優位性を確立している。

AI深層分析2026年4月29日 20:06

重要/ 5段階

深度40%

キーポイント

Nemotron 3 シリーズの展開とハイブリッドアーキテクチャ

2025 年 12 月に発表された Nano モデルに加え、Super/Ultra バリアントも登場し、Mamba-Transformer ハイブリッドや MoE（混合専門家）アーキテクチャを採用して推論効率とコンテキスト長を大幅に向上させた。

オープンモデル戦略のビジネス的合理性

NVIDIA はオープンソースモデルを構築・公開することで、開発者コミュニティの生態系を形成し、最終的に自社の GPU ハードウェアへの依存と採用を促進するという独自の収益モデルを確立している。

合成データパイプラインとトレーニング手法の革新

20T トークン規模での FP8 訓練や、高品質な合成データ生成パイプライン（HelpSteer2 など）を活用し、オープンソースコミュニティに対して最先端のトレーニングレシピとデータを公開している。

Megatron-LM と NeMo フレームワークの進化

モデル並列化フレームワーク「Megatron-LM」やエンドツーエンド LLM スタック「NeMo」が再注目されており、オープンソーストレーニングソフトウェアの標準基盤としての地位を強化している。

Nemotron 3 の新世代モデルとデータ公開戦略

2026 年上半期にハイブリッド MoE アーキテクチャを採用した Nemotron 3 シリーズ（Nano/Super/Ultra）のリリースを予定しており、最大 1M トークンのコンテキスト長をサポートする。また、高品質な事前学習データセットや SFT/RL データを公開することで、業界全体のエコシステムを支援している。

大規模組織における「管理」ではなく「招待」による運営

500 人以上の研究者を抱える組織において、トップダウンでの統制ではなく、メンバーを「招待」する文化によって自律的な研究環境を構築している。

モデル公開の目的はシステム R&D とエコシステムの両立

NVIDIA がオープンモデルを公開する主な理由は、単なる技術発表ではなく、ハードウェアとソフトウェアの統合におけるシステム R&D を推進し、同時に開発者コミュニティのエコシステムを強化するためである。

重要な引用

Nvidia understands their unique roll as a company that and both build and directly capture the value they get from building open language models, giving them a uniquely sustainable advantage.

Bryan has a beautiful analogy for open models this early in AI's development, and how they are a process of creating 'potential energy' for AI's future applications.

Nemotron's two jobs: systems R&D + ecosystem support

Organizing 500+ people with 'invitation, not control'

We think that technology generally works better when there's openness to the infrastructure so that people can build things in different ways.

AI is a form of infrastructure... it's kind of a foundation.

影響分析・編集コメントを表示

影響分析

この記事は、NVIDIA がオープンソースコミュニティにおいて単なるベンダーから生態系の構築者へ役割を変えつつあることを示しており、業界全体が「クローズド vs オープン」の対立軸から「ハードウェア最適化によるオープンソースの標準化」という新たなフェーズへと移行していることを意味します。NVIDIA が公開するモデルとトレーニング手法は、今後数年間の AI 開発インフラの事実上の基準となる可能性が高く、他社ベンダーや研究者への影響は甚大です。

編集コメント

NVIDIA のオープンソース戦略が、単なる技術公開を超えて自社のハードウェアビジネスを強化する「持続可能な競争優位性」の源泉となっている点は、業界全体のパラダイムシフトを示唆しています。

2025 年、私にとっての大きなトピックの一つは、Nvidia がオープンモデルプログラムを大幅に強化したことです。より多くのリリース、高品質なモデルの提供、データセットを公開する数少ない企業の仲間入りを果たすなどです。このインタビューでは、500 人以上の技術スタッフを率いる 3 人の VP の一人である Bryan Catanzaro と座談し、以下の点について議論しました。

2025 年 12 月にリリースされた非常に印象的な「Nemotron 3 Nano」モデルと、まもなく登場する大規模な Super および Ultra バリアント、

なぜ Nvidia のビジネスはオープンモデルを構築することによって明確に恩恵を受けるのか、

より優れたモデルを追求するためにどのように Nemotron チームの文化が築かれたか、

Megatron-LM とオープンソーストレーニングソフトウェアの現状、

キャリアに関する振り返りと AI 研究への道筋、

その他のトピック。

このインタビューから私が得た最大の教訓は、Nvidia が自社で構築したオープン言語モデルから価値を直接獲得・キャプチャする企業としての独自の役割を理解している点です。これにより、彼らには独自かつ持続可能な優位性が生まれています。

Bryan は、AI 開発のこの初期段階におけるオープンモデルについて、非常に美しい比喩を持っています。それは、AI の将来のアプリケーションのための「ポテンシャルエネルギー」を創出するプロセスであるという考え方です。

ぜひお楽しみください！

ゲスト：NVIDIA 応用深層学習研究 VP（Applied Deep Learning Research: ADLR）ブライアン・カタンザロ。X: @ctnzr、LinkedIn、Google Scholar。

Apple Podcasts、Spotify、YouTube、およびポッドキャストを入手できるあらゆるプラットフォームで聴取可能です。他の Interconnects インタビューについては、こちらをご覧ください。

Nemotron モデルタイムライン

2019–2022 — 基盤となる研究

Megatron-LM（最近再び非常に人気となったモデル並列化フレームワーク；代替案：DeepSpeed、PyTorch FSDP）。

NeMo Framework（NVIDIA のエンドツーエンド LLM スタック：トレーニングレシピ、データパイプライン、評価、デプロイメント）。

2023 年 11 月 — Nemotron-3 8B: エンタープライズ対応の NeMo モデル。モデル：ベース、チャット SFT、チャット RLHF、コレクション。ブログ。

2024 年 2 月 — Nemotron-4 15B: 8T トークンでトレーニングされた多言語 LLM。論文。

2024 年 6 月 — Nemotron-4 340B: 合成データパイプラインを詳細に記述した主要なオープンリリース。論文、ブログ。モデル：インストラクション、リワード。

2024 年 7 月〜9 月 — Minitron / Nemotron-Mini: 15B から剪定された彼らの最初の剪定済みモデル。Minitron-4B（ベースモデル）、Nemotron-Mini-4B-Instruct。論文、コード。

2024 年 10 月 — Llama-3.1-Nemotron-70B: Llama 3.1 70B に対する強力なポストトレーニング。モデル、コレクション。主要データセット — HelpSteer2、論文。

2025 年 3 月〜6 月 — Nemotron-H: 推論効率のための最初のハイブリッド Mamba-Transformer モデル。論文、研究ページ、ブログ。モデル：8B、47B、4B-128K。

2025 年 5 月 — Llama-Nemotron: Llama（依然として！）の上に構築された効率的な推論モデル。論文。

2025 年 9 月 — Nemotron Nano 2: 推論のための 9B ハイブリッド、パフォーマンスの継続的な向上中。20T トークンでトレーニングされた 12B ベース（FP8 トレーニング）をポストトレーニング用に 9B に剪定。レポート、V2 コレクション。

2025 年 11 月 — Nemotron Nano V2 VL: 12B の VLM。レポート。

2025 年 12 月 — Nemotron 3：Nano/Super/Ultra ファミリー、ハイブリッド MoE（Mixture of Experts）、最大 100 万トークンのコンテキスト。Super/Ultra は 2026 年上半期にリリース予定。

Nano: 25T トークンで学習済み、総パラメータ数 316 億 / アクティブパラメータ約 32 億。レシピ、コード、データセットを公開。論文：ホワイトペーパー、技術報告書。モデル：Nano-30B-BF16、Base、FP8。

Nemotron の最近のデータセット

NVIDIA は 2025 年より大幅に多くのデータを公開し始めました。これには事前学習用データセットも含まれており、大規模な高品質な事前学習データを公開する数少ない組織の一つとなっています（ただし、無視できない法的リスクを伴います）。

事前学習データ

収集元 — CC-v2, CC-v2.1, CC-Code-v1, Code-v2, Specialized-v1, CC-Math-v1。数学論文：arXiv:2508.15096。

事後学習データ

コアとなる事後学習ダンプ（SFT/RL の組み合わせ）:

Llama Nemotron 事後学習 v1.1 (2025 年 4 月)

Nemotron 事後学習 v1 (2025 年 7 月)

Nemotron 事後学習 v2 (2025 年 8 月)

2025 年の推論/コード SFT コーパス:

OpenMathReasoning (2025 年 4 月)

OpenCodeReasoning (2025 年 4 月)、OpenCodeReasoning-2 (2025 年 5 月)

AceReason-1.1-SFT (2025 年 6 月)

Nemotron-Math-HumanReasoning (2025 年 6 月)、Nemotron-PrismMath (2025 年 4 月)

NeMo Gym RLVR データセット：コレクション

Nemotron v3 事後学習 (2025 年 12 月)：コレクション

HelpSteer（人間フィードバック/選好）:

HelpSteer (2023 年 11 月)

HelpSteer2 (2024 年 6 月)

HelpSteer3 (2025 年 3 月)

その他、ここではリンクされていないものも多数あります。

チャプター

00:00:00 イントロダクションと NVIDIA がオープンモデルを公開する理由

00:05:17 Nemotron の二つの役割：システム R&D とエコシステムの支援

00:15:23 モデルだけでなくデータセットも公開する

00:22:25 「支配」ではなく「招待」で 500 人以上を組織する

0:37:29 Nemotron のスケーリングと Megatron の進化

00:48:26 キャリアの振り返り：SVM から DLSS へ

00:54:12 バイトン・シリコンバレー AI ラボからの教訓

00:57:25 Jensen Huang と共に応用研究ラボを構築する

01:00:44 研究者へのアドバイスと 2026 年の予測

トランスクリプト

00:00:06 ネイサン・ランバート：はい、バイアン。Nemotron について話せるのをとても楽しみにしています。私は、誰もが話題にする中国における明白な出来事以外でも、オープンモデルの 25 年における最も大きな進化の物語の一つだと考えています。多くの注目を集めていますので、ぜひお越しいただきありがとうございます。

00:00:22 バイアン・カタンザロ：ああ、いいえ、光栄です。

00:00:23 ネイサン・ランバート：では、質問を始めたいと思います。これらの質問の一部は、正直に言って、ファンとしての私の好奇心を満たすものです。例えば、なぜ NVIDIA は基本的なレベルで Nemotron をオープンモデルとしてリリースするのでしょうか？

00:00:39 ブライアン・カタンザロ：はい、AI が成長するたびに NVIDIA の市場拡大の機会があることは私たちが知っていますし、AI を前進させようとする多くの開発者や研究者にとって、オープンな AI モデルへのアクセスが非常に重要であることも承知しています。業界内の他の企業による、オープンに開発された AI を推進する取り組みには、私たちも大変興奮していました。もちろん、Meta は Llama で素晴らしい成果を上げましたし、OpenAI も GPT OSS をリリースしており、これも非常に刺激的な出来事でした。また、アレン研究所は当然のことながら、研究、特にオープンリサーチの先頭に立っており、Marin プロジェクトや OpenAthena といった取り組みも牽引しています。私たちが常に発展を期待しているようなプロジェクトがいくつもあるのです。

そして、AI が今後どこへ向かうかを考える際、NVIDIA は AI をインフラストラクチャの一形態と信じています。適用された際には非常に有用な技術ですが、それ単体では基盤でありインフラストラクチャそのものなのです。人々が異なる方法で何かを構築できるようにするためには、インフラストラクチャに対してオープンであることが、技術がより良く機能するために重要だと考えています。インターネットが世界経済のあらゆる側面を変革した方法を思い浮かべてみてください。それは非常に深い影響を与えたものであり、私たちはまだ終わっていません。

例えば、小売業がインターネットを利用する方法と、医療業界がインターネットを利用する方法は異なります。そして、経済の異なるセクターが、それぞれのビジネスの心臓部にインターネットを組み込む方法を独自のやり方で見つけられたという事実は、インターネットが人々がさまざまな試みを行えるオープンな技術の上に構築されていたからこそ可能でした。私たちは、AI も同様に進化していくと考えています。世界のあらゆる経済セクターに属する組織が、AI に対して新しい、驚くべき、楽しい、そして重要な用途を見出し、カスタマイズして自らの業務に直接組み込む能力を持っていれば、それらをより効果的に実現できるでしょう。

なお、これは、多くの主要な研究所から提供されている API のように、極めて卓越した能力を持つ「クローズド」なアプローチを貶める意図ではありません。それらについても私たちは非常に興奮しています。

NVIDIA はあらゆる形態の AI を支援することを好んでいますが、現在、AI を展開するためのクローズドなアプローチは概ね順調に進んでいる一方で、オープンに開発された AI エコシステムには、さらに多くのエネルギーが必要だと感じています。そのため、私たちは過去 1 年間、この分野により多くの努力を注いできました。

00:03:42 ネイサン・ランバート：はい。この点については詳しく掘り下げていきます。実際に私も確認しているからです。私たちは今、1 月 26 日に録音を行っており、これはこれらの Nemotron 3 モデルの展開真っ只中です。ナノ版はすでに秋にリリースされており、おそらく組織が成し遂げた最も大きな成果の一つだったでしょう。そして皆、スーパーやウルトラ大規模なバリアントを熱心に待っています。

さて、この Nemotron プラットフォームをどこまで推し進めるつもりなのか、その限界はどこにあるのか。つまり、これは単にユーザーの反応や採用状況、エコシステムに依存しているだけなのでしょうか？それとも、組織全体を通じて一貫して掲げられるべき「北極星」のような明確な指針があるのでしょうか？あるいは、他のオープンラボの多くを聞くと、「私たちはオープン AGI を構築したい」という声が聞こえてきます。これは必ずしも現実的な基盤に基づいているとは限りませんが、非常に包括的なビジョンが存在しているように感じられます。

組織全体を通じてトーンを設定しようとするような試みはありますか？AI についても同様で—

00:04:31 ブライアン・カタンザロ：私の北極星について言えば—

00:04:32 ネイサン・ランバート：…学術界のことは非常に—

00:04:34 ブライアン・カタンザロ：Nemotron については。

00:04:36 ネイサン・ランバート：はい、続けてください。

00:04:37 ブライアン・カタンザロ：ああ、すみません。どうぞ。

00:04:39 ネイサン・ランバート：私はちょうど、AI についても比較しようとしていました。私たちは非常に明確なビジョンを持っています。あまりにもオープンであることについて言えば、研究が極めて必要であり、信頼性の高い研究に基づいて構築できるレシピがほとんどないと感じています。つまり、研究インフラストラクチャが必要なのです。そして、Llama のようなものが登場しましたが、これはザッカーバーグ氏のビジョンの上に築かれたものでした。しかし彼は考えを変え、私は彼のビジョン、特にオープンモデルの必要性を説く彼の表現方法が非常に優れていると思っていたのですが、それが次第に薄れていきました。では、組織全体に浸透し、人々を惹きつけ、興奮させるようなビジョンを設定する方法はあるのでしょうか？

00:05:17 ブライアン・カタンザロ：その通りです。私たちが Nemotron を構築したのには主に 2 つの理由があります。第一に、私たちの主力製品ラインのために必要だからです。では、私が何を意味しているのかというと？

さて、アクセラレーテッドコンピューティング（加速計算）とは、NVIDIA が行っていることであり、私たちは高速なコンピュータを構築しています。しかし、高速なコンピュータを構築する目的は、人々が新しいことをできるように支援することです。実際には、すべての高速コンピュータは同時に低速なコンピュータでもあります。「コンピュータがもっと速く、より多くのことができるようになればいい」という観察は古くから存在しており、コンピューティングの黎明期からあることです。では、アクセラレーテッドコンピューティングと標準的なコンピューティングの違いは何でしょうか？それは、特定のワークロードを加速することに優先順位をつけ、焦点を当て、決定を下す点にあります。残りのすべてのワークロードのうち約 99% に相当する他のワークロードについては、他者に任せることにします。

つまり、NVIDIA のシステムを購入して汎用計算を行うわけではありません。特定の目的のために購入するのです。現在では、それはすべて AI（人工知能）に関わるものです。しかし、AI に関連する計算ワークロードについて考えると、非常に多様性があり、また私たちが AI 向けに構築する必要のあるシステムに極めて大きな影響を与える多くの重要なパラメータやハイパーパラメータ、あるいはアルゴリズム的アプローチが存在します。

数値精度や MoE アーキテクチャといった要素は、もちろんネットワーク設計に影響を与えます。私たちはスパース性について夢見ています。ご存知の通り、GPU におけるスパースニューラルネットワークのアクセラレーション機能は Ampere アーキテクチャから搭載されていますが、まだ十分に活用されているとは言い難いです。では、どのようにしてその機能を効果的に活用できるかを解明すべきでしょうか。これらの要素は NVIDIA の主力製品ラインの未来に多大な影響を及ぼすものであり、私たちが何を構築していくべきかを知るためには、これらの問いに対する答えを深く理解する必要があります。

顧客に直接出向いてアンケートを行い、「例えば、先ほど話題に出たメタ社のように、NVIDIA の今後の製品ラインアップでどのような機能を望みますか？」と尋ねることはできません。もちろん、メタ社は可能な限り私たちを支援しようとしていますが、彼らが私たちに伝えられることには限界があります。なぜなら、これらのシステムの設計に影響を与える情報の多くは、その導出に非常に高額なコストがかかるため、極めて厳重に管理されているからです。したがって、どのようなシステムを構築すべきか、AI のどの部分を加速させるべきで、どの部分については気にする必要がないかを理解するためには、こうした問いに対して深く洞察する必要があります。つまり、Nemotron モデルの最初の役割は、NVIDIA が企業として存続し続けることを可能にすることです。コミュニティがこの理由を知っておくことが重要だと私は考えます。なぜなら、NVIDIA が Nemotron への投資を行っているのは、それが当社の未来にとって不可欠であると信じているからです。したがってこれは——もちろん、「NVIDIA はオープンに開発された AI を支持しています」と言うのは心地よいことですが、私たちは慈善活動家だからです、というわけではありません。実際には、これはビジネス上の判断なのです。

00:08:34 ネイサン・ランバート：賢明な判断だ

00:08:34 ブライアン・カタンザロ：…例えば、NVIDIA にとっては、AI について非常に深く理解することがビジネス上必要です。したがって、NVIDIA の継続的な事業を維持するために正当化される投資額は、大きいと考えます。これが Nemotron の最初の任務です。次に、Nemotron の2番目の任務は、NVIDIA 以外のより広いエコシステムをサポートすることです。AI 環境において NVIDIA は特別な立場にあります。私が考えるすべての大手 AI 企業のうち、他社との連携が最も多いのは私たちだと考えています。私たちは、小規模企業から大企業まで、AI ネイティブな企業から古くからの確立された企業まで、あらゆる企業を支援しています。

私たちはハイパースケラーとも協力し、小さなスタートアップとも協力し、世界中の国々とも協力しています。そのため、私たちはこの独自の立場を持ち、同時に、ユニークな責任、そしておそらくはユニークな機会も担っていると考えます。AI がどのような方向へ、あるいはどのような能力において成長しようとも、それは私たちが事業を拡大する機会となります。もちろん、それが自動的に起こるわけではありません。AI 市場は多様であり、さらに多様化しており、そうあるべきです。なぜなら、これは人類の歴史で最も重要な市場だからです。私たちはそのことを認識しつつも、同時に、AI エコシステムを発展させることが私たちの利益になるとも知っています。AI を構築し、発明し、展開する人々が増えれば増えるほど、企業としての機会も大きくなるのです。

これが Nemotron の2番目の任務です。

00:10:17 ネイサン・ランバート：はい、そのように率直におっしゃっていただき、本当に感謝しています。というのも、私たちはこれまで……私たちが昨夏に立ち上げた「アダム・プロジェクト」は、米国におけるオープンモデルへの投資を促進しようとするものですが、オープンモデルに対して明確なビジネスモデルを持っているのは NVIDIA のような企業だけだと言えます。なぜなら、オープンモデルと研究エコシステムが CUDA 上で円滑に連携するように保証する必要があるからです。そうすることで、実際に進行中の研究に非常に近づくことができるのです。そうでない場合、つまり、AMD などと比較して主に GPU で研究が行われることには大きな利点があるためです。

00:10:49 ブライアン・カタンザロ：ええ、私たちは競争を阻止する方法について考えているわけではありません。競争は歓迎します。すでに多くの競争があります。この分野ではさらに多くの競争が必要ですが、コミュニティとの関わりを維持することには非常に自らの利益があると考えています。

ご存知の通り、これは非常に重要です。CUDA の話についてはあまり記憶に残っていない方もいらっしゃるかもしれませんが、実は CUDA は当初、NVIDIA が学術界や産業界に対して積極的に働きかけ、「新しい計算方法がありますので、ぜひこれを使って何ができるか試してみてください」と呼びかけたことから始まりました。

実際、私は 2006 年にバークレーの大学院生だったときに CUDA を使い始めました。当時 NVIDIA のチーフサイエンティストだったデビッド・カーク氏がバークレーを訪れ、「新しい GPU をリリースしました。この新しいプログラミングモデルは CUDA と呼ばれますので、ぜひ試してみてください」とおっしゃったのです。その時私は FPGAs 上で機械学習に取り組んでおり、特にサポートベクターマシン（SVM）のトレーニングの一部を FPGA で実装していました。そこで、その小さな部分を CUDA で書き直してみることにしたところ、わずか 15 分ほどで完成し、実行すると、シングルスレッドの CPU コードと比較して約 200 倍もの速度向上がありました。

「おや、これは以前よりもずっと簡単だ！これならすぐにでも取り組める」と思ったものです。

さて、私の CUDA や NVIDIA への個人的な関与は、CUDA の開始当初から NVIDIA が行っていたこのアウトリーチ活動に起因するものです。ご存知の通り、これは NVIDIA に多くの素晴らしい成果をもたらしました。その一つが AlexNet です。これもまた学術プロジェクトであり、Alex Krizhevsky と Ilya Sutskever は「より大量のデータを用いて大規模なニューラルネットワークをどうやって訓練するか？GPU をこれまでとは異なる革新的で賢い方法で使用して GPU コードを書き込み、より優れた画像分類モデルを訓練しよう」と考えていました。そして、その結果は驚くべきものでした。これはコミュニティ全体にとって深層学習の時代を切り開くきっかけとなったのです。また、これもトップダウンで成し得たことではありません。これは、NVIDIA が並列計算と人工知能におけるオープンな開発および再研究を支援した結果として生まれたものです。私たちはこのことを忘れず、2026 年にはどのような姿になるかを考えています。つまり、どこかの研究室に所属する大学院生である未来の Alex Krizhevsky を支援し、世界を変える次世代技術を発明させるにはどうすればよいかということです。Nemotron や、現在公開されている他のオープン開発 AI プロジェクトのようなものなしでは、そのようなことは非常に難しいように思われます。なお、Nemotron は唯一のプロジェクトになろうとしているわけではありません。

私たちはコミュニティの一員です。オープンに開発された AI で素晴らしい成果を上げている他の人々を愛しています。他の人々の取り組みから学び、コミュニティをサポートしようとしています。それは私たちの利益にもなるからです。もちろん、他の人々が貢献しているのも非常に嬉しく思っています。

00:13:57 ネイサン・ランバート：はい、私が質問したかった点に移らせていただきます。Nemotron 25 が成熟（maturing）という言葉を使いたくないのですが、組織内でどのように感じられているかについてお聞きしたいのです。しかし、出力がコミュニティやモデルを構築する人々によってより注目されるレベルに達したことは事実です。これには多くの要因がありますが、私の専門分野のコミュニティでは、Nemotron のデータセットをよく利用しています。例えば、ポストトレーニング（post-training）レシピを見直す際、参考にするのは NVIDIA がリリースした高品質でオープンライセンスのポストトレーニングデータだけです。今年もまた、AI2 において大きな注目を集めた事前トレーニング（pre-training）データのいくつかをリリースし始めました。これは何でしょうか？Nemotron 内の明確な転換点と言えるのでしょうか？

それは長年やりたかったことで、ようやく実行したのでしょうか？でも、それはまさにゼロからイチへの瞬間のようなもので、事前トレーニングデータを公開することはどの企業にとっても法的リスクを伴うものですが、それを敢行する人はあまりいません。私の側の世界では、通常、最適な事前トレーニングデータセットが何かを言うのは比較的簡単で、長年にわたり Hugging Face、AI2、DCLM といった選択肢の間で揺れ動いており、実質的に選択肢はたった二つか三つしかなかったのです。したがって、基礎研究の観点からは、組織としてコミュニティをサポートし、ある程度のリスクを引き受けることは大きな一歩だと考えます。もしお話しできるエピソードがあれば、あるいは単に「感謝しています」と言っていただければ結構です。それ以上は特にありません。

00:15:23 ブライアン・カタンザロ：そうですね。つまり、Nemotron は単なるモデルではないことをより多くの人々に理解してほしいと思います。私たちが Nemotron で目指しているのは、オープンに開発された AI の支援です。これもまた、私たちの大きな機会だからです。現在、モデルを構築するようインセンティブを与えられている組織は数多くあり、そのモデルが事業の根幹をなすケースも少なくありません。

しかし NVIDIA においては、事業を動かしているのはモデルではなくシステムなのです。

原文を表示

One of the big stories of 2025 for me was how Nvidia massively stepped up their open model program — more releases, higher quality models, joining a small handful of companies releasing datasets, etc. In this interview, I sat down with one of the 3 VP’s leading the effort of 500+ technical staff, Bryan Catanzaro, to discuss:

Their very impressive Nemotron 3 Nano model released in Dec. 2025, and the bigger Super and Ultra variants coming soon,

Why Nvidia’s business clearly benefits from them building open models,

How the Nemotron team culture was crafted in pursuit of better models,

Megatron-LM and the current state of open-source training software,

Career reflections and paths into AI research,

And other topics.

The biggest takeaway I had from this interview is how Nvidia understands their unique roll as a company that and both build and directly capture the value they get from building open language models, giving them a uniquely sustainable advantage.

Bryan has a beautiful analogy for open models this early in AI’s development, and how they are a process of creating “potential energy” for AI’s future applications.

I hope you enjoy it!

Guest: Bryan Catanzaro, VP Applied Deep Learning Research (ADLR), NVIDIA. X: @ctnzr, LinkedIn, Google Scholar.

Listen on Apple Podcasts, Spotify, YouTube, and where ever you get your podcasts. For other Interconnects interviews, go here.

Nemotron Model Timeline

2019–2022 — Foundational Work

Megatron-LM (model parallelism framework that has become very popular again recently; alternatives: DeepSpeed, PyTorch FSDP).

NeMo Framework (NVIDIA’s end-to-end LLM stack: training recipes, data pipelines, evaluation, deployment).

Nov 2023 — Nemotron-3 8B: Enterprise-ready NeMo models. Models: base, chat-sft, chat-rlhf, collection. Blog.

Feb 2024 — Nemotron-4 15B: Multilingual LLM trained to 8T tokens. Paper.

Jun 2024 — Nemotron-4 340B: Major open release detailing their synthetic data pipeline. Paper, blog. Models: Instruct, Reward.

Jul–Sep 2024 — Minitron / Nemotron-Mini: First of their pruned models, pruned from 15B. Minitron-4B (base model), Nemotron-Mini-4B-Instruct. Paper, code.

Oct 2024 — Llama-3.1-Nemotron-70B: Strong post-training on Llama 3.1 70B. Model, collection. Key dataset — HelpSteer2, paper.

Mar–Jun 2025 — Nemotron-H: First hybrid Mamba-Transformer models for inference efficiency. Paper, research page, blog. Models: 8B, 47B, 4B-128K.

May 2025 — Llama-Nemotron: Efficient reasoning models built ontop of Llama (still!). Paper.

Sep 2025 — Nemotron Nano 2: 9B hybrid for reasoning, continuing to improve in performance. 12B base on 20T tokens (FP8 training) pruned to 9B for post-training. Report, V2 collection.

Nov 2025 — Nemotron Nano V2 VL: 12B VLM. Report.

Dec 2025 — Nemotron 3: Nano/Super/Ultra family, hybrid MoE, up to 1M context. Super/Ultra H1 2026.

Nano: 25T tokens, 31.6B total / ~3.2B active, releases recipes + code + datasets. Papers: White Paper, Technical Report. Models: Nano-30B-BF16, Base, FP8.

Nemotron’s Recent Datasets

NVIDIA began releasing substantially more data in 2025, including pretraining datasets — making them one of few organizations releasing high-quality pretraining data at scale (which comes with non-negligible legal risk).

Pretraining Data

Collection — CC-v2, CC-v2.1, CC-Code-v1, Code-v2, Specialized-v1, CC-Math-v1. Math paper: arXiv:2508.15096.

Post-Training Data

Core post-training dumps (SFT/RL blends):

Llama Nemotron Post-Training v1.1 (Apr 2025)

Nemotron Post-Training v1 (Jul 2025)

Nemotron Post-Training v2 (Aug 2025)

2025 reasoning/code SFT corpora:

OpenMathReasoning (Apr 2025)

OpenCodeReasoning (Apr 2025), OpenCodeReasoning-2 (May 2025)

AceReason-1.1-SFT (Jun 2025)

Nemotron-Math-HumanReasoning (Jun 2025), Nemotron-PrismMath (Apr 2025)

NeMo Gym RLVR datasets: Collection

Nemotron v3 post-training (Dec 2025): Collection

HelpSteer (human feedback/preference):

HelpSteer (Nov 2023)

HelpSteer2 (Jun 2024)

HelpSteer3 (Mar 2025)

And others, not linked here.

Chapters

00:00:00 Intro & Why NVIDIA Releases Open Models

00:05:17 Nemotron’s two jobs: systems R&D + ecosystem support

00:15:23 Releasing datasets, not just models

00:22:25 Organizing 500+ people with “invitation, not control”

0:37:29 Scaling Nemotron & The Evolution of Megatron

00:48:26 Career Reflections: From SVMs to DLSS

00:54:12 Lessons from the Baidu Silicon Valley AI Lab

00:57:25 Building an Applied Research Lab with Jensen Huang

01:00:44 Advice for Researchers & Predictions for 2026

Transcript

00:00:06 Nathan Lambert: Okay. Hey, Bryan. I’m very excited to talk about Nemotron. I think low-key, one of the biggest evolving stories in twenty-five of open models, outside the obvious things in China that everybody talks about, that gets a ton of attention. So th- thanks for coming on the pod.

00:00:22 Bryan Catanzaro: Oh, yeah, it’s my honor.

00:00:23 Nathan Lambert: So I wanted to start, and some of these questions are honestly fulfilling my curiosity as a fan. As like, why does NVIDIA, at a basic level, release Nemotron as open models?

00:00:39 Bryan Catanzaro: Well, we know that it’s an opportunity for NVIDIA to grow our market whenever AI grows, and we know that having access to open AI models is really important for a lot of developers and researchers that are trying to push AI forward. you know, we were really excited by efforts from some other companies around the industry to push openly developed AI forward. You know, Meta did some amazing work, obviously, with Llama and you know OpenAI released GPT OSS, which was exciting. And the Allen Institute, of course, has been, you know, really leading the charge for research, open research and, you know, also things like the Marin Project and OpenAthena. You know, like there’s, there’s a bunch of things that we’re always excited to see develop.

And, you know, as we think about where AI is gonna go, you know, NVIDIA believes that AI is a form of infrastructure. it’s.. AI is a very useful technology when it’s applied, but on its own you know, it’s kind of a foundation and infrastructure. We think that technology generally works better when there’s openness to the infrastructure so that people can build things in different ways. You know, you think about the way that the internet transformed every aspect of the world economy is pretty profound, and we’re not done yet.

But the way that, for example, retail uses the internet is different from the way that healthcare uses the internet. And the fact that you know, different sectors of the economy were able to figure out how to incorporate the internet into the beating heart of their businesses in different ways was possible because the internet was built on open technologies that, you know, allowed people to try different things. And we think AI is gonna evolve in a similar way, that organizations across every sector of the world economy are gonna find new and surprising and fun, and important things to do with AI, and they’ll be able to do that better if they have the ability to customize AI and incorporate it directly into the work that they do. and so -- and by the way, this is not to detract from any of the you know, more closed approaches to AI, you know, the APIs that we see from a number of leading labs that, you know, are just extraordinary and have amazing capabilities. We’re excited about those, too.

You know, NVIDIA loves to support AI in all of its manifestations, but we feel like right now the sort of closed approaches to deploying AI are doing pretty well but we, you know, could use some more energy in the openly developed AI ecosystem, and so that’s why we’ve been putting more effort into it this past year.

00:03:42 Nathan Lambert: Yeah. So I’m definitely gonna dig into this a lot ‘cause I have seen this. We’re sitting here recording in January twenty-six, which is in the midst of the rollout of these Nemotron three models. There’s the-- I think the Nano has released in the fall, which was probably one of the biggest splashes the org has made, and everybody’s eagerly awaiting these super and ultra-larger variants.

And it’s like how far are you, how far are you willing to push this Nemotron platform? Like, is it just depending on the users and the uptake and the ecosystem? Like, like, what is the-- is there a North Star in this? Or you hear a lot of.. if you listen to a lot of other open labs, they’re like: “We want to build open AGI,” which is like, I don’t necessarily think grounded, but there’s like a very unifying vision.

Is there something that you try to set the tone for it that goes through the organization? I mean, AI too, it’s like-

00:04:31 Bryan Catanzaro: You know, my North-

00:04:32 Nathan Lambert: .. academics is so-

00:04:34 Bryan Catanzaro: For Nemotron.

00:04:36 Nathan Lambert: Okay, go ahead.

00:04:37 Bryan Catanzaro: Oh, sorry. Go ahead.

00:04:39 Nathan Lambert: I was just, like, gonna compare to, like, AI too, where we can have such a-- like, we have a very specific vision, being so open that it’s like, I think, like, research is so needed, and there’s so little recipes to build on, like, with really credible research. So there’s, like, a research infrastructure, and then when you have something like Llama, it was, like, built on Zuckerberg’s vision, and he changed his mind, which I actually thought his vision was ex- was excellent, the way he articulated the need for open models, and it kind of faded. So it’s like, is there a way to set a vision for an org that, like, permeates every- everyone and is really compelling and exciting?

00:05:17 Bryan Catanzaro: Right. Well, we built Nemotron for two main reasons. The first is because we need to for our main product line. So what I mean by that?

Well, accelerated computing, what NVIDIA does, we build fast computers, right? But the point of building fast computers is to help people do new things. and actually every fast computer is also a slow computer. you know, the observation that it would be nice if computers were faster and could do more things isn’t new. that’s been around since the beginning of computing. So what makes accelerated computing different from standard computing is that we’re prioritizing, you know, we’re focusing, we’re deciding we’re gonna accelerate this workload. This other workload, which is like ninety-nine percent of all of the workloads, we’re gonna let somebody else do that, right?

So, like, you do not buy NVIDIA systems to do any general purpose computation. You buy them for a purpose, right? Which is these days, all about AI. But when you think about the workload, the compute workloads involved in AI there’s a, there’s a lot of diversity and there’s a lot of really important -.. parameters, hyperparameters, or algorithmic approaches that all have enormous imp- impacts on the systems that we need to build for AI.

So things like numeric precision MoE architecture, which of course, influence net-- it influences network design. you know, we’re dreaming about sparsity. We, you know, we’ve had, we’ve had sparse neural network acceleration in the GPU since Ampere. I don’t think that it’s being used enough. you know, so how do we, how do we figure out how to use that? These, these sorts of things have an enormous impact on the future of NVIDIA’s main product line, and we have to understand the answers to those questions deeply ourselves in order to know what we’re going to build.

We can’t just go to our customers and do a survey and say, “Hey “ you know, Meta, for example, since we were just talking about them, “what would you like to see in a future product line from NVIDIA?” Of course, Meta’s always trying to help us as much as they can, but there’s limits to what they can tell us because, you know a lot of the information that influences the design of these systems, it’s very expensive to derive, and so therefore, it’s, it’s very closely held. And so we need to be able to understand these questions very deeply in order to understand what kind of systems to build, in order to understand what we’re accelerating in AI and what we’re not gonna worry about. and so that’s kind of the first job for Nemotron models, is to make it possible for NVIDIA to continue to exist as a company. And I think it’s important that the community knows that because that’s the reason why NVIDIA is making the investments in Nemotron, is because we believe it’s essential for the future of our company. and so this isn’t-- and although as much, as much as it feels good to say, you know, NVIDIA believes in open openly developed AI because you know, we’re so charitable, but actually, that’s not the case. This is actually a business decision-

00:08:34 Nathan Lambert: It’s smart

00:08:34 Bryan Catanzaro: .. like, for NVIDIA, our business needs us to know about AI very deeply. And and so, you know, the amount of investment that is justified to carry on NVIDIA’s ongoing business, I think, is large. and so that’s that’s job number one for Nemotron. Now job number two for Nemotron is to support the ecosystem more broadly outside of NVIDIA. and, you know, NVIDIA has a special position in the AI landscape. of all of the big AI companies I think we’re the one that works with the most other companies. We support every company small and large, AI native company to old established enterprise.

We work with hyperscalers, we work with tiny little startups, we work with countries around the world. so we have this unique position and I think also a uni- unique responsibility and al- maybe also a unique opportunity, that whenever AI is able to grow in any sort of direction, in any capability, then you know, that’s an opportunity for us to grow our business. Obviously, it’s not automatic, right? you know, the AI market is diverse, and it’s getting more diverse, and it should be, ‘cause it’s the most important market in the history of humanity. So so we acknowledge that, and at the same time, we know that it’s in our interest to develop the AI ecosystem. The more people that are building, inventing, and deploying AI, the more opportunity that we have as a company.

So that’s job number two for Nemotron.

00:10:17 Nathan Lambert: Yeah. I really appreciate you saying it so directly ‘cause it’s like we’ve worked.. We- I launched this thing, the Adam Project, last summer, which is trying to get more investment in the US open models, and it’s like the only company that has an obvious business model for open models is something like NVIDIA, where you need to make sure that the open models and the research ecosystem plays nicely on CUDA, because then you’re gonna be able to be one-- You’re so many steps closer to research that’s happening. If not, like, if it like- There’s such an advantage to have research happen mostly on GPUs relative to AMD or anything like this, so.

00:10:49 Bryan Catanzaro: Well, you know, we are-- we’re, we’re not thinking about how to prevent competition. You know, we welcome competition. There’s lots of competition. There should be more competition in this space, but we are very self-interested in staying engaged with the community.

You know, it’s very important. You know, CUDA not many people remember this because it happened so long ago, but you know, CUDA started out with a lot of outreach from NVIDIA to the academic and industrial community saying, “Hey, we have this new way of doing computing. we’d love to see what you can do with it.” In fact, you know, I started using CUDA in 2006 when I was a grad student at Berkeley because David Kirk, who was the chief scientist of NVIDIA at the time, came over to Berkeley and said, “Hey we just released this new GPU, and it has this new programming model called CUDA. You should give it a try.” And I was-- at the time, I was working on machine learning on FPGAs, and I had been working on this one particular piece of support vector machine training on the FPGA, and I decided to take that little piece and write it in CUDA, and it took me like fifteen minutes, and then I ran it, and it was like two hundred times faster than my single-threaded CPU code, and I was like: “Whoa, that was way easier than what I was doing before. I’m just gonna go do that,” right?

So, like, my own personal involvement with CUDA and NVIDIA came about because of this outreach that NVIDIA conducted right from the beginning of CUDA. you know, of course, that led to a lot of great things for NVIDIA, including AlexNet, which was another academic project, you know, where Alex Krizhevsky and Ilya Sutskever were thinking about: “How do we train larger neural networks on more data? we’re gonna go write a bunch of GPU code that uses the GPU in a, in a kinda new and clever way, so that we can train a better image classification model.” And, you know, that had such astonishing results, it kicked off the deep learning era for the whole community. and again, not something that-.. could have been done top-down. That was a, that was a very much a result of NVIDIA supporting open development and re- research in parallel computing and artificial intelligence. And so we remember that, and we’re thinking about in twenty-six, what does it look like to help, you know, the Alex Krizhevsky of the future, who’s, who’s a grad student in a lab somewhere, invent the next technology that changes the world? It seems really difficult to do that without something like Nemotron or, or the other openly developed AI projects out there. yeah, I also wanna say in regards to this Nemotron is not trying to be the only project out there.

We’re part of the community. We love other people doing great work in openly developed AI. We learn from things that other people do and you know, so we’re, we’re trying to support the community because it’s in our interest, but we you know, we’re very happy to see other people contributing as well.

00:13:57 Nathan Lambert: Yeah, I mean, I can transition into something I wanted to ask about is like, I see multiple ways, twenty-five Nemotron mat-- in, I don’t wanna use the word maturing ‘cause I wanna ask you about how it feels in the org, but just like the output reached levels that were more noticed by the community and people building with models. And there’s a lot of ways that can happen, but one of them is like, in my niche community, I’ve been using Nemotron datasets a lot. Like we-- when we redo our post-training recipe, one of the only people we look at is like, okay, NVIDIA, Nemotron has released a lot of high-quality, openly licensed post-training data. this year, you also started releasing some pre-training data, which among AI2 got a lot of notice. Like, what is that? is that like a distinct shift within Nemotron?

Is that something that you’ve wanted to do for a while and finally just did? But it’s ‘cause it’s like-- it is just like a zero to one moment where releasing pre-training data comes with legal risk for any company, but so few people do it, where on my side of the world, it’s like pretty easy to normally say what the best pre-training dataset is, and it had, for a long time, oscillated between like Hugging Face, AI2, DCLM, and there was like literally only two or three options. So in terms of fundamental research, like I think that’s a big step from an org to support the community and take on some risk. So if you have any story you can tell and or just say like, I appreciate it, that’s, that’s all.. that’s all I got.

00:15:23 Bryan Catanzaro: Well, yeah. I mean, so I think it’d be great if more people could understand that Nemotron is not just a model, right? Like, what we’re trying to do with Nemotron is to support openly developed AI, because, again, that’s our big opportunity, right? Now, there’s a lot of organizations that are incentivized to build a model, and the model is maybe the thing that runs their business, right?

But at NVIDIA, the model is not the thing that runs our business, it’s the systems. So

この記事をシェア

TLDR AI重要度42026年7月24日 09:00

DeepSeek創業者が語るビジョンと戦略

Ars Technica AI重要度42026年7月25日 06:05

Anthropic、Opus 5 は効率重視で能力飛躍なし

NVIDIA Developer Blog重要度42026年7月25日 01:45

NVIDIA、モデル転送を光速化「ModelExpress」発表

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む