NVIDIA Developer Blog·2026年6月27日 04:00·約12分

Oracle Cloud Infrastructure上で本番対応のNVIDIA AI-Qブループリントをデプロイする方法

#AI Agents #LangChain #NVIDIA NeMo #OCI #Kubernetes

TL;DR

NVIDIA は Oracle Cloud Infrastructure (OCI) 上で AI-Q ブループリンを Terraform と Helm を用いてデプロイする完全な手順と、マルチエージェントアーキテクチャの実装方法を公開した。

AI深層分析2026年6月26日 21:03

重要/ 5段階

深度40%

キーポイント

AI-Q の進化とアーキテクチャ

単一質問対応から長期的計画やサブエージェント間の役割分担を行う「ロング・ホライズン」な AI エージェントへと進化し、LangChain と NeMo Agent Toolkit を基盤としている。

OCI 上での自動化デプロイ

Terraform で OCI リソース（OKE, Block Volume, Load Balancer など）を構築し、Helm でワークロードをインストールする一連の自動化手順が提供されている。

具体的な要件と構成要素

PostgreSQL 用のブロックボリュームや Vault によるシークレット管理など、本番環境向けに必要なインフラ要件と API キーの設定方法が明記されている。

必要なツールと前提知識

Terraform、kubectl、Helm、OCI CLIなどのローカルツールの準備に加え、KubernetesやLangChainの基礎知識があると有利です。

AI-Q のアーキテクチャ設計

本システムはマルチエージェントデザインを採用しており、インテントルーターがユーザークエリを解析して適切なワークフローへ振り分けます。

AI-Q のマルチエージェントアーキテクチャ

意図ルーターがクエリを分析し、単純な検索には「浅い研究エージェント」、複雑なタスクには計画と調査サブエージェントを持つ「深いエージェント」を動的に割り当てる。

安全かつ強力な実行環境

データ分析や画像処理などのスキルは、ファイルシステム（メモリストアなど）を共有しつつも、それぞれが独立したサンドボックス内で実行されることで安全性と分離性が確保されている。

影響分析・編集コメントを表示

影響分析

この記事は、NVIDIA の最先端 AI エージェント技術を実際のクラウドインフラ上で即座に運用可能にするための具体的なロードマップを提供しており、開発者やプラットフォームエンジニアにとって実装の障壁を大幅に下げる重要な役割を果たします。特に OCI との統合によるスケーラブルなデプロイ手法は、企業レベルでの AI エージェント導入を加速させる契機となるでしょう。

編集コメント

単なる機能紹介に留まらず、Terraform や Helm を用いた具体的なデプロイ手順まで公開されている点は、実務レベルでの導入を想定した非常に実践的な内容です。OCI 環境で本番運用を目指すエンジニアにとって即戦力となるドキュメントと言えます。

AI エージェントは過去 2 年間で大きく進化しました。最初のエージェントは一度に一つの質問しか答えられませんでした。次に、セッション全体を通じて文脈を保持できるマルチターンチャットが登場しました。そして今日では、長期の目標を持つエージェントが現れています。これらは多くのステップを計画し、作業をサブエージェント間に分担させ、長いタスク全体で文脈を維持し、安全なサンドボックス内でツールを実行するシステムです。

NVIDIA AI-Q Blueprint は、このようなエージェントのためのオープンソースリファレンスです。これは LangChain Deep Agents と NVIDIA NeMo Agent Toolkit を基盤に構築されています。引用された回答を素早く得るためにも、出典付きの長期的な調査レポートを作成するためにも使用できます。

この記事では、Terraform を用いて OCI リソースを作成し、Helm を用いて OKE 上にワークロードをインストールすることで、Oracle Cloud Infrastructure (OCI) 上で AI-Q 2.0 をデプロイする方法を示します。完了すれば、独自の OCI テナント内で動作する AI-Q エンドポイントが利用可能になり、作業終了時にすべてのリソースを削除するための単一のコマンドも用意されます。

対象者: Kubernetes、Terraform、およびシェル操作に慣れ親しんでおり、ラップトップではなく OCI 上で AI-Q を実行したい開発者やプラットフォームエンジニア向けです。

学習内容: AI-Q のマルチエージェントアーキテクチャが OCI サービスにどのようにマッピングされるか、そしてプロビジョニングからデプロイ、ブループリントの起動までを最初から最後まで行うための正確なコマンドについて学びます。

マルチエージェントアーキテクチャ（意図ルーター、浅層調査エージェント、深層エージェント、計画サブエージェント、研究者サブエージェントなど）に関する詳細は、AI-Q プロダクトページおよび NeMo Agent Toolkit ドキュメントをご覧ください。

前提条件

以下の準備が整っていることを確認してください：

OCI テナントへのアクセス権限があり、デプロイ可能なコンパートメントと、以下のサービス制限を十分に満たしていること：

OKE: 拡張クラスタ 1 つとノードプール 1 つ

ブロックボリューム：少なくとも 10 GB（クラスタ内の PostgreSQL 用に OKE CSI ドライバーによって動的にプロビジョニングされます）

ロードバランサー：フレキシブルロードバランサー 1 つ

Vault: バルト 1 つとシークレット

API キー：

build.nvidia.com から取得した NGC API キー（形式 nvapi-…）。これは NVIDIA 推論キーとして使用され、NGC コンテナレジストリ（nvcr.io）への認証にも利用されます。

tavily.com から取得した Tavily API キー（形式 tvly-…）

ローカルツール：terraform 1.5 以降、kubectl 1.28 以降、helm 3.x 以降。oci CLI は API 署名キーを設定済みであること。

Kubernetes、Helm チャート、Terraform、シェルに関する基本的な知識。LangChain や NeMo Agent Toolkit の経験があると望ましいですが、必須ではありません。

アーキテクチャ概要

AI-Q はマルチエージェント設計を採用しています。意図ルーターが各ユーザークエリを読み取り、適切なワークフローへ転送します。

image*図 1. AI-Q マルチエージェントアーキテクチャ。意図ルーターは、クエリをシャロー・リサーチ・エージェント（高速で制限されたツール拡張検索）またはディープ・エージェント（ファイルシステム層を共有し、サンドボックス内でスキルを実行するプランニングサブエージェントとリサーチャーサブエージェント）のいずれかに送信します*

このブループリントは拡張可能に設計されています。すべてのレイヤー（モデル、ツール、RAG バックエンド、サブエージェント、評価器）は、YAML 設定または NeMo Agent Toolkit プラグインシステムを通じて交換可能です。本シリーズの第 2 部および第 3 部では、この拡張性を活用します。

OCI デプロイメントアーキテクチャ

デプロイメントには、OCI リソースに対してTerraformを、Kubernetes ワークロードに対してHelmを使用します。これにより、インフラストラクチャとアプリケーションの明確な分離が実現され、後で Terraform destroy コマンドを実行するだけですべてのリソースを削除できます。

image*図 2. OCI 上の AI-Q デプロイメント。Terraform は VCN、OKE クラスター、ロードバランサー、Vault を作成します。Helm は OKE 上に AI-Q バックエンド、フロントエンド、および PostgreSQL ワークロードをインストールします*

リソースTerraform モジュール目的

VCN、サブネット、ゲートウェイ、NSGsnetworkネットワークの分離（パブリックサブネットと OKE サブネット）

OKE クラスター + ノードプールokeKubernetes ランタイム（拡張クラスター、VCN ネイティブ CNI）

OCI ロードバランサーloadbalancerポート 80 でのパブリック HTTP 取り込み、NodePort 30080 への転送

OCI Vault + secretsvault：API キーや認証情報のための AES-256 暗号化ストレージ

*表 1. deploy/terraform の Terraform モジュールによって作成される OCI リソース。

Helm チャートは OKE（Oracle Kubernetes Engine）上に 3 つのワークロードをインストールします：

Backend (aiq-backend): AI-Q ワークフローを実行する FastAPI ベースのエージェントサーバー。

Frontend (aiq-frontend): NodePort 30080 を介して公開される next.js ウェブ UI。

PostgreSQL (aiq-postgres): ジョブストア、チェックポイント、およびサマリー用のクラスター内データベース。

デプロイ手順

git clone https://github.com/oracle-samples/ai-q.git

cd ai-q/oke-samples/aiq-2.0

所要時間：約 20〜25 分。完全な参照情報は aiq-2.0/README.md に記載されています。

手順 1. Terraform 変数の設定

例ファイルをコピーし、テナントの詳細を編集してください：

cd deploy/terraform

cp terraform.tfvars.example terraform.tfvars

terraform.tfvars には少なくとも以下の設定が必要です：

tenancy_ocid, compartment_id, region（例：us-chicago-1）

user_ocid, fingerprint, private_key_path（~/.oci/config と同じ値）

db_admin_password：クラスター内 PostgreSQL をブートストラップするために使用され、OCI Vault に保存されます。

nvidia_api_key：build.nvidia.com からの NVIDIA NGC キー。推論および nvcr.io からのコンテナイメージのプルに使用します。

tavily_api_key：tavily.com からの Tavily キー。ウェブ検索に使用します。

手順 2. インフラストラクチャの作成

プロバイダーを初期化し、プランを確認して適用してください：

terraform init

terraform plan

terraform apply

これには約 10〜15 分かかります。Terraform は VCN、OKE クラスター、ロードバランサー、および NGC および Tavily API キーを保存時に暗号化された Vault を作成します。

確認: terraform output で oke_cluster_id と lb_public_ip の値が表示されるはずです。いずれかが空の場合、再度 terraform apply を実行してください – 適用は安全に繰り返し実行できます。

次のステップで必要な 2 つの値をキャプチャします:

export OKE_CLUSTER_ID="$(terraform output -raw oke_cluster_id)"

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

ステップ 3. NGC Helm チャートから AI-Q をインストールする

このチャートとコンテナイメージは NGC に公開されているため、ローカルでビルドする必要はありません。kubectl を新しい OKE クラスターに指し、チャートが使用するシークレットを作成した後、helm pull と helm install を実行します。

3a. kubectl を OKE クラスター用に設定する

oci ce cluster create-kubeconfig \

--cluster-id "$OKE_CLUSTER_ID" \

--file ~/.kube/config \

--region us-ashburn-1 \

--token-version 2.0.0 \

--kube-endpoint PUBLIC_ENDPOINT

kubectl get nodes

3b. API キーのエクスポート

terraform.tfvars に設定したのと同じ NGC および Tavily キーを再利用します。NGC キーは二重の役割を果たします。推論用キーであると同時に、nvcr.io へのプル認証情報としても機能します。

export NGC_API_KEY="nvapi-..."

export TAVILY_API_KEY="tvly-..."

export DB_USER_PASSWORD="<ステップ 1 の db_admin_password と同じ値>"

3c. ネームスペースとシークレットの作成

kubectl create namespace ns-aiq --dry-run=client -o yaml | kubectl apply -f -

kubectl create secret generic aiq-credentials -n ns-aiq \

--from-literal=NVIDIA_API_KEY="$NGC_API_KEY" \

--from-literal=TAVILY_API_KEY="$TAVILY_API_KEY" \

--from-literal=DB_USER_NAME="aiq" \

--from-literal=DB_USER_PASSWORD="$DB_USER_PASSWORD"

kubectl create secret docker-registry ngc-secret -n ns-aiq \

--docker-server=nvcr.io \

--docker-username='$oauthtoken' \

--docker-password="$NGC_API_KEY"

3d. NGC からチャートをプルしてインストールする

cd ../helm

helm pull https://helm.ngc.nvidia.com/nvidia/blueprint/charts/aiq2-web-2.0.0.tgz \

--username='$oauthtoken' \

--password="$NGC_API_KEY"

helm upgrade --install aiq aiq2-web-2.0.0.tgz \

-n ns-aiq \

--wait --timeout 10m \

-f values-oci-ngc.yaml

OCI オーバーレイ（values-oci-ngc.yaml）は意図的に非常に小さく、フロントエンドサービスを NodePort 30080（OCI ロードバランサーのヘルスチェックに使用されるポート）に固定し、ngc-secret イメージプルシークレットの名前を指定するだけです。イメージリポジトリ、Postgres の初期化 SQL、動的プロビジョニングされた 10 Gi ブロックボリューム PVC はすべて、チャート自体のデフォルト値から取得されます。

確認: kubectl get pods -n ns-aiq を実行すると、3〜5 分後に aiq-backend、aiq-frontend、aiq-postgres の各ポッドが Running 状態になっているはずです。

ステップ 4. AI-Q を開く

LB IP はすでにステップ 2 でシェルの環境変数として設定されています:

それ以降に新しいシェルを開いた場合は、Terraform から再エクスポートしてください:

cd ../terraform

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

ブラウザで http://<lb_public_ip> を開いてください。AI-Q のフロントエンドが表示されるはずです。

まず簡単な質問、例えば「NeMo Agent Toolkit とは何か？」を試してルーティングが機能しているか確認してください。その後、「ベンチマークスコアとコストで上位 3 つのオープンソース深層調査エージェントを比較する」などより深い質問を試して、ディープエージェントが実際に動作する様子を確認してください。

トラブルシューティング

terraform apply が OKE の作成時にクォータエラーで失敗します。コンパートメントの「クラスタ数」と「ノード数」に関するサービス制限を確認し、必要に応じてクォータの増額を依頼してください。

ポッドが ImagePullBackOff 状態で停止しています。イメージプルシークレットが作成されているか (kubectl get secret -n ns-aiq)、およびステップ 3c で kubectl create secret docker-registry ngc-secret コマンドを実行する際に NGC_API_KEY が正しく設定されていたかを確認してください。回転させるには、シークレットを削除して再作成し、その後 kubectl rollout restart deployment -n ns-aiq aiq-backend aiq-frontend を実行してください。

postgres ポッドが 2 分以上 Pending 状態のままです。Block Volume の PVC が動的にプロビジョニングされませんでした。kubectl describe pvc -n ns-aiq を実行してください。一般的な原因としては、OKE CSI ドライバーが実行されていない、デフォルトの StorageClass が存在しない、または Block Volume のクォータが不足していることが挙げられます。kubectl get sc でストレージクラスを確認し、コンパートメントの Block Volume サービス制限も確認してください。

Load Balancer IP が null として返されます。OCI では Terraform 実行後、ロードバランサーの完了に数分かかることがあります。terraform refresh を実行した後、再度 terraform output lb_public_ip を実行してください。

フロントエンドは読み込まれますが、クエリを実行すると 500 エラーが発生します。kubectl logs -n ns-aiq deploy/aiq-backend を実行してログを確認してください。最も一般的な原因は、ステップ 3c で作成した aiq-credentials シークレット内に NVIDIA_API_KEY または TAVILY_API_KEY が誤っているか欠落していることです。

さらに詳しく学ぶ

これで OCI 上で動作する AI-Q 2.0 のデプロイが完了し、不要になった際に terraform destroy コマンド一つでクリーンに削除することも可能です。さらに進める際に留意すべき点は以下の通りです:

コスト: OKE ノードプールとロードバランサーは稼働している間も継続して課金されます。実験の間にはスタックを破棄するか、ノードプールをゼロまでスケールダウンしてください。

シークレット: Terraform はプロビジョニング時に NGC と Tavily のキーを OCI Vault に保存します（監査および災害復旧のため）。ただし、実行中のポッドはステップ 3c で作成した aiq-credentials Kubernetes シークレットからこれらの値を読み取ります。ローテーションを行うには、新しい値でそのシークレットを削除して再作成し、その後 kubectl rollout restart deployment -n ns-aiq aiq-backend を実行してください。terraform.tfvars のみを編集してもポッドに反映されません。

拡張性: 今回デプロイしたものはすべて YAML と NeMo Agent Toolkit プラグインシステムによって駆動されています。LLM の差し替え、サブエージェントの追加、新しい RAG バックエンドの接続は、書き直しではなく設定変更で実現できます。

AI-Q in OCI repo をクローンし、構築したソリューションと解決した課題を NVIDIA Developer Forum で共有してください。

原文を表示

AI agents have changed a lot in the last two years. The first could only answer one question at a time. Then came multi-turn chat, where the model could keep some context across a session. Today, we have long-horizon agents. Systems that plan many steps, split work between sub-agents, keep context across a long task, and run tools in a safe sandbox.

The NVIDIA AI-Q Blueprint is an open source reference for this kind of agent. It is built on LangChain Deep Agents and the NVIDIA NeMo Agent Toolkit. You can use it for quick cited answers, or for longer research reports with sources.

This post shows you how to deploy AI-Q 2.0 on Oracle Cloud Infrastructure (OCI) using Terraform to create the OCI resources and Helm to install the workloads on OKE. By the end, you will have a working AI-Q endpoint in your own OCI tenancy, and one command to take it all down when you are done.

Who this is for: Developers and platform engineers comfortable with Kubernetes, Terraform, and the shell, and who want to run AI-Q on OCI rather than on a laptop.

What you’ll learn: How AI-Q’s multi-agent architecture maps to OCI services, plus the exact commands to provision, deploy, and open the blueprint from start to finish.

More background on the multi-agent architecture (such as intent router, shallow research agent, deep agent, planning sub-agent, researcher sub-agent), is on the AI-Q product page and the NeMo Agent Toolkit docs.

Prerequisites

Make sure you have:

OCI tenancy access with a compartment you can deploy into, and enough service limits for:

OKE: One enhanced cluster and one node pool

Block Volume: At least 10 GB (dynamically provisioned by the OKE CSI driver for the in-cluster PostgreSQL)

Load Balancer: One flexible

Vault: One vault plus secrets

API keys:

NGC API key from build.nvidia.com, format nvapi-… used both as the NVIDIA inference key and to authenticate to the NGC container registry (nvcr.io).

Tavily API key from tavily.com, format tvly-…

Local tools: terraform 1.5 or later, kubectl 1.28 or later, helm 3.x or later, the oci CLI set up with your API signing key

Some basic knowledge of Kubernetes, Helm charts, Terraform, and the shell. LangChain or NeMo Agent Toolkit experience is nice to have, but not required.

Architecture overview

AI-Q uses a multi-agent design. An intent router reads each user query and sends it to the right workflow.

Figure 1. The AI-Q multi-agent architecture. The intent router sends queries either to the Shallow Research Agent (fast, bounded tool-augmented search) or to the Deep Agent (a Planning sub-agent and a Researcher sub-agent that share a Filesystem layer and run skills in sandboxes)

The blueprint is built to be extensible. Every layer (models, tools, RAG backends, sub-agents, evaluators) can be swapped through YAML config or through the NeMo Agent Toolkit plugin system. We will use that extensibility in Parts 2 and 3 of this series.

OCI deployment architecture

The deployment uses Terraform for the OCI resources and Helm for the Kubernetes workloads. This gives a clean split between infrastructure and application, and one terraform destroy is enough to remove everything later.

Figure 2. The AI-Q deployment on OCI. Terraform creates the VCN, OKE cluster, Load Balancer, and Vault. Helm installs the AI-Q backend, frontend, and PostgreSQL workloads on OKE.

The Helm chart installs three workloads on OKE:

Backend (aiq-backend): A FastAPI-based agent server that runs the AI-Q workflow.

Frontend (aiq-frontend): A next.js web UI exposed over NodePort 30080.

PostgreSQL (aiq-postgres): An in-cluster database for the job store, checkpoints, and summaries.

Deployment steps

git clone https://github.com/oracle-samples/ai-q.git

cd ai-q/oke-samples/aiq-2.0

Total time: around 20 to 25 minutes. The full reference is in aiq-2.0/README.md.

Step 1. Configure Terraform variables

Copy the example file and edit it with your tenancy details:

cd deploy/terraform

cp terraform.tfvars.example terraform.tfvars

At minimum, set these variables in terraform.tfvars:

tenancy_ocid, compartment_id, region (for example us-chicago-1)

user_ocid, fingerprint, private_key_path (same values as your ~/.oci/config)

db_admin_password, used to bootstrap the in-cluster PostgreSQL, stored in OCI Vault.

nvidia_api_key, your NVIDIA NGC key from build.nvidia.com. Used for inference and to pull container images from nvcr.io.

tavily_api_key, your Tavily key from tavily.com, for web search.

Step 2. Create the infrastructure

Initialize the providers, check the plan, and apply:

terraform init

terraform plan

terraform apply

This takes about 10 to 15 minutes. Terraform creates the VCN, OKE cluster, Load Balancer, and the Vault with the NGC and Tavily API keys encrypted at rest.

Check: terraform output should show values for oke_cluster_id and lb_public_ip. If either is empty, run terraform apply again – the apply is safe to repeat.

Capture the two values you’ll need in the next step:

export OKE_CLUSTER_ID="$(terraform output -raw oke_cluster_id)"

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

Step 3. Install AI-Q from the NGC Helm chart

The chart and container images are published on NGC, so there’s nothing to build locally. We point kubectl at the new OKE cluster, create the secrets the chart consumes, then helm pull and helm install.

3a. Configure kubectl for the OKE cluster

oci ce cluster create-kubeconfig \

--cluster-id "$OKE_CLUSTER_ID" \

--file ~/.kube/config \

--region us-ashburn-1 \

--token-version 2.0.0 \

--kube-endpoint PUBLIC_ENDPOINT

kubectl get nodes

3b. Export the API keys

Reuse the same NGC and Tavily keys you put in terraform.tfvars. The NGC key does double duty. It’s both the inference key and the nvcr.io pull credential.

export NGC_API_KEY="nvapi-..."

export TAVILY_API_KEY="tvly-..."

export DB_USER_PASSWORD="<same value as db_admin_password in Step 1>"

3c. Create the namespace and secrets

kubectl create namespace ns-aiq --dry-run=client -o yaml | kubectl apply -f -

kubectl create secret generic aiq-credentials -n ns-aiq \

--from-literal=NVIDIA_API_KEY="$NGC_API_KEY" \

--from-literal=TAVILY_API_KEY="$TAVILY_API_KEY" \

--from-literal=DB_USER_NAME="aiq" \

--from-literal=DB_USER_PASSWORD="$DB_USER_PASSWORD"

kubectl create secret docker-registry ngc-secret -n ns-aiq \

--docker-server=nvcr.io \

--docker-username='$oauthtoken' \

--docker-password="$NGC_API_KEY"

3d. Pull and install the chart from NGC

cd ../helm

helm pull https://helm.ngc.nvidia.com/nvidia/blueprint/charts/aiq2-web-2.0.0.tgz \

--username='$oauthtoken' \

--password="$NGC_API_KEY"

helm upgrade --install aiq aiq2-web-2.0.0.tgz \

-n ns-aiq \

--wait --timeout 10m \

-f values-oci-ngc.yaml

The OCI overlay (values-oci-ngc.yaml) is intentionally tiny — it only pins the frontend service to NodePort 30080 (the port the OCI Load Balancer health-checks) and names the ngc-secret image-pull secret. Image repositories, the Postgres init SQL, and the dynamically provisioned 10 Gi Block Volume PVC all come from the chart’s own defaults.

Check: kubectl get pods -n ns-aiq should show aiq-backend, aiq-frontend, and aiq-postgres pods in Running state after 3 to 5 minutes.

Step 4. Open AI-Q

The LB IP is already in your shell from Step 2:

If you opened a new shell since then, re-export it from Terraform:

cd ../terraform

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

Open http://<lb_public_ip> in your browser. You should see the AI-Q frontend.

Try a simple question first, for example, “What is the NeMo Agent Toolkit?”, to confirm the routing works. Then try a deeper one, for example, “Compare the top three open-source deep-research agents by benchmark score and cost”, to see the deep agent in action.

Troubleshooting

terraform apply fails on OKE creation with a quota error. Check the service limits for your compartment for “Cluster count” and “Node count”, and ask for more quota if needed.

Pods stuck in ImagePullBackOff. Check that the image-pull secret was created (kubectl get secret -n ns-aiq) and that your NGC_API_KEY was correct when you ran the kubectl create secret docker-registry ngc-secret command in Step 3c. To rotate, delete the secret and re-create it, then kubectl rollout restart deployment -n ns-aiq aiq-backend aiq-frontend.

postgres pod stays in Pending for more than 2 minutes. The Block Volume PVC didn’t get dynamically provisioned. Run kubectl describe pvc -n ns-aiq. Typical causes are the OKE CSI driver not running, the default StorageClass missing, or insufficient Block Volume quota. Check the storage class with kubectl get sc and your compartment’s Block Volume service limit.

Load Balancer IP comes back as null. OCI can take a minute or two after Terraform to finish the LB. Run terraform refresh and then terraform output lb_public_ip again.

Frontend loads but queries return 500. Look at kubectl logs -n ns-aiq deploy/aiq-backend. The most common cause is a wrong or missing NVIDIA_API_KEY or TAVILY_API_KEY in the aiq-credentials secret you created in Step 3c.

Learn more

You now have a working AI-Q 2.0 deployment on OCI, and one command (terraform destroy) to remove it cleanly when you are done. A few things to keep in mind as you go further:

Cost: The OKE node pool and the Load Balancer keep costing you while they run. Destroy the stack between experiments, or scale the node pool down to zero.

Secrets: Terraform stores the NGC and Tavily keys in OCI Vault at provision time (for audit and disaster recovery), but the running pods read them from the aiq-credentials Kubernetes secret you created in Step 3c. To rotate, delete and re-create that secret with the new values, then kubectl rollout restart deployment -n ns-aiq aiq-backend. Editing terraform.tfvars alone won’t reach the pods.

Extensibility: Everything you just deployed is driven by YAML and by the NeMo Agent Toolkit plugin system. Swapping an LLM, adding a sub-agent, or plugging a new RAG backend is a configuration change, not a rewrite.

Clone the AI-Q in OCI repo and share on the NVIDIA Developer Forum the solution you built and what problem you solved.

この記事をシェア

LangChain Blog重要度42026年6月27日 02:13

Deep Agents との連携によるプロンプトキャッシング

AWS Machine Learning Blog重要度42026年6月26日 23:38

Stripe の金融コンプライアンス向け本番級 AI エージェント：AWS ベッドロックでの構築教訓

LangChain Blog2026年6月26日 00:04

SmithDB の全文検索用逆インデックス構築の仕組み

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

NVIDIA Developer Blog·2026年6月27日 04:00·約12分

Oracle Cloud Infrastructure上で本番対応のNVIDIA AI-Qブループリントをデプロイする方法

#AI Agents #LangChain #NVIDIA NeMo #OCI #Kubernetes

TL;DR

AI深層分析2026年6月26日 21:03

重要/ 5段階

深度40%

キーポイント

AI-Q の進化とアーキテクチャ

OCI 上での自動化デプロイ

Terraform で OCI リソース（OKE, Block Volume, Load Balancer など）を構築し、Helm でワークロードをインストールする一連の自動化手順が提供されている。

具体的な要件と構成要素

必要なツールと前提知識

Terraform、kubectl、Helm、OCI CLIなどのローカルツールの準備に加え、KubernetesやLangChainの基礎知識があると有利です。

AI-Q のアーキテクチャ設計

本システムはマルチエージェントデザインを採用しており、インテントルーターがユーザークエリを解析して適切なワークフローへ振り分けます。

AI-Q のマルチエージェントアーキテクチャ

安全かつ強力な実行環境

影響分析・編集コメントを表示

影響分析

編集コメント

前提条件

以下の準備が整っていることを確認してください：

OCI テナントへのアクセス権限があり、デプロイ可能なコンパートメントと、以下のサービス制限を十分に満たしていること：

OKE: 拡張クラスタ 1 つとノードプール 1 つ

ブロックボリューム：少なくとも 10 GB（クラスタ内の PostgreSQL 用に OKE CSI ドライバーによって動的にプロビジョニングされます）

ロードバランサー：フレキシブルロードバランサー 1 つ

Vault: バルト 1 つとシークレット

API キー：

tavily.com から取得した Tavily API キー（形式 tvly-…）

ローカルツール：terraform 1.5 以降、kubectl 1.28 以降、helm 3.x 以降。oci CLI は API 署名キーを設定済みであること。

Kubernetes、Helm チャート、Terraform、シェルに関する基本的な知識。LangChain や NeMo Agent Toolkit の経験があると望ましいですが、必須ではありません。

アーキテクチャ概要

AI-Q はマルチエージェント設計を採用しています。意図ルーターが各ユーザークエリを読み取り、適切なワークフローへ転送します。

OCI デプロイメントアーキテクチャ

リソースTerraform モジュール目的

VCN、サブネット、ゲートウェイ、NSGsnetworkネットワークの分離（パブリックサブネットと OKE サブネット）

OKE クラスター + ノードプールokeKubernetes ランタイム（拡張クラスター、VCN ネイティブ CNI）

OCI ロードバランサーloadbalancerポート 80 でのパブリック HTTP 取り込み、NodePort 30080 への転送

OCI Vault + secretsvault：API キーや認証情報のための AES-256 暗号化ストレージ

*表 1. deploy/terraform の Terraform モジュールによって作成される OCI リソース。

Helm チャートは OKE（Oracle Kubernetes Engine）上に 3 つのワークロードをインストールします：

Backend (aiq-backend): AI-Q ワークフローを実行する FastAPI ベースのエージェントサーバー。

Frontend (aiq-frontend): NodePort 30080 を介して公開される next.js ウェブ UI。

PostgreSQL (aiq-postgres): ジョブストア、チェックポイント、およびサマリー用のクラスター内データベース。

デプロイ手順

git clone https://github.com/oracle-samples/ai-q.git

cd ai-q/oke-samples/aiq-2.0

所要時間：約 20〜25 分。完全な参照情報は aiq-2.0/README.md に記載されています。

手順 1. Terraform 変数の設定

例ファイルをコピーし、テナントの詳細を編集してください：

cd deploy/terraform

cp terraform.tfvars.example terraform.tfvars

terraform.tfvars には少なくとも以下の設定が必要です：

tenancy_ocid, compartment_id, region（例：us-chicago-1）

user_ocid, fingerprint, private_key_path（~/.oci/config と同じ値）

db_admin_password：クラスター内 PostgreSQL をブートストラップするために使用され、OCI Vault に保存されます。

nvidia_api_key：build.nvidia.com からの NVIDIA NGC キー。推論および nvcr.io からのコンテナイメージのプルに使用します。

tavily_api_key：tavily.com からの Tavily キー。ウェブ検索に使用します。

手順 2. インフラストラクチャの作成

プロバイダーを初期化し、プランを確認して適用してください：

terraform init

terraform plan

terraform apply

次のステップで必要な 2 つの値をキャプチャします:

export OKE_CLUSTER_ID="$(terraform output -raw oke_cluster_id)"

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

ステップ 3. NGC Helm チャートから AI-Q をインストールする

3a. kubectl を OKE クラスター用に設定する

oci ce cluster create-kubeconfig \

--cluster-id "$OKE_CLUSTER_ID" \

--file ~/.kube/config \

--region us-ashburn-1 \

--token-version 2.0.0 \

--kube-endpoint PUBLIC_ENDPOINT

kubectl get nodes

3b. API キーのエクスポート

export NGC_API_KEY="nvapi-..."

export TAVILY_API_KEY="tvly-..."

export DB_USER_PASSWORD="<ステップ 1 の db_admin_password と同じ値>"

3c. ネームスペースとシークレットの作成

kubectl create namespace ns-aiq --dry-run=client -o yaml | kubectl apply -f -

kubectl create secret generic aiq-credentials -n ns-aiq \

--from-literal=NVIDIA_API_KEY="$NGC_API_KEY" \

--from-literal=TAVILY_API_KEY="$TAVILY_API_KEY" \

--from-literal=DB_USER_NAME="aiq" \

--from-literal=DB_USER_PASSWORD="$DB_USER_PASSWORD"

kubectl create secret docker-registry ngc-secret -n ns-aiq \

--docker-server=nvcr.io \

--docker-username='$oauthtoken' \

--docker-password="$NGC_API_KEY"

3d. NGC からチャートをプルしてインストールする

cd ../helm

helm pull https://helm.ngc.nvidia.com/nvidia/blueprint/charts/aiq2-web-2.0.0.tgz \

--username='$oauthtoken' \

--password="$NGC_API_KEY"

helm upgrade --install aiq aiq2-web-2.0.0.tgz \

-n ns-aiq \

--wait --timeout 10m \

-f values-oci-ngc.yaml

確認: kubectl get pods -n ns-aiq を実行すると、3〜5 分後に aiq-backend、aiq-frontend、aiq-postgres の各ポッドが Running 状態になっているはずです。

ステップ 4. AI-Q を開く

LB IP はすでにステップ 2 でシェルの環境変数として設定されています:

それ以降に新しいシェルを開いた場合は、Terraform から再エクスポートしてください:

cd ../terraform

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

ブラウザで http://<lb_public_ip> を開いてください。AI-Q のフロントエンドが表示されるはずです。

トラブルシューティング

terraform apply が OKE の作成時にクォータエラーで失敗します。コンパートメントの「クラスタ数」と「ノード数」に関するサービス制限を確認し、必要に応じてクォータの増額を依頼してください。

ポッドが ImagePullBackOff 状態で停止しています。イメージプルシークレットが作成されているか (kubectl get secret -n ns-aiq)、およびステップ 3c で kubectl create secret docker-registry ngc-secret コマンドを実行する際に NGC_API_KEY が正しく設定されていたかを確認してください。回転させるには、シークレットを削除して再作成し、その後 kubectl rollout restart deployment -n ns-aiq aiq-backend aiq-frontend を実行してください。

postgres ポッドが 2 分以上 Pending 状態のままです。Block Volume の PVC が動的にプロビジョニングされませんでした。kubectl describe pvc -n ns-aiq を実行してください。一般的な原因としては、OKE CSI ドライバーが実行されていない、デフォルトの StorageClass が存在しない、または Block Volume のクォータが不足していることが挙げられます。kubectl get sc でストレージクラスを確認し、コンパートメントの Block Volume サービス制限も確認してください。

Load Balancer IP が null として返されます。OCI では Terraform 実行後、ロードバランサーの完了に数分かかることがあります。terraform refresh を実行した後、再度 terraform output lb_public_ip を実行してください。

フロントエンドは読み込まれますが、クエリを実行すると 500 エラーが発生します。kubectl logs -n ns-aiq deploy/aiq-backend を実行してログを確認してください。最も一般的な原因は、ステップ 3c で作成した aiq-credentials シークレット内に NVIDIA_API_KEY または TAVILY_API_KEY が誤っているか欠落していることです。

さらに詳しく学ぶ

コスト: OKE ノードプールとロードバランサーは稼働している間も継続して課金されます。実験の間にはスタックを破棄するか、ノードプールをゼロまでスケールダウンしてください。

シークレット: Terraform はプロビジョニング時に NGC と Tavily のキーを OCI Vault に保存します（監査および災害復旧のため）。ただし、実行中のポッドはステップ 3c で作成した aiq-credentials Kubernetes シークレットからこれらの値を読み取ります。ローテーションを行うには、新しい値でそのシークレットを削除して再作成し、その後 kubectl rollout restart deployment -n ns-aiq aiq-backend を実行してください。terraform.tfvars のみを編集してもポッドに反映されません。

拡張性: 今回デプロイしたものはすべて YAML と NeMo Agent Toolkit プラグインシステムによって駆動されています。LLM の差し替え、サブエージェントの追加、新しい RAG バックエンドの接続は、書き直しではなく設定変更で実現できます。

AI-Q in OCI repo をクローンし、構築したソリューションと解決した課題を NVIDIA Developer Forum で共有してください。

原文を表示

Who this is for: Developers and platform engineers comfortable with Kubernetes, Terraform, and the shell, and who want to run AI-Q on OCI rather than on a laptop.

What you’ll learn: How AI-Q’s multi-agent architecture maps to OCI services, plus the exact commands to provision, deploy, and open the blueprint from start to finish.

Prerequisites

Make sure you have:

OCI tenancy access with a compartment you can deploy into, and enough service limits for:

OKE: One enhanced cluster and one node pool

Block Volume: At least 10 GB (dynamically provisioned by the OKE CSI driver for the in-cluster PostgreSQL)

Load Balancer: One flexible

Vault: One vault plus secrets

API keys:

NGC API key from build.nvidia.com, format nvapi-… used both as the NVIDIA inference key and to authenticate to the NGC container registry (nvcr.io).

Tavily API key from tavily.com, format tvly-…

Local tools: terraform 1.5 or later, kubectl 1.28 or later, helm 3.x or later, the oci CLI set up with your API signing key

Some basic knowledge of Kubernetes, Helm charts, Terraform, and the shell. LangChain or NeMo Agent Toolkit experience is nice to have, but not required.

Architecture overview

AI-Q uses a multi-agent design. An intent router reads each user query and sends it to the right workflow.

OCI deployment architecture

The Helm chart installs three workloads on OKE:

Backend (aiq-backend): A FastAPI-based agent server that runs the AI-Q workflow.

Frontend (aiq-frontend): A next.js web UI exposed over NodePort 30080.

PostgreSQL (aiq-postgres): An in-cluster database for the job store, checkpoints, and summaries.

Deployment steps

git clone https://github.com/oracle-samples/ai-q.git

cd ai-q/oke-samples/aiq-2.0

Total time: around 20 to 25 minutes. The full reference is in aiq-2.0/README.md.

Step 1. Configure Terraform variables

Copy the example file and edit it with your tenancy details:

cd deploy/terraform

cp terraform.tfvars.example terraform.tfvars

At minimum, set these variables in terraform.tfvars:

tenancy_ocid, compartment_id, region (for example us-chicago-1)

user_ocid, fingerprint, private_key_path (same values as your ~/.oci/config)

db_admin_password, used to bootstrap the in-cluster PostgreSQL, stored in OCI Vault.

nvidia_api_key, your NVIDIA NGC key from build.nvidia.com. Used for inference and to pull container images from nvcr.io.

tavily_api_key, your Tavily key from tavily.com, for web search.

Step 2. Create the infrastructure

Initialize the providers, check the plan, and apply:

terraform init

terraform plan

terraform apply

This takes about 10 to 15 minutes. Terraform creates the VCN, OKE cluster, Load Balancer, and the Vault with the NGC and Tavily API keys encrypted at rest.

Check: terraform output should show values for oke_cluster_id and lb_public_ip. If either is empty, run terraform apply again – the apply is safe to repeat.

Capture the two values you’ll need in the next step:

export OKE_CLUSTER_ID="$(terraform output -raw oke_cluster_id)"

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

Step 3. Install AI-Q from the NGC Helm chart

3a. Configure kubectl for the OKE cluster

oci ce cluster create-kubeconfig \

--cluster-id "$OKE_CLUSTER_ID" \

--file ~/.kube/config \

--region us-ashburn-1 \

--token-version 2.0.0 \

--kube-endpoint PUBLIC_ENDPOINT

kubectl get nodes

3b. Export the API keys

Reuse the same NGC and Tavily keys you put in terraform.tfvars. The NGC key does double duty. It’s both the inference key and the nvcr.io pull credential.

export NGC_API_KEY="nvapi-..."

export TAVILY_API_KEY="tvly-..."

export DB_USER_PASSWORD="<same value as db_admin_password in Step 1>"

3c. Create the namespace and secrets

kubectl create namespace ns-aiq --dry-run=client -o yaml | kubectl apply -f -

kubectl create secret generic aiq-credentials -n ns-aiq \

--from-literal=NVIDIA_API_KEY="$NGC_API_KEY" \

--from-literal=TAVILY_API_KEY="$TAVILY_API_KEY" \

--from-literal=DB_USER_NAME="aiq" \

--from-literal=DB_USER_PASSWORD="$DB_USER_PASSWORD"

kubectl create secret docker-registry ngc-secret -n ns-aiq \

--docker-server=nvcr.io \

--docker-username='$oauthtoken' \

--docker-password="$NGC_API_KEY"

3d. Pull and install the chart from NGC

cd ../helm

helm pull https://helm.ngc.nvidia.com/nvidia/blueprint/charts/aiq2-web-2.0.0.tgz \

--username='$oauthtoken' \

--password="$NGC_API_KEY"

helm upgrade --install aiq aiq2-web-2.0.0.tgz \

-n ns-aiq \

--wait --timeout 10m \

-f values-oci-ngc.yaml

Check: kubectl get pods -n ns-aiq should show aiq-backend, aiq-frontend, and aiq-postgres pods in Running state after 3 to 5 minutes.

Step 4. Open AI-Q

The LB IP is already in your shell from Step 2:

If you opened a new shell since then, re-export it from Terraform:

cd ../terraform

export LB_PUBLIC_IP="$(terraform output -raw lb_public_ip)"

Open http://<lb_public_ip> in your browser. You should see the AI-Q frontend.

Troubleshooting

terraform apply fails on OKE creation with a quota error. Check the service limits for your compartment for “Cluster count” and “Node count”, and ask for more quota if needed.

Pods stuck in ImagePullBackOff. Check that the image-pull secret was created (kubectl get secret -n ns-aiq) and that your NGC_API_KEY was correct when you ran the kubectl create secret docker-registry ngc-secret command in Step 3c. To rotate, delete the secret and re-create it, then kubectl rollout restart deployment -n ns-aiq aiq-backend aiq-frontend.

postgres pod stays in Pending for more than 2 minutes. The Block Volume PVC didn’t get dynamically provisioned. Run kubectl describe pvc -n ns-aiq. Typical causes are the OKE CSI driver not running, the default StorageClass missing, or insufficient Block Volume quota. Check the storage class with kubectl get sc and your compartment’s Block Volume service limit.

Load Balancer IP comes back as null. OCI can take a minute or two after Terraform to finish the LB. Run terraform refresh and then terraform output lb_public_ip again.

Frontend loads but queries return 500. Look at kubectl logs -n ns-aiq deploy/aiq-backend. The most common cause is a wrong or missing NVIDIA_API_KEY or TAVILY_API_KEY in the aiq-credentials secret you created in Step 3c.

Learn more

You now have a working AI-Q 2.0 deployment on OCI, and one command (terraform destroy) to remove it cleanly when you are done. A few things to keep in mind as you go further:

Cost: The OKE node pool and the Load Balancer keep costing you while they run. Destroy the stack between experiments, or scale the node pool down to zero.

Secrets: Terraform stores the NGC and Tavily keys in OCI Vault at provision time (for audit and disaster recovery), but the running pods read them from the aiq-credentials Kubernetes secret you created in Step 3c. To rotate, delete and re-create that secret with the new values, then kubectl rollout restart deployment -n ns-aiq aiq-backend. Editing terraform.tfvars alone won’t reach the pods.

Extensibility: Everything you just deployed is driven by YAML and by the NeMo Agent Toolkit plugin system. Swapping an LLM, adding a sub-agent, or plugging a new RAG backend is a configuration change, not a rewrite.

Clone the AI-Q in OCI repo and share on the NVIDIA Developer Forum the solution you built and what problem you solved.

この記事をシェア

LangChain Blog重要度42026年6月27日 02:13

Deep Agents との連携によるプロンプトキャッシング

AWS Machine Learning Blog重要度42026年6月26日 23:38

Stripe の金融コンプライアンス向け本番級 AI エージェント：AWS ベッドロックでの構築教訓

LangChain Blog2026年6月26日 00:04

SmithDB の全文検索用逆インデックス構築の仕組み

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

Oracle Cloud Infrastructure上で本番対応のNVIDIA AI-Qブループリントをデプロイする方法

キーポイント

影響分析

編集コメント

アーキテクチャ概要

OCI デプロイメントアーキテクチャ

デプロイ手順

トラブルシューティング

さらに詳しく学ぶ

Architecture overview

OCI deployment architecture

Deployment steps

Troubleshooting

Learn more

関連記事

Oracle Cloud Infrastructure上で本番対応のNVIDIA AI-Qブループリントをデプロイする方法

キーポイント

影響分析

編集コメント

アーキテクチャ概要

OCI デプロイメントアーキテクチャ

デプロイ手順

トラブルシューティング

さらに詳しく学ぶ

Architecture overview

OCI deployment architecture

Deployment steps

Troubleshooting

Learn more

関連記事