NVIDIA Developer Blog·2026年2月28日 02:30·約1分

NVIDIA GPUアクセラレーテッドエンドポイントを使用したQwen3.5 VLMによるネイティブマルチモーダルエージェントの開発

#マルチモーダルAI #大規模言語モデル #オープンソース #GPU最適化 #Alibaba #NVIDIA

TL;DR

NVIDIAの開発者ブログは、Alibabaが公開した約400Bパラメータのオープンソースマルチモーダルエージェント「Qwen3.5」シリーズを、NVIDIA GPUアクセラレーテッドエンドポイントで開発できることを紹介している。

AI深層分析2026年2月28日 03:41

重要/ 5段階

深度40%

キーポイント

Qwen3.5シリーズの公開

Alibabaがネイティブマルチモーダルエージェント向けに構築した新しいオープンソースモデルシリーズ「Qwen3.5」を発表した。

大規模パラメータモデル

シリーズ最初のモデルは約400Bパラメータのネイティブマルチモーダルモデルである。

NVIDIA GPUとの連携

このモデルはNVIDIA GPUアクセラレーテッドエンドポイントを使用して開発できることが紹介されている。

開発者向けリソース

NVIDIA開発者ブログを通じて、開発者向けに技術情報と実装方法が提供されている。

影響分析・編集コメントを表示

影響分析

この発表は、大規模マルチモーダルAIモデルのオープンソース化とハードウェア最適化の両面で業界に影響を与える。Alibabaの技術公開は競争を促進し、NVIDIAとの連携は実用化の加速につながる可能性が高い。

編集コメント

技術詳細が限定的だが、主要企業の連携とオープンソース化の動向として注目すべき発表。実装例やベンチマークが追加されれば評価はさらに高まる。

imageAlibabaは、ネイティブマルチモーダルエージェント向けに構築された新しいオープンソースのQwen3.5シリーズを発表しました。このシリーズの最初のモデルは、約400Bパラメータのネイティブ...

原文を表示

Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this series is a ~400B parameter native vision-language model (VLM) with reasoning built with a hybrid architecture of mixture of experts (MoE) and Gated Delta Networks. Qwen3.5 can understand and navigate user interfaces, which improves on the previous generation of VLMs.

Qwen3.5 is ideal for a variety of use cases, including:

Coding, including web development

Visual reasoning, including mobile and web interfaces

Chat applications

Complex search

Build with NVIDIA endpoints

You can start building with Qwen3.5 today with free access to GPU-accelerated endpoints on build.nvidia.com, powered by NVIDIA Blackwell GPUs. As part of the NVIDIA Developer Program, you can explore quickly in the browser, experiment with prompts, and even test the model with your own data to evaluate real-world performance.

You can also use the NVIDIA-hosted model through the API, free with registration in the NVIDIA Developer Program.

import requests

headers = {

"Authorization": "Bearer $NVIDIA_API_KEY",

"Accept": "application/json",

}

payload = {

"messages": [

{

"role": "user",

"content": ""

}

],

"model": "qwen/qwen3.5-397b-a17b",

"chat_template_kwargs": {

"thinking": True

},

"frequency_penalty": 0,

"max_tokens": 16384,

"presence_penalty": 0,

"stream": True,

"temperature": 1,

"top_p": 1

}

# re-use connections

session = requests.Session()

response = session.post(invoke_url, headers=headers, json=payload)

response.raise_for_status()

response_body = response.json()

print(response_body)

To take advantage of tool calling, simply define an array of OpenAI compatible tools to add to the chat completions tools parameter.

NVIDIA NIM makes it easy to take Qwen3.5 from development into production. Available as optimized, containerized inference microservices, NIM packages the model with the performance tuning, standardized APIs, and deployment flexibility enterprises need. Download and run it anywhere; on-premises, in the cloud, or across hybrid environments.

Customize with NVIDIA NeMo

While Qwen3.5 offers impressive “out-of-the-box” multimodal capabilities, the NVIDIA NeMo framework provides the essential tools to adapt it for specialized domain needs. Using the NeMo Automodel library, developers can fine-tune the Qwen3.5 397B-parameter architecture with high-throughput efficiency.

NeMo Automodel is a PyTorch-native training library that offers Day 0 Hugging Face support, enabling direct training on existing checkpoints without tedious model conversions. This facilitates rapid experimentation, whether performing full supervised fine-tuning (SFT) or using memory-efficient methods such as LoRA.

As a reference implementation guide, developers can leverage thetechnical tutorial on Medical Visual QA, which details how to fine-tune Qwen3.5 on radiological datasets. For massive scale, NeMo supports multinode Slurm and Kubernetes deployments, ensuring that even the largest MoE models are optimized for domain-specific reasoning and complex agentic workflows with minimal latency.

Get started with Qwen3.5

From data center deployments on NVIDIA Blackwell to NVIDIA NIM microservice for containerized deployment anywhere, NVIDIA offers solutions for your integration of Qwen3.5. To get started, check out the Qwen3.5 model page on Hugging Face and test Qwen3.5 on build.nvidia.com.

About the Authors

この記事をシェア

Simon Willison Blog2026年7月5日 10:00

sqlite-utils 4.0rc2、主にClaude Fable（約149.25ドル分）が執筆

MarkTechPost重要度42026年7月4日 15:32

NVIDIA AI が自己改善型ロボットフレームワーク「ASPIRE」を発表、LIBERO-Pro の長期タスクでゼロショット成功率 31% を達成

MarkTechPost重要度42026年7月3日 05:51

アリババのページエージェント：DOM を介して自然言語で Web インターフェースを制御する JavaScript 内蔵 GUI エージェント

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

NVIDIA Developer Blog·2026年2月28日 02:30·約1分

NVIDIA GPUアクセラレーテッドエンドポイントを使用したQwen3.5 VLMによるネイティブマルチモーダルエージェントの開発

#マルチモーダルAI #大規模言語モデル #オープンソース #GPU最適化 #Alibaba #NVIDIA

TL;DR

AI深層分析2026年2月28日 03:41

重要/ 5段階

深度40%

キーポイント

Qwen3.5シリーズの公開

Alibabaがネイティブマルチモーダルエージェント向けに構築した新しいオープンソースモデルシリーズ「Qwen3.5」を発表した。

大規模パラメータモデル

シリーズ最初のモデルは約400Bパラメータのネイティブマルチモーダルモデルである。

NVIDIA GPUとの連携

このモデルはNVIDIA GPUアクセラレーテッドエンドポイントを使用して開発できることが紹介されている。

開発者向けリソース

NVIDIA開発者ブログを通じて、開発者向けに技術情報と実装方法が提供されている。

影響分析・編集コメントを表示

影響分析

編集コメント

原文を表示

Qwen3.5 is ideal for a variety of use cases, including:

Coding, including web development

Visual reasoning, including mobile and web interfaces

Chat applications

Complex search

Build with NVIDIA endpoints

You can also use the NVIDIA-hosted model through the API, free with registration in the NVIDIA Developer Program.

import requests

headers = {

"Authorization": "Bearer $NVIDIA_API_KEY",

"Accept": "application/json",

}

payload = {

"messages": [

{

"role": "user",

"content": ""

}

],

"model": "qwen/qwen3.5-397b-a17b",

"chat_template_kwargs": {

"thinking": True

},

"frequency_penalty": 0,

"max_tokens": 16384,

"presence_penalty": 0,

"stream": True,

"temperature": 1,

"top_p": 1

}

# re-use connections

session = requests.Session()

response = session.post(invoke_url, headers=headers, json=payload)

response.raise_for_status()

response_body = response.json()

print(response_body)

To take advantage of tool calling, simply define an array of OpenAI compatible tools to add to the chat completions tools parameter.

Customize with NVIDIA NeMo

Get started with Qwen3.5

About the Authors

この記事をシェア

Simon Willison Blog2026年7月5日 10:00

sqlite-utils 4.0rc2、主にClaude Fable（約149.25ドル分）が執筆

MarkTechPost重要度42026年7月4日 15:32

NVIDIA AI が自己改善型ロボットフレームワーク「ASPIRE」を発表、LIBERO-Pro の長期タスクでゼロショット成功率 31% を達成

MarkTechPost重要度42026年7月3日 05:51

アリババのページエージェント：DOM を介して自然言語で Web インターフェースを制御する JavaScript 内蔵 GUI エージェント

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

NVIDIA GPUアクセラレーテッドエンドポイントを使用したQwen3.5 VLMによるネイティブマルチモーダルエージェントの開発

キーポイント

影響分析

編集コメント

Build with NVIDIA endpoints

Customize with NVIDIA NeMo

Get started with Qwen3.5

About the Authors

関連記事

NVIDIA GPUアクセラレーテッドエンドポイントを使用したQwen3.5 VLMによるネイティブマルチモーダルエージェントの開発

キーポイント

影響分析

編集コメント

Build with NVIDIA endpoints

Customize with NVIDIA NeMo

Get started with Qwen3.5

About the Authors

関連記事