Qwen3.7 プレビュー版が Arena に登場（1 分読了）

Oct 4, 2025

🚀 Qwen3-VL-30B-A3B-Instruct & Thinking が登場！

より小型ながら、同じく強力なパフォーマンス 💪—Qwen3-VL のすべての機能を搭載しています。

🔧 アクティブパラメータはわずか 3B ですが、STEM（科学・技術・工学・数学）、VQA（視覚質問応答）、OCR（光学文字認識）、Video（動画）、Agent（エージェント）タスクなどにおいて GPT-5-Mini や Claude4-Sonnet に匹敵し、多くの場合それらを上回ります。

さらに詳しくはこちら

モバイルアプリの収益化

スレッド共有プラットフォーム

モバイルアプリ開発

Twitter マーケティングガイド

X コンテンツツール

ツイートスケジュール管理ツール

マイクロブログing

アプリストア最適化

そしてこれだけではありません：FP8 バージョンも公開します。また、大規模モデルである Qwen3-VL-235B-A22B の FP8 バージョンもリリースいたします！

ぜひお試しください。マルチモーダル AI アプリケーションの動作をさらに高速化しましょう！🧠🖼️

Github&Cookbooks： github.com/QwenLM/Qwen3-V…

Qwen3-VL-30B-A3B-Thinking のパフォーマンス

純粋なテキスト性能

ツイート 4 つを読む

link#goto">

2025 年 9 月 22 日

🚀 Qwen3-Omni のご紹介 — テキスト、画像、音声、動画を一つのモデルで統合し、モダリティ間のトレードオフを不要とした、世界初のネイティブなエンドツーエンド・オムニモーダル AI です！

🏆 22/36 の音声および AV ベンチマークで SOTA（State of the Art）達成

🌍 テキスト 119L / 入力音声 19L / 出力音声 10L

⚡ レイテンシ 211ms | 🎧 30 分間の音声理解

🎨 システムプロンプトによる完全なカスタマイズ可能

🔗 組み込みツール呼び出し機能

🎤 オープンソースのキャプショニングモデル（低ハルシネーション！）

🌟 オープンソース化されたものとは？

開発者が指示従事から創造的タスクまで多様なアプリケーションを探求できるよう、Qwen3-Omni-30B-A3B-Instruct、Qwen3-Omni-30B-A3B-Thinking、および Qwen3-Omni-30B-A3B-Captioner をオープンソース化しました。

今すぐお試しください 👇

💬 Qwen Chat: chat.qwen.ai/?models=qwen3-…

💻 GitHub: github.com/QwenLM/Qwen3-O…

🤗 HF Models: huggingface.co/collections/Qw…

🤖 MS Models:

🎬 デモ：huggingface.co/spaces/Qwen/Qw…

Qwen Chat の音声チャットおよびビデオチャット機能を使用して、Qwen3-Omni モデルを体験してください。

パフォーマンス

4 つのツイートを読む

link#goto">

2025 年 9 月 11 日

🚀 Qwen3-Next-80B-A3B の紹介 — 効率的な大規模言語モデル（LLM: Large Language Model）の未来がここに！

🔹 パラメータ数は 80B ですが、トークンごとに活性化されるのはわずか 3B → Qwen3-32B と比較してトレーニングコストは約 1/10、推論速度は約 10 倍（特に 32K 以上のコンテキスト長において！）

🔹 ハイブリッドアーキテクチャ：ゲート付きデルタネット（Gated DeltaNet）＋ゲート付きアテンション（Gated Attention）→ 高速性と想起性能の両方を最適化

🔹 超スパースな MoE（Mixture of Experts: エキスパート混合モデル）：512 のエキスパート、そのうち 10 がルーティングされ、1 つが共有される構成

🔹 マルチトークン予測 → 推測デコーディング（speculative decoding）をターボチャージ

🔹 パフォーマンスでは Qwen3-32B を上回り、推論能力と長文コンテキスト処理においては Qwen3-235B に匹敵

🧠 Qwen3-Next-80B-A3B-Instruct は、当社の 235B フラグシップモデルに迫る性能を発揮。

🧠 Qwen3-Next-80B-A3B-Thinking は Gemini-2.5-Flash-Thinking を上回るパフォーマンスを示します。

今すぐお試しください：chat.qwen.ai

ブログ：qwen.ai/blog?id=4074cc…

Huggingface：huggingface.co/collections/Qw…

ModelScope：modelscope.cn/collections/Qw…

Alibaba Cloud API: alibabacloud.com/help/en/model-…

事前トレーニング効率と推論速度

プリフィル段階：コンテキスト長が 4K の場合、スループットは Qwen3-32B と比較して約 7 倍高いです。32K を超えると、10 倍以上高速になります。

9 つのツイートを読む

原文を表示

More from @Alibaba_Qwen

link#goto">

Oct 4, 2025

🚀 Qwen3-VL-30B-A3B-Instruct & Thinking are here!

Smaller size, same powerhouse performance 💪—packed with all the capabilities of Qwen3-VL!

🔧 With just 3B active params, it’s rivaling GPT-5-Mini & Claude4-Sonnet — and often beating them across STEM, VQA, OCR, Video, Agent tasks, and more.

Discover more

Mobile app monetization

Thread sharing platform

Mobile app development

Twitter marketing guide

X content tools

Tweet scheduling tool

Microblogging

App store optimization

And that’s not all: we’re also releasing an FP8 version, plus the FP8 of the massive Qwen3-VL-235B-A22B!

Try it out and make your multimodal AI applications run faster!🧠🖼️

Github&Cookbooks： github.com/QwenLM/Qwen3-V…

Performance of Qwen3-VL-30B-A3B-Thinking

Pure text performance

Read 4 tweets

link#goto">

Sep 22, 2025

🚀 Introducing Qwen3-Omni — the first natively end-to-end omni-modal AI unifying text, image, audio & video in one model — no modality trade-offs!

🏆 SOTA on 22/36 audio & AV benchmarks

🌍 119L text / 19L speech in / 10L speech out

⚡ 211ms latency | 🎧 30-min audio understanding

🎨 Fully customizable via system prompts

🔗 Built-in tool calling

🎤 Open-source Captioner model (low-hallucination!)

🌟 What’s Open-Sourced?

We’ve open-sourced Qwen3-Omni-30B-A3B-Instruct, Qwen3-Omni-30B-A3B-Thinking, and Qwen3-Omni-30B-A3B-Captioner, to empower developers to explore a variety of applications from instruction-following to creative tasks.

Try it now 👇

💬 Qwen Chat: chat.qwen.ai/?models=qwen3-…

💻 GitHub: github.com/QwenLM/Qwen3-O…

🤗 HF Models: huggingface.co/collections/Qw…

🤖 MS Models:

🎬 Demo: huggingface.co/spaces/Qwen/Qw…

Use the voice chat and video chat features on Qwen Chat to experience the Qwen3-Omni model.

Performance

Read 4 tweets

link#goto">

Sep 11, 2025

🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here!

🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!)

🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed & recall

🔹 Ultra-sparse MoE: 512 experts, 10 routed + 1 shared

🔹 Multi-Token Prediction → turbo-charged speculative decoding

🔹 Beats Qwen3-32B in perf, rivals Qwen3-235B in reasoning & long-context

🧠 Qwen3-Next-80B-A3B-Instruct approaches our 235B flagship.

🧠 Qwen3-Next-80B-A3B-Thinking outperforms Gemini-2.5-Flash-Thinking.

Try it now: chat.qwen.ai

Blog: qwen.ai/blog?id=4074cc…

Huggingface: huggingface.co/collections/Qw…

Alibaba Cloud API: alibabacloud.com/help/en/model-…

Pretraining Efficiency & Inference Speed

Prefill Stage: At 4K context length, throughput is nearly 7x higher than Qwen3-32B. Beyond 32K, it’s over 10x faster.

Read 9 tweets

この記事をシェア

TLDR AI重要度42026年7月3日 09:00

ハードウェアのクーデター：なぜAIハードウェアが永遠に変化したのか（3分読了）

Seed2.0 モデルカード（72 分間の読了）

#Qwen3-VL #マルチモーダル LLM #FP8 量化 #軽量モデル #アリババクラウド

AI 向けラマヌジャン・チャレンジ（1 分読了）

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

TLDR AI·2026年5月19日 09:00·約4分

Qwen3.7 プレビュー版が Arena に登場（1 分読了）

TL;DR

AI深層分析2026年7月5日 01:11

重要/ 5段階

深度40%

キーポイント

高性能な軽量マルチモーダルモデルの登場

FP8 量化版の同時公開

大規模モデルである Qwen3-VL-235B-A22B の FP8 版も同時にリリースされ、マルチモーダル AI アプリケーションの実行速度と効率性が大幅に向上する。

広範なアクセス経路の提供

Qwen Chat、GitHub Cookbooks、Alibaba Cloud API、Blog、ModelScope、HuggingFace など、開発者がすぐに試せる多様なプラットフォームが用意されている。

Qwen3-Omni の発表

テキスト、画像、音声、動画を一つのモデルで統合する初のネイティブエンドツーエンドのオムニモーダル AI「Qwen3-Omni」が紹介されました。

オープンソース化されたモデル

指示従事からクリエイティブタスクまでを可能にするため、Qwen3-Omni-30B-A3B-Instruct、Thinking、Captioner の 3 つのモデルがオープンソース化されました。

高性能な評価と機能

Qwen3-Omni のマルチモーダル機能

Qwen Chat を介して音声チャットおよびビデオチャット機能を体験でき、Qwen3-Omni モデルのリアルタイム性を確認できる。

影響分析・編集コメントを表示

影響分析

編集コメント

@Alibaba_Qwen からのさらなる情報

link#goto">

Oct 4, 2025

🚀 Qwen3-VL-30B-A3B-Instruct & Thinking が登場！

より小型ながら、同じく強力なパフォーマンス 💪—Qwen3-VL のすべての機能を搭載しています。

さらに詳しくはこちら

モバイルアプリの収益化

スレッド共有プラットフォーム

モバイルアプリ開発

Twitter マーケティングガイド

X コンテンツツール

ツイートスケジュール管理ツール

マイクロブログing

アプリストア最適化

そしてこれだけではありません：FP8 バージョンも公開します。また、大規模モデルである Qwen3-VL-235B-A22B の FP8 バージョンもリリースいたします！

ぜひお試しください。マルチモーダル AI アプリケーションの動作をさらに高速化しましょう！🧠🖼️

Github&Cookbooks： github.com/QwenLM/Qwen3-V…

Qwen3-VL-30B-A3B-Thinking のパフォーマンス

純粋なテキスト性能

ツイート 4 つを読む

link#goto">

2025 年 9 月 22 日

🏆 22/36 の音声および AV ベンチマークで SOTA（State of the Art）達成

🌍 テキスト 119L / 入力音声 19L / 出力音声 10L

⚡ レイテンシ 211ms | 🎧 30 分間の音声理解

🎨 システムプロンプトによる完全なカスタマイズ可能

🔗 組み込みツール呼び出し機能

🎤 オープンソースのキャプショニングモデル（低ハルシネーション！）

🌟 オープンソース化されたものとは？

今すぐお試しください 👇

💬 Qwen Chat: chat.qwen.ai/?models=qwen3-…

💻 GitHub: github.com/QwenLM/Qwen3-O…

🤗 HF Models: huggingface.co/collections/Qw…

🤖 MS Models:

🎬 デモ：huggingface.co/spaces/Qwen/Qw…

Qwen Chat の音声チャットおよびビデオチャット機能を使用して、Qwen3-Omni モデルを体験してください。

パフォーマンス

4 つのツイートを読む

link#goto">

2025 年 9 月 11 日

🚀 Qwen3-Next-80B-A3B の紹介 — 効率的な大規模言語モデル（LLM: Large Language Model）の未来がここに！

🔹 超スパースな MoE（Mixture of Experts: エキスパート混合モデル）：512 のエキスパート、そのうち 10 がルーティングされ、1 つが共有される構成

🔹 マルチトークン予測 → 推測デコーディング（speculative decoding）をターボチャージ

🔹 パフォーマンスでは Qwen3-32B を上回り、推論能力と長文コンテキスト処理においては Qwen3-235B に匹敵

🧠 Qwen3-Next-80B-A3B-Instruct は、当社の 235B フラグシップモデルに迫る性能を発揮。

🧠 Qwen3-Next-80B-A3B-Thinking は Gemini-2.5-Flash-Thinking を上回るパフォーマンスを示します。

今すぐお試しください：chat.qwen.ai

ブログ：qwen.ai/blog?id=4074cc…

Huggingface：huggingface.co/collections/Qw…

ModelScope：modelscope.cn/collections/Qw…

Alibaba Cloud API: alibabacloud.com/help/en/model-…

事前トレーニング効率と推論速度

プリフィル段階：コンテキスト長が 4K の場合、スループットは Qwen3-32B と比較して約 7 倍高いです。32K を超えると、10 倍以上高速になります。

9 つのツイートを読む

原文を表示

More from @Alibaba_Qwen

link#goto">

Oct 4, 2025

🚀 Qwen3-VL-30B-A3B-Instruct & Thinking are here!

Smaller size, same powerhouse performance 💪—packed with all the capabilities of Qwen3-VL!

🔧 With just 3B active params, it’s rivaling GPT-5-Mini & Claude4-Sonnet — and often beating them across STEM, VQA, OCR, Video, Agent tasks, and more.

Discover more

Mobile app monetization

Thread sharing platform

Mobile app development

Twitter marketing guide

X content tools

Tweet scheduling tool

Microblogging

App store optimization

And that’s not all: we’re also releasing an FP8 version, plus the FP8 of the massive Qwen3-VL-235B-A22B!

Try it out and make your multimodal AI applications run faster!🧠🖼️

Github&Cookbooks： github.com/QwenLM/Qwen3-V…

Performance of Qwen3-VL-30B-A3B-Thinking

Pure text performance

Read 4 tweets

link#goto">

Sep 22, 2025

🚀 Introducing Qwen3-Omni — the first natively end-to-end omni-modal AI unifying text, image, audio & video in one model — no modality trade-offs!

🏆 SOTA on 22/36 audio & AV benchmarks

🌍 119L text / 19L speech in / 10L speech out

⚡ 211ms latency | 🎧 30-min audio understanding

🎨 Fully customizable via system prompts

🔗 Built-in tool calling

🎤 Open-source Captioner model (low-hallucination!)

🌟 What’s Open-Sourced?

Try it now 👇

💬 Qwen Chat: chat.qwen.ai/?models=qwen3-…

💻 GitHub: github.com/QwenLM/Qwen3-O…

🤗 HF Models: huggingface.co/collections/Qw…

🤖 MS Models:

🎬 Demo: huggingface.co/spaces/Qwen/Qw…

Use the voice chat and video chat features on Qwen Chat to experience the Qwen3-Omni model.

Performance

Read 4 tweets

link#goto">

Sep 11, 2025

🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here!

🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!)

🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed & recall

🔹 Ultra-sparse MoE: 512 experts, 10 routed + 1 shared

🔹 Multi-Token Prediction → turbo-charged speculative decoding

🔹 Beats Qwen3-32B in perf, rivals Qwen3-235B in reasoning & long-context

🧠 Qwen3-Next-80B-A3B-Instruct approaches our 235B flagship.

🧠 Qwen3-Next-80B-A3B-Thinking outperforms Gemini-2.5-Flash-Thinking.

Try it now: chat.qwen.ai

Blog: qwen.ai/blog?id=4074cc…

Huggingface: huggingface.co/collections/Qw…

Alibaba Cloud API: alibabacloud.com/help/en/model-…

Pretraining Efficiency & Inference Speed

Prefill Stage: At 4K context length, throughput is nearly 7x higher than Qwen3-32B. Beyond 32K, it’s over 10x faster.

Read 9 tweets

この記事をシェア

TLDR AI重要度42026年7月3日 09:00

ハードウェアのクーデター：なぜAIハードウェアが永遠に変化したのか（3分読了）

Seed2.0 モデルカード（72 分間の読了）