Smol AI News·2026年5月18日 14:44·約13分

Google I/O 2026：Gemini 3.5 Flash、Omni、および Google のエージェントスタックを発表

#LLM #エージェント AI #マルチモーダル生成 #Gemini #Antigravity #Google DeepMind

TL;DR

Google I/O 2026 で発表された Gemini 3.5 Flash、Omni、および Antigravity エージェントスタックは、AI の処理規模と実用性を飛躍的に高め、業界の標準を再定義する重大な進展である。

AI深層分析2026年5月20日 10:02

最重要/ 5段階

深度40%

キーポイント

Gemini 3.5 Flash の本格稼働

コーディングやエージェントタスクに特化した新モデルで、100 万トークンのコンテキスト、65k の最大出力、4 つの思考レベルを即時 GA（一般利用）として提供。

Gemini Omni とマルチモーダル生成

推論能力とメディア生成を統合した新ファミリー「Omni」を発表し、テキスト・画像・動画・音声を入力として動画編集や生成を行う機能を Gemini アプリや API で展開。

Antigravity エージェントスタックの拡充

デスクトップ、CLI、SDK、API を跨ぐ包括的な「Antigravity 2.0」および管理型エージェント（Managed Agents）を公開し、開発者向けの統合環境を強化。

圧倒的なスケールと利用状況

月間処理トークン数が 3.2 兆を超え前年比 7 倍増となり、Gemini アプリの月間アクティブユーザーが 9 億人以上に達し、230 カ国以上で展開されていることを発表。

影響分析・編集コメントを表示

影響分析

この発表は、AI モデルが単なるチャットボットから、複雑なタスクを実行する自律型エージェントへと進化することを明確に示しており、開発現場におけるワークフローの根本的な変革を促すでしょう。特に動画生成と推論の統合や、大規模なトークン処理能力は、クリエイティブ産業とエンタープライズアプリケーションの両面で新たな基準を確立し、競合他社に対する Google の優位性を再確認させる内容です。

編集コメント

2026 年という未来の時点での発表内容ですが、Google が「AI エージェント」という概念を具体的な製品スタックとして実装し、スケール面で他社を圧倒している点は極めて示唆に富んでいます。特に動画生成と推論能力の融合は、次世代のクリエイティブツール市場を再定義する可能性を秘めています。

Google は再び戻ってきた！

2026 年 5 月 18 日〜19 日の AI ニュース。私たちは 12 のサブレッド、544 の Twitter を確認し、Discord はさらに調査していませんでした。AINews のウェブサイトでは過去のすべての号を検索できます。念のためお知らせしますが、AINews は現在 Latent Space のセクションの一部となっています。メールの頻度についてはオプトイン・オプトアウト可能です！

AI Twitter リキャップ

トップストーリー：Google I/O の振り返りと新しい Gemini の技術詳細**

何が起きたか

Google は I/O を通じて、Gemini を消費者向け AI サーフェスと開発者/エージェントプラットフォームの両方として再位置づけし、3 つの中核的な技術発表を行いました。それは、高速なエージェント処理やコーディングワークロード向けの「Gemini 3.5 Flash」、動画から始まるマルチモーダル生成・編集機能を持つ「Gemini Omni」、そしてデスクトップ、CLI、SDK、API にわたる広範な「Antigravity エージェントスタック」です。公式投稿では規模の拡大が強調され、Google は現在、月間 3.2 兆トークン以上を処理していると述べており、これは前年比で7 倍増（月間 480Tから）となっています。また、Gemini アプリは月間利用者数が 9 億人以上に達し、230 カ国以上、70 以上の言語で利用可能になっています (Google, Google, GeminiApp)。最も技術的に実質的な発表は「Gemini 3.5 Flash」で、Google はこれをこれまでにない最強のエージェント処理・コーディングモデルとして位置づけ、即時一般提供（GA）を開始しました。このモデルは100 万トークンのコンテキスト長、最大 65,000 トークンの出力制限、4 つの思考レベル（「最小/低/中/高」）、およびターン間の「思考の保持」機能を備えています (GoogleDeepMind, Google, _philschmid)。Google はこれに、Gemini の推論能力と生成メディア機能を組み合わせた新ファミリー「Gemini Omni」を追加しました。これは当初「Omni Flash」として提供され、テキスト・画像・動画・音声の入力を受け取り、Gemini、Flow、Shorts 内で動画編集や生成を行い、将来的には API でも利用可能になります (GoogleDeepMind, Google, GeminiApp)。これらのモデルを中心に、Google は「Antigravity 2.0」デスクトップ版のローンチまたは拡張、CLI、SDK、Gemini API 内のマネージドエージェント、検索ネイティブな生成 UI/コーディング機能、クラウド VM 上のバックグラウンドエージェントである「Gemini Spark」、そして Gemini アプリ、Workspace、コマース、メディアとの多数の統合を次々と発表・展開しました (Google, Google, Google)。

事実と意見

公式または第三者のベンチマークソースによって直接主張されている事実

Google は、現在月間 3.2 兆トクンを処理しており、1 年前の 480 兆トークンから増加したと述べています（Google）。

Google は、Gemini の月間ユーザー数が 9 億人以上に達していると述べています（Google）。

Google は、Gemini 3.5 Flash が本日、Gemini アプリ、Search AI モード、Gemini API、AI Studio、Antigravity、Android Studio、およびエンタープライズ向けサービスにおいて一般利用開始（GA）になったと述べています（Google, GeminiApp）。

Google は、Gemini 3.5 Flash が 100 万トークンのコンテキスト長を持ち、最大出力は 65k トークン、思考レベルが 4 つあり、ターン間での「思考の保持」機能も備えていると述べています（_philschmid）。

Google は、3.5 Flash が Terminal-Bench 2.1、GDPval-AA、および MCP Atlas において Gemini 3.1 Pro を上回っていると述べています（GoogleDeepMind, Google）。

Google は、3.5 Flash は同等の最先端モデルと比較して 4 倍高速で動作し、Antigravity では最大 12 倍高速であると述べています（Google, JeffDean）。

独立したベンチマーキング機関である Artificial Analysis は、Gemini 3.5 Flash がそのインテリジェンス指数で 55 のスコアを獲得し、Gemini 3 Flash より +9 向上しており、出力速度は秒間 280 トークン以上であると報告しています。また、MMMU-Pro で 84%、GDPval-AA Elo は 1656 を記録し、価格設定は入力/出力トークン 100 万あたりそれぞれ 1.50 ドル/9.00 ドルです。さらに、同社のスイートにおいて Gemini 3 Flash よりも 5.5 倍、Gemini 3.1 Pro よりも 75% コストがかかることも報告しています（ArtificialAnlys）。

Arena によると、Gemini 3.5 Flash は Text Arena で総合 9 位、Code Arena: Frontend でも 9 位を獲得し、スコアは 1507 です。これは Gemini 3 Flash より +70 の大幅な向上であり、その価格帯では最高スコアとなりました（arena）。

Google は、Gemini Omni Flash が現在有料ユーザー向けに Gemini/Flow で利用可能であり、今週から無料ユーザー向けに Shorts/Create で提供され、今後数週間以内に API 経由でも利用できるようになると述べています (Google)。

Google は、Spark は専用 Google Cloud 仮想マシン上で動作し、ユーザー端末がオフの状態で長時間実行可能なタスクを処理できると述べています (Google)。

Google は、Antigravity と Gemini 3.5 Flash を組み合わせたデモで、93 の並列サブエージェント、15,000 件以上のモデルリクエスト、26 億トークン、API クレジット 1,000 ドル未満を使用して、12 時間で機能する OS を構築したと主張しています (Google)。

Google は、Search が Antigravity と 3.5 Flash を活用して、その場でカスタムビジュアルツールやシミュレーションを生成すると述べています (Google)。

意見・解釈・懐疑論

肯定的な見方：「Google の復活」、「Flash モデルとしては驚異的な評価結果」、「AGI に向けた世界モデル」、Search と Antigravity は「息を呑むほど素晴らしい」など (kimmonismus, Kseniase_, demishassabis)。

中立的な慎重論：一部の投稿者は、自己報告されたベンチマークに基づき過度な期待を抱かないよう注意し、価格設定やパフォーマンスに関する懸念を指摘しています (scaling01, simonw)。

否定的・懐疑的な見方は主に以下に焦点を当てています：

以前の Flash モデルと比較して価格が上昇している点 (enricoros)。

GPT-5.5-medium の方が、エンドツーエンドでより賢く、安価で高速であるという比較 (scaling01, scaling01)。

ベンチマークにおける注意点として、TerminalBench-Hard が弱すぎたり、MRCR や ARC-AGI-2 が中程度であったり、Kimi/GLM との比較で一部のセグメントで明確に上回っていない点など (scaling01, teortaxesTex, scaling01)。

Gemini CLI と Antigravity CLI を巡る製品名付けや UX の混乱、およびより広範なインターフェース設計への批判（zachtratar, kchonyc, teortaxesTex）。

Gemini 3.5 Flash: メインの技術的リリース

公式な位置づけ

Google と DeepMind は繰り返し、Gemini 3.5 Flash を同社の絶対的なフラッグシップ知能モデルではなく、エージェントおよびコーディングに特化した同社最強のモデルとして位置づけています。これはパレートフロンティア（Pareto frontier）における高速・高実用性の領域に位置し、Google の製品と開発者のワークロードの両方を支えることを意図しています (GoogleDeepMind, Google, SundarPichai)。

技術詳細と数値指標

Google および関連投稿より：

GA（一般提供）は現在利用可能（Google）

1M トークンのコンテキストウィンドウ

最大出力トークン数 65k

思考レベル：最小、低、中（新しいデフォルト）、高

マルチターン会話における思考の保持

テキスト出力

入力モダリティ：テキスト、画像、動画、音声（Artificial Analysis による _philschmid, ArtificialAnlys）

プライシング：1M トークンあたり入力 $1.50、出力 $9.00、キャッシュされた入力は 90% オフ（scaling01, ArtificialAnlys）

公式ベンチマーク主張：

Terminal-Bench 2.1: 76.2%

GDPval-AA: 1656 Elo

MCP Atlas: 83.6%

Google が引用したマルチモーダル結果：エンジニアの投稿では MMMU-Pro で 83.6%、Artificial Analysis は 84% と報告し、同環境で記録された最高値（koraykv, ArtificialAnlys）

速度に関する主張：

Google のマーケティング主張：同等の最先端モデルと比較して 4 倍高速（Google）

Antigravity において、Google は最大で 12 倍高速であると述べています（JeffDean, scaling01）

Artificial Analysis では出力トークン速度が 280 トークン/秒を超えると観測されました

一部の議論では、Antigravity 固有の最適化されたサービングにおいて約 867 トークン/秒が報告されています（scaling01, scaling01）

サードパーティによる評価：

Artificial Analysis は、3.5 Flash が知能と速度のパレートフロンティアにおけるリーダーであると述べていますが、経済性は以前の Flash よりも明らかに劣っています。

知能指数 55

Gemini 3 Flash より +9 向上

ハルシネーション（幻覚）発生率が 61% に低下し、Gemini 3 Flash の全知設定と比較して 31 ポイント減少

GDPval-AA 1656 Elo

ベンチマークスイートでの実行コストは Gemini 3 Flash よりも 5.5 倍高価

同じスイートにおいて Gemini 3.1 Pro よりも 75% 高価（ArtificialAnlys）

アリーナ：

Text Arena で #9

Code Arena: Frontend で #9

スコアは 1507、Gemini-3 Flash より +70 向上

フロントエンドコーディング評価において、カテゴリ全体で Gemini 3.1 Pro を上回る（arena, arena）

示唆される影響

顕著な変化は、Google が「Flash」というラベルを、以前のサイクルでは単なる安価な軽量ティアではなく、デプロイメントに最適化されたハイエンド製品モデルとして記述されていたようなモデルに対して使用している点です。複数の投稿者がこれを直接指摘し、Flash はより高価になりつつあり、かつての Pro の領域を吸収しつつある可能性があると論じています（enricoros、simonw）。

最も強力な技術的シグナルは「絶対的なベンチマークで最高のモデル」ではなく、以下の点にあります。

実質的なエージェント機能の向上

極端な高速提供

プロダクト表面面への深い統合

サブエージェントと長期実行を軸としたツールリング

これにより、一部の競合他社が特定の第三者比較において調整済み知能の面で依然として勝っていても、3.5 Flash は戦略的に極めて重要となります。

Gemini Omni: 「あらゆる入力から何でも作成」を可能にする多モーダル生成・編集

Google が発表した内容

Google はGemini Omniを新ファミリーとして発表しました。これは Gemini の推論機能と世界知識を、Google の生成メディアスタックと融合させたもので、まずは動画の作成と編集から始まります。公式メッセージでは「あらゆる入力から何でも作成できる」と説明されていましたが、現在のロールアウト範囲はより限定的です。

入力：テキスト、画像、オーディオ、動画

初期出力の重点：動画

プロダクト利用可能先：Gemini アプリ、Flow、YouTube Shorts/Create、将来的には API

現在提供中のモデル：Gemini Omni Flash (GoogleDeepMind, Google, Google)

Google/DeepMind が主張する点:

より優れた世界理解力

より堅牢な物理法則の理解

シーンやキャラクターの一貫性を維持した複数回の編集機能

会話形式の編集でユーザーの動画素材を「再想像」できる能力 (Google, Google)

ロールアウトの詳細:

有料 Gemini ユーザー向けに、アプリおよび Flow で世界的に「本日」提供開始

YouTube Shorts/Create では、「今週から」無料で展開中

開発者・企業向けの API は今後数週間以内に提供予定 (Google, GeminiApp)

見解

サポート的見解：ユーザーと Google の従業員は、Omni を特に動画編集や一貫性において大きな品質の向上として捉え、高く評価しました（joshwoodward, fofrAI, osanseviero）。

戦略的解釈：複数の投稿者は、Omni がテキストやコードの競争だけでなく、世界モデルや具身的・物理的な事前知識への Google の投資を示す証拠であると位置づけました（demishassabis, jparkerholder, kimmonismus）。

-懐疑論：一部の UI や出力例は、「B ランクのビデオゲームインターフェース」のように見える、あるいは過度に磨き上げられたテンプレート風であるとして批判されました（teortaxesTex, shlomifruchter）。

コンテキスト

Omni は単なる「新たな動画モデル」としてよりも、Google が目指す統合の試みとして重要です：

多モーダル理解、

メディア編集、

世界との接地（グラウンディング）、

エージェントインターフェース、

そして最終的にはあらゆる入力・出力に対応する生成。

これは DeepMind の長年にわたる世界モデルのアジェンダと、Google の製品配布における優位性と整合しています。

反重力：コーディングアシスタントを超えた Google のエージェント OS

注目すべき I/O の主要テーマの一つは、Google がもはやエージェントをチャットモデルの薄いラッパーとして提示していないことです。Antigravity は実行基盤へと進化しています。

発表・拡大された機能

Antigravity 2.0 デスクトップアプリ：コア会話、アーティファクト、マルチエージェントオーケストレーションを備えたエージェントファーストのデスクトップ（Google, Google）

Antigravity CLI（Google, Google）

Antigravity SDK（Google）

Gemini API におけるマネージドエージェント：単一の API コールで、エージェントとホストされた Linux サンドボックス（サンドボックス環境）が提供されます。Bash/Python/Node への対応、ファイル操作、ブラウジング機能、カスタム Markdown で定義されたスキル、リポジトリ/GCS マウント（Google, GoogleAIStudio, _philschmid）をサポートしています。

AI Studio、Android、Firebase、Workspace、Web との統合（Google, Google）

AI Studio から Antigravity へのワンクリックエクスポート機能（Google）

AI Studio におけるネイティブ Android アプリ生成 / Antigravity での Android サポート対応（Google, AndroidDev）

技術的シグナル

Google 自身のデモは、並列サブエージェント、ホスト型実行環境、高頻度の反復ループ、そしてアーティファクト指向のワークフローに焦点を当てたものでした。Jeff Dean は 3.5 Flash を、「協力して動作するサブエージェントを展開し、高頻度の反復ループを実行し、大規模な現実世界の問題を解決するための強力なエンジン」として明確に説明しました（<a href="https://x.com/JeffDean/）

原文を表示

Google is so back!

AI News for 5/18/2026-5/19/2026. We checked 12 subreddits, 544 Twitters and no further Discords. AINews' website lets you search all past issues. As a reminder, AINews is now a section of Latent Space. You can opt in/out of email frequencies!

AI Twitter Recap

Top Story: Google I/O recap and new Gemini technical details

What happened

Google used I/O to reposition Gemini as both a consumer AI surface and a developer/agent platform, with three core technical announcements: Gemini 3.5 Flash for fast agentic/coding workloads, Gemini Omni for multimodal generation/editing starting with video, and a broader Antigravity agent stack spanning desktop/CLI/SDK/API. Official posts emphasized scale — Google says it now processes over 3.2 quadrillion tokens/month, up 7x YoY from 480T/month, while the Gemini app has 900M+ monthly users and is available in 230+ countries and 70+ languages (Google, Google, GeminiApp). The most technically substantive release was Gemini 3.5 Flash, framed by Google as its strongest agentic/coding model yet, GA immediately, with 1M-token context, 65k max output, 4 thinking levels (“minimal/low/medium/high”), and “thought preservation” across turns (GoogleDeepMind, Google, _philschmid). Google paired that with Gemini Omni, a new family combining Gemini reasoning with generative media, initially via Omni Flash, capable of taking text/image/video/audio inputs and producing video edits/generation in Gemini, Flow, Shorts, and later APIs (GoogleDeepMind, Google, GeminiApp). Around those models, Google launched or expanded Antigravity 2.0 desktop, CLI, SDK, Managed Agents in the Gemini API, Search-native generative UI/coding, Gemini Spark background agents on cloud VMs, and a long list of Gemini-app/Workspace/commerce/media integrations (Google, Google, Google).

Facts vs. opinions

Facts / directly claimed by official or third-party benchmark sources

Google says it now processes 3.2 quadrillion tokens/month, up from 480 trillion a year earlier (Google).

Google says Gemini has 900M+ monthly users (Google).

Google says Gemini 3.5 Flash is GA today across Gemini app, Search AI Mode, Gemini API, AI Studio, Antigravity, Android Studio, and enterprise surfaces (Google, GeminiApp).

Google says Gemini 3.5 Flash has 1M context, 65k max output, 4 thinking levels, and “thought preservation” across turns ( _philschmid).

Google says 3.5 Flash beats Gemini 3.1 Pro on Terminal-Bench 2.1, GDPval-AA, and MCP Atlas (GoogleDeepMind, Google).

Google says 3.5 Flash runs 4x faster than comparable frontier models, and up to 12x faster in Antigravity (Google, JeffDean).

Independent benchmarker Artificial Analysis reports Gemini 3.5 Flash scores 55 on its Intelligence Index, +9 vs Gemini 3 Flash, at >280 output tok/s, with MMMU-Pro 84%, GDPval-AA Elo 1656, and pricing of $1.50 / $9.00 per 1M input/output tokens; it also reports the model is 5.5x costlier to run than Gemini 3 Flash on its suite and 75% costlier than Gemini 3.1 Pro (ArtificialAnlys).

Arena reports Gemini 3.5 Flash reached #9 overall in Text Arena and #9 in Code Arena: Frontend, scoring 1507, a +70 jump over Gemini 3 Flash, and becoming the top score in its price tier (arena).

Google says Gemini Omni Flash is available in Gemini/Flow today for paid users, in Shorts/Create starting this week for free, and via APIs in coming weeks (Google).

Google says Spark runs on dedicated Google Cloud virtual machines, allowing long-running tasks while user devices are closed (Google).

Google claims an Antigravity + Gemini 3.5 Flash demo built a functioning OS in 12 hours using 93 parallel sub-agents, 15k+ model requests, 2.6B tokens, and < $1K API credits (Google).

Google says Search will use Antigravity + 3.5 Flash to generate custom visual tools/simulations on the fly (Google).

Opinions / interpretations / skepticism

Positive takes: “Google is back,” “insane evals for a Flash model,” “world model towards AGI,” “mind blowing” for Search + Antigravity, etc. (kimmonismus, Kseniase_, demishassabis).

Neutral caution: some posters explicitly avoided overhyping due to self-reported benchmarks and noted pricing/perf concerns (scaling01, simonw).

Negative/skeptical takes focused on:

Price inflation relative to earlier Flash models (enricoros).

Comparisons where GPT-5.5-medium may be smarter/cheaper/faster end-to-end (scaling01, scaling01).

Benchmark caveats such as weak TerminalBench-Hard, mediocre MRCR / ARC-AGI-2, or not clearly beating Kimi/GLM on some slices (scaling01, teortaxesTex, scaling01).

Product naming/UX confusion around Gemini CLI vs Antigravity CLI and broader interface design criticism (zachtratar, kchonyc, teortaxesTex).

Gemini 3.5 Flash: the main technical release

Official positioning

Google/DeepMind repeatedly described Gemini 3.5 Flash as the company’s strongest model yet for agents and coding, not its absolute flagship intelligence model. It’s meant to sit on the high-speed, high-utility part of the Pareto frontier, powering both Google products and developer workloads (GoogleDeepMind, Google, SundarPichai).

Technical details and metrics

From Google and affiliated posts:

GA availability now (Google)

1M token context window

65k max output tokens

Thinking levels: minimal, low, medium (new default), high

Thought preservation across multi-turn conversations

Text output

Input modalities: text, image, video, speech per Artificial Analysis ( _philschmid, ArtificialAnlys)

Pricing: $1.50 / 1M input, $9.00 / 1M output, 90% discount on cached input (scaling01, ArtificialAnlys)

Official benchmark claims:

Terminal-Bench 2.1: 76.2%

GDPval-AA: 1656 Elo

MCP Atlas: 83.6%

Google-quoted multimodal result: MMMU-Pro 83.6% in one engineer post; Artificial Analysis reports 84%, highest recorded on its setup (koraykv, ArtificialAnlys)

Speed claims:

Google marketing claim: 4x faster than comparable frontier models (Google)

In Antigravity, Google says it is up to 12x faster (JeffDean, scaling01)

Artificial Analysis observed >280 output tok/s

Some discussion cited ~867 tok/s in Antigravity-specific optimized serving (scaling01, scaling01)

Third-party evaluation:

Artificial Analysis says 3.5 Flash is the leader on the intelligence-vs-speed Pareto frontier, but the economics are notably worse than prior Flash:

Intelligence Index 55

+9 over Gemini 3 Flash

Hallucination rate reduced to 61%, a 31-point drop vs Gemini 3 Flash on its omniscience setup

GDPval-AA 1656 Elo

5.5x costlier than Gemini 3 Flash to run on its benchmark suite

75% costlier than Gemini 3.1 Pro on the same suite (ArtificialAnlys)

Arena:

#9 Text Arena

#9 Code Arena: Frontend

1507 score, +70 over Gemini-3 Flash

Better than Gemini 3.1 Pro across categories in its frontend coding eval (arena, arena)

Implications

The notable shift is that Google appears to be using a “Flash” label for a model that, in prior cycles, would have been described more like a high-end product model optimized for deployment rather than simply a cheap lightweight tier. Several posters called this out directly, arguing Flash is becoming more expensive and possibly absorbing former Pro territory (enricoros, simonw).

The strongest technical signal is not “best absolute benchmark model,” but:

material agentic gains

extreme serving speed

deep integration into product surfaces

tooling built around subagents and long-horizon execution

That makes 3.5 Flash strategically important even if some competitors still win on raw price-adjusted intelligence in certain third-party comparisons.

Gemini Omni: multimodal generation/editing as “create anything from any input”

What Google announced

Google introduced Gemini Omni as a new family merging Gemini reasoning/world knowledge with Google’s generative media stack, starting with video creation and editing. Official messaging described it as “create anything from any input,” but current rollout is narrower:

Inputs: text, images, audio, video

Initial output emphasis: video

Product availability: Gemini app, Flow, YouTube Shorts/Create, later APIs

Current shipping model: Gemini Omni Flash (GoogleDeepMind, Google, Google)

Google/DeepMind claims:

Better world understanding

More robust physics

Multi-turn editing where scene/character consistency is retained

Ability to “reimagine” user video footage with conversational edits (Google, Google)

Rollout specifics:

Paid Gemini users globally in app/Flow “today”

YouTube Shorts/Create rolling out “starting this week” at no cost

APIs for developers/enterprise in coming weeks (Google, GeminiApp)

Perspectives

Supportive: users and Google employees described Omni as a major quality step, especially for video editing and consistency (joshwoodward, fofrAI, osanseviero).

Strategic interpretation: several posters framed Omni as evidence Google is investing in world models and embodied/physical priors, not just text/code competition (demishassabis, jparkerholder, kimmonismus).

Skepticism: some UI/output examples drew criticism for looking like “B-tier video game interface” or too polished/template-like (teortaxesTex, shlomifruchter).

Context

Omni matters less as “yet another video model” and more as Google’s attempt to unify:

multimodal understanding,

media editing,

world grounding,

agent interfaces,

and eventually any-input/any-output generation.

This aligns with DeepMind’s long-running world-model agenda and Google’s product distribution advantage.

Antigravity: Google’s agent OS, not just a coding assistant

A major underappreciated I/O theme was that Google is no longer presenting agents as a thin wrapper around a chat model. Antigravity is becoming the execution substrate.

What launched / expanded

Antigravity 2.0 desktop app: agent-first desktop with core conversations, artifacts, multi-agent orchestration (Google, Google)

Antigravity CLI (Google, Google)

Antigravity SDK (Google)

Managed Agents in Gemini API: single API call gives an agent plus hosted Linux sandbox; supports Bash/Python/Node, files, browsing, custom markdown-defined skills, repo/GCS mounts (Google, GoogleAIStudio, _philschmid)

Integrations with AI Studio, Android, Firebase, Workspace, web (Google, Google)

One-click export from AI Studio to Antigravity (Google)

Native Android app generation in AI Studio / Android support in Antigravity (Google, AndroidDev)

Technical signaling

Google’s own demos centered on parallel sub-agents, hosted execution, high-frequency iterative loops, and artifact-oriented workflows. Jeff Dean explicitly described 3.5 Flash as a strong engine for “deploy sub-agents that collaborate, run high-frequency iterative loops, and solve real-world problems at scale” (<a href="https://x.com/JeffDean/

この記事をシェア

GitHub Changelog重要度42026年7月3日 08:07

GitHub Copilot における Gemini 2.5 Pro および Gemini 3 Flash の利用終了発表

Simon Willison Blog2026年7月5日 10:00

sqlite-utils 4.0rc2、主にClaude Fable（約149.25ドル分）が執筆

TechCrunch AI2026年7月5日 00:51

ミストラル AI とは？OpenAI の競合企業に関する全知識

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

Smol AI News·2026年5月18日 14:44·約13分

Google I/O 2026：Gemini 3.5 Flash、Omni、および Google のエージェントスタックを発表

#LLM #エージェント AI #マルチモーダル生成 #Gemini #Antigravity #Google DeepMind

TL;DR

AI深層分析2026年5月20日 10:02

最重要/ 5段階

深度40%

キーポイント

Gemini 3.5 Flash の本格稼働

Gemini Omni とマルチモーダル生成

Antigravity エージェントスタックの拡充

デスクトップ、CLI、SDK、API を跨ぐ包括的な「Antigravity 2.0」および管理型エージェント（Managed Agents）を公開し、開発者向けの統合環境を強化。

圧倒的なスケールと利用状況

影響分析・編集コメントを表示

影響分析

編集コメント

Google は再び戻ってきた！

AI Twitter リキャップ

トップストーリー：Google I/O の振り返りと新しい Gemini の技術詳細**

何が起きたか

事実と意見

公式または第三者のベンチマークソースによって直接主張されている事実

Google は、現在月間 3.2 兆トクンを処理しており、1 年前の 480 兆トークンから増加したと述べています（Google）。

Google は、Gemini の月間ユーザー数が 9 億人以上に達していると述べています（Google）。

Google は、Gemini 3.5 Flash が本日、Gemini アプリ、Search AI モード、Gemini API、AI Studio、Antigravity、Android Studio、およびエンタープライズ向けサービスにおいて一般利用開始（GA）になったと述べています（Google, GeminiApp）。

Google は、Gemini 3.5 Flash が 100 万トークンのコンテキスト長を持ち、最大出力は 65k トークン、思考レベルが 4 つあり、ターン間での「思考の保持」機能も備えていると述べています（_philschmid）。

Google は、3.5 Flash が Terminal-Bench 2.1、GDPval-AA、および MCP Atlas において Gemini 3.1 Pro を上回っていると述べています（GoogleDeepMind, Google）。

Google は、3.5 Flash は同等の最先端モデルと比較して 4 倍高速で動作し、Antigravity では最大 12 倍高速であると述べています（Google, JeffDean）。

独立したベンチマーキング機関である Artificial Analysis は、Gemini 3.5 Flash がそのインテリジェンス指数で 55 のスコアを獲得し、Gemini 3 Flash より +9 向上しており、出力速度は秒間 280 トークン以上であると報告しています。また、MMMU-Pro で 84%、GDPval-AA Elo は 1656 を記録し、価格設定は入力/出力トークン 100 万あたりそれぞれ 1.50 ドル/9.00 ドルです。さらに、同社のスイートにおいて Gemini 3 Flash よりも 5.5 倍、Gemini 3.1 Pro よりも 75% コストがかかることも報告しています（ArtificialAnlys）。

Arena によると、Gemini 3.5 Flash は Text Arena で総合 9 位、Code Arena: Frontend でも 9 位を獲得し、スコアは 1507 です。これは Gemini 3 Flash より +70 の大幅な向上であり、その価格帯では最高スコアとなりました（arena）。

Google は、Gemini Omni Flash が現在有料ユーザー向けに Gemini/Flow で利用可能であり、今週から無料ユーザー向けに Shorts/Create で提供され、今後数週間以内に API 経由でも利用できるようになると述べています (Google)。

Google は、Spark は専用 Google Cloud 仮想マシン上で動作し、ユーザー端末がオフの状態で長時間実行可能なタスクを処理できると述べています (Google)。

Google は、Antigravity と Gemini 3.5 Flash を組み合わせたデモで、93 の並列サブエージェント、15,000 件以上のモデルリクエスト、26 億トークン、API クレジット 1,000 ドル未満を使用して、12 時間で機能する OS を構築したと主張しています (Google)。

Google は、Search が Antigravity と 3.5 Flash を活用して、その場でカスタムビジュアルツールやシミュレーションを生成すると述べています (Google)。

意見・解釈・懐疑論

肯定的な見方：「Google の復活」、「Flash モデルとしては驚異的な評価結果」、「AGI に向けた世界モデル」、Search と Antigravity は「息を呑むほど素晴らしい」など (kimmonismus, Kseniase_, demishassabis)。

中立的な慎重論：一部の投稿者は、自己報告されたベンチマークに基づき過度な期待を抱かないよう注意し、価格設定やパフォーマンスに関する懸念を指摘しています (scaling01, simonw)。

否定的・懐疑的な見方は主に以下に焦点を当てています：

以前の Flash モデルと比較して価格が上昇している点 (enricoros)。

GPT-5.5-medium の方が、エンドツーエンドでより賢く、安価で高速であるという比較 (scaling01, scaling01)。

ベンチマークにおける注意点として、TerminalBench-Hard が弱すぎたり、MRCR や ARC-AGI-2 が中程度であったり、Kimi/GLM との比較で一部のセグメントで明確に上回っていない点など (scaling01, teortaxesTex, scaling01)。

Gemini CLI と Antigravity CLI を巡る製品名付けや UX の混乱、およびより広範なインターフェース設計への批判（zachtratar, kchonyc, teortaxesTex）。

Gemini 3.5 Flash: メインの技術的リリース

公式な位置づけ

技術詳細と数値指標

Google および関連投稿より：

GA（一般提供）は現在利用可能（Google）

1M トークンのコンテキストウィンドウ

最大出力トークン数 65k

思考レベル：最小、低、中（新しいデフォルト）、高

マルチターン会話における思考の保持

テキスト出力

入力モダリティ：テキスト、画像、動画、音声（Artificial Analysis による _philschmid, ArtificialAnlys）

プライシング：1M トークンあたり入力 $1.50、出力 $9.00、キャッシュされた入力は 90% オフ（scaling01, ArtificialAnlys）

公式ベンチマーク主張：

Terminal-Bench 2.1: 76.2%

GDPval-AA: 1656 Elo

MCP Atlas: 83.6%

Google が引用したマルチモーダル結果：エンジニアの投稿では MMMU-Pro で 83.6%、Artificial Analysis は 84% と報告し、同環境で記録された最高値（koraykv, ArtificialAnlys）

速度に関する主張：

Google のマーケティング主張：同等の最先端モデルと比較して 4 倍高速（Google）

Antigravity において、Google は最大で 12 倍高速であると述べています（JeffDean, scaling01）

Artificial Analysis では出力トークン速度が 280 トークン/秒を超えると観測されました

一部の議論では、Antigravity 固有の最適化されたサービングにおいて約 867 トークン/秒が報告されています（scaling01, scaling01）

サードパーティによる評価：

Artificial Analysis は、3.5 Flash が知能と速度のパレートフロンティアにおけるリーダーであると述べていますが、経済性は以前の Flash よりも明らかに劣っています。

知能指数 55

Gemini 3 Flash より +9 向上

ハルシネーション（幻覚）発生率が 61% に低下し、Gemini 3 Flash の全知設定と比較して 31 ポイント減少

GDPval-AA 1656 Elo

ベンチマークスイートでの実行コストは Gemini 3 Flash よりも 5.5 倍高価

同じスイートにおいて Gemini 3.1 Pro よりも 75% 高価（ArtificialAnlys）

アリーナ：

Text Arena で #9

Code Arena: Frontend で #9

スコアは 1507、Gemini-3 Flash より +70 向上

フロントエンドコーディング評価において、カテゴリ全体で Gemini 3.1 Pro を上回る（arena, arena）

示唆される影響

最も強力な技術的シグナルは「絶対的なベンチマークで最高のモデル」ではなく、以下の点にあります。

実質的なエージェント機能の向上

極端な高速提供

プロダクト表面面への深い統合

サブエージェントと長期実行を軸としたツールリング

これにより、一部の競合他社が特定の第三者比較において調整済み知能の面で依然として勝っていても、3.5 Flash は戦略的に極めて重要となります。

Gemini Omni: 「あらゆる入力から何でも作成」を可能にする多モーダル生成・編集

Google が発表した内容

入力：テキスト、画像、オーディオ、動画

初期出力の重点：動画

プロダクト利用可能先：Gemini アプリ、Flow、YouTube Shorts/Create、将来的には API

現在提供中のモデル：Gemini Omni Flash (GoogleDeepMind, Google, Google)

Google/DeepMind が主張する点:

より優れた世界理解力

より堅牢な物理法則の理解

シーンやキャラクターの一貫性を維持した複数回の編集機能

会話形式の編集でユーザーの動画素材を「再想像」できる能力 (Google, Google)

ロールアウトの詳細:

有料 Gemini ユーザー向けに、アプリおよび Flow で世界的に「本日」提供開始

YouTube Shorts/Create では、「今週から」無料で展開中

開発者・企業向けの API は今後数週間以内に提供予定 (Google, GeminiApp)

見解

サポート的見解：ユーザーと Google の従業員は、Omni を特に動画編集や一貫性において大きな品質の向上として捉え、高く評価しました（joshwoodward, fofrAI, osanseviero）。

戦略的解釈：複数の投稿者は、Omni がテキストやコードの競争だけでなく、世界モデルや具身的・物理的な事前知識への Google の投資を示す証拠であると位置づけました（demishassabis, jparkerholder, kimmonismus）。

コンテキスト

Omni は単なる「新たな動画モデル」としてよりも、Google が目指す統合の試みとして重要です：

多モーダル理解、

メディア編集、

世界との接地（グラウンディング）、

エージェントインターフェース、

そして最終的にはあらゆる入力・出力に対応する生成。

これは DeepMind の長年にわたる世界モデルのアジェンダと、Google の製品配布における優位性と整合しています。

反重力：コーディングアシスタントを超えた Google のエージェント OS

発表・拡大された機能

Antigravity 2.0 デスクトップアプリ：コア会話、アーティファクト、マルチエージェントオーケストレーションを備えたエージェントファーストのデスクトップ（Google, Google）

Antigravity CLI（Google, Google）

Antigravity SDK（Google）

Gemini API におけるマネージドエージェント：単一の API コールで、エージェントとホストされた Linux サンドボックス（サンドボックス環境）が提供されます。Bash/Python/Node への対応、ファイル操作、ブラウジング機能、カスタム Markdown で定義されたスキル、リポジトリ/GCS マウント（Google, GoogleAIStudio, _philschmid）をサポートしています。

AI Studio、Android、Firebase、Workspace、Web との統合（Google, Google）

AI Studio から Antigravity へのワンクリックエクスポート機能（Google）

AI Studio におけるネイティブ Android アプリ生成 / Antigravity での Android サポート対応（Google, AndroidDev）

技術的シグナル

原文を表示

Google is so back!

AI News for 5/18/2026-5/19/2026. We checked 12 subreddits, 544 Twitters and no further Discords. AINews' website lets you search all past issues. As a reminder, AINews is now a section of Latent Space. You can opt in/out of email frequencies!

AI Twitter Recap

Top Story: Google I/O recap and new Gemini technical details

What happened

Facts vs. opinions

Facts / directly claimed by official or third-party benchmark sources

Google says it now processes 3.2 quadrillion tokens/month, up from 480 trillion a year earlier (Google).

Google says Gemini has 900M+ monthly users (Google).

Google says Gemini 3.5 Flash is GA today across Gemini app, Search AI Mode, Gemini API, AI Studio, Antigravity, Android Studio, and enterprise surfaces (Google, GeminiApp).

Google says Gemini 3.5 Flash has 1M context, 65k max output, 4 thinking levels, and “thought preservation” across turns ( _philschmid).

Google says 3.5 Flash beats Gemini 3.1 Pro on Terminal-Bench 2.1, GDPval-AA, and MCP Atlas (GoogleDeepMind, Google).

Google says 3.5 Flash runs 4x faster than comparable frontier models, and up to 12x faster in Antigravity (Google, JeffDean).

Independent benchmarker Artificial Analysis reports Gemini 3.5 Flash scores 55 on its Intelligence Index, +9 vs Gemini 3 Flash, at >280 output tok/s, with MMMU-Pro 84%, GDPval-AA Elo 1656, and pricing of $1.50 / $9.00 per 1M input/output tokens; it also reports the model is 5.5x costlier to run than Gemini 3 Flash on its suite and 75% costlier than Gemini 3.1 Pro (ArtificialAnlys).

Arena reports Gemini 3.5 Flash reached #9 overall in Text Arena and #9 in Code Arena: Frontend, scoring 1507, a +70 jump over Gemini 3 Flash, and becoming the top score in its price tier (arena).

Google says Gemini Omni Flash is available in Gemini/Flow today for paid users, in Shorts/Create starting this week for free, and via APIs in coming weeks (Google).

Google says Spark runs on dedicated Google Cloud virtual machines, allowing long-running tasks while user devices are closed (Google).

Google claims an Antigravity + Gemini 3.5 Flash demo built a functioning OS in 12 hours using 93 parallel sub-agents, 15k+ model requests, 2.6B tokens, and < $1K API credits (Google).

Google says Search will use Antigravity + 3.5 Flash to generate custom visual tools/simulations on the fly (Google).

Opinions / interpretations / skepticism

Positive takes: “Google is back,” “insane evals for a Flash model,” “world model towards AGI,” “mind blowing” for Search + Antigravity, etc. (kimmonismus, Kseniase_, demishassabis).

Neutral caution: some posters explicitly avoided overhyping due to self-reported benchmarks and noted pricing/perf concerns (scaling01, simonw).

Negative/skeptical takes focused on:

Price inflation relative to earlier Flash models (enricoros).

Comparisons where GPT-5.5-medium may be smarter/cheaper/faster end-to-end (scaling01, scaling01).

Benchmark caveats such as weak TerminalBench-Hard, mediocre MRCR / ARC-AGI-2, or not clearly beating Kimi/GLM on some slices (scaling01, teortaxesTex, scaling01).

Product naming/UX confusion around Gemini CLI vs Antigravity CLI and broader interface design criticism (zachtratar, kchonyc, teortaxesTex).

Gemini 3.5 Flash: the main technical release

Official positioning

Technical details and metrics

From Google and affiliated posts:

GA availability now (Google)

1M token context window

65k max output tokens

Thinking levels: minimal, low, medium (new default), high

Thought preservation across multi-turn conversations

Text output

Input modalities: text, image, video, speech per Artificial Analysis ( _philschmid, ArtificialAnlys)

Pricing: $1.50 / 1M input, $9.00 / 1M output, 90% discount on cached input (scaling01, ArtificialAnlys)

Official benchmark claims:

Terminal-Bench 2.1: 76.2%

GDPval-AA: 1656 Elo

MCP Atlas: 83.6%

Google-quoted multimodal result: MMMU-Pro 83.6% in one engineer post; Artificial Analysis reports 84%, highest recorded on its setup (koraykv, ArtificialAnlys)

Speed claims:

Google marketing claim: 4x faster than comparable frontier models (Google)

In Antigravity, Google says it is up to 12x faster (JeffDean, scaling01)

Artificial Analysis observed >280 output tok/s

Some discussion cited ~867 tok/s in Antigravity-specific optimized serving (scaling01, scaling01)

Third-party evaluation:

Artificial Analysis says 3.5 Flash is the leader on the intelligence-vs-speed Pareto frontier, but the economics are notably worse than prior Flash:

Intelligence Index 55

+9 over Gemini 3 Flash

Hallucination rate reduced to 61%, a 31-point drop vs Gemini 3 Flash on its omniscience setup

GDPval-AA 1656 Elo

5.5x costlier than Gemini 3 Flash to run on its benchmark suite

75% costlier than Gemini 3.1 Pro on the same suite (ArtificialAnlys)

Arena:

#9 Text Arena

#9 Code Arena: Frontend

1507 score, +70 over Gemini-3 Flash

Better than Gemini 3.1 Pro across categories in its frontend coding eval (arena, arena)

Implications

The strongest technical signal is not “best absolute benchmark model,” but:

material agentic gains

extreme serving speed

deep integration into product surfaces

tooling built around subagents and long-horizon execution

That makes 3.5 Flash strategically important even if some competitors still win on raw price-adjusted intelligence in certain third-party comparisons.

Gemini Omni: multimodal generation/editing as “create anything from any input”

What Google announced

Inputs: text, images, audio, video

Initial output emphasis: video

Product availability: Gemini app, Flow, YouTube Shorts/Create, later APIs

Current shipping model: Gemini Omni Flash (GoogleDeepMind, Google, Google)

Google/DeepMind claims:

Better world understanding

More robust physics

Multi-turn editing where scene/character consistency is retained

Ability to “reimagine” user video footage with conversational edits (Google, Google)

Rollout specifics:

Paid Gemini users globally in app/Flow “today”

YouTube Shorts/Create rolling out “starting this week” at no cost

APIs for developers/enterprise in coming weeks (Google, GeminiApp)

Perspectives

Supportive: users and Google employees described Omni as a major quality step, especially for video editing and consistency (joshwoodward, fofrAI, osanseviero).

Strategic interpretation: several posters framed Omni as evidence Google is investing in world models and embodied/physical priors, not just text/code competition (demishassabis, jparkerholder, kimmonismus).

Skepticism: some UI/output examples drew criticism for looking like “B-tier video game interface” or too polished/template-like (teortaxesTex, shlomifruchter).

Context

Omni matters less as “yet another video model” and more as Google’s attempt to unify:

multimodal understanding,

media editing,

world grounding,

agent interfaces,

and eventually any-input/any-output generation.

This aligns with DeepMind’s long-running world-model agenda and Google’s product distribution advantage.

Antigravity: Google’s agent OS, not just a coding assistant

A major underappreciated I/O theme was that Google is no longer presenting agents as a thin wrapper around a chat model. Antigravity is becoming the execution substrate.

What launched / expanded

Antigravity 2.0 desktop app: agent-first desktop with core conversations, artifacts, multi-agent orchestration (Google, Google)

Antigravity CLI (Google, Google)

Antigravity SDK (Google)

Managed Agents in Gemini API: single API call gives an agent plus hosted Linux sandbox; supports Bash/Python/Node, files, browsing, custom markdown-defined skills, repo/GCS mounts (Google, GoogleAIStudio, _philschmid)

Integrations with AI Studio, Android, Firebase, Workspace, web (Google, Google)

One-click export from AI Studio to Antigravity (Google)

Native Android app generation in AI Studio / Android support in Antigravity (Google, AndroidDev)

Technical signaling

この記事をシェア

GitHub Changelog重要度42026年7月3日 08:07

GitHub Copilot における Gemini 2.5 Pro および Gemini 3 Flash の利用終了発表

Simon Willison Blog2026年7月5日 10:00

sqlite-utils 4.0rc2、主にClaude Fable（約149.25ドル分）が執筆

TechCrunch AI2026年7月5日 00:51

ミストラル AI とは？OpenAI の競合企業に関する全知識

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

キーポイント

影響分析

編集コメント

AI Twitter リキャップ

何が起きたか

事実と意見

公式または第三者のベンチマークソースによって直接主張されている事実

意見・解釈・懐疑論

Gemini 3.5 Flash: メインの技術的リリース

公式な位置づけ

技術詳細と数値指標

示唆される影響

Gemini Omni: 「あらゆる入力から何でも作成」を可能にする多モーダル生成・編集

Google が発表した内容

見解

コンテキスト

反重力：コーディングアシスタントを超えた Google のエージェント OS

発表・拡大された機能

技術的シグナル

AI Twitter Recap

What happened

Facts vs. opinions

Facts / directly claimed by official or third-party benchmark sources

Opinions / interpretations / skepticism

Gemini 3.5 Flash: the main technical release

Official positioning

Technical details and metrics

Implications

Gemini Omni: multimodal generation/editing as “create anything from any input”

What Google announced

Perspectives

Context

Antigravity: Google’s agent OS, not just a coding assistant

What launched / expanded

Technical signaling

関連記事

キーポイント

影響分析

編集コメント

AI Twitter リキャップ

何が起きたか

事実と意見

公式または第三者のベンチマークソースによって直接主張されている事実

意見・解釈・懐疑論

Gemini 3.5 Flash: メインの技術的リリース

公式な位置づけ

技術詳細と数値指標

示唆される影響

Gemini Omni: 「あらゆる入力から何でも作成」を可能にする多モーダル生成・編集

Google が発表した内容

見解

コンテキスト

反重力：コーディングアシスタントを超えた Google のエージェント OS

発表・拡大された機能

技術的シグナル

AI Twitter Recap

What happened

Facts vs. opinions

Facts / directly claimed by official or third-party benchmark sources

Opinions / interpretations / skepticism

Gemini 3.5 Flash: the main technical release

Official positioning

Technical details and metrics

Implications

Gemini Omni: multimodal generation/editing as “create anything from any input”

What Google announced

Perspectives

Context

Antigravity: Google’s agent OS, not just a coding assistant

What launched / expanded

Technical signaling

関連記事