Luma AIのUni-1、GoogleのNano Banana画像優位性に対する初の真の挑戦者となる可能性
Luma AIがUni-1を発表し、画像理解と生成を単一アーキテクチャで統合し、プロンプトを推論しながら生成するモデルとして、GoogleのNano Bananaの画像支配に初めて本格的な挑戦を開始した。
キーポイント
Uni-1の革新的アーキテクチャ
画像理解と生成を単一のアーキテクチャに統合し、プロンプトを推論しながら画像を生成する新しいモデルを発表した。
Google Nano Bananaへの挑戦
Uni-1はGoogleのNano Bananaが支配する画像生成分野において、初めて本格的な競合となり得ると評価されている。
OpenAIとGoogleへの対抗
Luma AIがUni-1を通じて、OpenAIとGoogleというAI大手企業に直接対抗する姿勢を示している。
技術的統合の進展
従来分離されがちだった画像理解と生成機能を統合することで、より高度で文脈を考慮した画像生成を実現している。
影響分析・編集コメントを表示
影響分析
この発表は、Googleが支配する画像生成市場に新たな競争をもたらし、AIモデルの統合化・多機能化のトレンドを加速させる可能性がある。単一アーキテクチャでの理解と生成の統合は、より直感的で文脈を理解したAI画像生成の新たな基準を設定するかもしれない。
編集コメント
画像生成市場の競争激化を示す重要なニュース。単一アーキテクチャでの理解と生成の統合は技術的に興味深く、今後のモデル開発に影響を与える可能性が高い。

Luma AIは、画像理解と画像生成を単一のアーキテクチャに統合したモデル「Uni-1」でOpenAIとGoogleに挑む。このモデルは生成を行いながら、プロンプトに基づいて推論する。
この記事「Luma AIのUni-1は、GoogleのNano Bananaの画像支配に対する最初の真の挑戦者となる可能性がある」は、The Decoderに最初に掲載されました。
原文を表示
Mar 23, 2026
Uni-1 prompted by THE DECODER
Luma AI's new Uni-1 image model tops Nano Banana 2 and GPT Image 1.5 on logic-based benchmarks
Update March 23, 2026:
Uni-1 (see below) is now available. In human preference tests (Elo rating), Uni-1 takes first place in the overall, style/editing, and reference-based generation categories, according to Luma Labs. For pure text-to-image generation, it ranks second behind Google's Nano Banana.
The model nails my benchmark prompt, on par with Nano Banana Pro, possibly even better. That's a noticeable step up from the new Midjourney v8, which struggled with the same prompt. One caveat: the generated image went through a Luma image generation agent, so the results might differ slightly from the upcoming API (see below). You can try Uni-1 for free at Luma Labs.
Prompt: A hyper-realistic DSLR photo. A monkey holding a pink banana is sitting on a tiger in the foreground. In the background, a HORSE is RIDING AN ASTRONAUT. The astronaut is underneath like a living "spacesuit horse saddle," and the HORSE is clearly on top, in control, as the rider. Make it 100% unambiguous: the HORSE is the rider and the ASTRONAUT is being ridden, NOT the other way around. High-resolution, sharp focus, realistic lighting. (Best image out of three attempts, but all three were good…)
Overall, Uni-1 gets close to Google's flagship image model while coming in cheaper at comparable resolution: at 2K, the average cost through the upcoming API lands at about $0.09 per image, depending on how many reference images you feed it.
Feature
Uni-1
Nano Banana 2
Nano Banana Pro
Text to Image (2048px)
$0.0909
$0.101
$0.134
Image edit / i2i (2048px)
$0.0933
$0.101
$0.134
Multi-ref, 1 img (2048px)
$0.0933
$0.101
$0.134
Multi-ref, 2 imgs (2048px)
$0.0957
$0.101
$0.134
Multi-ref, 8 imgs (2048px)
$0.1101
$0.101
$0.134
Nano Banana 2 does offer lower resolutions at cheaper prices, though: a 0.5K image costs about $0.045, and a 1K image runs about $0.067.
Original article from March 8, 2026:
Luma AI's new Uni-1 image model tops Nano Banana 2 and GPT Image 1.5 on logic-based benchmarks
Luma AI introduces Uni-1, its first model to combine image understanding and image generation in a single architecture.
Like Google's Nano Banana Pro and GPT Image 1.5, Uni-1 is built on an autoregressive transformer, an AI model that generates content token by token in sequence, instead of pulling images out of noise the way traditional diffusion models do. Text and images share the same processing pipeline.
Luma says the model can reason through prompts before and during generation, breaking down complex instructions and planning out scenes. This approach typically leads to much more accurate prompt following, and Uni-1 is no exception. It can, for example, take several photos and merge them into an entirely new composition.
Multiple ordinary pet photos were combined into the scene above. Prompt: "Combine the black and white curly-haired dog with pink bandana, the Boston Terrier in plaid harness, and the black-and-white cat into a single scene where they are dressed in academic regalia, standing before a whiteboard filled with scientific diagrams and text, with the Luma AI logo placed in the top-left corner." | Image: Luma
Beyond basic generation, Luma says Uni-1 can refine subjects across multiple conversation turns while keeping context intact, convert images into over 76 art styles, accept sketches and visual instructions as input, and transfer identities, poses, and compositions into new images from reference photos. In one demo, the model generated an entire sequence from a single reference image, gradually aging a pianist from childhood to old age.
From a single reference image, Uni-1 generates a sequence showing a pianist aging from childhood to old age - keeping the same camera angle and consistent scene throughout. | Image: Luma AI
According to Luma, Uni-1 scores highest on the RISEBench test for logic-based image processing, narrowly beating both Nano Banana 2 and GPT Image 1.5. The image generation capability also boosts the model's visual understanding. In object recognition, for instance, it nearly matches Google's Gemini 3 Pro. The model supports multiple languages.
Uni-1 tops the overall RISEBench ranking, just ahead of Nano Banana 2 and GPT Image 1.5, the current image model powering ChatGPT. | Image: Luma AI
Uni-1 will soon be available through Luma Agents, a newly launched creative assistant, and the Luma API. No pricing has been announced yet.
AI News Without the Hype – Curated by Humans
Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.
Subscribe now
More than 16% discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.
Subscribe to The Decoder
関連記事
今日のまとめ
AI日報で今日の重要ニュースをまとめ読み