Cursorが中国のオープンソースKimi K2.5を基盤に新コーディングモデルを静かに構築
Cursor社は、中国のオープンソースモデルKimi K2.5を基盤として開発した次世代AIコーディングモデル「Composer 2」を発表し、AnthropicやOpenAIの主要モデルと同等の性能を低コストで実現することを目指している。
キーポイント
新モデル「Composer 2」の発表
Cursor社はソフトウェア開発向けAIモデルの第2世代「Composer 2」をリリースした。
中国オープンソースモデルを基盤
新モデルは中国のオープンソースAIモデル「Kimi K2.5」を基盤として構築されている。
コスト効率を重視した設計
AnthropicやOpenAIの主要コーディングモデルと同等の性能を、大幅に低いコストで実現することを目指している。
開発手法の特徴
記事はこの開発が「静かに(quietly)」行われたと表現しており、注目を集めずに進められたことを示唆している。
影響分析・編集コメントを表示
影響分析
この記事は、AIコーディングツール市場における競争激化と、中国発のオープンソース技術がグローバルなAI開発基盤として活用され始めていることを示している。特に、高価な専有モデルに依存しないコスト効率の良い開発アプローチが注目される。
編集コメント
中国のオープンソース技術を基盤としたグローバルAI製品の開発事例として注目されるが、記事の情報量が限定的で技術的詳細に乏しいため、より深い分析には追加情報が必要。

Cursorは、ソフトウェア開発向け自社AIモデルの第2世代となる「Composer 2」をリリースしました。このモデルは、AnthropicとOpenAIの主要なコーディングモデルに匹敵する性能を、大幅に低いコストで実現するように設計されています。
本記事「Cursor、新コーディングモデルを中国オープンソースKimi K2.5上に構築」は、The Decoderで最初に公開されました。
原文を表示
Update from March 21, 2026:
Cursor's new AI model is built on top of the Chinese open-source model Kimi K2.5. Cursor employee Lee Robinson says roughly a quarter of the pretraining comes from the base model, with Cursor doing the rest through fine-tuning and continued training. Because of that additional training, the model's benchmark results differ from the original Kimi K2.5. The commercial license runs through inference partner Fireworks.
Cursor never disclosed any of this, drawing criticism. It only came out after Kimi employees dug into the model themselves. Cursor co-founder Aman Sanger owned up to the mistake: "It was a miss to not mention the Kimi base in our blog from the start. We'll fix that for the next model."
The bigger question is why Cursor kept quiet in the first place. The most likely answer: admitting it would mean conceding that, unlike Anthropic and OpenAI, Cursor can't build its own frontier model. Both competitors pour billions into proprietary base models; Cursor simply can't play at that level.
Cursor could have pitched open-source fine-tuning as a billion-dollar shortcut
There's nothing wrong with taking a strong open-source model and fine-tuning it for a specific use case, though. It's common practice and often the smarter path, especially for a company whose real strength isn't pretraining massive language models but building a coding editor. The problem is shipping someone else's base model under your own brand without saying so.
There's another way to look at this, though. If Cursor's fine-tuned model can genuinely compete with billion-dollar proprietary efforts, that's an uncomfortable question for the frontier labs: how much is a proprietary base model actually worth when a small team with smart fine-tuning can get similar results?
Cursor would have been much better off owning the open-source angle from the start, positioning comparable results from a fine-tuned open-source model as proof that billion-dollar proprietary development isn't the only path forward. That framing would have put the pressure on OpenAI and Anthropic instead of leaving Cursor on the defensive.
Original article from March 19, 2026:
Cursor takes on OpenAI and Anthropic with Composer 2, a code-only model built to match rivals at a fraction of the cost
Cursor releases Composer 2, the second generation of its own AI model for software development. The model aims to match the leading coding models from Anthropic and OpenAI at a fraction of the cost.
The model is now available in Cursor and in the early alpha of the new "Glass" interface. Pricing starts at 0.50 dollars per million input tokens and 2.50 dollars per million output tokens. A faster variant that Cursor says delivers identical intelligence costs 1.50 and 7.50 dollars per million tokens respectively, and ships as the default.
Model
Price per 1 million tokens, input / output
Hint
Composer 2
0.50 / 2.50 dollars
Standard version
Composer 2 Fast
1.50 / 7.50 dollars
Faster version with the same intelligence according to the cursor
Claude Opus 4.6
5.00 / 25.00 dollars
API price according to Anthropic, valid for any context length
GPT-5.4
2.50 / 15.00 dollars, short context; 5.00 / 22.50 dollars, long context
OpenAI price depending on context length
On pure API pricing, Cursor Composer 2 comes in well below both Claude Opus 4.6 and GPT-5.4. Even the faster variant still undercuts both competitors by a wide margin on token costs.
Co-founder Aman Sanger told Bloomberg the model was trained exclusively on code data. That narrow focus made it possible to build a smaller, more cost-effective model. "It won't help you do your taxes. It won't be able to write poems," Sanger said.
Reinforcement learning on long coding tasks drives quality gains
According to Cursor, the quality improvements over its predecessor come down to a stronger first pass of continued pretraining, which provides a better foundation for the reinforcement learning that follows. Training runs on so-called long-horizon coding tasks, programming challenges that require hundreds of individual actions to complete.
The numbers Cursor published show a major jump, especially compared to earlier Composer versions. On CursorBench, the company's internal benchmark for coding tasks, Composer 2 climbs from 44.2 (Composer 1.5) to 61.3. The model also posts gains on Terminal Bench 2.0, a benchmark for agent-based tasks in the terminal, and on SWE-bench Multilingual, which tests software engineering tasks across multiple programming languages.
Model
CursorBench
Terminal Bench 2.0
Terminal Bench 2.0 optimized
SWE-bench Multilingual
Composer 2
61.3
61.7
73.7
Composer 1.5
44.2
47.9
65.9
Composer 1
38.0
40.0
56.9
Claude Opus 4.6
58.2
58.0
65.4
77.8
GPT 5.4 Thinking
63.9
75.1
N/A
Terminal Bench 2.0 scores aren't directly comparable across the board, since results also depend on the agent, harness, and settings. For Claude Opus 4.6, 58.0 is the Claude Code value; 65.4 is an additional optimized value published by Anthropic. For GPT-5.4 Thinking, only a single published Terminal Bench value is available.
Building its own model is about survival, not just performance
Cursor competes directly with Anthropic and OpenAI, both of which are shipping increasingly powerful AI models for software development. According to Bloomberg, Cursor now has more than one million daily users and around 50,000 enterprise customers. The company is also in talks about a new funding round at a valuation of roughly 50 billion dollars.
At the same time, Cursor faces a structural dilemma. The platform still supports models from OpenAI and Anthropic, which means it's competing with the very providers whose technology it has relied on. As long as Cursor buys third-party models, its pricing, performance, and margins all depend on companies that sell directly to the same customers.
Anthropic, in particular, is making aggressive moves in the coding market with Claude Code. Cursor reportedly estimates that a single Claude Code subscription at 200 dollars a month can rack up around 5,000 dollars in actual compute costs. That highlights the structural problem: when you build on someone else's model, you're paying full price for compute that the model provider can heavily subsidize in its own product.
That doesn't leave Cursor much room. According to the report, consumer subscriptions are already running at negative margins, with enterprise contracts carrying the business. And the longer-term risk may be even bigger—as AI coding agents get more capable, users might skip the IDE entirely and work with these systems straight from the model provider.
AI News Without the Hype – Curated by Humans
Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.
Subscribe now
関連記事
今日のまとめ
AI日報で今日の重要ニュースをまとめ読み