The Decoder·2026年3月28日 03:37·約1分

Cohere、音声認識ベンチマークでトップのオープンソースモデルをリリース

#音声認識 #オープンソース #ベンチマーク #Cohere #生成AI #マルチモーダル

TL;DR

Cohereが発表したオープンソースの音声認識モデルは、ベンチマークでOpenAIのWhisperを含む全ての競合モデルを上回る性能を示した。

AI深層分析2026年3月28日 04:40

重要/ 5段階

深度40%

キーポイント

オープンソースモデルのリリース

Cohereが音声認識モデルをオープンソースとして公開した。

ベンチマークでの優位性

公開されたモデルはベンチマークテストにおいて、既存の競合モデル全てを性能で上回った。

主要競合の明確化

記事では特にOpenAIのWhisperモデルを主要な競合として言及している。

業界への影響

オープンソースで高性能な音声認識モデルの登場は、AI業界の競争環境を変化させる可能性がある。

影響分析・編集コメントを表示

影響分析

この発表は、音声認識分野においてオープンソースモデルが商用モデルと同等以上の性能を達成できることを示しており、業界の競争構造に影響を与える可能性がある。特にOpenAIのWhisperに対する直接的な挑戦は、AI基盤モデル市場の多様化を促進する重要な出来事と言える。

編集コメント

オープンソースモデルが商用モデルをベンチマークで上回るという事実は、AI業界の民主化と競争激化の重要なマイルストーンとなる。今後の展開に注目したい。

image

Cohereは、ベンチマークテストにおいて、OpenAIのWhisperを含む全ての競合モデルを凌駕するオープンソースの音声認識モデルを発表しました。

この記事「Cohere releases open source model that tops speech recognition benchmarks」は、The Decoderで最初に公開されました。

原文を表示

Mar 27, 2026

Canadian AI company Cohere has released "Transcribe," a new open-source model for automatic speech recognition. The company says it claims the top spot on the Hugging Face Open ASR Leaderboard with an average word error rate of just 5.42 percent, beating out competitors like OpenAI's Whisper Large v3, ElevenLabs Scribe v2, and Qwen3-ASR-1.7B. Cohere says Transcribe also delivers the best throughput among similarly sized models.

Cohere Transcribe compared with seven other speech recognition models. Models closer to the upper left corner perform best, meaning faster throughput and lower word error rates. | Image: CohereThe 2 billion parameter model supports 14 languages, including English, German, French, and Japanese. It's available for download under the Apache 2.0 license on Hugging Face and can also be accessed through Cohere's API and the Model Vault platform. Cohere plans to integrate Transcribe into its AI agent platform North in the future.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

Subscribe now

この記事をシェア

TechCrunch AI2026年3月26日 22:30

Cohereが文字起こし専用のオープンソース音声モデルを発表

AI Business2026年3月27日 05:43

Cohereがエッジデバイス向けオープンソース音声モデルを発表

TLDR AI2026年7月3日 09:00

メタの「Watermelon」が GPT-5.5 ベンチマークに匹敵

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む