AI2のコンピュータ利用エージェントがオンラインで操作を実行可能に
AI2が開発したオープンソースのコンピュータ利用エージェントは、ユーザーのためにオンラインでタスクを実行できるが、いくつかの制限がある。
キーポイント
オープンソースのコンピュータ利用エージェント
AI2が開発したエージェントは、ユーザーに代わってオンライン上で様々なタスクを実行することができる。
ユーザー支援機能
このエージェントはユーザーのために具体的な行動を実行する能力を持っている。
制限の存在
記事では、このエージェントにはいくつかの制限があることが明記されている。
実用性と課題
実際の応用には可能性があるが、制限によって完全な自律運用には課題が残っている。
影響分析・編集コメントを表示
影響分析
この記事は、AIエージェントが人間の代わりにコンピュータ操作を実行する実用的な応用への一歩を示している。オープンソースとして公開されている点は、研究コミュニティや開発者によるさらなる改良と応用を促進する可能性があるが、制限の存在は完全な自律運用にはまだ課題が残っていることを示唆している。
編集コメント
オープンソースの自律操作エージェントの実用化への一歩を示すニュースだが、制限の詳細が不明な点が気になる。今後の開発動向に注目したい。
このオープンソース・エージェントはユーザーのためにタスクを実行できますが、制限があります。
原文を表示
3 Min ReadWith more enterprises interested in using AI agents that are local to their computers and on their devices, AI research lab Ai2 on Tuesday released its own open source web agent called MolmoWeb, a day after Anthropic introduced an update that gives Claude access to personal computers.MolmoWeb is a visual web agent that automates browser tasks using multimodal AI. It is built on Ai2's model family, Molmo 2, and is available in two sizes,4B and 8B parameters. Ai2 also released its training data set, MolmoWebMix, evaluation tools, and an inference library, so developers and researchers can self-host, fine-tune and improve the system.MolmoWeb is similar to Anthropic's computer use capability, which the AI Lab introduced in 2024, in that it allows the AI agent to act on behalf of users. Using computer vision, the AI agent can perceive what is happening on a user's computer and reason through a sequence of actions to achieve the user's goals. Anthropic on March 24 revealed that Claude Cowork and Claude Code users can now allow Claude to complete tasks. Anthropic said that Claude can point, click and navigate what is on a user's screen to perform tasks. It can open files, use the browser, and automatically run dev tools. Users can also use Dispatch, a feature within Anthropic's Cowork, to assign Claude tasks from their phones. The feature is available in research preview to Claude Pro and Max subscribers. Related:The Real AI Shift Isn’t New Models. It’s Control.Molmo Web an Open OptionBoth MolmoWeb and the Claude update highlight a trend in the AI market in which the AI agent is becoming more personal, and users are focusing on ways to put AI agents to use on their local computers. The difference between what Anthropic does and what Ai2 has produced with MolmoWeb is that one is open to the community, and the other is not."MolmoWeb is an innovation paradigm of computer use agents similar to the proprietary frontier model providers, but with an open approach to data sets and agents," said Arun Chandrasekaran, an analyst at Gartner. "It lowers the barrier to entry for studying agentic behavior and understanding agent decision-making processes that are otherwise opaque.""This is critical for building safe systems in the future," Chandrasekaran continued. MolmoWeb is also an option for enterprises considering open source technology to explore AI agents, said Chris Callison-Burch, a professor of computer and information science at the University of Pennsylvania. Callison-Burch was a visiting research scientist at AI2 from 2023 to 2024.Related:OpenAI GPT-5.4-Cyber is More Open Than Claude Mythos"The cost to develop models is quite high, but adopting open source models is potentially a buy-in strategy for a lot of businesses," Callison-Burch said. He added that Ai2 generated synthetic data set MolmoWebMix that enables the AI research lab and developers using MolmoWeb to train agents. Challenges to Computer UseWhile MolmoWeb seems like a good alternative to proprietary AI agents, it also poses challenges, especially because the agent's computer vision technology means it sees and perceives what the human does. The research lab acknowledged that the AI agent can be thrown off track by actions such as scrolling before a web page has finished loading. It has also not been trained in tasks that require financial login, and its performance degrades with ambiguous instructions.However, Ai2's approach of making the data openly available will let researchers and enterprises work through the limitations and overcome them, Callison-Burch said. MolmoWeb is available on Hugging Face and GitHub.About the AuthorNews Writer, AI BusinessEsther Shittu brings four years of expertise covering artificial intelligence technologies and industry trends. As co-host of the "Targeting AI" podcast, she talks to thought leaders and practitioners exploring critical AI developments. Previous to AI Business, she wrote for several publications including the New York Daily News, Bklyner and the Brooklyn Daily Eagle. When she's not diving deep into the world of AI, she spends her time on passion projects and raising her three daughters.
関連記事
今日のまとめ
AI日報で今日の重要ニュースをまとめ読み