Microsoft Research·2025年12月11日 02:00·約11分

プロンプションズ、動的UIコントロールでAIプロンプトの精度を向上させる

#プロンプトエンジニアリング #UI/UX #オープンソース #Microsoft Research #知識労働 #AIインタラクション

TL;DR

Microsoft Researchが発表したPromptionsは、動的UIコントロールを通じてAIプロンプティングの精度を向上させるオープンソースフレームワークであり、知識労働者の理解タスクにおけるAIインタラクションの課題を解決することを目指している。

AI深層分析2026年2月27日 23:44

重要/ 5段階

深度40%

キーポイント

プロンプト精度向上のUIフレームワーク

Promptionsはプロンプトとオプションを組み合わせたUIフレームワークで、開発者がより精密なユーザー制御を持つAIインターフェースを構築できるようにする。

理解タスクに特化した研究基盤

「Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks」研究に基づき、生成ではなく理解を目的としたAI利用における課題に焦点を当てている。

動的リファインメントの重要性

静的オプションでは不十分で、ユーザーが扱う具体的なコンテンツ（数式、コード等）に適応する動的制御が必要であることが研究で明らかになった。

オープンソースでの公開

MITライセンスでMicrosoft Foundry LabsとGitHubで公開されており、カスタマーサポート、教育、医療など様々な分野への統合が容易とされている。

動的UIコントロールの利点

動的プロンプト改良コントロール（Dynamic PRC）は、ユーザーの入力に基づいてリアルタイムでオプションを生成し、プロンプトエンジニアリングの労力を減らし、コンテンツ理解に集中できるようにする。

ユーザーエクスペリエンスの向上

動的コントロールにより、ユーザーはタスクのニュアンスをより簡単に表現でき、自分では考えなかった改良を試すきっかけとなり、目標をより明確に考えるよう促す。

技術的アーキテクチャ

Promptionsはユーザーと言語モデルの間に位置する軽量なミドルウェア層として機能し、動的プロンプト改良を実現する。

影響分析・編集コメントを表示

影響分析

この研究は、生成AIの実用化において重要な課題である「プロンプトエンジニアリングの民主化」に貢献する可能性がある。特に企業内でのAI活用において、非技術系ユーザーでも効果的にAIを活用できる環境整備に寄与し、AI導入の障壁を下げる効果が期待される。

編集コメント

AI利用の実務課題を研究ベースで解決する実用的なフレームワークとして、企業内AI導入の現場で注目される内容。オープンソース公開により普及可能性が高い。

AI システムを利用する誰もが経験している frustration：プロンプトを入力しても、回答が的を外し、そのサイクルが繰り返される。この試行錯誤のループは予測不能で、やる気を削ぐように感じられることがある。これを解決するため、私たちは「Promptions（prompt + options）」を発表します。これは開発者が AI インターフェースを構築する際に、より精密なユーザー制御を実現するための UI フレームワークです。

そのシンプルなデザインにより、追加の文脈を必要とするあらゆる環境への統合が容易です。顧客サポート、教育、医療などがその例です。Promptions は MIT ライセンスの下で提供されており、Microsoft Foundry Labs（新しいタブで開く）および GitHub で入手可能です。

Promptions は、「Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks」という私たちの研究に基づいています。このプロジェクトは、知識労働者が「作成」ではなく「理解」を目的として生成 AI をどのように利用するかを検証したものです。公的な議論の多くがテキストや画像の生成に焦点を当てている一方で、「理解」には、AI に説明させたり、明確化したり、教えるよう依頼するタスクが含まれます。これはすぐに複雑になる可能性があります。スプレッドシートの数式を考えてみましょう。あるユーザーは単純な構文の分解を望むかもしれませんし、別のユーザーはデバッグガイドを必要とするかもしれません。さらに別の人には同僚に教えるのに適した説明が求められるかもしれません。同じ数式でも、ユーザーの役割、専門知識、目的によって、全く異なる説明が必要になることがあります。

これらの一見単純なリクエストの背後には、非常に多くの複雑さが潜んでいます。ユーザーはしばしば、質問の表現方法が AI が必要とする詳細レベルと一致していないことに気づきます。本当に何を求めているのかを明確にするためには、長く注意深く言葉を選んだプロンプトが必要となり、作成する際に疲れてしまいます。また、自然言語とシステム動作との間のつながりが常に明確ではないため、AI が特定の要求をどのように解釈するかを予測するのは困難です。結局のところ、ユーザーは自分が学びたいと思っていた内容を理解することに費やす時間よりも、相互作用そのものを管理することに多くの時間を費やしてしまいます。

ユーザーが AI の出力をどのようにガイドしたいかを特定する

これらの課題がなぜ持続しているのか、そして人々がどのようにして AI をカスタマイズされた結果へと導くことができるかを探るために、技術職および非技術職の知識労働者に対して 2 つの研究を実施しました。その経験は、Promptions の設計を導いた重要なギャップを浮き彫りにしました。

最初の研究には、エンジニアリング、リサーチ、マーケティング、プログラム管理にわたる 38 名の専門家が参加しました。参加者は、AI の応答を形成するための静的なプロンプト改善オプション（長さ、トーン、または「～から始める」など）を提供するデザインのモックアップを検討しました。

これらの静的なオプションは有用でしたが、参加者が理解しようとしていた特定の式、コードスニペット、またはテキストには適応できませんでした。また、参加者は指示を入力することなく、応答のトーン、詳細さ、またはフォーマットを直接カスタマイズする方法も望んでいました。

動的な改善が重要な理由

2 つ目の研究では、制御実験においてプロトタイプを検証しました。第 1 の研究から得られた静的なデザインである「Static Prompt Refinement Control」（Static PRC）と、参加者のフィードバックに応じて機能する特徴を備えた「Dynamic Prompt Refinement Control」（Dynamic PRC）を比較しました。生成 AI に精通した技術専門家 16 名が、コードの説明、複雑なトピックの理解、新スキルの習得にわたる 6 つのタスクを完了しました。各参加者は両方のシステムをテストし、公平な比較のためにタスク割り当てはバランスよく行われました。

Dynamic PRC と Static PRC を比較した結果、動的なプロンプト改善オプションがユーザーのコントロール感や探索行動にどのように影響を与えるか、またそれらが理解の振り返りをどう支援するかについて重要な知見が得られました。

静的なプロンプト改善

Static PRC は、初期研究で特定された一連の事前選択済みコントロール（図 1）を提供しました。これらのオプションは、説明を求めるさまざまな種類のプロンプトに対して有用であると考えられています。

動的なプロンプト改善

Dynamic PRC システムは、ユーザーの入力に基づいて自動的にプロンプトオプションと改善案を生成し、リアルタイムで提示するように構築されました。これにより、ユーザーはこれらのコントロールを調整して AI の応答をより精密に誘導できるようになります（図 2）。

医療における AI 革命の再考

Microsoft のピーター・リーと共に、AI がヘルスケアにどのような影響を与え、医療の未来にとって何を意味するのかを探る旅に参加してください。

参加者たちは一貫して、動的コントロールにより、プロンプスを繰り返し言い換えることなくタスクのニュアンスを表現しやすくなったと報告しました。これによりプロンプトエンジニアリングにかかる負担が軽減され、ユーザーはフレーミングの仕組みを管理するよりもコンテンツを理解することに注力できるようになりました。

文脈に応じたオプションは、ユーザー自身が独自に考えなかったような改良を試すよう促しました。この行動様式は、動的 PRC（Prompt Refinement Controls）がユーザーの AI 解説への関与の幅を広げ、初期の意図を超えたタスクへの新たなアプローチ方法を発見する手助けとなる可能性を示唆しています。探索だけでなく、動的コントロールは参加者に対して目標についてより慎重に考えるよう促しました。「学習目標」や「回答形式」といったオプションは、概念の適用に関するガイダンスが必要なのか、それとも段階的なトラブルシューティング支援が必要なのかといった、必要なものを明確化するのに役立ちました。

参加者たちは動的 PRC の適応性を高く評価する一方で、その解釈がより困難であると指摘しました。一部の参加者は、選択したオプションが回答にどのような影響を与えるかを予測することに苦戦し、コントロールの効果が出力が表示されて初めて明らかになるため、制御手段が不透明だと感じました。

しかし、動的 PRC に対する全体的な肯定的な反応は、Promptions が広く有用であることを示しており、これにより開発者コミュニティへの共有へとつながりました。

技術設計

Promptions は、ユーザーと基盤となる言語モデルの間に位置する軽量なミドルウェアレイヤーとして機能します（図 5）。主なコンポーネントは以下の 2 つです。

Option Module. このモジュールはユーザーのプロンプトと会話履歴をレビューし、一連の改善オプションを生成します。これらは、AI がプロンプトをどのように解釈するかを直接形作るインタラクティブな UI 要素（ラジオボタン、チェックボックス、テキストフィールド）として提示されます。

Chat Module. このモジュールは、洗練されたプロンプトに基づいて AI の応答を生成します。ユーザーがオプションを変更すると、応答が即座に更新されるため、反復的なプロンプトのサイクルというよりも、進化していく会話のようなインタラクションになります。

アプリケーションへの Promptions の追加

Promptions は、あらゆる対話型チャットインターフェースに容易に統合できます。開発者は、オプションを表示するコンポーネントを追加し、AI システムに接続するだけで済みます。セッション間でデータを保存する必要はないため、実装はシンプルに保たれます。Microsoft Foundry Labs (新しいタブで開く) リポジトリには、このデザインを実践で示す 2 つのサンプルアプリケーション、つまり汎用チャットボットと画像生成器が含まれています。

Promptions は、ユーザーが文脈を提供する必要があるが、すべてを記述したくないインターフェースに特に適しています。長い説明を入力する代わりに、AI の応答を自分の好みに合わせるよう導くコントロールを調整できます。

さらなる探求のための質問

Promptions は将来の研究に向けて重要な問いを提起しています。主要なユーザビリティの課題としては、動的オプションが AI の出力にどのように影響するかを明確化することや、複数のコントロールによる複雑性を管理することが挙げられます。その他の問いには、即座に行える調整と恒久的な設定のバランスを取る方法や、ユーザーがオプションを共同で共有できるようにする仕組みが含まれます。

技術的な側面では、より効果的なオプションの生成、動的インターフェースの有効性とカスタマイズの検証、関連文脈の自動収集、そしてセッションを超えてオプションセットを保存・共有できる機能のサポートに焦点が当てられています。

これらの問いに加え、コラボレーション、倫理、セキュリティ、スケーラビリティといったより広範な考慮事項が、Promptions および関連システムに関する当社の継続的な取り組みを導いています。

Promptions をオープンソース化することで、開発者がより賢く、反応性の高い AI 体験を生み出す手助けができることを願っています。

Microsoft Foundry Labs で Promptions を探索する（新しいタブで開きます）

図 1: 静的な PRC インターフェース

図 2. ダイナミック PRC システムにおけるインタラクションフロー。(1) ユーザーはシステムに長い Excel 数式の解説を依頼します。(2) ダイナミック PRC が詳細レベル、焦点領域、学習目標といった精緻化オプションを生成します。(3) ユーザーがこれらのオプションを変更します。(4) AI は選択されたオプションに基づいて解説を返します。(5) セッションチャットパネルで、ユーザーは応答の構造や形式を制御するよう追加リクエストを入力します。(6) ダイナミック PRC はこの入力に基づいて新しいオプションセットを生成します。(7) AI は新たに適用されたオプションを反映した更新版の解説を生成します。

図 3. 主要な評価基準における、静的 PRC と動的 PRC のユーザー選好の比較。

図 4. 静的 PRC と動的 PRC の有効性を比較した参加者による評価

図 5. Promptions ミドルウェアのワークフロー。(1) オプションモジュールはユーザーのプロンプトと会話履歴を読み込み、(2) プロンプトオプションを生成します。(3) これらのオプションは専用コンポーネントによってインラインでレンダリングされます。(4) チャットモジュールはこれらの洗練されたオプションを元のプロンプトおよび履歴と共に組み合わせて応答を生成します。(5) ユーザーがコントロールを調整すると、その refinements が更新され、チャットモジュールはそれに応じて応答を再生成します。

原文を表示

Anyone who uses AI systems knows the frustration: a prompt is given, the response misses the mark, and the cycle repeats. This trial-and-error loop can feel unpredictable and discouraging. To address this, we are excited to introduce Promptions (prompt + options), a UI framework that helps developers build AI interfaces with more precise user control.

Its simple design makes it easy to integrate into any setting that relies on added context, including customer support, education, and medicine. Promptions is available under the MIT license on Microsoft Foundry Labs (opens in new tab) and GitHub.

Promptions builds on our research, “Dynamic Prompt Middleware: Contextual Prompt Refinement Controls for Comprehension Tasks.” This project examined how knowledge workers use generative AI when their goal is to understand rather than create. While much public discussion centers on AI producing text or images, understanding involves asking AI to explain, clarify, or teach—a task that can quickly become complex. Consider a spreadsheet formula: one user may want a simple syntax breakdown, another a debugging guide, and another an explanation suitable for teaching colleagues. The same formula can require entirely different explanations depending on the user’s role, expertise, and goals.

A great deal of complexity sits beneath these seemingly simple requests. Users often find that the way they phrase a question doesn’t match the level of detail the AI needs. Clarifying what they really want can require long, carefully worded prompts that are tiring to produce. And because the connection between natural language and system behavior isn’t always transparent, it can be difficult to predict how the AI will interpret a given request. In the end, users spend more time managing the interaction itself than understanding the material they hoped to learn.

Identifying how users want to guide AI outputs

To explore why these challenges persist and how people can better steer AI toward customized results, we conducted two studies with knowledge workers across technical and nontechnical roles. Their experiences highlighted important gaps that guided Promptions’ design.

Our first study involved 38 professionals across engineering, research, marketing, and program management. Participants reviewed design mock-ups that provided static prompt-refinement options—such as length, tone, or start with—for shaping AI responses.

Although these static options were helpful, they couldn’t adapt to the specific formula, code snippets, or text the participant was trying to understand. Participants also wanted direct ways to customize the tone, detail, or format of the response without having to type instructions.

Why dynamic refinement matters

The second study tested prototypes in a controlled experiment. We compared the static design from the first study, called the “Static Prompt Refinement Control” (Static PRC), against a “Dynamic Prompt Refinement Control” (Dynamic PRC) with features that responded to participants’ feedback. Sixteen technical professionals familiar with generative AI completed six tasks, spanning code explanation, understanding a complex topic, and learning a new skill. Each participant tested both systems, with task assignments balanced to ensure fair comparison.

Comparing Dynamic PRC to Static PRC revealed key insights into how dynamic prompt-refinement options change users’ sense of control and exploration and how those options help them reflect on their understanding.

Static prompt refinement

Static PRC offered a set of pre‑selected controls (Figure 1) identified in the initial study. We expected these options to be useful across many types of explanation-seeking prompts.

Dynamic prompt refinement

We built the Dynamic PRC system to automatically produce prompt options and refinements based on the user’s input, presenting them in real time so that users could adjust these controls and guide the AI’s responses more precisely (Figure 2).

The AI Revolution in Medicine, Revisited

Join Microsoft’s Peter Lee on a journey to discover how AI is impacting healthcare and what it means for the future of medicine.

Participants consistently reported that dynamic controls made it easier to express the nuances of their tasks without repeatedly rephrasing their prompts. This reduced the effort of prompt engineering and allowed users to focus more on understanding content than managing the mechanics of phrasing.

Contextual options prompted users to try refinements they might not have considered on their own. This behavior suggests that Dynamic PRC can broaden how users engage with AI explanations, helping them uncover new ways to approach tasks beyond their initial intent. Beyond exploration, the dynamic controls prompted participants to think more deliberately about their goals. Options like “Learning Objective” and “Response Format” helped them clarify what they needed, whether guidance on applying a concept or step-by-step troubleshooting help.

While participants valued Dynamic PRC’s adaptability, they also found it more difficult to interpret. Some struggled to anticipate how a selected option would influence the response, noting that the controls seemed opaque because the effect became clear only after the output appeared.

However, the overall positive response to Dynamic PRC showed us that Promptions could be broadly useful, leading us to share it with the developer community.

Technical design

Promptions works as a lightweight middleware layer that sits between the user and the underlying language model (Figure 5). It has two main components:

Option Module. This module reviews the user’s prompt and conversation history, then generates a set of refinement options. These are presented as interactive UI elements (radio buttons, checkboxes, text fields) that directly shape how the AI interprets the prompt.

Chat Module. This module produces the AI’s response based on the refined prompt. When a user changes an option, the response immediately updates, making the interaction feel more like an evolving conversation than a cycle of repeated prompts.

Adding Promptions to an application

Promptions easily integrates into any conversational chat interface. Developers only need to add a component to display the options and connect it to the AI system. There’s no need to store date between sessions, which keeps implementation simple. The Microsoft Foundry Labs (opens in new tab) repository includes two sample applications, a generic chatbot and an image generator, that demonstrate this design in practice.

Promptions is well-suited for interfaces where users need to provide context but don’t want to write it all out. Instead of typing lengthy explanations, they can adjust the controls that guide the AI’s response to match their preferences.

Questions for further exploration

Promptions raises important questions for future research. Key usability challenges include clarifying how dynamic options affect AI output and managing the complexity of multiple controls. Other questions involve balancing immediate adjustments with persistent settings and enabling users to share options collaboratively.

On the technical side, questions focus on generating more effective options, validating and customizing dynamic interfaces, gathering relevant context automatically, and supporting the ability to save and share option sets across sessions.

These questions, along with broader considerations of collaboration, ethics, security, and scalability, are guiding our ongoing work on Promptions and related systems.

By making Promptions open source, we hope to help developers create smarter, more responsive AI experiences.

Explore Promptions on Microsoft Foundry Labs (opens in new tab)

Figure 2. Interaction flow in the Dynamic PRC system. (1) The user asks the system to explain a long Excel formula. (2) Dynamic PRC generates refinement options: Explanation Detail Level, Focus Areas, and Learning Objectives. (3) The user modifies these options. (4) The AI returns an explanation based on the selected options. (5) In the session chat panel, the user adds a request to control the structure or format of the response. (6) Dynamic PRC generates new option sets based on this input. (7) The AI produces an updated explanation reflecting the newly applied options.

Figure 3. Comparison of user preferences for Static PRC versus Dynamic PRC across key evaluation criteria.

Figure 4. Participant ratings comparing the effectiveness of Static PRC and Dynamic PRC

Figure 5. Promptions middleware workflow. (1) The Option Module reads the user’s prompt and conversation history and (2) generates prompt options. (3) These options are rendered inline by a dedicated component. (4) The Chat Module incorporates these refined options alongside the original prompt and history to produce a response. (5) When the user adjusts the controls, the refinements update and the Chat Module regenerates the response accordingly.

この記事をシェア

MarkTechPost重要度42026年7月2日 17:46

Google Health API に CLI ツール「ghealth」登場：Fitbit データを AI エージェントへ

Microsoft Research重要度42026年7月1日 01:50

SkillOpt：エージェントのスキルを学習可能なパラメータとして扱う手法

Microsoft Research重要度42026年6月30日 06:14

抽象化と具体性のバランスを取る調和型メモリ表現「Memora」の発表

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む