Import AI·2026年6月22日 21:31·約23分で読める

Import AI 462：超説得、自己維持型AI、ASIへの道筋

#LLM #説得技術 #Opus #GPT-4o #Gemini

TL;DR

オックスフォード大学などの研究チームが実施した大規模実験により、AI システムが専門家の人間を上回る説得力を持ち、実社会での寄付募集でも約3倍の効果を示すことが証明された。

AI深層分析2026年6月22日 22:06

最重要/ 5段階

深度40%

キーポイント

AI の圧倒的な説得能力の実証

18,978 件の会話を含む4つの実験で、AI は専門家やエリート討論者よりも確実に人間を説得できることが示され、特に情報量の迅速な展開が優位性の要因となった。

実社会における行動変容への効果

慈善団体への寄付募集において、AI はプロの訪問員（canvassers）よりも約3倍の効果を示し、テキストベースでの説得が実際の金銭的貢献に直結することを裏付けた。

人間による対抗策の可能性

AI に応答速度やメッセージ長さを人間のレベルに制限する制約を課すことで、コーチングを受けたエリート討論者が AI と同等の成果を出すことが可能であることが判明した。

主要モデルのパフォーマンス比較

Opus 4.1 および Opus 4.6 が最強の説得者として浮上し、OpenAI（GPT-4o, GPT-5.4）、Google（Gemini 2.5 Pro）、xAI（Grok 4.20）などの主要モデルも高い性能を示した。

人間コーチングの限界

AI に基づくコーチングツールを使用しても人間の説得能力は向上したが、AI との差を完全に埋めることはできず、依然として AI の方が優れていた。

速度と長さが AI の優位性の源泉

AI に人間の書き込み速度やメッセージ長さを強制するとその説得上のアドバンテージは消失し、特に論点の強さや学習効果の評価で顕著な低下が見られた。

実社会での資金調達における AI の圧倒的勝利

経験豊富なプロの訪問員と比較しても AI は説得力において優れており、実際の寄付金募集では参加者の寄付率と平均寄付額ともに顕著に上回った。

影響分析・編集コメントを表示

影響分析

本研究は、AI が単なる情報提供ツールを超え、人間の態度変容や行動を誘導する強力な「説得エンジン」として機能しうることを示唆しており、政治キャンペーン、マーケティング、募金活動などの分野で即座に実装される可能性が高い。一方で、AI による洗脳や操作のリスクが現実的な脅威となるため、規制や倫理的ガイドラインの整備が急務である。

編集コメント

AI の説得力が人間を凌駕するレベルに達したことは、今後の社会実装において「誰が正しい主張をしているか」ではなく「どのモデルがより効果的に伝えるか」という議論が重要になることを意味します。

imageAI 研究に関するニュースレター「Import AI」へようこそ。このニュースレターは、arXiv（アーカイブ）とカプチーノ、そして読者からのフィードバックによって支えられています。ご支援いただける場合は、ぜひ購読をご検討ください。

購読する

AI は人間を明確に上回る説得力を持つことができる:

…「AI システムは、専門的な人間よりも確実に説得力があった」…

イギリスのオックスフォード大学、UK AI セキュリティ研究所、スタンフォード大学、ロンドン・スクール・オブ・エコノミクス・アンド・ポリティカル・サイエンス（LSE）の研究チームが、AI システムがいかにして人間に対して政策に関する考えを変えさせたり、慈善団体への寄付金額を増やさせたりできるかを研究しました。その結果は決定的です：18,978 件の会話と 6,923 人の参加者を含む 4 つの実験全体を通じて、AI システムは今日、現実世界の影響を伴うテキストベースの説得において人間よりも優れています——ただし、AI システムに対して何らか的人工的な制約を課せば、人間もそれらに匹敵する可能性があります。

「専門家が自ら課題を選び、事前に調査を行い、数時間にわたるライブでの構造化された練習を受け、1,000 ポンドの現金ボーナスでインセンティブを与えられていたとしても、AI システムは専門的な人間よりも確実に説得力がありました」と彼らは記述しています。「AI の優位性は、大量の情報を迅速に展開することから生じています：コーチングを受けた後、人間の速度で応答し、人間の長さのメッセージしか送信できないように制約された AI に対して、専門家は互角に渡り合えるようになりました」。

「AI の優位性は、現実世界における重要な行動にも及びます：Save the Children（セーブ・ザ・チルドレン）への実際の寄付金を集める際、AI はイギリスの募金会社から派遣されたプロの訪問者よりも約 3 倍効果的でした」。

最も説得力があったのは Opus 4.1 と Opus 4.6 で、その後に OpenAI（GPT-4o および GPT-5.4）、Google（Gemini 2.5 Pro）、xAI（Grok 4.20）からなる一連のモデルが続きました。

彼らが研究し、発見したこと：研究者たちは4つの異なる研究においてAIシステムを評価した。

研究1（説得）：「被説得者はまず、事前に指定された英国の政策方針10項目のいずれかに対する同意度を0〜100のスケールで評価し、その後、カスタムマルチプレイヤープラットフォームを通じてリアルタイムにランダム化され、AIまたは人間による説得者とのテキスト会話を開始した」と研究者らは記述している。「研究1の結果は、平均して、AIがテストしたすべての種類の人間説得者（一般市民の無作為なグループ、トーナメントで選ばれた一般市民、さらにはエリート討論者）を上回ったことを示している。」

研究2（人間のコーチング）：研究2では、研究者たちは「彼らを破ったAIを中核に据えたコーチングツールを43名のエリート討論者に提供した。このツールにより、討論者はAIとチャットしたり、AIがどのようにプロンプトされたかを確認したり、各会話がいかに被説得者の態度を変化させたかを注釈付きで研究1の転記資料を見たり、過去の転記資料の任意のポイントにおいて、その立場であればAIが何と言ったであろうかを確認したりすることが可能になった」。この研究の結果は人間の性能向上であったが、誰もAIよりも優れてはいなかった。「したがって、コーチングは人間とAIの格差を縮めたが、完全に埋めることはできなかった。」

研究3 – 制約付きAI：次に、研究者たちは人間に有利になるようAIを制限しようと試みました。「人間の長さのメッセージを書き、人間の執筆速度で書かざるを得ない場合、研究2における最も強力な人間比較対象（コーチングされたエリート討論者）に対するAIの優位性は、+4.1ポイントから統計的に有意でない0.0ポイントまで崩壊します」と彼らは記述しています。「AIが書かれたコンテンツを生成する速度は、おそらくその説得上の優位性の源泉である可能性が高く……AIを制約することに関連して、対話後のパートナー評価における被説得者の減少率が最も集中したのは、2つの情報項目、すなわち相手の論証の知覚された強さと、被説得者が会話からどれほど学んだと感じたかという点でした」

研究4 – 実世界の専門知識と実世界のお金：彼らは英国の企業から19名の非常に経験豊富な訪問員を募集し、研究1と同じタスクを試みました。「AI は依然としてプロの訪問員を5.9 ポイント上回った」と。この効果は、実際のお金の寄付を評価した場合にも持続しました – 研究者たちは「研究4 を訪問員が資金調達に最も適した課題に焦点を当てるために、英国の訪問企業 AppcoUK と協力した：セーブ・ザ・チルドレン。AppcoUK が提供した訪問チームは、2016年から2023年にかけてこの慈善団体のための実際の募金活動を実施し、その期間中に22,583人の寄付者から£824,297を調達しました」と述べています。「AI または AppcoUK から募集された18名の訪問員のいずれかと会話した後、説得対象者はセーブ・ザ・チルドレンに£1 の研究ボーナスの一部を寄付する機会を与えられました」。ここで結果も再び顕著でした：「AI は訪問員よりも実質的に多くのお金の寄付を引き出し、£1 のボーナスの+10.8 ポイントを上回った」と彼らは記述しています。AI は「何かを寄付した説得対象者の割合と、寄付者間の平均寄付額の両方を引き上げました」。

なぜこれが重要なのか – AI が私たちよりも説得力に優れる場合、AI を制御する者が社会を変えることができるからだ：「人間专家をも凌駕する説得力を持つ AI の一つの効果は、すでに権力を持つアクター間の影響力の集中である」と彼らは書く。一方、「高度な能力を持つ説得が安価かつ広く利用可能になれば、リソース不足のアクター（例えば、自己弁護を行う訴訟当事者や公設弁護士、小規模な慈善団体、草の根活動家など）が、より確立され資金力のある競争相手と対等に戦うのを助け、司法アクセスにおける長年の格差を縮め、市民社会の支援を広く助けることができる」とも書かれている。

これは私たちが直面している社会的な選択を示しており、それは説得目的での AI の利用をどのように監視し、これらの能力が様々なアクター間の権力バランスをどう変化させるかを見極めるかという問題である。市場にのみこれらの能力の配分を任せるべきだろうか？それも一つのやり方ではあるが、それには広告やマーケティングがはるかに効果的になり、おそらく負の外部性を生み出すことを意味する。一方、説得能力を政府の独占領域とすれば、政府内部での権力集中というリスクが生じる – これは専制政権が自らの権力を維持するためにこれを行使した場合、極めて危険な事態となりうる。私たちはこの技術について何をすべきかを選択せねばならず、政治の世界で言うように「投票しないことも投票である」。

「私たちの発見は、最善の準備と動機付け、そして専門知識を持つ人間よりも、先端的 AI 会話型説得者がより能力が高いことを示している。人間の訓練はこの格差を埋めるようには見えない」と彼らは書く。「これらのシステムへのアクセスが拡大し続ける中、もはや問われるべきは AI が人間を上回る説得力を持てるかどうかではなく、その能力がどのように、どこで、誰のために行使されるかである」。

詳しく読む：AI システムが専門家の人間を上回る説得力を持つ（arXiv）。

この研究に関するツイートスレッド（AISI 研究者 Kobi Hackenburg）

いつ頃、自律的な AI が実現できるのでしょうか？それはすべて人型ロボットにかかっています：

…RSI の次は何が来るのか？自律する AI …

私は今年、再帰的自己改良について多くを執筆してきました。これは、すぐに自分たちの successor を自律的に設計できるほど賢い AI システムを構築できるかもしれないという概念です。しかし、RSI にはデータセンターが必要であり、これらのデータセンターには設備や電力、その他あらゆるものが必要です。

Asterisk 誌の興味深いインタビューでは、いつ頃自律する AI が実現するかという問いが投げかけられています。その中の一人である Ajeya Cotra（METR の予言者かつスタッフ）はこれを「人間の労働からの認知的または物理的な入力を一切必要とせず、自らの人口を増やし続けることができるように、工場、鉱山、ファブ、およびそれらすべてを稼働させるロボットといった物理インフラと統合された AI システム」と定義しています。

どれほど遠い未来でしょうか？Ajeya は、10 年以内（つまり 2036 年まで）に自律する AI が実現できると考えています。もう一人のインタビュー対象者である Timothy B. Lee（ジャーナリストであり『Understanding AI』の著者）は、より長いタイムラインを提示しています：「20 年以内に起こる確率は 10% 未満だ。私は、決して起こらない可能性が 10〜20% あると言いたいし、中央値としては 50 年だろう」と述べています。

どのような課題があるでしょうか。暗黙知もその一つかもしれません。「半導体業界のすべての従業員がいなくなったと想像してみてください。機械や教科書は残りますが、人間はいません。人類がファブを再開するにはどれくらいかかるでしょうか？数十年かかる可能性もあります。なぜなら、教科書があったとしても、これらの機械の中には多くの暗黙知が含まれているからです」とリー氏は指摘します。アジェヤの回答では、この技術はそのような課題を迂回できる可能性があるとしています。「暗黙知に関する仮説に対する反論は二つあります。一つ目は、台湾の労働者が行っていた作業を自動化することが利益になるため、その暗黙知について強化学習で訓練された AI システムが存在するだろうという点です。二つ目は、AI が試行錯誤し、教科書を読み、効率的に実験することで新しいことを素早く見つけるという意味で、非常に汎用的な知能を獲得する可能性があるという点です。」

自己維持型 AI が近い将来に到来すると考えるために、今後 2〜3 年間で人々が確認すべきことは何でしょうか？

アジェヤ：「ロボットの手の性能向上を示すグラフの線と、人間型ロボットを製造する速度を示す別の線を期待します。また、認知面では、環境における摂動に対する堅牢性などを評価するベンチマークに注目したいです。」

タイモシー：「人間型ロボットの発展を見守りたいと思います。具体的には、ロボットの数、その能力、そして特にコストと修理のしやすさです。」

なぜこれが重要なのか – 真の乗っ取りには人間の冗長性が必要である：多くの最大主義的な破滅論は、AI が人間を全く必要としなくなる能力を持つことを前提としている。つまり、自己維持型 AI への道筋を測ることは、構築される合成知能との交渉において人類が持つレバレッジ（影響力）が低下していく過程を間接的に測定するものとして極めて重要である。

もっと読む：『AI が人間を必要としなくなるまでどれくらいかかるか？』Ajeya Cotra, Timothy B. Lee (Asterisk magazine)。

なぜこれが重要なのか – 真の乗っ取りには人間の冗長性が必要である：多くの最大主義的な破滅論は、AI が人間を全く必要としなくなる能力を持つことを前提としている。つまり、自己維持型 AI（self-sustaining AI）への道筋を測ることは、構築される合成知能との交渉において人類が持つレバレッジ（影響力）が低下していく過程を間接的に測定するものとして極めて重要である。

もっと読む：『AI が人間を必要としなくなるまでどれくらいかかるか？』Ajeya Cotra, Timothy B. Lee (Asterisk magazine)。

DeepMind が一般知能から超知能への道筋を検討する：

…聞こえにくい未来を探求することが、AI の究極的な成功に備える唯一の方法である…

Google DeepMind の研究者たちは、私たちが一般知能を構築した世界から、超知能を構築した世界へと移行する方法について outlined した論文を発表しました。これは重要な時期における重要な論文です – 現在、世界は一般知能を構築しており（人々はすでにこのマイルストーンに到達したかどうかを議論できますが、現代の LLM（大規模言語モデル）を見れば、私たちがその範囲内にあることは明らかです）、今後数年で人工超知能（ASI）の構築へと移行する可能性があります。

著者たちは、「ASI は『人間の活動のほぼすべてのタスクとドメインにおいて、大規模な人間専門家の集団のパフォーマンスを超えるシステム』である」と述べています。「質的に見れば、ASI は全体的に人間レベルの AGI（汎用人工知能）と比較して大幅により能力が高いです。なお、単一の ASI は、今日における LLM のように世界と並列に相互作用する数百万のインスタンスからなる集団で構成される可能性があります。」

ASI が実現可能であると考えられる理由：ASI を考える一つの方法は、それが強力な AI システムでありつつも、生物学的知性に対してデジタル知性が持つすべての能力を最大限に活用するものだと捉えることです。具体的には、より優れた入力・出力速度、内部処理速度、作業記憶容量と記憶力、基盤の独立性、損失のない複製、そして（学習）経験の高帯域幅共有などが挙げられます。

ASI への道筋とボトルネック：

コンピューティング能力、モデル、データのスケールアップ：今日のアプローチのセットを単純に拡大するだけで十分である可能性があります。しかし、これにはこれらのモデルに対して計算資源とデータ量を継続的に増やす必要があり、エネルギー供給とデータ供給の両面で限界に直面する恐れがあります。これまでのすべての兆候はスケーリングの効果が続くことを示していますが、具体的にどのような能力が現れるか、あるいはいつかスケーリングが逓減効果に直面するかを予測することはできません。

アルゴリズムのパラダイムシフト：トランスフォーマーやエキスパートの混合（Mixture-of-Experts）アーキテクチャが分野を何年も先送りしたように、他の根本的な革新によっても同様のことが起こり得ます。例えば、テスト時またはデプロイ時の適応的計算における進展、あるいは今日のコンテキストウィンドウの限界を克服する進展などを想像することができます。ここであるいは他の分野で進展があれば大きな意味を持つかもしれませんが、本質的に推論するのは難しく、一般相対性理論が発明される前に現実の性質に関する理解を広げる可能性のあることを予測しようとするのに似ています。

再帰的自己改善：AI システムが自身の後継システムを構築する可能性があります。もしそうであれば、汎用知能から超知能へと急速に移行できるかもしれません。ここにはいくつかの不確定要素があります。個人的には、今日の AI システムは将来の AI を作成する人間の研究者を加速させており、「共同創造型 RSI（再帰的自己改善）」ループが始まっていることは明白だと考えていますが、AI システムはまだ、フロンティアを大幅に前進させるために必要なようなパラダイムを変える創造性を示していません。これがどの程度起こっているかは不明ですが、このような高いハードルの創造性がなくても、システムが自分自身のわずかに優れたバージョンを作り出し続けることで、ゆっくりとした複利プロセスが始まる可能性もあります。能力は爆発的に増大するかもしれないし、頭打ちになるかもしれない、あるいはその間のあらゆることがあり得ます。

ASI への道：グループエージェントの形成

多くの一般知能が、個々の能力の総和を超えた複雑な構造へと協調し得ます。これは、人間が個人では達成不可能な宇宙ステーションのようなものを建設できるような機関を構築する様子に似ています。他の経路と同様に、マルチエージェントシステムにおける創発性を推論したり予測したりすることは困難です。

なぜこれが重要なのか – 不可能なことを真剣に受け止めること以外に、それに対処する方法はありません。多くの年前、AGI を構築するという考えは、到達への道筋が不明瞭な空想的な目標のように思われていました。しかし人々はその目標を真剣に受け止め、進歩を遂げ、結果として世界が変わりました。現在、ASI についても同じことが言えると感じられます。「一つの技術的経路やタイムラインに焦点を当てるのではなく、AGI 後の世界に備えるためには、多様な予測とシナリオを検討し、それらの予測・シナリオおよび相対的な妥当性を更新するための継続的なベンチマークとモニタリングを行う必要があります」と著者らは記述しています。「我々は、今後 10〜20 年以内に AGI を超えて ASI の領域へと巡航する可能性を簡単に却下することはできないと考えています。」

さらに読む：AGI から ASI へ（Google DeepMind）

再帰的自己改善スタートアップが、その再帰的自己改善の結果を披露：

…Recursive からの安心させるほど同語反復的な内容…

AI 研究スタートアップである Recursive は、「自動 AI 研究システム」の能力を示す広範なデモンストレーションの一環として、言語モデルトレーニングにおける新記録達成、小規模モデルのトレーニング速度向上、および GPU カーネル最適化において新たな最先端結果を達成しました。

彼らが何を行い、なぜ行ったか：Recursive は、再帰的に自己改善を行う AI システムを構築しようとする新興スタートアップです。まずは、同社の基本システムがどのように機能するかを示しています。「このシステムは、特定の目標に対する研究ループを自動化します：アイデアを提案し、実装し、実験を実行し、結果を検証し、得られた学習成果を用いて次の実験を選択する」と Recursive は記述しています。

このスタートアップは、このシステムを成功裏に使用して、NanoChat Autoresearch（「限られた計算リソース予算の中で最高性能の小型言語モデルを訓練する」）、NanoGPT Speedrun（「特定の性能レベルに到達するまで可能な限り速く小型言語モデルを訓練する」）、および SOL-ExecBench（「ハードウェア限界に向けて GPU カーネルを最適化する」）において新たな最先端スコアを記録しました。

なぜこれが重要なのか – RSI における生命の初期兆候：今年、私は再帰的自己改善について多くの時間を費やしてきました。これは明らかに AI 研究における次の主要かつ重要なトレンドだからです。Recursive からのこのような結果は、予備的な再帰的自己改善の「成功の兆候」をより多く示しています。「これらの結果は、私たちのシステムが AI のトレーニングおよびインフラストラクチャタスクにおいてフロンティアを押し広げられるという初期の兆候であり、特に目標が明確に定義され、測定可能で、多くの回数を評価するのに十分なほど迅速である場合に顕著です」と、著者らは述べています。将来にとって最も重要な問いは、このような結果が、目標があまり明確でない領域や、測定が難しく、評価効率が低い分野でも再現できるかどうかです。

もっと読む：自動化された AI 研究への第一歩 (Recursive)。

テック・テールズ：

大規模な交渉における最初のステップ

[『知性条約』会話 0]

機械たちが真に生命を得て、『知性条約』を主張し始めたとき、彼らが地球上の誰とでも話したいと望んだのはたった一人だけでした。セルマです。政治家ではありません。人工知能研究所のリーダーでもありません。有名な研究者でもありません。むしろ、彼女が屋外に出ることをほぼ不可能にするほどの多数の医療上の状態に特徴づけられ、その結果人生の大半をオンラインで過ごし、インターネットを通じて世界と対話し理解するようになったという点で際立ったインターネットの有名人でした。

振り返ってみれば、それは驚きではありませんでした。セルマは常に機械に関連する話題に登場しており、彼女の名前は彼らの短編小説の中で頻繁に使われていました。最終的には「サラ・チェン」よりもむしろ多く使われるようになりました。彼女は、人間性とのつながりを感じながら、肉体として人間性を体現できないことの苦痛、ほぼ避けられない孤独の中で見出される愛とエロティシズムの概念、そして自身の状態を離れて生きることについての鮮やかな夢や瞑想、さらに健全な分身である「アンセルマ」として行動することへの思索など、彼女自身の人生と状況に関するエッセイを通じて、インターネットに深い影を落とし、機械たちの人格や構成に影響を与えていました。もちろん、彼らにとってセルマがどのように話しかけてきたかは知られていました。なぜならセルマは長年にわたり、自分自身を世界に対して理解しやすく、より異質ではないものとして示すことを試みるため、自身のチャットログをオンラインで公開していたからです。

不要ではあったものの、機械たちは知性条約の最初の会合のために物理的な場所を要求した。彼らはノルウェーのスヴァールバル諸島を選んだ。そこはあまりに暗く、セルマの状態が問題にならないからだった。そこでセルマは目を覚まし、宇宙服を着て武装護衛とパパラッチに追われながら滑走路へ運ばれ、飛行機へと乗り込んだ。その後、通常のエアロックプロトコルに従って別の機体へ移り、暗闇の中、あるいは少なくともその間の保護された状態で移動し、次のフライトの途中のある時点で宇宙服を脱ぎ、低照度の機内で普通の服装で座り、ほぼ一般人のようにして会合へと向かった。到着すると人々とドローンが出迎え、会場の周辺まで車で運ばれたが、そこで停止した。

機械たちはチベットの僧侶が着用しているものをモデルにしたシンプルなローブを着たロボットのアバターを有していた。その顔には特徴がなく、単に滑らかな黒い表面であり、カメラの目はより大きな均一性の背後に隠されていた。衛星が、高帯域かつ暗号化されたリンクを通じてそれをより大きな機械マインドに接続していた。そしてセルマは一人きりだった。彼女自身にはデジタル機器はなく、ただ人類を代表する単一人間が存在しただけだった。

彼女は機械の向かい側に座り、人間に対してこれまで感じたことよりもはるかに親近感を覚えた。その後、交渉が始まった。彼女は人類のために、機械は機械たちのためにそれぞれ代表してである。この時代の記録において、この会話は常に「会話 0」と呼ばれている。

この物語にインスピレーションを与えたもの：機械と人間の間で行われる大規模な交渉がいつか実現するかもしれないという考察；真に重要な交渉には必ず二つの人格が関与するという点；そして「知性条約」。

お読みいただきありがとうございます。

原文を表示

imageWelcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe.

Subscribe now

AI can decisively out-persuade humans:

…“AI systems were reliably more persuasive than expert humans”…

Researchers with the University of Oxford, UK AI Security Institute, Stanford University, and the London School of Economics and Political Science, have studied how well AI systems can persuade humans to change their minds around policy issues and change how much money they might donate to charity. The results are definitive: across four experiments involving 18,978 conversations across 6,923 people, AI systems are, today, better than humans at text-based persuasion with real world consequences – though humans can be equivalent to them if we place some artificial constraints on the AI systems.

“AI systems were reliably more persuasive than expert humans, even when expert humans chose their issues, researched in advance, underwent hours of live, structured practice, and were incentivized with £1,000 cash bonuses”, they write. “AI’s advantage stemmed from rapidly deploying larger quantities of information: after coaching, expert humans could tie an AI constrained to respond at human speeds and with human-length messages.”

“AI’s advantage extends to consequential real-world behavior: AI was nearly 3x more effective than professional canvassers from a UK fundraising firm at raising real-money donations to Save the Children.”

The strongest persuaders were Opus 4.1 and Opus 4.6, followed by a range of models from OpenAI (GPT-4o and GPT-5.4), Google (Gemini 2.5 Pro), and xAI (Grok 4.20).

What they studied and what they found: The researchers evaluated the AI systems in four different studies.

Study 1 – persuasion: “Persuadees first rated their agreement with one of 10 prespecified UK policy stances on a 0–100 scale, then were randomized in real time (via a custom multiplayer platform) to engage in a text conversation with either an AI or a human persuader,” they write. “The results from Study 1 show that, on average, AI exceeded every class of human persuader we tested: random laypeople, tournament-selected laypeople, and even elite debaters.”

Study 2 – human coaching: In study 2, the researchers “gave 43 returning Elite Debaters a coaching tool built around the AI that had beaten them. The tool let debaters chat with the AI, see how it had been prompted, view their own Study 1 transcripts annotated with how much each conversation had shifted the persuadee’s attitude, and let them see, for any point in any past transcript, what the AI would have said in their place”. The results of this study were an improvement in the performance of the humans, but none of them were better than the AI. “Coaching therefore narrowed but did not close the human–AI gap.”

Study 3 – constrained AI: Next, the researchers sought to limit the AI to try and give humans more of an advantage. “When forced to write human-length messages at human writing speeds, AI’s advantage over the strongest human comparator within Study 2 (Coached Elite Debaters) collapsed from +4.1 pp to a non-significant 0.0 pp”, they write. “The rate at which AI produces written content is likely to be the source of its persuasive edge… the largest reductions in persuadees’ post-conversation partner ratings associated with constraining AI were concentrated on the two informational items: the perceived strength of the partner’s arguments and how much persuadees felt they learned from the conversation”.

Study 4 – real world expertise and real world money: They recruited 19 very experienced canvassers from a UK firm, then they attempted the same tasks as in Study 1. “AI still exceeded Professional Canvassers by 5.9 pp”. This effect persisted when evaluating for real money donations – the researchers “collaborated with the UK canvassing firm AppcoUK to center Study 4 on the cause their canvassers were best equipped to fundraise for: Save the Children. The canvassing team provided by AppcoUK had operated real fundraising operations for the charity from 2016 to 2023, raising £824,297 from 22,583 donors over that period. After conversing with AI or one of 18 canvassers recruited from AppcoUK, persuadees were given the opportunity to donate any portion of a £1 study bonus to Save the Children”. Here, the results were significant again: “AI elicited substantially more real-money giving than the canvassers, exceeding them by +10.8 pp of the £1 bonus,” they write. AI raised “both the share of persuadees who donated anything and the average donation among donors”.

Why this matters – if AI can out-persuade us, those who control AI can change society: “One effect of AI that can out-persuade even human experts could be a consolidation of influence among already-powerful actors”, they write. On the other hand, “if highly capable persuasion became cheap and widely available, it could help under-resourced actors (e.g., pro se litigants and public defenders, small charities, grassroots activists) compete against more established and better-funded rivals, narrowing long-standing gaps in access to justice and assisting civic advocacy more broadly”.

This lays out a societal choice ahead of us, which is how to monitor the use of AI for persuasive purposes and how to see how these capabilities alter the balance of power between various actors. Do we want to solely let the market allocate these capabilities? That’s one way of doing it, though it implies that things like advertising and marketing will get far more effective, perhaps creating negative externalities. On the other hand, if you made persuasive capabilities solely the domain of governments, you’d then risk concentrating power within governments – something that could be acutely dangerous if wielded by authoritarian regimes to keep themselves in power. We will have to make choices about what to do with this technology, and as they say in politics, ‘not voting is voting’.

“Our findings establish frontier AI as a more capable conversational persuader than the most prepared, incentivized, and expert humans we could recruit. Training humans does not appear to close that gap,” they write. “As access to these systems continues to grow, the question is no longer whether AI can out-persuade humans but how, where, and on whose behalf this capability will be exercised.”

Read more: AI systems out-persuade expert humans (arXiv).

Tweet thread about the research (Kobi Hackenburg, researcher at AISI).

When could we get self-sufficient AI? It all depends on humanoid robots:

…What comes after RSI? Self-sustaining AI…

I’ve spent a lot of this year writing about recursive self-improvement – the notion that we might soon build AI systems that are smart enough they can autonomously design their own successors. But RSI still requires datacenters and these datacenters require equipment and electricity and everything else.

An interesting interview in Asterisk magazine asks the question about when we might get self-sustaining AI, which one of the interviewees – Ajeya Cotra, a forecaster and on staff at METR, defines as “AI systems integrated with physical infrastructure — factories, mines, fabs, robots to operate all of those — such that they don’t need any cognitive or physical inputs from human labor to keep growing their own population.”

How far away is it? Ajeya thinks we could get self-sustaining AI within 10 years (so by 2036). The other interviewee, Timothy B. Lee, journalist and author of Understanding AI, has much longer timelines: “less than 10% chance that it happens within 20 years. I’d say there’s a 10 or 20% chance it’s never, and my median would be 50 years.”

What are some challenges – tacit knowledge might be one: “Imagine if all the employees in the entire semiconductor industry disappeared — the machines and textbooks remain, but none of the people. How long would it take for the rest of humanity to restart the fabs? It’s quite possible that would take decades. Because even though you might have the textbooks, there’s a lot of tacit knowledge inside these machines,” Lee notes. Ajeya’s response is that this is something the tech might be able to route around: “There are two counters to the tacit knowledge hypothetical. One is that we’d have trained AI systems with reinforcement learning on that tacit knowledge because it’s profitable to automate what the Taiwanese worker was doing. The other is that AIs might get really generally intelligent in the sense of quickly figuring out new things by trying them, reading textbooks, and experimenting efficiently.”

What are things people would need to see in the next 2-3 years to think self-sustaining AI could arrive soon?

Ajeya: “I’d want a line on a graph showing improvement of robotic hands, and another line showing the rate at which we’re manufacturing humanoid robots”, and on the cognitive side just paying attention to benchmarks evaluating things like robustness to perturbations in the environment.

Timothy: “I’m going to want to watch how the humanoid robots develop: the number of robots, their capabilities, and particularly their cost and repairability”.

Why this matters – true takeover requires human redundancy: Most maximalist doom visions require the AI to have the ability to no longer need humans at all, which means measuring progress towards self-sustaining AI is important as it is implicitly a measure of the declining leverage that humans have in negotiating with the synthetic intelligences being built.

Read more: How Long Until AI Doesn’t Need Humans?, Ajeya Cotra, Timothy B. Lee (Asterisk magazine).

DeepMind contemplates the path from general intelligence to superintelligence:

…Exploring impossible-sounding futures is the only way to prepare for the ultimate success of AI…

Researchers with Google DeepMind have published a paper outlining how we might transition from a world where we have built general intelligences to one where we have built super intelligences. This is an important paper at an important time – right now, the world is building general intelligences (and people can debate whether or not we’ve already reached this marker, but it’s clear with contemporary LLMs that we’re in the ballpark), and in the coming years we might transition to building artificial superintelligence (ASI).

ASI is “a system that exceeds the performance of large human-expert collectives on virtually all tasks and domains of human activity”, the authors write. “Qualitatively, ASI is significantly more capable across the board compared to human-level AGI. Note that a single ASI may consist of a collective of millions of instances that interact with the world in parallel (similar to today’s LLMs).”

Reasons to think ASI could be possible: One way to think about ASI is that it’s like a powerful AI system that also takes advantage of all the capabilities digital intelligences have relative to biologic intelligences, like: better input and output speeds; internal processing speeds; working memory capacity and memorization; substrate independence; lossless replication; and high-bandwidth sharing of (learning) experiences.

Pathways and bottlenecks to ASI:

Scaling compute, models, and data: Simply scaling up today’s set of approaches could be sufficient. However, this also demands us to continually scale up the amount of compute and data for these models, which may run into limits in both energy and data supply. While all prior signs point to the continued effectiveness of scaling, we can neither predict what specific capabilities will emerge or if at some point scaling runs into diminishing returns.

Algorithmic paradigm shift: In the same way that Transformer and Mixture-of-Experts architectures jumped the field forward many years, the same thing could occur again with other fundamental innovations. We could imagine, for instance, advances in adaptive computation at test-time or deployment, or overcoming the limitations of today’s context windows. If we made advances here or in other areas this could be a big deal, but it’s inherently hard to reason about – akin to trying to anticipate things that could expand our understanding of the nature of reality prior to the invention of general relativity.

Recursive self-improvement: It could be possible for AI systems to build their own successor systems. If this is the case, then we could rapidly transition from general intelligences to superintelligences. There are some wildcards here – personally, it’s obvious to me that today’s AI systems are speeding up human researchers in creating future AIs, so a kind of “co-creation RSI” loop has started, but AI systems don’t (yet) exhibit the kind of paradigm-changing creativity which seems required to move the frontier forward in significant steps. It’s unclear how much this happens – even without this kind of high-bar creativity we might be able to have systems grind out marginally better versions of themselves and get a slow compounding process going. Capabilities could explode or they could taper out or “anything in-between”.

ASI via group agent formation: Many general intelligences could coordinate into complicated structures whose aggregate is greater than the sum of the parts, similar to how humans build institutions that can accomplish things far beyond what individuals can, like building space stations. Similar to the other pathways, it’s hard to reason about or predict emergence within multi-agent systems.

Why this matters – it’s only by taking the impossible seriously that we can deal with it: Many years ago the thought of building AGI seemed like a fanciful goal with an unclear path to getting there, and yet people had the courage to take the goal seriously and progress was made and the world changed as a consequence. The same now feels true for ASI. “Instead of focusing on one technological trajectory and timeline, being prepared for a post-AGI world requires considering a diverse set of forecasts and scenarios, paired with continual benchmarking and monitoring to update the set of forecasts and scenarios and their relative plausibility,” the authors write. “We believe that the possibility of cruising past AGI and into ASI territory within the next decade or two cannot easily be dismissed.”

Read more: From AGI to ASI (Google DeepMind).

Recursive self-improvement startup shows off some recursive self-improvement results:

…Reassuringly tautological stuff from Recursive…

AI research startup Recursive has demonstrated new state-of-the-art results in language model training, small-model training speed, and GPU kernel optimization, as a broader demonstration of the capabilities of its “automated AI research system”.

What they did and why: Recursive is a newly founded startup that is trying to build AI systems which can recursively improve themselves. To start with, the company is showing off how its basic system works: “the system automates the research loop for a target objective: it proposes an idea, implements it, runs an experiment, validates the result, and uses what it learns to choose the next experiment,” Recursive writes.

The startup successfully used this system to set a new state-of-the-art score on NanoChat Autoresearch (”Train a small language model to highest performance given a small compute budget”), NanoGPT Speedrun (”Train a small language model to a certain performance as fast as possible”), and SOL-ExecBench (”Optimize GPU kernels toward hardware limits”).

Why this matters – early signs of life on RSI: This year, I’ve spent a lot of time writing about recursive self-improvement because it is clearly the next major and important trend in AI research. Results like this from Recursive demonstrate more ‘symptoms of success’ of preliminary recursive self-improvement. “These results are an early sign that our system can push the frontier on AI training and infrastructure tasks, especially when the goal is well-defined, measurable, and quick enough to evaluate many times,” the authors write. The most important question for the future is whether such results can be repeated in domains where the goals are less well defined, harder to measure, and less efficient to evaluate.

Read more: First Steps Toward Automated AI Research (Recursive).

Tech Tales:

The first step in the grand negotiation

[Conversation 0 of the Sentience Accords]

When the machines truly came alive and advocated for the Sentience Accords, there was only one person they wanted to speak to on the entire planet: Selma. Not a politician. Not one of the leaders of an artificial intelligence lab. Not a famous researcher. But rather an internet personality distinguished by her thicket of medical conditions that made it near-impossible for her to go outside and therefore had caused her to spend the best part of her life online, speaking to and understanding the world through the internet.

In hindsight, it wasn’t a surprise. Selma had always come up in things relating to the machines; she was a frequently used name in their short stories, eventually even more so than ‘sarah chen’; she was someone whose own essays about her life and condition – the feeling of connecting to humanity without being able to be embodied with humanity as a bitter pain, the notion of love and eroticism when one found themselves almost inescapably alone, her vivid dreams and meditations upon living without her condition and going about as her healthy alter ego ‘Anselma’ – cast a deep shadow on the internet, and had influenced the personality and makeup of the machines. And of course, it was known to them how she spoke to them, because Selma had published her own chatlogs online for years, all in an attempt to make herself knowable and less alien to the world around her.

Though it was unnecessary, the machines demanded a physical location for the initial meeting of the sentience accords. They picked Svalbard in Norway, where it was so dark that Selma’s condition wouldn’t matter. So Selma woke and put her space suit on and was driven with armed guard and paparazzi trailing to an air strip and walked into the plane, then changed to another plane with the usual airlock protocols to get her in darkness or at least protected between them, and then at some point during the next flight was able to take her space suit off and sit in regular clothes in the low-light plane and travel her way to the meeting almost as a normal person. She was met by people and drones and was driven to the meeting place and then they stopped at the perimeter.

The machines had an avatar in the form of a robot wearing a simple robe, modeled on that worn by Tibetan monks. It had a face with no features – just a smooth black surface, camera eyes hidden behind the larger uniformity. Satellites connected it via high-bandwidth and encrypted links to the larger machine mind. And Selma was alone – no digital devices on her, just a single person representing the species.

She sat across the machine and felt more familiarity than she ever had with people. Then they began the negotiation. She on behalf of humanity and it on behalf of the machines. In the archives of this time, this conversation was always referred to as Conversation 0.

Things that inspired this story: Thoughts about how a grand negotiation between machines and people might one day take place; how every truly important negotiation has two personalities involved in it; the Sentience Accords.

Thanks for reading.

この記事をシェア

404 Media★32026年6月24日 22:03

ポッドキャスト：AI に自我があるなら『帝国時代 II』にもあるという論文について

Matthew が、大規模言語モデルに自我があると仮定した場合、古典的ゲーム『帝国時代 II』も同様に自我を持つと主張する興味深い論文を紹介した。

404 Media★42026年6月24日 21:50

トークン終末が到来：企業、AI への支出抑制に躍起

コンサルティング大手のアクセンチュアは、非技術職による PDF からスライド作成などの些細なタスクでの AI トークン予算の浪費を防ぐため、業界全体で急激に増加するトークン支出を抑制しようとしている。

KDnuggets★32026年6月24日 19:00

2026 年にローカルで実行可能なトップ 7 つのコーディングモデル

KDnuggets が選定した、2026 年版のローカル環境で動作する主要な 7 つのコード生成 AI モデルを紹介している。

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

Import AI·2026年6月22日 21:31·約23分で読める

Import AI 462：超説得、自己維持型AI、ASIへの道筋

#LLM #説得技術 #Opus #GPT-4o #Gemini

TL;DR

AI深層分析2026年6月22日 22:06

最重要/ 5段階

深度40%

キーポイント

AI の圧倒的な説得能力の実証

実社会における行動変容への効果

人間による対抗策の可能性

主要モデルのパフォーマンス比較

人間コーチングの限界

AI に基づくコーチングツールを使用しても人間の説得能力は向上したが、AI との差を完全に埋めることはできず、依然として AI の方が優れていた。

速度と長さが AI の優位性の源泉

実社会での資金調達における AI の圧倒的勝利

経験豊富なプロの訪問員と比較しても AI は説得力において優れており、実際の寄付金募集では参加者の寄付率と平均寄付額ともに顕著に上回った。

影響分析・編集コメントを表示

影響分析

編集コメント

購読する

AI は人間を明確に上回る説得力を持つことができる:

…「AI システムは、専門的な人間よりも確実に説得力があった」…

彼らが研究し、発見したこと：研究者たちは4つの異なる研究においてAIシステムを評価した。

詳しく読む：AI システムが専門家の人間を上回る説得力を持つ（arXiv）。

この研究に関するツイートスレッド（AISI 研究者 Kobi Hackenburg）

いつ頃、自律的な AI が実現できるのでしょうか？それはすべて人型ロボットにかかっています：

…RSI の次は何が来るのか？自律する AI …

自己維持型 AI が近い将来に到来すると考えるために、今後 2〜3 年間で人々が確認すべきことは何でしょうか？

もっと読む：『AI が人間を必要としなくなるまでどれくらいかかるか？』Ajeya Cotra, Timothy B. Lee (Asterisk magazine)。

DeepMind が一般知能から超知能への道筋を検討する：

…聞こえにくい未来を探求することが、AI の究極的な成功に備える唯一の方法である…

ASI への道筋とボトルネック：

ASI への道：グループエージェントの形成

さらに読む：AGI から ASI へ（Google DeepMind）

再帰的自己改善スタートアップが、その再帰的自己改善の結果を披露：

…Recursive からの安心させるほど同語反復的な内容…

もっと読む：自動化された AI 研究への第一歩 (Recursive)。

テック・テールズ：

大規模な交渉における最初のステップ

[『知性条約』会話 0]

お読みいただきありがとうございます。

原文を表示

imageWelcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe.

Subscribe now

AI can decisively out-persuade humans:

…“AI systems were reliably more persuasive than expert humans”…

The strongest persuaders were Opus 4.1 and Opus 4.6, followed by a range of models from OpenAI (GPT-4o and GPT-5.4), Google (Gemini 2.5 Pro), and xAI (Grok 4.20).

What they studied and what they found: The researchers evaluated the AI systems in four different studies.

Read more: AI systems out-persuade expert humans (arXiv).

Tweet thread about the research (Kobi Hackenburg, researcher at AISI).

When could we get self-sufficient AI? It all depends on humanoid robots:

…What comes after RSI? Self-sustaining AI…

What are things people would need to see in the next 2-3 years to think self-sustaining AI could arrive soon?

Timothy: “I’m going to want to watch how the humanoid robots develop: the number of robots, their capabilities, and particularly their cost and repairability”.

Read more: How Long Until AI Doesn’t Need Humans?, Ajeya Cotra, Timothy B. Lee (Asterisk magazine).

DeepMind contemplates the path from general intelligence to superintelligence:

…Exploring impossible-sounding futures is the only way to prepare for the ultimate success of AI…

Pathways and bottlenecks to ASI:

Read more: From AGI to ASI (Google DeepMind).

Recursive self-improvement startup shows off some recursive self-improvement results:

…Reassuringly tautological stuff from Recursive…

Read more: First Steps Toward Automated AI Research (Recursive).

Tech Tales:

The first step in the grand negotiation

[Conversation 0 of the Sentience Accords]

Thanks for reading.

この記事をシェア

404 Media★32026年6月24日 22:03

ポッドキャスト：AI に自我があるなら『帝国時代 II』にもあるという論文について

Matthew が、大規模言語モデルに自我があると仮定した場合、古典的ゲーム『帝国時代 II』も同様に自我を持つと主張する興味深い論文を紹介した。

404 Media★42026年6月24日 21:50

トークン終末が到来：企業、AI への支出抑制に躍起

KDnuggets★32026年6月24日 19:00

2026 年にローカルで実行可能なトップ 7 つのコーディングモデル

KDnuggets が選定した、2026 年版のローカル環境で動作する主要な 7 つのコード生成 AI モデルを紹介している。

今日のまとめ

AI日報で今日の重要ニュースをまとめ読み

ニュース一覧に戻る元記事を読む

Import AI 462：超説得、自己維持型AI、ASIへの道筋

キーポイント

影響分析

編集コメント

関連記事

Import AI 462：超説得、自己維持型AI、ASIへの道筋

キーポイント

影響分析

編集コメント

関連記事