平均場ゲーム理論

平均場ゲーム理論（へいきんばゲームりろん、Mean-field game theory）は、非常に大規模な集団における小さな相互作用エージェントによる戦略的意思決定の研究である。

解説

ゲーム理論と確率分析および制御理論の交差点にある。「平均場」という用語の使用は、個々の粒子がシステムに与える影響がごくわずかである多数の粒子のシステムの挙動を考慮する物理学の平均場理論に触発されている。言い換えると、各エージェントは、他のエージェントの決定を考慮して、最小化または最大化の問題に従って行動し、その母集団が多いため、エージェントの数は無限大へ向かうと仮定でき、代表的なエージェントが存在するとも仮定できる。^[1]

伝統的なゲーム理論では、研究対象は通常、2人のプレイヤーと離散的な時間空間を持つゲームであり、帰納法によって結果をより複雑な状況に拡張する。ただし、連続状態を持つ連続時間のゲーム(差分ゲームまたは確率的差分ゲーム)の場合、動的相互作用が生成する複雑さのために、この戦略は使用できない。一方、MFGでは、平均代表エージェントを介して多数のプレーヤーを処理できると同時に、複雑な状態のダイナミクスを記述できる。

このクラスの問題は、ボヤン・ヨバノビッチとロバート・W・ローゼンタールによる経済学文献^[2]、ミンイ・ファン、ローランド・マルハメ、ピーター・E・ケインズによる工学文献^[3]^[4]^[5] 、そして数学者ジャン・ミッシェル・ラスリーとピエール＝ルイ・リオンによって独立してほぼ同時に検討された^[6]^[7]。

連続時間では、平均場ゲームは通常、個人の最適制御を記述するハミルトン–ヤコビ–ベルマン方程式と、エージェントの集合分布のダイナミクスを記述するフォッカー–プランク方程式で構成される。かなり一般的な仮定の下では、平均場ゲームのクラスが次のようにNプレイヤーのナッシュ均衡の $N\to \infty$ の極限であることを証明できる^[8]。

平均場ゲームに関連する概念は、「平均場型制御」である。この場合、ソーシャルプランナーは状態の分布を制御し、制御戦略を選択する。平均場型制御問題の解は、通常、コルモゴロフ方程式と結合した二重随伴ハミルトン-ヤコビ-ベルマン方程式として表すことができる。平均場型ゲーム理論は、単一エージェント平均場型制御のマルチエージェント一般化である^[9]。

平均場ゲームの一般形式

次の連立方程式を使用して^[10] 、典型的な平均場ゲームをモデル化できる。

${\begin{cases}\partial _{t}u-\nu \Delta u+H(x,m,Du)=0&(1)\\\partial _{t}m-\nu \Delta m-div(D_{p}H(x,m,Du)m)=0&(2)\\m(0)=m_{0}&(3)\\u(x,T)=G(x,m(T))&(4)\end{cases}}$

この一連の方程式の基本的なダイナミクスは、平均的なエージェントの最適制御問題によって説明できる。平均場ゲームでは、平均的なエージェントは、次の方法で移動αを制御して、母集団の全体的な位置に影響を与えることができる。

$dX_{t}=\alpha _{t}d_{t}+{\sqrt {2\nu }}B_{t}$

$\nu$ はパラメータであり、 $B_{t}$ は標準ブラウン運動。エージェントの動きを制御することにより、エージェントは、期間 $[0,T]$ を通じて全体的な予想コスト $C$ を最小限に抑えることを目指している。

$C=\mathbb {E} [\int _{0}^{T}L(X_{s},\alpha _{s},m(s))ds+G(X_{T},m(T))]$

$L(X_{s},\alpha _{s},m(s))$ は時間 $s$ におけるランニングコストで $G(X_{T},m(T))$ は時間 $T$ におけるターミナルコスト。定義により、時間 $t$ と位置 $x$ について、価値関数 $u(t,x)$ は以下のように決定できる。

$u(t,x)=\inf _{\alpha }\mathbb {E} [\int _{t}^{T}L(X_{s},\alpha _{s},m(s))ds+G(X_{T},m(T))]$

価値関数 $u(t,x)$ の定義が与えられると、ハミルトン-ヤコビ方程式 (1) で追跡できる。平均的なプレーヤーの最適なアクション $\alpha ^{*}(x,t)$ はとして求めることができる。すべてのエージェントは比較的小さく、集団のダイナミクスを単独で変更することはできないので、それらは個別に最適な制御を適応させ、人口はそのように移動する。これは、すべてのエージェントが他の特定の戦略のセットに応じて行動するナッシュ均衡に似ている。最適制御解は、コルモゴロフ-フォッカー-プランク方程式(2)につながる。

有限状態ゲーム

平均場の顕著なカテゴリは、有限数の状態と有限数のプレイヤーあたりのアクションを持つゲームである。これらのゲームでは、ハミルトン-ヤコビ-ベルマン方程式の類似物はベルマン方程式であり、フォッカー-プランク方程式の離散バージョンはコルモゴロフ方程式である。具体的には、離散時間モデルの場合、プレイヤーの戦略はコルモゴロフ方程式の確率行列である。連続時間モデルでは、プレイヤーは遷移率行列を制御することができる。

離散平均場ゲームはタプル ${\mathcal {G}}=({\mathcal {E}},{\mathcal {A}},\{Q_{a}\},{\bf {m}}_{0},\{c_{a}\},\beta )$ ,で定義でき、 ${\mathcal {E}}$ は状態空間、 ${\mathcal {A}}$ は作用集合、 $Q_{a}$ は遷移速度行列、 ${\bf {m}}_{0}$ は初期状態、 $\{c_{a}\}$ はコスト関数、 $\beta$ $\in \mathbb {R}$ は割引係数である。さらに、混合戦略は測定可能な関数 $\pi :\mathbb {E} \times \mathbb {R} ^{+}{\xrightarrow[{}]{}}{\mathcal {P(A)}}$ , これは各状態 $i\in {\mathcal {E}}$ と $t\geq 0$ ごとに可能なアクションのセットに対する確率測度 $\pi _{i}(t)\in {\mathcal {P(A)}}$ に関連付ける。したがって、 $\pi _{i,a}(t)$ は、時間 $t$ において、状態 $i$ のプレイヤーが戦略の下で行動 $a$ をとる確率である。さらに、レート行列 $\{Q_{a}({\bf {m}}^{\pi }(t))\}_{a\in {\mathcal {A}}}$ は母集団分布の経時的な進化を定義し、ここで ${\bf {m}}^{\pi }(t)\in {\mathcal {P({\mathcal {E}})}}$ は時刻 $t$ における母集団分布である^[11]。

線形二次ガウスゲーム問題

Caines(2009)から、大規模ゲームの比較的単純なモデルは線形二次ガウスモデルである。個々のエージェントのダイナミクスは、確率微分方程式としてモデル化される。 $dX_{i}=(a_{i}X_{i}+b_{i}u_{i})\,dt+\sigma _{i}\,dW_{i},\quad i=1,\dots ,N,$ $X_{i}$ は $i$ 番目のエージェントの状態で, $u_{i}$ は $i$ 番目のエージェントの制御, $W_{i}$ は独立の $i=1,\dots ,N$ に対するウィーナー過程である。個々のエージェントのコストは、 $J_{i}(u_{i},\nu )=\mathbb {E} \left\{\int _{0}^{\infty }e^{-\rho t}\left[(X_{i}-\nu )^{2}+ru_{i}^{2}\right]\,dt\right\},\quad \nu =\Phi \left({\frac {1}{N}}\sum _{k\neq i}^{N}X_{k}+\eta \right).$ エージェント間の結合はコスト関数で発生する。

一般および応用用途

平均場ゲームのパラダイムは、分散意思決定と確率的モデリングの間の主要なつながりとなっている。確率的制御の文献から始まり、次のようなさまざまなアプリケーションで急速に採用されている。

金融市場。Carmonaは、MFGパラダイムの枠組みの中でキャストして取り組むことができる金融工学と経済学のアプリケーションをレビューしている^[12] 。カルモナは、マクロ経済学、契約理論、金融などのモデルは、より伝統的な離散時間モデルから連続時間への切り替えから大きな恩恵を受けると主張している。彼はレビューの章で、システミックリスク、価格への影響、最適な執行、銀行経営のモデル、高頻度取引、暗号通貨など、連続時間モデルのみを検討している。

群衆の動き。MFGは、個人が特定のコストに関して戦略とパスを最適化しようとする賢いプレーヤーであることを前提としている(合理的期待アプローチとの均衡)。MFGモデルは、予測現象を記述するのに役立つ:前方部分は群衆の進化を記述し、後方部分は予測がどのように構築されるかのプロセスを提供する。さらに、マルチエージェントの微視的モデル計算と比較して、MFGは巨視的シミュレーションの計算コストが低くて済む。一部の研究者は、人口間の相互作用をモデル化し、2つの歩行者グループ間の嫌悪感と渋滞行動^[13]、朝の通勤者の出発時間の選択^[14]、自動運転車の意思決定プロセスなど^[15]、インテリジェントエージェントの意思決定プロセスを研究するためにMFGに目を向けた。

エピデミックの制御と緩和。流行は社会と個人に大きな影響を与えているため、MFGと平均場制御(MFC)は、特にCovid-19パンデミック対応のコンテキストで、根底にある人口動態を研究および理解するための視点を提供する。MFGは、空間効果でSIRタイプのダイナミクスを拡張したり、個人が自分の行動を選択し、病気の蔓延への寄与を制御できるようにするために使用されている。MFCは、空間領域内でのウイルスの拡散を制御し、社会的相互作用を制限する個人の決定を制御し、政府の非医薬品介入をサポートするための最適な戦略を設計するために適用される。^[16]^[17] ^[18]

出典

^ Vasiliadis, Athanasios. "An Introduction to Mean Field Games using probabilistic methods". arXiv:1907.01411 [math.OC]。
^ Jovanovic, Boyan; Rosenthal, Robert W. (1988). “Anonymous Sequential Games”. Journal of Mathematical Economics 17 (1): 77–87. doi:10.1016/0304-4068(88)90029-8.
^ Huang, M. Y.; Malhame, R. P.; Caines, P. E. (2006). “Large Population Stochastic Dynamic Games: Closed-Loop McKean–Vlasov Systems and the Nash Certainty Equivalence Principle”. Communications in Information and Systems 6 (3): 221–252. doi:10.4310/CIS.2006.v6.n3.a5. Zbl 1136.91349.
^ Nourian, M.; Caines, P. E. (2013). “ε–Nash mean field game theory for nonlinear stochastic dynamical systems with major and minor agents”. SIAM Journal on Control and Optimization 51 (4): 3302–3331. arXiv:1209.5684. doi:10.1137/120889496.
^ Djehiche, Boualem; Tcheukam, Alain; Tembine, Hamidou (2017). “Mean-Field-Type Games in Engineering”. AIMS Electronics and Electrical Engineering 1 (1): 18–73. arXiv:1605.03281. doi:10.3934/ElectrEng.2017.1.18.
^ Lions, Pierre-Louis; Lasry, Jean-Michel (March 2007). “Large investor trading impacts on volatility”. Annales de l'Institut Henri Poincaré C 24 (2): 311–323. Bibcode: 2007AIHPC..24..311L. doi:10.1016/j.anihpc.2005.12.006.
^ Lasry, Jean-Michel; Lions, Pierre-Louis (28 March 2007). “Mean field games”. Japanese Journal of Mathematics 2 (1): 229–260. doi:10.1007/s11537-007-0657-8.
^ Cardaliaguet (September 27, 2013). “Notes on Mean Field Games”. Template:Cite webの呼び出しエラー：引数 accessdate は必須です。
^ Bensoussan, Alain; Frehse, Jens; Yam, Phillip (2013) (英語). Mean Field Games and Mean Field Type Control Theory. Springer Briefs in Mathematics. New York: Springer-Verlag. ISBN 9781461485070 ^{[要ページ番号]}
^ Achdou, Yves (2020). Mean field games : Cetraro, Italy 2019. Pierre Cardaliaguet, F. Delarue, Alessio Porretta, Filippo Santambrogio. Cham. ISBN 978-3-030-59837-2. OCLC 1238206187
^ Doncel, Josu; Gast, Nicolas; Gaujal, Bruno (2019). “Discrete mean field games: Existence of equilibria and convergence”. Journal of Dynamics & Games: 1–19. arXiv:1909.01209. doi:10.3934/jdg.2019016.
^ Carmona, Rene (2020). "Applications of mean field games in financial engineering and economic theory". arXiv:2012.05237 [q-fin.GN]。
^ Lachapelle, Aimé; Wolfram, Marie-Therese (2011). “On a mean field game approach modeling congestion and aversion in pedestrian crowds”. Transportation Research Part B: Methodological 45 (10): 1572–1589. doi:10.1016/j.trb.2011.07.011.
^ Feinstein, Zachary; Sojmark, Andreas (2019). "A dynamic default contagion model: From Eisenberg-Noe to the mean field". arXiv:1912.08695 [q-fin.MF]。
^ Huang, Kuang; Chen, Xu; Di, Xuan; Du, Qiang (2021). “Dynamic driving and routing games for autonomous vehicles on networks: A mean field game approach”. Transportation Research Part C: Emerging Technologies 128: 103189. arXiv:2012.08388. doi:10.1016/j.trc.2021.103189.
^ Lee, Wonjun; Liu, Siting; Tembine, Hamidou; Li, Wuchen; Osher, Stanley (2021). “Controlling propagation of epidemics via mean-field control”. SIAM Journal on Applied Mathematics 81 (1): 190–207. arXiv:2006.01249. doi:10.1137/20M1342690.
^ Aurell, Alexander; Carmona, Rene; Dayanikli, Gokce; Lauriere, Mathieu (2022). “Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach”. SIAM Journal on Control and Optimization 60 (2): S294–S322. arXiv:2011.03105. doi:10.1137/20M1377862.
^ Elie, Romuald; Hubert, Emma; Turinici, Gabriel (2020). “Contact rate epidemic control of COVID-19: an equilibrium view”. Mathematical Modelling of Natural Phenomena 15: 35. doi:10.1051/mmnp/2020022.

外部リンク

Mean Field Stochastic Control (Slides), 2009 IEEE Control Systems Society Bode Prize Lecture by Peter E. Caines
Caines, Peter E. (2013). “Mean Field Games”. Encyclopedia of Systems and Control. pp. 1–6. doi:10.1007/978-1-4471-5102-9_30-1. ISBN 978-1-4471-5102-9
Notes on Mean Field Games, from Pierre-Louis Lions' lectures at Collège de France
(フランス語) Video lectures by Pierre-Louis Lions
Mean field games and applications by Olivier Guéant, Jean-Michel Lasry, and Pierre-Louis Lions

[1] Vasiliadis, Athanasios. "An Introduction to Mean Field Games using probabilistic methods". arXiv:1907.01411 [math.OC]。

[2] Jovanovic, Boyan; Rosenthal, Robert W. (1988). “Anonymous Sequential Games”. Journal of Mathematical Economics 17 (1): 77–87. doi:10.1016/0304-4068(88)90029-8.

[3] Huang, M. Y.; Malhame, R. P.; Caines, P. E. (2006). “Large Population Stochastic Dynamic Games: Closed-Loop McKean–Vlasov Systems and the Nash Certainty Equivalence Principle”. Communications in Information and Systems 6 (3): 221–252. doi:10.4310/CIS.2006.v6.n3.a5. Zbl 1136.91349.

[4] Nourian, M.; Caines, P. E. (2013). “ε–Nash mean field game theory for nonlinear stochastic dynamical systems with major and minor agents”. SIAM Journal on Control and Optimization 51 (4): 3302–3331. arXiv:1209.5684. doi:10.1137/120889496.

[5] Djehiche, Boualem; Tcheukam, Alain; Tembine, Hamidou (2017). “Mean-Field-Type Games in Engineering”. AIMS Electronics and Electrical Engineering 1 (1): 18–73. arXiv:1605.03281. doi:10.3934/ElectrEng.2017.1.18.

[6] Lions, Pierre-Louis; Lasry, Jean-Michel (March 2007). “Large investor trading impacts on volatility”. Annales de l'Institut Henri Poincaré C 24 (2): 311–323. Bibcode: 2007AIHPC..24..311L. doi:10.1016/j.anihpc.2005.12.006.

[7] Lasry, Jean-Michel; Lions, Pierre-Louis (28 March 2007). “Mean field games”. Japanese Journal of Mathematics 2 (1): 229–260. doi:10.1007/s11537-007-0657-8.

[8] Cardaliaguet (September 27, 2013). “Notes on Mean Field Games”. Template:Cite webの呼び出しエラー：引数 accessdate は必須です。

[9] Bensoussan, Alain; Frehse, Jens; Yam, Phillip (2013) (英語). Mean Field Games and Mean Field Type Control Theory. Springer Briefs in Mathematics. New York: Springer-Verlag. ISBN 9781461485070 ^{[要ページ番号]}

[10] Achdou, Yves (2020). Mean field games : Cetraro, Italy 2019. Pierre Cardaliaguet, F. Delarue, Alessio Porretta, Filippo Santambrogio. Cham. ISBN 978-3-030-59837-2. OCLC 1238206187

[11] Doncel, Josu; Gast, Nicolas; Gaujal, Bruno (2019). “Discrete mean field games: Existence of equilibria and convergence”. Journal of Dynamics & Games: 1–19. arXiv:1909.01209. doi:10.3934/jdg.2019016.

[12] Carmona, Rene (2020). "Applications of mean field games in financial engineering and economic theory". arXiv:2012.05237 [q-fin.GN]。

[13] Lachapelle, Aimé; Wolfram, Marie-Therese (2011). “On a mean field game approach modeling congestion and aversion in pedestrian crowds”. Transportation Research Part B: Methodological 45 (10): 1572–1589. doi:10.1016/j.trb.2011.07.011.

[14] Feinstein, Zachary; Sojmark, Andreas (2019). "A dynamic default contagion model: From Eisenberg-Noe to the mean field". arXiv:1912.08695 [q-fin.MF]。

[15] Huang, Kuang; Chen, Xu; Di, Xuan; Du, Qiang (2021). “Dynamic driving and routing games for autonomous vehicles on networks: A mean field game approach”. Transportation Research Part C: Emerging Technologies 128: 103189. arXiv:2012.08388. doi:10.1016/j.trc.2021.103189.

[16] Lee, Wonjun; Liu, Siting; Tembine, Hamidou; Li, Wuchen; Osher, Stanley (2021). “Controlling propagation of epidemics via mean-field control”. SIAM Journal on Applied Mathematics 81 (1): 190–207. arXiv:2006.01249. doi:10.1137/20M1342690.

[17] Aurell, Alexander; Carmona, Rene; Dayanikli, Gokce; Lauriere, Mathieu (2022). “Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach”. SIAM Journal on Control and Optimization 60 (2): S294–S322. arXiv:2011.03105. doi:10.1137/20M1377862.

[18] Elie, Romuald; Hubert, Emma; Turinici, Gabriel (2020). “Contact rate epidemic control of COVID-19: an equilibrium view”. Mathematical Modelling of Natural Phenomena 15: 35. doi:10.1051/mmnp/2020022.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

表話編歴ゲーム理論
定義	非協力ゲーム協力ゲーム標準型ゲーム展開型ゲームベイジアンゲーム簡潔ゲーム（英語版）情報集合信念の階層選好進化ゲームハイパーゲーム（英語版）行動ゲーム
解概念と精緻化	ナッシュ均衡部分ゲーム完全均衡 Mertens-stable equilibrium（英語版）ベイジアン・ナッシュ均衡完全ベイズ均衡摂動完全均衡プロパー均衡 ε均衡相関均衡（英語版、ドイツ語版）逐次均衡準完全均衡進化的安定戦略リスク支配コアシャープレイ値パレート効率性質的応答均衡自己確証均衡強ナッシュ均衡（英語版、ヘブライ語版）マルコフ完全均衡（英語版）戦略的補完性合理化可能性直観的基準
戦略	支配戦略混合戦略（英語版）しっぺ返し戦略トリガー戦略共謀（英語版）後ろ向き帰納法前向き帰納法マルコフ戦略（英語版）主人と奴隷
ゲームのクラス	対称ゲーム（英語版）完全情報完全情報ゲーム完備情報不完備情報ゲーム確実情報同時手番ゲーム逐次手番ゲーム（英語版）繰り返しゲームシグナリングゲームチープトークゼロ和非ゼロ和メカニズムデザイン交渉問題（英語版）確率ゲーム（英語版）大ポアソンゲーム（英語版）非推移的ゲームグローバルゲーム（英語版）特性関数型ゲーム二人零和有限確定完全情報ゲーム
ゲーム	囚人のジレンマ旅人のジレンマ（英語版）協調ゲーム（英語版）チキンゲームムカデゲーム（英語版）ボランティアのジレンマ（英語版）ドル・オークション（英語版）男女の争い（英語版）スタグハントゲームマッチングペニー（英語版）最後通牒ゲームじゃんけん海賊ゲーム（英語版）独裁者ゲーム（英語版）公共財ゲーム（英語版） Blotto games（英語版）消耗戦（英語版）エルファロル・バー問題公平分割行き詰まり（英語版）割り勘のジレンマ Guess 2/3 of the average（英語版）クーン・ポーカー交渉問題（英語版）スクリーニングゲーム（英語版）囚人と帽子のパズル（英語版） Trust game（英語版） Princess and monster game（英語版）モンティ・ホール問題クールノー競争ベルトラン競争シュタッケルベルグ競争
定理	ミニマックス法ナッシュの定理純化定理フォーク定理顕示原理（英語版）アローの不可能性定理
主要人物	ケネス・アローロバート・オーマンケン・ビンモアサミュエル・ボールズメルヴィン・ドレッシャー（英語版）メリル・フラッド（英語版）ドリュー・フューデンバーグ（英語版）ドナルド・ギリースジョン・ハーサニレオニード・ハーヴィッツデイヴィッド・レヴァイン（英語版）ダニエル・カーネマンハロルド・クーンエリック・マスキンジャン＝フランソワ・メルタン（英語版）ポール・ミルグロムオスカー・モルゲンシュテルンロジャー・マイヤーソンジョン・ナッシュジョン・フォン・ノイマンアリエル・ルービンシュタイントーマス・シェリングラインハルト・ゼルテンハーバート・サイモンロイド・シャープレージョン・メイナード＝スミスジャン・ティロールアルバート・タッカーウィリアム・ヴィックリーロバート・ウィルソンペイトン・ヤング（英語版）
関連項目	コモンズの悲劇 Tyranny of small decisions（英語版） All-pay auction（英語版）ゲーム理論におけるゲームの一覧（英語版） Confrontation analysis（英語版）ゲーム理論家の一覧（英語版）数学経済学進化論集団遺伝学オペレーションズリサーチ社会生物学環境社会学クープマンモデル
カテゴリ