KA

Kenshi Abe

Sort by
二人零和マルコフゲームにおけるオフ方策評価
Optimization Approaches for Counterfactual Risk Minimization with Continuous Actions
Deep Counterfactual Regret Minimization
多人数不完全情報ゲームにおけるAI ~ポーカーと麻雀を例として~
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
Deep Q-learning from Demonstrations
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Evolved policy gradients