Kenshi Abe

Sort by

Latest

Most popular

二人零和マルコフゲームにおけるオフ方策評価

Optimization Approaches for Counterfactual Risk Minimization with Continuous Actions

Deep Counterfactual Regret Minimization

多人数不完全情報ゲームにおけるAI ~ポーカーと麻雀を例として~

Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations

Deep Q-learning from Demonstrations

Multi-agent Reinforcement Learning in Sequential Social Dilemmas

Evolved policy gradients