PDF) Tackling Morpion Solitaire with AlphaZero-likeRanked Reward
Por um escritor misterioso
Descrição
Two-Agent Self-Play
PDF) Potential-based Reward Shaping in Sokoban
PDF] Monte Carlo Q-learning for General Game Playing
Adaptive Warm-Start MCTS in AlphaZero-Like Deep Reinforcement Learning
PDF] Analysis of Hyper-Parameters for Small Games: Iterations or Epochs in Self-Play?
Deep Reinforcement Learning for Morpion Solitaire
AlphaGo tag ·
PDF) Monte Carlo Q-learning for General Game Playing
The Survey of Self-play Method in Computer Games
PDF) Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization
PDF] A0C: Alpha Zero in Continuous Action Space