Figure 1 from Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Por um escritor misterioso
Descrição
Figure 1: Training AlphaZero for 700,000 steps. Elo ratings were computed from evaluation games between different players when given one second per move. a Performance of AlphaZero in chess, compared to 2016 TCEC world-champion program Stockfish. b Performance of AlphaZero in shogi, compared to 2017 CSA world-champion program Elmo. c Performance of AlphaZero in Go, compared to AlphaGo Lee and AlphaGo Zero (20 block / 3 day) (29). - "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"
Training AlphaZero for 700,000 steps. Elo ratings were computed from
Algorithms, Free Full-Text
Electronics, Free Full-Text
Mastering construction heuristics with self-play deep reinforcement learning
Shogi - Wikipedia
PDF] Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
AlphaZero: The AI from Google which mastered Chess in 4 hours, by University of Toronto Machine Intelligence Team
MuZero figures out chess, rules and all
Figure 1 from Giraffe: Using Deep Reinforcement Learning to Play Chess
AlphaZero: The AI from Google which mastered Chess in 4 hours, by University of Toronto Machine Intelligence Team