Download PDFOpen PDF in browserA Review for Deep Reinforcement Learning in Atari: Benchmarks, Challenges and SolutionsEasyChair Preprint 6985, version 226 pages•Date: May 19, 2023AbstractThe Arcade Learning Environment (ALE) is proposed as an evaluation platform for empirically assessing the generality of agents across dozens of Atari 2600 games. ALE offers various challenging problems and has drawn significant attention from the deep reinforcement learning (RL) community. From Deep Q-Networks (DQN) to Agent57, RL agents seem to achieve superhuman performance in ALE. However, is this the case? In this paper, to explore this problem, we first review the current evaluation metrics in the Atari benchmarks and then reveal that the current evaluation criteria of achieving superhuman performance are inappropriate, which underestimated the human performance relative to what is possible. To handle those problems and promote the development of RL research, we propose a novel Atari benchmark based on human world records (HWR), which puts forward higher requirements for RL agents on both final performance and learning efficiency. Keyphrases: Human World Records Benchmark, Reinforcement Learning, Superhuman Agents, The Arcade Learning Environment
|