DQN for Solving a Maze in Less than 10 minutes Training
Is it possible to train a DQN to solve a maze with non-convex obstacles in a long-horizon navigation task (in 10 minutes or less)? The rules are: You can not use old data except for the replay buffer The inputs are only the x and y coordinates of the state and the distance of the agent to the goal Step size should not exceed 2% of the total maze size You must start from the same initial state […]