Average DQN implementation #206

rbauld · 2018-05-19T00:45:58Z

This is my attempt to implement averaged DQN, based on this paper: https://arxiv.org/pdf/1611.01929.pdf

By default with avg_dqn=1 It should just be regular DQN/DDQN. When avg_dqn> 1 it will average over previous models. So you gain some stability at the cost of some computation time.

In any case, let me know what you think, or what tests should be done. I have already ran the agent tests and the cart pole example, but I think more extensive testing may be needed.

Seemingly the testing branch is 6 months behind, so I guess everyone is just merging to master?

update

RaphaelMeudec · 2018-05-24T10:38:14Z

Hi! Thanks for your contribution, tests are failing since a few days ago, I'll dig into it and then look at your implementation!

rbauld · 2018-05-29T21:45:30Z

I took a look at some of the errors on the travis CI build. They don't really seem related to any of the changes I have made.

Seems other pull requests have had a similar issue.

In any-case, the changes were not too drastic, but I thought they were worthwhile making, since I have found some benefit when using average DQN in particularly noisy environments.

rbauld added 4 commits May 13, 2018 17:07

Merge pull request #1 from keras-rl/master

0479c2d

update

Implemented average DQN functionality

56e3233

Implemented average DQN functionality

5ddcc78

Implemented average DQN functionality

6acf3f6

RaphaelMeudec self-assigned this May 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Average DQN implementation #206

Average DQN implementation #206

rbauld commented May 19, 2018

RaphaelMeudec commented May 24, 2018

rbauld commented May 29, 2018

Average DQN implementation #206

Are you sure you want to change the base?

Average DQN implementation #206

Conversation

rbauld commented May 19, 2018

RaphaelMeudec commented May 24, 2018

rbauld commented May 29, 2018