Skip to content

Conversation

@jackokaiser
Copy link

Hi Ilya,

Thanks for your open-source implementation of DDPG/NAF in pytorch.

We spotted a typo in NAF: the discount factor (and the done mask) should multiply the next_state_values instead of adding up to it.

Cheers,
Jacques

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant