Skip to content

Conversation

@troybvo
Copy link

@troybvo troybvo commented Aug 13, 2021

There is a bug in the way log_prob is calculated. Last step reduce_sum is supposed to be over the action space, not actions from the batch. Fixed with adding axis=1

There is a bug in the way log_prob is calculated. Last step reduce_sum is supposed to be over the action space, not actions from the batch. Fixed with adding axis=1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

1 participant