Highlights over existing PyTorch RL repos

Greetings! I'm a PyTorch RL fan but previously used baselines and stable baselines for research. I notice stable-baselines3 through the origin stable-baselines issue. 
Recently there are many PyTorch RL platforms that emerged, including rlpyt, tianshou, etc. I went through their code and compared with stable-baselines3.

| Features                | Stable-Baselines3 | rlpyt | tianshou |
| --------------------------- | ----------------------| --- | --- |
| State of the art RL methods | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| Documentation               | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| Custom environments         | :heavy_check_mark: | Just so-so | :heavy_check_mark: |
| Custom policies             | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| Common interface            | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| Ipython / Notebook friendly | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| PEP8 code style             | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| Custom callback             | :heavy_check_mark: | :x: | :x: |
| High code coverage          | :heavy_check_mark: | :x: | :heavy_check_mark: |
| Type hints                  | :heavy_check_mark: | :x: | :heavy_check_mark: |


And for the planned features of stable-baselines3:

| **Features**                | Stable-Baselines3 | rlpyt | tianshou |
| --------------------------- | ----------------------| --- | --- |
| Tensorboard support         | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| DQN extensions | :heavy_minus_sign:  QR-DQN in SB3 contrib | :heavy_check_mark: | :heavy_check_mark: |
|Support for Dict observation spaces| :heavy_check_mark: | :heavy_check_mark:  | :heavy_check_mark: |
| Recurrent Policies| :heavy_check_mark: in [contrib](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/53) | :heavy_check_mark: | :heavy_check_mark: |
|TRPO | :heavy_check_mark: in [contrib](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/pull/40) | :x: | :heavy_check_mark: |

Also, the most important feature "modularization", from my perspective, tianshou is the best of all, rlpyt is the second. I hate OpenAI Baselines at this point, but stable-baselines is much better than openai.

Just some of my concerns.

Features	Stable-Baselines3	rlpyt	tianshou
State of the art RL methods	✔️	✔️	✔️
Documentation	✔️	✔️	✔️
Custom environments	✔️	Just so-so	✔️
Custom policies	✔️	✔️	✔️
Common interface	✔️	✔️	✔️
Ipython / Notebook friendly	✔️	✔️	✔️
PEP8 code style	✔️	✔️	✔️
Custom callback	✔️	❌	❌
High code coverage	✔️	❌	✔️
Type hints	✔️	❌	✔️

Features	Stable-Baselines3	rlpyt	tianshou
Tensorboard support	✔️	✔️	✔️
DQN extensions	➖ QR-DQN in SB3 contrib	✔️	✔️
Support for Dict observation spaces	✔️	✔️	✔️
Recurrent Policies	✔️ in contrib	✔️	✔️
TRPO	✔️ in contrib	❌	✔️

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Highlights over existing PyTorch RL repos #20

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Highlights over existing PyTorch RL repos #20

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions