Fix `sb3_contrib/ars/policies.py` type hint #122

qgallouedec · 2022-11-29T08:51:32Z

Description

Requires DLR-RM/stable-baselines3#1188 to be merged

Context

I have raised an issue to propose this change (required)

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist:

Note: we are using a maximum length of 127 characters per line

araffin · 2022-11-29T09:19:14Z

Requires DLR-RM/stable-baselines3#1188 to be merged

and to be released too

sb3_contrib/ars/policies.py

araffin · 2022-11-29T11:58:08Z

One thing that need to be checked before merging: does it break pre-trained models? (we might have to do some manual renaming in case it does)

qgallouedec · 2022-11-30T10:58:42Z

All rl-baselines3-zoo/tests/test_trained_agents passed, except CartPole-v1. I'm investigating.

qgallouedec · 2022-11-30T11:05:46Z

Probably because ARSLinearPolicy.action_net is turned from

Linear(in_features=4, out_features=2, bias=False)

to

Sequential(
  (0): Linear(in_features=4, out_features=2, bias=False)
)

qgallouedec · 2022-11-30T13:35:57Z

from sb3_contrib import ARS
from torch import nn

FOLDER = "rl-trained-agents"
env_id = "CartPole-v1"

for model_name in ["best_model", env_id]:
    model = ARS.load(f"{FOLDER}/ars/{env_id}_1/{model_name}.zip")
    model.policy.action_net = nn.Sequential(model.policy.action_net)
    model.save(f"{FOLDER}/ars/{env_id}_1/{model_name}.zip")

It seems to have worked. Does that look right to you?

qgallouedec · 2022-11-30T13:49:11Z

For HF, I think this should also work, but I'm not familiar enough with HF Hub to be sure:

from huggingface_sb3 import load_from_hub, push_to_hub
from torch import nn
from sb3_contrib import ARS

checkpoint = load_from_hub(
    repo_id="sb3/ars-CartPole-v1",
    filename="ars-CartPole-v1.zip",
)
model = ARS.load(checkpoint)
model.policy.action_net = nn.Sequential(model.policy.action_net)

push_to_hub(
    repo_id="sb3/ars-CartPole-v1",
    filename="ars-CartPole-v1.zip",
    commit_message="Update action_net structure",
)

araffin · 2022-12-06T17:19:53Z

All rl-baselines3-zoo/tests/test_trained_agents passed, except CartPole-v1. I'm investigating.

I fixed that in my latest commit ;) (the name of the commit is wrong though ^^ read "load" instead of "save")

araffin

LGTM, please check what I did before merging ;)

qgallouedec · 2022-12-12T10:52:00Z

Looks good, I just turned the docstring of ARS.set_parameters into a comment, so that the true docstring is inherited from BaseAlgorithm.

qgallouedec added 9 commits November 28, 2022 23:18

Update contribution.md

5be36c4

New loop struct to make mypy happy

5f000da

Update setup.cfg

2d9d780

Update changelog

a46e5cd

fix squash_output = False in ARS policy

18110b5

Add with_bias parameter to ARSPolicy

fc3e981

Make ARSLinearPolicy a special case of ARSPolicy

3531127

Remove ars_policy from mypy exclude

bf99546

Update changelog

12e22a4

Base automatically changed from fix-common-utils-hint to master November 29, 2022 09:24

qgallouedec and others added 3 commits November 29, 2022 10:27

Merge branch 'master' into fix-ars-policies-hint

32322da

Merge branch 'master' into fix-ars-policies-hint

9dbae0c

Update SB3 version

40859cf

araffin reviewed Nov 29, 2022

View reviewed changes

sb3_contrib/ars/policies.py Show resolved Hide resolved

araffin and others added 2 commits December 6, 2022 17:56

Merge branch 'master' into fix-ars-policies-hint

a2709e7

Fix to save ARS linear policy saved with sb3-contrib < 1.7.0

b9594ec

Fix test

5145e04

araffin approved these changes Dec 9, 2022

View reviewed changes

araffin and others added 2 commits December 9, 2022 13:29

Merge branch 'master' into fix-ars-policies-hint

d5ef12a

Turn docstring into comment

2bd9235

qgallouedec merged commit 6b23c6c into master Dec 12, 2022

qgallouedec deleted the fix-ars-policies-hint branch December 12, 2022 12:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `sb3_contrib/ars/policies.py` type hint #122

Fix `sb3_contrib/ars/policies.py` type hint #122

qgallouedec commented Nov 29, 2022

araffin commented Nov 29, 2022

araffin commented Nov 29, 2022

qgallouedec commented Nov 30, 2022

qgallouedec commented Nov 30, 2022

qgallouedec commented Nov 30, 2022

qgallouedec commented Nov 30, 2022 •

edited

Loading

araffin commented Dec 6, 2022

araffin left a comment

qgallouedec commented Dec 12, 2022 •

edited

Loading

Fix sb3_contrib/ars/policies.py type hint #122

Fix sb3_contrib/ars/policies.py type hint #122

Conversation

qgallouedec commented Nov 29, 2022

Description

Context

Types of changes

Checklist:

araffin commented Nov 29, 2022

araffin commented Nov 29, 2022

qgallouedec commented Nov 30, 2022

qgallouedec commented Nov 30, 2022

qgallouedec commented Nov 30, 2022

qgallouedec commented Nov 30, 2022 • edited Loading

araffin commented Dec 6, 2022

araffin left a comment

Choose a reason for hiding this comment

qgallouedec commented Dec 12, 2022 • edited Loading

Fix `sb3_contrib/ars/policies.py` type hint #122

Fix `sb3_contrib/ars/policies.py` type hint #122

qgallouedec commented Nov 30, 2022 •

edited

Loading

qgallouedec commented Dec 12, 2022 •

edited

Loading