Your work is exceptionally well done, truly fantastic! I would like to inquire if the source policy can be trained directly using train.py? During the training process, I encountered an issue where the reward function remains at 0, and I noticed that the task registration name in train.py does not match the registration name of the source policy.
Your work is exceptionally well done, truly fantastic! I would like to inquire if the source policy can be trained directly using
train.py? During the training process, I encountered an issue where the reward function remains at 0, and I noticed that the task registration name intrain.pydoes not match the registration name of the source policy.