source policy training


Your work is exceptionally well done, truly fantastic! I would like to inquire if the source policy can be trained directly using `train.py`? During the training process, I encountered an issue where the reward function remains at 0, and I noticed that the task registration name in `train.py` does not match the registration name of the source policy.