save success status before reset by yhnsu · Pull Request #119 · DexForce/EmbodiChain

yhnsu · 2026-02-04T03:47:05Z

Description

buf fix: save success status before reseting objects status

Copilot

Pull request overview

This pull request fixes a bug where the task success status was being evaluated after reset_objects_state() was called, resulting in incorrect success detection. The fix saves the success status before resetting objects and uses the saved value during episode initialization.

Changes:

Added is_task_success() method to BaseEnv class for checking task success across environments
Modified reset() to save task success status before calling reset_objects_state()
Updated _initialize_episode() in EmbodiedEnv to use the pre-saved success status instead of calling is_task_success() after reset

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
embodichain/lab/gym/envs/base_env.py	Adds `is_task_success()` method and saves task success status before resetting objects in `reset()` method
embodichain/lab/gym/envs/embodied_env.py	Updates `_initialize_episode()` to use saved `_task_success` attribute instead of calling `is_task_success()` after objects are reset

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

embodichain/lab/gym/envs/base_env.py

embodichain/lab/gym/envs/embodied_env.py

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-05T03:46:57Z

embodichain/lab/gym/envs/embodied_env.py

+                if self._task_success is None:
+                    logger.log_warning("task_success is not defined, nothing to save.")
+                    self._task_success = torch.zeros(
+                        self.num_envs, dtype=torch.bool, device=self.device
+                    )


This None check and fallback initialization is unreachable. Since the base class (BaseEnv) now initializes _task_success as a torch.Tensor in line 136-138 of base_env.py, _task_success will never be None. This dead code should be removed, keeping only the use of self._task_success[env_id].item() on line 395.

Suggested change

if self._task_success is None:

logger.log_warning("task_success is not defined, nothing to save.")

self._task_success = torch.zeros(

self.num_envs, dtype=torch.bool, device=self.device

)

embodichain/lab/scripts/preview_env.py

pyproject.toml

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

embodichain/lab/gym/envs/base_env.py

pyproject.toml

embodichain/lab/gym/utils/gym_utils.py

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

embodichain/lab/gym/utils/gym_utils.py

embodichain/lab/gym/envs/base_env.py

Copilot · 2026-02-05T09:34:31Z

embodichain/agents/rl/train.py

+    gym_env_cfg = config_to_cfg(
+        gym_config_data, manager_modules=DEFAULT_MANAGER_MODULES
+    )



The num_envs override logic has been removed but not replaced. Previously, if num_envs was specified in the trainer config, it would override the value from the gym config. Now this override is lost, which means the trainer config's num_envs setting is ignored.

The removed code was:

if num_envs is not None: gym_env_cfg.num_envs = num_envs

This should be re-added after the config_to_cfg call to maintain the ability to override num_envs from the training configuration.

Suggested change

# Allow trainer config to override number of environments

num_envs = trainer_cfg.get("num_envs")

if num_envs is not None:

gym_env_cfg.num_envs = num_envs

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-05T15:30:07Z

embodichain/lab/gym/envs/base_env.py

+    def is_task_success(self, **kwargs) -> torch.Tensor:
+        """Check if the task is successful for each environment.
+
+        This method should be overridden in subclasses to implement task-specific success criteria.
+
+        Args:
+            **kwargs: Additional keyword arguments.
+
+        Returns:
+            A boolean tensor indicating success for each environment.
+        """
+
+        raise NotImplementedError(
+            "BaseEnv.is_task_success must be implemented in subclasses to define "
+            "task-specific success criteria."
+        )


The is_task_success method added to BaseEnv raises NotImplementedError. This is a breaking change because BaseEnv is now calling this method in the reset() method (line 559). Any environment that inherits from BaseEnv (instead of EmbodiedEnv which provides a default implementation) will now fail when reset() is called. Consider making this method return a default value (like torch.zeros or torch.ones) instead of raising NotImplementedError, or document that this is a required override for BaseEnv subclasses.

save success status before reset

79a2c48

yhnsu requested review from Copilot and yangchen73 February 4, 2026 03:47

Copilot started reviewing on behalf of yhnsu February 4, 2026 03:47 View session

yhnsu requested a review from yuecideng February 4, 2026 03:49

Copilot AI reviewed Feb 4, 2026

View reviewed changes

embodichain/lab/gym/envs/base_env.py Show resolved Hide resolved

embodichain/lab/gym/envs/embodied_env.py Outdated Show resolved Hide resolved

embodichain/lab/gym/envs/embodied_env.py Outdated Show resolved Hide resolved

wip

923032b

yuecideng requested changes Feb 4, 2026

View reviewed changes

embodichain/lab/gym/envs/embodied_env.py Outdated Show resolved Hide resolved

wip

1d34500

Copilot AI review requested due to automatic review settings February 5, 2026 03:42

Copilot started reviewing on behalf of yhnsu February 5, 2026 03:42 View session

Merge branch 'main' into yhn/success_before_reset

3247055

Copilot AI reviewed Feb 5, 2026

View reviewed changes

yuecideng approved these changes Feb 5, 2026

View reviewed changes

yuanhaonan added 2 commits February 5, 2026 15:39

install qpsolvers[osqp]

f2b591a

config_to_cfg modules parsing

da27060

Copilot AI review requested due to automatic review settings February 5, 2026 08:46

Copilot started reviewing on behalf of yhnsu February 5, 2026 08:47 View session

yuecideng requested changes Feb 5, 2026

View reviewed changes

embodichain/lab/scripts/preview_env.py Outdated Show resolved Hide resolved

yuecideng reviewed Feb 5, 2026

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

Copilot AI reviewed Feb 5, 2026

View reviewed changes

embodichain/lab/gym/envs/base_env.py Outdated Show resolved Hide resolved

embodichain/lab/gym/envs/base_env.py Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

embodichain/lab/gym/utils/gym_utils.py Show resolved Hide resolved

yuanhaonan added 2 commits February 5, 2026 17:25

wip

15578f1

wip

5ac4aa6

Copilot AI review requested due to automatic review settings February 5, 2026 09:27

Copilot started reviewing on behalf of yhnsu February 5, 2026 09:27 View session

yuecideng approved these changes Feb 5, 2026

View reviewed changes

Copilot AI reviewed Feb 5, 2026

View reviewed changes

yuecideng added 2 commits February 5, 2026 15:13

Merge branch 'main' into yhn/success_before_reset

477badc

Merge branch 'main' into yhn/success_before_reset

4027429

Copilot AI review requested due to automatic review settings February 5, 2026 15:24

Copilot started reviewing on behalf of yuecideng February 5, 2026 15:24 View session

Copilot AI reviewed Feb 5, 2026

View reviewed changes

+    # Allow trainer config to override number of environments
+    num_envs = trainer_cfg.get("num_envs")
+    if num_envs is not None:
+        gym_env_cfg.num_envs = num_envs

Conversation

yhnsu commented Feb 4, 2026

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants