Skip to content

[codex] Add action-labeled bilinear ablation tooling#8

Merged
fei-yang-wu merged 2 commits into
mainfrom
feature/action-labeling
May 23, 2026
Merged

[codex] Add action-labeled bilinear ablation tooling#8
fei-yang-wu merged 2 commits into
mainfrom
feature/action-labeling

Conversation

@fei-yang-wu
Copy link
Copy Markdown
Member

@fei-yang-wu fei-yang-wu commented May 22, 2026

Summary

  • Add action-labeled Dance102 bilinear ablation tooling, including the labeled manifest, cluster sweep wrappers, policy-label export, offline-pretrain evaluation, and deterministic rollout recording helpers.
  • Add latent goal observation/task support plus VQ-VAE and goal-AE bilinear config variants for the G1 latent tasks.
  • Sync submodule pointers to pushed commits: ImitationLearningTools 2aa832b for NPZ action-label loading and RLOpt 29f749d for bilinear latent pretraining/BC controls and latent learner checkpointing.

Submodule PRs

Validation

  • git diff --check in the parent repo
  • git diff --check in ImitationLearningTools and RLOpt
  • bash -n on updated cluster/experiment shell scripts
  • python3 -m py_compile on the new RLOpt helper scripts

Notes

Generated NPZ label/rollout/debug outputs are intentionally not tracked. The action-label manifest remains as a small checked-in recipe/path contract, and the docs now describe the label NPZ as a local/generated artifact.

@fei-yang-wu fei-yang-wu force-pushed the feature/action-labeling branch from fb21a1b to 2bdd70d Compare May 22, 2026 14:26
@fei-yang-wu fei-yang-wu marked this pull request as ready for review May 23, 2026 18:59
@fei-yang-wu fei-yang-wu merged commit 4da2f0c into main May 23, 2026
2 checks passed
@fei-yang-wu fei-yang-wu deleted the feature/action-labeling branch May 23, 2026 18:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant