Pi rollout: load tokenizer-collate flags from .hydra/config.yaml#401
Pi rollout: load tokenizer-collate flags from .hydra/config.yaml#401rl2aloha wants to merge 1 commit into
Conversation
|
Warning This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
This stack of pull requests is managed by Graphite. Learn more about stacking. |
25554ba to
1ce450b
Compare
The trained pi prompt format (`Task: ...; Action: ...`) is governed by
`data.{proprio,embodiment_label,control_mode,state_num_bins}` in the
hydra config. The `.ckpt` only stores the `model:` subtree (see
`trainHydra._build_model_config_tree`), so read the flags from the
run-output snapshot at `<run_dir>/.hydra/config.yaml` instead.
- rollout.py: `build_tokenized_collate` now wired from the run snapshot
rather than hardcoded args. `data["embodiment"]` is set to the
integer id; previously it was the string `"eva_bimanual"`, which
`_embodiment_name`'s `int(eid)` cast would have crashed on.
- hydra_utils.py: new `find_run_snapshot_path` / `load_run_snapshot`
helpers for reading hydra run-output `config.yaml` without
re-resolving runtime interpolations like
`${data.dataset.data_schematic}`.
Debug prints kept (TODO-marked) to ease verification on the rollout
host.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1b82282 to
c044ed8
Compare
1ce450b to
90bf6f8
Compare
Claude Code ReviewPR Review: Pi rollout tokenizer-collate flags from snapshotSummaryReads the pi prompt-formatting flags ( Key concerns
Suggestions
Verdict: Request ChangesMainly for concern #2 (silent defaults are dangerous for prompt format — a mismatched tokenization is exactly the kind of bug that wastes a day of rollout debugging) and #3 ( Reviewed by Claude · Review workflow |
Claude Code ReviewReviewSummaryFixes pi rollout by loading tokenizer-collate flags from the hydra run snapshot (since Key concerns
Suggestions
VerdictRequest Changes — mainly to convert the silent-fallback behavior (concerns #2, #3) into a hard failure, and to demote the per-step debug print. The core fix (snapshot loading + int embodiment id) is correct and addresses a real crash. Reviewed by Claude · Review workflow |

The trained pi prompt format (
Task: ...; Action: ...) is governed bydata.{proprio,embodiment_label,control_mode,state_num_bins}in thehydra config. The
.ckptonly stores themodel:subtree (seetrainHydra._build_model_config_tree), so read the flags from therun-output snapshot at
<run_dir>/.hydra/config.yamlinstead.build_tokenized_collatenow wired from the run snapshotrather than hardcoded args.
data["embodiment"]is set to theinteger id; previously it was the string
"eva_bimanual", which_embodiment_name'sint(eid)cast would have crashed on.find_run_snapshot_path/load_run_snapshothelpers for reading hydra run-output
config.yamlwithoutre-resolving runtime interpolations like
${data.dataset.data_schematic}.Debug prints kept (TODO-marked) to ease verification on the rollout
host.
Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com