pi train human new by RyanPCo · Pull Request #473 · GaTech-RL2/EgoVerse

RyanPCo · 2026-05-24T17:43:39Z

No description provided.

RyanPCo · 2026-05-24T17:43:52Z

remove rotation bounds checks for norm stats #480 : 2 dependent PRs (#481 , #490 )
optional viz videos on train dataset #476
optional flag to generate validation videos for each task #475
pi train human new #473 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

github-actions · 2026-06-11T17:46:37Z

Claude Code Review

PR Review: pi train human new

Summary

Adds Mecka support to the Pi 0.5 training pipeline (keymap/transforms/configs), tightens optimizer/scheduler defaults, broadens MultiDataset retry to be cross-leaf, and adds split xyz/ypr eval metrics. Removes the mecka+scale cotrain config.

Key concerns

default_prompt: "" change in pi0.5_base.yaml — switching from "This is a bad action." to empty string is a meaningful training-behavior change. This silently affects every Pi 0.5 run, not just Mecka-human. Empty prompts may interact poorly with tokenizer / PaliGemma prompt conditioning (some tokenizers produce only BOS/EOS, some warn). Was this validated against existing Aria/Scale runs, or is this intended to become the new default for all experiments?
sample_frac: 1.0 → 0.2 in train_zarr_cartesian.yaml — this is a global default for the Cartesian training config. A 5× reduction in normalization sample fraction will change norm stats for every run using this config, not just this PR's experiments. If you have cached precomputed_norm_path references from prior runs, those become inconsistent with fresh recomputes. Consider overriding per-run instead of changing the base.
dataset_dir path change in paths/default.yaml — switching the default from /coc/flash7/scratch/... (Skynet) to /storage/project/r-dxu345-0/... (PACE) will break anyone training on the old cluster who relies on the default. Consider a cluster-specific override file rather than mutating the shared default.
Filter semantics change in mecka_pi.yaml — the new DatasetFilter with filter_lambdas uses task == 'fold_clothes', but the old (and deleted cotrain) config used task == 'folding_clothes' for mecka. Confirm the actual task string in the SQL DB — one of these is wrong, and if it's the new one, training silently gets zero episodes (or vice versa). The deleted cotrain file used fold_clothes, so this is probably the correct one, but worth double-checking against the DB.
Mecka keymap front_key branching — for cartesian_pi, the front cam is renamed base_0_rgb; for cartesian it stays VIZ_IMAGE_KEY. The viz configs (pi_cartesian.yaml) hardcode image_key: base_0_rgb for mecka, which is consistent with cartesian_pi mode. But get_transform_list(mode="cartesian") calls _build_aria_cartesian_bimanual_transform_list — does that transform expect base_0_rgb or the Mecka VIZ_IMAGE_KEY? If the transform list expects an Aria-style image key but the keymap emits base_0_rgb, you'll get a silent key-miss in the transform. Please verify.
MultiDataset global retry — broadening to range(len(self.index_map)) is the right call to escape a bad leaf, but it changes sampling distribution: a single bad leaf can now redirect its samples to other leaves, biasing the effective mixture ratio away from the configured train_dataloader_params. For small-scale debugging this is fine; for production cotraining this could matter. A warning log of cumulative retry counts per leaf during training would help catch silent rebalancing.
PI_DEBUG_PROMPTS debug logging — fine as gated debug, but self._debug_prompt_count is per-rank under DDP, so under 8 GPUs you'll see up to 24 log lines, not 3. Minor.

Suggestions

Move default_prompt: "" and sample_frac: 0.2 into a Mecka-specific override (e.g., model/pi0.5_mecka.yaml, train_zarr_cartesian_mecka.yaml) instead of mutating shared base configs.
Don't change paths/default.yaml; use a cluster-specific paths override.
Verify the fold_clothes vs folding_clothes task string against the active SQL DB and add a test/assertion that the resolver returns >0 episodes.
Verify get_transform_list(mode="cartesian") for Mecka uses the same image key (base_0_rgb) that the keymap emits in cartesian_pi mode — and ideally pass mode consistently.
The _check_bounds short-circuit assumption in test_multi_retry.py is fragile to refactors; consider adding an embodiment attribute to _DummyLeaf and a real bounds path test, or comment-mark the test as intentionally bypassing bounds.
Rename egomimic/rldb/zarr/test_multi_retry.py → tests/... or test_zarr_dataset_multi.py to match repo test conventions (and avoid pytest picking up a test file inside the importable package).
The _split_mse helper in eval_pi.py is hardcoded to the 12-D bimanual layout — add a comment or assertion mapping it to HumanBimanualCartesianEuler.from32, and consider moving it to a util so unimanual / other layouts can later opt in.

Verdict: Request Changes

Main blockers are (a) mutating shared base configs (default_prompt, sample_frac, dataset_dir) that affect non-Mecka experiments, and (b) confirming the task-string filter actually matches DB rows. Once those are scoped to Mecka-specific overrides or confirmed safe, the rest is good — the retry fix and Mecka keymap extension are nice additions.

Reviewed by Claude · Review workflow

RyanPCo force-pushed the ryanco/pi-train-human-new branch from e6d6607 to c4a5205 Compare May 26, 2026 17:07

This was referenced May 27, 2026

optional flag to generate validation videos for each task #475

Open

optional viz videos on train dataset #476

Open

add custom configs to specify tasks for viz #481

Closed

remove rotation bounds checks for norm stats #480

Open

RyanPCo mentioned this pull request Jun 5, 2026

6D rotation normalization to prevent large changes from euler #490

Open

RyanPCo force-pushed the ryanco/pi-train-human-new branch 2 times, most recently from fd0e414 to 4580ba3 Compare June 7, 2026 19:43

RyanPCo mentioned this pull request Jun 8, 2026

exp short state history memory #492

Draft

RyanPCo force-pushed the ryanco/pi-train-human-new branch from 4580ba3 to 9fcdebd Compare June 8, 2026 20:45

RyanPCo marked this pull request as ready for review June 8, 2026 20:46

RyanPCo mentioned this pull request Jun 8, 2026

local: personal env overrides — Do not merge #493

Draft

pi train human new

c66446a

RyanPCo force-pushed the ryanco/pi-train-human-new branch from 9fcdebd to c66446a Compare June 11, 2026 17:45

This was referenced Jun 11, 2026

data-quality: episode review tool + mecka folding_clothes labels #494

Draft

exp task segment labels #498

Draft

better val metrics #499

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pi train human new#473

pi train human new#473
RyanPCo wants to merge 1 commit into
mainfrom
ryanco/pi-train-human-new

RyanPCo commented May 24, 2026

Uh oh!

RyanPCo commented May 24, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RyanPCo commented May 24, 2026

Uh oh!

RyanPCo commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 11, 2026

Claude Code Review

PR Review: pi train human new

Summary

Key concerns

Suggestions

Verdict: Request Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

RyanPCo commented May 24, 2026 •

edited

Loading