lvae 2d vanilla implementation by mkeeler43 · Pull Request #57 · IDEALLab/EngiOpt

mkeeler43 · 2026-01-29T22:34:53Z

Basic lvae_2d and performance MLP plvae_2d implementations

engiopt/lvae_2d/aes.py

SoheylM

Here are the points I identified:

aes.py: log(s+eta) that can lead to NaN
aes.py: the .to() override
lvae_2d.py: plvae_2d.py: hard-coded [:25] that can fail on small datasets for visualization
lvae_2d.py: plvae_2d.py: possible risk of OOM when encoding full training set at once
lvae_2d.py: plvae_2d.py: Encoder, Decoder class being duplicated. I understand this is done because of the CleanRL mindset, so to be decided if we consider lvae and plvae to be close enough to share these.
lvae_2d.py: plvae_2d.py: th.cuda.empty_cache() called even if I run CPU/MPS
lvae_2d.py: small typo (double period) in url docstring
plvae_2d.py: comment says won't be used in predictor but tensors are actually passed through and the hack works becasue zero-width concat is a no-op
lvae_2d.py: plvae_2d.py: epoch_report called with bactch None and pbar None but I think this is a wanted behavior
Future work: add evaluate scripts for lvae_2d.py and plvae_2d.py

SoheylM · 2026-02-04T14:18:38Z

engiopt/lvae_2d/aes.py

+            Scalar volume loss.
+        """
+        s = z.std(0)
+        return torch.exp(torch.log(s + self.eta).mean())


Can eta be zero? in this case any latent dimension with zero standard deviation (if possible) can lead to log(0) -> -inf, causing NaN losses. I suggest safeguarding against this by either 1) caatching this case and returning whatever value is appropriate or 2) adding a default non-zero for eta like 1e-8

SoheylM · 2026-02-04T14:21:50Z

engiopt/lvae_2d/aes.py

+        self._zstd: torch.Tensor | None = None
+        self._zmean: torch.Tensor | None = None
+
+    def to(self, device: torch.device | str) -> LeastVolumeAE_DynamicPruning:


If I am not mistaken (and can trust my powerful coding LLM), register_buffer calls already ensure these tensors move with the model. This override reassigns them as plain tensors (not buffers), breaking state dict saving/loading.
Unless, I missed why this needs to be specifically defined, I would suggest removing this method since register buffers auto-move with nn.Module.to().

SoheylM · 2026-02-04T14:24:42Z

engiopt/lvae_2d/lvae_2d.py

+                        # Generate interpolated designs
+                        x_ints = []
+                        for alpha in [0, 0.25, 0.5, 0.75, 1]:
+                            z_ = (1 - alpha) * z[:25] + alpha * th.roll(z, -1, 0)[:25]


Hard-coded sample count may fail (and this is not the only place wherer it occurs in code).

If the training set has fewer than 25 samples, this silently produces fewer visualizations - which one may be fine with, if only for visualization. th.roll(z, -1, 0)[:25] will wrap around incorrectly for small datasets.

I would suggest using min(25, len(z)) or parametrize the sample count.

SoheylM · 2026-02-04T14:25:11Z

engiopt/lvae_2d/plvae_2d.py

+                        # Generate interpolated designs
+                        x_ints = []
+                        for alpha in [0, 0.25, 0.5, 0.75, 1]:
+                            z_ = (1 - alpha) * z[:25] + alpha * th.roll(z, -1, 0)[:25]


See comment in lvae_2d.py about this

SoheylM · 2026-02-04T14:27:39Z

engiopt/lvae_2d/lvae_2d.py

+                    with th.no_grad():
+                        # Encode training designs
+                        xs = x_train.to(device)
+                        z = lvae.encode(xs)


Full dataset encoding may lead to OOM

xs = x_train.to(device)
z = lvae.encode(xs)

I think there is a risk of OOM if encoding the entire dataset at once during visualization.

Would it be worth encoding in batches to be on the safe side?

SoheylM · 2026-02-04T14:31:55Z

engiopt/lvae_2d/lvae_2d.py

+            }
+            wandb.log(val_log_dict, commit=True)
+
+        th.cuda.empty_cache()


This is called every epoch regardless whether CUDA is being used (as opposed to CPU/MPS). Harmless but wasteful.

What about doing this instead:
if th.cuda.is_available(): th.cuda.empty_cache()

SoheylM · 2026-02-04T14:32:01Z

engiopt/lvae_2d/plvae_2d.py

+            }
+            wandb.log(val_log_dict, commit=True)
+
+        th.cuda.empty_cache()


This is called every epoch regardless whether CUDA is being used (as opposed to CPU/MPS). Harmless but wasteful.

What about doing this instead:
if th.cuda.is_available(): th.cuda.empty_cache()

SoheylM · 2026-02-04T14:32:45Z

engiopt/lvae_2d/lvae_2d.py

@@ -0,0 +1,455 @@
+"""LVAE for 2D designs with plummet-based dynamic pruning. Adapted from https://github.com/IDEALLab/Least_Volume_ICLR2024..


Typo: Double period at the end of URL

SoheylM · 2026-02-04T14:35:52Z

engiopt/lvae_2d/lvae_2d.py

+            val_vol /= n
+
+        # Trigger pruning check at end of epoch
+        lvae.epoch_report(epoch=epoch, callbacks=[], batch=None, loss=losses, pbar=None)


This works because callbacks list is empty (otherwise pbar and batch would have to be non-None values. Checking at the rest of the code, I understand this is done on purpose.

SoheylM · 2026-02-04T14:40:32Z

engiopt/lvae_2d/plvae_2d.py

+        c_train_scaled = th.from_numpy(c_scaler.fit_transform(c_train.numpy())).to(c_train.dtype)
+        c_val_scaled = th.from_numpy(c_scaler.transform(c_val.numpy())).to(c_val.dtype)
+    else:
+        # Dummy tensors when not using conditions (won't be used in predictor)


I would clarify:
zero-width tensors: concatenating with pz is a no-op, so predictor sees only latent

SoheylM · 2026-02-27T07:36:03Z

Hey Matt, just checking, how is the progress with this PR? SOH-13

lvae 2d vanilla implementation

0f780a5

mkeeler43 assigned SoheylM Jan 29, 2026

mkeeler43 added 3 commits January 29, 2026 23:36

ruff fixes for PR

7c8b32d

bugfix for performance visualization

210ff35

modified default hyperparameters

d087b8e

SoheylM reviewed Feb 4, 2026

View reviewed changes

engiopt/lvae_2d/aes.py Show resolved Hide resolved

SoheylM reviewed Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lvae 2d vanilla implementation#57

lvae 2d vanilla implementation#57
mkeeler43 wants to merge 4 commits intomainfrom
lvae_2d

mkeeler43 commented Jan 29, 2026

Uh oh!

Uh oh!

SoheylM left a comment

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM Feb 4, 2026

Uh oh!

SoheylM commented Feb 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,455 @@
		"""LVAE for 2D designs with plummet-based dynamic pruning. Adapted from https://github.com/IDEALLab/Least_Volume_ICLR2024..

Conversation

mkeeler43 commented Jan 29, 2026

Uh oh!

Uh oh!

SoheylM left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SoheylM commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SoheylM commented Feb 27, 2026 •

edited

Loading