[Bug/Question] Numerical inconsistency in model representations among inferences

**Description:**
I am observing a discrepancy in the output representations (token embeddings) when inferring the exact same sequence under two different execution contexts on NVIDIA A100.

**The Problem:**
I am comparing the representations of a specific sequence S in two scenarios:

- Scenario A (Loop): I run a loop to infer $N$ sequences sequentially. I then extract the representation for sequence S.
- Scenario B (Single Run): I run the inference script for only sequence $S$.

In both scenarios, I strictly ensure that the batch size is 1. Despite the identical input and model weights, the resulting tensors are not identical.

**Comparison Results:**

<img width="698" height="322" alt="Image" src="https://github.com/user-attachments/assets/8901595b-5cf7-42c8-a904-928b29ff875b" />

**Minimal Code:**
```
def load_e1_model(model_name):

    try:
        model = E1ForMaskedLM.from_pretrained(E1_CONFIG[model_name])
        model.eval()
    except:
        raise ValueError(f"Model {model_name} not found")
  
    if torch.cuda.is_available():
        model = model.cuda()

  return model
```
```
def compute_E1_embeddings(
    model,
    labels,
    sequences,
    save_dir,
    max_batch_tokens=16384,
):
    predictor = E1Predictor(
        model=model,
        max_batch_tokens=max_batch_tokens,
        fields_to_save=["token_embeddings"],
        use_cache=False,
    )

    embeddings = {}

    for prediction in predictor.predict(
        sequences=sequences, sequence_ids=labels, context_seqs=None
    ):
        if _check_files_exist(save_dir, labels):
            continue
        label = prediction["id"]
        token_embeddings = prediction["token_embeddings"]  # (Sequence Length, Embedding Dim)
        embeddings[label] = token_embeddings.cpu().clone()
        save_path = os.path.join(save_dir, label + ".pt")
        torch.save(embeddings[label], save_path)
    return embeddings
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug/Question] Numerical inconsistency in model representations among inferences #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug/Question] Numerical inconsistency in model representations among inferences #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions