Update minimax h100 & h200 vLLM image to v0.18.0 by functionstackx · Pull Request #958 · SemiAnalysisAI/InferenceX

functionstackx · 2026-03-27T07:11:55Z

Update vllm/vllm-openai image tag from v0.16.0 to v0.18.0 for minimaxm2.5-fp8-h100-vllm and minimaxm2.5-fp8-h200-vllm configs.

Closes #955

Generated with Claude Code

Update vllm/vllm-openai image tag from v0.16.0 to v0.18.0 for minimaxm2.5-fp8-h100-vllm and minimaxm2.5-fp8-h200-vllm configs. Closes #955 Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>

github-actions · 2026-03-27T07:12:04Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

github-actions · 2026-03-27T07:12:04Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

functionstackx · 2026-03-27T07:14:52Z

@claude remove disable log requests

vllm: error: unrecognized arguments: --disable-log-requests

https://github.com/SemiAnalysisAI/InferenceX/actions/runs/23635447185/job/68843638787?pr=958

Klaud-Cold · 2026-03-27T07:15:18Z

Claude finished @functionstackx's task in 2m 0s —— View job

Removed `--disable-log-requests` from vLLM scripts

Identify where --disable-log-requests is used
Remove the flag from relevant vLLM benchmark scripts
Commit and push changes

Removed --disable-log-requests from both minimaxm2.5_fp8_h100.sh and minimaxm2.5_fp8_h200.sh. This flag is no longer recognized in vLLM v0.18.0.
| Branch

The --disable-log-requests flag is no longer recognized in vLLM v0.18.0. Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>

claude

LGTM — straightforward version bump from v0.16.0 to v0.18.0; the inline comment flags the placeholder in the changelog entry.

Extended reasoning...

Overview

Two-file change: two single-line image tag bumps in .github/configs/nvidia-master.yaml (h100 and h200 minimax configs: v0.16.0 → v0.18.0) and a new changelog entry in perf-changelog.yaml with a placeholder PR link.

Security risks

None. This is a container image version update with no auth, crypto, or permission changes.

Level of scrutiny

Low. Version bumps following the established pattern in this repo (matching recent similar PRs like #950 for kimik2.5-int4-mi355x and #936 for kimik2.5-fp4-mi355x) require minimal scrutiny. The only issue is a nit already captured as an inline comment.

Other factors

The placeholder pull/XXX in perf-changelog.yaml is a pre-existing pattern in this file (several other entries also use it), so while not ideal, it is consistent with existing practice. No prior reviews from me on this PR.

claude · 2026-03-27T07:17:39Z

perf-changelog.yaml

+  description:
+    - "Update vLLM image from v0.16.0 to v0.18.0 for minimax h100 and h200 configs"
+  pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX
+


🟡 The new changelog entry for minimaxm2.5-fp8-h100-vllm and minimaxm2.5-fp8-h200-vllm uses the placeholder pull/XXX instead of the actual PR number. The pr-link on line 7 should be updated to https://github.com/SemiAnalysisAI/InferenceX/pull/958.

Extended reasoning...

Bug: Placeholder PR link not replaced with actual PR number

The new changelog entry added at the top of perf-changelog.yaml (lines 1-7) contains a placeholder pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX instead of the actual PR number.

How it manifests: The diff clearly shows the new entry was added with the XXX placeholder:

- config-keys: - minimaxm2.5-fp8-h100-vllm - minimaxm2.5-fp8-h200-vllm description: - "Update vLLM image from v0.16.0 to v0.18.0 for minimax h100 and h200 configs" pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

Why existing code doesn't prevent it: There is no automated validation or CI check that enforces that pr-link values in perf-changelog.yaml must reference real PR numbers rather than the XXX placeholder. The file already contains several other pre-existing XXX placeholders (for entries like dsr1-fp8-h200-sglang, glm5-fp8-mi355x-sglang, minimaxm2.5-fp8-h200-vllm, qwen3.5-bf16-mi325x-sglang, and qwen3.5-fp8-mi325x-sglang), so no pattern-match check exists to flag this.

Why this is different from the other XXX entries: Unlike the pre-existing XXX entries where the PR number may have been unknown at the time of submission, this entry was added as part of PR #958 itself. The PR number was already known and available — it is even referenced in the PR description (Closes #955 and this is PR #958). This is a straightforward oversight where the author forgot to replace the placeholder before submitting.

Impact: The pr-link field is changelog metadata that helps trace which PR introduced each configuration change. A broken link reduces traceability and makes it harder to find the associated discussion, review comments, and rationale for the change. While this does not affect any functionality, it degrades the quality of the changelog as a reference document.

Step-by-step proof:

This PR is Update minimax h100 & h200 vLLM image to v0.18.0 #958 (as confirmed by the PR metadata)

The PR adds a new entry at the top of perf-changelog.yaml

That entry has pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

Visiting the link results in a 404 (or a random PR if XXX was meant as a literal string)

The correct link should be https://github.com/SemiAnalysisAI/InferenceX/pull/958

Fix: Replace pull/XXX with pull/958 on line 7 of perf-changelog.yaml.

Update minimax h100 & h200 vLLM image to v0.18.0

8624fe5

Update vllm/vllm-openai image tag from v0.16.0 to v0.18.0 for minimaxm2.5-fp8-h100-vllm and minimaxm2.5-fp8-h200-vllm configs. Closes #955 Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>

functionstackx requested a review from a team March 27, 2026 07:11

functionstackx requested review from jgangani and kedarpotdar-nv as code owners March 27, 2026 07:11

github-project-automation bot added this to InferenceMAX Board Mar 27, 2026

functionstackx added the sweep-enabled label Mar 27, 2026

Remove --disable-log-requests from minimax vLLM scripts

a156211

The --disable-log-requests flag is no longer recognized in vLLM v0.18.0. Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>

claude bot reviewed Mar 27, 2026

View reviewed changes

functionstackx merged commit 1d5cae1 into main Mar 27, 2026
34 of 40 checks passed

functionstackx deleted the claude/issue-955-20260327-0708 branch March 27, 2026 07:59

github-project-automation bot moved this to Done in InferenceMAX Board Mar 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update minimax h100 & h200 vLLM image to v0.18.0#958

Update minimax h100 & h200 vLLM image to v0.18.0#958
functionstackx merged 2 commits intomainfrom
claude/issue-955-20260327-0708

functionstackx commented Mar 27, 2026

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

functionstackx commented Mar 27, 2026

Uh oh!

Klaud-Cold commented Mar 27, 2026 •

edited

Loading

Uh oh!

claude bot left a comment

Uh oh!

claude bot Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

functionstackx commented Mar 27, 2026

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

functionstackx commented Mar 27, 2026

Uh oh!

Klaud-Cold commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Removed --disable-log-requests from vLLM scripts

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Overview

Security risks

Level of scrutiny

Other factors

Uh oh!

claude bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Klaud-Cold commented Mar 27, 2026 •

edited

Loading

Removed `--disable-log-requests` from vLLM scripts