chore: upgrade h200 gptoss to latest trtllm by cquil11 · Pull Request #854 · SemiAnalysisAI/InferenceX

cquil11 · 2026-03-02T18:58:42Z

No description provided.

claude · 2026-03-02T18:59:51Z

Claude finished @cquil11's task in 51s —— View job

Review of PR #854

Gather context and read changed files
Validate master config and perf-changelog
Validate container image accessibility
Validate model prefix
Review benchmark script changes
Post review summary

LGTM — no blocking issues found.

This is a clean upgrade of the H200 GPT-OSS TensorRT-LLM config from the dev image (gpt-oss-dev) to an official release candidate (v1.3.0rc5). The sed workaround removal and enable_block_reuse default change are both reasonable cleanups enabled by the new container version. Perf changelog is properly updated.

functionstackx · 2026-03-02T23:55:18Z

hi @jgangani

the performance is wildly worse on the latest trtllm 1.3 rc, can u take a look? nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc5

jgangani · 2026-03-03T23:18:23Z

@functionstackx thanks for the perf diff. Looking at them. will get back to you

cquil11 · 2026-03-24T21:52:42Z

@jgangani update?

functionstackx · 2026-03-24T22:02:42Z

@jgangani we at semianalysis gonna merge this with this tag by monday

jgangani · 2026-03-24T22:37:20Z

@cquil11 @functionstackx Sorry for dropping this without an update. Reason for delay is that TRTLLM rc5 container pytorch update automatically updated Triton to 3.5.1 which had regression for MoE activation kernel. We believe this has been fixed later in Triton 3.6. We are in the process of releasing a new periodic container with pytorch and triton update by next week. This week is short work week at Nvidia (we are off TH-FR). I hope we can hold off merging this until then.

upgrade h200 gptoss to latest trtllm

187f575

github-project-automation bot added this to InferenceMAX Board Mar 2, 2026

Merge branch 'main' into chore/update-gptoss-trtll

e52653c

cquil11 added the sweep-enabled label Mar 2, 2026

cquil11 marked this pull request as ready for review March 2, 2026 18:59

cquil11 requested a review from a team March 2, 2026 18:59

cquil11 requested review from ankursingh-nv, jgangani and kedarpotdar-nv as code owners March 2, 2026 18:59

cquil11 added 2 commits March 2, 2026 13:03

incorrect version fix

cbb0612

incorrect version fix

ed200a7

functionstackx removed the sweep-enabled label Mar 2, 2026

kedarpotdar-nv assigned jgangani Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: upgrade h200 gptoss to latest trtllm#854

chore: upgrade h200 gptoss to latest trtllm#854
cquil11 wants to merge 4 commits intomainfrom
chore/update-gptoss-trtll

cquil11 commented Mar 2, 2026

Uh oh!

claude bot commented Mar 2, 2026 •

edited

Loading

Uh oh!

functionstackx commented Mar 2, 2026

Uh oh!

jgangani commented Mar 3, 2026

Uh oh!

cquil11 commented Mar 24, 2026

Uh oh!

functionstackx commented Mar 24, 2026

Uh oh!

jgangani commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cquil11 commented Mar 2, 2026

Uh oh!

claude bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review of PR #854

Uh oh!

functionstackx commented Mar 2, 2026

Uh oh!

jgangani commented Mar 3, 2026

Uh oh!

cquil11 commented Mar 24, 2026

Uh oh!

functionstackx commented Mar 24, 2026

Uh oh!

jgangani commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

claude bot commented Mar 2, 2026 •

edited

Loading