Skip to content

Update glm-5 container to use SGLang latest#1561

Open
xinli-sw wants to merge 2 commits into
SemiAnalysisAI:mainfrom
xinli-sw:glm-update
Open

Update glm-5 container to use SGLang latest#1561
xinli-sw wants to merge 2 commits into
SemiAnalysisAI:mainfrom
xinli-sw:glm-update

Conversation

@xinli-sw
Copy link
Copy Markdown

@xinli-sw xinli-sw commented May 24, 2026

Note

Medium Risk
Switches benchmark recipes to a nightly SGLang container build, which can change runtime behavior/performance and introduce compatibility regressions despite being config-only.

Overview
Updates the GLM-5 B200 SGLang benchmark configs (FP8/FP4, incl. MTP variants) to use lmsysorg/sglang:nightly-dev-cu13-20260523-c112f762 instead of the pinned v0.5.12-cu130 image.

Adds perf changelog entries documenting the SGLang image bump for the glm5-fp4-b200-sglang and glm5-fp4-b200-sglang-mtp configs.

Reviewed by Cursor Bugbot for commit 8678a8d. Bugbot is set up for automated code reviews on this repo. Configure here.

Copy link
Copy Markdown
Contributor

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

1 participant