Skip to content

dsv4-fp4-b300-sglang: update image to nightly#1506

Open
yhyang201 wants to merge 3 commits into
mainfrom
yyh/update-dsv4-b300-sglang-image
Open

dsv4-fp4-b300-sglang: update image to nightly#1506
yhyang201 wants to merge 3 commits into
mainfrom
yyh/update-dsv4-b300-sglang-image

Conversation

@yhyang201
Copy link
Copy Markdown
Collaborator

Summary

  • Update image from deepseek-v4-b300@sha256:2fec8d... to nightly-dev-cu13-20260518-c67b2870
  • Refactor benchmark script to dispatch by CONC instead of nested DP_ATTENTION/CONC/EP_SIZE
  • Switch high-concurrency profiles (CONC 2048/4096/8192) from --moe-a2a-backend deepep to megamoe
  • Remove env vars deleted from sglang main or redundant with defaults
  • Remove --deepep-config (not needed by megamoe)
  • Fix CONC=512 yaml ep: 4ep: 1 (flashinfer_mxfp4 doesn't set ep=tp)

@github-actions
Copy link
Copy Markdown
Contributor

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

1 similar comment
@github-actions
Copy link
Copy Markdown
Contributor

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

@yhyang201 yhyang201 changed the title dsv4-fp4-b300-sglang: update image to nightly, switch to megamoe dsv4-fp4-b300-sglang: update image to nightly May 18, 2026
@yhyang201 yhyang201 force-pushed the yyh/update-dsv4-b300-sglang-image branch from f25519e to cf36b0c Compare May 19, 2026 15:32
@github-actions
Copy link
Copy Markdown
Contributor

@github-actions
Copy link
Copy Markdown
Contributor

@yhyang201 yhyang201 force-pushed the yyh/update-dsv4-b300-sglang-image branch from d8ca8a8 to 09875d7 Compare May 21, 2026 10:52
@github-actions
Copy link
Copy Markdown
Contributor

1 similar comment
@github-actions
Copy link
Copy Markdown
Contributor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

1 participant