Skip to content

[Bug] slime-v0.3.0 版本在跑 qwen3.6 35B A3B 模型的时候,在第二次 rollout 会有乱码。怀疑 镜像&sglang 版本导致 #2091

Description

@gtarcoder

Bug Description

slime-v0.3.0 版本在跑 qwen3.6 35B A3B 模型的时候,在第二次 rollout 会有乱码

  1. 使用slime-v0.3.0 配套的 镜像 slimerl/sglang:v0.5.12.post1-cu129 会出现乱码
  2. 使用 slime-v0.2.4 配套的 镜像 slimerl/sglang:v0.5.9 则结果正常

Steps to Reproduce

分别使用:

  1. slime-v0.3.0 配套的 镜像 slimerl/sglang:v0.5.12.post1-cu129 去跑 Qwen-3.6 MOE 35B A3B RL 任务。
  2. slime-v0.2.4 配套的 镜像 slimerl/sglang:v0.5.9 去跑 去跑 Qwen-3.6 MOE 35B A3B RL 任务。

Expected Behavior

两个任务在第二次 rollout的时候都没有乱码

Actual Behavior

能看到 前者会在 第二次 rollout的时候有乱码,而后者正常

Environment

  • slime version: v0.3.0
  • Python version: 3.12.3
  • PyTorch version: 2.9.1+cu129
  • CUDA/ROCm version: 13.0
  • GPU type and count: H800 x 8
  • OS: ubuntu-24.04
  • SGLang version (if relevant): 镜像自带
  • Megatron-LM version (if relevant):

Logs

Additional Context

No response

Pre-submission Checklist

  • I have read the CONTRIBUTING.md and understand the collaboration scope.
  • I have read the documentation and my issue is not addressed there.
  • I have searched for existing issues and this is not a duplicate.
  • I have provided a minimal, reproducible example.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions