Skip to content

Pull requests: jd-opensource/xllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Modify XLLM_OPS dependency: xllm -> custom_xllm_math
#1568 opened May 26, 2026 by ware2009 Loading…
3 of 11 tasks
bugfix: serialize qwen3 next attention weight transforms
#1567 opened May 26, 2026 by pjgao Contributor Loading…
refactor: add pool name when init thread pool.
#1565 opened May 26, 2026 by XuZhang99 Collaborator Loading…
3 of 17 tasks
feat: add rec qwen num_return_sequences in gpu and npu.
#1563 opened May 26, 2026 by DragonFive Collaborator Loading…
10 of 17 tasks
perf: reduce aclgraph copy operations.
#1562 opened May 26, 2026 by JC-ut0 Contributor Loading…
17 tasks
feat: qwen3.5 support chunked prefill.
#1551 opened May 25, 2026 by maojunx99 Contributor Loading…
5 of 13 tasks
bugfix: add qwen3.5 chat stop token.
#1548 opened May 25, 2026 by yingxudeng Collaborator Loading…
9 of 17 tasks
feat: support qwen3.5 linear-state prefix cache.
#1546 opened May 25, 2026 by yingxudeng Collaborator Loading…
17 tasks
feat: parallelize manual-loader decoder weight merge.
#1542 opened May 25, 2026 by Clement-Wang26 Collaborator Loading…
[WIP]perf: overlap mtp draft extend preparation
#1541 opened May 25, 2026 by pjgao Contributor Loading…
feat: support Cola-DLM on cuda device.
#1539 opened May 24, 2026 by Dragonliu2018 Contributor Draft
feat: update mlu container to 26.04.
#1535 opened May 23, 2026 by phantomlei3 Collaborator Loading…
refactor: refact multimodal processor.
#1530 opened May 22, 2026 by wly-115 Collaborator Loading…
feat: support MiMo-7B-Base on cuda device.
#1523 opened May 22, 2026 by Dragonliu2018 Contributor Loading…
feat: add dcu backend support.
#1522 opened May 22, 2026 by WenQ7 Loading…
feat: expose cached token usage in responses.
#1514 opened May 21, 2026 by zhang-minchao Collaborator Loading…
refactor: remove xattention one-stage decode path.
#1504 opened May 21, 2026 by LMX-xin Collaborator Draft
feat: enable REC XAttention for Qwen3 MoE on cuda device.
#1500 opened May 20, 2026 by LMX-xin Collaborator Loading…
feat: support vae parallel for qwen-image-edit-plus.
#1499 opened May 20, 2026 by shan-chen-feng Collaborator Loading…
feat: support customized multimodal preprocess configs.
#1481 opened May 19, 2026 by xanecdotex Collaborator Loading…
refactor: remove negative condition when choosing decode or prefill
#1475 opened May 18, 2026 by rauletorresc Contributor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.