-
Notifications
You must be signed in to change notification settings - Fork 211
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Modify XLLM_OPS dependency: xllm -> custom_xllm_math
#1568
opened May 26, 2026 by
ware2009
Loading…
3 of 11 tasks
bugfix: serialize qwen3 next attention weight transforms
#1567
opened May 26, 2026 by
pjgao
Contributor
Loading…
refactor: add pool name when init thread pool.
#1565
opened May 26, 2026 by
XuZhang99
Collaborator
Loading…
3 of 17 tasks
feat: add rec qwen num_return_sequences in gpu and npu.
#1563
opened May 26, 2026 by
DragonFive
Collaborator
Loading…
10 of 17 tasks
perf: reduce aclgraph copy operations.
#1562
opened May 26, 2026 by
JC-ut0
Contributor
Loading…
17 tasks
feat: qwen3.5 support chunked prefill.
#1551
opened May 25, 2026 by
maojunx99
Contributor
Loading…
5 of 13 tasks
bugfix: add qwen3.5 chat stop token.
#1548
opened May 25, 2026 by
yingxudeng
Collaborator
Loading…
9 of 17 tasks
feat: support qwen3.5 linear-state prefix cache.
#1546
opened May 25, 2026 by
yingxudeng
Collaborator
Loading…
17 tasks
feat: parallelize manual-loader decoder weight merge.
#1542
opened May 25, 2026 by
Clement-Wang26
Collaborator
Loading…
[WIP]perf: overlap mtp draft extend preparation
#1541
opened May 25, 2026 by
pjgao
Contributor
Loading…
feat: support Cola-DLM on cuda device.
#1539
opened May 24, 2026 by
Dragonliu2018
Contributor
•
Draft
[WIP] perf(npu): eliminate redundant Transpose in Qwen3.5 MTP spec verify conv path
#1536
opened May 23, 2026 by
pjgao
Contributor
Loading…
feat: support interlayer add norm and SplitRmsnormRope operation for qwen3.
#1531
opened May 22, 2026 by
shan-chen-feng
Collaborator
Loading…
feat: support MiMo-7B-Base on cuda device.
#1523
opened May 22, 2026 by
Dragonliu2018
Contributor
Loading…
feat: expose cached token usage in responses.
#1514
opened May 21, 2026 by
zhang-minchao
Collaborator
Loading…
feat: enable REC XAttention for Qwen3 MoE on cuda device.
#1500
opened May 20, 2026 by
LMX-xin
Collaborator
Loading…
feat: support vae parallel for qwen-image-edit-plus.
#1499
opened May 20, 2026 by
shan-chen-feng
Collaborator
Loading…
feat: add TileLang chunk_gated_delta_rule_fwd_h kernel.
#1498
opened May 20, 2026 by
fengz72
Loading…
bugfix: use max_concurrent_requests for single block and linear state allocation.
#1496
opened May 20, 2026 by
pjgao
Contributor
Loading…
feat: support customized multimodal preprocess configs.
#1481
opened May 19, 2026 by
xanecdotex
Collaborator
Loading…
refactor: remove negative condition when choosing decode or prefill
#1475
opened May 18, 2026 by
rauletorresc
Contributor
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.