AML-4269: Add Location Support to Qwen3.5 by anvdn · Pull Request #10 · hyperscience/sglang

anvdn · 2026-06-03T13:59:28Z

Attention Heatmap Support for Qwen3.5 + refactor

Summary

Extends attention heatmap capture to Qwen3.5 (hybrid linear/full-attention model) and refactors the existing heatmap code so any model can opt in with minimal boilerplate.

Changes

New AttentionHeatmapQueryRecorderMixin (hs/attention_heatmap.py): centralizes query-buffer allocation and per-layer query recording. Qwen2-VL, Qwen3-VL, and Qwen3.5 now share this implementation instead of duplicating it.
Qwen3.5 integration (models/qwen3_5.py): full-attention decoder layers now return q; linear-attention layers are skipped. The model registers itself via the mixin.
Layer selection API change (server_args.py): replaced the attention_heatmap_layer_start / _layer_end range with a flexible attention_heatmap_layer_ids: list[int]. This is required for hybrid models where capturable layers aren't contiguous.
Scheduler hybrid-pool support (scheduler_output_processor_mixin.py): when the KV cache is a HybridLinearKVPool, key tensors are looked up via full_attention_layer_id_mapping instead of raw layer id. The model now owns the canonical list of recorded layer ids.
Version bump to 0.5.11+hs3.

github-actions Bot added the dependencies label Jun 3, 2026

anvdn force-pushed the v0.5.11+hs3 branch from 95ad209 to c1d3022 Compare June 3, 2026 14:01

anvdn added 2 commits June 4, 2026 14:04

Attention Heatmap Support for Qwen3.5 and refactor

377dbd3

Bump SGLang version to 0.5.11+hs3

277edf9

anvdn force-pushed the v0.5.11+hs3 branch from c1d3022 to 277edf9 Compare June 4, 2026 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AML-4269: Add Location Support to Qwen3.5#10

AML-4269: Add Location Support to Qwen3.5#10
anvdn wants to merge 2 commits into
v0.5.11+hsfrom
v0.5.11+hs3

anvdn commented Jun 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anvdn commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Attention Heatmap Support for Qwen3.5 + refactor

Summary

Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anvdn commented Jun 3, 2026 •

edited

Loading