BUG REPORT: Inverse Boolean Logic in "Limit Model Offloading to GPU Dedicated Memory" Toggle affecting MoE Models

### System Environment:

- **LM Studio Version:** 0.4.17
- **OS:** Windows 10
- **Hardware:** AMD Ryzen 7 5700X3D | NVIDIA RTX 5060 Ti (16GB VRAM) | 32GB RAM
- **Model Tested:** Qwen 3.6 35B A3B (GGUF) with large context 64K

### Description of the Bug & Visual Evidence:

There is an inverse boolean logic bug with the **"Limit Model Offloading to GPU Dedicated Memory"** toggle when running `Qwen 3.6 35B A3B`. The behavior of the switch is completely inverted compared to its visual label:

- **When ON (Toggle Enabled):** VRAM caps at ~13.5 GB, offloading processing to the CPU (causing CPU usage to spike). Performance drops to **~23 tok/s**.
<img width="2878" height="1080" alt="Image" src="https://github.com/user-attachments/assets/708c439f-57ac-4991-a82e-daf4e1d5bad6" />

- **When OFF (Toggle Disabled):** The backend correctly allocates maximum VRAM (~15.2 GB), keeping shared memory at 0 GB, and performance flies at **+72 tok/s** with low CPU usage.
<img width="2878" height="1080" alt="Image" src="https://github.com/user-attachments/assets/729ae2fc-6a34-4270-97df-4e8832f63e11" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG REPORT: Inverse Boolean Logic in "Limit Model Offloading to GPU Dedicated Memory" Toggle affecting MoE Models #232

System Environment:

Description of the Bug & Visual Evidence:

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

BUG REPORT: Inverse Boolean Logic in "Limit Model Offloading to GPU Dedicated Memory" Toggle affecting MoE Models #232

Description

System Environment:

Description of the Bug & Visual Evidence:

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions