fix(utils): call empty_cache() after fp16→fp32 casts in prepare_model_for_kbit_training by umutonuryasar · Pull Request #3293 · huggingface/peft

umutonuryasar · 2026-05-31T10:15:03Z

The bulk param.data = param.data.to(torch.float32) loop creates temporary
tensors that PyTorch's CUDA allocator keeps cached even after they are no
longer referenced, resulting in ~1 GB of reserved-but-unused CUDA memory
on return. This breaks training on 8 GB unified-memory devices.

Fix: add a single torch.cuda.empty_cache() call (guarded by
torch.cuda.is_available()) after the cast loop so the allocator releases
those blocks back to the driver immediately.

Fixes #3265

…_for_kbit_training The bulk param.data = param.data.to(torch.float32) loop creates temporary tensors that PyTorch's CUDA allocator keeps cached even after they are no longer referenced, resulting in ~1 GB of reserved-but-unused CUDA memory on return. This breaks training on 8 GB unified-memory devices. Fix: add a single torch.cuda.empty_cache() call (guarded by torch.cuda.is_available()) after the cast loop so the allocator releases those blocks back to the driver immediately. Fixes huggingface#3265

umutonuryasar mentioned this pull request May 31, 2026

prepare_model_for_kbit_training adds ~1 GB CUDA reserved memory in 500 ms — undocumented cost that breaks memory-constrained training on 8 GB unified-memory devices #3265

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(utils): call empty_cache() after fp16→fp32 casts in prepare_model_for_kbit_training#3293

fix(utils): call empty_cache() after fp16→fp32 casts in prepare_model_for_kbit_training#3293
umutonuryasar wants to merge 1 commit into
huggingface:mainfrom
umutonuryasar:fix/kbit-training-cuda-memory-overhead

umutonuryasar commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

umutonuryasar commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant