FIX: inject_adapter incorrectly propagates inference_mode to active adapters by kiritozc · Pull Request #3290 · huggingface/peft

kiritozc · 2026-05-30T07:58:34Z

Description

When injecting a new adapter via inject_adapter, the housekeeping section calls:

self.set_adapter(self.active_adapters, inference_mode=peft_config.inference_mode)

Here peft_config belongs to the newly injected adapter, but self.active_adapters points to the existing active adapter(s). When the new adapter has inference_mode=True (e.g. during save_pretrained with path_initial_model_for_weight_conversion in PiSSA/OLoRA/CorDA workflows), this erroneously freezes the already-active training adapter, causing grad_norm to become 0 and training to effectively stop.

Fix

Only propagate inference_mode when the new adapter IS the active adapter (first-time injection). For subsequent adapters, set_adapter is called without inference_mode, preserving the existing active adapter's trainability state. The new adapter's own inference_mode is still correctly handled by the existing code below.

Tests

Added test_inject_adapter_inference_mode_does_not_freeze_active_adapter — a regression test covering LoraConfig, LoHaConfig, LoKrConfig, IA3Config, OFTConfig, BOFTConfig
Two existing xfailing tests (switch_inference_mode and add_adapter) remain as xfail since they require decoupling active adapter selection from requires_grad in set_adapter

…ctive adapters When injecting a new adapter via inject_adapter, the housekeeping section called set_adapter(self.active_adapters, inference_mode=peft_config.inference_mode). Here peft_config belongs to the newly injected adapter, but self.active_adapters points to the existing active adapter(s). When the new adapter has inference_mode=True (e.g. during save_pretrained with path_initial_model_for_weight_conversion in PiSSA/OLoRA/CorDA workflows), this erroneously freezes the already-active training adapter, causing grad_norm to become 0 and training to effectively stop. The fix only propagates inference_mode when the new adapter IS the active adapter (first-time injection). For subsequent adapters, set_adapter is called without inference_mode, preserving the existing active adapter's trainability state. The new adapter's own inference_mode is still correctly handled by the existing code that follows. This was a regression introduced in commit 13fa0ae (PR huggingface#2765). A regression test is added that verifies adding an adapter with inference_mode=True does not freeze the existing active adapter.

BenjaminBossan

Thanks for fixing this edge case. This fix looks good but I have two small comments regarding the test, please check.

BenjaminBossan · 2026-06-01T09:35:18Z

        params_with_grad = [n for n, p in model.named_parameters() if p.requires_grad]
        assert all(not p.requires_grad for p in model.parameters())

+    @pytest.mark.parametrize("config_cls", [LoraConfig, LoHaConfig, LoKrConfig, IA3Config, OFTConfig, BOFTConfig])


OFT is failing to initialize with the given arguments. But I think for this test, just checking LoRA is enough (like in the previous tests).

BenjaminBossan · 2026-06-01T09:35:26Z

+        # Regression test for a bug where adding a second adapter with inference_mode=True would incorrectly freeze
+        # the already-active training adapter. This happened because inject_adapter propagated the new adapter's
+        # inference_mode to set_adapter for the existing active adapters.
+        # See PR #XXXX


Update the comment.

Hi, I've addressed both review comments:

Simplified the test to only use LoraConfig

Updated the PR reference in the comment (FIX: inject_adapter incorrectly propagates inference_mode to active adapters #3290)

Please take another look. Thanks!

HuggingFaceDocBuilderDev · 2026-06-01T13:37:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

A couple of tests are failing now. I think this is because they baked in the previous assumption and thus may require updating. Could you please take a look?

kiritozc mentioned this pull request May 30, 2026

Fix: restore requires_grad after _save_converted_model to work around peft inject_adapter side effect kiritozc/ms-swift#1

Merged

kiritozc closed this in kiritozc/ms-swift#1 May 30, 2026

kiritozc reopened this May 30, 2026

kiritozc mentioned this pull request May 30, 2026

Fix: restore requires_grad after _save_converted_model to work around peft inject_adapter side effect modelscope/ms-swift#9452

Merged

BenjaminBossan requested changes Jun 1, 2026

View reviewed changes

Address review: simplify test to only LoRA, fix PR reference

b4a2124

BenjaminBossan reviewed Jun 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: inject_adapter incorrectly propagates inference_mode to active adapters#3290

FIX: inject_adapter incorrectly propagates inference_mode to active adapters#3290
kiritozc wants to merge 2 commits into
huggingface:mainfrom
kiritozc:fix/inject-adapter-inference-mode-propagation

kiritozc commented May 30, 2026

Uh oh!

BenjaminBossan left a comment

Uh oh!

BenjaminBossan Jun 1, 2026

Uh oh!

BenjaminBossan Jun 1, 2026

Uh oh!

kiritozc Jun 1, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jun 1, 2026

Uh oh!

BenjaminBossan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kiritozc commented May 30, 2026

Description

Fix

Tests

Related

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

kiritozc Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 1, 2026

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants