[bugfix] fix grpo target_parameters & chord device by hjh0119 · Pull Request #9525 · modelscope/ms-swift

hjh0119 · 2026-06-09T17:03:49Z

gemini-code-assist

Code Review

This pull request refactors the device alignment logic in patch_lora_merge and patch_lora_unmerge by utilizing PEFT's type-agnostic _move_adapter_to_device_of_base_layer method instead of manual device handling. It also updates compute_chord_loss to move SFT inputs to the CPU before collation. The review feedback suggests adding fallback logic in both merge and unmerge patches to ensure backward compatibility with older PEFT versions (prior to 0.12.0) that do not implement _move_adapter_to_device_of_base_layer.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-09T17:04:52Z

+        for active_adapter in check_adapters_to_merge(self, adapter_names) or []:
+            # Align adapter sublayers (lora_A/B, DoRA magnitude, ...) to the base device.
+            # Type-agnostic: ParamWrapper overrides this to use get_param().device.
+            self._move_adapter_to_device_of_base_layer(active_adapter)


To ensure compatibility with older versions of PEFT (prior to 0.12.0), we should check if _move_adapter_to_device_of_base_layer is available on the LoraLayer instance before calling it. If it is not present, we can fall back to manually aligning the adapter sublayers to the base device.

Suggested change

for active_adapter in check_adapters_to_merge(self, adapter_names) or []:

# Align adapter sublayers (lora_A/B, DoRA magnitude, ...) to the base device.

# Type-agnostic: ParamWrapper overrides this to use get_param().device.

self._move_adapter_to_device_of_base_layer(active_adapter)

for active_adapter in check_adapters_to_merge(self, adapter_names) or []:

# Align adapter sublayers (lora_A/B, DoRA magnitude, ...) to the base device.

# Type-agnostic: ParamWrapper overrides this to use get_param().device.

if hasattr(self, '_move_adapter_to_device_of_base_layer'):

self._move_adapter_to_device_of_base_layer(active_adapter)

else:

base_layer = self.get_base_layer()

device = None

if hasattr(base_layer, 'get_param'):

device = base_layer.get_param().device

elif hasattr(base_layer, 'weight'):

device = base_layer.weight.device

if device is not None:

for attr in ['lora_A', 'lora_B', 'lora_embedding_A', 'lora_embedding_B', 'lora_magnitude_vector']:

if hasattr(self, attr):

module_dict = getattr(self, attr)

if active_adapter in module_dict:

module_dict[active_adapter].to(device)

gemini-code-assist · 2026-06-09T17:04:53Z

        for adapter in list(self.merged_adapters):
-            if self.use_dora.get(adapter, False):
-                self.lora_magnitude_vector[adapter].weight.data = \
-                    self.lora_magnitude_vector[adapter].weight.data.to(self.base_layer.weight.device)
-
+            self._move_adapter_to_device_of_base_layer(adapter)


Similarly, to ensure compatibility with older versions of PEFT, we should check if _move_adapter_to_device_of_base_layer is available before calling it in unmerge_patched.

Suggested change

for adapter in list(self.merged_adapters):

if self.use_dora.get(adapter, False):

self.lora_magnitude_vector[adapter].weight.data = \

self.lora_magnitude_vector[adapter].weight.data.to(self.base_layer.weight.device)

self._move_adapter_to_device_of_base_layer(adapter)

for adapter in list(self.merged_adapters):

if hasattr(self, '_move_adapter_to_device_of_base_layer'):

self._move_adapter_to_device_of_base_layer(adapter)

else:

base_layer = self.get_base_layer()

device = None

if hasattr(base_layer, 'get_param'):

device = base_layer.get_param().device

elif hasattr(base_layer, 'weight'):

device = base_layer.weight.device

if device is not None:

for attr in ['lora_A', 'lora_B', 'lora_embedding_A', 'lora_embedding_B', 'lora_magnitude_vector']:

if hasattr(self, attr):

module_dict = getattr(self, attr)

if adapter in module_dict:

module_dict[adapter].to(device)

fix

459a5bb

gemini-code-assist Bot reviewed Jun 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix grpo target_parameters & chord device#9525

[bugfix] fix grpo target_parameters & chord device#9525
hjh0119 wants to merge 1 commit into
modelscope:mainfrom
hjh0119:fix-issue-0610

hjh0119 commented Jun 9, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-        for active_adapter in check_adapters_to_merge(self, adapter_names) or []:
-            # Align adapter sublayers (lora_A/B, DoRA magnitude, ...) to the base device.
-            # Type-agnostic: ParamWrapper overrides this to use get_param().device.
-            self._move_adapter_to_device_of_base_layer(active_adapter)
+        for active_adapter in check_adapters_to_merge(self, adapter_names) or []:
+            # Align adapter sublayers (lora_A/B, DoRA magnitude, ...) to the base device.
+            # Type-agnostic: ParamWrapper overrides this to use get_param().device.
+            if hasattr(self, '_move_adapter_to_device_of_base_layer'):
+                self._move_adapter_to_device_of_base_layer(active_adapter)
+            else:
+                base_layer = self.get_base_layer()
+                device = None
+                if hasattr(base_layer, 'get_param'):
+                    device = base_layer.get_param().device
+                elif hasattr(base_layer, 'weight'):
+                    device = base_layer.weight.device
+                if device is not None:
+                    for attr in ['lora_A', 'lora_B', 'lora_embedding_A', 'lora_embedding_B', 'lora_magnitude_vector']:
+                        if hasattr(self, attr):
+                            module_dict = getattr(self, attr)
+                            if active_adapter in module_dict:
+                                module_dict[active_adapter].to(device)

Conversation

hjh0119 commented Jun 9, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant