Per-parameter Learning Rate (PPLR) Implementation, Tensor Decomposition Models, and Tomography Updates by cedriclim1 · Pull Request #213 · electronmicroscopy/quantem

cedriclim1 · 2026-04-24T00:55:57Z

What does this PR do?

First pass implementation of KPlanes implementation (https://brentyi.github.io/tilted/, https://sarafridov.github.io/K-Planes/). These tensor decomposition models require multiple parmeter optimization which is not handled in OptimizerMixin.

The solution @arthurmccray and I came up with is to setup a per-parameter learning rate PPLR abstract class which will be instantiated with models that have multiple parameters to be optimized to. This is implemented incore/ml/models/model_base.py. The most important part about this class is the get_params method, which is used for parsing the trainable parameters (see ObjectTensorDecomp's get_optimization_parameters) .

class PPLR(ABC):
    """
    Abstract base class for models that require multi-scale parameter optimization.
    """

    @abstractmethod
    def get_params(self) -> Dict[str, list[nn.Parameter]]:
        """
        Return a dictionary of parameters grouped by key.

        For example if your nn.Module has multiple optimizable parameter groups,
        you can return a dictionary with the keys "grids" and "sigma_net"
        (KPlanes example).
        """
        pass

The KPlanes and TILTED implementations are in core/ml/models in kplanes.py with KPlanes, KPlanesTILTED, and CPTilted. Note CPTIlted is used for the two-phase warmup that the paper recommends by pretraining the SO3 rotations of the planes in a lower representation space. so3params.py also has both quaternion and R9+SVD implementations which can be swapped as a parameter in KPlanesTILTED.

In tomography/object_models.py there is now a new object that handles the new tensor decomposition methods, ObjectTensorDecomp that inherits from ObjectINR and the following overloads are needed:

optimizer_params property: The setter is changed, since we're building a a dictionary of parameters into our optimizers per https://docs.pytorch.org/docs/stable/optim.html#per-parameter-options.
reconnect_optimizer_to_parameters: This one is the most annoying thing to implement, since now our parameters is a dictionary.

There are a few optional overrides that I've also included preemptively just incase if I want to add additional soft/hard constraints to these methods as you can see both apply_soft_constraints and apply_hard_constraints are still there.

Key change to OptimizerMixin: I have changed set_optimizer to handle a dictionary of parameters. I think this is better than my original idea of just implementing this in object_models.py since you would have to overload that function again. I think this is fine and still works for single optimization stuff.

There are also small changes to different aspects of the Tomography module, but mostly contained in object_models.py. Listing out the changes and rationale in the tomography files:

tomography.py: I've disabled torch.bfloat16 autocasting since F.grid_sample does not have autograd implemented errors. I've also put the TILTED tensor decomp pretraining check by looking at the convergence of the SO3 rotations. I don't like this being here, but I'm not quite sure where else to put this.
tomography_base.py: Type-hinting change for setting up distributed.
tomography_opt.py: Parsing is already handled in object_models.py.

What the reviewer should do

Attached is a notebook that tests tensor decomposition methods in tomography: 0415_PPLR_testing.ipynb. The dataset here is from quantem-tutorials, so just set the directory of the same dataset to this notebook. The runtime can be increased by increasing the batch size, but the learning rates have to be scaled. Setting the batch size to 4096 seems to be fine? So feel free to change it and scale the learning rates by 2.

Double check implementations across the board, especially the optimizer_mixin.py, and object_models.py
Organization makes sense for tensor decomposition stuff. I'm actually not very happy with how core/ml is looking. Where do we draw the line of what lives in models (I think everything model related should live in there tbh: INR, CNN, etc...)?
Check reconstruction workflows are still working i.e, ptychography due to the OptimizerMixin change.

…eeds to be overloaded is set_optimizer for PPLR cases

…to do the matching in set_optimizer instead of parsing in optimizer_params maybe?

… to check: Look at object_models.py and see how the optimizer matching should be handled. It seems like set_optimizers doesn't really do what it's supposed to do.

… probably have to do TV loss computation within the model?

…es. Also overloaded reconnecting optimizers

…well. Only things to ask Corneel about is multiscale res since this adds a significant amount of compute. Should I be doing variable num_samples_per_ray?

…t DDP, clean-up KPlanes, fix up object_models.py since it's insanely cluttered now

…ositionModel ABC, make sure to have a property for which kind of tensor decomposition method is being used. SO3Params are moved to a different file, thinking of making a kplanes_utils.py. Starting reorganization of object_models.py to have ObjectINR and ObjectTensorDecomp

…p-level Tomography

… of parameters now that helps with type-setting. The main reason for having model_base.py as is it is right now is if we ever wanted to go do TensoRF or something just to validate

…ry parsing

…pe hinting and make PPLR more extensible

…rams. reconnect_optimizer_to_parameters, and optimizer_params changes

…dels.py

bobleesj · 2026-04-28T00:14:02Z

@cedriclim1 the default VS code setting has been removed, so please pull from upstreadm/dev

…ixin_pplr

arthurmccray

overall looks good! I made the necessary few changes to ptycho so it works with this, and overall i'm happy with it. Hopefully we're one step closer to not having to mess with OptimizerMixin ever again...

As to /core/ml organization, yes cnn.py, cnn_dense.py, inr.py, etc. should all be in /ml/models. I don't think it should be that bad to move them actually? An automatic refactor should handle everything in quantem fine, and they'll be accessible at the same level of the namespace (we still want to be able to from quantem.core.ml import CNN2d), so it shouldn't affeect the tutorials either really. If you move them (and whatever else you think) we can just re-run all the tutorials and tests to make sure nothing breaks.

…es this class anyways currently. Type-hinting error fixed in tomography_otp.py

… into optmixin_pplr

…pe safety

cedriclim1 · 2026-05-18T22:03:56Z

@arthurmccray ready for review again - addressed most comments, but some need follow-up.

arthurmccray

Overall looks good, just one or two things I saw. Did I miss anything that you needed my followup on?

arthurmccray · 2026-05-19T23:47:31Z

+        return f"T={self.quats.shape[0]}"
+
+
+class SO3ParamR9SVD(nn.Module):


bump. slight preference to having static methods rotmat_to_r9 and r9_to_rotmat (the latter of which is simply called by as_matrix), but just having it in a classmethod would be fine too.

arthurmccray · 2026-05-20T00:08:48Z

+    Handles all reconstruction parameters to be passed into object models.
+
+    Subclasses will pick whatever parameter they need
+        - Pixelated reads ".volume"


please include here what these actually mean tho. Like volume is pretty self explanatory (presumably it's the full reconstructed volume), and coords would be for INR (is it the coords for the full volume? or just part of it), but pred and all_densities and obj should be explained in a line

arthurmccray · 2026-05-20T00:16:25Z


    @property
-    def optimizer_params(self) -> OptimizerType:
+    def optimizer_params(self) -> OptimizerType | dict[str, OptimizerType]:


it probably would, but it should be easy for me to fix... but on the other hand it feels a little silly for it to be {"params": OptimizerType} for single Optimizers. Up to you!

arthurmccray · 2026-05-20T00:18:05Z

+            # Per-group case: all groups must agree on the optimizer class,
+            # and per-group hyperparameters are already baked into each dict
+            # by get_optimization_parameters().
+            opt_specs = list(self._optimizer_params.values())


in this case i don't think it makes any difference as the optimizer_type property just directly returns _optimizer_type. But in theory it might not, and that's more why I think about using the properties internally. idk if that's officially good or bad practice though--in my comment i was honestly asking it as a question 😅

cedriclim1 and others added 21 commits April 15, 2026 17:06

Added k-planes model

2148679

Added PPLR stuff

c299ca8

object_models optimization setting is working well. Only thing that n…

9ac27a9

…eeds to be overloaded is set_optimizer for PPLR cases

Optimizing, set_optimizer is just default to Adam now, probably need …

5f40d5a

…to do the matching in set_optimizer instead of parsing in optimizer_params maybe?

KPlanes Tilted claude implementation, need to talk to Corneel. Things…

7f51dc5

… to check: Look at object_models.py and see how the optimizer matching should be handled. It seems like set_optimizers doesn't really do what it's supposed to do.

Added TV loss for PPLR models, I don't like this solution though will…

af140be

… probably have to do TV loss computation within the model?

Merge branch 'electronmicroscopy:dev' into optmixin_pplr

04bd1dc

object_models.py now has tv_loss for both KPlanes and INR architectur…

73c0d30

…es. Also overloaded reconnecting optimizers

KPlanes with R9+SVD parameterization, everything seems to be working …

35157fb

…well. Only things to ask Corneel about is multiscale res since this adds a significant amount of compute. Should I be doing variable num_samples_per_ray?

Everything seems to be working; only things to do is to take a look a…

1c89ad5

…t DDP, clean-up KPlanes, fix up object_models.py since it's insanely cluttered now

New TV loss function

4215d6e

TV volume -- needs significant refactoring everywhere

aebf801

DDP Fixes for PPLR stuff

2579847

Changes

a59baac

Removed some the _unwrap dependencies. Added ObjectTensorDecomp on to…

e33fec1

…p-level Tomography

Revamped model_base.py to cover type-hinting stuff. KPlanes has a set…

15bdcd4

… of parameters now that helps with type-setting. The main reason for having model_base.py as is it is right now is if we ever wanted to go do TensoRF or something just to validate

Final changes prior to draft PR

0c71a5d

Fixed typo

99fdd88

Small change to set_optimizer in OptimizerMixin to allow for dictiona…

4a44826

…ry parsing

Pretraining warning on ObjectTensorDecomp

ede9d56

cedriclim1 requested a review from arthurmccray April 24, 2026 00:58

cedriclim1 added 6 commits April 27, 2026 10:31

Doing some refactoring, adding reconstruction context to help with ty…

dc0b5f0

…pe hinting and make PPLR more extensible

Working on ObjectPixelated implementing ctx

01456ed

Added pyrightconfig.json to gitignore, PrivateImportUsage error annoying

a061845

ObjectINR implemented

23cfd72

Claude OptimizerMixin changes to account for different optimizable pa…

fb07d14

…rams. reconnect_optimizer_to_parameters, and optimizer_params changes

Moved ReconContext, changed get_optimization_parameters in dataset_mo…

6678c43

…dels.py

Small changes

9a96415

Merge branch 'dev' of github.com:electronmicroscopy/quantem into optm…

e444164

…ixin_pplr

cedriclim1 closed this Apr 28, 2026

cedriclim1 reopened this Apr 28, 2026

converting ptycho models to work with new PPLR

8ecd3c7

arthurmccray reviewed May 7, 2026

View reviewed changes

cedriclim1 added 12 commits May 18, 2026 11:16

Created a BaseContext class - totally optional, not sure if anyone us…

adcc26d

…es this class anyways currently. Type-hinting error fixed in tomography_otp.py

Merge branch 'optmixin_pplr' of github.com:electronmicroscopy/quantem…

044f446

… into optmixin_pplr

Removed import in constraints.py

af33b96

PPLR description changed to multi-parameter optimization

03da610

SO3 Rotations paper citation in SO3params.py

a62295c

Type-hinting fix in So3params

e222512

Load parameters added .to(self.device)

91d6056

Volume added to reconstruction context

f37ca9f

Citations for the kplanes models

8072fa6

Explanation in object_models.py for different types of tv loss.

eafca0c

Refactor type hinting in _unwrap function to use cast for improved ty…

e4c8dbe

…pe safety

Some type-hinting fix for Contexts

136a65f

arthurmccray reviewed May 20, 2026

View reviewed changes

		return f"T={self.quats.shape[0]}"


		class SO3ParamR9SVD(nn.Module):

Conversation

cedriclim1 commented Apr 24, 2026 • edited by arthurmccray Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

What the reviewer should do

Uh oh!

bobleesj commented Apr 28, 2026

Uh oh!

arthurmccray left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cedriclim1 commented May 18, 2026

Uh oh!

arthurmccray left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arthurmccray May 19, 2026

Choose a reason for hiding this comment

Uh oh!

arthurmccray May 20, 2026

Choose a reason for hiding this comment

Uh oh!

arthurmccray May 20, 2026

Choose a reason for hiding this comment

Uh oh!

arthurmccray May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cedriclim1 commented Apr 24, 2026 •

edited by arthurmccray

Loading

arthurmccray left a comment •

edited

Loading