fix: memoize optimizer factory functions to avoid JIT recompilation (#353) by Muneerali199 · Pull Request #1693 · google-deepmind/optax

Muneerali199 · 2026-06-09T18:24:27Z

Fixes #353

Problem

When GradientTransformation objects are passed as static arguments to jax.jit, JAX recompiles on every call because each call to an optimizer factory creates new closures with different identities, producing different hashes.

Root Cause

GradientTransformation is a NamedTuple whose __hash__ is derived from the identity of its init and update closure fields. Since every call to optax.adam(1e-3) creates new closures, the hash is different each time.

Solution

Add @functools.lru_cache(maxsize=None) to all optimizer alias factory functions. This ensures that identical arguments always return the exact same GradientTransformation object with a stable hash.

Files changed:

optax/_src/alias.py: Added memoization to all 29 optimizer factory functions
optax/transforms/_combining.py: Added to chain()
optax/_src/base.py: Added to identity(), set_to_zero(), stateless(), stateless_with_tree_map(), with_extra_args_support()
optax/_src/alias_test.py: Added GradientTransformationMemoizationTest with 6 tests

Verification

All memoized factory functions return is-identical objects for identical arguments
Different arguments still produce different objects
JAX jax.jit does not recompile when the same optimizer configuration is reused

…oogle-deepmind#353) When GradientTransformation objects are passed as static arguments to jax.jit, JAX recompiles on every call because the closures inside each GradientTransformation have different identities, producing different hashes. Fix: add @functools.lru_cache(maxsize=None) to all 29 optimizer alias factory functions in alias.py, plus chain() in _combining.py, and identity/set_to_zero/stateless/with_extra_args_support in base.py. Memoization ensures the same arguments always return the exact same GradientTransformation object, with stable identity-based hashing. JAX sees the same static argument and does not recompile. Closes google-deepmind#353

rdyro · 2026-06-09T20:33:39Z

Thanks, this looks like an interesting direction!

The straightforward application lru_cache is probably going to break on any dynamic data which is not hashable, what are you thinking as far as solving that problem?

Muneerali199 · 2026-06-10T09:35:18Z

Thanks, this looks like an interesting direction!

The straightforward application lru_cache is probably going to break on any dynamic data which is not hashable, what are you thinking as far as solving that problem?

Good point. I'd handle it by wrapping lru_cache to catch TypeError from unhashable args and fall through cleanly — no crash, no id() reuse bug. For the common case (hashable args) it's cached and stable. Want me to update the PR with this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: memoize optimizer factory functions to avoid JIT recompilation (#353)#1693

fix: memoize optimizer factory functions to avoid JIT recompilation (#353)#1693
Muneerali199 wants to merge 1 commit into
google-deepmind:mainfrom
Muneerali199:fix/memoize-optimizers-for-jit-hash

Muneerali199 commented Jun 9, 2026

Uh oh!

rdyro commented Jun 9, 2026

Uh oh!

Muneerali199 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Muneerali199 commented Jun 9, 2026

Problem

Root Cause

Solution

Files changed:

Verification

Uh oh!

rdyro commented Jun 9, 2026

Uh oh!

Muneerali199 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants