Pure-PyTorch plastic-memory sequence model: local attention plus bounded differentiable memory, surprise-gated writes, forgetting, synthetic benchmarks, and ablations.
machine-learning transformers pytorch attention ai-research sequence-modeling long-context memory-augmented-networks differentiable-memory synthetic-benchmarks
-
Updated
May 2, 2026 - Python