CLUSTER · TIER 3
Apple researchers introduce MemoryLLM for interpretable feed-forward memory in transformers.
MemoryLLM decouples feed-forward modules from self-attention, enabling study of FFN parameters as context-free token-wise neural retrieval memory and revealing how input tokens access memory locations across downstream tasks.
Sources
1
X mentions
—
First seen
2Hago
Velocity
+102%/6h