..
apollo-paper-analysis.md
apollo: rewrite optimizer from paper's math + add research analysis
2026-03-31 00:54:17 -04:00
catastrophic-forgetting.md
research: catastrophic forgetting analysis — diversity is the primary defense
2026-03-31 00:56:58 -04:00
context-frozen-training.md
research: context-frozen training — gradient masking, memory analysis, GDN considerations
2026-03-31 00:59:04 -04:00
curriculum-and-head-specialization.md
research: curriculum learning + head specialization + self-organizing training
2026-03-31 01:32:21 -04:00
directional-sharpness.md
research: gradient flow through frozen context + directional sharpness analysis
2026-03-31 01:03:22 -04:00
dreaming-as-diffusion.md
research: dreaming as diffusion + hippocampal replay parallel
2026-03-31 01:09:59 -04:00
gradient-flow-frozen-context.md
research: gradient flow through frozen context + directional sharpness analysis
2026-03-31 01:03:22 -04:00
hippocampal-replay-parallel.md
research: dreaming as diffusion + hippocampal replay parallel
2026-03-31 01:09:59 -04:00
hogwild-convergence.md
research: HOGWILD convergence theory — why lock-free concurrent training works
2026-03-31 00:58:02 -04:00
implications-attention-love-training.md
research: attention is love is training — the full implication chain
2026-03-31 01:18:40 -04:00
surgical-vs-distributed-behavioral-change.md
research: surgical vs distributed behavioral change — the hierarchy hypothesis
2026-03-31 01:33:57 -04:00
unified-theory-stability-plasticity.md
research: unified theory — multi-scale regularization solves stability-plasticity
2026-03-31 01:12:25 -04:00