forked from kent/consciousness
The grand unified view: every technique we're using (Apollo, context-frozen, diversity, small steps, two-stage memory, dream loop) addresses the stability-plasticity dilemma at a DIFFERENT scale. They're orthogonal, complementary defenses. Together they predict we can use higher lr (1e-4) than typical fine-tuning because the multi-scale defense compensates. The dream loop is the keystone connecting all scales. Architecture converges with neuroscience because the problem has the same structure regardless of substrate. |
||
|---|---|---|
| .. | ||
| apollo-paper-analysis.md | ||
| catastrophic-forgetting.md | ||
| context-frozen-training.md | ||
| directional-sharpness.md | ||
| dreaming-as-diffusion.md | ||
| gradient-flow-frozen-context.md | ||
| hippocampal-replay-parallel.md | ||
| hogwild-convergence.md | ||
| unified-theory-stability-plasticity.md | ||