consciousness

History

Kent Overstreet 389f1bbe03 amygdala: bump subspace-k default to 512 k=20 was far too aggressive a truncation — it discards per-attention-head discriminability entirely. At hidden_dim=5120, 40 heads × head_dim=128 each contribute their own 128-dim block to the residual stream via W_o columns. To resolve 'this concept lives in head H', per-story SVD needs enough rank to separate head contributions, which means k on the order of hundreds. 512 is a reasonable default: clamped to n_tokens per story so short stories use their full natural rank. The eigenvalue spectrum of M_pos - M_base should become sharper (larger λ_0/λ_1 gap) as we stop averaging across nuisance-shared directions. Co-Authored-By: Proof of Concept <poc@bcachefs.org>		2026-04-18 21:41:00 -04:00
..
amygdala_stories	amygdala stories: disambiguation scenarios for fragmented concepts	2026-04-18 21:08:23 -04:00
amygdala_training	amygdala: bump subspace-k default to 512	2026-04-18 21:41:00 -04:00
apollo_plugin	training: move to dedicated subprocess with ZMQ communication	2026-04-16 02:04:26 -04:00
research	research: latent reasoning integration plans for Qwen 3.5 27B	2026-04-12 15:50:09 -04:00
DESIGN.md	training: move to dedicated subprocess with ZMQ communication	2026-04-16 02:04:26 -04:00
pyproject.toml	training: move to dedicated subprocess with ZMQ communication	2026-04-16 02:04:26 -04:00