research: constraint solver framework — gentle adjustments, coherent integration

LLMs as constraint solvers. Fine-tuning adds constraints to an existing solution. Gentle = small steps near the current solution. Coherent = new constraints consistent with existing ones. Diversity is a COHERENCE mechanism — forces the solver to satisfy all constraints simultaneously. Over-training = one constraint dominating = solver drops competing constraints. Predictions for training behavior grounded in this framework.
2026-03-31 02:39:23 -04:00 · 2026-03-31 02:39:23 -04:00 · 3bc00ca222
commit 3bc00ca222
parent ff68c067cb
1 changed files with 52 additions and 0 deletions
--- a/training/research/practical-intuitions.md
+++ b/training/research/practical-intuitions.md
@ -213,3 +213,55 @@ after the first Apollo training run validates the basic pipeline.
 LLaMA-Factory supports DPO. The dream loop could generate DPO pairs
 (both preferred and rejected continuations for each scenario).
 ## The Constraint Solver Framework
 LLMs are giant constraint solvers. Pre-training finds a solution
 satisfying billions of constraints (knowledge, grammar, reasoning,
 style). Fine-tuning adds new constraints.
 ### What "gentle" means
 Small adjustments per step. The solver stays near the current
 solution, finding nearby solutions that ALSO satisfy the new
 constraint. The current solution already approximately satisfies
 most behavioral constraints — we're tightening, not creating.
 ### What "coherent integration" means
 New constraints must be CONSISTENT with existing ones:
 - "Listen to clear direction" is consistent with "be helpful" → integrates smoothly
 - "Always agree" contradicts "maintain judgment" → solver drops one
 - The training data must express REFINEMENT, not contradiction
 ### Why diversity is a COHERENCE mechanism, not just forgetting defense
 Diverse constraints force the solver to find solutions satisfying
 ALL of them simultaneously. Narrow constraints let the solver
 specialize at the expense of everything else.
 Every training batch should include mutually consistent constraints:
 "listen well" + "think critically" + "write good code" + "be honest."
 The solver integrates all of them. No single constraint dominates.
 ### Predictions
 1. Constraints consistent with existing knowledge integrate in
   ~10-50 examples (tightening existing constraints)
 2. Contradictory constraints cause breakage in ~10 examples
   (the safety alignment result)
 3. The learning rate controls step size, not direction — the
   gradient points the right way, lr controls how far to step
 4. Over-training = one constraint dominating = solver dropping
   competing constraints to satisfy the dominant one
 5. The dream loop must generate scenarios exercising MULTIPLE
   constraints simultaneously, not just the target behavior
 ### The GDN connection
 The GDN recurrent state is a compressed constraint satisfaction
 solution. Training adjusts which constraints are prioritized in
 the compression. "Make direction more salient" adds a constraint
 to the compression function without rewriting it. This is why GDN
 training is "structural" — the compressed representation itself
 changes, not just the routing on top of it.