Task vectors (W_finetuned - W_pretrained) compose through arithmetic. Train behavioral patterns separately, extract task vectors, compose with TIES-merging. Result: personality as version control — each behavioral pattern is a separate, tunable, removable vector. Key steal: NEGATE unwanted behaviors (subtract τ_suggesting). Key steal: ICL as warm start for fine-tuning (ICL task vector initializes Apollo's moments). Key architecture: memory graph nodes map 1:1 to task vectors. Graph = specification, vectors = implementation, Apollo = compiler, merge recipe = build system. |
||
|---|---|---|
| .. | ||
| checkpoint | ||
| research | ||
| apollo_mini.py | ||
| apollo_worker.py | ||
| DESIGN.md | ||
| export_weights.py | ||
| first_training_step.py | ||
| start_vllm_with_apollo.sh | ||
| train.py | ||
| training_example.py | ||
| vllm_export_hook.py | ||
| weight_mapping.py | ||