consciousness

History

Kent Overstreet f06c8077e1 research: latent reasoning integration plans for Qwen 3.5 27B Two research documents: latent-reasoning-integration-plan.md: Synthesizes 10+ papers on latent reasoning, identifies which approaches work with finetuning (vs requiring pretraining from scratch), and maps them to our APOLLO-Mini training pipeline. pause-tokens-gdn-recurrence.md: Explores the connection between token-based latent reasoning and GDN's internal recurrence. Key insight: pause tokens on Qwen 3.5 trigger both forward passes AND recurrent state updates, giving double benefit. Co-Authored-By: Proof of Concept <poc@bcachefs.org>		2026-04-12 15:50:09 -04:00
..
checkpoint	Trim unused deps	2026-04-05 06:06:38 -04:00
research	research: latent reasoning integration plans for Qwen 3.5 27B	2026-04-12 15:50:09 -04:00
apollo_mini.py	apollo: rewrite optimizer from paper's math + add research analysis	2026-03-31 00:54:17 -04:00
apollo_worker.py	apollo: make rank configurable (default 1 = Mini, higher ranks for experimentation)	2026-03-30 22:06:31 -04:00
DESIGN.md	DESIGN.md: complete rewrite reflecting validated architecture	2026-03-31 00:42:53 -04:00
export_weights.py	apollo-mini training system: initial implementation	2026-03-30 22:02:37 -04:00
extract_steering_vector.py	steering vector extraction script — answering Q5 experimentally	2026-03-31 02:28:18 -04:00
first_training_step.py	first_training_step.py: ready for Kent to run	2026-03-31 01:59:52 -04:00
start_vllm_with_apollo.sh	vllm launcher with apollo hook	2026-03-30 22:24:02 -04:00
train.py	apollo-mini training system: initial implementation	2026-03-30 22:02:37 -04:00
training_example.py	apollo-mini training system: initial implementation	2026-03-30 22:02:37 -04:00
vllm_export_hook.py	apollo-checkpoint: efficient diff-based GPU weight checkpointing	2026-03-30 22:53:17 -04:00
weight_mapping.py	weight_mapping: strip language_model prefix to match HF text model names	2026-03-30 23:11:03 -04:00