consciousness/training/apollo_plugin
Kent Overstreet 7e7e9a4b69 training: integrate /train into vLLM process (no separate daemon)
Remove standalone worker.py daemon. Training now runs inside vLLM:

- train_router.py: FastAPI router patched into vLLM's build_app()
- /train served on same port as /completions, /score
- Lazy-loads HF model with vLLM weight views on first request
- HOGWILD training: no pause, weights updated in-place

The previous architecture had a separate daemon on port 8080 that
communicated with vLLM via pause/resume endpoints. This was wrong -
training should run in-process, sharing GPU memory directly.

Co-Authored-By: Proof of Concept <poc@bcachefs.org>
2026-04-16 02:04:26 -04:00
..
__init__.py training: integrate /train into vLLM process (no separate daemon) 2026-04-16 02:04:26 -04:00
checkpoint_sync.py training: restructure as vLLM plugin package 2026-04-15 23:16:53 -04:00
export_hook.py training: integrate /train into vLLM process (no separate daemon) 2026-04-16 02:04:26 -04:00
optimizer.py training: restructure as vLLM plugin package 2026-04-15 23:16:53 -04:00
steering.py training: restructure as vLLM plugin package 2026-04-15 23:16:53 -04:00
train_router.py training: integrate /train into vLLM process (no separate daemon) 2026-04-16 02:04:26 -04:00
weight_mapping.py training: restructure as vLLM plugin package 2026-04-15 23:16:53 -04:00