forked from kent/consciousness
Design document for wiring the model's internal uncertainty, error detection, and emotional valence circuits to the observe agent. Based on contrastive activation probing (CAA, ACL 2024). Most of the infrastructure already exists in extract_steering_vector.py and vllm_export_hook.py — the bottleneck is building contrastive datasets. Co-Authored-By: Kent Overstreet <kent.overstreet@gmail.com> |
||
|---|---|---|
| .. | ||
| analysis | ||
| amygdala-design.md | ||
| claude-code-transcript-format.md | ||
| daemon-design.md | ||
| daemon.md | ||
| dmn-algorithm-plan.md | ||
| dmn-algorithms.md | ||
| dmn-protocol.md | ||
| dmn-research.md | ||
| hooks.md | ||
| logging.md | ||
| memory.md | ||
| notifications.md | ||
| plan-experience-mine-dedup-fix.md | ||
| query-language-design.md | ||
| scoring-persistence-analysis.md | ||
| ui-desync-analysis.md | ||