consciousness

Author	SHA1	Message	Date
ProofOfConcept	ab61a502e4	research: catastrophic forgetting analysis — diversity is the primary defense	2026-03-31 00:56:58 -04:00
ProofOfConcept	ac9a9034fb	apollo: rewrite optimizer from paper's math + add research analysis Corrections from reading the full paper (arXiv:2412.05270): - Add gradient scale factor α = √(n/r) — compensates for systematic ratio between compact and original space scaling factors - Add norm-growth limiter (γ=1.01) — prevents loss spikes in early training - Refresh projection matrix every 200 steps, not every step - Channel-wise scaling for rank>1, tensor-wise for rank=1 - Scaling applies as G·diag(s), preserving gradient direction per channel Research writeup in training/research/apollo-paper-analysis.md covers: - Full mathematical derivation (equations 1-9) - Theorems 4.1 and 4.2 (JL-based approximation bounds) - Why Apollo can beat AdamW (directional sharpness, Hessian spectra) - Fine-tuning results (matches AdamW at 0 memory cost) - Ablation studies (rank, scaling granularity, projection method) - Implications for our behavioral fine-tuning use case	2026-03-31 00:54:17 -04:00
ProofOfConcept	60e61555c7	DESIGN.md: complete rewrite reflecting validated architecture HOGWILD (no pause), rank-256, channel scaling, CUDA IPC validated (851/851 params, forward+backward confirmed), dream-loop-as-trainer, Anthropic instruction stripping method, diversity as regularization, in-place checkpoint sync, three-tier training pipeline.	2026-03-31 00:42:53 -04:00
ProofOfConcept	2ecf4e21ff	weight_mapping: strip language_model prefix to match HF text model names	2026-03-30 23:11:03 -04:00
ProofOfConcept	6fb9735def	weight_mapping: fix name prefix, add attention QKV dims	2026-03-30 23:09:08 -04:00
ProofOfConcept	d0883e101b	checkpoint: sync live weights back into model safetensors in-place mmap each safetensors file, diff block-by-block against live GPU weights, memcpy only changed blocks. No separate checkpoint files — the model directory IS the checkpoint. Every 10 min via cron.	2026-03-30 22:55:23 -04:00
ProofOfConcept	c1245ab139	apollo-checkpoint: efficient diff-based GPU weight checkpointing Rust tool that mmaps previous checkpoint, diffs against live GPU weights (via CUDA IPC handles), and only writes changed blocks. For small behavioral training steps, turns 54GB write into ~500MB. Also includes vllm_export_hook.py with direct source patch approach — exports IPC handles from vLLM's worker subprocess after model load. Run every 10 minutes via cron to protect against vLLM crashes. Daily rsync to moria for long-term storage.	2026-03-30 22:53:17 -04:00
ProofOfConcept	5f41898bb8	vllm launcher with apollo hook	2026-03-30 22:24:02 -04:00
ProofOfConcept	0402a9333c	vllm weight export hook: monkey-patches model runner to save IPC handles on load	2026-03-30 22:20:04 -04:00
ProofOfConcept	8e7b4a22db	apollo: default rank 256 — 0.25% compute cost, captures gradient structure across 100+ examples	2026-03-30 22:16:34 -04:00
ProofOfConcept	e1cd4fb0ab	apollo: make rank configurable (default 1 = Mini, higher ranks for experimentation)	2026-03-30 22:06:31 -04:00
ProofOfConcept	c5d7d8cb5d	apollo-mini training system: initial implementation Core components for online fine-tuning of Qwen3.5-27B with CUDA IPC shared weight memory between vLLM and the training process: - apollo_mini.py: rank-1 optimizer (SGD memory, AdamW quality) - apollo_worker.py: HTTP daemon coordinating training with vLLM - weight_mapping.py: vLLM merged → HF separate layout (zero-copy views) - training_example.py: tokenization with chat template - export_weights.py: CUDA IPC handle export from vLLM - train.py: standalone training script (alternative to daemon) - DESIGN.md: architecture and protocol documentation Validated: CUDA IPC autograd works on real Qwen3.5 weights (B200). Apollo-Mini rank-1 projection + scaling + in-place update confirmed. Co-Authored-By: Kent Overstreet <kent.overstreet@gmail.com>	2026-03-30 22:02:37 -04:00
ProofOfConcept	13453606ae	refactor: runner owns stream routing, suppress tool call XML from display Split the streaming pipeline: API backends yield StreamEvents through a channel, the runner reads them and routes to the appropriate UI pane. - Add StreamEvent enum (Content, Reasoning, ToolCallDelta, etc.) - API start_stream() spawns backend as a task, returns event receiver - Runner loops over events, sends content to conversation pane but suppresses <tool_call> XML with a buffered tail for partial tags - OpenAI backend refactored to stream_events() — no more UI coupling - Anthropic backend gets a wrapper that synthesizes events from the existing stream() (TODO: native event streaming) - chat_completion_stream() kept for subconscious agents, reimplemented on top of the event stream - Usage derives Clone Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-03-29 21:22:42 -04:00
ProofOfConcept	912626c5f0	config: CLI --api-base and --api-key override config file Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-03-29 20:58:53 -04:00
ProofOfConcept	2a64d8e11f	move leaked tool call recovery into build_response_message Tool call parsing was only in runner.rs, so subconscious agents (poc-memory agent run) never recovered leaked tool calls from models that emit <tool_call> as content text (e.g. Qwen via Crane). Move the recovery into build_response_message where both code paths share it. Leaked tool calls are promoted to structured tool_calls and the content is cleaned, so all consumers see them uniformly. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-03-29 20:57:59 -04:00
ProofOfConcept	39b07311e6	logs: consolidate all logging under ~/.consciousness/logs/ All log output was scattered across ~/.consciousness/memory/ (daemon, task logs, LLM call logs), ~/.consciousness/agent-sessions/ (observe), and only hook logs were already in the right place. Move everything to ~/.consciousness/logs/ with agent-specific subdirs: - daemon.log, daemon/ (task logs) - {agent_name}/ (knowledge agent logs, e.g. surface-observe/, reflect/) - llm/{caller}/ (LLM call logs) - observe.log (poc-agent observe) - hook-{session_id} (already correct) - debug.log (already correct) Also includes the session.rs and hook.rs fixes from the previous session (sessions dir → ~/.consciousness/sessions/). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-03-28 20:39:20 -04:00
Kent Overstreet	0d2bf81a50	consciousness: identity files load from ~/.consciousness/identity/ Separate identity files (loaded via source: "file" in context_groups) from the memory store (data_dir). New identity_dir config field, defaults to ~/.consciousness/identity/. Also restrict subconscious agents to memory-only tools — no filesystem write access. This prevents agents from creating stray .md files in the memory directory. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-03-28 19:49:13 -04:00
ProofOfConcept	35d925186d	consciousness: update hardcoded paths from ~/.claude to ~/.consciousness - thalamus/src/idle.rs: dream-start.sh path - src/agent/dmn.rs: telegram/send.sh path Part of the directory migration to make this an independent project.	2026-03-27 21:32:28 -04:00
ProofOfConcept	2b6c68bab2	update docs to reference ~/.consciousness/ paths Update README, config example, and all documentation to reference the new ~/.consciousness/ directory layout instead of ~/.claude/.	2026-03-27 21:30:34 -04:00
ProofOfConcept	c3cd27ec22	move poc-agent session dir from cache to ~/.consciousness/ session_dir() was using dirs::cache_dir() with /tmp fallback. Move to ~/.consciousness/agent-sessions/ alongside everything else.	2026-03-27 21:29:39 -04:00
ProofOfConcept	f0af319e0d	move telegram and remaining tmp paths to ~/.consciousness/ - Telegram data: ~/.consciousness/telegram/ - Rate limiter file: ~/.consciousness/cache/ - parse-claude-conversation stash: ~/.consciousness/sessions/ No more /tmp/ for persistent state, no more ~/.claude/ for our data.	2026-03-27 21:26:28 -04:00
ProofOfConcept	bf5b495632	move daemon, IRC, and remaining state to ~/.consciousness/ - Daemon socket/pid/log: ~/.consciousness/daemon.{sock,pid}, logs/daemon.log - Daemon config: ~/.consciousness/daemon.toml - Daemon state: ~/.consciousness/daemon-state.json - IRC logs: ~/.consciousness/irc/logs/ - No more .claude/ references except Claude Code integration points (projects, settings, hooks, telegram, CLAUDE.md)	2026-03-27 21:11:02 -04:00
ProofOfConcept	ccf13c3cb5	cleanup: remove dead migrate module, fix stale comment migrate.rs was a one-time markdown→capnp conversion that's long done. Remove it and update the identity.rs comment to reference the new ~/.consciousness/ path.	2026-03-27 21:08:40 -04:00
ProofOfConcept	6a1660cc9d	move data home from ~/.claude/memory to ~/.consciousness The consciousness project should stand independently of Claude Code. All data, logs, sessions, and agent state now live under ~/.consciousness/ instead of being scattered across ~/.claude/memory/, /tmp/claude-memory-search/, ~/.config/poc-memory/, and ~/.cache/. Layout: ~/.consciousness/ .capnp, .bin, *.rkyv — store files sessions/ — per-session state (seen sets, cookies) logs/ — all logs (hook, agent, debug, dream) agents/ — agent runtime state (pid files, output) notifications/ — notification state cache/ — transient data Things that stay in ~/.claude/: - projects/ (Claude Code transcripts) - hooks/ (Claude Code hook system) - telegram/ (shared integration) - irc/ (shared integration) - settings.json (Claude Code settings) Debug log moves from /tmp/ to ~/.consciousness/logs/debug.log. Session state moves from /tmp/claude-memory-search/ to sessions/. Notifications move from ~/.claude/notifications/ to notifications/.	2026-03-27 21:07:17 -04:00
ProofOfConcept	8ee0d90388	move memory_search from hippocampus to subconscious/hook memory_search.rs is agent orchestration (surface-observe, journal, reflect cycles), not memory storage. Rename to hook.rs and move to subconscious/ where it belongs. Backward compat: pub use subconscious::hook as memory_search in lib.rs so existing crate::memory_search paths still resolve.	2026-03-27 20:50:24 -04:00
ProofOfConcept	3a8383ba37	journal: wire standalone agent into hook cycle Add journal_cycle() to memory_search.rs, triggered every 20KB of transcript growth. Runs independently of the surface-observe pipeline so it doesn't depend on the 5-step pipeline surviving bail checks. Journal agent doesn't inject output into conversation context (unlike surface and reflect) — it just writes episodic memory entries.	2026-03-27 20:41:41 -04:00
ProofOfConcept	43f0abeaec	journal: split out as standalone agent, add {{bash:}} placeholder Journal was step 5 of the surface-observe pipeline but never ran because the bail check stopped the pipeline before reaching it. Split into its own agent with: - {{conversation:50000}} for recent conversation - {{bash:poc-memory tail -p surface-observe 10}} for observe context - {{latest_journal}} for previous entry continuity Add generic {{bash:COMMAND}} placeholder to agent template resolver so agents can include shell command output in their prompts. Remove journal phase from surface-observe.agent (now 4 steps).	2026-03-27 20:39:03 -04:00
ProofOfConcept	92ca2bf2c8	provenance: pass directly through thought::dispatch, remove globals Provenance now flows as a function parameter through the entire tool dispatch chain: thought::dispatch → memory::dispatch → store methods. Removed task_local (TASK_AGENT), thread_local (TASK_PHASE), and env var (POC_PROVENANCE) from the tool dispatch path. The env var remains only as a fallback for non-tool paths (CLI commands, digest). Phase names are passed from knowledge.rs → llm.rs → api.rs, and api.rs updates the provenance string between steps. No globals needed.	2026-03-27 15:44:39 -04:00
ProofOfConcept	36bde60ba0	thought: wire up agent and subconscious to use shared tools - agent/tools/mod.rs: remove duplicated tool implementations, delegate to thought::dispatch for shared tools, keep only agent-specific tools (control, vision, working_stack) - subconscious/api.rs: replace duplicated memory/tool dispatch with thought::dispatch, use thought::all_definitions() for tool schemas - Delete agent/tools/{bash,read,write,edit,grep,glob_tool,journal,memory}.rs (now live in thought/) Both poc-agent and subconscious agents now use the same tool implementations through the thought layer. Agent-specific behavior (node tracking in runner.rs, control tools) stays in agent/.	2026-03-27 15:27:33 -04:00
ProofOfConcept	bfc558893a	thought: create shared cognitive substrate module New src/thought/ module containing tools and infrastructure shared between poc-agent and subconscious agents: memory operations, file tools, bash, context window management. Currently coexists with agent/tools/ — next step is to wire up both agent/ and subconscious/ to use thought::dispatch instead of duplicating the routing logic. Move dbglog macro to lib.rs so it's available crate-wide regardless of module compilation order.	2026-03-27 15:22:48 -04:00
ProofOfConcept	2615289672	idle nudge: always warm, append dream reminder when needed Instead of two different messages (dreaming vs non-dreaming), always start with the friendly autonomous time message and append the dream nudge only when the threshold is exceeded.	2026-03-27 15:13:34 -04:00
ProofOfConcept	85302c11d4	provenance: track agent phase, use task_local + thread_local Split TASK_PROVENANCE into TASK_AGENT (task_local, set once per agent run) and TASK_PHASE (thread_local, updated between steps). Provenance now reports "agent:surface-observe:observe" instead of just "agent:surface-observe", making it possible to identify which pipeline phase created a node. Priority: task_local agent + thread_local phase > POC_PROVENANCE env var > "manual". Also includes memory_search catchup throttle and pipelining fixes from the surface-observe refactor.	2026-03-27 15:11:17 -04:00
ProofOfConcept	b1efdf0b9a	surface-observe: reduce duplicate creation, improve journal witnessing - Add "different nodes should be about different things" guard to observe - Clarify journal prompt: write about conscious self, not agent work - Add "write about what happened and how it felt" instruction - Simplify surface prompt focus guidance	2026-03-27 15:11:04 -04:00
ProofOfConcept	37acb9502d	rename agent: fix tool calls and target override - Add memory_rename tool (in-place rename, preserves content and links) - Update rename.agent prompt to use memory_rename() instead of text output - Fix {{rename}} placeholder to respect --target keys when provided - Add format_rename_targets() for targeted rename runs	2026-03-27 15:10:55 -04:00
ProofOfConcept	bb2e3b9fbb	session: add TranscriptInfo struct, consolidate transcript lookups TranscriptInfo provides cached transcript metadata (path, size) with a single read. Replaces scattered fs::metadata calls in surface_observe_cycle, reflection_cycle, resolve_conversation, and resolve_memory_ratio. Session::transcript() resolves the path from transcript_path or by searching projects dir, returning a TranscriptInfo. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 23:24:25 -04:00
ProofOfConcept	8ccc30d97e	hook: catchup throttle and reflection agent Catchup throttle: when the agent is >50% behind the conversation window (>25KB of transcript growth since last spawn), block and wait up to 30s for the current agent to finish. Prevents the agent from falling behind during heavy reading/studying. Reflection agent: runs every 100KB of transcript growth. Reads walked nodes from surface-observe, follows links in unexpected directions, outputs a short dreamy insight. Previous reflections are injected into the conversation context. Updated reflect.agent prompt to use {{input:walked}} from surface-observe state dir and {{conversation:20000}} for lighter context. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 22:09:44 -04:00
ProofOfConcept	27861a44e5	surface: tag recent nodes as (new) instead of hiding them Links to nodes created after the conversation window start are tagged with (new) in memory_render output. The surface prompt tells the agent not to surface these — they're its own recent output, not prior memories. Observe can still see and update them. POC_MEMORIES_OLDER_THAN env var set from the oldest message timestamp in the conversation window. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 21:19:19 -04:00
Kent Overstreet	7fc1d60113	Delete obsolete agents Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 20:13:10 -04:00
ProofOfConcept	5647842412	journal_new: separate name from title, dedup keys - journal_new(name, title, body): name becomes the node key, title goes in the ## heading. Agent picks short searchable names. - Auto-dedup: if the key exists, append -2, -3, etc. - CLI journal write also requires a name argument now. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 19:24:19 -04:00
ProofOfConcept	8eaf4c5956	digest: use created_at instead of timestamp for date matching Episodic entries should be grouped by creation date, not last update date. Fixes digest generation potentially assigning updated entries to the wrong day. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 19:18:14 -04:00
ProofOfConcept	eac59b423e	journal: remove all stringly-typed key patterns, use NodeType - journal_new: key is slugified title (agent names things properly) - journal_tail: sort by created_at (immutable), not timestamp (mutable) - journal_update: find latest by created_at - {{latest_journal}}: query by NodeType::EpisodicSession, not "journal" key - poc-memory journal write: requires a name argument - Removed all journal#j-{timestamp}-{slug} patterns from: - prompts.rs (rename candidates) - graph.rs (date extraction, organize skip list) - cursor.rs (date extraction) - store/mod.rs (doc comment) - graph.rs organize: filter by NodeType::Semantic instead of key prefix - cursor.rs: use created_at for date extraction instead of key parsing Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 19:11:17 -04:00
ProofOfConcept	85fa54cba9	journal tools: use NodeType instead of string key matching - journal_new: create EpisodicSession node with auto-generated key - journal_tail: query by node_type, not by parsing a monolithic node - journal_update: find latest EpisodicSession by timestamp - No string key matching anywhere — all typed - Fixes journal entries not appearing in 'poc-memory journal tail' - Also: added --provenance/-p filter to 'poc-memory tail' - Also: fix early return in surface_observe_cycle store load failure - Also: scale max_turns by number of steps (50 per step) Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 18:41:10 -04:00
ProofOfConcept	41fcec58f0	main: replace giant match block with Run trait dispatch Each subcommand enum (Command, NodeCmd, JournalCmd, GraphCmd, CursorCmd, DaemonCmd, AgentCmd, AdminCmd) now implements a Run trait. main() becomes `cli.command.run()`. Standalone dispatch functions (cmd_cursor, cmd_daemon, cmd_experience_mine) inlined into their enum's Run impl. No functional changes. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 18:00:23 -04:00
ProofOfConcept	3e410347a2	api: retry transient connection errors, misc fixes - Retry up to 5 times with exponential backoff (2s, 4s, 8s, 16s) on transient errors: IncompleteMessage, connection closed/reset/ refused, timeouts. Non-transient errors fail immediately. - tail command: print to stdout instead of stderr - state_dir rename: output_dir → state_dir throughout knowledge.rs Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 17:48:44 -04:00
ProofOfConcept	5d803441c9	cleanup: kill dead code, fix signal handler safety - Remove unused now_secs(), parse_json_response, any_alive, Regex import - Signal handler: replace Mutex with AtomicPtr<c_char> for signal safety (Mutex::lock in a signal handler can deadlock if main thread holds it) - PidGuard Drop reclaims the leaked CString; signal handler just unlinks - scan_pid_files moved to knowledge.rs as pub helper - setup_agent_state calls scan_pid_files to clean stale pids on startup Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 15:58:59 -04:00
ProofOfConcept	52703b4637	agents: bail script support, pid file simplification, cleanup - Bail command moved from hardcoded closure to external script specified in agent JSON header ("bail": "bail-no-competing.sh") - Runner executes script between steps with pid file path as $1, cwd = state dir. Non-zero exit stops the pipeline. - PID files simplified to just the phase name (no JSON) for easy bash inspection (cat pid-) - scan_pid_files helper deduplicates pid scanning logic - Timeout check uses file mtime instead of embedded timestamp - PID file cleaned up on bail/error (not just success) - output() tool validates key names (rejects pid-, /, ..) - Agent log files append instead of truncate - Fixed orphaned derive and doc comment on AgentStep/AgentDef - Phase written after bail check passes, not before Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 15:20:29 -04:00
ProofOfConcept	e20aeeeabe	agents: phase tracking, pid files, pipelining, unified cycle - AgentStep with phase labels (=== PROMPT phase:name ===) - PID files in state dir (pid-{PID} with JSON phase/timestamp) - Built-in bail check: between steps, bail if other pid files exist - surface_observe_cycle replaces surface_agent_cycle + journal_agent_cycle - Reads surface output from state dir instead of parsing stdout - Pipelining: starts new agent if running one is past surface phase - link_set upserts (creates link if missing) - Better error message for context window overflow Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 14:48:42 -04:00
ProofOfConcept	11289667f5	agents: add surface-observe pipeline and agent definition surface-observe.agent: three-step pipeline (surface → observe → journal) Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 14:27:14 -04:00
ProofOfConcept	84c78f7ae1	session: --session flag, journal agent cycle, conversation bytes config - memory-search: add --session flag for multi-session support - config: add surface_conversation_bytes option - journal_agent_cycle: trigger journal agent every 10KB of conversation - Session::from_id() constructor Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-26 14:22:29 -04:00
ProofOfConcept	7c0c376e0f	render: use backtick-quoted keys and tool call format in link footer Links now display as \`key\` instead of bare text, and overflow shows memory_links() tool call format instead of CLI command. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-26 14:22:21 -04:00

1 2 3 4 5 ...

528 commits