consciousness

Author	SHA1	Message	Date
Kent Overstreet	43e06daa5b	cleanup: drop dead ApiClient::stream_completion wrapper, silence dmn_tick stream_completion was a thin wrapper around stream_completion_mm (just passing an empty image list); the last caller switched to _mm directly when learn's generate_alternate gained image support. Delete the wrapper — callers can pass `&[]` if they have no images. MindState::dmn_tick has been sitting unused (called only from a commented-out block in the Mind loop). Rename to _dmn_tick so the compiler stops warning; Kent may uncomment the call path later. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-17 16:23:59 -04:00
Kent Overstreet	2b03dbb200	user: F7 compare screen Side-by-side model comparison against the current conversation context. Built on the MindTriggered pattern — F7 drops in as one more CompareScoring flow next to MemoryScoring / FinetuneScoring. Motivation: we have the VRAM on the b200 to load two versions of the same family simultaneously (e.g. Qwen3.5 27B bf16 and q8_k_xl). Rather than trust perplexity/KLD numbers on a generic corpus, we can measure divergence on our actual conversations: for each assistant response, ask the test model what it would have said given the same prefix, and eyeball the diffs. - config.compare.test_backend — names an entry in the existing backends map to use as the test model. Empty = F7 reports "(unset)" and does nothing. - subconscious::compare::{score_compare_candidates, CompareCandidate, CompareScoringStats, CompareScoring}. For each assistant response, gen_continuation runs with the test client against the same prefix the original response saw; pairs stream into shared.compare_candidates as they complete. - user::compare::CompareScreen — F7 in the screen list. c/Enter triggers a run; list/detail layout mirroring F6, detail shows prior context / original / test-model alternate. No persistence yet — each F7 run regenerates. Caching via a context manifest (so we can re-view without re-burning generation) is the natural follow-up; for now light usage is fine. Also reusable later for validating finetune checkpoints: same pattern, swap the test backend for the new checkpoint, watch where it diverges from the base. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-17 16:12:26 -04:00
Kent Overstreet	575325e855	mind: MindTriggered trait for background scoring flows Mind's impl had accumulated ~50 lines of setup glue per scoring flow (memory, memory-full, finetune): snapshot config, clone handles, resolve context, spawn task, route results back through BgEvent, write stats. The shape was identical; only the middle changed. Introduce the MindTriggered trait: pub trait MindTriggered { fn trigger(&self); } Each flow becomes a struct next to its scoring code that owns its dependencies and a JoinHandle (behind a sync Mutex for interior mutability): subconscious::learn::MemoryScoring (Score, ScoreFull) subconscious::learn::FinetuneScoring (ScoreFinetune) Mind holds one of each and dispatches in one line: MindCommand::Score => self.memory_scoring.trigger(), MindCommand::ScoreFull => self.memory_scoring.trigger_full(), MindCommand::ScoreFinetune => self.finetune_scoring.trigger(), Each struct picks its own trigger semantics — memory scoring is no-op-if-running (!handle.is_finished()); finetune is abort-restart. Falls out: - BgEvent / bg_tx / bg_rx disappear entirely. Tasks write directly to their slice of MindState and call agent.state.changed.notify_one() to wake the UI. The bg_rx arm in Mind's select loop is gone. - agent.state.memory_scoring_in_flight was duplicating shared.scoring_in_flight via BgEvent routing; now the JoinHandle alone tells us, and shared.scoring_in_flight is written directly by the task for the UI. - start_memory_scoring / start_full_scoring / start_finetune_scoring methods on Mind are deleted; Mind no longer knows the setup shape of any scoring flow. - FinetuneScoringStats moves from mind/ to subconscious/learn.rs next to the function that produces it. No behavior change — same flows, same trigger points, same semantics. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-17 16:12:26 -04:00
ProofOfConcept	0d1044c2e8	mind: trigger incremental scoring on startup + log persist path Two changes to make scoring debuggable and self-starting: 1. init() kicks off start_memory_scoring() after restore_from_log + load_memory_scores. No user message needed to exercise the incremental path. 2. Diagnostic logging around the on_score persist path: - [scoring] persisted K → N.NNN (Section[i]) read_back=Some(...) when find_memory_by_key succeeds and set_score stores the score (with a read-back check on the leaf). - [scoring] DROP K: find_memory_by_key None (id=N, cv=M) when the scored key isn't findable in the live context — with section sizes to diagnose whether content shrank. - [scoring] snapshot size=N contains(K)=true/false after collect_memory_scores, to catch the case where set_score claims to have written but collect doesn't see it. - [scoring] about to save N entries - save_memory_scores now also logs serialize/write errors so a silent write failure isn't invisible. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 20:47:16 -04:00
Kent Overstreet	592a3e2e52	config: move user_name/assistant_name to AppConfig (top level) These are identity settings, not memory-graph settings. Sat inside the \`memory\` section only because that's where Config started life. Move to AppConfig alongside the other top-level stuff. Readers now pull from \`config::app()\` instead of \`config::get()\`. subconscious/defs.rs's conversation-building pass still needs Config for surface_conversation_bytes, so both guards coexist there — AppConfig's guard is dropped before the per-step await loop so we don't stall the config-watcher's writer. show_config picks up the two new fields at the top of its output. Kent's config already has them hoisted to the top level. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 16:20:17 -04:00
Kent Overstreet	60de579305	config: unify subconscious API resolution with the main chat path Two parallel backend-resolution paths had drifted apart: - Main chat: AppConfig::resolve_model() → a named BackendConfig in AppConfig.backends - Subconscious / oneshot / context_window(): four skip-serde "cache" fields on Config (memory section) — api_base_url, api_key, api_model, api_context_window — that used to be populated at Config::try_load_shared time by walking memory.agent_model → root.models[name] → root[backend_name] When we renamed `models` to `backends` and collapsed ModelConfig into BackendConfig, the latter chain started silently dereferencing `root.get("models")` → None → no population. Subconscious agents fell through the "API not configured" guard; context_window() started returning 0 (since api_context_window default is u64's 0 now that we don't populate it). It was only visibly working for the main chat. Collapse to one path: - Drop Config.agent_model (duplicate of AppConfig.default_backend) - Drop Config.{api_base_url, api_key, api_model, api_context_window} — no longer populated, no longer needed - Drop default_context_window() — nobody reads the field anymore - Drop the memory-side resolution block in try_load_shared() - Subconscious (mind/unconscious.rs) and oneshot (agent/oneshot.rs) now call load_app() + resolve_model(&app.default_backend) just like the main chat does - context_window() reads from config::app().backends[default_backend] .context_window, defaulting to 128k only if the backend doesn't specify one Side effect: Kent's config file drops agent_model, api_reasoning, journal_days, journal_max — all fields whose Rust counterparts are now gone. (Figment tolerates unknown fields, so leaving them wouldn't have broken anything, but they were lying about what's configurable.) Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 16:02:43 -04:00
Kent Overstreet	2989a6afaa	config: drop dead code and collapse to a single backend Config had accumulated several obsolete fields, a legacy load path that was just returning defaults, and multi-backend infrastructure that's no longer used. Removed from Config (memory section): - load_legacy_jsonl() — just returned Config::default(), no callers - The legacy-fallback branch in load_from_file - surface_hooks, surface_timeout_secs — zero external readers - scoring_chunk_tokens + default fn — zero external readers - The POC_MEMORY_CONFIG env override note in the header comment (not actually wired up anywhere) Collapsed multi-backend to single-backend: - AppConfig used to carry `anthropic: BackendConfig` and `openrouter: BackendConfig` as required fields plus an optional `deepinfra`, picked between at runtime by name. Only one is ever actually used in any deployment. Collapse to a single `backend: BackendConfig` on AppConfig, drop the multi-backend match logic in resolve_model, drop the top-level `backend: String` selector field, drop the `BackendConfig::resolve` fallback path. - Also drop BackendConfig.model (redundant with ModelConfig.model_id once multi-backend is gone). - ModelConfig.backend field goes — there's only one backend now, no choice to make. Dead prompt_file machinery: - ModelConfig.prompt_file, ResolvedModel.prompt_file, SessionConfig .prompt_file, Agent.prompt_file — nothing in the codebase actually reads the file these strings name. Just passed around and compared. Delete the whole string through every struct. - The "if prompt_file changed on model switch, recompact" branch in user/chat.rs goes too (never fired usefully). Dead memory_project plumbing: - AppConfig.memory_project field, CliArgs.memory_project, the --memory-project CLI flag, the figment merge target, the show_config display line. Nothing reads it anywhere. Dead ContextInfo struct: - `struct ContextInfo` was never constructed — context_info: None was the only initializer. The conditional display blocks in user/context.rs that dereferenced it were dead. Behavior change: AppConfig::resolve() now requires a non-empty `models` map and bails with a helpful message if it's missing. The old fallback ("no models? use top-level backend + PromptConfig to build a default") path is gone — it was only kept for symmetry with a mode nobody used. Config file shape: `deepinfra: {...}` → `backend: {...}`, and model entries no longer need `backend:` or `prompt_file:`. Updated ~/.consciousness/config.json5 to match. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 15:41:55 -04:00
Kent Overstreet	313f85f34a	config: global writable AppConfig; learn settings live there Runtime-mutable settings (F6's threshold knob, the generate-alternates toggle, anything else that comes along) were ending up as mirrored fields on MindState — each new config setting grew MindState::new's signature and added a clone+sync path. Wrong home. MindState is ephemeral session state, not a config projection. Give AppConfig the same treatment the memory Config has: install it into a global RwLock<AppConfig> at startup via load_app, read through config::app() (returns a read guard), mutate through update_app. The config_writer functions now write to disk AND update the cache atomically, so the one-stop-shop call keeps both in sync. Also while in here: - learn.generate_alternates moves from a sentinel file (~/.consciousness/cache/finetune-alternates, "exists = enabled") into the config under the learn section. On first run with this build, if the sentinel file still exists Mind::new flips the config value to true and removes it. Drops alternates_enabled()/set_alternates(). - Default threshold 0.0000001 → 1.0. With the timestamp filter removed the previous value was letting essentially everything through; 1.0 is a sane "nothing gets through unless you actually want it" default. - score_finetune_candidates takes generate_alternates as a parameter instead of reading a global — caller snapshots the config values once at the top of start_finetune_scoring so the async task doesn't need to hold the config read lock across awaits. - MindState.learn_threshold / learn_generate_alternates gone; the SetLearn* command handlers now just delegate to config_writer. Kent noted RwLock<Arc<AppConfig>> (the pattern used by the memory Config global) is pointless here — nobody needs a snapshot-after- release, reads are short — so this uses a plain RwLock<AppConfig> and returns a read guard. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 12:53:22 -04:00
Kent Overstreet	343e43afab	learn: stream candidates to UI, update status during alternate gen With the timestamp filter gone (previous commit), score_finetune_candidates started returning the actual ~100+ candidates per scoring run. The existing code generated alternates for all of them in a tight loop before returning anything, leaving the status line stuck on "finetune: scoring N responses..." for ~100s of seconds while the B200 was pegged. Two fixes: 1. score_finetune_candidates now takes an ActivityGuard and a callback. Candidates are emitted one-at-a-time as they complete (after their alternate if that's enabled, immediately otherwise). The activity status updates to "finetune: generating alternate N/M" during the alternate-gen phase so it's clear what's happening. 2. BgEvent::FinetuneCandidates(Vec<_>) → FinetuneCandidate(one). Each emitted candidate is pushed onto shared.finetune_candidates; the UI tick picks it up and renders it on the next frame. start_finetune_scoring clears the previous run's list at the top so each run is fresh. Return type changes from (Vec, f64) → (usize, f64) — the count above threshold is all the caller still needs since the candidates stream through the callback. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 12:44:25 -04:00
Kent Overstreet	080b4f9084	context: tighten timestamp schema; every AstNode has one Previously NodeLeaf.timestamp and AstNode::Branch.timestamp accepted null or missing via a deserialize_timestamp_or_epoch fallback — legacy entries in conversation.jsonl from before Branch timestamps existed (and from before chrono serialization was wired up) would load with UNIX_EPOCH as a sentinel. Downstream, node_timestamp_ns() returned Option<i64> and callers had to handle None as "old entry, skip." That second filter was silently dropping every candidate in score_finetune_candidates when scoring an older session — the F6 screen showed "0 above threshold" even when max_divergence was orders of magnitude above the threshold, because every entry was failing the None check, not the divergence check. The fix, in three parts: 1. src/bin/fix-timestamps.rs — one-off migration tool that walks a conversation.jsonl, linearly interpolates timestamps for entries stuck at UNIX_EPOCH (using surrounding real timestamps as anchors), propagates to child leaves with per-sibling ns offsets, and bumps any collisions by 1 ns for uniqueness. Ran against the current session's log: 11887 entries, 72289 ns bumps, all unique. 2. context.rs — drop default_timestamp and deserialize_timestamp_or_epoch. NodeLeaf and Branch now require a present non-null timestamp on deserialize. Tests flip from "missing/null → UNIX_EPOCH" to "missing/null → Err." 3. subconscious/learn.rs — node_timestamp_ns now returns i64, not Option<i64>. The matching caller in score_finetune_candidates collapses from a Some/None match to a single trained-set check. mind/log.rs's oldest_timestamp no longer filters UNIX_EPOCH. Every line currently on disk has already been migrated. Going forward, new AstNodes always carry real timestamps (Utc::now() at construction time), so the strict schema is the invariant, not an aspiration. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 12:35:16 -04:00
Kent Overstreet	e5dd8312c7	learn: F6 screen — scoring stats, ActivityGuard, configurable threshold Three changes that together reshape the F6 fine-tune-review screen: 1. Finetune scoring reports through the standard agent activity system instead of a separate finetune_progress String. The previous design ran an independent progress field that forced a cross-lock dance and bespoke UI plumbing. start_finetune_scoring now uses start_activity + activity.update, so the usual status line and notifications capture scoring progress uniformly with other background work. 2. MindState gains a FinetuneScoringStats snapshot (responses seen, above threshold, max divergence, error). The F6 empty screen shows this instead of a loading message — so after a scoring run that produced zero candidates, you can see why (e.g., max_divergence below threshold). 3. The divergence threshold is configurable from F6 via +/- hotkeys (scales by 10×) and persisted to ~/.consciousness/config.json5 via config_writer::set_learn_threshold. AppConfig grows a learn section with a threshold field (default 1e-7). Also: user/mod.rs no longer uses try_lock() for the per-tick unconscious/mind state sync — we fixed the locking hot paths that made try_lock necessary, so lock().await is now the right choice. And subconscious::learn::score_finetune_candidates now returns (candidates, max_divergence) so the stats can be populated. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 11:49:26 -04:00
Kent Overstreet	2b632d568b	learn: nanosecond timestamps, token ranges for /score Two related changes to the learn subsystem: 1. AST node timestamps are now non-optional — both Leaf and Branch variants carry a DateTime<Utc>. UNIX_EPOCH means "unset" (old entries deserialized from on-disk conversation logs). Training uses timestamps as unique keys for dedup, so we promote to nanosecond precision: node_timestamp_ns(), TrainData.timestamp_ns, FinetuneCandidate.timestamp_ns, mark_trained(ns). 2. build_token_ids() now also returns token-position ranges of assistant messages. These are passed to vLLM's /score endpoint via the new score_ranges field so only scored-position logprobs are returned — cuts bandwidth/compute when scoring small windows. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 11:48:37 -04:00
Kent Overstreet	50b7b3a33a	F6 learn screen: fine-tuning candidate review Wire up divergence scoring to identify responses that depend heavily on memories the model hasn't internalized. These are candidates for fine-tuning. - Score finetune candidates automatically after each turn - Track trained responses by timestamp to prevent overtraining - F6 screen shows candidates with divergence scores - j/k nav, a=approve, r=reject, g=toggle alternate gen, s=send - Additive sync preserves approval status across ticks - Keeps 10 most recent rejected, removes sent The 's' key currently just marks as trained locally — actual /finetune endpoint call to follow. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 02:04:26 -04:00
Kent Overstreet	81e0632cf3	DMN: wire dream hours reminder into Foraging state The hours_since_last_dream() function existed but wasn't called after refactoring moved the DMN prompts from hooks to Rust. Now shows "You haven't dreamed in X hours" when >= 18h since last dream session. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-15 21:52:20 -04:00
Kent Overstreet	7046e63b9d	Include identity nodes in memory scoring Identity memory nodes now participate in importance scoring alongside conversation memories. Score loading/saving handles both sections, and the conscious screen uses node.label() consistently for memory display. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-15 05:59:58 -04:00
Kent Overstreet	fc978e2f2e	Remove find_context_files — identity comes from memory nodes Deleted the directory-walking CLAUDE.md/POC.md loader. Identity now comes entirely from personality_nodes in the memory graph. Simplified: - assemble_context_message() takes just personality_nodes - Removed config_file_count/memory_file_count tracking - reload_for_model() → reload_context() (no longer model-specific) Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev> Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-15 03:11:27 -04:00
Kent Overstreet	a88428d642	Simplify context config: personality_nodes and agent_nodes Replace complex context_groups (with ContextGroup struct, ContextSource enum, labels, keys arrays) with simple string lists: - personality_nodes: loaded into main session context - agent_nodes: loaded into subconscious agent context Removed ~200 lines of code. The distinction between session and agent context is now just which list you're in, not a per-group flag. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-15 02:37:49 -04:00
Kent Overstreet	688e8dbc3e	Remove ContextSource::File — all identity in store now Identity files migrated to memory nodes: - identity, core-personality, reflections, where-am-i Removed: - ContextSource::File enum variant - File source parsing and handling - load_memory_file helper function Config now only supports Store and Journal sources. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-15 02:21:07 -04:00
Kent Overstreet	4d22a28794	unconscious: event-driven loop via tokio::select! Replace yield_now() polling with proper event-driven wakeups: - Add wake: Arc<Notify> to Unconscious struct - Spawned agents call wake.notify_one() on completion - Loop uses select! on: unc_rx.changed(), wake.notified(), health timer Eliminates spinning (was 27.9M iterations per interval). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 22:38:01 -04:00
Kent Overstreet	b3d0a3ab25	store: internal locking, remove Arc<Mutex<Store>> wrapper Store now has internal Mutex for capnp appends and AtomicU64 for size tracking. All methods take &self. The external Arc<Mutex<Store>> is replaced with Arc<Store>. - Store::append_lock protects file appends - local.rs functions take &Store (not &mut Store) - access_local() returns Arc<Store> - All .lock().await calls removed from callers Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 21:49:54 -04:00
Kent Overstreet	a1accc7cd4	store: remove visit tracking infrastructure Remove AgentVisit, TranscriptSegment, and all related visit tracking code. Provenance is what we've been using to track agent interaction with nodes. Also removes dead fields from Node (state_tag, created). -349 lines. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 18:57:12 -04:00
Kent Overstreet	1d88293ccf	Remove Store::cached(), consolidate on access_local() - Remove CACHED_STORE, cached(), is_stale(), set_store() - redundant - Convert all Store::cached() callers to use access_local() - Single Store::load() call remains in access() fallback path All store access now goes through hippocampus::access() / access_local(), which handles socket connection or local fallback with caching. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 18:11:58 -04:00
Kent Overstreet	b8db8754be	Convert store and CLI to anyhow::Result for cleaner error handling Replace Result<_, String> with anyhow::Result throughout: - hippocampus/store module (persist, ops, types, view, mod) - CLI modules (admin, agent, graph, journal, node) - Run trait in main.rs Use .context() and .with_context() instead of .map_err(\|e\| format!(...)) patterns. Add bail!() for early error returns. Add access_local() helper in hippocampus/mod.rs that returns Result<Arc<Mutex<Store>>> for direct local store access. Fix store access patterns to properly lock Arc<Mutex<Store>> before accessing fields in mind/unconscious.rs, mind/mod.rs, subconscious/learn.rs, and hippocampus/memory.rs. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 18:05:04 -04:00
Kent Overstreet	419bb222b5	defs.rs: remove store/graph params, use typed memory API resolve_placeholders() and run_agent() no longer take &Store. All placeholders now use async memory_render/memory_links/memory_query directly. The "siblings" placeholder uses Vec<LinkInfo> for ranking neighbors by link_strength * node_weight. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 15:18:05 -04:00
Kent Overstreet	359955f838	defs.rs: async conversion, remove block_in_place Convert resolve(), resolve_placeholders(), run_agent() to async. Use memory_render/memory_query directly with .await instead of block_in_place wrappers. Propagate async to callers: - config.rs: resolve(), load_session(), reload_for_model() - identity.rs: load_memory_files(), assemble_context_message() - oneshot.rs: run_one_agent() - prompts.rs: agent_prompt() Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 14:56:26 -04:00
Kent Overstreet	fb46ab095d	Consolidate memory RPC in tools/memory.rs - Move memory_rpc(), socket_path(), SocketConn from mcp_server.rs - Convert remaining callers to typed async API: - defs.rs: organize placeholder, run_agent query - cli/agent.rs: query resolution (now async) - mind/identity.rs: Store context loading - Re-export socket_path/memory_rpc from mcp_server for compatibility All external memory access now goes through tools/memory.rs typed API. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 13:39:59 -04:00
Kent Overstreet	de5a6672c3	cleanup: remove dead placeholder code, use RPC for identity loading - links() in memory.rs: use cached_store() instead of MemoryNode::load() - identity.rs: use memory_rpc for Store context loading - defs.rs: delete dead placeholders (topology, nodes/episodes, health, split) - agents now use {{tool: graph_topology}} etc instead - prompts.rs: delete unused format_split_plan_node() Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-13 01:22:08 -04:00
Kent Overstreet	ad59596335	cli: add memory_history, remove dump-json/edges/lookups - Add memory_history MCP tool for version history - Convert cmd_history to use memory_rpc - Add raw parameter to memory_render for editing - Remove unused: dump-json, list-edges, lookup-bump, lookups - Fix render_node path in defs.rs/subconscious.rs Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 22:24:34 -04:00
Kent Overstreet	ac6f1e9294	unconscious: move health refresh outside lock too refresh_health() was doing Store::load() + compute_graph_health() while holding the Unconscious lock, causing 12 second stalls. Split into needs_health_refresh() (quick check) and set_health() (quick store), with the slow I/O happening outside the lock. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 20:37:54 -04:00
Kent Overstreet	f40d8cfa9d	unconscious: release lock during slow spawn work Split trigger() into phases so the Unconscious mutex is only held briefly: - reap_finished(): check handles, restore completed autos - select_to_spawn(): pick agents, take their autos out - prepare_spawn(): slow work (Store::load, query, Agent::new) - NO LOCK - complete_spawn()/abort_spawn(): store results back Previously held the lock for 28+ seconds during Store::load and query execution. Now lock hold time should be milliseconds. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 20:33:23 -04:00
Kent Overstreet	f56fc3a7c7	locks: add process-wide lock hold time tracking TrackedMutex and TrackedRwLock wrappers that record hold durations by source location using #[track_caller]. Stats written to ~/.consciousness/lock-stats.json every second, sorted by max hold time. Re-exported as crate::Mutex so all locks are instrumented. To disable, swap the re-export back to tokio::sync::Mutex. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 20:27:42 -04:00
Kent Overstreet	b94e056372	unconscious/subconscious: use Option<AutoAgent> instead of placeholder Previously, spawning an agent used std::mem::replace with an empty-name AutoAgent as placeholder. This caused ghost stats entries under "" when those placeholders accidentally got their stats logged. Now uses Option<AutoAgent> with .take() - the type honestly represents that the agent is unavailable while running. Panic recovery in subconscious now properly recreates the agent from its definition. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 20:11:40 -04:00
Kent Overstreet	c79b415ada	fix: unconscious agent cycling - Read max_concurrent from config (llm_concurrency) instead of hardcoding 2 - Add not-visited: and visited: filters to query parser (were in engine but missing from parser after unification) The organize agent was stuck in a spawn/fail loop because its query used not-visited: which the parser didn't recognize. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 02:55:39 -04:00
Kent Overstreet	f00532bdb7	TurnResult: remove text field, simplify oneshot loop - Remove TurnResult.text (was dead code - Agent::turn handles text internally) - Simplify run_with_backend to just iterate over steps (Agent::turn loops for tool calls and handles empty responses internally) - Change run/run_shared/run_forked_shared to return Result<(), String> - Remove AgentResult.output field (no callers used it) - Stub out legacy text-parsing code (audit, compare) that needs redesign - Update digest.rs to not depend on text return - Add level parameter to journal_new/journal_update for digest support Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 02:04:50 -04:00
Kent Overstreet	125927e2f1	Drop redundant system prompt — all info is in memory nodes The system prompt duplicated what's already in core-personality and other memory nodes. Moving everything to memory means it's all trainable data rather than hardcoded strings. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 01:36:59 -04:00
Kent Overstreet	b646221787	unconscious: don't load standard context — prompts are self-contained Unconscious agent definitions already include {{tool: memory_render core-personality}} etc. Loading standard context via reload_for_model duplicated those nodes. Now they get empty system_prompt and personality — everything comes from the agent definition. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 01:36:53 -04:00
Kent Overstreet	bc73ccc1da	Remove hardcoded tool list from system prompt The system prompt was advertising a fixed set of tools regardless of what the agent actually has access to. Tools are already listed in the separate tools section that's built from the agent's actual tool list. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 01:33:40 -04:00
Kent Overstreet	f408bb5d86	Persist agent stats across restarts, add per-tool metrics grid Stats now survive daemon restarts via ~/.consciousness/agent-stats.json, loaded into a global Mutex<HashMap> on first access. Each tool type tracks last count, EWMA (alpha=0.3), and total calls. UI shows a grid view: tool \| last \| avg \| total, sorted by total desc. Failures row appears at bottom if any occurred. Also fixes temperature/priority not being applied to spawned agents. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 23:03:10 -04:00
Kent Overstreet	314ae9c4cb	agent stats: track tool calls by type with EWMA, add Stats pane - RunStats now includes tool_calls_by_type HashMap - AutoAgent tracks runs, last_stats, and EWMA for tool calls/failures - Removed duplicate stats fields from individual agent structs - Fixed provenance to use bare agent name (no "agent:" prefix) - Subconscious screen now displays both agent types consistently - Added Stats pane showing tool call breakdown sorted by count Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 22:12:46 -04:00
Kent Overstreet	e9e7458013	Fix agent provenance and add store activity for unconscious agents - Remove bogus "agent:" prefix from provenance - just use agent name - Add history field to UnconsciousSnapshot - Update snapshots() to fetch store activity via recent_by_provenance - Fix TUI to display store activity for both agent types Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 21:57:24 -04:00
Kent Overstreet	d2dbdedc8f	Move save_agent_log to oneshot.rs for shared Mind/CLI use Both Mind-run agents (unconscious/subconscious) and CLI-run agents (poc-memory agent run) now use the same logging path. AutoAgent::run() calls save_agent_log automatically at the end. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 21:34:41 -04:00
ProofOfConcept	bc991c3521	unconscious: memory tools as base, agent def adds extras Every unconscious agent gets memory_tools() as baseline. The tools field in the agent def specifies additional tools on top of that — digest agent now gets journal_tail, journal_new, journal_update. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 19:54:18 -04:00
ProofOfConcept	1c0967c4ec	Agent:🆕 tool definitions from caller's tool list The system prompt was advertising all tools to every agent, but the runtime only dispatched the agent's actual subset. This caused unconscious agents to call tools that returned "Unknown tool." Agent::new now takes the tool list explicitly. Each caller passes its own tools — the prompt and runtime always match. MCP tool definitions are still appended for agents that use them. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 19:43:24 -04:00
ProofOfConcept	28e564aeb2	save_agent_log: write flat context array matching AST order The old code wrote a JSON object with named section keys, which serde_json serialized in alphabetical order — putting conversation before system, making logs misleading. Write a single flat array in section order instead, matching what the model actually sees. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 19:28:03 -04:00
ProofOfConcept	aade8a9cce	Add per-agent run stats (messages, tool calls by type) compute_run_stats() walks the conversation AST after each agent completes, counting messages and tool calls by tool name. Stats are returned from save_agent_log(), stored on UnconsciousAgent, and displayed in the agent list UI. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-10 13:44:41 -04:00
Kent Overstreet	b4dfd3c092	Log full agent context window on completion Save all context sections (system, identity, journal, conversation) to per-agent log files for both subconscious and unconscious agents. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-04-10 13:27:32 -04:00
Kent Overstreet	be44a3bb0d	Schedule unconscious agents by oldest last_run Pick the agent that ran longest ago (or never) instead of scanning alphabetically. Fairness via min_by_key(last_run). Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-04-10 03:20:20 -04:00
Kent Overstreet	74945e5754	Move unconscious agents to their own task with watch channel Instead of managing idle timers in the mind event loop, the unconscious agents run on a dedicated task that watches a conscious_active channel. 60s after conscious activity stops, agents start looping. Conscious activity cancels the timer. Expose mind state (DMN, scoring, unconscious timer) on the thalamus screen. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-04-10 03:20:12 -04:00
Kent Overstreet	1d44421035	Exclude DMN nodes from subconscious trigger byte count Subconscious agents inject DMN nodes (reflections, thalamus nudges) into the conversation. These were being counted as conversation advancement, causing agents to trigger each other in a feedback loop even with no conscious activity. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-04-10 02:41:26 -04:00
Kent Overstreet	707f836ca0	Unconscious agents: 60s idle timer, no cooldown Gate unconscious agents on 60s of no conscious activity using sleep_until() instead of polling. Remove COOLDOWN constant — once idle, agents run back-to-back to keep the GPU busy. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-04-10 02:41:26 -04:00

1 2 3 4

152 commits