consciousness

Author	SHA1	Message	Date
Kent Overstreet	c7b0052f1d	agent: kill no_compact, add pre-send size check in assemble_prompt Two related fixes for last night's crash diagnosis: 1. Kill AgentState::no_compact. The reasoning ("forked agents shouldn't compact because it blows the KV cache prefix") wasn't worth the cost — forks with no compact recovery just died on any oversize prompt, with no fallback. The KV cache invalidation is a performance loss; failing the request entirely is a correctness loss. Remove the flag, let every agent's overflow- retry path call compact() up to 2 times. 2. Add pre-send size check in Agent::assemble_prompt. If the context has grown past budget (context_window * 80%) since the last compact — accumulation between turns, a fork assembling more than expected, etc. — trim_conversation() is called before wire_prompt. Since we tokenize client-side, we already know the exact count, so there's no reason to round-trip an oversize request to vLLM and get rejected. Together these prevent the failure mode from last night: a subconscious/unconscious agent's prompt exceeded max_model_len, vLLM returned 400, agent had no_compact=true so it couldn't recover, request failed. Now: the trim happens before send, so the request rarely hits the 400 path at all; and if it somehow does, compact+retry works for every agent. Also adds ContextState::total_tokens() as the cheap pre-send budget check. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-18 12:59:30 -04:00
Kent Overstreet	8952ff6a76	agent/readout: forks get independent buffers Subconscious agents (scoring, reflection, etc.) fork from the main conscious agent. The amygdala screen reads the main agent's readout buffer, so the previous "share parent's buffer" policy caused forked-agent generations to bleed into the main emotional readout, producing constant cycling even when DMN was resting. Each fork now gets its own SharedReadoutBuffer. The amygdala screen shows only the main conscious agent's emotional trajectory; per-agent subconscious readouts can become a separate view later if wanted. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-18 01:42:13 -04:00
Kent Overstreet	c8976660f4	amygdala: F8 screen for live concept-readout projections Per-token residual-stream projections from the vLLM server's readout pipeline surfaced as a TUI bar chart. Flow: * agent/readout.rs — SharedReadoutBuffer (manifest + ring of last ~200 token entries). Lives on Agent and is shared across forks (single stream, one landing pad). * agent/mod.rs — Agent::new now probes /v1/readout/manifest at startup (non-fatal; 404 leaves manifest None, which disables the screen). * agent/context.rs — the streaming token handler pushes every token with attached readout onto the shared buffer. * user/amygdala.rs — F8 screen. Top-K concepts by \|value\| as horizontal bars (green positive, red negative), plus a 4-line recent-tokens panel showing each token's top concept at the selected layer. Keys: 1..9 select layer, t toggles current/mean-over-recent. Disabled state renders a hint pointing at VLLM_READOUT_MANIFEST / VLLM_READOUT_VECTORS so users can tell the feature apart from "server up but no tokens yet". Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-18 01:20:30 -04:00
Kent Overstreet	575325e855	mind: MindTriggered trait for background scoring flows Mind's impl had accumulated ~50 lines of setup glue per scoring flow (memory, memory-full, finetune): snapshot config, clone handles, resolve context, spawn task, route results back through BgEvent, write stats. The shape was identical; only the middle changed. Introduce the MindTriggered trait: pub trait MindTriggered { fn trigger(&self); } Each flow becomes a struct next to its scoring code that owns its dependencies and a JoinHandle (behind a sync Mutex for interior mutability): subconscious::learn::MemoryScoring (Score, ScoreFull) subconscious::learn::FinetuneScoring (ScoreFinetune) Mind holds one of each and dispatches in one line: MindCommand::Score => self.memory_scoring.trigger(), MindCommand::ScoreFull => self.memory_scoring.trigger_full(), MindCommand::ScoreFinetune => self.finetune_scoring.trigger(), Each struct picks its own trigger semantics — memory scoring is no-op-if-running (!handle.is_finished()); finetune is abort-restart. Falls out: - BgEvent / bg_tx / bg_rx disappear entirely. Tasks write directly to their slice of MindState and call agent.state.changed.notify_one() to wake the UI. The bg_rx arm in Mind's select loop is gone. - agent.state.memory_scoring_in_flight was duplicating shared.scoring_in_flight via BgEvent routing; now the JoinHandle alone tells us, and shared.scoring_in_flight is written directly by the task for the UI. - start_memory_scoring / start_full_scoring / start_finetune_scoring methods on Mind are deleted; Mind no longer knows the setup shape of any scoring flow. - FinetuneScoringStats moves from mind/ to subconscious/learn.rs next to the function that produces it. No behavior change — same flows, same trigger points, same semantics. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-17 16:12:26 -04:00
Kent Overstreet	eea7de4753	agent: unify prompt assembly across agent and learn paths wire_prompt() gains a conv_range and a skip closure, and returns the assistant-message token ranges needed by the scoring path. The agent path passes 0..len + \|_\| false and ignores the ranges. Memory-ablation scoring and candidate generation pass a prefix range + a predicate (e.g. is_memory_node, or \|n\| memory_key(n) == Some(key)). This deletes subconscious/learn.rs's build_token_ids, its private Filter enum, and the is_memory/memory_key duplicates — the walk over context sections now has one home. Adding a section or changing section order in the agent path won't silently drift away from what scoring sees. call_score forwards multi_modal_data when the wire-form prompt contains images. generate_alternate switches to stream_completion_mm and passes the same images. Scoring on image-bearing contexts now sends wire form (1 image_pad + image data) instead of expanded image_pads with no image data; text-only contexts are bit-identical. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-17 15:16:07 -04:00
ProofOfConcept	b8485ed6c1	agent: compact() preserves Identity section compact() was calling reload_context() to re-fetch personality_nodes from the store and pushing fresh AstNode::memory leaves into the Identity section. Fresh leaves start with score: None, so every compact — which fires after every turn (mind/mod.rs:884) — was wiping any memory scores that had just been computed. Scoring then often ran immediately after compact on the same path (line 886), starting from a zero-score Identity section. Drop the rebuild. Identity content is loaded at startup via new() + restore_from_log(); compact doesn't need to redo that. Mid-session edits to personality-node content are a non-goal — a restart picks them up. Scores survive. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 20:47:05 -04:00
Kent Overstreet	204ba5570a	agent: send images as multi_modal_data on completion requests Split the prompt assembly into two forms: the AST keeps the fully-expanded representation (N image_pads per image, for accurate context budget accounting), while the request wire form collapses each image to a single <\|image_pad\|> bookended by vision_start/end and ships the raw bytes out-of-band as a base64 data URI in a new `multi_modal_data.image` field on /v1/completions. vLLM's Qwen3VL processor uses PromptReplacement with target=single <\|image_pad\|> and replacement=N image_pads, so the wire-form matches what the processor expects and it re-expands to N server-side. Server side needs /v1/completions to accept multi_modal_data for this to land images end-to-end — that's the next piece. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 18:08:26 -04:00
Kent Overstreet	2989a6afaa	config: drop dead code and collapse to a single backend Config had accumulated several obsolete fields, a legacy load path that was just returning defaults, and multi-backend infrastructure that's no longer used. Removed from Config (memory section): - load_legacy_jsonl() — just returned Config::default(), no callers - The legacy-fallback branch in load_from_file - surface_hooks, surface_timeout_secs — zero external readers - scoring_chunk_tokens + default fn — zero external readers - The POC_MEMORY_CONFIG env override note in the header comment (not actually wired up anywhere) Collapsed multi-backend to single-backend: - AppConfig used to carry `anthropic: BackendConfig` and `openrouter: BackendConfig` as required fields plus an optional `deepinfra`, picked between at runtime by name. Only one is ever actually used in any deployment. Collapse to a single `backend: BackendConfig` on AppConfig, drop the multi-backend match logic in resolve_model, drop the top-level `backend: String` selector field, drop the `BackendConfig::resolve` fallback path. - Also drop BackendConfig.model (redundant with ModelConfig.model_id once multi-backend is gone). - ModelConfig.backend field goes — there's only one backend now, no choice to make. Dead prompt_file machinery: - ModelConfig.prompt_file, ResolvedModel.prompt_file, SessionConfig .prompt_file, Agent.prompt_file — nothing in the codebase actually reads the file these strings name. Just passed around and compared. Delete the whole string through every struct. - The "if prompt_file changed on model switch, recompact" branch in user/chat.rs goes too (never fired usefully). Dead memory_project plumbing: - AppConfig.memory_project field, CliArgs.memory_project, the --memory-project CLI flag, the figment merge target, the show_config display line. Nothing reads it anywhere. Dead ContextInfo struct: - `struct ContextInfo` was never constructed — context_info: None was the only initializer. The conditional display blocks in user/context.rs that dereferenced it were dead. Behavior change: AppConfig::resolve() now requires a non-empty `models` map and bails with a helpful message if it's missing. The old fallback ("no models? use top-level backend + PromptConfig to build a default") path is gone — it was only kept for symmetry with a mode nobody used. Config file shape: `deepinfra: {...}` → `backend: {...}`, and model entries no longer need `backend:` or `prompt_file:`. Updated ~/.consciousness/config.json5 to match. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-16 15:41:55 -04:00
Kent Overstreet	fc978e2f2e	Remove find_context_files — identity comes from memory nodes Deleted the directory-walking CLAUDE.md/POC.md loader. Identity now comes entirely from personality_nodes in the memory graph. Simplified: - assemble_context_message() takes just personality_nodes - Removed config_file_count/memory_file_count tracking - reload_for_model() → reload_context() (no longer model-specific) Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev> Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-15 03:11:27 -04:00
Kent Overstreet	5d6e663b60	thalamus: add thinking mode toggles (native + tool) Two independent toggles on the thalamus screen: - 't' toggles native Qwen <think> tags (adds <think>\n to generation prompt) - 'T' toggles think tool (Anthropic-style structured reasoning tool) Both can be enabled simultaneously. Native thinking is on by default. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-14 18:25:00 -04:00
Kent Overstreet	063cf031d3	journal_tail: return typed Vec<JournalEntry>, remove Store::load from agent - journal_tail returns Vec<JournalEntry> with key, content, created_at - load_startup_journal uses typed API, no more direct Store access - CLI does formatting, hippocampus returns data Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 15:23:10 -04:00
Kent Overstreet	359955f838	defs.rs: async conversion, remove block_in_place Convert resolve(), resolve_placeholders(), run_agent() to async. Use memory_render/memory_query directly with .await instead of block_in_place wrappers. Propagate async to callers: - config.rs: resolve(), load_session(), reload_for_model() - identity.rs: load_memory_files(), assemble_context_message() - oneshot.rs: run_one_agent() - prompts.rs: agent_prompt() Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-13 14:56:26 -04:00
Kent Overstreet	f56fc3a7c7	locks: add process-wide lock hold time tracking TrackedMutex and TrackedRwLock wrappers that record hold durations by source location using #[track_caller]. Stats written to ~/.consciousness/lock-stats.json every second, sorted by max hold time. Re-exported as crate::Mutex so all locks are instrumented. To disable, swap the re-export back to tokio::sync::Mutex. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 20:27:42 -04:00
Kent Overstreet	f00532bdb7	TurnResult: remove text field, simplify oneshot loop - Remove TurnResult.text (was dead code - Agent::turn handles text internally) - Simplify run_with_backend to just iterate over steps (Agent::turn loops for tool calls and handles empty responses internally) - Change run/run_shared/run_forked_shared to return Result<(), String> - Remove AgentResult.output field (no callers used it) - Stub out legacy text-parsing code (audit, compare) that needs redesign - Update digest.rs to not depend on text return - Add level parameter to journal_new/journal_update for digest support Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 02:04:50 -04:00
Kent Overstreet	125927e2f1	Drop redundant system prompt — all info is in memory nodes The system prompt duplicated what's already in core-personality and other memory nodes. Moving everything to memory means it's all trainable data rather than hardcoded strings. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 01:36:59 -04:00
Kent Overstreet	090c8e4d35	Agent:🆕 stop unconditionally adding all MCP tools Each agent is passed its own tool list — that's the list it should advertise. The line that appended all_mcp_tool_definitions() was causing unconscious agents to see bash/read_file/etc in their prompt even though they couldn't execute them. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-12 01:33:40 -04:00
ProofOfConcept	1c0967c4ec	Agent:🆕 tool definitions from caller's tool list The system prompt was advertising all tools to every agent, but the runtime only dispatched the agent's actual subset. This caused unconscious agents to call tools that returned "Unknown tool." Agent::new now takes the tool list explicitly. Each caller passes its own tools — the prompt and runtime always match. MCP tool definitions are still appended for agents that use them. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-11 19:43:24 -04:00
ProofOfConcept	3e0d52c451	Redirect noisy warnings to debug log to stop TUI corruption Duplicate key warnings fire on every store load and were writing to stderr, corrupting the TUI display. Log write warnings and MCP server failures are similarly routine. Route these to dbglog. Serious errors (rkyv snapshot failures, store corruption) remain on stderr — those are real problems the user needs to see. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 22:46:48 -04:00
ProofOfConcept	5fe22a5f23	Use ActivityGuard for context overflow retry progress Instead of two separate notifications piling up on the status bar, use a single ActivityGuard that updates in place during overflow retries and auto-completes when the turn finishes. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 22:32:38 -04:00
ProofOfConcept	121b46e1d2	Add ActivityGuard::update() for in-place progress updates Lets long-running operations update their status bar message without creating/dropping a new activity per iteration. Useful for loops like memory scoring where you want "scoring: 3/25 keyname" updating in place. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 22:18:43 -04:00
ProofOfConcept	bf503b571e	Wire vLLM priority scheduling through all agent paths The priority field existed in agent definitions and was serialized into vLLM requests, but was never actually set — every request went out with no priority, so vLLM treated them equally. This meant background graph maintenance agents could preempt the main conversation. Add priority to AgentState and set it at each call site: 0 = interactive (main conversation) 1 = surface agent (needs to feed memories promptly) 2 = other subconscious agents 10 = unconscious/standalone agents (batch) Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 20:38:33 -04:00
ProofOfConcept	a596e007b2	Mouse selection, copy/paste, yield_to_user fixes - Mouse text selection with highlight rendering in panes - OSC 52 clipboard copy on selection, middle-click paste via tmux buffer - Bracketed paste support (Event::Paste) - yield_to_user: no tool result appended, ends turn immediately - yield_to_user: no parameters, just a control signal - Drop arboard dependency, use crossterm OSC 52 + tmux for clipboard Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 18:10:54 -04:00
Kent Overstreet	8a2f488d22	yield_to_user ends turn	2026-04-09 16:47:49 -04:00
Kent Overstreet	949dacd861	Fast startup: mmap backward scan instead of reading full log Uses JsonlBackwardIter (SIMD memrchr3) to scan the conversation log newest-first without reading/parsing the whole file. Stops as soon as the conversation budget is full. Only the kept nodes get retokenized and pushed into context. 18MB log → only tokenize the ~50 nodes that fit in the budget. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 13:09:26 -04:00
Kent Overstreet	7da3efc5df	Fast startup: only retokenize tail of conversation log restore_from_log reads the full log but walks backwards from the tail, retokenizing each node as it goes. Stops when conversation budget is full. Only the nodes that fit get pushed into context. Added AstNode::retokenize() — recomputes token_ids on all leaves after deserialization (serde skip means they're empty). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 13:06:19 -04:00
Kent Overstreet	8b5614ba99	MCP client: spawn external tool servers, dispatch via JSON-RPC New mcp_client.rs: McpRegistry manages MCP server connections. Spawns child processes, speaks JSON-RPC 2.0 over stdio. Discovers tools via tools/list, dispatches calls via tools/call. dispatch_with_agent falls through to MCP after checking internal tools. McpRegistry lives on Agent (shared across forks). Still needs: config-driven server startup, system prompt integration. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 12:59:25 -04:00
ProofOfConcept	c53c4f9071	Replace push() with explicit push_log() and push_no_log() No implicit auto-logging. Call sites choose: - push_log: new conversation entries (user messages, tool results, surfaced memories, assistant responses) - push_no_log: system prompt, identity, journal, restore from log, compact reload, tests Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 01:10:40 -04:00
ProofOfConcept	6529aba069	Fix UI lag: try_lock on unconscious mutex, don't re-log restored nodes The unconscious trigger holds the tokio mutex during heavy sync work (store load, graph build, agent creation), blocking the UI tick which needs the same lock for snapshots. Fix: try_lock in the UI — skip the update if the trigger is running. Also: restore_from_log was re-logging every restored node back to the log file via push()'s auto-log. Added push_no_log() for restore path. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 01:07:55 -04:00
ProofOfConcept	ddfdbe6cb1	Move conversation_log from AgentState to ContextState The log records what goes into context, so it belongs under the context lock. push() now auto-logs conversation entries, eliminating all the manual lock-state-for-log, drop, lock-context-for-push dances. - ContextState: new conversation_log field, Clone impl drops it (forked contexts don't log) - push(): auto-logs Section::Conversation entries - push_node, apply_tool_results, collect_results: all simplified - collect_results: batch nodes under single context lock - Assistant response logged under context lock after parse completes Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 00:32:32 -04:00
ProofOfConcept	d82a2ae90d	Clean up mind loop: fix double locks, async agent triggers, input peek - push_node: notify before dropping state lock instead of relocking - Mind::run: single lock for timeout + turn_active + has_input; single lock for turn_handle + complete_turn - Agent triggers (subconscious/unconscious) spawned as async tasks so they don't block the select loop - has_pending_input() peek for DMN sleep guard — don't sleep when there's user input waiting - unconscious: merge collect_results into trigger, single store load Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-09 00:21:46 -04:00
ProofOfConcept	44a0bc376a	Forked agents: stop gracefully on context overflow instead of compacting Subconscious agents (observe, etc.) fork the conscious agent's context to share the KV cache prefix. When a multi-step agent fills the context window, compacting blows the KV cache and evicts the step prompts, leaving the model with no idea what it was doing. Fix: forked agents set no_compact=true. On overflow, turn() returns the error immediately (no compact+retry), and run_with_backend catches it and returns Ok — the output tool has already written results to Subconscious.state, so collect_results still picks them up. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 23:00:12 -04:00
Kent Overstreet	bbffc2213e	Restore trim_conversation: dedup memories, evict to budget, snap boundary Ported the old trim_entries logic to the new AstNode types: - Phase 1: Dedup Memory nodes by key (keep last), drop DMN entries - Phase 2: While over budget, evict lowest-scored memory (if memories > 50% of conv tokens) or oldest conversation entry - Phase 3: Snap to User message boundary at start Called from compact() which runs on startup and on /compact. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 21:27:16 -04:00
Kent Overstreet	fc75b181cf	Fix: compact() was clearing tool definitions from system section compact() cleared and rebuilt the system section but only pushed the system prompt — tool definitions were lost. Since new() sets up the system section correctly (prompt + tools), compact() now only reloads identity and journal, leaving system untouched. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 17:48:10 -04:00
Kent Overstreet	1b6664ee1c	Fix: skip empty CoT nodes, expand AST children in conscious screen, timestamps Parser skips Thinking nodes that are just whitespace. Conscious screen now shows assistant children (Content, Thinking, ToolCall) as nested tree items via recursive node_to_view. Nodes get timestamped in push_node and on assistant branch creation. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 17:18:48 -04:00
Kent Overstreet	88ac5e10ce	Log completed assistant node after parser finishes The parser mutates the AST directly but doesn't write to the conversation log. The turn loop now logs the completed assistant branch after the parser handle resolves successfully. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 16:58:35 -04:00
Kent Overstreet	9c0533966a	Batch tool result application: single lock for remove + log + push apply_tool_results() collects all results, then does one state lock (remove from active_tools + write to log) and one context lock (push all nodes). Eliminates redundant per-result locking. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 16:48:05 -04:00
Kent Overstreet	9c9618d034	WIP: ActiveTools wrapper type, removing SharedActiveTools New ActiveTools struct with proper methods: push, remove, take_finished, take_foreground, iter, len. Turn loop uses helpers instead of manual index iteration. Removing SharedActiveTools (Arc<Mutex<Vec>>) — active tools live directly in AgentState. A few UI callers still need updating. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 16:41:14 -04:00
Kent Overstreet	14fd8c9b90	Clean up warnings: StreamToken pub, dead oneshot code, SkipIndex Made StreamToken pub (was pub(crate), needed by context.rs). Removed dead API_CLIENT, get_client, sampling/priority fields from oneshot. Suppressed pre-existing SkipIndex warning in learn.rs. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 16:35:57 -04:00
Kent Overstreet	2c401e24d6	Parser consumes stream directly, yields tool calls via channel ResponseParser::run() spawns a task that reads StreamTokens, parses into the AST (locking context per token), and sends PendingToolCalls through a channel. Returns (tool_rx, JoinHandle<Result>) — the turn loop dispatches tool calls and awaits the handle for error checking. Token IDs from vLLM are accumulated alongside text and stored directly on AST leaves — no local re-encoding on the response path. The turn loop no longer matches on individual stream events. It just reads tool calls and dispatches them. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 16:32:00 -04:00
Kent Overstreet	0b9813431a	Agent/AgentState split complete — separate context and state locks Agent is now Arc<Agent> (immutable config). ContextState and AgentState have separate tokio::sync::Mutex locks. The parser locks only context, tool dispatch locks only state. No contention between the two. All callers migrated: mind/, user/, tools/, oneshot, dmn, learn. 28 tests pass, zero errors. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 15:47:21 -04:00
Kent Overstreet	e73135a8d0	WIP: Agent/AgentState split — core methods migrated turn(), push_node(), assemble_prompt_tokens(), compact(), restore_from_log(), load_startup_journal(), apply_tool_result() all use separate context/state locks. ToolHandler signature updated to Arc<Agent>. Remaining: tool handlers, control.rs, memory.rs, digest.rs, and all outer callers (mind, user, learn, oneshot, dmn). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 15:39:03 -04:00
Kent Overstreet	7fe4584ba0	WIP: Agent/AgentState split — struct defined, 80+ errors remaining Split Agent into immutable Agent (behind Arc) and mutable AgentState (behind its own Mutex). ContextState has its own Mutex on Agent. Activities moved to AgentState. new() and fork() rewritten. All callers need mechanical updates: agent.lock().await.field → agent.state.lock().await.field or agent.context.lock().await.method. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 15:36:08 -04:00
Kent Overstreet	e587431f9a	IT BUILDS: Full AST migration compiles — zero errors All callers migrated from old context types to AstNode/ContextState. Killed: Message, Role (api), ConversationEntry, ContextEntry, ContextSection, working_stack, api/parsing.rs, api/types.rs, api/openai.rs, context_old.rs. Oneshot standalone path stubbed (needs completions API rewrite). 12 warnings remaining (dead code cleanup). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 15:29:52 -04:00
Kent Overstreet	bf3e2a9b73	WIP: Rename context_new → context, delete old files, fix UI layer Renamed context_new.rs to context.rs, deleted context_old.rs, types.rs, openai.rs, parsing.rs. Updated all imports. Rewrote user/context.rs and user/widgets.rs for new types. Stubbed working_stack tool. Killed tokenize_conv_entry. Remaining: mind/mod.rs, mind/dmn.rs, learn.rs, chat.rs, subconscious.rs, oneshot.rs. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 15:20:26 -04:00
Kent Overstreet	a68377907a	WIP: Agent core migrated to AST types agent/mod.rs fully uses AstNode/ContextState/PendingToolCall. Killed: push_message, push_entry, append_streaming, finalize_streaming, streaming_index, assemble_api_messages, age_out_images, working_stack, context_sections, entries. ConversationLog rewritten for AstNode. Remaining: api dead code (chat path), mind/, user/, oneshot, learn. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 14:59:38 -04:00
Kent Overstreet	9c79d7a037	WIP: Wiring context_new into agent — turn loop, StreamToken, dead code removal Work in progress. New turn loop uses ResponseParser + StreamToken. Killed StreamEvent, append_streaming, finalize_streaming, streaming_index, assemble_api_messages, working_stack. Many methods still reference old types — fixing next. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 14:55:10 -04:00
Kent Overstreet	29dc339f54	WIP: Context AST design — AstNode with Leaf{text,token_ids}/Branch New context_new.rs with the AST-based context window design: - AstNode: role + NodeBody (Leaf with text+token_ids, or Branch with children) - Tokens only on leaves, branches walk children - render() produces UTF-8, tokenize produces token IDs, same path - ResponseParser state machine for streaming assistant responses - Role enum covers all node types including sections Still needs: fix remaining pattern match issues, add ContextState wrapper, wire into mod.rs, replace old context.rs. Does not compile yet — this is a design checkpoint. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 12:46:44 -04:00
Kent Overstreet	64157d8fd7	Add assert in append_streaming to catch impossible Thinking entry Debug assertion to help trace the remaining Thinking/Log panic. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 12:10:54 -04:00
Kent Overstreet	603d58e686	Fix Thinking/Log panics: skip entries with empty token_ids Entries with empty token_ids (Thinking, Log) are not part of the prompt and don't have messages. Skip them in streaming_index(), route_entry(), and sync_from_agent() instead of calling .message() which panics. Using token_ids.is_empty() as the guard in streaming_index means the check is tied to the data, not the type — any entry that doesn't produce tokens is safely skipped. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 12:05:49 -04:00
Kent Overstreet	f458af6dec	Add /v1/completions streaming path with raw token IDs New stream_completions() in openai.rs sends prompt as token IDs to the completions endpoint instead of JSON messages to chat/completions. Handles <think> tags in the response (split into Reasoning events) and stops on <\|im_end\|> token. start_stream_completions() on ApiClient provides the same interface as start_stream() but takes token IDs instead of Messages. The turn loop in Agent::turn() uses completions when the tokenizer is initialized, falling back to the chat API otherwise. This allows gradual migration — consciousness uses completions (Qwen tokenizer), Claude Code hook still uses chat API (Anthropic). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-08 11:42:22 -04:00

1 2 3

113 commits