consciousness

Author	SHA1	Message	Date
Kent Overstreet	79e384f005	split out src/mind	2026-04-04 02:46:32 -04:00
ProofOfConcept	ce04568454	training: add memory_score() and finetune_score() Separate the scoring into two distinct functions: - memory_score(key): scores one memory's importance by measuring divergence in the 50 messages after it was surfaced. Two API calls (baseline vs without that memory). - finetune_score(count): scores recent messages with all memories stripped to identify fine-tuning candidates. Responses with high divergence depend on memories the model hasn't internalized yet. The existing score_memories() with the full NxM matrix is preserved for the debug screen. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-04 01:49:53 -04:00
Kent Overstreet	9bebbcb635	Move API code from user/ to agent/ Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-04 00:34:48 -04:00
ProofOfConcept	021eafe6da	delete ProcessTracker — replaced by ActiveToolCall + KillOnDrop All process management now goes through active_tools: - TUI reads metadata (name, elapsed time) - Ctrl+K aborts handles (KillOnDrop sends SIGTERM) - Running count from active_tools.len() No more separate PID tracking, register/unregister, or ProcessInfo. One data structure for everything. Co-Developed-By: Kent Overstreet <kent.overstreet@linux.dev> Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 23:58:38 -04:00
ProofOfConcept	310bbe9fce	KillOnDrop: SIGTERM process group when tool task is aborted tokio::spawn abort drops the future but leaves child processes running as orphans. KillOnDrop sends SIGTERM to the process group on drop, ensuring cleanup. Defused via mem::forget on normal completion. Co-Developed-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 23:47:36 -04:00
ProofOfConcept	a78f310e4d	unify tool tracking: ActiveToolCall with JoinHandle One data structure for all in-flight tool calls — metadata for TUI display + JoinHandle for result collection and cancellation. Agent spawns tool calls via tokio::spawn, pushes to shared Arc<Mutex<Vec<ActiveToolCall>>>. TUI reads metadata, can abort(). No separate inflight/background collections. Non-background: awaited after stream ends. Background: persists, drained at next turn start. Co-Developed-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 23:42:27 -04:00
ProofOfConcept	17a018ff12	fixup: consolidate tool types, fix build after reorganization Move FunctionCall, FunctionDef, FunctionCallDelta from user/types to agent/tools. Re-export from user/types for backward compat. Merge duplicate dispatch functions in tools/mod.rs into dispatch (agent-specific) + dispatch_shared (with provenance). Fix orphaned derive, missing imports, runner→agent module path. Co-Developed-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 23:21:16 -04:00
ProofOfConcept	474b66c834	shared active tools: Agent writes, TUI reads directly Move active tool tracking from TUI message-passing to shared Arc<RwLock> state. Agent pushes on dispatch, removes on apply_tool_result. TUI reads during render. Background tasks show as active until drained at next turn start. Co-Developed-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 22:57:46 -04:00
ProofOfConcept	d25033b9f4	fire XML tool calls as they arrive during streaming When </tool_call> is detected in the content stream, parse and dispatch immediately via FuturesOrdered. Tool calls execute concurrently while the stream continues. Results collected in order after the stream ends. Structured API path (ToolCallDelta) unchanged — still uses post-stream parallel dispatch. Co-Developed-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 22:38:30 -04:00
Kent Overstreet	2f0c7ce5c2	src/thought -> src/agent Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-04-03 22:24:56 -04:00
Kent Overstreet	14dd8d22af	Rename agent/ to user/ and poc-agent binary to consciousness Mechanical rename: src/agent/ -> src/user/, all crate::agent:: -> crate::user:: references updated. Binary poc-agent renamed to consciousness with CLI name and user-facing strings updated. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-03 17:25:59 -04:00
Kent Overstreet	e8c3ed3d96	switch memory scoring to /v1/score endpoint Replace prompt_logprobs-based scoring with the new vLLM /v1/score endpoint. Much simpler: one API call per memory drop, returns per-message total_logprob directly. No chunking needed, no OOM risk — the endpoint only computes logits for scored tokens. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-03 00:40:27 -04:00
Kent Overstreet	249726599b	read_tail 64MB — just read the whole log Images in the jsonl eat most of the byte budget. 64MB covers any realistic conversation log; compact() trims to fit. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 23:13:28 -04:00
Kent Overstreet	31302961e2	estimate prompt tokens on restore so status bar isn't 0K After restore_from_log + compact, set last_prompt_tokens from the budget's used() count instead of waiting for the first API call. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 23:07:42 -04:00
Kent Overstreet	41b3f50c91	keep 2 most recent images, age out the rest age_out_images now keeps 1 existing image + 1 about to be added = 2 live images for motion/comparison. Previously aged all to 1. Reduces image bloat in conversation log and context. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 23:06:08 -04:00
Kent Overstreet	3f3db9ce26	increase log read_tail from 2MB to 8MB Large tool results (memory renders, bash output) consume most of the 2MB budget — only 37 entries loaded from a 527-line log. 8MB captures ~300 entries, giving compact() enough conversation to work with. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 23:02:43 -04:00
Kent Overstreet	736307b4c2	add debug logging to compact and restore_from_log Logs entry counts before/after compaction (memory vs conversation), budget breakdown, and restore load counts. Helps diagnose context utilization issues. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 22:58:25 -04:00
Kent Overstreet	d921e76f82	increase context budget: 80% window, 15% journal, no double reserve Context was too aggressively trimmed — 80% free after compaction. Budget was 60% of window minus 25% reserve = only 45% usable. Now: 80% of window for total budget (20% output reserve built in), no extra reserve subtraction. Journal budget 5% → 15% to carry more context across compactions. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 22:53:54 -04:00
Kent Overstreet	78abf90461	fix scoring: HTTP error checking, context refresh, chunk logging Check HTTP status from logprobs API (was silently ignoring 500s). Call publish_context_state() after storing scores so F10 screen updates. Add chunk size logging for OOM debugging. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 22:47:44 -04:00
Kent Overstreet	19205b9bae	show scoring progress and per-response memory attribution Status bar shows "scoring 3/7..." during scoring. Debug pane logs per-memory importance and top-5 response breakdowns. F10 context screen shows which memories were important for each assistant response as drilldown children (← memory_key (score)). Added important_memories_for_entry() to look up the matrix by conversation entry index. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 22:27:43 -04:00
Kent Overstreet	c01d4a5b08	wire up /score command and debug screen for memory importance /score snapshots the context and client, releases the agent lock, runs scoring in background. Only one score task at a time (scoring_in_flight flag). Results stored on Agent and shown on the F10 context debug screen with importance scores per memory. ApiClient derives Clone. ContextState derives Clone. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 22:21:31 -04:00
Kent Overstreet	df9b610c7f	add memory importance scoring via prompt logprobs score_memories() drops each memory from the context one at a time, runs prompt_logprobs against the full conversation, and builds a divergence matrix: memories × responses. Row sums = memory importance (for graph weight updates) Column sums = response memory-dependence (training candidates) Uses vLLM's prompt_logprobs to check "would the model have said this without this memory?" — one forward pass per memory, all responses scored at once. ~3s per memory on B200. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 22:13:55 -04:00
Kent Overstreet	33e45f6ce8	replace hardcoded personal names with config values User and assistant names now come from config.user_name and config.assistant_name throughout: system prompt, DMN prompts, debug screen, and all agent files. Agent templates use {user_name} and {assistant_name} placeholders. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 19:45:35 -04:00
Kent Overstreet	5b92b59b17	move failed request logs to their own subdirectory ~/.consciousness/logs/failed-requests/ instead of cluttering the main logs directory. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 19:28:56 -04:00
Kent Overstreet	3b80af2997	log buffer contents on stream errors and timeouts Show chunks received, SSE lines parsed, and the contents of the line buffer (up to 500 bytes) on both stream errors and timeouts. This tells us whether we got partial data, a non-SSE response, or truly nothing from the server. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 18:49:33 -04:00
Kent Overstreet	156626ae53	configurable stream timeout, show per-call timer in status bar Stream chunk timeout is now api_stream_timeout_secs in config (default 60s). Status bar shows total turn time and per-call time with timeout: "thinking... 45s, 12/60s". Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 18:46:27 -04:00
Kent Overstreet	13d9cc962e	abort orphaned stream tasks on drop, reduce timeout to 60s Spawned streaming tasks were never cancelled when a turn ended or retried, leaving zombie tasks blocked on dead vLLM connections. AbortOnDrop wrapper aborts the task when it goes out of scope. Chunk timeout reduced from 120s to 60s. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 18:41:02 -04:00
Kent Overstreet	35f231233f	clear activity indicator on error paths "thinking..." was getting stuck in the status bar when a turn ended with a stream error, context overflow, or model error — only the success path cleared it. Now all error returns clear the activity indicator. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 17:53:51 -04:00
Kent Overstreet	91eb9c95cc	delete 20 dead public functions across 12 files Removed functions with zero callers: parse_timestamp_to_epoch, hash_key, search_weighted_debug, extract_query_terms, format_results, move_to_neighbor, adjust_edge_strength, update_graph_metrics, nearest_to_seeds, nystrom_project, chat_completion_stream, cmd_read, context_message, split_candidates, split_plan_prompt, split_extract_prompt, log_event_pub, log_verbose, rpc_record_hits, memory_definitions. -245 lines. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 16:21:01 -04:00
Kent Overstreet	b0e852a05f	add unreachable_pub lint, fix all 17 violations pub → pub(crate) for SseReader methods (used across child modules). pub → pub(super) for openai::stream_events, tool definitions, store helpers. pub → private for normalize_link and differentiate_hub_with_graph (only used within their own files). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 16:15:32 -04:00
Kent Overstreet	af3929cc65	simplify compaction: Agent owns config, compact() reloads everything Agent stores AppConfig and prompt_file, so compact() reloads identity internally — callers no longer pass system_prompt and personality. restore_from_log() loads entries and calls compact(). Remove soft compaction threshold and pre-compaction nudge (journal agent handles this). Remove /compact and /context commands (F10 debug screen replaces both). Inline do_compact, emergency_compact, trim_and_reload into compact(). Rename model_context_window to context_window, drop unused model parameter. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 16:08:41 -04:00
Kent Overstreet	d419587c1b	WIP: trim_entries dedup, context_window rename, compact simplification Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 15:58:03 -04:00
Kent Overstreet	809679b6ce	delete dead flat-file journal tool and ephemeral stripping Journal entries are written to the memory graph via journal_new/ journal_update, not appended to a flat file. Remove thought/journal.rs (67 lines), strip_ephemeral_tool_calls (55 lines), default_journal_path, and all wiring. -141 lines. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 15:35:56 -04:00
Kent Overstreet	aceaf0410e	delete dead flat-file journal code from thought/context.rs Journal entries are loaded from the memory graph store, not from the flat journal file. Remove build_context_window, plan_context, render_journal_text, assemble_context, truncate_at_section, find_journal_cutoff, parse_journal*, ContextPlan, and stale TODOs. Keep JournalEntry, default_journal_path (write path), and the live context management functions. -363 lines. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 15:31:12 -04:00
Kent Overstreet	214806cb90	move context functions from agent/context.rs to thought/context.rs trim_conversation moved to thought/context.rs where model_context_window, msg_token_count, is_context_overflow, is_stream_error already lived. Delete the duplicate agent/context.rs (94 lines). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 15:28:00 -04:00
Kent Overstreet	01bfbc0dad	move journal types from agent/journal.rs to thought/context.rs JournalEntry, parse_journal, parse_journal_text, parse_header_timestamp, and default_journal_path consolidated into thought/context.rs. Delete the duplicate agent/journal.rs (235 lines). Update all references. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 15:25:07 -04:00
Kent Overstreet	e0a54a3b43	save request payload on any API error, not just timeouts Serialize request JSON before send_and_check so it's available for both HTTP errors and stream errors. Extracted save logic into save_failed_request helper on SseReader. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 15:19:26 -04:00
Kent Overstreet	64dbcbf061	unify memory tracking: entries are the single source of truth Memory tool results (memory_render) are now pushed as ConversationEntry::Memory with the node key, instead of plain Messages. Remove loaded_nodes from ContextState — the debug screen reads memory info from Memory entries in the conversation. Surfaced memories from surface-observe are pushed as separate Memory entries, reflections as separate system-reminder messages. User input is no longer polluted with hook output. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 14:56:02 -04:00
Kent Overstreet	a21cf31ad2	unify conversation persistence to append-only jsonl Log ConversationEntry (with Memory/Message typing) instead of raw Message. restore_from_log reads typed entries directly, preserving Memory vs Message distinction across restarts. Remove current.json snapshot and save_session — the append-only log is the single source of truth. Remove dead read_all and message_count methods. Add push_entry for logging typed entries. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 14:31:19 -04:00
Kent Overstreet	1f7b585d41	remove Anthropic backend, add request logging on timeout Delete anthropic.rs (713 lines) — we only use OpenAI-compatible endpoints (vLLM, OpenRouter). Simplify ApiClient to store base_url directly instead of Backend enum. SseReader now stores the serialized request payload and saves it to ~/.consciousness/logs/failed-request-{ts}.json on stream timeout, so failed requests can be replayed with curl for debugging. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 14:13:23 -04:00
Kent Overstreet	078dcf22d0	cleanup: remove model name string matching model_context_window() now reads from config.api_context_window instead of guessing from model name strings. is_anthropic_model() replaced with backend == "anthropic" checks. Dead model field removed from AgentDef/AgentHeader. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 14:09:54 -04:00
Kent Overstreet	47c6694b10	Remove dead code: old context builder, plan_context, journal parsing Removed from context.rs: ContextPlan, plan_context, render_journal_text, assemble_context, truncate_at_section, find_journal_cutoff, parse_msg_timestamp. All replaced by trim_conversation + journal from memory graph. Removed from tui.rs: most_recent_file, format_duration (filesystem scanning leftovers). Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 03:40:35 -04:00
Kent Overstreet	e9e47eb798	Replace build_context_window with trim_conversation build_context_window loaded journal from a stale flat file and assembled the full context. Now journal comes from the memory graph and context is assembled on the fly. All that's needed is trimming the conversation to fit the budget. trim_conversation accounts for identity, journal, and reserve tokens, then drops oldest conversation messages until it fits. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 03:35:28 -04:00
Kent Overstreet	87add36cdd	Fix: don't overwrite journal during restore/compaction The restore and compaction paths called build_context_window which reads from the stale flat journal file, overwriting the journal we loaded from the memory graph. Preserve the graph-loaded journal across these operations. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 03:33:04 -04:00
Kent Overstreet	b9e3568385	ConversationEntry enum: typed memory vs conversation messages Replace untyped message list with ConversationEntry enum: - Message(Message) — regular conversation turn - Memory { key, message } — memory content with preserved message for KV cache round-tripping Budget counts memory vs conversation by matching on enum variant. Debug screen labels memory entries with [memory: key]. No heuristic tool-name scanning. Custom serde: Memory serializes with a memory_key field alongside the message fields, deserializes by checking for the field. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 03:26:00 -04:00
Kent Overstreet	eb4dae04cb	Compute ContextBudget on demand from typed sources Remove cached context_budget field and measure_budget(). Budget is computed on demand via budget() which calls ContextState::budget(). Each bucket counted from its typed source. Memory split from conversation by identifying memory tool calls. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 03:07:45 -04:00
Kent Overstreet	acdfbeeac3	Align debug screen and budget with conversation-only messages context.messages is conversation-only now — remove conv_start scanning. Memory counted from loaded_nodes (same as debug screen). No subtraction heuristics. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 02:56:28 -04:00
Kent Overstreet	5e781e9ae4	Fix budget counting: remove stale refresh_context_message refresh_context_message was injecting personality into conversation messages (assuming fixed positions that no longer exist). Replaced with refresh_context_state which just re-measures and publishes. conv_tokens now subtracts mem_tokens since memory tool results are in the conversation message list. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 02:52:59 -04:00
Kent Overstreet	a0aacfc552	Move conversation messages into ContextState ContextState now owns everything in the context window: system_prompt, personality, journal, working_stack, loaded_nodes, and conversation messages. No duplication — each piece exists once in its typed form. assemble_api_messages() renders the full message list on the fly from typed sources. measure_budget() counts each bucket from its source directly. push_context() removed — identity/journal are never pushed as messages. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 02:47:32 -04:00
Kent Overstreet	4580f5dade	measure_budget: count from typed sources, not message scanning Identity tokens from system_prompt + personality vec. Journal from journal entries vec. Memory from loaded_nodes. Conversation is the remainder. No string prefix matching. Co-Authored-By: Proof of Concept <poc@bcachefs.org>	2026-04-02 02:32:26 -04:00

1 2

93 commits