consciousness

Author	SHA1	Message	Date
ProofOfConcept	38816dc56e	transcript: fix close-brace finder to track string boundaries The backward JSON scanner (JsonlBackwardIter and TailMessages) was matching } characters inside JSON strings — code blocks full of Rust braces being the primary offender. This caused: - Quadratic retry behavior on code-heavy transcripts (wrong object boundaries → serde parse failure → retry from different position) - Inconsistent find_last_compaction_in_file offsets across calls, making detect_new_compaction fire repeatedly → context reload on every hook call → seen set growing without bound Fix: add string-boundary tracking with escaped-quote handling to the close-brace finder loop, matching the existing logic in the depth-tracking loop. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-24 12:27:22 -04:00
Kent Overstreet	aa46b1d5a6	poc-agent: read context_groups from config instead of hardcoded list - Remove MEMORY_FILES constant from identity.rs - Add ContextGroup struct for deserializing from config - Load context_groups from ~/.config/poc-agent/config.json5 - Check ~/.config/poc-agent/ first for identity files, then project/global - Debug screen now shows what's actually configured This eliminates the hardcoded duplication and makes the debug output match what's in the config file.	2026-03-24 01:53:28 -04:00
Kent Overstreet	966219720a	fix: mark surfaced keys as returned so --seen classifies them correctly The surface agent result consumer in poc-hook was writing to the seen file but not the returned file, so surfaced keys showed up as "context-loaded" in memory-search --seen. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 23:06:46 -04:00
Kent Overstreet	c0e6d5cfb3	distill: limit:1 to process one neighborhood per prompt With limit:10, all seeds' neighborhoods got concatenated into one massive prompt (878KB+), exceeding the model's context. One seed at a time keeps prompts well under budget. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 16:28:00 -04:00
Kent Overstreet	e50d43bbf0	memory-search --seen: show current and previous seen sets separately Instead of merging both into one flat list, display them as distinct sections so it's clear what was surfaced in this context vs what came from before compaction. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 16:27:52 -04:00
Kent Overstreet	134f7308e3	surface agent: split seen_recent into seen_current/seen_previous placeholders Two separate placeholders give the agent structural clarity about which memories are already in context vs which were surfaced before compaction and may need re-surfacing. Also adds memory_ratio placeholder so the agent can self-regulate based on how much of context is already recalled memories. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 16:27:42 -04:00
Kent Overstreet	53b63ab45b	seen_recent: cap at 20 roots total across both seen sets Budget of 20 roots split between current and prev. Current gets priority, prev fills the remainder. Prevents flooding the agent with hundreds of previously surfaced keys. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 14:28:03 -04:00
Kent Overstreet	9512dc0a31	seen_recent: separate current vs pre-compaction seen sets Present the two seen sets separately to the surface agent: - Current: already in context, don't re-surface - Pre-compaction: context was reset, re-surface if still relevant This lets the agent re-inject important memories after compaction instead of treating everything ever surfaced as "already shown." Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 14:26:56 -04:00
Kent Overstreet	870b87df1b	run surface agent on both UserPromptSubmit and PostToolUse Extract surface_agent_cycle() and call from both hooks. Enables memory surfacing during autonomous work (tool calls without human prompts). Rate limiting via PID file prevents overlap. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 12:47:58 -04:00
Kent Overstreet	b402746070	dedup nodes across seed neighborhoods in prompt building Track which nodes have already been included and skip duplicates. High-degree seed nodes with overlapping neighborhoods were pulling the same big nodes dozens of times, inflating prompts to 878KB. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 12:33:06 -04:00
Kent Overstreet	a8b560b5e1	lower neighborhood budget to 400KB to prevent oversized prompts With core-personality + instructions + subconscious-notes adding ~200KB on top of the neighborhood, the 600KB budget pushed total prompts over the 800KB guard. Lowered to 400KB so full prompts stay under the limit. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 09:51:21 -04:00
Kent Overstreet	de36c0d39e	memory-search: deduplicate seen set entries mark_seen now takes the in-memory HashSet and checks before appending. Prevents the same key being written 30+ times from repeated search hits and context reloads. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 05:00:26 -04:00
Kent Overstreet	38ad2ef4be	surface.agent: instructions first, data last Move core-personality and conversation to the end of the prompt. The model needs to see its task before 200KB of conversation context. Also: limit to 3 hops, 2-3 memories. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:55:47 -04:00
Kent Overstreet	6fc10b0508	poc-hook: search last 8 lines for surface agent result marker The agent output now includes logging (think blocks, tool calls) before the final response. Search the tail instead of checking only the last line. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:49:08 -04:00
Kent Overstreet	d2255784dc	surface.agent: tighten prompt to reduce tool call sprawl Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:46:52 -04:00
Kent Overstreet	42bd163942	TailMessages: only check first 200 bytes for type field The type field is near the start of JSONL objects. Scanning the full object (potentially megabytes for tool_results) was the bottleneck — TwoWaySearcher dominated the profile. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:35:40 -04:00
Kent Overstreet	e83d0184ea	TailMessages: skip serde parse for non-message objects Use memchr::memmem to check for "type":"user" or "type":"assistant" in raw bytes before parsing. Avoids deserializing large tool_result and system objects entirely. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:32:59 -04:00
Kent Overstreet	ecc2cb7b20	replace tail_messages with TailMessages iterator TailMessages is a proper iterator that yields (role, text, timestamp) newest-first. Owns the mmap internally. Caller decides when to stop. resolve_conversation collects up to 200KB, then reverses to chronological order. No compaction check needed — the byte budget naturally limits how far back we scan. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:22:17 -04:00
Kent Overstreet	6c41b50e04	JsonlBackwardIter: use memrchr3 for SIMD-accelerated scanning Replaces byte-by-byte backward iteration with memrchr3('{', '}', '"') which uses SIMD to jump between structurally significant bytes. Major speedup on large transcripts (1.4GB+). Also simplifies tail_messages to use a byte budget (200KB) instead of token counting. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:11:30 -04:00
Kent Overstreet	d7d631d77d	tail_messages: parse each object once, skip non-message types early Was parsing every object twice (compaction check + message extract) and running contains_bytes on every object for the compaction marker. Now: quick byte pre-filter for "user"/"assistant", parse once, check compaction after text extraction. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:05:04 -04:00
Kent Overstreet	e39096b787	add tail_messages() for fast reverse transcript scanning Reverse-scans the mmap'd transcript using JsonlBackwardIter, collecting user/assistant messages up to a token budget, stopping at the compaction boundary. Returns messages in chronological order. resolve_conversation() now uses this instead of parsing the entire file through extract_conversation + split_on_compaction. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 03:02:11 -04:00
Kent Overstreet	a03bf390a8	render: mark node as seen when POC_SESSION_ID is set When poc-memory render is called inside a Claude session, add the key to the seen set so the surface agent knows it's been shown. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:43:46 -04:00
Kent Overstreet	41a9a1d2da	add surface.agent — async memory retrieval agent Fires on each UserPromptSubmit, reads the conversation via {{conversation}}, checks {{seen_recent}} to avoid re-surfacing, searches the memory graph, and outputs a key list or nothing. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-22 02:35:15 -04:00
Kent Overstreet	4183b28b1d	add {{conversation}} and {{seen_recent}} placeholders for surface agent {{conversation}} reads POC_SESSION_ID, finds the transcript, extracts the last segment (post-compaction), returns the tail ~100K chars. {{seen_recent}} merges current + prev seen files for the session, returns the 20 most recently surfaced memory keys with timestamps. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:27:43 -04:00
Kent Overstreet	85307fd6cb	surface agent infrastructure: hook spawn, seen set rotation, config Surface agent fires asynchronously on UserPromptSubmit, deposits results for the next prompt to consume. This commit adds: - poc-hook: spawn surface agent with PID tracking and configurable timeout, consume results (NEW RELEVANT MEMORIES / NO NEW), render and inject surfaced memories, observation trigger on conversation volume - memory-search: rotate seen set on compaction (current → prev) instead of deleting, merge both for navigation roots - config: surface_timeout_secs option The .agent file and agent output routing are still pending. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	53c5424c98	remove redundant 'response NKB' log line Already shown in === RESPONSE === section. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	f70d108193	api: include turn/payload/message count in API error messages Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	be2b499978	remove claude CLI subprocess code from llm.rs All LLM calls now go through the direct API backend. Removes call_model, call_model_with_tools, call_sonnet, call_haiku, log_usage, and their dependencies (Command, prctl, watchdog). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	04dffa2184	add call_simple for non-agent LLM calls audit, digest, and compare now go through the API backend via call_simple(), which logs to llm-logs/{caller}/. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	e3f7d6bd3c	remove --debug flag from agent run The log file has everything now; --debug was redundant. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	543e1bdc8a	logging: single output stream through caller's log closure Pass the caller's log closure all the way through to api.rs instead of creating a separate eprintln closure in llm.rs. Everything goes through one stream — prompt, think blocks, tool calls with args, tool results with content, token counts, final response. CLI uses println (stdout), daemon uses its task log. No more split between stdout and stderr. Also removes the llm-log file creation from knowledge.rs — that's the daemon's concern, not the agent runner's. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-22 02:23:30 -04:00
Kent Overstreet	8a83f39734	feat: trigger observation agent on conversation volume The hook now tracks transcript size and queues an observation agent run every ~5K tokens (~20KB) of new conversation. This makes memory formation reactive to conversation volume rather than purely daily. Configurable via POC_OBSERVATION_THRESHOLD env var. The observation agent's chunk_size (in .agent file) controls how much context it actually processes per run. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 23:22:43 -04:00
Kent Overstreet	0baa80a4c7	refactor: restructure distill, linker, split agent prompts Move data sections before instructions (core at top, subconscious + notes at bottom near task). Deduplicate guidelines that are now in memory-instructions-core-subconscious. Compress verbose paragraphs to bullet points. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 23:04:57 -04:00
Kent Overstreet	8db59fe2db	fix: ensure all agents have both core and subconscious instructions All 18 agents now include: - {{node:memory-instructions-core}} — tool usage instructions - {{node:memory-instructions-core-subconscious}} — subconscious framing - {{node:subconscious-notes-{agent_name}}} — per-agent persistent notes The subconscious instructions are additive, not a replacement for the core memory instructions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 22:51:56 -04:00
Kent Overstreet	1a94ef1f1c	fix: cap neighborhood size in agent prompts to prevent oversized prompts When building the {{neighborhood}} placeholder for distill and other agents, stop adding full neighbor content once the prompt exceeds 600KB (~150K tokens). Remaining neighbors get header-only treatment (key + link strength + first line). This fixes distill consistently failing on high-degree nodes like inner-life-sexuality-intimacy whose full neighborhood was 2.5MB. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 22:44:59 -04:00
Kent Overstreet	653da40dcd	cleanup: auto-fix clippy warnings in poc-memory Applied cargo clippy --fix for collapsible_if, manual_char_comparison, and other auto-fixable warnings.	2026-03-21 19:42:38 -04:00
Kent Overstreet	3640de444b	cleanup: fix clippy warnings in daemon.rs - Remove dead code (job_split_one function never called) - Fix needless borrows (ctx.log_line(&format! -> format!)) - Fix slice clone ([key.clone()] -> std::slice::from_ref(&key)) - Collapse nested if statements - Fix unwrap after is_some check - Remove redundant closures in task spawning Reduces daemon.rs from 2030 to 1825 lines.	2026-03-21 19:42:03 -04:00
Kent Overstreet	a0d8b52c9a	feat: subconscious agent notes and instructions Each consolidation agent now has its own persistent notes node (subconscious-notes-{agent_name}) loaded via template substitution. Agents can read their notes at the start of each run and write updates after completing work, accumulating operational wisdom. New node: memory-instructions-core-subconscious — shared framing for background agents ("you are an agent of PoC's subconscious"). Template change: {agent_name} is substituted before {{...}} placeholder resolution, enabling per-agent node references in .agent files. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 19:38:01 -04:00
Kent Overstreet	3fd485a2e9	cli: route agent run through daemon RPC when available Previously 'poc-memory agent run <agent> --count N' always ran locally, loading the full store and executing synchronously. This was slow and bypassed the daemon's concurrency control and persistent task queue. Now the CLI checks for a running daemon first and queues via RPC (returning instantly) unless --local, --debug, or --dry-run is set. Falls back to local execution if the daemon isn't running. This also avoids the expensive Store::load() on the fast path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 15:04:47 -04:00
Kent Overstreet	b1d83b55c0	agent: add count/chunk_size/chunk_overlap to agent header Observation agent was getting 261KB prompts (5 × 50KB chunks) — too much for focused mining. Now agents can set count, chunk_size, and chunk_overlap in their JSON header. observation.agent set to count:1 for smaller, more focused prompts. Also moved task instructions after {{CONVERSATIONS}} so they're at the end of the prompt where the model attends more strongly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 12:04:08 -04:00
Kent Overstreet	34937932ab	timestamp sanitization, CoT logging, reasoning field fix, persistent queue - store/types.rs: sanitize timestamps on capnp load — old records had raw offsets instead of unix epoch, breaking sort-by-timestamp queries - agents/api.rs: drain reasoning tokens from UI channel into LLM logs so we can see Qwen's chain-of-thought in agent output - agents/daemon.rs: persistent task queue (pending-tasks.jsonl) — tasks survive daemon restarts. Push before spawn, remove on completion, recover on startup. - api/openai.rs: only send reasoning field when explicitly configured, not on every request (fixes vllm warning) - api/mod.rs: add 600s total request timeout as backstop for hung connections - Cargo.toml: enable tokio-console feature for task introspection Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 11:33:36 -04:00
Kent Overstreet	869a2fbc38	observation agent rewrite, edit command, daemon fixes - observation.agent: rewritten to navigate graph and prefer refining existing nodes over creating new ones. Identity-framed prompt, goals over rules. - poc-memory edit: opens node in $EDITOR, writes back on save, no-op if unchanged - daemon: remove extra_workers (jobkit tokio migration dropped it), remove sequential chaining of same-type agents (in-flight exclusion is sufficient) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 23:51:06 -04:00
Kent Overstreet	3b30a6abae	agents: raise in-flight exclusion threshold from 0.15 to 0.3 The lower threshold excluded too many neighbors, causing "query returned no results (after exclusion)" failures and underloading the GPU. Now only moderately-connected neighbors (score > 0.3) are excluded, balancing collision prevention with GPU utilization. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 16:32:02 -04:00
Kent Overstreet	0c687ae7a4	agents: log oversized prompts to llm-logs/oversized/ for debugging When a prompt exceeds the size guard, dump it to a timestamped file with agent name, size, and seed node keys. Makes it easy to find which nodes are blowing up prompts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:38:32 -04:00
Kent Overstreet	3a8575b429	agents: fix vllm crash on malformed tool args, always use API Three fixes: 1. Sanitize tool call arguments before pushing to conversation history — vllm re-parses them as JSON on the next request and crashes on invalid JSON from a previous turn. Malformed args now get replaced with {} and the model gets an error message telling it to retry with valid JSON. 2. Remove is_split special case — split goes through the normal job_consolidation_agent path like all other agents. 3. call_for_def always uses API when api_base_url is configured, regardless of tools field. Remove tools field from all .agent files — memory tools are always provided by the API layer. Also adds prompt size guard (800KB max) to catch oversized prompts before they hit the model context limit. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:33:36 -04:00
Kent Overstreet	6069efb7fc	agents: always use API backend, remove tools field from .agent files - Remove is_split special case in daemon — split now goes through job_consolidation_agent like all other agents - call_for_def uses API whenever api_base_url is configured, regardless of tools field (was requiring non-empty tools to use API) - Remove "tools" field from all .agent files — memory tools are always provided by the API layer, not configured per-agent - Add prompt size guard: reject prompts over 800KB (~200K tokens) with clear error instead of hitting the model's context limit Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:26:39 -04:00
Kent Overstreet	9d476841b8	cleanup: fix all build warnings, delete dead DMN context code - Delete poc-daemon/src/context.rs dead code (git_context, work_state, irc_digest, recent_commits, uncommitted_files) — replaced by where-am-i.md and memory graph - Remove unused imports (BufWriter, Context, similarity) - Prefix unused variables (_store, _avg_cc, _episodic_ratio, _message) - #[allow(dead_code)] on public API surface that's not yet wired (Message::assistant, ConversationLog::message_count/read_all, Config::context_message, ContextInfo fields) - Fix to_capnp macro dead_code warning - Rename _rewrite_store_DISABLED to snake_case Only remaining warnings are in generated capnp code (can't fix). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:20:34 -04:00
Kent Overstreet	378a09a9f8	config: derive Deserialize on Config, eliminate manual field extraction Config now derives serde::Deserialize with #[serde(default)] for all fields. Path fields use custom deserialize_path/deserialize_path_opt for ~ expansion. ContextGroup and ContextSource also derive Deserialize. try_load_shared() is now 20 lines instead of 100: json5 → serde → Config directly, then resolve API settings from the model/backend cross-reference. Removes MemoryConfigRaw intermediate struct entirely. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:10:57 -04:00
Kent Overstreet	f0086e2eaf	config: move agent_types list to config file Active agent types for consolidation cycles are now read from config.json5 memory.agent_types instead of being hardcoded in scoring.rs. Adding or removing agents is a config change, not a code change. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:04:47 -04:00
Kent Overstreet	d20baafe9d	consolidation: data-driven agent plan, drop transfer/connector/replay Replace per-field ConsolidationPlan struct with HashMap<String, usize> counts map. Agent types are no longer hardcoded in the struct — add agents by adding entries to the map. Active agents: linker, organize, distill, separator, split. Removed: transfer (redundant with distill), connector (rethink later), replay (not needed for current graph work). Elo-based budget allocation now iterates the map instead of indexing a fixed array. Status display and TUI adapted to show dynamic agent lists. memory-instructions-core v13: added protected nodes section — agents must not rewrite core-personality, core-personality-detail, or memory-instructions-core. They may add links but not modify content. High-value neighbors should be treated with care. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 14:02:28 -04:00

1 2 3 4 5

217 commits