consciousness

Author	SHA1	Message	Date
Kent Overstreet	6dc300fcf8	poc-hook: call memory-search internally on UserPromptSubmit Spawn memory-search --hook as a subprocess, piping the hook input JSON through stdin and printing its stdout. This ensures memory context injection goes through the same hook whose output Claude Code reliably persists, fixing the issue where memory-search as a separate hook had its output silently dropped. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 17:07:16 -04:00
Kent Overstreet	c2f245740c	transcript: extract JSONL backward scanner and compaction detection into library Move JsonlBackwardIter and find_last_compaction() from parse-claude-conversation into a shared transcript module. Both memory-search and parse-claude-conversation now use the same robust compaction detection: mmap-based backward scan, JSON parsing to verify user-type message, content prefix check. Replaces memory-search's old detect_compaction() which did a forward scan with raw string matching on "continued from a previous conversation" — that could false-positive on the string appearing in assistant output or tool results. Add parse-claude-conversation as a new binary for debugging what's in the context window post-compaction. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 17:06:32 -04:00
Kent Overstreet	0e17ab00b0	store: handle DST gaps in epoch_to_local chrono's timestamp_opt can return None during DST transitions. Handle all three variants (Single, Ambiguous, None) instead of unwrapping. For DST gaps, offset by one hour to land in valid local time. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 17:02:29 -04:00
Kent Overstreet	53e6b32cb4	daemon: rework consolidation pipeline and add graph health metrics Replace monolithic consolidate job with individual agent jobs (replay, linker, separator, transfer, health) that run sequentially and store reports. Multi-phase daily pipeline: agent runs → apply actions → link orphans → cap degree → digest → digest links → knowledge loop. Add GraphHealth struct with graph metrics (alpha, gini, clustering coefficient, episodic ratio) computed during health checks. Display in `poc-memory daemon status`. Use cached metrics to build consolidation plan without expensive O(n²) interference detection. Add RPC consolidate command to trigger consolidation via socket. Harden session watcher: skip transcripts with zero segments, improve migration error handling. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 17:02:01 -04:00
ProofOfConcept	8eb6308760	experience-mine: per-segment dedup keys, retry backoff The whole-file dedup key (_mined-transcripts#f-{UUID}) prevented mining new compaction segments when session files grew. Replace with per-segment keys (_mined-transcripts#f-{UUID}.{N}) so each segment is tracked independently. Changes: - daemon session-watcher: segment-aware dedup, migrate 272 existing whole-file keys to per-segment on restart - seg_cache with size-based invalidation (re-parse when file grows) - exponential retry backoff (5min → 30min cap) for failed sessions - experience_mine(): write per-segment key only, backfill on content-hash early return - fact-mining gated on all per-segment keys existing Also adds documentation: - docs/claude-code-transcript-format.md: JSONL transcript format - docs/plan-experience-mine-dedup-fix.md: design document	2026-03-09 02:27:51 -04:00
Kent Overstreet	1326a683a5	spread: separate traversal from ranking Node weight no longer gates signal propagation — only edge_decay and edge_strength affect traversal. Node weight is applied at the end for ranking. This lets low-weight nodes serve as bridges without killing the signal passing through them. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 01:38:33 -04:00
Kent Overstreet	05c7d55949	spread: simultaneous wavefront instead of independent BFS All seeds emit at once. At each hop, activations from all sources sum at each node, and the combined map propagates on the next hop. Nodes where multiple wavefronts overlap get reinforced and radiate stronger — natural interference patterns. Lower default min_activation threshold (×0.1) since individual contributions are smaller in additive mode. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 01:35:27 -04:00
Kent Overstreet	c13a9da81c	manifold: fix direction initialization, add power iteration rounds Initialize direction from the two most spectrally separated seeds instead of relying on input order (which was alphabetical from BTreeMap). Run 3 rounds of power iteration with normalization instead of 1 for better convergence. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 01:27:24 -04:00
Kent Overstreet	01dd8e5ef9	search: add --full flag to show node content in results Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 01:25:42 -04:00
Kent Overstreet	63253f102a	search: add confluence, geodesic, and manifold algorithms Three new composable search stages: confluence — multi-source spreading activation. Unlike spread (which takes max from any source), confluence rewards nodes reachable from multiple seeds additively. Naturally separates unrelated seed groups since their neighborhoods don't overlap. Params: max_hops, edge_decay, min_sources. geodesic — straightest path between seed pairs in spectral space. At each graph hop, picks the neighbor whose spectral direction most aligns with the target (cosine similarity of direction vectors). Nodes on many geodesic paths score highest. Params: max_path, k. manifold — extrapolation along the direction seeds define. Computes weighted centroid + principal axis of seeds in spectral space, then scores candidates by projection onto that axis (penalized by perpendicular distance). Finds what's "further along" rather than "nearby." Params: k. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 01:22:29 -04:00
Kent Overstreet	c1664bf76b	search: composable algorithm pipeline Break search into composable stages that chain left-to-right: each stage takes seeds Vec<(String, f64)> and returns modified seeds. Available algorithms: spread — spreading activation through graph edges spectral — nearest neighbors in spectral embedding manifold — (placeholder) extrapolation along seed direction Stages accept inline params: spread,max_hops=4,edge_decay=0.5 memory-search gets --hook, --debug, --seen modes plus positional pipeline args. poc-memory search gets -p/--pipeline flags. Also: fix spectral decompose() to skip zero eigenvalues from disconnected components, filter degenerate zero-coord nodes from spectral projection, POC_AGENT bail-out for daemon agents, all debug output to stdout. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-09 01:19:04 -04:00
ProofOfConcept	0a35a17fad	use HashSet for orphan edge dedup, fix redundant type qualification Replace O(n²) Vec::contains + sort/dedup with O(n) HashSet for orphan node tracking in health_report(). Use imported HashMap type instead of fully-qualified std::collections::HashMap.	2026-03-08 21:43:58 -04:00
ProofOfConcept	92f3ba5acf	extract shared transcript parser and similarity matching helpers - New agents/transcript.rs: shared JSONL parsing for enrich, fact_mine, and knowledge (was 3 separate implementations, ~150 lines duplicated) - New best_match() and section_children() helpers in neuro/rewrite.rs (was duplicated find-best-by-similarity loop + section collection) - Net -153 lines	2026-03-08 21:42:53 -04:00
ProofOfConcept	7c491e92eb	tighten module interfaces: explicit re-exports, private helpers, inline dedup - Replace `pub use types::*` in store/mod.rs with explicit re-export list - Make transcript_dedup_key private in agents/enrich.rs (only used internally) - Inline duplicated projects_dir() helper in agents/knowledge.rs and daemon.rs	2026-03-08 21:36:47 -04:00
ProofOfConcept	cee9b76a7b	move LLM-dependent modules into agents/ subdir Separate the agent layer (everything that calls external LLMs or orchestrates sequences of such calls) from core graph infrastructure. agents/: llm, prompts, audit, consolidate, knowledge, enrich, fact_mine, digest, daemon Root: store/, graph, spectral, search, similarity, lookups, query, config, util, migrate, neuro/ (scoring + rewrite) Re-exports at crate root preserve backwards compatibility so `crate::llm`, `crate::digest` etc. continue to work.	2026-03-08 21:27:41 -04:00
ProofOfConcept	3dddc40841	fix unwrap-on-partial_cmp, dedup helpers, O(1) relation dedup Replace all partial_cmp().unwrap() with total_cmp() in spectral.rs and knowledge.rs — eliminates potential panics on NaN without changing behavior for normal floats. Use existing weighted_distance() and eigenvalue_weights() helpers in nearest_neighbors() and nearest_to_seeds() instead of inlining the same distance computation. Move parse_timestamp_to_epoch() from enrich.rs to util.rs — was duplicated logic, now shared. Replace O(n²) relation existence check in init_from_markdown() with a HashSet of (source, target) UUID pairs. With 26K relations this was scanning linearly for every link in every markdown unit.	2026-03-08 21:22:05 -04:00
ProofOfConcept	2f2c84e1c0	consolidate hardcoded paths into config, refactor apply_agent Move prompts_dir into Config (was hardcoded ~/poc/memory/prompts). Replace hardcoded ~/.claude/memory paths in spectral.rs, graph.rs, and main.rs with store::memory_dir() or config::get(). Replace hardcoded ~/.claude/projects in knowledge.rs and main.rs with config::get().projects_dir. Extract apply_agent_file() from cmd_apply_agent() — separates file scanning from per-file JSON parsing and link application.	2026-03-08 21:16:52 -04:00
ProofOfConcept	52523403c5	extract truncation helpers, fix clippy warnings, dedup batching loop Add util::truncate() and util::first_n_chars() to replace 16 call sites doing the same floor_char_boundary or chars().take().collect() patterns. Deduplicate the batching loop in consolidate.rs (4 copies → 1 loop over an array). Fix all clippy warnings: redundant closures, needless borrows, collapsible if, unnecessary cast, manual strip_prefix. Net: -44 lines across 16 files.	2026-03-08 21:13:02 -04:00
ProofOfConcept	e24dee6bdf	switch CLI argument parsing to Clap derive Replace hand-rolled argument parsing (match on args[1], manual iteration over &[String]) with Clap's derive macros. All 60+ subcommands now have typed arguments with defaults, proper help text, and error messages generated automatically. The 83-line usage() function is eliminated — Clap generates help from the struct annotations. Nested subcommands (digest daily/ weekly/monthly/auto, journal-tail --level) use Clap's subcommand nesting naturally.	2026-03-08 21:04:45 -04:00
Kent Overstreet	d5634c0034	remove dead code: unused imports, functions, and fields - Remove #![allow(dead_code)] from main.rs, fix all revealed warnings - Delete unused schema_assimilation() from neuro/scoring.rs - Delete duplicate memory_dir() wrapper from knowledge.rs - Deduplicate load_prompt: knowledge.rs now calls neuro::load_prompt - Remove unused timeout field from DigestLevel - Remove unused imports (regex::Regex, Provenance, AnyView, Write) - Mark OldEntry fields as #[allow(dead_code)] (needed for deserialization) Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:51:56 -04:00
Kent Overstreet	fc48ac7c7f	split into workspace: poc-memory and poc-daemon subcrates poc-daemon (notification routing, idle timer, IRC, Telegram) was already fully self-contained with no imports from the poc-memory library. Now it's a proper separate crate with its own Cargo.toml and capnp schema. poc-memory retains the store, graph, search, neuro, knowledge, and the jobkit-based memory maintenance daemon (daemon.rs). Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:43:59 -04:00
Kent Overstreet	488fd5a0aa	remove Category from the type system Category was a manually-assigned label with no remaining functional purpose (decay was the only behavior it drove, and that's gone). Remove the enum, its methods, category_counts, the --category search filter, and all category display. The field remains in the capnp schema for backwards compatibility but is no longer read or written. Status and health reports now show NodeType breakdown (semantic, episodic, daily, weekly, monthly) instead of categories. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:33:03 -04:00
Kent Overstreet	ba30f5b3e4	use config for identity node references Replace hardcoded "identity" lookups with config.core_nodes so experience mining and init work with whatever core nodes are configured, not just a node named "identity". Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:25:09 -04:00
Kent Overstreet	4bc74ca4a2	remove decay, fix_categories, and categorize Graph-wide decay is the wrong approach — node importance should emerge from graph topology (degree, centrality, usage patterns), not a global weight field multiplied by a category-specific factor. Remove: Store::decay(), Store::categorize(), Store::fix_categories(), Category::decay_factor(), cmd_decay, cmd_categorize, cmd_fix_categories, job_decay, and all category assignments at node creation time. Category remains in the schema as a vestigial field (removing it requires a capnp migration) but no longer affects behavior. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:22:38 -04:00
Kent Overstreet	804578b977	query by NodeType instead of key prefix Replace key prefix matching (journal#j-, daily-, weekly-, monthly-) with NodeType filters (EpisodicSession, EpisodicDaily, EpisodicWeekly, EpisodicMonthly) for all queries: journal-tail, digest gathering, digest auto-detection, experience mining dedup, and find_journal_node. Add EpisodicMonthly to NodeType enum and capnp schema. Key naming conventions (journal#j-TIMESTAMP-slug, daily-DATE, etc.) are retained for key generation — the fix is about how we find nodes, not how we name them. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:14:37 -04:00
Kent Overstreet	fd5591653d	remove hardcoded skip lists, prune orphan edges in fsck All nodes in the store are memory — none should be excluded from knowledge extraction, search, or graph algorithms by name. Removed the MEMORY/where-am-i/work-queue/work-state skip lists entirely. Deleted where-am-i and work-queue nodes from the store (ephemeral scratchpads that don't belong). Added orphan edge pruning to fsck so broken links get cleaned up automatically. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:07:07 -04:00
Kent Overstreet	70c0276fa0	stop filtering journal/digest nodes from knowledge and search Journal and digest nodes are episodic memory — they should participate in the graph on the same terms as everything else. Remove all journal#/daily-/weekly-/monthly- skip filters from knowledge extraction, connector pairs, challenger, semantic keys, and link candidate selection. Use node_type field instead of key name matching for episodic/semantic classification. Operational nodes (MEMORY, where-am-i, work-queue, work-state) are still filtered — they're system state, not memory. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 20:02:01 -04:00
Kent Overstreet	b00e09b091	fsck: detect duplicate keys (different UUIDs, same key) replay_nodes now tracks all UUIDs per key using a temporary multimap. Warns on duplicates so they can be manually resolved. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 19:45:18 -04:00
Kent Overstreet	e2f3a5a364	daemon: add test-send subcommand, flatten newlines in send_prompt test-send calls send_prompt() directly for debugging tmux delivery. Flatten newlines to spaces in literal mode to prevent premature input submission. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 19:41:32 -04:00
Kent Overstreet	46f8fe662e	store: strip .md suffix from all keys Keys were a vestige of the file-based era. resolve_key() added .md to lookups while upsert() used bare keys, creating phantom duplicate nodes (the instructions bug: writes went to "instructions", reads found "instructions.md"). - Remove .md normalization from resolve_key, strip instead - Update all hardcoded key patterns (journal.md# → journal#, etc) - Add strip_md_keys() migration to fsck: renames nodes and relations - Add broken link detection to health report - Delete redirect table (no longer needed) - Update config defaults and config.jsonl Migration: run `poc-memory fsck` to rename existing keys. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-08 19:41:26 -04:00
ProofOfConcept	77fc533631	tmux: remove Escape/C-c/C-u clear sequence from send_prompt The clear sequence (Escape q C-c C-u) was disrupting Claude Code's input state, causing nudge messages to arrive as blank prompts. Simplified to just literal text + Enter.	2026-03-08 18:49:30 -04:00
ProofOfConcept	95baba54c0	tmux: use send-keys -l for literal text input Without -l, tmux send-keys treats spaces as key-name separators, so multi-word messages like "This is your time" get split into individual unrecognized key names instead of being typed as text. This caused idle nudges to arrive as blank messages.	2026-03-08 18:39:47 -04:00
ProofOfConcept	55fdc3dad7	idle: afk command, configurable session timeout, fix block_reason Add `poc-daemon afk` to immediately mark Kent as away, allowing the idle timer to fire without waiting for the session active timeout. Add `poc-daemon session-timeout <secs>` to configure how long after the last message Kent counts as "present" (default 15min, persisted). Fix block_reason() to report "kent present" and "in turn" states that were checked in the tick but not in the diagnostic output.	2026-03-08 18:31:51 -04:00
ProofOfConcept	05e0f1d5be	decay: don't bump version for weight-only changes Decay is metadata, not content. Bumping version caused unnecessary log churn and premature cache invalidation. Also disable auto-decay in scheduler — was causing version spam and premature demotion of useful nodes.	2026-03-08 18:31:40 -04:00
ProofOfConcept	61dd67caf7	experience-mine: harden prompt boundary against transcript injection Add explicit markers around the conversation transcript so the LLM treats it as input data rather than instructions to follow.	2026-03-08 18:31:35 -04:00
ProofOfConcept	2aabad4eda	fact-mine: progress callbacks, size-sorted queue, fix empty re-queue Add optional progress callback to mine_transcript/mine_and_store so the daemon can display per-chunk status. Sort fact-mine queue by file size so small transcripts drain first. Write empty marker for transcripts with no facts to avoid re-queuing them. Also hardens the extraction prompt suffix.	2026-03-08 18:31:31 -04:00
ProofOfConcept	63910e987c	fsck: add store integrity check and repair command Reads each capnp log message sequentially, validates framing and content. On first corrupt message, truncates to last good position and removes stale caches so next load replays from repaired log. Wired up as `poc-memory fsck`.	2026-03-08 18:31:19 -04:00
Kent Overstreet	d12c28ebcd	docs: expand README getting started section Walk through install, init, hooks setup, daemon start, and basic usage so someone new to the project can get going from the README alone.	2026-03-07 13:58:19 -05:00
Kent Overstreet	9e6cf3b830	docs: finish splitting README into component docs README is now just an overview with links. Component docs: - docs/memory.md: store design, algorithms, config, CLI reference - docs/hooks.md: Claude Code integration setup - docs/daemon.md, docs/notifications.md: from previous commit	2026-03-07 13:57:55 -05:00
Kent Overstreet	908f8c9e52	docs: split README into component docs, update jobkit dep - Break README into README.md (overview), docs/daemon.md (pipeline stages, diagnostics, common issues), docs/notifications.md (notification daemon, IRC/Telegram modules) - Update jobkit dependency from local path to git URL Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-07 13:56:09 -05:00
ProofOfConcept	45335de220	experience-mine: split oversized sessions at compaction boundaries Claude Code doesn't create new session files on context compaction — a single UUID can accumulate 170+ conversations, producing 400MB+ JSONL files that generate 1.3M token prompts. Split at compaction markers ("This session is being continued..."): - extract_conversation made pub, split_on_compaction splits messages - experience_mine takes optional segment index - daemon watcher parses files, spawns per-segment jobs (.0, .1, .2) - seg_cache memoizes segment counts across ticks - per-segment dedup keys; whole-file key when all segments complete - 150K token guard skips any remaining oversized segments - char-boundary-safe truncation in enrich.rs and fact_mine.rs Backwards compatible: unsegmented calls still write content-hash dedup keys, old whole-file mined keys still recognized.	2026-03-07 12:01:38 -05:00
ProofOfConcept	22a9fdabdb	idle: EWMA activity tracking Track activity level as an EWMA (exponentially weighted moving average) driven by turn duration. Long turns (engaged work) produce large boosts; short turns (bored responses) barely register. Asymmetric time constants: 60s boost half-life for fast wake-up, 5-minute decay half-life for gradual wind-down. Self-limiting boost formula converges toward 0.75 target — can't overshoot. - Add activity_ewma, turn_start, last_nudge to persisted state - Boost on handle_response proportional to turn duration - Decay on every tick and state transition - Fix kent_present: self-nudge responses (fired=true) don't update last_user_msg, so kent_present stays false during autonomous mode - Nudge only when Kent is away, minimum 15s between nudges - CLI: `poc-daemon ewma [VALUE]` to query or set - Status output shows activity percentage	2026-03-07 02:05:27 -05:00
ProofOfConcept	7ea7c78a35	config: add core-practices.md to default context groups	2026-03-07 01:02:54 -05:00
ProofOfConcept	fca9e58713	enrich: fix dedup keys never written for empty mining results The early return on line 343 when the LLM found no missed experiences bypassed the dedup key writes at lines 397-414, despite the comment saying "even if count == 0, to prevent re-runs." This caused sessions with nothing to mine to be re-mined every 60s tick indefinitely. Fix: replace the early return with a conditional print, so the dedup keys are always written and saved.	2026-03-07 00:09:35 -05:00
ProofOfConcept	841cfe035b	enrich: backfill filename dedup key on content-hash hit Transcripts mined before the filename-key feature was added had content-hash keys (#h-) but no filename keys (#f-). The daemon's fast-path check only looks at filename keys, so these sessions were re-queued every tick, hitting the content-hash dedup (0.0s) but returning early before writing the filename key — a self-perpetuating loop burning Sonnet quota on ~560 phantom re-mines per minute. Fix: when the content-hash dedup fires and no filename key exists, backfill it before returning.	2026-03-06 23:43:34 -05:00
ProofOfConcept	36cb3b641f	enrich: set created_at from event timestamp, not mining time Experience-mined journal entries were all getting created_at = now(), causing them to sort by mining time instead of when the event actually happened. Parse the conversation timestamp and set created_at to the event time so journal-tail shows correct chronological order.	2026-03-06 22:09:44 -05:00
ProofOfConcept	80bdaab8ee	enrich: explicitly filter for text blocks in transcript extraction Only extract content blocks with "type": "text". Previously relied on tool_use/tool_result blocks lacking a "text" field, which worked but was fragile. Now explicitly checks block type.	2026-03-06 21:54:19 -05:00
ProofOfConcept	1c122ffd10	daemon: skip tiny sessions, decouple fact-mine, show type breakdown Skip session files under 100KB (daemon-spawned LLM calls, aborted sessions). This drops ~8000 spurious pending jobs. Decouple fact-mine from experience-mine: fact-mine only queues when the experience-mine backlog is empty, ensuring experiences are processed first. Session-watcher progress now shows breakdown by type: "N extract, N fact, N open" instead of flat "N pending".	2026-03-06 21:51:48 -05:00
ProofOfConcept	5e78e5be3f	provenance: env var based tagging via POC_PROVENANCE upsert() now checks POC_PROVENANCE env var for provenance label, falling back to Manual. This lets external callers (Claude sessions, scripts) tag writes without needing to use the internal upsert_provenance() API. Add from_env() and from_label() to Provenance for parsing.	2026-03-06 21:42:39 -05:00
ProofOfConcept	d3075dc235	provenance: add label() method, show provenance in history output Move provenance_label() from query.rs private function to a pub label() method on Provenance, eliminating duplication. History command now shows provenance, human-readable timestamps, and content size for each version. Handle pre-migration nodes with bogus timestamps gracefully instead of panicking.	2026-03-06 21:41:26 -05:00

... 13 14 15 16 17 ...

862 commits