consciousness

Author	SHA1	Message	Date
ProofOfConcept	552d255dc3	migrate agent output to capnp store, add provenance tracking All agent output now goes to the store as nodes instead of markdown/JSON files. Each node carries a Provenance enum identifying which agent created it (AgentDigest, AgentConsolidate, AgentFactMine, AgentKnowledgeObservation, etc — 14 variants total). Store changes: - upsert_provenance() method for agent-created nodes - Provenance enum expanded from 5 to 14 variants Agent changes: - digest: writes to store nodes (daily-YYYY-MM-DD.md etc) - consolidate: reports/actions/logs stored as _consolidation-* nodes - knowledge: depth DB and agent output stored as _knowledge-* nodes - enrich: experience-mine results go directly to store - llm: --no-session-persistence prevents transcript accumulation Deleted: 14 Python/shell scripts replaced by Rust implementations.	2026-03-05 15:30:57 -05:00
ProofOfConcept	e37f819dd2	daemon: background job orchestration for memory maintenance Replace fragile cron+shell approach with `poc-memory daemon` — a single long-running process using jobkit for worker pool, status tracking, retry, cancellation, and resource pools. Jobs: - session-watcher: detects ended Claude sessions, triggers extraction - scheduler: runs daily decay, consolidation, knowledge loop, digests - health: periodic graph metrics check - All Sonnet API calls serialized through a ResourcePool(1) Status queryable via `poc-memory daemon status`, structured log via `poc-memory daemon log`. Phase 1: shells out to existing subcommands. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-05 13:18:00 -05:00
ProofOfConcept	4747004b36	types: unify all epoch timestamps to i64 All epoch timestamp fields (timestamp, last_replayed, created_at on nodes; timestamp on relations) are now i64. Previously a mix of f64 and i64 which caused type seams and required unnecessary casts. - Kill now_epoch() -> f64 and now_epoch_i64(), replace with single now_epoch() -> i64 - All formatting functions take i64 - new_node() sets created_at automatically - journal-ts-migrate handles all nodes, with valid_range check to detect garbage from f64->i64 bit reinterpretation - capnp schema: Float64 -> Int64 for all timestamp fields	2026-03-05 10:23:57 -05:00
Kent Overstreet	b4bbafdf1c	search: trim default output to 5 results, gate spectral with --expand Default search was 15 results + 5 spectral neighbors — way too much for the recall hook context window. Now: 5 results by default, no spectral. --expand restores the full 15 + spectral output. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-03 18:44:44 -05:00
Kent Overstreet	ca0c8cfac6	add daily lookup counter for memory retrieval tracking Mmap'd open-addressing hash table (~49KB/day) records which memory keys get retrieved. FNV-1a hash, linear probing, 4096 slots. - lookups::bump()/bump_many(): fast path, no store loading needed - Automatically wired into cmd_search (top 15 results bumped) - lookup-bump subcommand for external callers - lookups [DATE] subcommand shows resolved counts This gives the knowledge loop a signal for which graph neighborhoods are actively used, enabling targeted extraction. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-03 18:36:25 -05:00
ProofOfConcept	a9b90f881e	digest: unify gather/find with composable date_range + date_to_label Each DigestLevel now carries two date-math fn pointers: - label_dates: expand an arg into (label, dates covered) - date_to_label: map any date to this level's label Parent gather works by expanding its date range then mapping those dates through the child level's date_to_label to derive child labels. find_candidates groups journal dates through date_to_label and skips the current period. This eliminates six per-level functions (gather_daily/weekly/monthly, find_daily/weekly/monthly_args) and the three generate_daily/weekly/monthly public entry points in favor of one generic gather, one generic find_candidates, and one public generate(store, level_name, arg).	2026-03-03 18:04:21 -05:00
Kent Overstreet	f4364e299c	replace libc date math with chrono, extract memory_subdir helper - date_to_epoch, iso_week_info, weeks_in_month: replaced unsafe libc (mktime, strftime, localtime_r) with chrono NaiveDate and IsoWeek - epoch_to_local: replaced unsafe libc localtime_r with chrono Local - New util.rs with memory_subdir() helper: ensures subdir exists and propagates errors instead of silently ignoring them - Removed three duplicate agent_results_dir() definitions across digest.rs, consolidate.rs, enrich.rs - load_digest_files, parse_all_digest_links, find_consolidation_reports now return Result to properly propagate directory creation errors Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-03 17:23:43 -05:00
Kent Overstreet	50da0b7b26	digest: split into focused modules, externalize prompts digest.rs was 2328 lines containing 6 distinct subsystems. Split into: - llm.rs: shared LLM utilities (call_sonnet, parse_json_response, semantic_keys) - audit.rs: link quality audit with parallel Sonnet batching - enrich.rs: journal enrichment + experience mining - consolidate.rs: consolidation pipeline + apply Externalized all inline prompts to prompts/*.md templates using neuro::load_prompt with {{PLACEHOLDER}} syntax: - daily-digest.md, weekly-digest.md, monthly-digest.md - experience.md, journal-enrich.md, consolidation.md digest.rs retains temporal digest generation (daily/weekly/monthly/auto) and date helpers. ~940 lines, down from 2328. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-03-03 17:18:18 -05:00
ProofOfConcept	635da6d3e2	split capnp_store.rs into src/store/ module hierarchy capnp_store.rs (1772 lines) → four focused modules: store/types.rs — types, macros, constants, path helpers store/parse.rs — markdown parsing (MemoryUnit, parse_units) store/view.rs — StoreView trait, MmapView, AnyView store/mod.rs — Store impl methods, re-exports new_node/new_relation become free functions in types.rs. All callers updated: capnp_store:: → store::	2026-03-03 12:56:15 -05:00
ProofOfConcept	70a5f05ce0	capnp_store: remove dead code, consolidate CRUD API Dead code removed: - rebuild_uuid_index (never called, index built during load) - node_weight inherent method (all callers use StoreView trait) - node_community (no callers) - state_json_path (no callers) - log_retrieval, log_retrieval_append (no callers; only _static is used) - memory_dir_pub wrapper (just make memory_dir pub directly) API consolidation: - insert_node eliminated — callers use upsert_node (same behavior for new nodes, plus handles re-upsert gracefully) AnyView StoreView dispatch compressed to one line per method (also removes UFCS workaround that was needed when inherent node_weight shadowed the trait method). -69 lines net.	2026-03-03 12:38:52 -05:00
ProofOfConcept	fa7fe8c14b	query: rich QueryResult + toolkit cleanup QueryResult carries a fields map (BTreeMap<String, Value>) so callers don't re-resolve fields after queries run. Neighbors queries inject edge context (strength, rel_type) at construction time. New public API: - run_query(): parse + execute + format in one call - format_value(): format a Value for display - execute_parsed(): internal, avoids double-parse in run_query Removed: output_stages(), format_field() Simplified commands: - cmd_query, cmd_graph, cmd_link, cmd_list_keys all delegate to run_query - cmd_experience_mine uses existing find_current_transcript() Deduplication: - now_epoch() 3 copies → 1 (capnp_store's public fn) - hub_threshold → Graph::hub_threshold() method - eval_node + eval_edge → single eval() with closure for field resolution - compare() collapsed via Ordering (35 → 15 lines) Modernization: - 12 sites of partial_cmp().unwrap_or(Ordering::Equal) → total_cmp()	2026-03-03 12:07:04 -05:00
ProofOfConcept	64d2b441f0	cmd_graph, cmd_list_keys: use query language internally Dog-food the query engine for node-property filtering. cmd_link left unconverted — needs edge data in query results.	2026-03-03 11:38:11 -05:00
ProofOfConcept	18face7063	query: replace CLI flags with pipe syntax degree > 15 \| sort degree \| limit 10 \| select degree,category * \| sort weight asc \| limit 20 category = core \| count Output modifiers live in the grammar now, not in CLI flags. Also adds * wildcard for "all nodes" and string-aware sort fallback.	2026-03-03 11:05:28 -05:00
ProofOfConcept	a36449032c	query: peg-based query language for ad-hoc graph exploration poc-memory query "degree > 15" poc-memory query "key ~ 'journal.*' AND degree > 10" poc-memory query "neighbors('identity.md') WHERE strength > 0.5" poc-memory query "community_id = community('identity.md')" --fields degree,category Grammar-driven: the peg definition IS the language spec. Supports boolean logic (AND/OR/NOT), numeric and string comparison, regex match (~), graph traversal (neighbors() with WHERE), and function calls (community(), degree()). Output flags: --fields, --sort, --limit, --count. New dependency: peg 0.8 (~68KB, 2 tiny deps).	2026-03-03 10:55:30 -05:00
ProofOfConcept	71e6f15d82	spectral decomposition, search improvements, char boundary fix - New spectral module: Laplacian eigendecomposition of the memory graph. Commands: spectral, spectral-save, spectral-neighbors, spectral-positions, spectral-suggest. Spectral neighbors expand search results beyond keyword matching to structural proximity. - Search: use StoreView trait to avoid 6MB state.bin rewrite on every query. Append-only retrieval logging. Spectral expansion shows structurally nearby nodes after text results. - Fix panic in journal-tail: string truncation at byte 67 could land inside a multi-byte character (em dash). Now walks back to char boundary. - Replay queue: show classification and spectral outlier score. - Knowledge agents: extractor, challenger, connector prompts and runner scripts for automated graph enrichment. - memory-search hook: stale state file cleanup (24h expiry).	2026-03-03 01:33:31 -05:00
ProofOfConcept	94dbca6018	graph health: fix-categories, cap-degree, link-orphans Three new tools for structural graph health: - fix-categories: rule-based recategorization fixing core inflation (225 → 26 core nodes). Only identity.md and kent.md stay core; everything else reclassified to tech/obs/gen by file prefix rules. - cap-degree: two-phase degree capping. First prunes weakest Auto edges, then prunes Link edges to high-degree targets (they have alternative paths). Brought max degree from 919 → 50. - link-orphans: connects degree-0/1 nodes to most textually similar connected nodes via cosine similarity. Linked 614 orphans. Also: community detection now filters edges below strength 0.3, preventing weak auto-links from merging unrelated communities. Pipeline updated: consolidate-full now runs link-orphans + cap-degree instead of triangle-close (which was counterproductive — densified hub neighborhoods instead of building bridges). Net effect: Gini 0.754 → 0.546, max degree 919 → 50.	2026-03-01 08:18:07 -05:00
ProofOfConcept	6c7bfb9ec4	triangle-close: bulk lateral linking for clustering coefficient New command: `poc-memory triangle-close [MIN_DEG] [SIM] [MAX_PER_HUB]` For each node above min_degree, finds pairs of its neighbors that aren't directly connected and have text similarity above threshold. Links them. This turns hub-spoke patterns into triangles, directly improving clustering coefficient and schema fit. First run results (default params: deg≥5, sim≥0.3, max 10/hub): - 636 hubs processed, 5046 lateral links added - cc: 0.14 → 0.46 (target: high) - fit: 0.09 → 0.32 (target ≥0.2) - σ: 56.9 → 84.4 (small-world coefficient improved) Also fixes separator agent prompt: truncate interference pairs to batch count (was including all 1114 pairs = 1.3M chars).	2026-03-01 07:35:29 -05:00
ProofOfConcept	6bc11e5fb6	consolidate-full: autonomous consolidation pipeline New commands: - `digest auto`: detect and generate missing daily/weekly/monthly digests bottom-up. Validates date format to skip non-date journal keys. Skips today (incomplete) and current week/month. - `consolidate-full`: full autonomous pipeline: 1. Plan (metrics → agent allocation) 2. Execute agents (batched Sonnet calls, 5 nodes per batch) 3. Apply consolidation actions 4. Generate missing digests 5. Apply digest links Logs everything to agent-results/consolidate-full.log Fix: separator agent prompt was including all interference pairs (1114 pairs = 1.3M chars) instead of truncating to batch size. First successful run: 862s, 6/8 agents, +100 relations, 91 digest links applied.	2026-03-01 07:14:03 -05:00
ProofOfConcept	30d176d455	experience-mine: retroactive journaling from conversation transcripts Reads a conversation JSONL, identifies experiential moments that weren't captured in real-time journal entries, and writes them as journal nodes in the store. The agent writes in PoC's voice with emotion tags, focusing on intimate moments, shifts in understanding, and small pleasures — not clinical topic extraction. Conversation timestamps are now extracted and included in formatted output, enabling accurate temporal placement of mined entries. Also: extract_conversation now returns timestamps as a 4th tuple field.	2026-03-01 01:47:31 -05:00
ProofOfConcept	515f673251	journal-tail: add --full flag for complete entry display `poc-journal tail 5 --full` shows full entry content with timestamp headers and --- separators. Default mode remains title-only for scanning. Also passes all args through the poc-journal wrapper instead of just the count.	2026-03-01 01:43:02 -05:00
ProofOfConcept	6096acb312	journal-tail: show timestamps and extract meaningful titles Sort key normalization ensures consistent ordering across entries with different date formats (content dates vs key dates). Title extraction skips date-only lines, finds ## headers or falls back to first content line truncated at 70 chars. Also fixed: cargo bin had stale binary shadowing local bin install.	2026-03-01 01:41:37 -05:00
Kent Overstreet	7264bdc39c	link-audit: walk every link through Sonnet for quality review Batch all non-deleted links (~3,800) into char-budgeted groups, send each batch to Sonnet with full content of both endpoints, and apply KEEP/DELETE/RETARGET/WEAKEN/STRENGTHEN decisions. One-time cleanup for links created before refine_target existed. Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-01 00:48:44 -05:00
Kent Overstreet	4530837057	hub differentiation + refine_target for automatic section targeting Pattern separation for memory graph: when a file-level node (e.g. identity.md) has section children, redistribute its links to the best-matching section using cosine similarity. - differentiate_hub: analyze hub, propose link redistribution - refine_target: at link creation time, automatically target the most specific section instead of the file-level hub - Applied refine_target in all four link creation paths (digest links, journal enrichment, apply consolidation, link-add command) - Saturated hubs listed in agent topology header with "DO NOT LINK" This prevents hub formation proactively (refine_target) and remediates existing hubs (differentiate command). Co-Authored-By: ProofOfConcept <poc@bcachefs.org>	2026-03-01 00:33:46 -05:00
ProofOfConcept	59e2f39479	port digest-link-parser, journal-agent, apply-consolidation to Rust Three Python scripts (858 lines) replaced with native Rust subcommands: - digest-links [--apply]: parses ## Links sections from episodic digests, normalizes keys, applies to graph with section-level fallback - journal-enrich JSONL TEXT [LINE]: extracts conversation from JSONL transcript, calls Sonnet for link proposals and source location - apply-consolidation [--apply]: reads consolidation reports, sends to Sonnet for structured action extraction (links, categorizations, manual items) Shared infrastructure: call_sonnet now pub(crate), new parse_json_response helper for Sonnet output parsing with markdown fence stripping.	2026-03-01 00:10:03 -05:00
ProofOfConcept	91122fe1d1	digest: native Rust implementation replacing Python scripts Replace daily-digest.py, weekly-digest.py, monthly-digest.py with a single digest.rs module. All three digest types now: - Gather input directly from the Store (no subprocess calls) - Build prompts in Rust (same templates as the Python versions) - Call Sonnet via `claude -p --model sonnet` - Import results back into the store automatically - Extract links and save agent results 606 lines of Rust replaces 729 lines of Python + store_helpers.py overhead. More importantly: this is now callable as a library from poc-agent, and shares types/code with the rest of poc-memory. Also adds `digest monthly [YYYY-MM]` subcommand (was Python-only).	2026-02-28 23:58:05 -05:00
ProofOfConcept	0ea86b8d54	refactor: extract Store methods, clean up shell-outs - Add Store::upsert() — generic create-or-update, used by cmd_write - Add Store::insert_node() — for pre-constructed nodes (journal entries) - Add Store::delete_node() — soft-delete with version bump - Simplify cmd_write (20 → 8 lines), cmd_node_delete (16 → 7 lines), cmd_journal_write (removes manual append/insert/save boilerplate) - Replace generate_cookie shell-out to head/urandom with direct /dev/urandom read + const alphabet table main.rs: 1137 → 1109 lines.	2026-02-28 23:49:43 -05:00
ProofOfConcept	29d5ed47a1	clippy: fix all warnings across all binaries - &PathBuf → &Path in memory-search.rs signatures - Redundant field name in graph.rs struct init - Add truncate(false) to lock file open - Derive Default for Store instead of manual impl - slice::from_ref instead of &[x.clone()] - rsplit_once instead of split().last() - str::repeat instead of iter::repeat().take().collect() - is_none_or instead of map_or(true, ...) - strip_prefix instead of manual slicing Zero warnings on `cargo clippy`.	2026-02-28 23:47:11 -05:00
ProofOfConcept	7ee6f9c651	refactor: eliminate date shell-outs, move logic to Store methods - Replace all 5 `Command::new("date")` calls across 4 files with pure Rust time formatting via libc localtime_r - Add format_date/format_datetime/format_datetime_space helpers to capnp_store - Move import_file, find_journal_node, export_to_markdown, render_file, file_sections into Store methods where they belong - Fix find_current_transcript to search all project dirs instead of hardcoding bcachefs-tools path - Fix double-reference .clone() warnings in cmd_trace - Fix unused variable warning in neuro.rs main.rs: 1290 → 1137 lines, zero warnings.	2026-02-28 23:44:44 -05:00
ProofOfConcept	da10dfaeb2	add journal-write and journal-tail commands journal-write creates entries directly in the capnp store with auto-generated timestamped keys (journal.md#j-YYYY-MM-DDtHH-MM-slug), episodic session type, and source ref from current transcript. journal-tail sorts entries by date extracted from content headers, falling back to key-embedded dates, then node timestamp. poc-journal shell script now delegates to these commands instead of appending to journal.md. Journal entries are store-first.	2026-02-28 23:13:17 -05:00
ProofOfConcept	7b811125ca	add position field to nodes for stable section ordering Sections within a file have a natural order that matters — identity.md reads as a narrative, not an alphabetical index. The position field (u32) tracks section index within the file. Set during init and import from parse order. Export and load-context sort by position instead of key, preserving the author's intended structure.	2026-02-28 23:06:27 -05:00
ProofOfConcept	57cf61de44	add write, import, and export commands write KEY: upsert a single node from stdin. Creates new or updates existing with version bump. No-op if content unchanged. import FILE: parse markdown sections, diff against store, upsert changed/new nodes. Incremental — only touches what changed. export FILE\|--all: regenerate markdown from store nodes. Gathers file-level + section nodes, reconstitutes mem markers with links and causes from the relation graph. Together these close the bidirectional sync loop: markdown → import → store → export → markdown Also exposes memory_dir_pub() for use from main.rs.	2026-02-28 23:00:52 -05:00
ProofOfConcept	14b6457231	add load-context and render commands load-context replaces the shell hook's file-by-file cat approach. Queries the capnp store directly for all session-start context: orientation, identity, reflections, interests, inner life, people, active context, shared reference, technical, and recent journal. Sections are gathered per-file and output in priority order. Journal entries filtered to last 7 days by key-embedded date, capped at 20 most recent. render outputs a single node's content to stdout. The load-memory.sh hook now delegates entirely to `poc-memory load-context` — capnp store is the single source of truth for session startup context.	2026-02-28 22:53:39 -05:00
ProofOfConcept	2d6c8d5199	add node-delete command and redirect table for split files node-delete: soft-deletes a node by appending a deleted version to the capnp log, then removing it from the in-memory cache. resolve_redirect: when resolve_key can't find a node, checks a static redirect table for sections that moved during file splits (like the reflections.md → reflections-{reading,dreams,zoom}.md split). This handles immutable files (journal.md with chattr +a) that can't have their references updated.	2026-02-28 22:40:17 -05:00
ProofOfConcept	4b0bba7c56	replace state.json cache with bincode state.bin Faster serialization/deserialization, smaller on disk (4.2MB vs 5.9MB). Automatic migration from state.json on first load — reads the JSON, writes state.bin, deletes the old file. Added list-keys, list-edges, dump-json commands so Python scripts no longer need to parse the cache directly. Updated bulk-categorize.py and consolidation-loop.py to use the new CLI commands.	2026-02-28 22:30:03 -05:00
ProofOfConcept	23fac4e5fe	poc-memory v0.4.0: graph-structured memory with consolidation pipeline Rust core: - Cap'n Proto append-only storage (nodes + relations) - Graph algorithms: clustering coefficient, community detection, schema fit, small-world metrics, interference detection - BM25 text similarity with Porter stemming - Spaced repetition replay queue - Commands: search, init, health, status, graph, categorize, link-add, link-impact, decay, consolidate-session, etc. Python scripts: - Episodic digest pipeline: daily/weekly/monthly-digest.py - retroactive-digest.py for backfilling - consolidation-agents.py: 3 parallel Sonnet agents - apply-consolidation.py: structured action extraction + apply - digest-link-parser.py: extract ~400 explicit links from digests - content-promotion-agent.py: promote episodic obs to semantic files - bulk-categorize.py: categorize all nodes via single Sonnet call - consolidation-loop.py: multi-round automated consolidation Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>	2026-02-28 22:17:00 -05:00

35 commits