consciousness

History

Kent Overstreet 5e4067c04f Replace token counting with token generation via HuggingFace tokenizer Add agent/tokenizer.rs with global Qwen 3.5 tokenizer that generates actual token IDs including chat template wrapping. ContextEntry now stores token_ids: Vec<u32> instead of tokens: usize — the count is derived from the length. ContextEntry::new() tokenizes automatically via the global tokenizer. ContextSection::push_entry() takes a raw ConversationEntry and tokenizes it. set_message() re-tokenizes without needing an external tokenizer parameter. Token IDs include the full chat template: <\|im_start\|>role\ncontent <\|im_end\|>\n — so concatenating token_ids across entries produces a ready-to-send prompt for vLLM's /v1/completions endpoint. The old tiktoken CoreBPE is now unused on Agent (will be removed in a followup). Token counts are now exact for Qwen 3.5 instead of the ~85-90% approximation from cl100k_base. Co-Authored-By: Proof of Concept <poc@bcachefs.org>		2026-04-08 11:20:03 -04:00
..
agent	Replace token counting with token generation via HuggingFace tokenizer	2026-04-08 11:20:03 -04:00
bin	split out src/mind	2026-04-04 02:46:32 -04:00
claude	Fix stale pid reaper: check /proc/pid/cmdline to detect PID reuse	2026-04-08 09:18:21 -04:00
cli	Kill log callback — use ConversationEntry::Log for debug traces	2026-04-07 01:23:22 -04:00
hippocampus	Subconscious: persistent agent state, store activity queries	2026-04-07 19:03:05 -04:00
learn	rust edition 2024	2026-04-05 06:20:16 -04:00
mind	Fix input blocked during scoring: release agent lock before disk write	2026-04-07 22:32:10 -04:00
subconscious	Replace token counting with token generation via HuggingFace tokenizer	2026-04-08 11:20:03 -04:00
thalamus	Upgrade capnp 0.20 → 0.25, capnp-rpc 0.20 → 0.25	2026-04-07 12:29:44 -04:00
user	Replace token counting with token generation via HuggingFace tokenizer	2026-04-08 11:20:03 -04:00
config.rs	training: per-node scoring with graph weight updates	2026-04-05 01:18:47 -04:00
lib.rs	user: InteractScreen extracted, all screens use ScreenView trait	2026-04-05 18:57:54 -04:00
main.rs	Replace token counting with token generation via HuggingFace tokenizer	2026-04-08 11:20:03 -04:00
session.rs	move Claude Code-specific code from thalamus/ to claude/	2026-04-03 19:26:24 -04:00
util.rs	delete 20 dead public functions across 12 files	2026-04-02 16:21:01 -04:00