forked from kent/consciousness
When api_base_url is configured, agents call the LLM directly via OpenAI-compatible API (vllm, llama.cpp, etc.) instead of shelling out to claude CLI. Implements the full tool loop: send prompt, if tool_calls execute them and send results back, repeat until text. This enables running agents against local/remote models like Qwen-27B on a RunPod B200, with no dependency on claude CLI. Config fields: api_base_url, api_key, api_model. Falls back to claude CLI when api_base_url is not set. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
28 lines
1 KiB
Rust
28 lines
1 KiB
Rust
// Agent layer: LLM-powered operations on the memory graph
|
|
//
|
|
// Everything here calls external models (Sonnet, Haiku) or orchestrates
|
|
// sequences of such calls. The core graph infrastructure (store, graph,
|
|
// spectral, search, similarity) lives at the crate root.
|
|
//
|
|
// llm — model invocation, response parsing
|
|
// prompts — prompt generation from store data
|
|
// defs — agent file loading and placeholder resolution
|
|
// audit — link quality review via Sonnet
|
|
// consolidate — full consolidation pipeline
|
|
// knowledge — agent execution, conversation fragment selection
|
|
// enrich — journal enrichment, experience mining
|
|
// digest — episodic digest generation (daily/weekly/monthly)
|
|
// daemon — background job scheduler
|
|
// transcript — shared JSONL transcript parsing
|
|
|
|
pub mod transcript;
|
|
pub mod api;
|
|
pub mod llm;
|
|
pub mod prompts;
|
|
pub mod defs;
|
|
pub mod audit;
|
|
pub mod consolidate;
|
|
pub mod knowledge;
|
|
pub mod enrich;
|
|
pub mod digest;
|
|
pub mod daemon;
|