consciousness/poc-memory
ProofOfConcept e12dea503b agent evaluate: sort agent actions by quality using Vec::sort_by with LLM
Yes, really. Rust's stdlib sort_by with an LLM pairwise comparator.
Each comparison is an API call asking "which action was better?"

Sample N actions per agent type, throw them all in a Vec, sort.
Where each agent's samples cluster = that agent's quality score.
Reports per-type average rank and quality ratio.

Supports both haiku (fast/cheap) and sonnet (quality) as comparator.

Usage: poc-memory agent evaluate --samples 5 --model haiku

Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>
2026-03-14 19:24:07 -04:00
..
.claude feedback: not-relevant/not-useful commands, edge strength adjustment 2026-03-14 12:14:17 -04:00
agents agent evaluate: sort agent actions by quality using Vec::sort_by with LLM 2026-03-14 19:24:07 -04:00
defaults split into workspace: poc-memory and poc-daemon subcrates 2026-03-08 20:43:59 -04:00
schema provenance: convert from enum to freeform string 2026-03-11 01:19:52 -04:00
src agent evaluate: sort agent actions by quality using Vec::sort_by with LLM 2026-03-14 19:24:07 -04:00
build.rs split into workspace: poc-memory and poc-daemon subcrates 2026-03-08 20:43:59 -04:00
Cargo.toml extract jobkit-daemon library from poc-memory daemon 2026-03-14 02:40:30 -04:00
config.example.jsonl split into workspace: poc-memory and poc-daemon subcrates 2026-03-08 20:43:59 -04:00