consciousness/poc-memory
ProofOfConcept 16777924d0 evaluate: switch to Elo ratings with skillratings crate
Replace sort-based ranking with proper Elo system:
- Each agent TYPE has a persistent Elo rating (agent-elo.json)
- Each matchup: pick two random types, grab a recent action from
  each, LLM compares, update ratings
- Ratings persist across daily evaluations — natural recency bias
  from continuous updates against current opponents
- K=32 for fast adaptation to prompt changes

Usage: poc-memory agent evaluate --matchups 30 --model haiku

Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>
2026-03-14 19:53:46 -04:00
..
.claude feedback: not-relevant/not-useful commands, edge strength adjustment 2026-03-14 12:14:17 -04:00
agents agent evaluate: sort agent actions by quality using Vec::sort_by with LLM 2026-03-14 19:24:07 -04:00
defaults split into workspace: poc-memory and poc-daemon subcrates 2026-03-08 20:43:59 -04:00
schema provenance: convert from enum to freeform string 2026-03-11 01:19:52 -04:00
src evaluate: switch to Elo ratings with skillratings crate 2026-03-14 19:53:46 -04:00
build.rs split into workspace: poc-memory and poc-daemon subcrates 2026-03-08 20:43:59 -04:00
Cargo.toml evaluate: switch to Elo ratings with skillratings crate 2026-03-14 19:53:46 -04:00
config.example.jsonl split into workspace: poc-memory and poc-daemon subcrates 2026-03-08 20:43:59 -04:00