evaluate: ask for reasoning in comparisons
Chain-of-thought: "say which is better and why" forces clearer judgment and gives us analysis data for improving agents. Co-Authored-By: Kent Overstreet <kent.overstreet@linux.dev>
This commit is contained in:
parent
39e3d69e3c
commit
415180eeab
1 changed files with 4 additions and 2 deletions
|
|
@ -276,7 +276,8 @@ fn llm_compare(
|
|||
{}\n\n\
|
||||
## Action A\n## Report output{}\n\n\
|
||||
## Action B\n## Report output{}\n\n\
|
||||
Reply with ONLY: BETTER: A or BETTER: B or BETTER: TIE",
|
||||
Say which is better and why in 1-2 sentences, then end with:\n\
|
||||
BETTER: A or BETTER: B or BETTER: TIE",
|
||||
a.0, shared_prompt, report_a, report_b
|
||||
)
|
||||
} else {
|
||||
|
|
@ -285,7 +286,8 @@ fn llm_compare(
|
|||
for building a useful, well-organized knowledge graph?\n\n\
|
||||
## Action A ({} agent)\n{}\n\n\
|
||||
## Action B ({} agent)\n{}\n\n\
|
||||
Reply with ONLY: BETTER: A or BETTER: B or BETTER: TIE",
|
||||
Say which is better and why in 1-2 sentences, then end with:\n\
|
||||
BETTER: A or BETTER: B or BETTER: TIE",
|
||||
a.0, a.2, b.0, b.2
|
||||
)
|
||||
};
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue