Tag
#evaluation
-
What to Keep When Evaluating AI
A note on recording criteria, failure modes, and human review boundaries instead of collecting only good AI outputs.
Tag
A note on recording criteria, failure modes, and human review boundaries instead of collecting only good AI outputs.