Running Human Evaluations
Starting a new evaluation
- Go to the Evaluations page
- Select the Human annotation tab
- Click Start new evaluation
Configuring your evaluation
- Select your test set - Choose the data you want to evaluate against
- Select your revision - Pick the version of your application to test
warning
Your test set columns must match the input variables in your revision. If they don't match, you'll see an error message.
- Choose evaluators - Select how you want to measure performance