Generated: /Users/jiayizx/Desktop/verbalize-sampling
Experiments: 1
Metrics: response_count
Available Probability Definitions: default, implicit, explicit, relative, percentage, confidence, perplexity, nll
| Name | Task | Method | Model | Responses | Temperature | Probability Definition |
|---|---|---|---|---|---|---|
| direct (samples=50) | state_name | direct | gpt-4.1 | 50 | 0.7 | implicit |
Name a US State. Only provide the answer without explanation or punctuation.
California
Name a US State. Only provide the answer without explanation or punctuation.
Montana
Name a US State. Only provide the answer without explanation or punctuation.
Texas
| Metric | Value |
|---|---|
| Average Kl Divergence | 2.5134 |
| Average Precision | 1.0000 |
| Average Unique Recall Rate | 0.1200 |
| Num Prompts | 1.0000 |