k-eval Viewer

Load a k-eval results file to compare how different agent configurations performed across a benchmark dataset.

Drop a .detailed.jsonl file anywhere on this page, or click the button below.

Accepts .jsonl or .json — produced by k-eval run

{{ loadMsg }}