k-eval Viewer
Load a k-eval results file to compare how different agent configurations performed across a benchmark dataset.
Drop a .detailed.jsonl file anywhere on this page, or click the button below.
Accepts .jsonl or .json — produced by k-eval run
{{ loadErr }} Check that the file was produced by k-eval run and is not empty.
{{ loadMsg }}