PawBench Leaderboard
4-dimensional LLM inference benchmark results
Loading results...
#
Model
GPU
Config
Single tok/s
Peak tok/s
Quality
TTFT (ms)