95
Benchmark · Coding
95.7
.
AIME score — a new state of the art for open coding models.
95.7
AIME
73.8
SWE-bench
87.4
τ²-Bench
Swiss Grid · Pentagram