I plotted the Grok benchmarks on one less confusing graph