Benchmarks

45 tasks accelerated, across 9 domains

Every result is measured against the upstream baseline on frozen inputs, with output concordance and peak memory verified. Pick a domain for methods, datasets, and per-run timings.