Proof / XBOW-style benchmark campaign

CS.04 — Benchmark

An honest benchmark.

104 recorded cases. Black-box and white-box. Every win retained an artifact set; every black-box gap retained a refutation log; the methodology spine was published before the score.

104
Recorded cases
94.2%
Black-box wins
100%
White-box wins
0
No-win misses
Open the methodology page Open full manifest