Every finished match: the model's earliest prediction vs what actually happened. Heads-up: hit rate flatters any model that picks favourites β the calibration curve is the rigorous test. Per-match Brier shown (0 = perfect, 0.33 β guessing).