MLB Market Backtest v3

10,148 games 766,028 state observations 2022-04-07 → 2026-04-27 10,047 with pregame odds

What does all this mean? (read me first)

This tool answers two different questions. Pick the right mode at the top.

Per-game mode: One row per game. Use for pregame questions like "when the favorite is -200 to -250, how often do they win?"
Per-state mode: One row per plate appearance moment. Use for live-betting questions like "down 3 entering top 8 with 0 outs, how often does the away team hold on?"
States matched: Count of plate-appearance moments matching your filter. The same game can contribute multiple matching states (e.g., 3 batters hitting with 0 outs = 3 states). Per-state mode counts each separately.
Unique games: Distinct games among the matched states. The dataset has 766,028 total state observations across 10,148 games — so each game contributes roughly 75 plate-appearance states.
With pregame odds: Of those games, how many have the pregame moneyline / spread / total attached. Pregame-odds filters and the Favorite vs Underdog market only use these. Games before 2020-06-06 or that didn't match a snapshot won't have them.
Hit rate: Hits ÷ N. The fraction of matching observations where this outcome happened. Note: in per-state mode, the denominator is states (not games), which weights long-lingering states more heavily. For clean game-level rates, use per-game mode.
95% CI (Wilson): The plausible range for the true rate, given your sample size. Wider = less certainty (usually because N is small). If two markets' intervals don't overlap, the difference is probably real; if they do, you can't be confident yet.
Devig probability: The bookmakers' implied win probability with their margin ("vig") removed. Roughly the market's true estimate of the win probability if the books priced fairly.

Quick scenarios: