So far, we’ve shown basic upset bonuses leave a bit to be desired, but there might be a way to optimize them with the right tweaking.
Now is a good time to answer the question: what does the optimal March Madness scoring system look like? What do we want it to do?
This is a subjective question, but here are the things we’re looking for:
-
- Picking no upsets should not be the only optimal approach for both early and late rounds.
- Certain upsets should not be so incentivized that picking all of them gives you a clear advantage. Better teams retaining higher expected value is better because it’s more intuitive and doesn’t lead to people gaming the system.
- 8–9 first round matchups are an unavoidable exception to this rule.
- It should be a simple and explainable system. To be simple and explainable:
- Upset bonuses should be integers and either a simple flat bonus or a multiple of seed differences. Flat bonus is preferred for simplicity.
- This multiple can change from round to round, but it must be the same within a round. For example, we can’t use a different multiple for the 5–12 and 1–16 matchups. It’s better if it follows a defined pattern from one round to the next.
- It must be possible to implement this system on Yahoo’s March Madness site.
Since flat bonuses and seed difference bonuses are practically similar for the first round, we decide to use flat bonuses for simplicity. Another benefit for the flat bonus is that upsets are more likely to have smaller seed differences in later rounds, where No. 1 seeds are comparatively dominant.
Figure 1 shows how often each seed reaches each round. No. 1 seeds are twice as likely to reach the Final Four as any other seed, 3 times as likely to reach the national championship, and 5 times as likely to win the national championship. Some hefty bonuses are needed to prop No. 2 and 3 seeds up to similar expected values.

Figure 1: How often each team reaches each round historically
We can then repeat this process for every subsequent round. This brings an inherent challenge though. Let’s say you want to calculate the expected value of picking a No. 5 seed to reach the Sweet 16. The upset bonus will always be rewarded for winning the first-round game, but in the second round the No. 5 seed could either be facing a No. 4 or 13 seed. Beating the No. 4 seed would reward an upset bonus but beating the No. 13 seed would not.
To solve this problem, I first estimate the likelihood of facing a better-seeded team by averaging across all seeds each team could face in each round. In this example, there’s a 78.9% chance the No. 4 seed wins in the first round, so 78.9% is the likelihood of a No. 5 seed facing a higher seed in the second round.
To estimate the probability each team has of reaching each round, I use the historical probabilities shown in Figure 1.2