Instead of learning the teams and the players, I have explored the statistics of the past 24 years (data can be found here) (Last years picks, Round of 64 and 32 upsets, Final Four and Championship probabilities) combined with other's specialized knowledge like the Sagarin ratings. This year I am updating some of my charts to include data from 2006, 2007 and 2008.
The pool I enter favors upsets. The points for each round are the round multiplier times the seed of the winning team that you picked. Thus if a 10 seed wins Round 2 and you pick it you get 2*10 for points. To win this type of pool it is imperative that you pick upsets. Game results for 24 years of Round 1 are shown graphically below.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjjMEaQk18Sv1mHb91eecKjI6aXheENqLkklYOvKBU8Nia1Z_OOBVHVvScPb_EJn2LR-Ui54GiAwMU7-tmUIm0FRIbT24SAcg76h0p-I378ta5dNodyuAgvr1mj3ElrpYloG0Fn/s280/March+Madness+Round+of+64+outcome.bmp)
- No 16 seed team has ever won in the first round. Don't be the first to pick one.
- 15 seeds are also very safe and normally win their games.
- History shows that there will be at least one, and in some years two upsets favoring a 10, 11, 12 seed.
- One could make the case for one upset a year favoring a 13 and 14 seed as well.
- 9 seeds win against 8 seeds more than half of the time. Pick two upsets.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgmnzYFDWJqxc5xenf6HDYmMj5M3DPktLO0ui8le7e-1liInn9rsffiV5syqVIvtcfDUzukbEohE2nCBI8uqxgyQgAzA6fuo6PqRcDvN3xkVlIkNRWdp_PeYsUgothI5t8_38eU/s280/March+Madness+Round+of+32+outcome.bmp)
Some lessons from the Round of 32 chart:
- 1 seed teams typically win in this round as well, rarely being beaten by 8 or 9 seeds.
- Matchups with 5 vs 4 seeds, 6 vs 3 seeds and even 10 vs 2 seeds and 12 vs 4 seeds (surprisingly) seem to be toss-ups over the 24 years of data. Almost half the time there is an upset and the lower seed wins. If you have them in your bracket pick the correct underdog half of the time.
- Matchups with 7 vs 2 seeds do result in upsets about a third of the time. Look for opportunities to pick one.
Below is a matchup outcome chart for the Sweet Sixteen round which is similar to the earlier charts, but much more complicated.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhkzaHmdlTzAk-FOnBaETQe_1EikANEUpJVL5xcKGDnBGyFbZch4305ZYNNtqE433kimfQ8LLD0C-x98NbqTTSi3KqLOPMilEc7IP9AzXZ8EGcP2C_rD8WHJmkKDaIuEbD8clXw/s280/March+Madness+Round+of+16+outcome.bmp)
Some lessons gleaned from the Round of 16 outcome chart:
- 1 seeds usually win. They always beat 12 seeds that make it through.
- The closer the distance between seeds the more the outcome is a tossup. This is true for all of these charts.
- In the three times that 11 seeds have made it to this round they have beaten the 7 seed they played. Whether that is statistically significant or not is the question.
Sweet Sixteen frequency chart below
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEic8g1qGmxCfs358vf4lReEFN_Uhvs-3b9h-J2MCQG3_xOqYpmTHReHDYG8JWLGPQAFZBg3kJb4vm3208l7mz4Ia8vZtHI1DaVTxtpiKw6N2Koa0owpb2ml_Be8RvsynR7vAhCZ/s280/March+Madness+Sweet+Sixteen+freq.bmp)
Elite Eight seed frequency chart below.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgaPPJq5DrIYJ4SGG1RAhlm3CYjoX9RA7Vuen2QoYUYYwB3Yh_rQnN2i6MAbkhRPAkfz_hsadQAoFM-cLJPeA2z2LdC7i8dJbQZbo5D9YJeUpAIZx8fMeIt3FobV_7BydNnNTlc/s280/March+Madness+Elite+Eight+freq.bmp)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj1n6PCEZg6PY63_4IICIOc22IHEJcnXX7ymyCJ3K347U8rIuw_wFEhTw3AaUcU05nK4GwpiU9Y9CcrcrGiQCX7Qk5undJE8Y6GaJEs2Y9_0Resvv1EAVK-I6KPX2sTmwx34Pzg/s280/March+Madness+Final+Four+freq.bmp)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhTDmK1EJfoaEnvVdrbItI2QHeTvbo1sQ9NAfLOx_lk9HNDia-AbqntHycnpt67Gu5Rjfkl0HXXAWE0OG9r3kPLXQ7hDk1addhOyyLdGVy7-uTjCM19Xs-DmPSTVUCA0DRQutzX/s280/March+Madness+Championship+game+freq.bmp)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgB_oZUnYNhvp1gTAK8O-SSjjp7H-3xpaAmi4I5b1-kBiLSDb_F4_MV1QlvO5UHETyTn-z6BDRD-fRYUuuiplrb70RuI_n8xq_laQZ42SIuzayqDWtriwO7w0mPm813afUvbtro/s280/March+Madness+Champion+winner+freq.bmp)
- Every other year or so a 5, 6, 8, 10, 11 seed makes it to the Elite Eight.
- One 11 seed, three 8 seeds, three 6 seeds and four 5 seeds have appeared in the Final Four in 96 opportunities over 24 years, choose these upsets sparingly, but if you get them right you might just win the pool.
- In the Championship game, one 8 seeds and two of 4, 5, and 6 seeds have made it that far. use sparingly.
- No team lower seeded than 8 has won the whole Championship. A 6, 8 and 4 seed have won it once each. The Final winner has been a 1 seed more than half of the time.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgNbC1ZQjnkB5Jiq13ipMZ5XClceCnOCPCee2zVP-HCys6DT9Pf_OEDQWRG83vO7ighTZmG0u6wahv0W4ZjWNA3xoiEMhc2W2xO1xSg0g_9ryWhjGo9LrYNxEgNPFt-2mEOntZX/s280/March+Madness+potential+points+by+round+1985+to+2008+data.bmp)
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgCGoGpqUaPcc0easfoAEVtWbWzq4Zm5CzZu87UAlyKWXCAIMXtaloaR8CSq8RGLmjU9BWebcC7IpKHyHhwztv8ohImml3HpFizlGSF4acsOPswCkOIjZndRDPvdK8khgsAtXyi/s280/March+madness+potential+points+histogram+1985+-+2008+data.bmp)
After all of this discussion of picking upsets and examination of the data indicates that upsets happen and are the key to winning the pool, but which upsets and where. This is where we resort to the expertise of others. I use the Sagarin ratings (click on 2008-09 NCAA men's ratings by team) which are essentially a least squares ranking of all of the teams, based on all of the games that a team has played in the year. He suggests using the Predictor ranking to predict the outcome of a game rather than the ranking itself. Every year I match the teams to their rankings, the rankings represent the number of points a team is expected to score in a game so the difference of these rankings is the difference in the game. Since there is some error in the rankings I choose a value below which I will pick the lower seated team to win (picking upsets) and generate my bracket.
This year I automated the process in Excel. If a Predictor difference fell below the chosen factor I set the lower ranked team as the winner. Only for the final four does the model let the best team (higher Predictor score) win regardless of seed. A plot of the resulting expected points versus this factor shows some interesting cutoffs. Realize also that the home advantage for the Sagarin ratings this year is 3.79, almost two baskets. So the factors listed below are not out of the question. Always assuming that they fault to the upset is unreasonable, but called for to maximize point possibilities for this particular bracket.
![](https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh97qbb62dCnPlcJgbYLVrzaIwEIIJtEw_APiNW65zCIJRT33upb2J5Bg8fdWk7ahBSNOI11_ippGJjteMJmJ-0PtKtd5Roe7YapKVOjCLSVCO4_mhqnGYiFi2JJOMEaexy0hD2/s280/March+madness+potential+points+as+a+function+of+spread.bmp)
All that being said, be aware that on any day, any given team can beat any other, thus the format of March Madness is given to upsets and surprises and picking a bracket is still as much luck as skill. These models are an attempt to quantify this uncertainty and use it to drive bracket picks that will take advantage of luck, upsets and surprises when they occur.
Can we see your bracket?
ReplyDeleteThis comment has been removed by a blog administrator.
ReplyDeleteSorry no gratuitous bad language on the blog. I imagine that the second anonymous commenter was expressing disbelief at the amount of work that went into the post and thus I will take the comment as a compliment.
ReplyDeleteThis post is one of the reasons I love my brother so much...this just proves that being smart is COOL! And I am completely serious.
ReplyDeleteKeep up the stats, they really are fun. Granted, being of complete average intellegence, it's still enjoyable for me...even if I do have to re-read some paragraphs (hee, hee)