1 out of 16,233 odds for Pats Fumbles is BS

The Problem

First, go read this article by Warren Sharp: The New England Patriots Prevention of Fumbles is Nearly Impossible. Sharp looks at the number of fumbles / total plays the Patriots had in 2010-14 and fits it to a Normal Distribution. He then argues that this is a statistical improbability (or perhaps it's the data scientist at nflproject.com) that does this.

For those not aware or reading this years from now - this is in regard to an event called Deflategate or as I know it "ballghazi". I'll let the reader decide which name is best.

Final note: I have no skin in this game - I am a lions fan / sufferer. Just a statistics nerd.

I claim the following: Fitting this data to a normal distribution is non-sense.

The Data

Download my data sets below, if you want to play around with it:

2010-2014 (Team, Season) Fumble Data

You should be able to roll that up to the aggregates.

If you find any issues, email me.

Normal Distribution is Wrong

Let's recall their claim: "Based on the assumption that fumbles per play follow a normal distribution, you’d expect to see, according to random fluctuation, the results that the Patriots have gotten over this period, once in 16,233.77 instances”

We can take a look at their data, which is all NFL teams 2010-2014 total number of lost fumbles divided by total number of plays. We can replicate Sharp's data with a plot that matches his and initially makes NE look really bad

Lost Fumbles

And a data table:

Team	Games 2010-14	Total Offensive Plays	Total LOST fumbles	Fumbles / Plays
NE	80	5470.	30.	182.333
NO	80	5425.	39.	139.103
HOU	80	5316.	39.	136.308
ATL	80	5250.	39.	134.615
GB	80	5105.	39.	130.897
BAL	80	5207.	41.	127.
CLE	80	5037.	41.	122.854
JAC	80	5023.	43.	116.814
SF	80	4877.	43.	113.419
SEA	80	4935.	45.	109.667
MIN	80	4984.	46.	108.348
CHI	80	4931.	47.	104.915
STL	80	4993.	48.	104.021
IND	80	5277.	51.	103.471
SD	80	5144.	50.	102.88
CAR	80	5008.	49.	102.204
CIN	80	5192.	51.	101.804
OAK	80	5081.	51.	99.627
DET	80	5429.	55.	98.709
DAL	80	5072.	52.	97.538
ARI	80	4972.	52.	95.615
MIA	80	5052.	53.	95.321
KC	80	5091.	54.	94.278
NYJ	80	5223.	57.	91.632
TEN	80	4799.	53.	90.547
WAS	80	5141.	57.	90.193
PIT	80	5122.	59.	86.814
NYG	80	5105.	60.	85.083
TB	80	4847.	57.	85.035
BUF	80	5065.	62.	81.694
DEN	80	5348.	67.	79.821
PHI	80	5335.	72.	74.097

So this looks bad. Let's checkout the histogram of plays per lost fumble, Sharp's metric.

Distribution of PPLF

You can guess which bar is the Patriots. So at this point, their big claim (and the one that was repeated everywhere on blogs, etc):

"Based on the assumption that fumbles per play follow a normal distribution, you’d expect to see, according to random fluctuation, the results that the Patriots have gotten over this period, once in 16,233.77 instances”

Cool. So let's fit this to a normal distribution by taking the mean (103.365) and the standard deviation of this data set (17.2877) without the Patriots. Then, we can say, what the total density less than or equal to the Patriots value of 182.33:

Mean[Drop[fppWithLost[[All, 5]], 1]]
StandardDeviation[Drop[fppWithLost[[All, 5]], 1]]
AccountingForm[1 - CDF[NormalDistribution[103.365, 17.2877], 182.33]]

103.365
17.2877
0.00000246554

Apologies for the ugly Mathematica code.

We'll get 0.00000246554. Not an exact match but apprxomiately close to Sharps. Does this mean that the Patriots are at an impossible level? Well, let's eyeball that histogram with this distribution overlaid:

Distribution of PPLF

Huh. Doesn't really look like a great fit. Maybe it's the bin size we're using on the histogram?

Oh. Well, we can do a test of normality Kolmogorov Smirnov:

DistributionFitTest[fppWithLost[[All, 5]], Automatic, 
  "HypothesisTestData"]["TestDataTable", "KolmogorovSmirnov"]

Statistic P-Value
Kolmogorov-Smirnov  0.173157  0.0163527

p-value is pretty small, looks like we can reject the null hypothesis (that the hypothesized normal and the observed values are drawn from the same distribution). Looks like we can't use normal and I'd reject the assumption that fumbles per play follow a normal distribution.

Rant Even if you don't trust the above frequentist test - using a normal distribution here should give you some concerns.

It makes a lot of assumptions about tails - especially with values less than zero.
31 (32 - NE) samples doesn't strike me as enough to really build up a comprehensive distribution.

In fact, I tried a few other distributions (Poisson, in particular) and nothing is a great fit. However, this is a great case for bootstrapping. In another post I'm writing now, I'm going to claim the odds of the Patriots plays/fumble number is closer to 10% - unspectacular and not a convincing case for cheating.