Building a Better Box Score

What is the most common line present in any analytics-based article/post/discussion of the game of football? My answer is some version of “Analytics in football are more difficult/impossible/can’t be done because of how interactive and interdependent the members of a football team are compared to other sports.” Sometimes that basic line is flavored differently, depending on the particular tone of the piece, but that line always seems to be there. It’s a seemingly necessary caveat of the genre. “Of course we can’t know everything about Quarterback X because so much of being a quarterback depends on [insert whatever point one want to make about interactivity on football teams].”

Why do we simply talk about that problem? Why do we constantly talk about the interdependence problem but never fix it? Can it actually be fixed and what would such a solution look like?

The Football Box score – A Model by Ralph Wiggum

To start, I’d like to spend some time developing the idea of why dealing with the interdependence problem is so difficult in football. After all, other industries deal with a similar problem. The entire job of management is interactive and interdependent by nature, we can still figure out who the good managers and the poor managers are. Why do we have such a difficult time in football? I argue it’s because of the modern football box sore, a ubiquitous, pervasive summary of the events on a football field that horribly misrepresents the realities of the game.

A good box score acts like a description of the events of a game, the same way that the model of the solar system you built in middle school acts as a basic description of how the solar system works, how it is set up, and how each component generally relates to the others. Your solar system model was also simplified. You didn’t put all of Jupiter’s 67 moons in your model. You probably didn’t put in the asteroid belt or the daily spin of each planet to an accurate degree (kudos to you if you did), and that’s okay. The purpose of any model is to get the major elements of the system correct while simplifying or eliminating the less important elements. And this is where the box score of a football game falls down dramatically. If a football box score were a model of the solar system, it would be a model created by Ralph Wiggum. This is true to some extent of the entire box score, but I’m going to focus mostly on the passing statistics element as it is the worst offender.

The passing yards box score for Florida State in their blowout loss to Oregon in the first round of the college football playoffs looks something like this.

Passing Yards

Jameis Winston, 29/45, 348 yards, 1 TD, 1 INT

Sean Maguire, 0/3, 0 yards, 0 TD, 0 INT

Receiving Yards

Travis Rudolph, 6 rec., 96 yards, 1 TD

Jesus Wilson, 5 rec., 72 yards, 0 TD

Karlos Williams, 5 rec., 59 yards, 0 TD

Rashad Greene, 6 rec., 69 yards, 0 TD

Dalvin Cook, 3 rec., 24 yards, 0 TD

Ermon Lane, 2 rec., 22 yards, 0 TD

Freddie Stevenson, 1 rec., 12 yards, 0 TD


There are lots of problems with how this data is presented, but let’s focus on two.

Problem 1 – Data Redundancy

The problem that upsets me the most about a football box score is needless redundancy. Every single yard gained by a forward pass is counted twice – once for the quarterback and once for the receiver. This is a problem, a big problem, because it means the data presented here do not reflect what actually happens in a football game. You don’t complete a forward pass and then immediately mark off the gained yardage again. However, in a football box score, because we call the events different things – completions vs. receptions and passing vs. receiving yards – suddenly it becomes okay to double count every single event in the passing game except attempts and interceptions. But they are the exact same event. Jameis Winston’s completion is Rashad Greene’s reception. One cannot happen without the other. Winston earns passing yards and Greene earns receiving yards on the exact same yards gained. We’re not modeling what actually happened in the game. We’re modeling a way to give credit in the most individualized way possible. However, as every single football analytics article will tell you, football is an interactive game. If the game is, in reality, interactive, why do we assign credit for the events in this individualized manner?

Problem #2 – Loss of Information

I think it’s rather ironic that a football box score has so much information redundancy, but it explicitly removes an important piece of information that would allow us to complete some very important analyses on team performance in football.

As an example of the loss of information, let’s look at a different example, this time from Oregon’s Week 1 win over the University of South Dakota. Early season games are useful for this example because it is very likely that the backup of the “high power” football program will spend a great deal of time in the game.

Passing Yards

Marcus Mariota, 14/20, 267 yards, 3 TDs, 0 INTs

Jeff Lockie, 11/12, 113 yards, 1 TD, 0 INTs

Receiving Yards

Byron Marshall, 8 rec., 138 yards, 2 TDs

Darren Carrington, 4 rec., 68 yards, 0 TDs

Dwayne Stanford, 1 rec., 62 yards, 1 TD

Pharaoh Brown, 2 rec., 32 yards, 1 TD

Johnny Mundt, 2 rec., 29 yards, 0 TDs

Keanon Lowe, 1 rec., 18 yards, 0 TDs

Royce Freeman, 1 rec., 11 yards, 0 TDs

Thomas Tyner, 3 rec., 8 yards, 0 TDs

Charles Nelson, 1 rec., 8 yards, 0 TDs

Devon Allen, 1 rec., 5 yards, 0 TDs

Johnathan Loyd, 1 rec., 1 yards, 0 TDs


Here we have two quarterbacks that completed a similar number of passes over the course of the game, but Marcus Mariota gained more than double the passing yards compared to Jeff Lockie. How was that accomplished? Which receivers gained all those yards for Marcus Mariota? Who caught those passes from Mariota to gain so many passing yards for him? Was it because Dwayne Stanford caught one long pass for a touchdown from Mariota? Or did Jeff Lockie complete that pass and only got 51 yards from the other 10 passes he completed? Who was the intended receiver on the six attempts that Mariota did not complete? How many times was any receiver an intended receiver, but did not complete the catch? Did Mariota target the same receiver over and over with no results? Or are his unsuccessful attempts scattered all over the place? We don’t know the answer to any of these questions from the box score description of this particular game. Answering these questions is of critical importance to a better understanding of the game of football.

Our first step to scientifically understanding football is to build a better box score. The question is, what would we want the box score to represent?

Begin with a Model

Before we can build a tool to aggregate data, we need to have a decent idea of what data we want and why we want it. We need to start with a theory of how a football offense works. This is our “model of the solar system” as it were. What things in a football offense are important and what things should we avoid for right now? Here is a figure of my current theory of a football offense.

Basic Model of a Football Passing Offense



This very basic theory says that a football offense begins with the offensive play caller. The play call made then filters down to whichever quarterback is currently in the game, and the ability of the quarterback then filters down to the receivers. Note that we could also include pass catching tight ends and running backs in this model.

Now, I’ve simplified the passing offense to a great extent in this model. Most notably, I’ve removed the impact of the offensive line here. That is by design, but not because I think that the offensive line is unimportant. Instead, I think the offensive line is like adding in the daily spin of the planets to your solar system model. Adding it in becomes incredibly complex and will probably take some specialized data that not is not publically available. For right now, we will keep the offensive line out of our model. Anything else that’s left out of this model we will consider as having such a limited effect that it won’t change our understanding to a large enough extent that we need to account for it.

A Better Box Score

Now that we have our model, we can adjust our box score so that it reflects the important elements inherent in our model. The most important element of the theory says that we must count passes as a single event and account for the coach that called the play, the quarterback that threw the pass, and the receiver that caught it. Such a thing isn’t terribly difficult to create. You can see an example of one below. In this example, I am allowing team to stand in as a proxy for the play caller.



Now all we need is a way to analyze data that fits our theoretical understanding and the data that our better box score is collecting.  Fortunately, such a way exists. Next week.

SeaWorld and the NFL

The orcas at SeaWorld are getting a new habitat. The new habitat will cost SeaWorld hundreds of millions of dollars and basically double the size of the habitat the orcas currently have.

Several events have conspired together to move SeaWorld toward building this habitat, but the catalyst can be traced back to a large male orca named Tilikum who murdered Dawn Brancheau, an experienced trainer who was working with him, in 2010.

A key point in the debate about whether or not “murdered” is an acceptable word for the events that transpired rests on whether or not any trainer at any level of experience is safe in the pool with an orca whale. In court, SeaWorld contended that trainers well versed in methods of animal learning and operant conditioning were perfectly capable of controlling a 3 ton whale. The Occupational Safety and Health Administration, on the other hand, contended that no amount of contact with orcas could be considered safe. After some legal wrangling, it was decided that any and all contact between orcas and trainers had to be done with a solid material, such as a concrete barrier, between the trainer and the whale. Direct contact was no longer allowed.

I’ve been returning to my feelings about the SeaWorld incident a lot during this week of terrible football fandom that just won’t end. I wonder about organized football, both in college and the NFL, from the perspective of a workplace. I think about the increased rates of brain damage, along with the recent instances of domestic violence, child abuse, and rape and I wonder what this game that I love to watch is doing to the individuals that get a chance to play it at a high level.

Last week I wrote about one of the great benefits to using statistical methods for employee selection. Basically, using math to find people means it is much easier to find other people should the ones you originally find not work out socially. That’s a great boon to employers in a typical workplace. They become less beholden to talent. Talented people no longer become insulated from the consequences of socially unacceptable acts. And, for any typical place of employment, that works quite well. However, I did not consider in that piece that the NFL is not a typical place of employment. Professional football has some troubling facts associated with it. As a psychology professor, I know all about what brain damage to the prefrontal cortex can do to an individual. Increased emotional reactivity and impulsivity are just the tip of the iceberg. I wonder if this game is destroying the lives of the people that play it. And I worry that anyone that works to identify and project success from one level of football to another, including myself, may be complicit in that destruction.

I am well aware that this blog isn’t particularly popular. As of this writing, I have 45 followers on Twitter and I get about 6 readers a day, 3 of which are spam crawlers. I am, at the moment, insulated from a looming ethical dilemma. No one with actual decision making power is calling me to learn my opinion on whether or not a particular player should be granted the “privilege” of continuing to play the game. The minute that happens, though, I will have an important decision to make. I will need to choose whether or not I should use my intellect to grant someone else the opportunity to potentially destroy theirs in the name of a sporting contest. I will need to decide if I believe the game of football can be played in a way that doesn’t destroy the lives of its players or if I believe my ability to identity talented football players is the same as placing the best trainers in a pool with 3 tons of socially maladjusted rage.

I do know one thing. Talking about football has never seemed more hollow than in these last 10 days. The heart-rune of my fandom is ripped. I’m not sure it will ever heal.

page 1 of 1
Welcome , today is Wednesday, March 21, 2018