Building a Better Box Score

What is the most common line present in any analytics-based article/post/discussion of the game of football? My answer is some version of “Analytics in football are more difficult/impossible/can’t be done because of how interactive and interdependent the members of a football team are compared to other sports.” Sometimes that basic line is flavored differently, depending on the particular tone of the piece, but that line always seems to be there. It’s a seemingly necessary caveat of the genre. “Of course we can’t know everything about Quarterback X because so much of being a quarterback depends on [insert whatever point one want to make about interactivity on football teams].”

Why do we simply talk about that problem? Why do we constantly talk about the interdependence problem but never fix it? Can it actually be fixed and what would such a solution look like?

The Football Box score – A Model by Ralph Wiggum

To start, I’d like to spend some time developing the idea of why dealing with the interdependence problem is so difficult in football. After all, other industries deal with a similar problem. The entire job of management is interactive and interdependent by nature, we can still figure out who the good managers and the poor managers are. Why do we have such a difficult time in football? I argue it’s because of the modern football box sore, a ubiquitous, pervasive summary of the events on a football field that horribly misrepresents the realities of the game.

A good box score acts like a description of the events of a game, the same way that the model of the solar system you built in middle school acts as a basic description of how the solar system works, how it is set up, and how each component generally relates to the others. Your solar system model was also simplified. You didn’t put all of Jupiter’s 67 moons in your model. You probably didn’t put in the asteroid belt or the daily spin of each planet to an accurate degree (kudos to you if you did), and that’s okay. The purpose of any model is to get the major elements of the system correct while simplifying or eliminating the less important elements. And this is where the box score of a football game falls down dramatically. If a football box score were a model of the solar system, it would be a model created by Ralph Wiggum. This is true to some extent of the entire box score, but I’m going to focus mostly on the passing statistics element as it is the worst offender.

The passing yards box score for Florida State in their blowout loss to Oregon in the first round of the college football playoffs looks something like this.

Passing Yards

Jameis Winston, 29/45, 348 yards, 1 TD, 1 INT

Sean Maguire, 0/3, 0 yards, 0 TD, 0 INT

Receiving Yards

Travis Rudolph, 6 rec., 96 yards, 1 TD

Jesus Wilson, 5 rec., 72 yards, 0 TD

Karlos Williams, 5 rec., 59 yards, 0 TD

Rashad Greene, 6 rec., 69 yards, 0 TD

Dalvin Cook, 3 rec., 24 yards, 0 TD

Ermon Lane, 2 rec., 22 yards, 0 TD

Freddie Stevenson, 1 rec., 12 yards, 0 TD


There are lots of problems with how this data is presented, but let’s focus on two.

Problem 1 – Data Redundancy

The problem that upsets me the most about a football box score is needless redundancy. Every single yard gained by a forward pass is counted twice – once for the quarterback and once for the receiver. This is a problem, a big problem, because it means the data presented here do not reflect what actually happens in a football game. You don’t complete a forward pass and then immediately mark off the gained yardage again. However, in a football box score, because we call the events different things – completions vs. receptions and passing vs. receiving yards – suddenly it becomes okay to double count every single event in the passing game except attempts and interceptions. But they are the exact same event. Jameis Winston’s completion is Rashad Greene’s reception. One cannot happen without the other. Winston earns passing yards and Greene earns receiving yards on the exact same yards gained. We’re not modeling what actually happened in the game. We’re modeling a way to give credit in the most individualized way possible. However, as every single football analytics article will tell you, football is an interactive game. If the game is, in reality, interactive, why do we assign credit for the events in this individualized manner?

Problem #2 – Loss of Information

I think it’s rather ironic that a football box score has so much information redundancy, but it explicitly removes an important piece of information that would allow us to complete some very important analyses on team performance in football.

As an example of the loss of information, let’s look at a different example, this time from Oregon’s Week 1 win over the University of South Dakota. Early season games are useful for this example because it is very likely that the backup of the “high power” football program will spend a great deal of time in the game.

Passing Yards

Marcus Mariota, 14/20, 267 yards, 3 TDs, 0 INTs

Jeff Lockie, 11/12, 113 yards, 1 TD, 0 INTs

Receiving Yards

Byron Marshall, 8 rec., 138 yards, 2 TDs

Darren Carrington, 4 rec., 68 yards, 0 TDs

Dwayne Stanford, 1 rec., 62 yards, 1 TD

Pharaoh Brown, 2 rec., 32 yards, 1 TD

Johnny Mundt, 2 rec., 29 yards, 0 TDs

Keanon Lowe, 1 rec., 18 yards, 0 TDs

Royce Freeman, 1 rec., 11 yards, 0 TDs

Thomas Tyner, 3 rec., 8 yards, 0 TDs

Charles Nelson, 1 rec., 8 yards, 0 TDs

Devon Allen, 1 rec., 5 yards, 0 TDs

Johnathan Loyd, 1 rec., 1 yards, 0 TDs


Here we have two quarterbacks that completed a similar number of passes over the course of the game, but Marcus Mariota gained more than double the passing yards compared to Jeff Lockie. How was that accomplished? Which receivers gained all those yards for Marcus Mariota? Who caught those passes from Mariota to gain so many passing yards for him? Was it because Dwayne Stanford caught one long pass for a touchdown from Mariota? Or did Jeff Lockie complete that pass and only got 51 yards from the other 10 passes he completed? Who was the intended receiver on the six attempts that Mariota did not complete? How many times was any receiver an intended receiver, but did not complete the catch? Did Mariota target the same receiver over and over with no results? Or are his unsuccessful attempts scattered all over the place? We don’t know the answer to any of these questions from the box score description of this particular game. Answering these questions is of critical importance to a better understanding of the game of football.

Our first step to scientifically understanding football is to build a better box score. The question is, what would we want the box score to represent?

Begin with a Model

Before we can build a tool to aggregate data, we need to have a decent idea of what data we want and why we want it. We need to start with a theory of how a football offense works. This is our “model of the solar system” as it were. What things in a football offense are important and what things should we avoid for right now? Here is a figure of my current theory of a football offense.

Basic Model of a Football Passing Offense



This very basic theory says that a football offense begins with the offensive play caller. The play call made then filters down to whichever quarterback is currently in the game, and the ability of the quarterback then filters down to the receivers. Note that we could also include pass catching tight ends and running backs in this model.

Now, I’ve simplified the passing offense to a great extent in this model. Most notably, I’ve removed the impact of the offensive line here. That is by design, but not because I think that the offensive line is unimportant. Instead, I think the offensive line is like adding in the daily spin of the planets to your solar system model. Adding it in becomes incredibly complex and will probably take some specialized data that not is not publically available. For right now, we will keep the offensive line out of our model. Anything else that’s left out of this model we will consider as having such a limited effect that it won’t change our understanding to a large enough extent that we need to account for it.

A Better Box Score

Now that we have our model, we can adjust our box score so that it reflects the important elements inherent in our model. The most important element of the theory says that we must count passes as a single event and account for the coach that called the play, the quarterback that threw the pass, and the receiver that caught it. Such a thing isn’t terribly difficult to create. You can see an example of one below. In this example, I am allowing team to stand in as a proxy for the play caller.



Now all we need is a way to analyze data that fits our theoretical understanding and the data that our better box score is collecting.  Fortunately, such a way exists. Next week.

page 1 of 1
Welcome , today is Wednesday, March 21, 2018