Monday, November 30, 2015

WHAT HAVE WORLD SERIES WINNING TEAMS DONE BEST? (part two)

You may dimly remember a series of posts about an obscure stat that lives in the bowels of the data at Forman et fils--a statistical breakout for relief pitcher that they call "Non-save situations."

It appears at first (and quite possibly even second and third) glance to be a catch-all, garbage-like stat, capturing all of the performances by relievers when the game is either well under control (ahead  by four or more runs), or where the team is trailing--and also when the score is tied.

It also captures early inning usages of relievers (prior to the sixth inning) regardless of the game situation, but these are the rarest of the events that cluster into this odd "catch-all" area.

Oddly enough, however, these situations produce a .570 WPCT for the pitchers who get decisions in this "afterthought area." (That WPCT, by the way, is for one hundred and two seasons' worth of data: the won-loss totals for "non-save situation" decisions are 39671 (wins) and 29966 (losses),

When studying World Series teams, the question regarding these games in the context of the question in  our title is rather interesting with respect to assessing the meaning of this "outcome anomaly." Will it prove to be a random function--meaning that all teams, regardless of their overall won-loss record, win around 57% of the decisions that occur from these situations--or is it a function that is defined and controlled by team quality, where better teams have better WPCTs in their "non-save situations" decisions.

Now, if you read those blog posts, you'll already know the answer. It turns out that the function is indeed defined and controlled by team quality. Teams that have won the World Series have an aggregate WPCT of .656 in non-save situation decisions; teams that lost the World Series have an aggregate WPCT of .630 for this breakout.

So it's very likely that the thing that World Series winning teams have done best over the course of baseball history is to generate a significantly higher-than-average WPCT in games where the pitcher getting the decision is working in a non-save situation. Who woulda thunk?

Actually, with the number of decisions occurring in the non-save situation on the increase (due to the rise of reliever innings), it's becoming part of the strategic landscape--and a team with otherwise ordinary performance elsewhere can offset that with a top-flight performance in this obscure area. That was most definitely the case for the Royals in 2015 (24-11, .686 WPCT, 2.92 ERA) and the Giants in the previous season (30-7, .811 WCT, 2.80 ERA).

Thus a "garbage" stat, one apparently deserving the briefest of afterthoughts, is evolving into another key tool in winning games. And, as we noted in the earliest posts about it, it restores meaning in the won-loss stats...consider it another piece of moral relativism stuffed down the throats of those who probably aren't paying attention.

Saturday, November 14, 2015

WHAT HAVE WORLD SERIES WINNING TEAMS DONE BEST? (part one)

There are literally hundreds of ways to try to answer the question in the title above...the first thing that needs to be done in order to narrow the focus is to decide what our point of comparison is. Are we comparing World Series champs to all other teams? Are we comparing them to all other playoff teams?

Or are we going to look at them only in terms of their opponents in the World Series?

Prior to 1969, of course, that was the only point of comparison we had. So to keep any potential data set operating on at least a semi-consistent basis, what we propose to look at here (in a series of posts to appear irregularly during the off-season) is what separates World Series winners and losers. So we are performing only a binary comparison here.

Even with that, we still have many ways to skin the cat. What type of performance are we talking about? Is it what happens in the World Series itself? No, that would be too small a sample size. We'd be better off looking at the in-season data for the two teams and seeing if any strong patterns emerge from it.

So--in-season data. What type of data? Pitching? Do we want to look at bullpen performance? Particular layers of that performance? What about hitting? What would be significant enough as a rough guide to capture differences? And will any of them prove to be more than a random pattern?

Well, no way to know without just diving in somewhere and hoping that it's not the shallow end of the pool. Reaching in semi-blindly, we're choosing to begin by looking at the teams' hitting with two outs. That's a large enough data sample in each season to be meaningful: we're not down to something that's only a tenth of the total plate appearances.

It turns out that this particular split data goes back to 1957, with a few other seasons prior to that available as Retrosheet fills in more play-by-play data further in the past. Interestingly enough, when we use the sOPS+ value from Forman et fils as a way of gauging how much better than average, we find out two interesting facts. First, World Series winners are, on average, 9% better than their overall league average in hitting with two outs. Second, the 1957 Milwaukee Braves, one of the very first teams for whom we have this data, have the highest sOPS+ value of any World Series winner, at 137.

The 2015 Royals, the most recent World Series winner, rank twelfth on this list, with a 119 sOPS+.

Oddly enough, the 1985 Royals--the last KC team to win a World Series, rank dead last (62nd) in this stat, with an 85 sOPS+.

The other interesting thing here is that we are seeing a lot of recent World Series winners on either extreme of this list. Particularly unusual is the fact that the San Francisco Giants, in all three incarnations of their recent even-year dominance of the World Series, were very poor performers when hitting with two outs.

So now what we want to know is: how does this stack up against the teams they beat in the World Series? All of the above wouldn't mean jack if the losing team in the Series had a higher sOPS+ in plate appearances with two outs. And it turns out that it is lower--not a lot lower (104), but lower.

But there is another nuance we should explore here--namely, when two teams face off in the World Series, does the fact that one of these teams performs better with two outs of any predictive value with respect to who is the eventual World Champion? Or is this simply another random variable?

The answer: there is some possibility that it is, in fact, an indicator--particularly in recent times. Measuring the data from the first year where we have both winners and losers available (1957), we see that the eventual World Series winner has had a higher sOPS+ when hitting with two out in 32 out of 54 Fall Classics, or 59% of the time.

But this was a 50/50 proposition from 1960 through 1982; since then, the odds are closer to 2 to 1 in favor of the World Series winner having a better 2-out hitting performance during the regular season that the World Series loser.

Interestingly, as the teams that make the World Series become more subject to the random forces that have taken hold due to the expanded post-season, the more robust this trend has seemed to become. In the past 20 World Series dating back to 1996, the team with better 2-out hitting has won 14 times (70%). And as the chart at right shows, the five-year smoothed ten-year average for this data shows an even higher correlation than that over the past ten years.

Small sample size? Of course. And none of this takes into account all of the intermediate post-season matchups that occur along the way to the Fall Classic. But it is interesting to note that this trend has strengthened even as teams in the World Series are declining in average WPCT due to the randomizing effects of the expanded post-season.

This is one we will have to keep an eye on moving forward...