Is walking in baseball an old man's skill? (part 2)

As I showed in part one, there was a pretty strong correlation between walking and age last season (R-Squared was .587). I took the idea from part one (is walking an old man's skill?) and took it a step further. According to Fangraphs, there have been 3,636 players in the history of baseball who have accumulated 1000+ PAs. Over the past several weeks, largely because I did not know that there already existed an excel function (pivot table) to do it for me, I organized and sorted every one of those 3,636 players by BB% and took the average PA (and wOBA, for good measure) by walk rate (categorizing each walk rate by 1/10 of a percent). From there, I graphed the data and calculated the correlations. The lowest walk rate in the sample size (0.40%) belonged to a man by the name of George Zettlein (1108 PAs, .209 wOBA). The highest walk rate in the sample size (20.80%) belonged not to Barry Bonds (20.60%), but Ted Williams (9791 PAs, .493 wOBA).

The basis questions to the two data sets was as follows:
1) Do players who walk more get more PAs (in other words, do players who walk more have longer careers)?
2) Are players who walk more generally better at baseball (is BB% correlated to wOBA)?*
*note: wOBA is an OPS-like metric, which is skewed towards OBP rather than SLG and accounts for other factors such as SB%.

Chart #1 (click to enlarge): BB% (x-variable) to PAs (y-variable)

The R-Squared for this data set is .551

Chart #2 (click to enlarge): BB% (x-variable) to wOBA (y-variable)

The R-Squared for this data set is .639 (no surprise).

Other tidbits of data:
-The average wOBA over this sample size was .321
-The average BB% of the 3,636 players was 8.21%
-The average PA/career in the data set was 3596.

Conclusions:
-No surprise here, but the backbone of production is walking. There is a very high (approx. +.64) correlation.
-Walking is an old man's skill. There is a strong coefficient of determination between how many PAs a player will get over the course of his career and his walk rate. I feel that the strength of the coefficient is understated by two factors. First, guys like Albert Pujols are still young. In other words, a lot of very good players are still very young and haven't gotten the chance to accumulate 6000+ PAs yet. Secondly, and more importantly, many players with great talent are derailed by injury. There are plenty of players with fantastic peripherals who just cannot stay healthy and thus get fewer PAs across their career, thus distorting the lower ends of the data. When calculating the average PAs for players with better walk rates within the data (13+%), I noticed many players whose careers ended before accumulating even 2500 PAs.

This data should be pretty reliable. The minimum sample sizes necessary to draw statistically relevant inferences from the data is 200 PA for BB% and 500 PA for OPS. wOBA is somewhat related to OPS (though a much better representation of a player's offensive value), thus I make the logical, but mathematically unproven assumption that 1000 PAs would be a good minimum range for the data from which I could draw statistically relevant conclusions about wOBA (in relation to BB%).

All in all, the answer seems to be yes, walking is an old man's sport.

0 comments: