KATOH: Forecasting Major League Pitching with Minor League Stats

Shortly before the new year, I wrote a piece at The Hardball Times introducing KATOH — a methodology for forecasting major league performance using minor league stats. Using a series of probit regression analyses, I explored how a hitter’s age and offensive statistics are predictive, across all levels of the minor leagues, from Rookie ball to Triple-A, of his future big league performance.

The result was a set of projections for for each minor league hitter, which included the probability that he’d play in the majors and that he would hit certain WAR thresholds through age 28. This analysis also provided some insight into which offensive statistics are predictive of future success for players at each level of the minor leagues. The most pronounced trend — and possibly the most surprising one — relates to a hitter’s walk rates. In the lower-levels of the minors, they have little to no bearing on a player’s future big league success.

Today, I’m going to deal with minor league pitchers using the same type of methodology. In my original piece, I noted that these projections shouldn’t be used to replace traditional, scouting-based methodologies. Instead, they are intended to complement them, and possibly uncover statistical factors that have been overlooked. This is especially true for pitchers, whose stats take longer to stabilize, and whose stuff often matters more than stats.

The table below summarizes which stats proved significant at each minor league level. This analysis includes minor league data going back to 1991, the earliest year for which Baseball-Reference has batters-faced totals for pitchers. R+ refers to the advanced rookie leagues–the Appalachian and Pioneer Leagues — while R- includes the Arizona and Gulf Coast Leagues.

The following factors wound up being significant for pitchers at one or more minor league level: Age, the percentage of a pitcher’s games that were starts, strikeout rate, walk rate, home run rate, and handedness. Additionally, the square of a pitcher’s strikeout rate (K%^2) was significant at the Triple-A level with a negative coefficient. Essentially, this says that a high strikeout rate bodes well for pitchers in Triple-A, but the added benefit starts to diminish around 25 percent. All the performance stats (K%, BB%, HR%) have been adjusted to league average, but were not park-adjusted.

Significant Statistics by Level

Age GS% K% BB% HR% Handedness K^2
AAA Yes Yes Yes Yes Yes Yes
AA Yes Yes Yes Yes Yes Yes
A+ Yes Yes Yes Yes
A Yes Yes Yes Yes Yes
A- Yes Yes Yes Yes
R+ Yes Yes Yes
R- Yes Yes Yes Yes

Unsurprisingly, both a pitcher’s age and his percentage of games started are predictive in the direction you would expect. Pitchers who are young for their level are more likely to be successful than older prospects, and starting pitchers are generally more successful than those who work in relief. Strikeout rate is also a very important predictor of future success, especially in short-season leagues, where other metrics don’t tell us anything about future success.

Just as we saw with hitters, a pitcher’s walk rate is not at all predictive for appearances in the lowest levels of the minor leagues. However, once a pitcher reaches full-season ball, a one percent change in walk rate immediately becomes almost as useful as a one percent change in strikeout rate in forecasting future performance. This differs from what I found for hitters, whose walk rates mean very little below Double-A, and don’t become as useful as strikeout rate until the Triple-A level.

Home run rate is another metric that’s pretty much meaningless in the lower levels of the minors. Although it starts to add some predictive value as early as A-ball, its effect is relatively small for pitchers below Double- and Triple-A. Relative to strikeout percentage, a one percent increase in a pitcher’s home run rate matters about twice as much Triple-A as it would for a pitcher in Low-A and Short-Season-A.

Another interesting finding is the significance of a pitcher’s handedness. For pitchers in Double-A and in the lower rung of Rookie ball, a righty is more likely to blossom into a successful big leaguer than a lefty, all else being equal. Don’t read too much into this, however, as the effect isn’t large enough to make a noticeable difference in the projections. Its hard to say exactly why a righty is more likely to succeed than a lefty with comparable stats, but my guess is that it has something to do with minor league hitters facing lefties with unusual deliveries — like throwing side-arm — for the first time.

One variable I wish I could include here is a pitcher’s height. Its generally accepted that taller pitchers have an advantage over shorter ones — taller pitchers can throw the ball on a more downhill plane and usually release the ball closer to home plate. As a result, you often hear evaluators refer to a pitching prospect as “projectable” if he’s over 6-foot-3, implying that there’s some extra potential left for him to unlock. As a result, I would imagine that a height variable would be significant with a positive coefficient. In other words, if a 6-foot-4 pitcher and a 5-foot-11 pitcher both had the same stat line, I would guess that the taller guy would be more likely to succeed.

Unfortunately, I can’t say that for sure, since I couldn’t find height data for minor leaguers in a readable format. But I’m hoping to add height into the mix for future projections. So if you have any suggestions on how I could track down this type of data, please let me know.

Predicting how a minor league hitter will perform in the majors is no easy task, and doing it for pitchers is even harder. As the saying goes: “There’s no such thing as a pitching prospect.” Countless pitching prospects who have put up crazy minor league numbers only to flame out without establishing themselves at the big league level. Often it’s due to injury — we’ll never know what could have been for promising pitchers like Ryan Anderson and Nick Neugebauer, but sometimes healthy pitching prospects just turn into pumpkins once they reach the majors — like Rick Ankiel and Salomon Torres. Then there are guys like Johan Santana and Roy Oswalt, who turn into bona fide aces after scuffling for a few years in the minors. It’s a crapshoot.

Because of all of this uncertainty, the top KATOH projections for pitchers tend to run lower than they do for hitters — minor league pitching stats just aren’t as predictive as hitting stats. KATOH is pretty wishy-washy on most pitchers, especially when they’re in the low minors. For pitchers in short-season ball, nearly all of the KATOH projections are clumped relatively close together, even when simply estimating the probability that a player will make it to the majors. Aside from what he does in Triple-A, a pitcher’s minor league stats can tell us only so much, so unless a low-level pitcher is really tearing things up, KATOH is going to peg him somewhere close to the median.


While KATOH has a significantly harder time projecting pitchers than batters, it succeeds and fails in similar ways. Just as with hitters, it fares pretty well for players in the high minors, but has a really tough time with guys at the lower levels, especially when you look at the higher-WAR thresholds. For low-level pitchers, the only thing it can tell us with any sort of certainty is whether or not he’ll crack the big leagues before his 29th birthday.

Considering all pitchers with a KATOH projection since 1991, the table below shows the average residual–the difference between KATOH’s prediction and what actually happened (either zero or 100 percent)–divided by the average prediction at each level. The greener the box, the better job KATOH did of guessing right.


Now for the fun part.. But before you immerse yourself in the projections, keep a couple of things in mind:

1) Be wary of projections for pitchers in the lower levels of the minors. As I outlined above, the projections become less and less reliable with every step you take down the minor league ladder. Low minors projections are a best guess based on the available data, but are very much subject to error.

2) Pay attention to sample sizes. Many of the pitchers listed in my Google doc may have excellent projections over a small number of innings. Take these with as many grains of salt as you would a pitcher’s FIP over the same number of innings.

Unsurprisingly, the list of top KATOH projections (minimum 200 batters faced, or about 50 innings pitched) reads like a “who’s who” of top pitching prospects in baseball. But if you look further down the list, you’ll find a few lesser-known guys who may not have knockout stuff, but still managed to get batters out in 2014 despite being young for their level.


As I did with hitters, I put together a document that includes a projection every player who threw a pitch in affiliated baseball last year. Again, sample size caveats apply. I also made another document that includes all of the minor league seasons that went into making these forecasts, which includes all prospects were 28 or older in 2014. Have fun browsing these projections, and don’t hesitate to reach out to me if you have any questions or suggested improvements.

This article originally appeared at The Hardball Times.

About Chris

Chris works in economic development by day, but spends most of his nights thinking about baseball. He writes for Pinstripe Pundits, and is an occasional user of the twitter machine: @_chris_mitchell
This entry was posted in Analysis and tagged , . Bookmark the permalink.

Comments are closed.