View Single Post
  #1138  
Old 11-21-2021, 06:56 PM
BobC BobC is offline
Bob C.
 
Join Date: Apr 2009
Location: Ohio
Posts: 3,275
Default

Quote:
Originally Posted by AndrewJerome View Post
Great stuff guys. This is a fun thread.

A few things:

Snowman, if that model could be created it would be pretty cool. Obviously it would take a lot of work. I have a practical question. Sorry, I don’t know all the terminology, and I really have no idea how a model like that works. If that model were to be created, how would information get processed through the model? For say 1953 or whatever year, would every stat for that year have to be manually input into the model?

The idea of how athletes evolve is interesting. Of course humans have slowly gotten bigger, faster, stronger etc over the past 130 years. However, for quality of play in baseball, I’m not sure it is as simple as every year we go forward the quality of play gets a little better. Obviously, there have been social changes that impact this greatly. Quality of play clearly went down during the war years of the early 1940s, and clearly went up in the late 1940s with integration. This is only a guess, but it seems to me, just brainstorming, that quality of play seems especially strong in the 1950s / early 1960s, and also from the late 1980s to around 2000. A high number of very elite players entered MLB in the 1950s. Mantle, Mays, Aaron, Clemente, Jackie Robinson, Frank Robinson, Snider, Berra, Campanella, Banks, Matthews, Koufax, Gibson etc. The upper tier HOFers are seemingly endless for the 1950s and moving to the 1960s for the end of their careers. But it seems like there were far less upper tier HOFers starting out in the 1960s. Brock, Rose, Morgan types are not nearly as impressive as the 1950s list. Similarly, upper tier HOFers starting out near 1970 and early to mid 1980s are not nearly as impressive as the 1950s list. 1970s you have Reggie, Schmidt, Brett, early to mid 1980s you have Rickey Henderson, Ripken, etc. but no where near the top end talent starting out in the 1950s. But then in the mid to late 1980s you add Bonds, Clemens, Griffey, Randy Johnson, Maddux, Pedro, Arod, Jeter, Frank Thomas etc., just a lot of top tier HOFers and it would seem like very high level of play. I guess my question is how much impact do high end HOFers have on the level of play for a time period? The flip side of the argument would be that the “average” type players increased in skill greatly over time, and the “average” players in the league getting better over time could be more impactful than the amount of top end talent at any one time. Anyway, fun stuff to think about.

Finally, my understanding is that a high or low BABIP generally is a lucky/unlucky stat. An unusually high (and out of line) BABIP for a pitcher would entail bad luck where a bunch of line drives and grounders happen to get hits. And an unusually low BABIP for a pitcher would be good luck where line drives seem to be hit right at guys etc. How much of BABIP is “good situational pitching” or “good situational defense”? Who knows. But this being said, Maddux is a fascinating pitcher. His control is obviously elite and close to best of all time for control. And not just throwing strikes, but the ability to nibble at the edges of the strike zone. This makes it very hard to make solid contact and should equate to a lower BABIP. That’s just the eye test from watching him. Strikes that are on the corners are difficult to hit hard. It you rarely throw a meat ball and get lots of strikes on the corners then you’d think stats should follow the eye test, just because Maddux was so good with his control.
Andrew,

Some very insightful points. In particular about the measure of "luck" in regards to BABIP. Kind of like predicting the outcome of flipping a coin and whether it lands heads or tails. That outcome is always a 50/50 probability. And so over time, and all other things constant and equal and assuming a sufficient sample size, anyone flipping coins would eventually expect to see them ending up with exactly half heads, and half tails. To me, I've always thought of this as kind of what is meant by "regressing to the mean", in this case ending up 50/50 on heads or tales. But what is interesting is say you start out flipping coins to test this, and everything being constant and nothing abnormal with the coin, the first 9 flips all come out tails. Now the absolute probabity of a head or a tail is still just 50/50 on that next, 10th flip, or is it? Since over a large enough sample size we expect the number of heads or tails to come up to regress to that expected mean of 50/50 for each of the two possible outcomes, if in starting out with getting tails 9 times in a row, you know you eventually have to start flipping heads, but the probability of each and every single flip is still always going to be just 50/50. So now you have somewhat of a paradox on what the actual probability of flipping a head or tail on all future attempts should be, at least it seems like one to me.

So now back to BABIP. The fact that you have some pitchers that appear to consistenly be above or below the league average BABIP, all the time, leads me to believe there is something other than simple "luck" involved with them being able to do that. At what point (ie: sample size) will a statistician be comfortable in finally admitting there may be some other factor(s) or variable(s) that they haven't been able to effectively measure, quantify, and account for, and as a result just refer to it as "luck". For wouldn't it be true that if they had been able to somehow measure and include all the pertinent factors and variables in their formulas, such as a pitcher like Maddux's ability to have batters consistenly not hit the ball hard or cleanly, that those formulas would in fact show where all things do eventually regress to a mean. Just like they do in the case of flipping coins where it will eventually always come back to show a 50/50 heads or tails probability. In other words, in the case of BABIP, if the statisticians could effectively factor in ALL variables and factors, there would be no outliers, like a Maddux maybe, sitting significantly outside the mean, unless expainable by some other variable or factor, like a lack of a sufficient sample size. But to just simply explain these outliers by attributing those differences to such an amorphous concept or idea as luck, leads me to believe there is an inability, or unwillingness, on the part of those performing the statistical analysis to effectively be able to find and include all the pertinent variables and factors in their formulas. Thus making BABIP maybe the best statistical tool for it's intended purpose they can do for now, but ultimately not the best and closest statistical measure or tool currently out there for use that it could be.

Last edited by BobC; 11-21-2021 at 06:56 PM.
Reply With Quote