I’ve had it in the back of my mind for some time now that it wouldn’t be too much of a leap to alter my parser from the collection of batter-related transition state changes to the collection of pitch data … both sets of information are in the same xml file. So, last night, after I got down to the last three differences between my parsing results and those from Retrosheet, I decided to give it a whirl.
Oh, oh, oh … it was a piece of cake compared to generating transition state changes … and now I have a database table of 715819 pitches by 51 fields for each pitch. Woohoo!
Now the question is: what questions to ask about pitches??? I think it’s PCA time … and also time to revisit charts in R.
Pitch locations for the first three games of the season:
Pitches from the first three games that were ‘called balls’:
… not bad there, ump!
On the other hand … pitches that were ‘called strikes’:
… no wonder the players get cheesed at the umpires :-þ
On another other hand though … pitches that were swung at:
… pretty sure the umpires figure the batters are blind too!
And now … hits!
Miguel Cabrera’s spray chart for 2016. Representative field 330′ down the lines and 371 to center.