廣告投手_測量投手隱藏自己的音高的程度

廣告投手

As the baseball community has recently seen with the Astros 2017 cheating scandal, knowing what pitch is being thrown gives batters a game-breaking advantage. However, unless you have an intricate system of cameras and trash cans set up, knowing what pitch is about to be thrown is incredibly difficult. Batters have mere fractions of a second to pick up on signals that might indicate the type of pitch coming their way and even fewer milliseconds to process and act on that information. Signals that could indicate what pitch is being thrown could be a subconscious tell from the pitcher or a sign from a runner looking in at second. But these are either situational or easily avoided by experienced pitchers. A much harder to avoid pitch indicator is the pitcher’s release point.

正如棒球界最近在2017年《太空人》中的作弊丑聞中所看到的那樣,知道投擲什么球距會讓擊球手具有突破性的優勢。 但是,除非您設置了復雜的照相機和垃圾桶系統,否則要知道將要拋出的間距是非常困難的。 擊球手只有幾分之一秒的時間就能接收到可能表明音高類型的信號,甚至更少的毫秒來處理和處理該信息。 可能表明投擲了什么音調的信號可能是投手下意識地發出的信號,也可能是跑步者看了一眼的信號。 但是這些都是有條件的,或是經驗豐富的投手很容易避免。 投手的釋放點是一個更難避免的投球指示器。

Image for post
Sean Manaea’s release points for pitches thrown in 2018–2019
肖恩·馬納阿(Sean Manaea)在2018–2019年投擲球場的釋放點

The figure above shows Oakland A’s pitcher Sean Manaea’s release point for all of his pitches thrown since 2018. As we can see from the plot, there are relatively distinct differences in release point depending on the type of pitch. For Manaea, he tends to release his fastball lower than any other pitch and his changeup is, on average, released at the highest point. An observant batter could pick up on these signals and use them to their advantage. No trash cans needed, just skill. However, not all pitchers have such distinct differences in their release point. Take Justin Verlander for example:

上圖顯示了Oakland A的投手Sean Manaea自2018年以來投出的所有投球的釋放點。從圖中可以看出,根據投球類型的不同,釋放點存在相對不同的差異。 對于Manaea而言,他傾向于將自己的快球釋放得比其他任何俯仰都低,并且他的換乘平均是在最高點釋放的。 細心的擊球手可能會撿起這些信號,并利用它們來發揮優勢。 無需垃圾桶,只需技巧即可。 但是,并非所有的投手在釋放點上都有如此明顯的不同。 以Justin Verlander為例:

Image for post
Justin Verlander’s release points for pitches in 2019
賈斯汀·維蘭德(Justin Verlander)在2019年的發布點

Looking at the graph above, we can see that Verlander’s release points are much more uniform. He doesn’t seem to have a very distinct difference in release point depending on his pitch type. This makes it much harder for a batter to predict what pitch Verlander is throwing based on differences in arm slot.

查看上圖,我們可以看到Verlander的發布點更加統一。 根據他的音高類型,他的釋放點似乎沒有非常明顯的不同。 這使擊球手很難根據臂隙的不同來預測Verlander的投球角度。

Quantifying Ability to Hide Pitches:

量化隱藏音高的能力:

Now that we have seen this difference in ability to hide pitches, a natural question would be: how can we quantify this difference? The way I decided to quantify this ability is by using a classification model. If you don’t know what a classification model is, here is a quick summary. Classification is a machine learning model that attempts to classify data based on certain features. In this particular model the ‘classes’ are the pitch types and the features are the coordinates for the release point and the count the pitch was thrown in. So the model takes the the release point coordinates and the count and does its best to determine what type of pitch was thrown based on that information.

既然我們已經看到了隱藏音高的能力上的差異,那么一個自然的問題將是:如何量化這種差異? 我決定量化此功能的方法是使用分類模型。 如果您不知道分類模型是什么,請快速總結一下。 分類是一種機器學習模型,試圖基于某些功能對數據進行分類。 在這個特定的模型中,“類”是音高類型,特征是釋放點的坐標以及被拋出音高的計數。因此,該模型將獲取釋放點的坐標和計數,并盡最大努力確定根據該信息拋出的音高類型。

Feature Selection:

功能選擇:

Of course, if I simply wanted to accurately classify what pitch was thrown I could include spin rate and the movement metrics of the pitch as features to make a much more accurate model. But I want to quantify how well pitchers hide their pitches from batters so I only want information that is available to the batter up until the ball is released from the pitcher’s hand. As a consequence, in this model we only have the release coordinates of the pitch and the count it was thrown in as features.

當然,如果我只是想準確地對拋出的音高進行分類,則可以將旋轉速度和音高的運動指標包括在內,以形成更準確的模型。 但是我想量化投手對擊球手的掩飾效果,所以我只希望擊球手能得到的信息直到球從投手手中釋放出來為止。 因此,在此模型中,我們僅具有音高的釋放坐標和作為特征拋出的音高。

Evaluating the Model:

評估模型:

Once the model has attempted to classify the data it is given, we need a way to evaluate how well it classified the data. This is how we will measure a pitcher’s ability to hide pitches from batters. The metric used to evaluate the model is called the precision score. This essentially returns the proportion of pitches that were classified correctly so it will range from 0 to 1. If the model is able to classify a large proportion of pitches correctly (a precision score value closer to 1) that tells us that the pitcher has more distinct differences in release points for his pitches and/or he is very predictable in the pitches he throws in certain counts. A precision score closer to 0 indicates that the model could not effectively classify the pitch type based on release point and count which tells us that the pitcher is much better at releasing pitches from the same point and mixes them up well depending on the count. One thing that needs to be kept in mind is that pitchers with a larger pitch repertoire will have a lower score simply because there are more pitches to classify. To counteract this I will be measuring how well the model does based on how much better it performs compared to simply randomly classifying the pitches. For example, if a pitcher has 4 pitches and you randomly guessed the pitch type you would expect to get 25% of them correct. So if the precision score for a pitcher with 4 pitches is 0.5, its adjusted score would be 2 because it is twice as effective compared to randomly guessing.

一旦模型嘗試對給定的數據進行分類,我們需要一種方法來評估其對數據的分類程度。 這就是我們測量投手隱藏擊球手投球能力的方式。 用于評估模型的度量標準稱為精度得分。 這實際上會返回正確分類的音調的比例,因此范圍為0到1。如果模型能夠正確分類很大比例的音調(精度得分值接近1),則告訴我們該音調器具有更多他的投擲點在釋放點上的明顯差異和/或他在某些計數下投出的投擲點非常可預測。 精度得分接近0表示該模型無法根據釋放點和計數有效地對音高類型進行分類,這告訴我們該投手在釋放同一點的音高方面要好得多,并且根據計數將它們很好地混合在一起。 需要牢記的一件事是,具有更大音調庫的投手將具有較低的分數,這僅僅是因為要分類的音調更多。 為了解決這個問題,我將根據模型的性能好于簡單隨機分類的音調來衡量模型的性能。 例如,如果一個投手有4個音高,而您隨機猜測了音高類型,那么您會期望其中的25%正確。 因此,如果一個具有4個音高的投手的精確度得分為0.5,則其調整后的得分將為2,因為它的效率是隨機猜測的兩倍。

Results:

結果:

Now that we have defined our model and evaluation metrics, let’s see the results. Here I picked 16 random pitchers from 2019 and ran their pitch data through the model.

現在我們已經定義了模型和評估指標,讓我們看看結果。 在這里,我從2019年挑選了16個隨機水罐,并通過模型運行了它們的水罐數據。

Image for post

Our ‘winner’ is Blake Snell who has both the highest precision score and highest adjusted score. Snell’s high score suggests that he has distinct release points for his different, let’s see a plot of his release points to verify this.

我們的“勝利者”是布雷克·斯內爾(Blake Snell),他同時擁有最高的準確度得分和最高的調整后得分。 斯內爾(Snell)的高分表明他有不同的釋放點,讓我們看一下他的釋放點圖以驗證這一點。

Image for post
Blake Snell’s release points
布萊克·斯內爾(Blake Snell)的發行要點

The graph above seems to fall in line with Snell’s high precision score. Snell appears to have very distinct areas where he releases his pitches with his changeup being released lower and to the right and his curveball and fastball being released higher and to the left.

上圖似乎與Snell的高精度得分相符。 斯內爾(Snell)似乎有非常不同的區域,他的球高和球高分別向左下方和右下方釋放,而他的曲線球和快球則在上方和左側釋放。

This difference is made even clearer when compared to Gerrit Cole who had the lowest precision score out of the players I tested.

與Gerrit Cole相比,這種差異更加明顯,后者在我測試的球員中得分最低。

Image for post
Gerrit Cole’s release points
Gerrit Cole的發行要點

Gerrit Cole’s release points are much more muddled and there aren’t clear patterns for where he releases certain pitches. This makes it much more difficult for batters to pick up what pitch is being thrown out of his hand.

杰里特·科爾(Gerrit Cole)的釋放點更加混亂,他釋放特定音高的位置也沒有明確的模式。 這使得擊球手要撿起他手中的球變得更加困難。

Applications:

應用范圍:

While this little experiment was more for my sake, I believe there are a couple ways teams could utilize this data and model.

就我而言,雖然這個小實驗更多,但我相信團隊可以采用幾種方法來利用此數據和模型。

The first application would be for batters to find pitchers with distinct differences in release point. Teams and hitters can then look at the release point plots of those pitchers to find the patterns for the different pitches and use that information to help figure out what pitch is being thrown based on the pitcher’s arm slot. Granted, that is much easier said than done but skilled players could use that information to great effect.

擊球手的第一個應用程序是查找釋放點明顯不同的投手。 然后,團隊和擊球手可以查看那些投手的釋放點圖,以找到不同投手的模式,并使用該信息來幫助根據投手的臂隙來找出要投擲的投手。 當然,這說起來容易做起來難,但是熟練的玩家可以利用這些信息產生巨大的效果。

The second application would be for pitchers. Pitchers that have high precision scores may want to try and lower their precision score by consistently releasing from the same point regardless of the pitch to make it harder for batters to pick up on what is being thrown.

第二個應用是投手。 具有高精確度得分的投手可能想要通過不斷地從同一點釋放而不管其間距如何來降低其精確度得分,從而使擊球手更難撿起所投擲的東西。

If you have any more ideas for how this data can be applied feel free to let me know!

如果您對如何應用這些數據還有其他想法,請隨時告訴我!

The code for the model and data can be found here on my GitHub. This is my first time using classification so any tips or criticisms are greatly appreciated.

該模型和數據的代碼可以在我的GitHub上找到 。 這是我第一次使用分類,因此非常感謝任何提示或批評。

翻譯自: https://towardsdatascience.com/measuring-how-well-pitchers-hide-their-pitches-f61f076d91f4

廣告投手

本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。
如若轉載,請注明出處:http://www.pswp.cn/news/388554.shtml
繁體地址,請注明出處:http://hk.pswp.cn/news/388554.shtml
英文地址,請注明出處:http://en.pswp.cn/news/388554.shtml

如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!

相關文章

linux事務隔離級別,事務的隔離級別(Transaction isolation levels)2

READ COMMITTEDREAD COMMITTED這是數據庫默認的隔離級別。它能保證你不能讀取那張表格數據,只要有其它事務還在改變這張表格數據。可是,因為sql server在select操作的時,鎖表格時間就那么一小會兒,如果一個事務在READ COMMITTED級…

Asp導出到Excel之二

response.contentType "application/vnd.ms-excel" response.addheader "Content-Disposition", "attachment; filename引出文件.xls" 一、適用于動態和表態表。 二、頁面最好只存放數據表,不要有其它內容。 三、對于分頁的情…

warning C4996: “strcpy”被聲明為否決的解決辦法

VC2005中,使用了很多標準的C函數,比如fopen,strcpy之類的。編譯時會出現警告,比如這個: d:\xxxx.c(1893) : warning C4996: “strcpy”被聲明為否決的 緊接著IDE有提示說:“This function or variable…

驗證部分表單是否重復

1. 效果 圖片中的名稱、機構編碼需要進行重復驗證2. 思路及實現 表單驗證在獲取數據將需要驗證的表單數據進行保存this.nameChangeTemp response.data.orgName;this.codeChangeTemp response.data.orgCode; 通過rule對表單進行驗證 以名字的驗證為例rules: {orgName: [// 設置…

python bokeh_提升視覺效果:使用Python和Bokeh制作交互式地圖

python bokehLet’s face it, fellow data scientists: our clients LOVE dashboards. Why wouldn’t they? Visualizing our data helps us tell a story. Visualization turns thousands of rows of data into a compelling and beautiful narrative. In fact, dashboard vi…

用C#寫 四舍五入函數(原理版)

doubled 0.06576523;inti (int)(d/0.01);//0.01決定了精度 doubledd (double)i/100;//還原 if(d-dd>0.005)dd0.01;//四舍五入 MessageBox.Show((dd*100).ToString()"%");//7%,dd*100就變成百分的前面那一部分了

C++設計UDP協議通訊示例

UDP是一種面向非連接,不可靠的通訊協議,相對于TCP來說,雖然可靠性不及,但傳輸效率較高   一、緒言   UDP是一種面向非連接,不可靠的通訊協議,相對于TCP來說,雖然可靠性不及,但…

浪里個浪 FZU - 2261

TonyY是一個喜歡到處浪的男人,他的夢想是帶著蘭蘭姐姐浪遍天朝的各個角落,不過在此之前,他需要做好規劃。 現在他的手上有一份天朝地圖,上面有n個城市,m條交通路徑,每條交通路徑都是單行道。他已經預先規劃…

C#設計模式(9)——裝飾者模式(Decorator Pattern)

一、引言 在軟件開發中,我們經常想要對一類對象添加不同的功能,例如要給手機添加貼膜,手機掛件,手機外殼等,如果此時利用繼承來實現的話,就需要定義無數的類,如StickerPhone(貼膜是手…

北大青鳥c語言課后答案,北大青鳥C語言教程--第一章 C語言基礎.ppt

《北大青鳥C語言教程--第一章 C語言基礎.ppt》由會員分享,可在線閱讀,更多相關《北大青鳥C語言教程--第一章 C語言基礎.ppt(20頁珍藏版)》請在人人文庫網上搜索。1、第一章,C 語言基礎,2,課程地位,.Net ,以 # 開始的語句稱為預處理器指令,#include語句不…

nosql_探索NoSQL系列

nosql數據科學 (Data Science) Knowledge on NoSQL databases seems to be an increasing requirement in data science applications, yet, the taxonomy is so diverse and problem-centered that it can be a challenge to grasp them. This post attempts to shed light on…

C++TCP和UDP屬于傳輸層協議

TCP和UDP屬于傳輸層協議。其中TCP提供IP環境下的數據可靠傳輸,它事先為要發送的數據開辟好連接通道(三次握手),然后再進行數據發送;而UDP則不為IP提供可靠性,一般用于實時的視頻流傳輸,像rtp、r…

程序員如何利用空閑時間掙零花錢

一: 私活 作為一名程序員,在上班之余,我們有大把的時間,不能浪費,這些時間其實都是可以用來挖掘自己潛在的創造力,今天要討論的話題就是,程序員如何利用空余時間掙零花錢?比如說周末…

編寫程序乘法口訣表C語言,陳廣川問:c語言編程九九乘法口訣表 怎樣用c語言寫九九乘法口訣表?...

怎樣用c語言寫九九乘法口訣表?哈哈,我剛剛用javascript寫好乘法口訣表。C語言,如何編寫程序輸出九九乘法表。形式如下 ********* ******** ******* ****** ***** **** *** ** *?兩個循環,一般用for循環 一個循環控制行…

PHP中文亂碼解決辦法

一&#xff0e;首先是PHP網頁的編碼 1. php文件本身的編碼與網頁的編碼應匹配 a. 如果欲使用gb2312編碼&#xff0c;那么php要輸出頭&#xff1a;header(“Content-Type: text/html; charsetgb2312")&#xff0c;靜態頁面添加<meta http-equiv"Content-T…

python中api_通過Python中的API查找相關的工作技能

python中api工作技能世界 (The World of Job Skills) So you want to figure out where your skills fit into today’s job market. Maybe you’re just curious to see a comprehensive constellation of job skills, clean and standardized. Or you need a taxonomy of ski…

欺詐行為識別_使用R(編程)識別欺詐性的招聘廣告

欺詐行為識別背景 (Background) Online recruitment fraud (ORF) is a form of malicious behaviour that aims to inflict loss of privacy, economic damage or harm the reputation of the stakeholders via fraudulent job advertisements.在線招聘欺詐(ORF)是一種惡意行為…

PE文件的感染C++源代碼

PE文件的感染C源代碼 PE文件規定了可執行文件的格式&#xff0c;凡是符合此格式的文件都能在windows系統上運行。PE文件的格式暫且不談&#xff0c;說一些感染PE文件的幾種途徑。 導入表感染。這個涉及比較復雜的操作&#xff0c;首先&#xff0c;要自行寫一個dll文件&#x…

c語言實驗四報告,湖北理工學院14本科C語言實驗報告實驗四數組

湖北理工學院14本科C語言實驗報告實驗四 數組.doc實驗四 數 組實驗課程名C語言程序設計專業班級 14電氣工程2班 學號 201440210237 姓名 熊帆 實驗時間 5.12-5.26 實驗地點 K4-208 指導教師 祁文青 一、實驗目的和要求1. 掌握一維數組和二維數組的定義、賦值和輸入輸出的方法&a…

c語言宏定義

一. #define是C語言中提供的宏定義命令&#xff0c;其主要目的是為程序員在編程時提供一定的方便&#xff0c;并能在一定程度上提高程序的運行效率&#xff0c;但學生在學習時往往不能理解該命令的本質&#xff0c;總是在此處產生一些困惑&#xff0c;在編程時誤用該命令&#…