大數據技術 學習之旅_為什么聚焦是您數據科學之旅的關鍵

大數據技術 學習之旅

David Robinson, a data scientist, has said the following quotes:

數據科學家David Robinson曾說過以下話:

“When you’ve written the same code 3 times, write a function.”

“當您編寫了3次相同的代碼時,請編寫一個函數。”

“When you’ve given the same in-person advice 3 times, write a blog post.”

“當您兩次給出相同的面對面建議時,請寫一篇博客文章。”

The first quote is something you should adopt soon, but the focus (literally) for this post is the second quote. I wrote an article recently sharing some tips from my data science journey. In this article, I want to share with you the overall theme that I have been giving advice on since that post, focus.

第一個引號是您應該很快采用的,但是本文的重點(從字面上看)是第二個引號。 我最近寫了一篇文章, 分享了我在數據科學歷程中的一些技巧 。 在本文中,我想與您分享自從發表這篇文章以來,我一直在提供建議的總體主題。

為什么重點很重要? (Why focus is important?)

Image for post
Photo by Nicolas Picard on Unsplash
Nicolas Picard在Unsplash上拍攝的照片

If you were to follow the strands on this spider web, you could end up in many different intersection points.

如果您要跟蹤此蜘蛛網上的子線,則可能會遇到許多不同的交點。

Image for post
Photo by Deb Dowd on Unsplash
Deb Dowd在Unsplash上拍攝的照片

You could also take multiple paths to the same intersection point. But there is an optimal path. A shorter path. This is true of the data science field also. Just the number of subfields alone is vast. Even more so if you include the subject knowledge you need for projects if they are not in the same domain. If can quickly feel overwhelming…

您也可以采用多條路徑到達相同的交點。 但是有一條最佳的道路。 更短的路徑。 數據科學領域也是如此。 僅子域的數量是巨大的。 更重要的是,如果您包含的主題知識不屬于同一領域,那么您就需要這些項目。 如果可以很快感到不知所措...

Image for post
Photo by Christian Erfurt on Unsplash
克里斯蒂安·愛爾福特在Unsplash上的照片

It took me 2.5 years to land my data science role. If you haven’t read the prior article, here is some quick background on my situation:

我花了2.5年的時間才能獲得數據科學職位。 如果您還沒有閱讀上一篇文章,請快速了解我的情況:

  1. I am a husband and father to a toddler.

    我是一個小孩的丈夫和父親。
  2. I was a high school teacher with an hour commute in each direction by car.

    我是一名高中老師,每個方向的通勤時間均為一個小時。
  3. I only had an hour or so a day dedicated to data science since my wife supported me in this career change.

    自從妻子支持我從事這項職業以來,我只有一個小時左右的時間致力于數據科學。

I didn’t focus at the beginning. I started with an overview foundation since I didn’t have much of a programming background. I would still recommend this if you have no background in math and/or coding. The problem came afterward when everything about the field was so fascinating I leaped at everything I could interact with. But it prevented me from mastering anything, leading me into that classic saying…

一開始我沒有集中精力。 我從概述基礎開始,因為我沒有太多的編程背景。 如果您沒有數學和/或編碼的背景,我仍然會建議這樣做。 隨后,當有關該領域的所有事情都如此吸引人時,我就跳下了我可以與之互動的一切的問題。 但這阻止了我精通一切,使我陷入了那句經典的話……

“Jack of all trades, master of none.”

“萬事通,無精打采。”

Eventually, I felt incredibly overwhelmed. From that, there was a time when I shut down and didn’t practice anything for a few weeks.

最終,我感到難以置信。 從那時起,有一段時間我關閉了并且幾周沒有練習任何東西。

Image for post
Photo by Ben Weber on Unsplash
本·韋伯在Unsplash上的照片

那么如何避免我的錯誤呢? (So how can you avoid my mistake?)

There are a couple of approaches you could take and I should have considered sooner:

您可以采取幾種方法,我應該早點考慮:

  1. Focus on a particular branch of data science such as natural language processing or data visualization.

    專注于數據科學的特定分支,例如自然語言處理或數據可視化。
  2. Focus on a domain and sculpt your data science skills around projects in that domain.

    專注于某個領域,并圍繞該領域的項目雕刻您的數據科學技能。

After I got some help to get out of my rut, I took the second approach. Leveraging my educational background, I focused on solving problems related to the education field from the perspective of a teacher. This led me to:

在獲得幫助以擺脫困境后,我采取了第二種方法。 利用我的教育背景,我專注于從老師的角度解決與教育領域有關的問題。 這導致我:

  1. Influencing a hiring decision based on the academic needs of students.

    根據學生的學術需求影響招聘決定。
  2. Created an overview of my school’s performance in a concise report.

    在簡明的報告中概述了我學校的表現。
  3. Using a Bayesian version of a T-test to determine if my review lesson improved the student’s understanding and by how much.

    使用貝葉斯T檢驗確定我的復習課是否提高了學生的理解力以及提高了多少。

  4. Analyzing state exam questions to guide curriculum decisions.

    分析州考試題以指導課程決策。

These projects I put on my LinkedIn profile. They got the attention of people I did not expect. It got the attention of the outside school consultant who ended up providing a lot of future help. It got the attention of a Facebook recruiter for a related data science/education position with a starting salary above $130,000. Discussing my experience with these projects got me past the first round of interviews easily.

這些項目我放在我的LinkedIn個人資料中。 他們引起了我意料之外的人們的注意。 引起了外部學校顧問的注意,他們最終提供了很多未來的幫助。 它吸引了一位Facebook招聘人員的注意,該招聘人員的相關數據科學/教育職位的起薪超過13萬美元。 討論我在這些項目中的經驗使我輕松通過了第一輪采訪。

My rate of getting interviews and getting further in the rounds soon improved since I became more focused. Again, given my situation, it wasn’t the fastest, but it was a vast improvement compared to my previous rate. Each interview improved how I presented myself. Until eventually…

自從我變得更加專注之后,我獲得面試和進一步進步的速度很快就提高了。 同樣,鑒于我的情況,它不是最快的,但是與我以前的速度相比,這是一個巨大的進步。 每次采訪都改善了我的自我介紹。 直到最后……

Image for post
Photo by bruce mars on Unsplash
布魯斯· 瑪斯 ( Bruce mars)在Unsplash上拍攝的照片

I succeeded! I landed my dream role and broke into the data science field!

我成功了! 我找到了自己夢dream以求的角色,并闖入了數據科學領域!

At the time of writing this, it has been just shy of three months since this new career started and it has been incredible! The people I work with are amazing, I get constant feedback, my work is having an immediate and/or future impact, and I am getting praised for it (as a teacher you don’t get that often so it is important to me…and also I am a kid at heart).

在撰寫本文時,距這個新職業生涯還不到三個月,這簡直令人難以置信! 與我共事的人很棒,我得到不斷的反饋,我的工作具有立竿見影和/或未來的影響,我為此而受到贊譽(作為老師,您很少得到這樣的幫助,所以對我來說很重要……)而且我還是個內心的孩子)。

If you are still hunting for your career just know it isn’t impossible. You can do it! Just focus on what you want to do in this field as soon as possible. If you are still experimenting a bit that is ok. But I would recommend doing it quickly if possible. If you are a parent or have a similar situation to me do know it will take longer, but you will get there.

如果您仍在尋找自己的職業,那就知道那并非不可能。 你能行的! 請盡快專注于您要在該領域中要做的事情。 如果您仍在嘗試,那還可以。 但是我建議盡可能快地這樣做。 如果您是父母或與我有類似的情況,請知道這將花費更長的時間,但是您會到達那里。

When you do get there, you will reflect on your journey up to that point. You will review the good and bad of it all. Finally, you will turn toward the future of your new career, and be amped to get started!

當您到達那里時,您將反思到那時的旅程。 您將回顧所有優點和缺點。 最終,您將轉向新職業的未來,并為入門做好準備!

Image for post
Attentie Attentie on Attentie Attentie在UnsplashUnsplash拍攝

Thanks for reading! If you found this post helpful and you haven’t checked out some of the tips from my journey, you can read about them below:

謝謝閱讀! 如果您發現這篇文章很有幫助,但還沒有從我的旅程中找到一些技巧,則可以在下面閱讀有關它們的信息:

Also if you are entering the field with a math background and feel you need help organizing a learning plan, check out my recommendations in this article below:

另外,如果您以數學背景進入該領域,并且認為需要幫助組織學習計劃,請在下面的本文中查看我的建議:

You can follow me here or connect with me on Linkedin and Twitter. Open to DM’s on Twitter.

您可以在這里關注我,也可以通過Linkedin和Twitter與我聯系。 在Twitter上打開DM。

Until next time,

直到下一次,

John DeJesus

約翰·德耶穌

翻譯自: https://towardsdatascience.com/why-focus-is-key-for-your-data-science-journey-b62715b2a1c

大數據技術 學習之旅

本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。
如若轉載,請注明出處:http://www.pswp.cn/news/387897.shtml
繁體地址,請注明出處:http://hk.pswp.cn/news/387897.shtml
英文地址,請注明出處:http://en.pswp.cn/news/387897.shtml

如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!

相關文章

SQL 語句

去重字段里的值 SELECT DISTINCT cat_id,goods_sn,repay FROM ecs_goods where cat_id ! 20014 刪除除去 去重字段 DELETE FROM ecs_goods where goods_id NOT IN ( select bid from (select min(goods_id) as bid from ecs_goods group by cat_id,goods_sn,repay) as b );轉…

無監督學習 k-means_無監督學習-第4部分

無監督學習 k-means有關深層學習的FAU講義 (FAU LECTURE NOTES ON DEEP LEARNING) These are the lecture notes for FAU’s YouTube Lecture “Deep Learning”. This is a full transcript of the lecture video & matching slides. We hope, you enjoy this as much as …

vCenter 升級錯誤 VCSServiceManager 1603

近日,看到了VMware發布的vCenter 6.7 Update 1b的更新消息。其中有一條比較震撼。有誤刪所有VM的概率,這種BUG誰也承受不起。Removing a virtual machine folder from the inventory by using the vSphere Client might delete all virtual machinesIn t…

day28 socketserver

1. socketserver 多線程用的 例 import socket import timeclientsocket.socket() client.connect(("127.0.0.1",9000))while 1:cmdinput("請輸入指令")client.send(cmd.encode("utf-8"))from_server_msgclient.recv(1024).decode("utf…

車牌識別思路

本文源自我之前花了2天時間做的一個簡單的車牌識別系統。那個項目,時間太緊,樣本也有限,達不到對方要求的95%識別率(主要對于車牌來說,D,0,O,I,1等等太相似了。然后,漢字…

深度學習算法原理_用于對象檢測的深度學習算法的基本原理

深度學習算法原理You just got a new drone and you want it to be super smart! Maybe it should detect whether workers are properly wearing their helmets or how big the cracks on a factory rooftop are.您剛剛擁有一架新無人機,并希望它變得超級聰明&…

【python】numpy庫linspace相同間隔采樣 詳解

linspace可以用來實現相同間隔的采樣; numpy.linspace(start,stop,num50,endpointTrue,retstepFalse, dtypeNone) 返回num均勻分布的樣本,在[start, stop]。 Parameters(參數): start : scalar(標量) The starting value of the sequence(序列的起始點)…

Spring整合JMS——基于ActiveMQ實現(一)

Spring整合JMS——基于ActiveMQ實現(一) 1.1 JMS簡介 JMS的全稱是Java Message Service,即Java消息服務。它主要用于在生產者和消費者之間進行消息傳遞,生產者負責產生消息,而消費者負責接收消息。把它應用到實際的…

軟件本地化 pdf_軟件本地化與標準翻譯

軟件本地化 pdfSoftware has become such an essential part of our world that it’s impossible to imagine a life without it. There’s hardly a service or product around us that wasn’t created with software or that runs on software.軟件已成為我們世界的重要組成…

CentOS7+CDH5.14.0安裝全流程記錄,圖文詳解全程實測-8CDH5安裝和集群配置

Cloudera Manager Server和Agent都啟動以后,就可以進行CDH5的安裝配置了。 準備文件 從 http://archive.cloudera.com/cdh5/parcels/中下載CDH5.14.0的相關文件 把CDH5需要的安裝文件放到主節點上,新建目錄為/opt/cloudera/parcel-repo把我們之前下載的…

node.js安裝部署測試

(一)安裝配置: 1:從nodejs.org下載需要的版本 2:直接安裝,默認設置 ,默認安裝在c:\program files\nodejs下。 3:更改npm安裝模塊的默認目錄 (默認目錄在安裝目錄下的node…

數據庫不停機導數據方案_如何計算數據停機成本

數據庫不停機導數據方案In addition to wasted time and sleepless nights, data quality issues lead to compliance risks, lost revenue to the tune of several million dollars per year, and erosion of trust — but what does bad data really cost your company? I’…

luogu4159 迷路 (矩陣加速)

考慮如果只有距離為1的邊,那我用在時間i到達某個點的狀態數矩陣 乘上轉移矩陣(就是邊的鄰接矩陣),就能得到i1時間的 然后又考慮到邊權只有1~9,那可以把邊拆成只有距離為1的 具體做法是一個點拆成9個然后串聯 1 #includ…

社群系統ThinkSNS+ V2.2-V2.3升級教程

WARNING本升級指南僅適用于 2.2 版本升級至 2.3 版本,如果你并非 2.2 版本,請查看其他升級指南,Plus 程序不允許跨版本升級!#更新代碼預計耗時: 2 小時這是你自我操作的步驟,確認將你的 2.2 版本代碼升級到…

BZOJ4881 線段游戲(二分圖+樹狀數組/動態規劃+線段樹)

相當于將線段劃分成兩個集合使集合內線段不相交,并且可以發現線段相交等價于逆序對。也即要將原序列劃分成兩個單增序列。由dilworth定理,如果存在長度>3的單減子序列,無解,可以先判掉。 這個時候有兩種顯然的暴力。 將點集劃分…

activemq部署安裝

一、架構和技術介紹 1、簡介 ActiveMQ 是Apache出品,最流行的,能力強勁的開源消息總線。完全支持JMS1.1和J2EE 1.4規范的 JMS Provider實現 2、activemq的特性 1. 多種語言和協議編寫客戶端。語言: Java, C, C, C#, Ruby, Perl, Python, PHP。應用協議: …

python初學者_面向初學者的20種重要的Python技巧

python初學者Python is among the most widely used market programming languages in the world. This is because of a variety of driving factors:Python是世界上使用最廣泛的市場編程語言之一。 這是由于多種驅動因素: It’s simple to understand. 很容易理解…

主串與模式串的匹配

主串與模式串的匹配 (1)BF算法: BF算法比較簡單直觀,其匹配原理是主串S.ch[i]和模式串T.ch[j]比較,若相等,則i和j分別指示串中的下一個位置,繼續比較后續字符,若不相等,從…

什么是 DDoS 攻擊?

歡迎訪問網易云社區,了解更多網易技術產品運營經驗。 全稱Distributed Denial of Service,中文意思為“分布式拒絕服務”,就是利用大量合法的分布式服務器對目標發送請求,從而導致正常合法用戶無法獲得服務。通俗點講就是利用網絡…

nginx 并發過十萬

一般來說nginx 配置文件中對優化比較有作用的為以下幾項: worker_processes 8; nginx 進程數,建議按照cpu 數目來指定,一般為它的倍數。 worker_cpu_affinity 00000001 00000010 00000100 00001000 00010000 00100000 01000000 10000000; 為每…