大數據業務學習筆記
意見 (Opinion)
A lot of aspiring Data Scientists think what they need to become a Data Scientist is :
許多有抱負的數據科學家認為,成為一名數據科學家需要具備以下條件:
- Coding 編碼
- Statistic 統計
- Math 數學
- Machine Learning 機器學習
- Deep Learning 深度學習
And any other technical skills.
以及其他任何技術技能。
The above list is accurate; most of the Data Scientist qualification you need right now is what I list above. It is unavoidable, as many job listing right now always list these skills as a prerequisite. Just look at the example of Data Scientist job requirements and preferences below.
上面的清單是準確的; 我上面列出的是您現在需要的大多數數據科學家資格。 這是不可避免的,因為現在很多工作清單總是將這些技能列為前提條件。 只需看下面的數據科學家工作要求和偏好示例。

Most of the requirements sound technical; degree, coding, math, and stats. Although, there is an underlying business understanding requirement that you might not realize at first from this job advertisement.
大部分要求聽起來都是技術性的; 學位,編碼,數學和統計信息。 但是,有一個潛在的業務理解要求,您可能首先不會從此招聘廣告中意識到。
If you look closely, they require someone that had experience in applying the analytical method to solve practical business problems. It implies your everyday task would consisting of solving the business problem, which in turn, you need to understand what kind of business the company runs and how the process itself works.
如果您仔細觀察,他們會要求那些具有應用分析方法來解決實際業務問題的經驗的人。 這意味著您的日常任務將包括解決業務問題 ,而這又需要您了解公司經營哪種業務以及流程本身如何運作。
You might ask, “Why do I need to understand it? Just create the machine learning model and the problem is solved, isn’t it?” Well, that line of thinking is dangerous, and I would explain why.
您可能會問:“為什么我需要了解它? 只需創建機器學習模型即可解決問題,不是嗎?” 好吧,這種思路很危險,我將解釋原因。
Just for a reminder, I would argue what makes you great as a Data Scientist is not only how well your coding skill is or how much you understand the statistical theory or even the master of business understanding, but it is a combination of many.
提醒您, 讓我成為數據科學家的不僅僅在于您的編碼技能如何,或者您對統計理論甚至對業務理解的掌握有多少,而且還包括很多方面。
Anybody, of course, could agree or not with my opinion as I believe there are no specific skills that make you a great Data Scientist.
當然,任何人都可以同意或不同意我的觀點,因為我相信沒有特定的技能可以使您成為一名出色的數據科學家。
Data Scientist employment is hard. It would not easy to get in this field. With many applicants and people with a similar set of skills, you need to stand out. Business Understanding is the skill that would certainly separate you from all the fish in the ponds.
數據科學家的工作很難。 進入這個領域并不容易。 由于許多申請人和具有類似技能的人,您需要脫穎而出。 業務理解能力無疑會使您與池塘中的所有魚區分開。
In my experience as a Data Scientist, there is no skill that I felt underrated as much as the business understanding skill. I even thought that you don’t need to understand the business in my early career. How wrong I was.
根據我作為數據科學家的經驗,沒有什么比業務理解技能低估了。 我什至以為您在我的早期職業中不需要了解業務。 我錯了
I am not ashamed, though, to admit that I did not consider the business aspect essential at first because many data science education and books did not even teach us about this.
但是,我并不感到ham愧,因為我一開始并不認為業務方面是必不可少的,因為許多數據科學教育和書籍甚至都沒有教過我們這一點。
So, why is it crucial to learn the business and how it impacts your employment as a Data Scientist?
那么,為什么學習業務至關重要,它又如何影響您作為數據科學家的工作呢?
Just imagine this situation. You work in the data department of the food industry with candy as their main product, and the company plans to release a new sour candy product. The company then ask the sales department to sell the product. Now, the sales department know that the company had a data department and requesting the data team to give new leads where they can sell sour candy.
試想一下這種情況。 您在食品工業的數據部門工作時,以糖果為主要產品,并且該公司計劃發布一種新的酸味糖果產品。 然后,公司要求銷售部門出售產品。 現在,銷售部門知道該公司有一個數據部門,并要求數據團隊提供新的線索以銷售酸味糖果。
Before anybody complains that “This is not our job, we create a machine learning model!” or “I work as a data scientist, not in the sales department.” No, this is precisely what Data scientists do in the company; many of the projects are to work with another department for solving the company problem.
在有人抱怨“這不是我們的工作之前,我們創建了機器學習模型!” 或“我是數據科學家,而不是在銷售部門。” 不,這正是數據科學家在公司中所做的; 許多項目將與另一個部門合作解決公司問題。
Back to our scenario, how do you correctly approach this problem then? You might think, “Just create a machine learning model to generate the leads.” Yes, it is on the right track, but how exactly you create the model? On what basis? Is the business question even viable enough to solved using the machine learning model?
回到我們的情況,那么您如何正確解決此問題? 您可能會想,“只要創建一個機器學習模型來生成線索即可。” 是的,它是在正確的軌道上,但是您如何精確地創建模型? 在什么基礎上? 業務問題是否足夠可行,可以使用機器學習模型解決?
You can’t just suddenly using a machine learning model, right? This is why business understanding is so crucial as a Data Scientist. You need to understand how the candy business in more detail. Keep asking a question like,
您不能只是突然使用機器學習模型,對嗎? 這就是為什么業務理解對數據科學家如此重要的原因。 您需要更詳細地了解糖果業務。 繼續問一個問題,
“What kind of business question exactly we want to solve?”
“ 我們到底想解決什么樣的業務問題?”
“Would we even need a machine learning model?”
“我們甚至需要機器學習模型嗎?”
“What kind of attributes related to candy sales?”
“與糖果銷售相關的屬性是什么?”
“How is the candy selling strategy and practice within and outside of the company?”.
“公司內部和外部的糖果銷售策略和實踐如何?” 。
And many more business questions you could think of related to the business.
還有更多您可能想到的與業務相關的業務問題。
It is important to know what kind of business your company run and everything related to the business as your work as a data scientist would need you to make sense of the data.
了解您的公司經營哪種業務以及與該業務相關的所有事項非常重要,因為作為數據科學家,您需要了解數據 。
While it is easy to say that business understanding skill is essential, it is not easy to gain one.
雖然容易理解業務理解技能是必不可少的,但要獲得一項技能卻并不容易。
Education is one thing; for example, you might have a higher chance to stand out to applying for a data science position in the PR company if your educational background is communication compared to someone with a biology degree.
教育是一回事; 例如,與具有生物學學位的人相比,如果您的教育背景是交流,那么您可能有更大的機會脫穎而出在PR公司申請數據科學職位。
Although work experience quickly covers this. Working experience with another job title in a similar business industry would provide significant leverage, as you already understand the business process.
盡管工作經驗很快就涵蓋了這一點。 由于您已經了解業務流程,因此在類似的業務行業中擁有另一個職務的工作經驗將提供重要的影響。
For a fresher, it might be a hard industry to break in, but in hindsight, there are many benefits as a fresher as well. I remember Tyler Folkman’s post on his LinkedIn why the industry should consider recent graduates, and I agree. The recent graduate could:
對于新生,這可能是一個很難進入的行業,但是事后看來,新生也有很多好處。 我記得泰勒·福克曼(Tyler Folkman)在其LinkedIn上的帖子,為什么該行業應考慮應屆畢業生,我也同意。 應屆畢業生可以:
- Come with preparation 附帶準備
- Hungry to learn about the business 渴望了解業務
- Make an impact 產生影響
Freshers should a target for companies that have established their data journeys. The company could teach many things about business more easily as fresher have no experience at all in the business world. In my opinion, never count out the freshers.
新生應該成為建立數據旅程的公司的目標。 該公司可以更輕松地教授有關業務的許多事情,因為剛開始的新手根本沒有業務領域的經驗。 我認為,永遠不要指望新生。
I also would tell you about my experience, as well. When I first get the data project, I was not thinking about the business at all and just tried to build the machine learning model. And how disastrous it turns out to be.
我也將告訴您我的經歷。 當我第一次獲得數據項目時,我根本沒有考慮業務,只是嘗試構建機器學習模型。 事實證明這是多么的災難。
I present the model to the related parties with hype in my brain. My model result is good, I know everything about the data, and I know the theory of the model I used. Easy peasy, right? So, wrong. It turns out that the user did not care about the model I used. They are more interested in knowing if I already consider a business approach “A” or why I used the data that should not relate at all to the business. It ends with a discussion that I need more business training.
我在腦海中大肆宣傳該模型。 我的模型結果很好,我了解所有有關數據的知識,并且知道我使用的模型的理論。 輕輕松松吧? 大錯特錯。 事實證明,用戶并不關心我使用的模型。 他們更想知道我是否已經考慮過業務方法“ A”,或者為什么我使用了與業務根本不相關的數據。 最后,我需要更多的業務培訓。
It is embarrassing, but I am not ashamed at all to admit that it is my fault not to consider business understanding. I could be the best in model creation or statistic, but not knowing the business turns out to be a disaster. Since that day, I try to learn more about the business process itself, even before considering any of the technical things.
令人尷尬,但我完全不as愧承認不考慮業務了解是我的錯。 在模型創建或統計方面,我可能是最好的,但我不知道這業務真是一場災難。 從那天開始,即使在考慮任何技術問題之前,我也會嘗試進一步了解業務流程本身。
結論 (Conclusion)
In my opinion, fresher or not, try to learn the business as much as possible.
我認為,無論是否新鮮,都應盡可能多地學習業務。
Focus on one industry you feel interested in; finance, banking, credit, automotive, candy, oil, etc. Every single business has a different approach and strategy; you just need to focus on learning the industry you like.
專注于您感興趣的一個行業; 金融,銀行,信貸,汽車,糖果,石油等。每一項業務都有不同的方法和策略; 您只需要專注于學習自己喜歡的行業即可。
Data scientist employment is hard. It was not easy to get into this field. With many applicants and many people with a similar set of skills, you need to stand out. Business understanding is the skill that will undoubtedly separate you from all the fish in the pond.
數據科學家的工作很難。 進入這個領域并不容易。 在許多申請人和具有相似技能的許多人中, 您需要脫穎而出。 業務理解能力無疑會使您與池塘中的所有魚類區分開。
翻譯自: https://towardsdatascience.com/learn-the-business-to-become-a-great-data-scientist-635fa6029fb6
大數據業務學習筆記
本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。 如若轉載,請注明出處:http://www.pswp.cn/news/388090.shtml 繁體地址,請注明出處:http://hk.pswp.cn/news/388090.shtml 英文地址,請注明出處:http://en.pswp.cn/news/388090.shtml
如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!