對食材的敬畏之心極致產品
重點 (Top highlight)
為什么選擇投資組合? (Why portfolios?)
Data science is a tough field. It combines in equal parts mathematics and statistics, computer science, and black magic. As of mid-2020, it is also a booming field with numerous applicants swarming every job ad. Also, as I mentioned – it is mid-2020, the raging pandemic dragging everything down just that extra bit.
數據科學是一個艱巨的領域。 它在數學和統計學,計算機科學和黑魔法中均等地結合在一起。 截至2020年中,這也是一個蓬勃發展的領域,大量申請者蜂擁而至。 此外,正如我提到的那樣- 到2020年中期 ,肆虐的大流行把一切都拖了下來。
Building up a list of course completion certificates won’t get you very far either, unless you’ve got some bona fides (Masters/PhD) in academic credentials. MOOC certificates like those from Coursera and EdX are nice, but I’ve yet to hear too many examples of them counting for much. Kaggle ain’t what it used to be, either. Its free competitions have become graveyards of useless overfit models, and real competitions are dominated by teams which is hard to compete with, and is of limited use for individual portfolios anyway.
建立課程結業證書列表也不會使您走得太遠,除非您具有一定的學術證書(碩士/博士學位)。 像那些來回MOOC證書m路線 RA的dé DX都不錯,但我還沒有聽到他們的例子太多了非常重要的計數。 Kagg le也不是以前的樣子。 它的免費比賽已成為無用的過擬合模型的墳墓地,而真正的比賽則由難以競爭的團隊主導,并且無論如何對于單個組合來說使用都很有限。
So, how does one go about building a profile online? My personal thought is that just as a famous band once said, you can go your own way.
那么,如何在線建立個人資料呢? 我個人的想法是,就像一個著名樂隊曾經說過的那樣, 您可以自己走 。
Instead of trying to do exactly what others do, or did, work on projects that you are interested in, build up a portfolio of your work, and put it up there for the world to see what you did, and what you can do.
與其嘗試完全做別人所做或所做的事情,不如去從事那些 喲 你有興趣,建立一個投資組合 您 工作,然后放在那里,讓全世界看看您做了什么,可以做什么。

Having said all that, I appreciate that it’s easier said than done. Not many data scientists are also designers/front-end developers, and not always keen to pick up that extra skill nor do they necessarily have the time to.
說了這么多,我很高興說起來容易做起來難。 很少有數據科學家同時還是設計師/前端開發人員,并不總是熱衷于學習這種額外技能,也不是一定有時間。
Luckily, we don’t always have to reinvent the wheel. Unlike the old days where portfolios were literally… portfolios full of glossy pages, or resumes that would only come across HR’s desk, many amazing portfolios are available online. These are invaluable resources, so why not make full use of them?
幸運的是,我們不必總是重新發明輪子。 與過去的投資組合實際上是……充滿光澤頁面的投資組合或僅會出現在HR辦公桌上的簡歷不同,在線上可以找到許多驚人的投資組合。 這些都是寶貴的資源,那么為什么不充分利用它們呢?
學習/啟發 (Learning / Inspiration)
Outside of using them as references for our own portfolios, these sites are also extremely valuable resources for learning, and for ideas.
除了將它們用作我們自己的作品集的參考之外,這些站點還是非常有用的學習和思想資源。
Many of these authors’ projects are practical, interesting and original. They are, for my money, also great complementary learning tools. For example, seeing practical applications of an ML tool provides context when learning the theoretical side, as I consider where I might apply this tool in my work or for my clients.
這些作者的許多項目都是實用,有趣和新穎的。 用我的錢,它們也是很棒的補充學習工具。 例如,在學習理論方面時,看到ML工具的實際應用將提供上下文,因為我考慮了我可能在工作中或為客戶應用此工具的地方。
I’ve said enough – let’s take a dive into some of these amazing works, to look at exactly how they are useful.
我已經說夠了–讓我們深入研究其中一些驚人的作品,以確切地了解它們的用處。
This is obviously just a few random selections of many, many great portfolios out there. Let me know in the comments about any of your favourites, and if you agree / disagree with my thoughts!
顯然,這只是眾多眾多投資組合中的一些隨機選擇。 在評論中讓我知道您的任何收藏夾,以及是否同意/不同意我的想法!
大衛·文圖里 (David Venturi)

I first came across David Venturi a few years ago while researching data science courses. He had written a blog post called “I Dropped Out of School to Create My Own Data Science Master’s – Here’s My Curriculum”.
幾年前,我在研究數據科學課程時遇到了David Venturi。 他寫了一篇博客文章“ 我輟學創建自己的數據科學碩士課程-這是我的課程 ”。
That post is from April 2016 and it has certainly stood the test of time, having racked up over 8000 claps on Medium as of August 2020!
該職位是從2016年4月開始的,它的確經受了時間的考驗,截至2020年8月,在Medium上已經拍了8000拍!
Since then, he’s gone on to do much more. He’s created courses for DataCamp, including one for Scala, a part of a Tableau course and one using the MLB’s (baseball) Statcast data.
從那時起,他繼續做更多的事情。 他為DataCamp創建了課程,包括一門針對Scala的課程,Tableau課程的一部分以及一門使用MLB(棒球)Statcast數據的課程。
He has even created a course titled UP AND DOWN WITH THE KARDASHIANS: Python project that uses pandas. Who’d have guessed the word Python to appear next to the word Kardashians, without it being a reference to the latest scandal or a terrible choice of a pet?
他甚至創建了一個名為UP AND DOWN THE KARDASHIANS的課程:使用熊貓的Python項目。 誰曾猜到Python一詞會出現在Kardashians一詞的旁邊,卻沒有提到最新的丑聞或對寵物的糟糕選擇?
Yup. The man is talented.
對。 這個人很有才華。
His portfolio site appropriately reflects this wide range of talents, to showcase the breadth of content types and variety of subject matters in his work.
他的作品集網站恰當地反映了這一廣泛的才能,以展示他工作中內容類型和主題的廣泛性。

The headings on Venturi’s site organise its content by the type of end clients. They range all the way from courses, projects and content created for DataCamp or Udacity, to a set of personal projects including articles for FreeCodeCamp, sports analytics and a sprinkling of web apps.
文丘里網站上的標題根據最終客戶的類型來組織其內容。 它們的范圍很廣,從為DataCamp或Udacity創建的課程,項目和內容,到一組個人項目,包括FreeCodeCamp的文章,體育分析和大量的Web應用程序。
What struck me after looking at the site for a while, though, was the clarity with which it demonstrated the exact types of outputs he is capable of producing. In other words:
在看了一段時間之后,令我震驚的是,它清楚地表明了他能夠生產的確切產出類型。 換一種說法:
Each section on Venturi’s portfolio fulfils a marketing purpose.
文丘里管產品組合的每個部分均滿足營銷目的。
The MOOCs are easy — after all, he is a seasoned course producer.
MOOC很容易-畢竟,他是一位經驗豐富的課程制作人。
But the next section includes two very different videos to highlight his production skills and comfort in front of a camera. One is an instructional video and the other is a highly produced… dog video (a clever marketing video).
但是下一部分將包括兩個截然不同的視頻,以突出他的制作技巧和在鏡頭前的舒適感。 一個是教學視頻,另一個是制作精良的…狗視頻(聰明的營銷視頻)。
And his personal projects exist to highlight the output medium indicated with bright links. His outputs are segmented with link to one of “Code”, “Demo”, or “Website”. This allows the viewer to instantly see the output of interest in the context of a project.
他的個人項目的存在是為了突出顯示帶有明亮鏈接的輸出介質。 通過鏈接到“代碼”,“演示”或“網站”之一來細分其輸出。 這使觀看者可以在項目的上下文中立即看到感興趣的輸出。
Even his written works are clearly categorised as one of a “Report”, “Article” or “Post”, explicitly acknowledging intended audience types. Someone reading this is clearly led to a relevant sample product than a mess of “writing samples” sorted by subject matter. (It does make me wonder if he did any scraping or analysis of job postings to arrive at this taxonomy.)
甚至他的書面作品也被明確歸類為“報告”,“文章”或“帖子”之一,明確承認了預期的受眾類型。 顯然,閱讀本書的人可以找到一個相關的樣本產品,而不是按照主題分類的“書寫樣本”。 (這確實使我感到奇怪,他是否對工作職位進行了任何抓取或分析以得出此分類法。)
Check out his portfolio here.
在這里查看他的投資組合。
漢娜·嚴漢 (Hannah Yan Han)

Being a data visualisation geek, this site just immediately filled me with a combination of joy and envy.
作為數據可視化極客,該站點立即使我充滿喜悅和羨慕。
The majority of the projects represented on her front page are (gorgeous, I might add) visualisations. Each project is represented by an image, where a mouseover reveals further details about it — as shown in the animation below.
她頭版上代表的大多數項目都是(很棒,我可能會補充)可視化效果。 每個項目都由一個圖像表示,將鼠標懸停在該圖像上可以顯示有關該圖像的更多詳細信息-如以下動畫所示。
So, within seconds of visiting the site, the reader is given the opportunity to see the range of visualisations that she’s produced, and her technical prowess in using a diverse range of tools from R, D3.js or P5.js to Tableau.
因此,在訪問該網站的幾秒鐘內,讀者就有機會看到自己制作的可視化效果范圍以及使用從R,D3.js或P5.js到Tableau的各種工具獲得的技術實力。
Personally I also really like the clean layout and simple and consistent interface. It’s simply a pleasure to navigate through.
我個人也很喜歡干凈的布局和簡單一致的界面。 瀏覽是一種樂趣。

Clicking on each project takes the reader to an article about the visualisation.
單擊每個項目會將讀者帶到關于可視化的文章。
She also has a dedicated data science portfolio, which she has placed on a separate page.
她還擁有專用的數據科學產品組合,并將其放在單獨的頁面上。

Clearly, this layout is designed to convey more information about each data science than those in the visualisation page. By segregating projects by type like she has done, she’s able to achieve visual consistency within each page for the reader. This probably also indicates that generally, the reader (prospective client) is interested in only one of visualisation or data science, rather than both.
顯然,此布局旨在傳達有關每個數據科學的信息,而不是可視化頁面中的信息。 通過像她所做的那樣按類型分隔項目,她能夠為讀者提供每一頁的視覺一致性。 這可能還表明,一般而言,讀者(潛在客戶)只對可視化或數據科學中的一種感興趣,而對這兩者都不感興趣。
Check out her portfolio here.
在這里查看她的投資組合。
多恩·馬丁 (Donne Martin)
Before moving on to the next example portfolio, sit down, grab a drink, and brace yourself.
在繼續下一個示例投資組合之前,請坐下,喝一杯并做好準備。

Donne Martin claims to be a software engineer at Facebook, but looking at his website and GitHub page, I am quite convinced that he a time traveller or some sort of a wizard who’s able to stretch time. I’ll get back to this point later, but for now, take a look at the animation below, scrolling through his main website.
Donne Martin自稱是Facebook的軟件工程師,但是在他的網站和GitHub頁面上,我非常相信他是一位時光旅行者或某種能夠延長時間的向導。 稍后,我將回到這一點,但是現在,看一下下面的動畫,滾動瀏覽他的主要網站。
His approach to the portfolio site is quite different to those we’ve looked at before this. He takes the approach of letting the crowd noise (i.e. GitHub stars) do the talking, and boy — are they loud.
他訪問投資組合網站的方法與我們之前所看過的完全不同。 他采取了讓人群喧((即GitHub上的明星)說話的方法,男孩-他們大聲嗎。
He casually flaunts the multiple personal projects with 20k+ stars!
他隨隨便便地用20k +星標炫耀了多個個人項目!

His GitHub page itself is very impressive. Since we are discussing data science portfolios, let’s take a look at his repo of data science notebooks.
他的GitHub頁面本身令人印象深刻。 由于我們正在討論數據科學產品組合,因此讓我們看一下他的數據科學筆記本回購 。
Remember how I said that I think Martin might be a wizard? Whenever we go back to burning witches and wizards, this data science notebook repo is going to be my primary evidence submission against Martin.
還記得我說過我認為馬丁可能是個巫師嗎? 每當我們回到燒毀的女巫和巫師的時候,這個數據科學筆記本回購將成為我針對Martin的主要證據。
I just don’t understand when he could possibly have had time to create all of these unless he has the ability to slow down time. Here is just a sampling — a very small sampling, actually, of the notebooks that he has made available in this repo.
我只是不知道他什么時候可能有時間創建所有這些,除非他有能力減慢時間。 這只是一個樣本,實際上是他在此回購中提供的筆記本的非常小的樣本。

It’s a dense list, but grouped by the primary library used, it does a great job as a showcase. Even before opening any of his notebooks or even reading the summaries of these notebooks, this list easily demonstrates his work ethic, breadth of skills and ability to communicate and teach.
這是一個密集的列表,但按使用的主庫分組,它作為展示臺表現出色。 即使在打開他的任何筆記本或什至閱讀這些筆記本的摘要之前,此清單也很容易證明他的職業道德,技能廣度以及溝通和教導的能力。
You could easily spend days, or weeks, browsing through Martin’s portfolio — and personally I don’t think it would be such a bad idea to do so. Check it out here.
您可以輕松地花幾天或幾周的時間瀏覽Martin的投資組合,而且我個人認為這樣做并不是一個壞主意。 在這里查看 。
克勞迪婭·十箍 (Claudia Ten Hoope)

Hoope’s website is clean, neat and easy to read. One key difference I wanted to highlight with this portfolio site is that it explicitly doubles as a hiring/enquiry page, with her daily rates etc.
Hoope的網站干凈,整潔且易于閱讀。 我想在此投資組合網站上強調的一個主要區別是,它可以作為招聘/查詢頁面(包括她的日薪等)明確地翻倍。
She is a freelancer, so it makes sense for her to spell out the exact services she offers to her prospective clients. The language she uses here also indicate that they are for those who may not necessarily be that familiar with data science.
她是一名自由職業者,因此她有必要向她的潛在客戶說明確切的服務。 她在這里使用的語言還表明它們適用于那些不一定熟悉數據科學的人。
It’s a good reminder for us to think about who the intended audience is for every piece of communication that we put out there, and to tailor the content accordingly.
這是對我們的一個很好的提醒,讓我們考慮一下在此進行的每一次交流的目標受眾是誰,并相應地調整內容。

Check it out — her page is here.
檢查一下-她的頁面在這里 。
Julia·尼古爾斯基(Julia Nikulski) (Julia Nikulski)

This is another excellent portfolio website, this time by Julia Nikulski. As with the others, she’s got some kickass projects listed here, each one with a hero image accompanied by a short description and key skills.
這是另一個出色的投資組合網站,這次是Julia·尼庫爾斯基(Julia Nikulski)。 和其他人一樣,她在此處列出了一些kickass項目,每個項目都有一個英雄形象,并附有簡短的描述和關鍵技能。
I won’t write too much more about it — as the main layout seems to be similar to some of the others, and I don’t read German!
我不會對此作過多介紹,因為主要布局似乎與其他一些布局相似,而且我不會讀德語!
One super interesting and (very meta) highlight is a post entitled “How to Build a Data Science Portfolio Website”, which, if you are reading this, you might find relevant!
一篇非常有趣且(非常中繼)的重點文章是標題為“ 如何構建數據科學投資組合網站 ”的帖子,如果您正在閱讀此書,那么可能會發現與之相關!
Thanks for reading — that was just a small selection of sites I’ve found online. If you have your personal favourites, or (constructive) critiques of the article, please let me know in the comments or on twitter!
感謝您的閱讀-這只是我在網上找到的一小部分網站。 如果您對本文有個人收藏或評論,請在評論中或在Twitter上告訴我!
Also, if you liked this, say 👋 / follow on twitter, or follow here for updates. ICYMI: I also wrote this article comparing Plotly Dash vs Streamlit:
另外,如果您喜歡這樣做,請在wit上說👋/或在此處更新。 ICYMI:我還寫了這篇文章,比較了Plotly Dash和Streamlit:
And this about visualising hidden relationships in data, using data from the NBA as an example:
并以NBA的數據為例,可視化數據中的隱藏關系:
翻譯自: https://towardsdatascience.com/these-data-science-portfolios-will-awe-and-inspire-you-mid-2020-edition-728e1021f60
對食材的敬畏之心極致產品
本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。 如若轉載,請注明出處:http://www.pswp.cn/news/387960.shtml 繁體地址,請注明出處:http://hk.pswp.cn/news/387960.shtml 英文地址,請注明出處:http://en.pswp.cn/news/387960.shtml
如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!