冠狀病毒時代的負責任數據可視化

First, a little bit about me: I’m a data science grad student. I have been writing for Medium for a little while now. I’m a scorpio. I like long walks on beaches. And writing for Medium made me realize the importance of taking personal responsibility over my data viz.

首先,關于我的一些知識:我是一名數據科學研究生。 我已經為Medium寫了一段時間了。 我是天蝎座。 我喜歡在海灘上散步。 為Medium寫信使我意識到了對數據負責的重要性。

我的理念 (My Philosophy)

I’ve always been libertarian when it comes to data dissemination; the more publicly available data, the more people tinkering with it from their basements and school libraries, the better. Data science is an increasingly pivotal field and I want others to get excited about it as much as I am. Great computer scientists are often made from 12-year-olds coding Tetris in Python (or whatever games they play now — Candy Crush?), although I myself didn’t open my first computer program until I was 21.

在數據發布方面,我一直都是自由主義者。 公開數據越多,地下室和學校圖書館對數據進行修補的人就越多。 數據科學是一個日益重要的領域,我希望其他人能像我一樣對它感到興奮。 優秀的計算機科學家通常是由12歲以下的人用Python(或現在玩的任何游戲,例如Candy Crush?)編寫的Tetris編寫的,盡管我自己直到21歲才打開我的第一個計算機程序。

I’d love to see an army of 12-year-olds graphing covid-19 in novel ways, getting invested in the spread and chiding their aunts and uncles to wash their hands better at their (socially distant) Thanksgivings. Furthermore, the more publicly available data, the more perspectives data scientists can tie in when trying to make predictions and recommendations on the job. More data + more people interested in data = a better world, clean and simple.

我很樂意看到一支由12歲的年輕人組成的小組以新穎的方式繪制covid-19字樣,投入資金進行傳播,并責怪他們的姑姑和叔叔在(遠離社交的)感恩節那天更好地洗手。 此外,公開可用的數據越多,數據科學家在嘗試對工作進行預測和建議時可以結合的視角越多。 更多數據+更多對數據感興趣的人=一個更美好,更干凈,更簡單的世界。

Or so I thought.

還是我想。

關鍵時刻 (The Moment of Truth)

This morning I had written an article about obtaining covid-19 data, performing your own exploratory analysis and then graphing it with animation in R. The end product looked something like this:

今天早上,我寫了一篇有關獲取covid-19數據,進行您自己的探索性分析,然后用R中的動畫繪制圖形的文章。最終產品看起來像這樣:

Image for post

Cool, right? Not only a hot topic at the moment (covid-19) but now it’s animated!

酷吧? 不僅是當下的熱門話題(covid-19),而且現在已經成為動畫!

If anything, I felt like this was the right thing to do if it helped one person visualize the pandemic from their own computer. Furthermore, I was using knowledge gained from my master’s to make a chart I thought both accurate and eye-catching and I genuinely proud of that.

如果有的話,我認為如果這可以幫助一個人從自己的計算機上直觀地看到大流行,那是正確的選擇。 此外,我利用從碩士獲得的知識來制作一張我認為既準確又引人注目的圖表,我為此感到非常自豪。

I was just about to hit publish when I decided to do a few parting reads of how other Medium articles broached the topic. That’s when I started reading a host of posts published by public health and data viz experts that made a whole lot of sense — and not in a way that made me feel helpful or positive anymore.

當我決定對其他Medium文章如何提出該主題進行部分閱讀時,我即將publish 。 從那時起,我開始閱讀由公共衛生和數據專家撰寫的大量文章,這些文章很有道理-不再以使我感到幫助或積極的方式出現。

世界正在遭受Covid-19大小的可視化泡沫 (The World is Suffering a Covid-19-Sized Visualization Bubble)

“To sum it up — #vizresponsibly; which may mean not publishing your visualizations in the public domain at all.”

“總結起來-負責任地; 這可能意味著根本就不會在公共領域發布您的可視化文件。”

— Amanda Makulec

—阿曼達·馬庫萊克(Amanda Makulec)

Instead of my article visualization the coronavirus from the umpteenth time, I think it would be better to skim over one of these articles instead, written by actual professors and health experts:

我認為最好是略過由實際教授和衛生專家撰寫的以下文章之一,而不是我的文章從第10次開始可視化冠狀病毒:

In her sobering article, Amanda Makulec goes on to say:

阿曼達·馬庫萊克(Amanda Makulec)在其發人深省的文章中繼續說:

The stakes are high around how we communicate about this epidemic to the wider public. Visualizations are powerful for communicating information, but can also mislead, misinform, and — in the worst cases — incite panic. We are in the middle of complete information overload, with hourly case updates and endless streams of information.

在我們如何將這種流行病傳播給廣大公眾方面風險非常高 。 可視化功能強大,可以傳達信息,但也會誤導,誤導信息,在最壞的情況下還會引起恐慌。 我們正處于完全的信息過載之中,每小時更新一次案例,信息源源不斷。

As a public health professional, might I ask:

作為一名公共衛生專業人員,請問:

Please consider if what you’ve created serves an actual information need in the public domain. Does it add value to the public and uncover new information?

請考慮您創建的內容是否滿足公共領域的實際信息需求。 它會為公眾增加價值并發現新信息嗎?

If not, perhaps this is one viz that should be for your own use only.

如果不是,也許這只是您自己使用的一種。

Reading these posts — and taking a moment to think hard about my next steps and their consequences — made me realize that it was better to pull the article than publish it. Sure, maybe I lost a few hours of my life by not publishing an article I had already finished — but there was a substantial chance I could do more harm than good, and for 1/10th more of the time I could be sharing articles written by people who know the topic better than I ever will.

閱讀這些文章,并花點時間仔細考慮我的后續步驟及其后果,使我意識到,撰寫這篇文章比發表它更好。 當然,也許我因為不發表自己已經寫完的文章而損失了數小時的時間,但是我很有可能弊大于利,而且有超過十分之一的時間我可以分享所寫的文章誰比我更了解這個主題的人。

離別的想法 (Parting Thoughts)

When it comes to a life-threatening disease that has impacted millions of people, I’ve come to realize that it is better to amplify the voices of experts than to contribute to a melting pot of novices (including myself) putting their hat in the ring. Whether or not I can personally make an accurate graph matters less than helping to share a select number of data visualizations that are the most telling and honest in their depictions of this epidemic.

當涉及到威脅到數百萬人的威脅生命的疾病時,我已經意識到, 擴大專家的聲音要比助長新手(包括我自己)的大鍋大喝更好。環。 是否可以親自制作一張準確的圖表,所需要解決的事情,不如幫助分享一些有關該流行病的描述中最能說明事實和最誠實的數據可視化。

Image for post
Edwin Andrade on 埃德溫·安德拉德 ( UnsplashUndersplash)攝影

Thank you to the visualization experts who unknowingly taught me the importance of responsibility over my data viz today. Having the ability to create information from data is like having a superpower —and you just saved me from becoming a villain.

感謝可視化專家,他們在不知不覺中教會了我今天數據管理責任的重要性。 能夠從數據中創建信息就像擁有超級大國一樣,而您只是使我免于成為小人。

Amanda West is a current master’s student in the School of Data Science at the University of Virginia. Prior to the program, she attended the University of Michigan, where she graduated with honors in economics and a math minor, studied abroad as a Gilman Scholar in Beijing, interned for the Ministry of Economic Development in Albania, trained and competed internationally in taekwondo, and held various jobs including as a Research Assistant and Data Visualization Consultant. You can contact her through her personal website here.

Amanda West目前是弗吉尼亞大學數據科學學院的碩士研究生。 在參加該計劃之前,她曾就讀于密歇根大學(University of Michigan),以優異的成績畢業于經濟學和數學專業,并在北京以吉爾曼學者的身份出國學習,在阿爾巴尼亞經濟發展部實習,并在跆拳道進行了國際培訓和比賽,并擔任過各種工作,包括擔任研究助理和數據可視化顧問。 您可以通過她的個人網站聯系她 在這里

翻譯自: https://towardsdatascience.com/a-students-first-encounter-with-responsible-data-viz-847f21c1c8e4

本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。
如若轉載,請注明出處:http://www.pswp.cn/news/391696.shtml
繁體地址,請注明出處:http://hk.pswp.cn/news/391696.shtml
英文地址,請注明出處:http://en.pswp.cn/news/391696.shtml

如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!

相關文章

集合_java集合框架

轉載自http://blog.csdn.net/zsw101259/article/details/7570033 Java集合框架圖 簡化圖: Java平臺提供了一個全新的集合框架。“集合框架”主要由一組用來操作對象的接口組成。不同接口描述一組不同數據類型。 1、Java 2集合框架圖 ①集合接口:6個…

顯示隨機鍵盤

顯示隨機鍵盤 1 <!DOCTYPE html>2 <html lang"zh-cn">3 <head>4 <meta charset"utf-8">5 <title>7-77 課堂演示</title>6 <link rel"stylesheet" type"text/css" href"style…

數據特征分析-統計分析

一、統計分析 統計分析是對定量數據進行統計描述&#xff0c;常從集中趨勢和離中趨勢兩個方面分析。 集中趨勢&#xff1a;指一組數據向某一中心靠攏的傾向&#xff0c;核心在于尋找數據的代表值或中心值-統計平均數&#xff08;算數平均數和位置平均數&#xff09; 算術平均數…

心學 禪宗_禪宗宣言,用于有效的代碼審查

心學 禪宗by Jean-Charles Fabre通過讓查爾斯法布爾(Jean-Charles Fabre) 禪宗宣言&#xff0c;用于有效的代碼審查 (A zen manifesto for effective code reviews) When you are coding, interruptions really suck.當您編碼時&#xff0c;中斷確實很糟糕。 You are in the …

leetcode 896. 單調數列

如果數組是單調遞增或單調遞減的&#xff0c;那么它是單調的。 如果對于所有 i < j&#xff0c;A[i] < A[j]&#xff0c;那么數組 A 是單調遞增的。 如果對于所有 i < j&#xff0c;A[i]> A[j]&#xff0c;那么數組 A 是單調遞減的。 當給定的數組 A 是單調數組…

數據eda_銀行數據EDA:逐步

數據edaThis banking data was retrieved from Kaggle and there will be a breakdown on how the dataset will be handled from EDA (Exploratory Data Analysis) to Machine Learning algorithms.該銀行數據是從Kaggle檢索的&#xff0c;將詳細介紹如何將數據集從EDA(探索性…

結構型模式之組合

重新看組合/合成&#xff08;Composite&#xff09;模式&#xff0c;發現它并不像自己想象的那么簡單&#xff0c;單純從整體和部分關系的角度去理解還是不夠的&#xff0c;并且還有一些通俗的模式講解類的書&#xff0c;由于其舉的例子太過“通俗”&#xff0c;以致讓人理解產…

計算機網絡原理筆記-三次握手

三次握手協議指的是在發送數據的準備階段&#xff0c;服務器端和客戶端之間需要進行三次交互&#xff1a; 第一次握手&#xff1a;客戶端發送syn包(synj)到服務器&#xff0c;并進入SYN_SEND狀態&#xff0c;等待服務器確認&#xff1b; 第二次握手&#xff1a;服務器收到syn包…

VB2010 的隱式續行(Implicit Line Continuation)

VB2010 的隱式續行&#xff08;Implicit Line Continuation&#xff09;許多情況下,您可以讓 VB 后一行繼續前一行的語句&#xff0c;而不必使用下劃線&#xff08;_&#xff09;。下面列舉出隱式續行語法的使用情形。1、逗號“&#xff0c;”之后PublicFunctionGetUsername(By…

flutter bloc_如何在Flutter中使用Streams,BLoC和SQLite

flutter blocRecently, I’ve been working with streams and BLoCs in Flutter to retrieve and display data from an SQLite database. Admittedly, it took me a very long time to make sense of them. With that said, I’d like to go over all this in hopes you’ll w…

leetcode 303. 區域和檢索 - 數組不可變

給定一個整數數組 nums&#xff0c;求出數組從索引 i 到 j&#xff08;i ≤ j&#xff09;范圍內元素的總和&#xff0c;包含 i、j 兩點。 實現 NumArray 類&#xff1a; NumArray(int[] nums) 使用數組 nums 初始化對象 int sumRange(int i, int j) 返回數組 nums 從索引 i …

Bigmart數據集銷售預測

Note: This post is heavy on code, but yes well documented.注意&#xff1a;這篇文章講的是代碼&#xff0c;但確實有據可查。 問題描述 (The Problem Description) The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in…

Android控制ScrollView滑動速度

翻閱查找ScrollView的文檔并搜索了一下沒有發現直接設置的屬性和方法&#xff0c;這里通過繼承來達到這一目的。 /*** 快/慢滑動ScrollView * author農民伯伯 * */public class SlowScrollView extends ScrollView {public SlowScrollView(Context context, Att…

數據特征分析-帕累托分析

帕累托分析(貢獻度分析)&#xff1a;即二八定律 目的&#xff1a;通過二八原則尋找屬于20%的關鍵決定性因素。 隨機生成數據 df pd.DataFrame(np.random.randn(10)*10003000,index list(ABCDEFGHIJ),columns [銷量]) #避免出現負數 df.sort_values(銷量,ascending False,i…

leetcode 304. 二維區域和檢索 - 矩陣不可變(前綴和)

給定一個二維矩陣&#xff0c;計算其子矩形范圍內元素的總和&#xff0c;該子矩陣的左上角為 (row1, col1) &#xff0c;右下角為 (row2, col2) 。 上圖子矩陣左上角 (row1, col1) (2, 1) &#xff0c;右下角(row2, col2) (4, 3)&#xff0c;該子矩形內元素的總和為 8。 示…

算法訓練營 重編碼_編碼訓練營后如何找到工作

算法訓練營 重編碼by Roxy Ayaz由Roxy Ayaz 編碼訓練營后如何找到工作 (How to get a job after a coding bootcamp) Getting a tech job after a coding bootcamp is very possible, but not necessarily pain-free.在編碼訓練營之后獲得技術工作是很有可能的&#xff0c;但不…

dt決策樹_決策樹:構建DT的分步方法

dt決策樹介紹 (Introduction) Decision Trees (DTs) are a non-parametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target variable by learning simple decision rules inferred f…

讀C#開發實戰1200例子記錄-2017年8月14日10:03:55

C# 語言基礎應用&#xff0c;注釋 "///"標記不僅僅可以為代碼段添加說明&#xff0c;它還有一項更重要的工作&#xff0c;就是用于生成自動文檔。自動文檔一般用于描述項目&#xff0c;是項目更加清晰直觀。在VisualStudio2015中可以通過設置項目屬性來生成自動文檔。…

iOS端(騰訊Bugly)閃退異常上報撲獲日志集成與使用指南

app已經上架并且有三次更新版本&#xff0c;今天市場部和顧客約談時&#xff0c;發現顧客的iphone 6 plus iOS 9.0.2上運行app點擊登錄按鈕時直接閃退&#xff0c;無法進入app里&#xff0c;這個問題還是第一次遇到&#xff0c;我下載了相應的模擬器版本&#xff0c; 并在上面運…

數據特征分析-正太分布

期望值&#xff0c;即在一個離散性隨機變量試驗中每次可能結果的概率乘以其結果的總和。 若隨機變量X服從一個數學期望為μ、方差為σ^2的正態分布&#xff0c;記為N(μ&#xff0c;σ^2)&#xff0c;其概率密度函數為正態分布的期望值μ決定了其位置&#xff0c;其標準差σ決定…