數據開放 數據集_除開放式清洗之外:敘述是開放數據門戶的未來嗎?

數據開放 數據集

There is growing consensus in the open data community that the mere release of open data — that is data that can be freely accessed, remixed, and redistributed — is not enough to realize the full potential of openness. Successful open data initiatives don’t simply tick the ‘open’ box but produce data that actually gets used. Open data portals, in particular, are prone to the risk of becoming “data dumps”, where the number of published datasets counts more than their quality or utility.

開放數據社區中越來越多的共識是,僅釋放開放數據(即可以自由訪問,重新混合和重新分配的數據)不足以實現開放的全部潛力。 成功的開放數據計劃不僅會簡單地在“開放”框中打鉤,而且還會產生實際使用的數據 。 開放數據門戶尤其容易成為“數據轉儲”的風險,其中已發布數據集的數量比其質量或實用性更為重要。

This is why, when Sheldon.Studio was hired by the Matera | European Capital of Culture 2019 foundation to design their open-data portal, we felt we were in front of a unique challenge. How do we create an open data portal that empowers the audience, and how do we avoid an open data dump 🤪? One month into the project, here is what we learned in the process 😎.

這就是為什么當Matera雇用Sheldon.Studio的原因| 歐洲文化之都2019基金會設計了他們的開放數據門戶 ,我們認為我們正在面對獨特的挑戰。 我們如何創建一個開放的數據門戶網站來增強受眾的能力,以及如何避免開放的數據轉儲🤪? 進入項目一個月后,這就是我們在過程中中學到的東西😎。

了解受眾是以受眾為中心的數據門戶的第一步 (Knowing the audience is the first step to an audience-centered data portal.)

Last year was a big one for the city of Matera, a city in Southern Italy of 60K souls whose history dates back to the Palaeolithic, as it became European Capital of Culture 2019 and witnessed the arrival of more than half a million visitors. Not only tourists but also artists, cultural workers, and social operators swarmed through the city and actively participated in more than 2400 events, many of which spanned multiple days.

去年是馬泰拉(Matera)市的重要一年。馬泰拉(Matera)是意大利南部一個擁有6萬名靈魂的城市,其歷史可以追溯到舊石器時代,當時它已成為2019年歐洲文化之都 ,目睹了超過50萬游客的到來。 不僅游客,藝術家,文化工作者和社會工作者也蜂擁而至,并積極參加了2400多個活動,其中許多活動跨越了數天。

Image for post
A zoom-out from the interactive Cultural Vibrancy timeline, which shows the whole events organized during the year.
交互式“ 文化活力”時間軸的放大圖顯示了一年中組織的所有活動。

Can you imagine the amount of data visitors and citizens generated during the year? We can tell you about what we received: dozens and dozens of spreadsheets, some handcrafted, some software-generated; textual reports; photo galleries and video interviews. We could simply upload it online in some repo and be done with it. Yet, since the beginning of the collaboration, we embraced the idea of conceiving something beyond the usual. We wanted to give the data back to the people who helped produce it. This meant focusing on what the audience needed to understand.

您能想象一年中游客和市民產生的數據量嗎? 我們可以告訴您我們收到了什么:數十個電子表格,一些是手工制作的,一些是軟件生成的; 文字報告; 照片畫廊和視頻采訪。 我們可以簡單地將其在線上傳到某個存儲庫中并完成它。 但是,自從合作開始以來,我們就接受了構思超出平常事物的想法。 我們希望將數據返回給幫助產生數據的人員。 這意味著關注觀眾需要理解的內容。

Saul Wurman, who coined the concept of information architecture in the mid of the 70s, often said: “You only understand something relative to something you already understand.” This simple, yet timeless, statement represents an essential lens through which we design information experiences at Sheldon.studio. In practice, it means that we should design upon the past experiences of our audience in order to explain something novel. So, knowing our audience was the first building block of our design process.

索爾·沃曼(Saul Wurman)在70年代中期提出了信息架構的概念,他經常說:“您只了解與已經了解的東西相關的東西。” 這個簡單而又永恒的陳述代表了我們在Sheldon.studio設計信息體驗的基本視角。 在實踐中,這意味著我們應該根據觀眾過去的經歷進行設計,以便解釋一些新穎的事物。 因此,了解我們的觀眾是我們設計過程的第一步。

Another key ingredient of our human-centered design approach is the preference for simple visualizations over flamboyant charts, especially when the fancier design would entail a compromise on clarity. Other than the complexity of the data visualizations, we instead leveraged colors and animations to keep our chart designs fresh and engaging, facilitating the audience in the comprehension of hidden data patterns.

我們以人為中心的設計方法的另一個關鍵要素是,相對于華麗的圖表 ,更喜歡簡單的可視化效果 ,尤其是在更高級的設計會影響清晰度的情況下。 除了數據可視化的復雜性之外,我們還利用顏色和動畫來使圖表設計保持新鮮和引人入勝,從而使觀眾能夠理解隱藏的數據模式。

Image for post

From a design perspective, we rooted our visualizations around the central theme of showing the liveliness and the humanity that characterized the cultural programme of Matera. For this reason, we privileged rounded shapes and a profusion of dots swarming everywhere, a metaphor of humanity as seen from a bird’s eye view and we decided to present some visualization using the metaphor of a pack of many separate units/bubbles forming bigger clusters. We feel that this makes the numbers interesting and more intuitive to grasp also for audiences with lower data viz literacy.

從設計的角度來看,我們將視覺化植根于以馬泰拉(Matera)文化節目為特色的生動活潑人性化這一中心主題。 因此,我們優先考慮圓形和到處散布著大量點的情況,從鳥瞰的角度來看,這是人類的隱喻,因此,我們決定使用一堆由許多獨立的單元/氣泡組成的隱喻來呈現一些可視化,從而形成更大的簇。 我們認為,這對于具有較低數據即識字率的受眾來說,也使數字變得有趣且更加直觀。

Image for post
A compilation of several counters featured in thge microstories.
微型故事中精選的幾個計數器的匯編。

In line with our endeavour to keep the data visualization accessible and easy to parse, we devised an innovative way to efficiently integrate legends, charts and text. Readers usually struggle as their eyes ping-pong back and forth between the chart and its legend to understand what’s what, so we tried to intertwine the legend in the descriptive text above it, highlighting keywords with the corresponding colours in the chart. The idea is to spark curiosity in the readers as they note that some words in the text are highlighted, or, the other way around, to prompt somebody to read the text while seeking for the legend.

為了使數據可視化易于訪問和易于解析,我們設計了一種創新的方法來有效地集成圖例,圖表和文本。 讀者通常會在圖表和圖例之間來回乒乓球的過程中掙扎,以了解內容是什么,因此我們嘗試在圖例上方的描述性文字中纏上圖例,在圖表中突出顯示具有相應顏色的關鍵字。 這樣做的目的是激發讀者的好奇心,因為他們會注意到文本中的某些單詞被突出顯示,或者反之亦然,以促使人們在尋找圖例時閱讀文本。

Image for post
An example of a legend integrated into the descriptive text above
上面描述文字中的圖例示例

下一個? 規劃不同的數據素養水平。 (Next? Plan for different data literacy levels.)

The co-design sessions with our client, the Matera Foundation, surfaced the need to plan for multiple entry points and different levels of data literacy, to suit the needs of the different types of people that would visit the portal.

與我們的客戶Matera基金會的共同設計會議表明,需要計劃多個入口點和不同級別的數據素養,以適應訪問門戶網站的不同類型人員的需求。

A first step in this direction was to include qualitative data alongside the numbers and statistics. We strongly believe that quantitative data is just one possible ingredient to the story, especially when we are discussing social issues, and moreover if it’s important to include a broader audience. For this reason, we combined traditional data visualizations with original texts, and we intertwined the data stories with photos and statements by the participants.

朝這個方向邁出的第一步是將定性數據與數字和統計信息一起納入。 我們堅信, 量化數據只是故事的一種可能成分,尤其是在我們討論社會問題時,而且對于擴大受眾范圍是否重要也是如此。 因此,我們將傳統的數據可視化與原始文本結合在一起,并將數據故事與參與者的照片和陳述交織在一起。

Image for post
How we combined quantitative and qualitative data to tell different facets of the same story.
我們如何結合定量和定性數據來講述同一故事的不同方面。

In its final version, the project unfolds across 8 thematic sections and 6 in-depth micro-stories. We opted for these two different content formats, sections, and stories, to offer two different ways of looking at the data. The thematic sections stand as metaphorical chapters that disclose the main narrative of what Matera 2019 has represented, providing a birds-eye view on the core values of its organization. The micro-stories, on the other hand, drill down on specific events or issues of particular importance. So, for instance, while the Cultural vibrancy introduces and visualizes the amount and diversity of the cultural program, the connected Open Design School micro-story unveils how the project brought talented youngsters from all over Europe during the year (see pic below).

在其最終版本中,該項目涵蓋8個主題部分和6個深入的微型故事。 我們選擇了這兩種不同的內容格式,部分和故事,以提供兩種不同的數據查看方式。 主題部分作為隱喻性章節站立,揭示了Matera 2019所代表的主要敘述,提供了其組織核心價值的鳥瞰圖。 另一方面,微型故事會深入研究特定事件或特別重要的問題。 因此,例如,在文化活力介紹和形象化文化節目的數量和多樣性的同時, 開放式設計學院微型故事樓揭示了該項目如何在這一年中吸引了來自歐洲各地的才華橫溢的年輕人(見下圖)。

Image for post
Microstories deepen the issue presented in the thematic sessions, also through qualitative data
微觀故事還通過定性數據加深了主題會議上提出的問題

The way we decided to publish the open data in the portal is itself an attempt at suiting the different data literacy levels and needs the website’s visitors may have. All the data is published in three places, each designed with a specific type of audience in mind.

我們決定在門戶中發布開放數據的方式本身就是為了適應不同數據素養水平和網站訪問者可能有的需求。 所有數據都在三個位置發布,每個位置的設計都考慮了特定的受眾類型。

  • 🤓🤓🤓 A dedicated GitHub repo that provides the CSV and JSON files (as a data geek would expect them).

    🤓🤓🤓一個專用的GitHub存儲庫 ,提供CSV和JSON文件(數據極客所期望的)。

  • 🤓🤓 An “Open Data Centre” on the website, which is essentially a traditional open data portal, listing all the raw data files along with their metadata.

    🤓🤓“開放數據中心”的網站上,這基本上是一個傳統的開放式數據門戶網站,列出了所有的原始數據文件及其元數據。

  • 🤓 An “Open Data Corner” at the end of each thematic section or micro-story, which includes only the data referred to the specific section or story. In each open data corner, we decided to publish not only the raw data but also the aggregated and processed data files that we used to produce each visualization that is on the page.

    each每個主題部分或微型故事結尾處的“ 開放數據角 ”,其中僅包含涉及特定部分或故事的數據。 在每個開放數據角,我們決定不僅發布原始數據,還發布我們用于生成頁面上每個可視化效果的匯總和處理數據文件。

We believe that the latter, the “Open Data Corner” is a core innovation in the way we designed the portal, as it empowers people who might have a lower data literacy than a typical open data user, like concerned citizens, activists, as well as journalists, to access and play with the data in a beginner-friendly manner.

我們相信后者,即“開放數據角”是我們設計門戶網站方式的一項核心創新,因為它使可能具有比一般開放數據用戶低的數據素養的人們(如關注的公民,活動家)作為記者,以對初學者友好的方式訪問和處理數據。

Image for post
At the end of each thematic sessions, the open data corner shares the data visualized in the page
在每個主題會議結束時,打開的數據角共享頁面中可視化的數據

最重要的是,將數據視為達到目的的手段,而不是目的本身😏。 (Most of all, think of data as a means to an end, not the end in itself 😏.)

The more we figured out how to translate the principles of human-centered design into the practice of creating the Matera 2019 data portal, the more we realized we were shifting the role data has in a traditional open data portal.

我們越想出如何將以人為本的設計原理轉化為創建Matera 2019數據門戶的實踐,就越能意識到我們正在轉移數據在傳統開放數據門戶中的角色。

Open data portals are typically all about the data: how many datasets, how many formats, which open licenses. Open data portals are typically all about the data: how many datasets, how many formats, which open licenses. In Matera 2019 this hierarchy is flipped: the stories come first which narrate the data and illustrate what can be done with the data, then we provide the downloadable open datasets.

開放數據門戶通常與數據有關:多少數據集,多少格式以及哪種開放許可證。 開放數據門戶通常與數據有關:多少數據集,多少格式以及哪種開放許可證。 在Matera 2019中,這種層次結構被顛覆了:故事首先講述數據,并說明如何處理數據,然后提供可下載的開放數據集。

In addition, a standard open data portal will include mainly quantitative, machine-readable datasets. In the Matera 2019 open data portal, CSVs and machine-readable datasets are just one of the many components of a multi-modal narration, together with texts, videos, pictures, etc. The datasets are not stand-alone elements, but parts of an informative ecosystem covering the many facets and the complexity of what Matera European Capital of Culture 2019 has represented.

此外,標準的開放數據門戶將主要包含定量的機器可讀數據集。 在Matera 2019開放數據門戶中,CSV和機器可讀數據集只是多模式旁白的眾多組成部分之一,還包括文本,視頻,圖片等。這些數據集不是獨立的元素,而是其中的一部分一個信息豐富的生態系統,涵蓋了Matera歐洲文化之都2019所代表的各個方面和復雜性。

Image for post
Image for post
Image for post
Image for post
Image for post
Image for post
Image for post
Image for post
Qualitative data from Matera 2019
Matera 2019的定性數據

Finally, our hope is to give rise to a recursive process that sees data as a means to an end, not an end itself. Publishing the data online was not the ultimate goal of the Matera 2019 Open data portal. It is humans and their actions that generated the datasets behind the stories of the portal. And now that the data is published, it should serve this community. We want to see the data used as a tool to foster new human interactions and to inform new processes aimed at improving the conditions of Matera’s society.

最后,我們的希望是引發一個遞歸過程,該過程將數據視為達到目的的手段,而不是達到目的的手段。 在線發布數據并不是Matera 2019 Open數據門戶網站的最終目標。 是人類及其行為在門戶故事背后生成了數據集。 既然數據已經發布,它就應該為這個社區服務。 我們希望將數據用作促進新的人類互動并為旨在改善馬泰拉社會狀況的新程序提供信息的工具。

Image for post

With this goal in mind, an integral part of designing the Open Data Portal has been that of planning for its legacy. In the autumn 🍂, we are supporting the organization of a DataSchool, together with the Matera Foundation and with the participation of the open-data guru Maurizio Napolitano. The School will bring in the city a colourful variety of data-people, from data-activists, to design students, journalists, or social scientists, to design new forms of communication and services based on the data.

考慮到這一目標,設計開放數據門戶的一個組成部分就是規劃其遺留問題。 在秋季🍂,我們將與Matera基金會以及開放數據專家Maurizio Napolitano共同支持DataSchool的組織。 該學院將為城市帶來各種各樣的數據人,從數據活動家到設計學生,新聞工作者或社會科學家,到根據數據設計新形式的交流和服務。

Through the design of the platform, we aimed to turn open data into commons, public goods generated and maintained by the community for its wealth and awareness. Our hope, indeed, is to contribute to a more inclusive data practice, which embraces a broader audience, provides diverse and faceted entry points for personal explorations, and constitutes a stepping stone towards new forms of information, knowledge, awareness, and social care.

通過平臺的設計,我們旨在將開放數據轉變為公共資源,社區為社區的財富和意識而產生和維護的公共物品。 實際上,我們的希望是為更具包容性的數據實踐做出貢獻,該實踐涵蓋了更廣泛的受眾,為個人探索提供了多種多樣且切面的切入點,并且是邁向新形式的信息,知識,意識和社會關懷的墊腳石。

Matteo Moretti is a designer and cofounder of Sheldon.studio & Alice Corona, data-journalist and founder of Batjo.eu

利瑪竇莫雷蒂 是一名設計師兼創始人 Sheldon.studio 愛麗絲電暈 ,數據記者和創始人 Batjo.eu

Image for post

翻譯自: https://medium.com/nightingale/beyond-open-washing-are-stories-and-narratives-the-future-of-open-data-portals-93228d8882f3

數據開放 數據集

本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。
如若轉載,請注明出處:http://www.pswp.cn/news/388513.shtml
繁體地址,請注明出處:http://hk.pswp.cn/news/388513.shtml
英文地址,請注明出處:http://en.pswp.cn/news/388513.shtml

如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!

相關文章

單選按鈕android服務器,android – 如何在radiogroup中將單選按鈕設置...

我已經動態創建了RadioGroup和RadioButton,如下所示:RadioGroup radioGroup new RadioGroup(context);RadioButton radioBtn1 new RadioButton(context);RadioButton radioBtn2 new RadioButton(context);RadioButton radioBtn3 new RadioButton(context);radio…

導入DMP文件過程

導入DMP文件過程 --釋放重名表空間 drop tablespace hxgr including contents and datafiles cascade constraints; --建立表空間 create tablespace hxgr logging datafile D:\oracle\oradata\hxgr\hxgr.dbf size 100m autoextend on next 32m maxsize 2048m extent manage…

string 轉化 xml,并找到指定節點及節點值

//這是一個符合xml格式的字符串string xml "<xmn> <people><name>zs</name><age>22</age></people> <people><name>ls</name><age>23</age></people> </xmn>";//將string 轉化…

ios android 交互 區別,很多人不承認:iOS的返回交互,對比Android就是反人類。

寧之的奧義2020-09-21 10:54:39點滅只看此人舉報給你解答&#xff1a;美國人都是左撇子&#xff0c;所以他們很方便&#x1f436;給你解答&#xff1a;美國人都是左撇子&#xff0c;所以他們很方便&#x1f436;亮了(504)回復查看評論(19)回憶的褶皺樓主2020-09-21 11:01:01點滅…

Servlet+JSP

需要說明的是&#xff0c;其實工具的版本不是主要因素&#xff0c;所以我下面忽略版本。 你能搜到這篇文章&#xff0c;說明你已經知道怎么部署Tomcat&#xff0c;并運行自己的網頁了。 但是&#xff0c;我們知道&#xff0c;每次修改源文件&#xff0c;我們總得手工把文件co…

正態分布高斯分布泊松分布_正態分布:將數據轉換為高斯分布

正態分布高斯分布泊松分布For detailed implementation in python check my GitHub repository.有關在python中的詳細實現&#xff0c;請查看我的GitHub存儲庫。 介紹 (Introduction) Some machine learning model like linear and logistic regression assumes a Gaussian di…

BABOK - 開篇:業務分析知識體系介紹

本文更新版已挪至 http://www.zhoujingen.cn/itbang/328.html ---------------------------------------------- 當我們作項目時&#xff0c;下面這張圖很多人都明白&#xff0c;從計劃、構建、測試、部署實施后發現提供的方案并不能真正解決用戶的問題&#xff0c;那么我們是…

對象-檢測屬性

<h3>判斷某個屬性是否存在于某個對象中&#xff1b;</h3><ol><li>in&#xff1a;檢查一個屬性是否屬于某個對象&#xff0c;包括繼承來的屬性&#xff1b;<pre>var person {name:yourname, age:10};console.log(name in person); //trueconsole…

黑蘋果 wifi android,動動手指零負擔讓你的黑蘋果連上Wifi

動動手指零負擔讓你的黑蘋果連上Wifi2019-12-02 10:08:485點贊36收藏4評論購買理由黑蘋果Wifi是個頭疼的問題&#xff0c;高“貴”的原機Wifi藍牙很貴&#xff0c;比如我最近偶然得到的BCM94360CS2&#xff0c;估計要180。稍微便宜的一點的&#xff0c;搞各種ID&#xff0c;各種…

洛谷——P2018 消息傳遞

P2018 消息傳遞 題目描述 巴蜀國的社會等級森嚴&#xff0c;除了國王之外&#xff0c;每個人均有且只有一個直接上級&#xff0c;當然國王沒有上級。如果A是B的上級&#xff0c;B是C的上級&#xff0c;那么A就是C的上級。絕對不會出現這樣的關系&#xff1a;A是B的上級&#xf…

axios異步請求數據的簡單使用

使用Mock模擬好后端數據之后&#xff08;Mock模擬數據的使用參考&#xff1a;https://segmentfault.com/a/11...&#xff09;&#xff0c;就需要嘗試請求加載數據了。數據請求選擇了axios&#xff0c;現在都推薦使用axios。 axios&#xff08;https://github.com/axios/axios&a…

float在html語言中的用法,float屬性值包括

html中不屬于float常用屬性值的是float常用的值就三個:left\right\none。沒有其他的值了。 其中none這個值是默認的&#xff0c;所以一般不用寫。css中float屬性有幾種用法&#xff1f;值 描述left 元素向左浮動。 right 元素向右浮動。 none 默認值。元素不浮動&#xff0c;并…

它們是什么以及為什么我們不需要它們

Once in a while, when reading papers in the Reinforcement Learning domain, you may stumble across mysterious-sounding phrases such as ‘we deal with a filtered probability space’, ‘the expected value is conditional on a filtration’ or ‘the decision-mak…

LoadRunner8.1破解漢化過程

LR8.1版本已經將7.8和8.0中通用的license封了&#xff0c;因此目前無法使用LR8.1版本&#xff0c;包括該版本的中文補丁。 破解思路&#xff1a;由于軟件的加密程序和運行的主程序是分開的&#xff0c;因此可以使用7.8的加密程序覆蓋8.1中的加密程序&#xff0c;這樣老的7.8和…

TCP/IP網絡編程之基于TCP的服務端/客戶端(二)

回聲客戶端問題 上一章TCP/IP網絡編程之基于TCP的服務端/客戶端&#xff08;一&#xff09;中&#xff0c;我們解釋了回聲客戶端所存在的問題&#xff0c;那么單單是客戶端的問題&#xff0c;服務端沒有任何問題&#xff1f;是的&#xff0c;服務端沒有問題&#xff0c;現在先讓…

談談iOS獲取調用鏈

本文由云社區發表iOS開發過程中難免會遇到卡頓等性能問題或者死鎖之類的問題&#xff0c;此時如果有調用堆棧將對解決問題很有幫助。那么在應用中如何來實時獲取函數的調用堆棧呢&#xff1f;本文參考了網上的一些博文&#xff0c;講述了使用mach thread的方式來獲取調用棧的步…

python 移動平均線_Python中的移動平均線

python 移動平均線There are situations, particularly when dealing with real-time data, when a conventional average is of little use because it includes old values which are no longer relevant and merely give a misleading impression of the current situation.…

Ireport制作過程

Ireport制作過程 1、首先要到Option下設置一下ClassPath添加文件夾 2、到預覽->報表字段設置一下將要用到的字段 3、到編輯->查詢報表->寫sql語句&#xff0c;然后把語句查詢的字段結果與上面設置的報表字段的名要對應上 4、Option->選項->Compiler設置一下…

2018.09.16 loj#10243. 移棋子游戲(博弈論)

傳送門 題目中已經給好了sg圖&#xff0c;直接在上面跑出sg函數即可。 最后看給定點的sg值異或和是否等于0就判好了。 代碼&#xff1a; #include<bits/stdc.h> #define N 2005 #define M 6005 using namespace std; int n,m,k,sg[N],first[N],First[N],du[N],cnt0,an…

html5字體的格式轉換,font字體

路由器之家網今天精心準備的是《font字體》&#xff0c;下面是詳解&#xff01;html中的標簽是什么意思HTML提供了文本樣式標記&#xff0c;用來控制網頁中文本的字體、字號和顏色&#xff0c;多種多樣的文字效果可以使網頁變得更加絢麗。其基本語法格式&#xff1a;文本內容fa…