數據分布策略_有效數據項目的三種策略

數據分布策略

Many data science projects do not go into production, why is that? There is no doubt in my mind that data science is an efficient tool with impressive performances. However, a successful data project is also about effectiveness: doing the right things as Russell Ackoff would write in “A systemic view of transformational leadership”.

許多數據科學項目沒有投入生產 ,為什么呢? 毫無疑問,數據科學是一種具有出色性能的有效工具。 但是,一個成功的數據項目也與有效性有關:如羅素·阿科夫(Russell Ackoff)在“ 變革型領導的系統觀點 ”中所寫, 做正確的事

Successful problem solving requires finding the right solution to the right problem. We fail more often because we solve the wrong problem than because we get the wrong solution to the right problem — Russell L. Ackoff (1974)

成功的問題解決需要找到正確問題的正確解決方案。 我們失敗的原因更多是因為我們解決了錯誤的問題,而不是因為我們沒有解決正確的問題— Russell L. Ackoff(1974)

How do you focus on your projects and make sure they will bring value to the company? Are you strategically thinking about how to bring your project to fruition?

您如何專注于您的項目并確保它們將為公司帶來價值? 您是否在戰略上考慮如何使您的項目實現?

NB: I will use golf — a strategic sport — as an illustrative analogy here.

注意:在這里,我將使用高爾夫這一具有戰略意義的運動作為比喻。

OKR:設定您致力于實現的目標 (OKR: Setting objectives that you commit to achieve)

Objectives and Key Results (OKR) have been adopted in successful organisations to drive tremendous growth (Intel, Google, …). They were initially introduced by John Doerr to increase focus that produces value.

目標和關鍵結果(OKR)已通過 成功的組織來推動巨大的增長(英特爾,谷歌等)。 它們最初是由約翰·多爾(John Doerr)引入的,以增加對創造價值的關注。

The general idea is to set Objectives that motivate you. Imagine you are passionate about golf and next Friday there is a big competition. In the last few years, nobody won it performing well on more than 15 holes out of the 18 on the course. Setting yourself to win it is a good objective — it is specific, ambitious, and happens at a given time. You then set Key Results that can measure how you are doing on this objective. In this golf example, they could be:

一般的想法是設定激勵您的目標 。 想象一下您對高爾夫充滿熱情,下周五將進行一場激烈的比賽。 在過去的幾年中,沒有人贏得過比賽中18個洞中超過15個洞的出色表現。 讓自己贏得勝利是一個很好的目標-這是特定的,雄心勃勃的,并且在特定時間發生 。 然后,您可以設置關鍵結果 ,以衡量您在此目標上的表現 在這個高爾夫示例中,它們可能是:

  • Hit a par (ideal number of shots to get into a hole) on at least 16 out of the 18 holes.

    在18個洞中的至少16個洞上擊出標準桿(理想的開槍次數)。
  • Avoid dropping the ball in a sand trap more than three times — because you know that you are bad at getting out of them.

    避免將球掉入沙坑中超過3次-因為您知道自己很難擺脫掉它們。
  • Go for a 20 min practice session before the competition — as you usually make a few bad shots with cold muscles.

    比賽前進行20分鐘的練習-因為您通常會因肌肉冰冷而做出一些不好的動作。

Checking all the key results are then a good indicator that you could win.

檢查所有關鍵結果便是您可能會獲勝的良好指示。

In another scenario, working for a large bank, picture you are tasked to build a loan risk model with 80% accuracy. Here are some possible key results:

在另一種情況下(為一家大型銀行工作),您需要負責建立準確性為80%的貸款風險模型。 以下是一些可能的關鍵結果:

  • Get 80% client repayment behaviour data by XX/YY/ZZZZ.

    通過XX / YY / ZZZZ獲取80%的客戶還款行為數據。
  • Test three explainable model types by AA/BB/CCCC.

    通過AA / BB / CCCC測試三種可解釋的模型類型。
  • Define and track four metrics to follow the model’s performances and understand where the model is wrong.

    定義并跟蹤四個指標,以跟蹤模型的性能并了解該模型在哪里出錯。

OKRs can be used to drive focus on anything. I find them useful to define my goals on a project: building a model or an application, when will it be good enough? Aiming for the key results brings clarity. Failing becomes a learning experience that stimulates better OKRs definitions and work. On the other hand, success is then crystal clear, and you should enjoy it.

OKR可用于推動對任何事物的關注。 我發現它們對于定義項目目標很有用:建立模型或應用程序,什么時候足夠好? 瞄準關鍵結果會帶來清晰度。 失敗成為一種學習經驗,可以激發更好的OKR定義和工作。 另一方面,成功是顯而易見的,您應該享受成功。

Must read on the topic: Measure what matters by John Doerr.

必須閱讀以下主題: 衡量 約翰·杜爾的重要性。

傳動系統方法 (The Drivetrain Approach)

A drivetrain approach is a comprehensive strategy to data products definition. The following diagram shows its essential steps:

傳動系統方法是數據產品定義的綜合策略。 下圖顯示了其基本步驟:

Image for post
iStock under license to M. Koutero.iStock的元素已獲M. Koutero許可。

In a new project we might ask ourselves:

在一個新項目中,我們可能會問自己:

  • Objectives

    目標

Setting objectives includes answering questions such as: Does it add value to the business? Is it aligned with the current roadmap? When should it be done? Is it opening new perspectives?

設定目標包括回答以下問題:是否能為企業增加價值? 它與當前路線圖一致嗎? 什么時候應該做? 它開辟了新的視角嗎?

  • Levers

    杠桿

What elements in the final product are under my control? Can I change the price of the product? The ranking on the recommendation page? …

我可以控制最終產品中的哪些元素? 我可以更改產品的價格嗎? 推薦頁面上的排名? …

  • Data

    數據

Given objectives and levers, what kind of data could I use? What are the compliance issues?

給定目標和杠桿,我可以使用哪種數據? 有哪些合規性問題?

  • Model / Simulation

    模型/模擬

Simulations should indicate if there is enough information in your data combined with your levers to get to your objectives. Could you drive more sales with fewer risks in the loan model example?

模擬應表明您的數據中是否有足夠的信息與您的杠桿相結合以實現目標。 在貸款模型示例中,您能否以更少的風險推動更多的銷售?

Every step is also an exit point. If you can’t find a solution alone or collectively, it might be an indication that it is not worth your time and should focus on something else.

每一步也是一個出口點。 如果您不能單獨或集體找到解決方案,則可能表明它不值得您花時間,而應專注于其他方面。

Must read on the topic: Designing great data products by Jeremy Howard, Margit Zwemer and Mike Loukides.

必須閱讀的題目是: 設計大數據產品 杰里米·霍華德 瑪吉特池維謀 麥克Loukides

決策智能 (Decision intelligence)

Decision intelligence is a more general discipline that tackles how to build a strategy given objectives in complex situations. The general process integrates notions such as external causes, multiple causal links, and feedback loops. Teams creating causal diagrams can then rationally decide upon a strategy with a clear perception of the problem at hand. One might understand decision intelligence as an extended merger between OKRs and the drive train approach.

決策智能是一門比較通用的學科,致力于解決復雜情況下給定目標的戰略制定方法。 常規過程集成了諸如外部原因,多個因果鏈接和反饋循環之類的概念。 然后,創建因果圖的團隊可以合理地決定策略,并清楚地了解當前的問題。 人們可能將決策智能理解為OKR與動力傳動系統方法之間的擴展合并。

Image for post
iStock under license to M. Koutero.iStock的元素,并已獲得M. Koutero的許可。

In the small example above, once you select a club, whether the ball will fly high (and hopefully far) or stay rolling on the ground means the wind is more or less likely to affect. Staying on the ground might be safer, but making only small shots, you will need more of them. Having a good strategy means you will find a reasonable equilibrium to achieve your objectives and goals.

在上面的小示例中,一旦選擇了一個球桿,球會飛高(并希望遠飛)還是保持在地面上滾動,這意味著或多或少會影響風。 留在地面上可能會更安全,但只拍攝一點,您將需要更多。 擁有良好的策略意味著您將找到一個合理的平衡點來實現自己的目標。

In the OKR example about the risk loan model, we would make here deeper inquiries. Would having a loan model that makes mistakes on certain types of customers be a hazard on equity? Is it possible that employees in charge of validating loans would only rely on the model, become less critical thinkers and be less likely to adjust their behaviour when delicate cases occur? Causal diagrams enable you to understand the indirect consequences of your decisions. If you consider that getting the right clean data and building a model ready for production is often a task that takes months, is it not worth spending some time on the reasons you are doing it?

在有關風險貸款模型的OKR示例中,我們將在這里進行更深入的查詢。 具有在某些類型的客戶上犯錯誤的貸款模型會危害股本嗎? 負責發生貸款問題的員工是否可能僅依靠模型,變得不那么批判性的思想家并且在發生細微情況時不太可能調整其行為? 因果圖使您能夠理解決策的間接后果。 如果您認為獲取正確的干凈數據并為生產做好準備的模型構建通常需要花費數月的時間,難道不應該花一些時間在做這些事情的原因上嗎?

For engineers and scientists, it is not extremely different from specifying a classical digital product with its constraints and target performances but broadening the perspective. What is interesting to me, is the focus on the decision making (“should I build this product and how?”) putting both business and technical people together to make sure that at the scale of a whole ecosystem, the next move is the right one.

對于工程師和科學家而言,它與指定具有約束條件和目標性能的經典數字產品并沒有什么不同,但是拓寬了視野。 對我而言,有趣的是將重點放在決策上(“我應該制造這種產品以及如何制造嗎?”),將業務人員和技術人員放在一起,以確保在整個生態系統的規模上,下一步行動是正確的之一。

Must read on the topic: Link by Lorien Pratt

必須閱讀以下主題: Lorien Pratt的 鏈接

Strategy is not limited to a top/down practice falling under the umbrella of leaders, managers, product managers, etc. I think it is part of any job to meet halfway and have some strategical thinking under the hood. Maybe these frameworks are sometimes too elaborated, but at its core, they start with a simple question that we can ask ourselves: why should I do this project?

戰略不僅限于領導,經理,產品經理等領導下的自上而下的實踐。我認為,中途開會并有一些戰略思想是任何工作的一部分。 也許這些框架有時過于復雜,但從根本上講,它們以一個簡單的問題開始,我們可以問自己:我為什么要進行這個項目?

As a field, growing beyond the AI hype, we cannot stay in an isolated system, extending our level of specialisation without clearly showing its value. Missing middle professionals are likely to be of importance in this task (Paul R. Daugherty — CTIO at Accenture and Lorien Pratt). Whether they will be decision intelligence specialists, data strategists or data product managers will be a matter of semantics and establishing new practices in the field.

作為一個超越AI炒作的領域,我們不能停留在孤立的系統中,無法在沒有清楚顯示其價值的情況下擴展專業化水平。 缺少中層專業人員可能對這項任務很重要( Paul R. Daugherty-埃森哲公司的CTIO和Lorien Pratt)。 他們將是決策情報專家數據戰略家還是數據產品經理,將取決于語義并確定該領域的新實踐。

___________

___________

References :

參考文獻:

  • Ackoff, R, L: 1998, A Systemic View of Transformational Leadership (Systemic Practice and Action Research).

    Ackoff,R,L:1998年, 《變革型領導的系統觀點》 (系統實踐與行動研究)。

  • Ackoff, R. L.: 1974, Redesigning the Future: A Systems Approach to Societal Problems (John Wiley & Sons).

    RL,阿科夫(Ackoff),1974年,《 重新設計未來:社會問題的系統方法》 (約翰·威利父子)。

  • Doerr, J: 2018, Measure what matters: How Google, Bono, and the Gates Foundation rock the world with OKRs (Portfolio Penguin).

    Doerr,J:2018, 衡量重要的事情:Google,Bono和蓋茨基金會如何利用OKR (Portfolio Penguin) 震撼整個世界

  • Pratt, L: 2019, Link (Emerald Publishing Limited).

    普拉特,L:2019, Link (Emerald Publishing Limited)。

翻譯自: https://towardsdatascience.com/three-strategies-towards-effective-data-projects-eed29ad05ded

數據分布策略

本文來自互聯網用戶投稿,該文觀點僅代表作者本人,不代表本站立場。本站僅提供信息存儲空間服務,不擁有所有權,不承擔相關法律責任。
如若轉載,請注明出處:http://www.pswp.cn/news/391881.shtml
繁體地址,請注明出處:http://hk.pswp.cn/news/391881.shtml
英文地址,請注明出處:http://en.pswp.cn/news/391881.shtml

如若內容造成侵權/違法違規/事實不符,請聯系多彩編程網進行投訴反饋email:809451989@qq.com,一經查實,立即刪除!

相關文章

cell 各自的高度不同的時候

1, cell 根據文字、圖片等內容,確定自己的高度。每一個cell有自己的高度。 2,tableView 初始化 現實的時候,不是從第一個cell開始顯示,(從第二個?),非非正常顯示。 a:cell 的高度問題…

leetcode 978. 最長湍流子數組(滑動窗口)

當 A 的子數組 A[i], A[i1], …, A[j] 滿足下列條件時&#xff0c;我們稱其為湍流子數組&#xff1a; 若 i < k < j&#xff0c;當 k 為奇數時&#xff0c; A[k] > A[k1]&#xff0c;且當 k 為偶數時&#xff0c;A[k] < A[k1]&#xff1b; 或 若 i < k < j&…

spring boot源碼下載地址

github下載&#xff1a; https://github.com/spring-projects/spring-boot/tree/1.5.x git地址&#xff1a; https://github.com/spring-projects/spring-boot.git 因為項目中目前使用的就是spring boot 1.5.19版本&#xff0c;因此這里先研究spring boot 1.5版本源碼.轉載于:h…

java基礎學習——5、HashMap實現原理

一、HashMap的數據結構 數組的特點是&#xff1a;尋址容易&#xff0c;插入和刪除困難&#xff1b;而鏈表的特點是&#xff1a;尋址困難&#xff0c;插入和刪除容易。那么我們能不能綜合兩者的特性&#xff0c;做出一種尋址容易&#xff0c;插入刪除也容易的數據結構&#xff1…

看懂nfl定理需要什么知識_NFL球隊為什么不經常通過?

看懂nfl定理需要什么知識Debunking common NFL myths in an analytical study on the true value of passing the ball在關于傳球真實價值的分析研究中揭穿NFL常見神話 Background背景 Analytics are not used enough in the NFL. In a league with an abundance of money, i…

Docker初學者指南-如何創建您的第一個Docker應用程序

您是一名開發人員&#xff0c;并且想要開始使用Docker&#xff1f; 本文是為您準備的。 (You are a developer and you want to start with Docker? This article is made for you.) After a short introduction on what Docker is and why to use it, you will be able to cr…

mybatis if-else(寫法)

mybaits 中沒有else要用chose when otherwise 代替 范例一 <!--批量插入用戶--> <insert id"insertBusinessUserList" parameterType"java.util.List">insert into business_user (id , user_type , user_login )values<foreach collection…

spring—攔截器和異常

SpringMVC的攔截器 SpringMVC攔截器-攔截器的作用 Spring MVC 的攔截器類似于 Servlet 開發中的過濾器 Filter&#xff0c;用于對處理器進行預處理和后處理。 將攔截器按一定的順序聯結成一條鏈&#xff0c;這條鏈稱為攔截器鏈&#xff08;InterceptorChain&#xff09;。在…

29/07/2010 sunrise

** .. We can only appreciate the miracle of a sunrise if we have waited in the darkness .. 人們在黑暗中等待著&#xff0c;那是期盼著如同日出般的神跡出現 .. 附&#xff1a;27/07/2010 sunrise ** --- 31 July 改動轉載于:https://www.cnblogs.com/orderedchaos/archi…

密度聚類dbscan_DBSCAN —基于密度的聚類方法的演練

密度聚類dbscanThe idea of having newer algorithms come into the picture doesn’t make the older ones ‘completely redundant’. British statistician, George E. P. Box had once quoted that, “All models are wrong, but some are useful”, meaning that no model…

node aws 內存溢出_在AWS Elastic Beanstalk上運行生產Node應用程序的現實

node aws 內存溢出by Jared Nutt賈里德努特(Jared Nutt) 在AWS Elastic Beanstalk上運行生產Node應用程序的現實 (The reality of running a production Node app on AWS Elastic Beanstalk) 從在AWS的ELB平臺上運行生產Node應用程序兩年的經驗教訓 (Lessons learned from 2 y…

Day2-數據類型

數據類型與內置方法 數據類型 數字字符串列表字典元組集合字符串 1.用途 用來描述某個物體的特征&#xff1a;姓名&#xff0c;性別&#xff0c;愛好等 2.定義方式 變量名 字符串 如&#xff1a;name huazai 3.常用操作和內置方法 1.按索引取值&#xff1a;&#xff08;只能取…

嵌套路由

父組件不能用精準匹配&#xff0c;否則只組件路由無法展示 轉載于:https://www.cnblogs.com/dianzan/p/11308146.html

leetcode 992. K 個不同整數的子數組(滑動窗口)

給定一個正整數數組 A&#xff0c;如果 A 的某個子數組中不同整數的個數恰好為 K&#xff0c;則稱 A 的這個連續、不一定獨立的子數組為好子數組。 &#xff08;例如&#xff0c;[1,2,3,1,2] 中有 3 個不同的整數&#xff1a;1&#xff0c;2&#xff0c;以及 3。&#xff09; …

從完整的新手到通過TensorFlow開發人員證書考試

I recently graduated with a bachelor’s degree in Civil Engineering and was all set to start with a Master’s degree in Transportation Engineering this fall. Unfortunately, my plans got pushed to the winter term because of COVID-19. So as of January this y…

微信開發者平臺如何編寫代碼_編寫超級清晰易讀的代碼的初級開發者指南

微信開發者平臺如何編寫代碼Writing code is one thing, but writing clean, readable code is another thing. But what is “clean code?” I’ve created this short clean code for beginners guide to help you on your way to mastering and understanding the art of c…

【轉】PHP面試題總結

PHP面試總結 PHP基礎1&#xff1a;變量的傳值與引用。 2&#xff1a;變量的類型轉換和判斷類型方法。 3&#xff1a;php運算符優先級&#xff0c;一般是寫出運算符的運算結果。 4&#xff1a;PHP中函數傳參&#xff0c;閉包&#xff0c;判斷輸出的echo&#xff0c;print是不是函…

Winform控件WebBrowser與JS腳本交互

1&#xff09;在c#中調用js函數 如果要傳值&#xff0c;則可以定義object[]數組。 具體方法如下例子&#xff1a; 首先在js中定義被c#調用的方法: function Messageaa(message) { alert(message); } 在c#調用js方法Messageaa private void button1_Click(object …

從零開始擼一個Kotlin Demo

####前言 自從google將kotlin作為親兒子后就想用它擼一管app玩玩&#xff0c;由于工作原因一直沒時間下手&#xff0c;直到項目上線后才有了空余時間&#xff0c;期間又由于各種各樣煩人的事斷了一個月&#xff0c;現在終于開發完成項目分為服務器和客戶端&#xff1b;服務器用…

移動平均線ma分析_使用動態移動平均線構建交互式庫存量和價格分析圖

移動平均線ma分析I decided to code out my own stock tracking chart despite a wide array of freely available tools that serve the same purpose. Why? Knowledge gain, it’s fun, and because I recognize that a simple project can generate many new ideas. Even t…