flink 檢查點_Flink檢查點和恢復

flink 檢查點

Apache Flink is a popular real-time data processing framework. It’s gaining more and more popularity thanks to its low-latency processing at extremely high throughput in a fault-tolerant manner.

Apache Flink是一種流行的實時數據處理框架。它以容錯的方式以極高的吞吐量進行低延遲處理，因此越來越受歡迎。

While there is a good documentation provided by Flink it took me some time to get to understand the various mechanics that come together to make Flink Check pointing and Recovery work end to end. In this article I will explain the key steps one need to perform at various operator levels to create a fault tolerant Flink Job. Flink basic operators are Source, Process and Sink. Process operators could be of various flavors.

盡管Flink提供了很好的文檔，但是我花了一些時間來理解使Flink Check Pointing和Recovery工作端到端結合在一起的各種機制。在本文中，我將解釋在各種操作員級別上創建容錯Flink Job所需執行的關鍵步驟。 Flink的基本運算符是Source，Process和Sink。過程操作員可能具有多種口味。

So let’s get started on what you need to do to enable check pointing and making all operators Checkpoint aware.

因此，讓我們開始您需要做的事情，以啟用檢查點并使所有操作員都知道Checkpoint。

Flink環境配置(檢查指向) (Flink Environment Configuration (Check pointing))

Flink Job Configuration for Check pointing

用于檢查點的Flink作業配置

源運營商檢查點 (Source Operator Checkpointing)

Source operator is the one which fetches data from the source. I wrote a simple SQL continuous query based source operator and kept track of the timestamp till the data has been queried. This information is what will be stored as part of check pointing process by flink. State of the source is saved by flink at the Job Operator level. CheckPointedFunction interface or ListCheckpointed interface should be implemented by the Source function as follows:

源運算符是從源獲取數據的運算符。我編寫了一個簡單的基于SQL連續查詢的源運算符，并跟蹤時間戳，直到查詢完數據為止。該信息將作為flink在檢查點過程中存儲的信息。源的狀態通過flink在作業操作員級別保存。 CheckPointedFunction接口或ListCheckpointed接口應該由Source函數實現，如下所示：

snapshotState method will be called by the Flink Job Operator every 30 seconds as configured. Method should return the value to be saved in state backend

Flink作業操作員將按配置每30秒調用一次snapshotState方法。方法應返回要保存在狀態后端的值

restoreState method is called when the operator is restarting and this method is the handler method to set the last stored timestamp (state) during a checkpoint

當操作員重新啟動時將調用restoreState方法，并且該方法是在檢查點期間設置最后存儲的時間戳(狀態)的處理程序方法

過程功能檢查點 (Process Function Checkpointing)

Flink supports saving state per key via KeyedProcessFunction. ProcessWindowFunction can also save the state of windows on per key basis in case of Event Time processing

Flink支持通過KeyedProcessFunction保存每個鍵的狀態。在事件時間處理的情況下， ProcessWindowFunction還可以按鍵保存窗口的狀態

For KeyedProcessFunction, ValueState need to be stored per key as follows:

對于KeyedProcessFunction ，需要按以下方式存儲每個鍵的ValueState ：

ValueState is just one of the examples. There are other ways to save the state as well. ProcessWindowFunction automatically saves the window state and no variable need to be set.

ValueState只是示例之一。還有其他保存狀態的方法。 ProcessWindowFunction自動保存窗口狀態，無需設置任何變量。

接收器功能檢查點 (Sink Function Checkpointing)

Sink function check pointing works similar to Source Function check pointing and state is saved at the Job Operator level. I have implemented Sink function for Postgres DB. There could be multiple approaches to make sink function fault tolerant and robust considering performance and efficiency. I have taken a simplistic approach and will improve upon it in future.

接收器功能檢查指向的工作方式類似于源功能檢查指向，并且狀態保存在作業操作員級別。我已經為Postgres DB實現了Sink功能。考慮到性能和效率，可以有多種方法使接收器功能具有容錯性和魯棒性。我采用了一種簡單的方法，將來會對其進行改進。

By committing statement in snapshotState method I’m ensuring that all pending data is flushed and committed as part of checkpointing trigger.

通過在snapshotState方法中提交語句，我確保將所有未決數據刷新并作為檢查點觸發器的一部分提交。

可以了，好了 (All Set)

Finally, you need to run your job and you can try to cancel it in between of processing and try to rerun it by providing the checkpoint location as follows. You will need to pass the latest checkpoint yourself, pay attention to -s parameter.

最后，您需要運行您的作業，您可以嘗試在處理之間取消它，并通過提供以下檢查點位置來嘗試重新運行它。您將需要自己通過最新的檢查點，請注意-s參數。

.\flink.bat run -m localhost:8081 -s D:\flink-checkpoints\1d96f28886b693452ab1c88ab72a35c8\chk-10 -c <Job class Name> <Path to Jar file>

結論 (Conclusion)

This is a basic approach toward checkpointing and failure recovey and might need more improvements depending upon each use case. Feel free to provide me your feedback. Happy Reading!!

這是進行檢查點和故障重新報告的基本方法，并且可能需要根據每個用例進行更多的改進。隨時向我提供您的反饋。閱讀愉快！

Repository Link to codebase:

倉庫鏈接到代碼庫：

翻譯自: https://towardsdatascience.com/flink-checkpointing-and-recovery-7e59e76c2d45

flink 檢查點

本文來自互聯網用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務，不擁有所有權，不承擔相關法律責任。
如若轉載，請注明出處：http://www.pswp.cn/news/388036.shtml
繁體地址，請注明出處：http://hk.pswp.cn/news/388036.shtml
英文地址，請注明出處：http://en.pswp.cn/news/388036.shtml

如若內容造成侵權/違法違規/事實不符，請聯系多彩編程網進行投訴反饋email:809451989@qq.com，一經查實，立即刪除！

917. 僅僅反轉字母

給定一個字符串 S，返回 “反轉后的” 字符串，其中不是字母的字符都保留在原地，而所有字母的位置發生反轉。示例 1： 輸入："ab-cd" 輸出："dc-ba"示例 2： 輸入：&q…

C# socket nat 映射網絡代理轉發

using System;using System.Collections.Generic;using System.Net;using System.Net.Sockets;using System.Text;using System.Threading;namespace portmap_net{/// <summary>/// 映射器實例狀態/// </summary>sealed internal class state{#region Fields (5)pu…

python初學者_初學者使用Python的完整介紹

python初學者A magical art of teaching a computer to perform a task is called computer programming. Programming is one of the most valuable skills to have in this competitive world of computers. We, as modern humans, are living with lots of gadgets such as …

c# nat udp轉發

UdpClient myClient;Thread recvThread;//打開udp端口開始接收private void startRecv(int port){myClient new UdpClient(port);recvThread new Thread(new ThreadStart(receive));recvThread.Start();}//停止接收private void stopRecv(){recvThread.Abort();}private void…

【Code-Snippet】TextView

1. TextView文字過長，顯示省略號【參考】必須要同時設置XML和JAVA，而且，java中設置文字必須是在最后。 android:ellipsize"start|end|middle" //省略號的位置 android:singleLine"true" android:lines"2"…

Object 的靜態方法之 defineProperties 以及數據劫持效果

再提一下什么是靜態方法： 靜態方法：在類身上的方法，　　動態方法:在實例身上的方法 Object.defineProperties(obj, props)obj：被添加屬性的對象props：添加或更新的屬性對象給對象定義屬性，如果存在該屬性&a…

Spring實現AOP的4種方式

Spring實現AOP的4種方式先了解AOP的相關術語: 1.通知(Advice): 通知定義了切面是什么以及何時使用。描述了切面要完成的工作和何時需要執行這個工作。 2.連接點(Joinpoint): 程序能夠應用通知的一個“時機”，這些“時機”就是連接點，例如方法被調用時、…

如何使用Plotly在Python中為任何DataFrame繪制地圖的衛星視圖

Chart-Studio和Mapbox簡介 (Introduction to Chart-Studio and Mapbox) Folium and Geemap are arguably the best GIS libraries/tools to plot satellite-view maps or any other kinds out there, but at times they require an additional authorization to use the Google…

Java入門系列-26-JDBC

認識 JDBC JDBC (Java DataBase Connectivity) 是 Java 數據庫連接技術的簡稱，用于連接常用數據庫。 Sun 公司提供了 JDBC API ，供程序員調用接口和類，集成在 java.sql 和 javax.sql 包中。 Sun 公司還提供了 DriverManager 類用來管理各種不…

3.19PMP試題每日一題

在房屋建造過程中，應該先完成衛生管道工程，才能進行電氣工程施工，這是一個：A、強制性依賴關系B、選擇性依賴關系C、外部依賴關系D、內部依賴關系作者：Tracy19890201（同微信號）轉載于:https://…

Can't find temporary directory:internal error

今天我機子上的SVN突然沒有辦法進行代碼提交了，出現的錯誤提示信息為： Error：Cant find temporary directory:internal error 然后試了下其他的SVN源，發現均無法提交，并且update時也出現上面的錯誤信息。對比項目文件…

snowflake 數據庫_Snowflake數據分析教程

snowflake 數據庫目錄 (Table of Contents) Introduction 介紹 Creating a Snowflake Datasource 創建雪花數據源 Querying Your Datasource 查詢數據源 Analyzing Your Data and Adding Visualizations 分析數據并添加可視化 Using Drilldowns on Your Visualizations 在可視化…

jeesite緩存問題

jeesite，其框架主要為： 后端核心框架：Spring Framework 4.0 安全框架：Apache Shiro 1.2 視圖框架：Spring MVC 4.0 服務端驗證：Hibernate Validator 5.1 布局框架：SiteMesh 2.4 工作流引擎…

高級Python：定義類時要應用的9種最佳做法

重點 (Top highlight)At its core, Python is an object-oriented programming (OOP) language. Being an OOP language, Python handles data and functionalities by supporting various features centered around objects. For instance, data structures are all objects, …