我們通過在關鍵節點打斷點的方式,去分析一個鼠標事件的流程。
我們知道chromium是多進程模型,那么,我們可以推測:一個鼠標消息先從主進程產生,再通過跨進程通信發送給渲染進程,渲染進程再發送給WebFrame,最后被派發到目標Node上。
于是,首先,我們在主進程,找到鼠標時間的最終消費點,打上斷點,進行分析:
主進程繼續往下走,就是把事件通過跨進程通信框架mojo發送過去的邏輯了:
接下來,我們要換個進程,在渲染進程打上斷點,看看消息過來后,怎么走的流程:
首先搜索這個mojom的事件,確認渲染進程斷點落腳處:
題外話:這個input_handler.mojom.cc到了渲染進程那就成了input_handler.mojom-blink.cc了。
這兩個文件99%相似,只有少量的不同(命名空間、基礎類型):
這兩個類的基類是一樣的:
這個類的頭文件寫明了,是由什么工具生成的:
很好奇生成源,我們也去看看:
// Copyright 2020 The Chromium Authors
// Use of this source code is governed by a BSD-style license that can be
// found in the LICENSE file.module blink.mojom;import "cc/mojom/browser_controls_state.mojom";
import "cc/mojom/overscroll_behavior.mojom";
import "cc/mojom/touch_action.mojom";
import "mojo/public/mojom/base/string16.mojom";
import "mojo/public/mojom/base/time.mojom";
import "third_party/blink/public/mojom/input/gesture_event.mojom";
import "third_party/blink/public/mojom/input/handwriting_gesture_result.mojom";
import "third_party/blink/public/mojom/input/input_event_result.mojom";
import "third_party/blink/public/mojom/input/input_event.mojom";
import "third_party/blink/public/mojom/input/pointer_lock_context.mojom";
import "third_party/blink/public/mojom/input/pointer_lock_result.mojom";
import "third_party/blink/public/mojom/input/ime_host.mojom";
import "third_party/blink/public/mojom/input/stylus_writing_gesture.mojom";
import "third_party/blink/public/mojom/input/touch_event.mojom";
import "third_party/blink/public/mojom/selection_menu/selection_menu_behavior.mojom";
import "ui/base/ime/mojom/ime_types.mojom";
import "ui/events/mojom/event_constants.mojom";
import "ui/events/mojom/event_latency_metadata.mojom";
import "ui/events/mojom/event.mojom";
import "ui/events/mojom/scroll_granularity.mojom";
import "ui/gfx/geometry/mojom/geometry.mojom";
import "ui/gfx/range/mojom/range.mojom";
import "ui/latency/mojom/latency_info.mojom";[EnableIf=is_android]
import "third_party/blink/public/mojom/input/synchronous_compositor.mojom";// These structs are purposely duplicated from ui/events/mojom/event.mojom.
// They map WebInputEvent <-> WebInputEvent across mojo.
// We have to work at unifying them. The current problem is that the browser
// uses WebInputEvents inside the render widget host and input router. Once
// we move these to ui::Event's then we can get rid of these duplicated
// mojom structs. Ideally the browser would use ui::Event up until we
// pass the events into the renderer and just use a StructTraits to perform
// conversion from ui::mojom::Event --> blink::WebInputEvent.
struct KeyData {int32 dom_key;int32 dom_code;int32 windows_key_code;int32 native_key_code;bool is_system_key;bool is_browser_shortcut;mojo_base.mojom.String16 text;mojo_base.mojom.String16 unmodified_text;
};struct PointerData {int32 pointer_id;float force;double tilt_x;double tilt_y;float tangential_pressure;int32 twist;blink.mojom.Button button;ui.mojom.EventPointerType pointer_type;int32 movement_x;int32 movement_y;bool is_raw_movement_event;gfx.mojom.PointF widget_position;gfx.mojom.PointF screen_position;MouseData? mouse_data;int32 device_id;
};struct WheelData {float delta_x;float delta_y;float wheel_ticks_x;float wheel_ticks_y;float acceleration_ratio_x;float acceleration_ratio_y;uint8 phase;uint8 momentum_phase;blink.mojom.DispatchType cancelable;uint8 event_action;uint8 delta_units;
};struct MouseData {int32 click_count;WheelData? wheel_data;
};struct ScrollUpdate {float velocity_x;float velocity_y;
};struct ScrollData {float delta_x;float delta_y;ui.mojom.ScrollGranularity delta_units;bool target_viewport;blink.mojom.InertialPhaseState inertial_phase;bool synthetic;int32 pointer_count;// Used for GestureScrollBegin type only. If this is true, we use the scroll// sequence to do cursor control rather than scroll.bool cursor_control;ScrollUpdate? update_details;
};struct PinchBeginData {bool needs_wheel_event;
};struct PinchUpdateData {float scale;bool zoom_disabled;bool needs_wheel_event;
};struct PinchEndData {bool needs_wheel_event;
};struct FlingData {float velocity_x;float velocity_y;bool target_viewport;bool prevent_boosting;
};struct TapData {int32 tap_count;bool needs_wheel_event;
};struct TapDownData {int32 tap_down_count;
};struct GestureData {gfx.mojom.PointF screen_position;gfx.mojom.PointF widget_position;blink.mojom.GestureDevice source_device;bool is_source_touch_event_set_blocking;ui.mojom.EventPointerType primary_pointer_type;int32 primary_unique_touch_event_id;int32 unique_touch_event_id;gfx.mojom.Size? contact_size;ScrollData? scroll_data;PinchBeginData? pinch_begin_data;PinchUpdateData? pinch_update_data;PinchEndData? pinch_end_data;TapData? tap_data;TapDownData? tap_down_data;FlingData? fling_data;
};struct TouchPoint {blink.mojom.TouchState state;float radius_x;float radius_y;float rotation_angle;PointerData pointer_data;
};struct TouchData {blink.mojom.DispatchType cancelable;bool moved_beyond_slop_region;bool touch_start_or_first_move;bool hovering;uint32 unique_touch_event_id;array<TouchPoint> touches;
};struct Event {blink.mojom.EventType type;int32 modifiers;mojo_base.mojom.TimeTicks timestamp;ui.mojom.LatencyInfo latency;ui.mojom.EventLatencyMetadata event_latency_metadata;KeyData? key_data;PointerData? pointer_data;GestureData? gesture_data;TouchData? touch_data;
};// Represents the current state of overscroll.
struct DidOverscrollParams {gfx.mojom.Vector2dF accumulated_overscroll;gfx.mojom.Vector2dF latest_overscroll_delta;gfx.mojom.Vector2dF current_fling_velocity;gfx.mojom.PointF causal_event_viewport_point;cc.mojom.OverscrollBehavior overscroll_behavior;
};// A struct wrapper for the TouchAction enumeration because
// enumerations cannot be optional.
struct TouchActionOptional {cc.mojom.TouchAction touch_action;
};// Types related to sending edit commands to the renderer.
struct EditCommand {string name;string value;
};// A structure that indicates selection offsets if the selection could be
// established.
struct SelectAroundCaretResult {// The offset differences between the extended selection and the initial// selection (which is a caret).int32 extended_start_adjust;int32 extended_end_adjust;// The offset differences between the word selection (regardless of the// extended selection granularity) and the initial selection (which is a// caret).int32 word_start_adjust;int32 word_end_adjust;
};// GENERATED_JAVA_ENUM_PACKAGE: org.chromium.blink_public.input
// GENERATED_JAVA_CLASS_NAME_OVERRIDE: SelectionGranularity
enum SelectionGranularity {kWord,kSentence,
};// An enumeration that describes the allowed non-directional pan action for the
// element under pointer being considered.
enum PanAction {// No pan action allowed.kNone,// Pan action is scroll.kScroll,// Pan action moves cursor when initial touch move is in horizontal direction,// or scrolls in the vertical direction.kMoveCursorOrScroll,// Pan action is stylus writable.kStylusWritable,
};// Interface exposed by the browser to the renderer.
interface WidgetInputHandlerHost {// When the renderer's main thread computes the touch action, send this to the// browser.SetTouchActionFromMain(cc.mojom.TouchAction touch_action);// Sets the pan action possible for the element under pointer which could be// one of the actions described under enum PanAction. Note that this is// different from TouchAction and does not contain directional pan info.SetPanAction(PanAction pan_action);// Sent by the compositor when input scroll events are dropped due to bounds// restrictions on the root scroll offset.DidOverscroll(DidOverscrollParams params);// Sent by the compositor when a GSB has started scrolling the viewport.DidStartScrollingViewport();// Required for cancelling an ongoing input method composition.ImeCancelComposition();// Sends the character bounds after every composition change// to always have correct bound info.ImeCompositionRangeChanged(gfx.mojom.Range range,array<gfx.mojom.Rect>? character_bounds,array<gfx.mojom.Rect>? line_bounds);// Updates the mouse capture state of this widget. While capture is enabled,// all mouse events, including those that don't hittest to this widget, will// be targeted to this widget. This enables Blink to behave correctly when// a scrollbar is being dragged, or text is being drag-highlighted, even// when the mouse passes across different RenderWidget areas.SetMouseCapture(bool capture);// Updates the browser whether this main frame widget has ongoing autoscroll// selection. Any further mouse up event should always be dispatched to the// main frame in addition to it's event target (OOP child frame) if the state// is active. This API should only ever be called on the InputHandler for a// main frame's RenderWidgetHost.SetAutoscrollSelectionActiveInMainFrame(bool autoscroll_selection);// Requests locking the target of mouse events to a single element and// removing the cursor from view. Mostly used by the Pointer Lock API.// See https://www.w3.org/TR/pointerlock/ for more info. This call is// also used by Pepper Flash.// |from_user_gesture| indicates whether this request came from a user// gesture or not.// |unadjusted_movement| indicates whether the request asked for raw mouse// movement data or just what the operating system returns (often accelerated// mouse movement).// |result| kSuccess if the mouse has been locked or the appropriate error// reason if not.// |context| is one end of a mojo pipe that will stay connected as long as// the mouse is locked. Is a NullRemote if |result| is not kSuccess.RequestMouseLock(bool from_user_gesture,bool unadjusted_movement)=> (PointerLockResult result,pending_remote<blink.mojom.PointerLockContext>? context);
};// This interface provides the input actions associated with the FrameWidget.
// Other input actions may also be dispatched via the WidgetInputHandler
// interface. If frame input actions are dispatched the WidgetInputHandler
// should be fetched via the associated interface request so that input calls
// remain in order. See https://goo.gl/x4ee8A for more details.
interface FrameWidgetInputHandler {// Adds text decorations between a given valid start and end offsets in the// currently focused editable field.AddImeTextSpansToExistingText(uint32 start, uint32 end, array<ui.mojom.ImeTextSpan> ime_text_spans);// Clears text decorations type between a given valid start and end offsets// in the currently focused editable field.ClearImeTextSpansByType(uint32 start, uint32 end, ui.mojom.ImeTextSpanType type);// Sets the text composition to be between the given start and end offsets in// the currently focused editable field.SetCompositionFromExistingText(int32 start, int32 end, array<ui.mojom.ImeTextSpan> ime_text_spans);// Deletes the current selection plus the specified number of characters// before and after the selection or caret.ExtendSelectionAndDelete(int32 before, int32 after);// Deletes the current selection plus the specified number of characters// before and after the selection or caret.ExtendSelectionAndReplace(uint32 before,uint32 after,mojo_base.mojom.String16 replacement_text);// Deletes text before and after the current cursor position, excluding the// selection. The lengths are supplied in Java chars (UTF-16 Code Unit),// not in code points or in glyphs.DeleteSurroundingText(int32 before, int32 after);// Deletes text before and after the current cursor position, excluding the// selection. The lengths are supplied in code points, not in Java chars// (UTF-16 Code Unit) or in glyphs. Does nothing if there are one or more// invalid surrogate pairs in the requested rangeDeleteSurroundingTextInCodePoints(int32 before, int32 after);// Selects between the given start and end offsets in the currently focused// editable field.SetEditableSelectionOffsets(int32 start, int32 end);// Stylus Writing - perform Gesture action in input using gesture data.HandleStylusWritingGestureAction(blink.mojom.StylusWritingGestureData gesture_data)=> (HandwritingGestureResult result);// Message payload is the name/value of a WebCore edit command to execute.ExecuteEditCommand(string command, mojo_base.mojom.String16? value);// These messages are typically generated from context menus and request the// renderer to apply the specified operation to the current selection.Undo();Redo();Cut();Copy();CopyToFindPboard();CenterSelection();Paste();PasteAndMatchStyle();Delete();SelectAll();CollapseSelection();// Pushed from the browser to the renderer each time the IME is activated,// e.g. an editable element becomes focused.[EnableIf=is_android]PassImeRenderWidgetHost(pending_remote<ImeRenderWidgetHost> remote);// Replaces the selected region or a word around the cursor with the// specified string.Replace(mojo_base.mojom.String16 word);// Replaces the misspelling in the selected region with the specified string.ReplaceMisspelling(mojo_base.mojom.String16 word);// Requests the renderer to select the region between two points.// Expects a SelectRange_ACK message when finished.SelectRange(gfx.mojom.Point base, gfx.mojom.Point extent);// Sent by the browser to ask the renderer to adjust the selection start and// end points by the given amounts. A negative amount moves the selection// towards the beginning of the document, a positive amount moves the// selection towards the end of the document. Will send show selection menu// event when needed.AdjustSelectionByCharacterOffset(int32 start, int32 end, SelectionMenuBehavior behavior);// Requests the renderer to select the specified granularity around caret and// to potentially show the selection handles and / or context menu after// selection.// Expects ack with new selection information when finished, or null if the// selection failed.SelectAroundCaret(SelectionGranularity granularity, bool should_show_handle,bool should_show_context_menu)=> (SelectAroundCaretResult? result);// Requests the renderer to move the selection extent point to a new position.// Expects a MoveRangeSelectionExtent_ACK message when finished.MoveRangeSelectionExtent(gfx.mojom.Point extent);// Tells the renderer to scroll the currently focused node into view only if// the currently focused node is a Text node (textfield, text area or content// editable divs).ScrollFocusedEditableNodeIntoView();// Replies when the next PageScaleAnimation has completed. Can only be called// on the outermost main frame. Can be used to reliably wait for a// ScrollFocusedEditableNodeIntoView to complete since that may create a// PageScaleAnimation which can take several frames to complete.// ScrollFocusedEditableNodeIntoView cannot use a reply callback for this// since it may be called on a subframe but the scroll will be completed in// another renderer when it bubbles up to the root. Additionally, the caller// may not be the one to have called ScrollFocusedEditableNodeIntoView; for// example, a test that simulates tapping an input box.WaitForPageScaleAnimationForTesting() => ();// Requests the renderer to move the caret selection toward the point.MoveCaret(gfx.mojom.Point point);
};// An enumeration describing the active and focus states. If a widget is
// focused then it must also be active, therefore there is no focused and
// not active value in this enumeration. All widgets in a tab will have
// the same active state. Active state is tab specific, and focus state is
// frame specific. See
// https://www.chromium.org/developers/design-documents/aura/focus-and-activation/
enum FocusState
{kFocused,kNotFocusedAndActive,kNotFocusedAndNotActive
};// Interface exposed by the renderer to the browser. This class represents
// an input interface for an associated Widget object. See FrameWidgetInputHandler
// for an interface at the frame level.
interface WidgetInputHandler {// Tells widget focus has been changed.SetFocus(FocusState state);// Tells widget mouse capture has been lost.MouseCaptureLost();// This message notifies the renderer that the next key event is bound to one// or more pre-defined edit commands. If the next key event is not handled// by blink, the specified edit commands shall be executed against current// focused frame.// Parameters// * edit_commands// See t_p/b/renderer/core/editing/commands/editing_command_type.h// Contains one or more edit commands.// See t_p/b/renderer/core/editing/commands/editor_command.cc for// detailed definition of webkit edit commands.//// This message must be sent just before sending a key event.SetEditCommandsForNextKeyEvent(array<EditCommand> commands);// Sends the cursor visibility state to the render widget.CursorVisibilityChanged(bool visible);// This message sends a string being composed with an input method.// Note, the response is specifically for Devtools to learn about completionImeSetComposition(mojo_base.mojom.String16 text,array<ui.mojom.ImeTextSpan> ime_text_spans,gfx.mojom.Range range, int32 start, int32 end) => ();// This message deletes the current composition, inserts specified text, and// moves the cursor.// Note, the response is specifically for Devtools to learn about completionImeCommitText(mojo_base.mojom.String16 text,array<ui.mojom.ImeTextSpan> ime_text_spans,gfx.mojom.Range range, int32 relative_cursor_position) => ();// This message inserts the ongoing composition.ImeFinishComposingText(bool keep_selection);// Request from browser to update text input state.RequestTextInputStateUpdate();// Request from browser to update the cursor and composition information which// will be sent through ImeCompositionRangeChanged. Setting// |immediate_request| to true will lead to an immediate update. If// |monitor_updates| is set to true then changes to text selection or regular// updates in each compositor frame (when there is a change in composition// info) will lead to updates being sent to the browser.RequestCompositionUpdates(bool immediate_request, bool monitor_request);// Sends an input event to the render widget. The browser should use this// API if it wants to know about the result of the rendering handling// the event. The callback may be delayed based on the event running on// the main thread so DispatchNonBlockingEvent is always preferred if// you don't require notification.DispatchEvent(Event event)=> (blink.mojom.InputEventResultSource source,ui.mojom.LatencyInfo updated_latency,blink.mojom.InputEventResultState state,DidOverscrollParams? overscroll,TouchActionOptional? touch_action);// Sends a non-blocking input event to the render widget. The behaviour// of this API is the same as DispatchEvent just that there is no callback// after the event is processed.DispatchNonBlockingEvent(Event event);// Forces input to be flushed and resolves the callback only once the input// has been fully processed, meaning its effects are visible to the full// system. In practice, this will force a redraw and wait until the new// CompositorFrame (containing all changes caused by prior input) has been// displayed.WaitForInputProcessed() => ();// Attach the synchronous compositor interface. This method only// should be called for Android WebView.[EnableIf=is_android]AttachSynchronousCompositor(pending_remote<SynchronousCompositorControlHost> control_host,pending_associated_remote<SynchronousCompositorHost> host,pending_associated_receiver<SynchronousCompositor> compositor_request);// Return an associated FrameWidgetInputHandler interface so that input// messages to the frame associated with this widget can be sent// serially.GetFrameWidgetInputHandler(pending_associated_receiver<FrameWidgetInputHandler> interface_request);// Notifies the renderer whether hiding/showing the browser controls is// enabled, what the current state should be, and whether or not to// animate to the proper state.UpdateBrowserControlsState(cc.mojom.BrowserControlsState constraints,cc.mojom.BrowserControlsState current,bool animate);
};
可見,這個生成的不止一個事件,包含了大部分用戶事件了。去看看所在目錄,也初步熟悉blink的目錄結構了:
后面我們再詳細介紹這套mojo跨進程通信框架的細節。
言歸正傳,我們attach到render進程,繼續打斷點跟蹤鼠標事件。
想在渲染進程找到WidgetInputHandler類對應的消息,只需要在后面加個Impl即可,我們通過符號跳轉,找到WidgetInputHandlerImpl類:
接下來,中間彎彎繞繞太多,我們直接在node打個斷點,看看鼠標事件如何傳遞到node具體的處理邏輯中的:
事件首先被捕獲并封裝為一個MouseEvent
或PointerEvent
對象。然后通過EventDispatcher
類和相關方法,事件被分發到目標Node
。對應的HTML元素(通過HTMLElement
和特化的TextControlInnerEditorElement
)會處理這個事件,然后這個時間觸發默認基類的處理器。
WebFrameWidget:繼承自WebWidget
,但是大部分函數依然是虛函數,用于專門負責處理一個WebFrame
(即一個網頁框架)的渲染和事件。如果要想顯示一個網頁,那么需要繼承自WebFrameWidget并實現相關純虛函數。
WebFrameWidgetImpl:是WebFrameWidget在blink里
的一個具體實現類。它實現了接口中定義的所有方法,并提供了框架的具體行為。WebFrameWidgetImpl
負責實際的繪圖邏輯、輸入事件處理、視圖的更新和大小調整等。這個實現將抽象的行為具體化,它通過調用Blink渲染引擎的其他組件,如布局引擎、繪圖系統等,來完成對Web內容的渲染和事件處理。WebFrameWidgetImpl
是Web內容在Blink中被渲染的核心地方,它直接與底層的渲染流水線交互。
講到這里,也要引出一個重量級的基類:Node:
Node
類是一個非常重要的基類,它表示文檔中的一個節點。Node
類扮演了核心角色,以下是對Node
類及其作用的更詳細介紹:
層次結構:
Node
類位于 DOM(文檔對象模型)層次結構中的基礎位置。在 DOM 中,所有的元素、文本內容和屬性都被視為節點。因此,Node
類提供了一系列的基礎方法和屬性,用于管理節點之間的關系(例如父子關系和兄弟關系)、節點的遍歷以及節點的基本操作(比如插入、替換、刪除)。事件模型:
Node
類還是事件模型的一部分。在 Blink 中,事件通常會在節點之間傳播,遵循捕獲階段、目標階段和冒泡階段。Node
類提供了事件處理的相關方法,比如綁定事件監聽器、分發事件等。當事件發生時,比如用戶的點擊或鍵盤操作,它會被分發到正確的Node
對象,并且通過事件監聽器進行相應的處理。渲染:
雖然Node
類本身并不直接處理渲染,但它的派生類,如Element
和特化后的HTMLElement
,會存儲與渲染相關的屬性(例如樣式信息)。這些類將Node
的基礎功能擴展到了渲染管線中,這樣渲染引擎就可以使用這些信息來呈現視覺元素。類型多樣性:
DOM 中有不同類型的節點,包括但不限于元素節點(Element
)、文本節點(Text
)、注釋節點(Comment
)和文檔節點(Document
)。Node
類是這些具體節點類型的基類,通過繼承和多態,不同類型的節點可以擁有特化的行為,并且仍然能夠通過Node
類的接口進行通用處理。API 和腳本接口:
Node
類還提供了一系列的 API,供 JavaScript 腳本訪問和操縱節點。開發者可以通過這些 API 查詢節點信息、修改節點結構、動態添加或移除節點等。這為開發者提供了強大的能力來改變頁面內容和行為。總的來說,
Node
類是 Blink 和 Web 開發者用來操作和理解 Web 頁面結構的基礎,它為構建、遍歷和交互提供了基本的框架。通過Node
及其派生類,Blink 渲染引擎能夠將結構化的 HTML 文檔轉化為用戶可以看到和交互的網頁。
未完待續 ....