browser

mirror of https://github.com/lightpanda-io/browser.git synced 2026-03-22 04:34:44 +00:00

Author	SHA1	Message	Date
Pierre Tachoire	d669d5c153	cdp: add a dummy Page.getLayoutMetrics	2026-03-10 08:54:48 +01:00
egrs	dc3958356d	address review feedback - TreeWalker.Full instead of FullExcludeSelf so querying a specific nodeId evaluates the root element itself - resolve href to absolute URL via URL.resolve - isDisabled checks ancestor <fieldset disabled> with legend exemption - parameter order: allocator before *Page per convention	2026-03-10 08:13:01 +01:00
Nikolay Govorov	8e59ce9e9f	Prepare global NetworkRuntime module	2026-03-10 03:00:47 +00:00
Adrià Arrufat	a318c6263d	SemanticTree: improve visibility, AX roles and xpath generation - Use `checkVisibility` for more accurate element visibility detection. - Add support for color, date, file, and month AX roles. - Optimize XPath generation by tracking sibling indices during the walk. - Refine interactivity detection for form elements.	2026-03-10 09:23:06 +09:00
egrs	a417c73bf7	add LP.getInteractiveElements CDP command Returns a structured list of all interactive elements on a page: buttons, links, inputs, ARIA widgets, contenteditable regions, and elements with event listeners. Includes accessible names, roles, listener types, and key attributes. Event listener introspection (both addEventListener and inline handlers) is unique to LP — no other browser exposes this to automation code.	2026-03-09 19:46:12 +01:00
Pierre Tachoire	8672232ee2	cdp: add dummy page.captureScreenshot	2026-03-09 17:38:57 +01:00
Adrià Arrufat	85ebbe8759	SemanticTree: improve accessibility tree and name calculation - Add more structural roles (banner, navigation, main, list, etc.). - Implement fallback for accessible names (SVG titles, image alt text). - Skip children for leaf-like semantic nodes to reduce redundancy. - Disable pruning in the default semantic tree view.	2026-03-09 21:04:47 +09:00
Adrià Arrufat	c3a53752e7	CDP: simplify AXNode name extraction logic	2026-03-09 15:34:59 +09:00
Adrià Arrufat	b8a3135835	SemanticTree: add pruning support and move logic to walk	2026-03-09 13:02:03 +09:00
Adrià Arrufat	b674c2e448	CDP/MCP: add highly compressed text format for semantic tree	2026-03-08 22:42:00 +09:00
Adrià Arrufat	b8139a6e83	CDP/MCP: improve Stagehand compatibility for semantic tree	2026-03-08 15:48:44 +09:00
Adrià Arrufat	e0f0b9f210	SemanticTree: use AXRole enum for interactive role check	2026-03-06 16:26:08 +09:00
Adrià Arrufat	248851701f	Refactor: move SemanticTree to core and expose via MCP tools	2026-03-06 15:44:03 +09:00
Adrià Arrufat	0f46277b1f	CDP: implement LP.getSemanticTree for native semantic DOM extraction	2026-03-06 15:29:32 +09:00
Karl Seguin	6c5efe6ce0	Merge pull request #1715 from lightpanda-io/cdp-frame-navigate cdp: don't dispatch executionContextsCleared on frame navigation	2026-03-04 22:02:30 +08:00
Pierre Tachoire	6a8174a15c	cdp: don't dispatch executionContextsCleared on frame navigation	2026-03-04 14:45:21 +01:00
Pierre Tachoire	40c3f1b618	cdp: fix req id resolver, they are REQ- not RID-	2026-03-04 13:00:16 +01:00
Karl Seguin	01fab5c92a	Merge pull request #1706 from lightpanda-io/cdp-attach-to-browser cdp: fix send CDP raw command with Playwright	2026-03-04 07:40:05 +08:00
Karl Seguin	6f0cd87d1c	Merge pull request #1703 from lightpanda-io/client_and_script_manager Fix a few issues in Client	2026-03-04 07:32:14 +08:00
Pierre Tachoire	9ca5188e12	cdp: set consistent target's default with about:blank for url and empty title.	2026-03-03 17:24:08 +01:00
Pierre Tachoire	56cc881ac0	Fcdp: fix attachtToTarget and attachToBrowserTarget resp	2026-03-03 15:01:53 +01:00
Pierre Tachoire	06ef6d3e6a	cdp: attachToTarget must add the session id	2026-03-03 12:58:00 +01:00
Pierre Tachoire	14b58e8062	add target.attachToBrowserTarget	2026-03-03 12:58:00 +01:00
Pierre Tachoire	eee232c12c	cdp: allow multiple calls to attachToTarget Playwright, when creating a new CDPSession, sends an attachToBrowserTarget followed by another attachToTarget to re-attach itself to the existing target. see playwright/axtree.js from demo/ repository.	2026-03-03 12:58:00 +01:00
Karl Seguin	523efbd85a	Fix a few issues in Client Most significantly, if removing from the multi fails, the connection is added to a "dirty" list for the removal to be retried later. Looking at the curl source code, remove fails on a recursive call, and we've struggled with recursive calls before, so I _think_ this might be happening (it fails in other cases, but I suspect if it _is_ happening, it's for this reason). The retry happens _after_ `perform`, so it cannot fail for due to recursiveness. If it fails at this point, we @panic. This is harsh, but it isn't easily recoverable and before putting effort into it, I'd like to know that it's actually happening. Fix potential use of undefined when a 401-407 request is received, but no 'WWW-Authenticate' or 'Proxy-Authenticate' header is received. Don't call `curl_multi_remove_handle` on an easy that hasn't been added yet do to error. Specifically, if `makeRequest` fails during setup, transfer_conn is nulled so that `transfer.deinit()` doesn't try to remove the connection. And the conn is removed from the `in_use` queue and made `available` again. On Abort, if getting the private fails (extremely unlikely), we now still try to remove the connection from the multi. Added a few more fields to the famous "ScriptManager.Header recall" assertion.	2026-03-03 18:02:06 +08:00
Adrià Arrufat	b2e301418f	cdp.lp: use page.document instead of window._document	2026-03-03 17:11:16 +09:00
Adrià Arrufat	334a2e44a1	lp: simplify dom_node resolution in getMarkdown	2026-03-03 17:08:43 +09:00
Adrià Arrufat	c9121a03d2	cdp: move LP.getMarkdown test to lp domain	2026-03-03 16:39:31 +09:00
Adrià Arrufat	cc93180d57	cdp: add LP domain and getMarkdown method This PR introduces a custom CDP domain 'LP' (Lightpanda) to expose browser-specific tools. The first method, 'LP.getMarkdown', allows retrieving a Markdown representation of the DOM or a specific node by its 'nodeId'. This is optimized for AI agents and LLM-based scraping tasks.	2026-03-03 16:35:48 +09:00
Karl Seguin	7695c8403f	Merge pull request #1692 from lightpanda-io/rename_page_id_to_frame_id Rename page.id to page._frame_id	2026-03-02 17:40:43 +08:00
Karl Seguin	10ad5d763e	Rename page.id to page._frame_id This field was recently added and is used to generate correct frameIds in CDP messages. They remain the same during a navigation event, so calling them page.id might cause surprises since navigation events create new pages, but retain the original id. Hence, frame_id is more accurate and hopefully less surprising. (This is a small cleanup prior to doing some iframe navigation work).	2026-03-02 16:21:29 +08:00
Karl Seguin	03b999c592	Remove redundant CDP v8 shutdown https://github.com/lightpanda-io/browser/pull/1614 improved our shutdown behavior so that microtasks associated with a context wouldn't fire after the context was disposed of. This involved having context-specific microtasks, pumping the message loop, and prevent re-entry. The shutdown code in CDP already had much of this behavior built-in, but it has now become redundant. Most importantly the CDP shutdown logic did not prevent re-entry. Removing this code fixes a flaky WPT crash. I didn't seem to be tied to a specific test, but rather a cross-context/page use-after-free that was saw prior to 1614. I could reproduce it reliably by running `/wasm/core/`. I'll be honest, it isn't clear to me why _removing_ the CDP cleanup helps. Running the message loop and microtask _before_ our normal shutdown might be unnecessary, but why would it crash? I don't know, but the CDP path is slightly different in that it also involves Inspector shutdown. So there's still something about this flow I don't quite understand. And, at least for this case the current flow seems "correct".	2026-03-02 10:24:07 +08:00
Karl Seguin	e65667963f	Correctly JSON encode URL I think this code comes from some serialization tweak from when everything was an std.Uri and by switch to [:0]const u8 everywhere not only was the tweak unecessary, it was also wrong - possibly resulting in the generation of invalid JSON.	2026-02-28 12:48:45 +08:00
Karl Seguin	e4cb78abee	Merge pull request #1670 from lightpanda-io/cdata_sso Change CData._data from []const to String (SSO)	2026-02-27 17:30:03 +08:00
Karl Seguin	870fd1654d	Change CData._data from []const to String (SSO) After looking at a handful of websites, the # of Text and Commend nodes that are small (<= 12 bytes) is _really_ high. Ranging from 85% to 98%. I thought that was high, but a lot of it is indentation or a sentence that's broken down into multiple nodes, eg: <div><b>sale!</b> <span class=price>$1.99</span> buy now<div> So what looks like 1 sentence to us, is actually 3 text nodes. On a typical website, we should see thousands of fewer allocations in the page arena for the text in text nodes.	2026-02-27 12:53:54 +08:00
Karl Seguin	315c9a2d92	Add RC support to NodeList Most importantly, this allows the Selector.List to be self-contained with an arena from the ArenaPool. Selector.List can be both relatively large and relatively common, so moving it off the page.arena is a nice win. Also applied this to ChildNodes, which is much smaller but could also be called often. I was initially going to hook into the v8::Object's internal fields to store the referencing v8::Object. So the v8::Object representing the Iterator would store the v8::Object representing the NodeList inside of its internal field - which the GC would trace/detect/respect. And that is probably the fastest and most v8-ish solution, but I couldn't come up with an elegant solution. The best I had was having a "afterCreate" callback which passed the v8 object (this is similar to the old postAttach callback we had, but used for a different purpose). However, since "acquireRef" was recently added to events, re-using that was much simpler and worked well.	2026-02-27 10:29:46 +08:00
Karl Seguin	641c7b2c89	Merge pull request #1661 from lightpanda-io/escape_navigate_url Some checks failed e2e-test / zig build release (push) Has been cancelled Details e2e-test / demo-scripts (push) Has been cancelled Details e2e-test / cdp-and-hyperfine-bench (push) Has been cancelled Details e2e-test / perf-fmt (push) Has been cancelled Details e2e-test / browser fetch (push) Has been cancelled Details zig-test / zig test using v8 in debug mode (push) Has been cancelled Details zig-test / zig test (push) Has been cancelled Details zig-test / perf-fmt (push) Has been cancelled Details Callers to page.navigate ensure URL is properly encoded.	2026-02-26 15:19:11 +08:00
Karl Seguin	21be3db51f	Callers to page.navigate ensure URL is properly encoded. Follow up to https://github.com/lightpanda-io/browser/pull/1646 The encodeURL (renamed to ensureEncoded and exposed in this commit) already handled already-encoded URLs, so this was largely a matter of exposing the functionality. The reason this isn't baked directly into Page.navigate is that, in some places e.g. internal navigation, the URL is already know to be encoded. So it's up to every caller to make sure they are passing a valid URL to navigate.	2026-02-26 12:22:06 +08:00
Karl Seguin	3bf596c54c	Merge pull request #1651 from lightpanda-io/more_pump_message_loop Run the MessageLoop [a lot] more.	2026-02-26 11:35:11 +08:00
Karl Seguin	af7498d283	Run the MessageLoop [a lot] more. Depends on: https://github.com/lightpanda-io/zig-v8-fork/pull/152 We previously ran the message loop every 250ms. This commit changes it to run on every tick (much more frequently). It also runs microtasks after draining the message loop (since it can generate microtasks). Also, we use to run microtasks after each script execution. Now we drain the message Loop + microtasks. We still only drain the microtasks when executing v8 callbacks. As part of this change, we also adjust our wait time based on whether or not there are pending background tasks in v8 in order to try to execute them (in general) and in a timely manner. The goal is to ensure that tasks v8 enqueued on the foreground thread are executed promptly. This change is particularly useful for calls to webassembly as compilation happens in the background and eventually requires the message loop to be drained to continue. Previously, if a script did `await WebAssembly.instantiate(....)`, there was a good chance we'd never finish the code - we'd wait too long to run the message loop AND, after running it, we wouldn't necessarily resolve the promise.	2026-02-25 13:55:35 +08:00
Nikolay Govorov	5fea1df42b	Move Net staff to clean network module	2026-02-25 05:31:19 +00:00
Karl Seguin	603e7d922e	Improve Context shutdown Under some conditions, a microtask would be executed for a context that was already deinit'd, resulting in various use-after-free. The culprit appears to be WASM compilation being placed in the microtask queue (by a user-script) and then resolved at some point in the future. We guard the microtask queue by a context.shutting_down boolean, but v8 doesn't know anything about this flag. The fact is that, microtasks are tied to an isolate, not a context. This commit introduces a number of changes: 1 - It follows `309f254c2c` and stores the zig Context inside of an embedder field. This ensures v8 doesn't consider this when GC'ing, which _could_ extend the lifetime of the v8::Context beyond what we expect 2 - Most significantly, it introduces per-context microtasks queues. Each context gets its own queue. This makes cleanup much simpler and reduces the chance of microtasks outliving the context 3 - pumpMessageLoop is called on context.deinit, this helps to ensure that any tasks v8 has for our context are processed (e.g. wasm compilation) before shtudown 4 - The order of context shutdown is important, we notify the isolate of the context destruction first, then pump the message loop and finally destroy the context's message loop. Depends on https://github.com/lightpanda-io/zig-v8-fork/pull/151	2026-02-21 13:02:43 +08:00
Karl Seguin	71d34592d9	add frame created cdp messages	2026-02-19 23:47:33 +08:00
Karl Seguin	db2927eea7	cleanup a not-so-great rebase	2026-02-19 23:47:33 +08:00
Karl Seguin	bb01a5cb31	Make CDP frame-aware	2026-02-19 23:47:33 +08:00
Karl Seguin	081979be3b	Initial support for frames Missing: - [ ] Navigation support within frames (in fact, as-is, any navigation done inside a frame, will almost certainly break things - [ ] Correct CDP support. I don't know how frames are supposed to be exposed to CDP. Normal navigate events? Distinct CDP frame_ids? - [ ] Cross-origin restrictions. The interaction between frames is supposed to change depending on whether or not they're on the same origin - [ ] Potentially handling src-less frames incorrectly. Might not really matter Adds basic frame support. Initially explored adding a BrowsingContext and embedding it in Page, with the goal of also having it embedded in a to-be created Frame. But it turns out that 98% of Page _was_ BrowsingContext and introducing a BrowsingContext as the primary interaction unit broke pretty much _every_ single WebAPI. So Page was expanded: - Added `_parent: ?Page`, which is `null` for "root" page. - Added `frame: ?IFrame`, which is `null` for the "root" page. This is the HTMLIFrameElement for frame-pages. - Added a _type: enum{root, frame}, which is currently only used to improve the logs - Added a frames: std.ArrayList(*Page). This is a list of frames for the page. Note that a "frame-page" can itself haven nested frames. Besides the above, there were 3 "big" changes. 1 - Adding frames (dynamically, parsed) has to create a new page, start navigation, track it (in the frames list). Part of this was just piggybacking off of code that handles <script> 2 - The page "load" event blocks on the frame "load" event. This cascades. when a page triggers it's load, it can do: ```zig if (self._parent) \|p\| { p.iframeLoaded(self); } ``` Pages need to keep track of how many iframes they're waiting to load. When all iframes (and all scripts) are loaded, it can then triggers its own load event. 3 - Our JS execution expects 1 primary entered context (the pages). But we now have multiple page contexts, and we need to be in the correct one based on where javascript is being executed. There is no more an default entered context. Creating a Local.Scope enters the context, and ls.deinit() exits the context.	2026-02-19 23:47:33 +08:00
Karl Seguin	938cd5e136	Merge pull request #1582 from lightpanda-io/cdp_per_page_frame_id Rework CDP frameIds (and loaderIds and requestIds and interceptorIds)	2026-02-19 22:16:52 +08:00
Karl Seguin	e2a1ce623c	Rework CDP frameIds (and loaderIds and requestIds and interceptorIds) Our BrowsingContext currently supports 1 target. So we have a per-BC target_id. Previously, our target had 1 "frame" - our page. So we often treated the targetId as the frameId. But to work with frames, we need page-specific frameIds and loaderIds. This tries to clean up our ids (a little). frameIds are now ids derived from a new incrementing page.id. This page.id has to be passed around (via http Requests and through notifications) in order to properly generate messages with a frameId.	2026-02-19 13:01:41 +08:00
Karl Seguin	645da2e307	Reduce cost of various Element render-related properties. Added a get-only `getStyle` which doesn't lazily create a new style if none exists. This can be used in the (frequently used) `checkVisibility` to avoid an allocation. Added a specialized getBoundingClientRectForVisible which skips the checkVisibility check, since a few callers have already done their own visibility check. DOMRect is now off the heap. This avoids _a lot_ of allocation when a DOMRect is only needed for internal calculation, e.g. in Document.elementFromPoint.	2026-02-19 09:45:56 +08:00
Nikolay Govorov	9296c10ca4	Use per-cdp connection HttpClient	2026-02-18 09:22:26 +00:00

1 2 3 4 5 ...

544 Commits