browser

mirror of https://github.com/lightpanda-io/browser.git synced 2026-03-22 12:44:43 +00:00

Author	SHA1	Message	Date
Karl Seguin	94ce5edd20	Frames on the same origin share v8 data Depends on: https://github.com/lightpanda-io/zig-v8-fork/pull/153 In some ways this is an extension of https://github.com/lightpanda-io/browser/pull/1635 but it has more implications with respect to correctness. A js.Context wraps a v8::Context. One of the important thing it adds is the identity_map so that, given a Zig instance we always return the same v8::Object. But imagine code running in a frame. This frame has its own Context, and thus its own identity_map. What happens when that frame does: ```js window.top.frame_loaded = true; ``` From Zig's point of view, `Window.getTop` will return the correct Zig instance. It will return the Window references by the "root" page. When that instance is passed to the bridge, we'll look for the v8::Object in the Context's `identity_map` but wont' find it. The mapping exists in the root context `identity_map`, but not within this frame. So we create a new v8::Object and now our 1 zig instance has N v8::Objects for every page/frame that tries to access it. This breaks cross-frame scripting which should work, at least to some degree, even when frames are on the same origin. This commit adds a `js.Origin` which contains the `identity_map`, along with our other `v8::Global` storage. The `Env` now contains a `js.Origin` lookup, mapping an origin string (e.g. lightpanda.io:443) to an *Origin. When a Page's URL is changed, we call `self.js.setOrigin(new_url)` which will then either get or create an origin from the Env's origin lookup map. js.Origin is reference counted so that it remains valid so long as at least 1 frame references them. There's some special handling for null-origins (i.e. about:blank). At the root, null origins get a distinct/isolated Origin. For a frame, the parent's origin is used. Above, we talked about `identity_map`, but a `js.Context` has 8 other fields to track v8 values, e.g. `global_objects`, `global_functions`, `global_values_temp`, etc. These all must be shared by frames on the same origin. So all of these have also been moved to js.Origin. They've also been merged so that we now have 3 fields: `identity_map`, `globals` and `temps`. Finally, when the origin of a context is changed, we set the v8::Context's SecurityToken (to that origin). This is a key part of how v8 allows cross- context access.	2026-03-11 08:43:40 +08:00
Pierre Tachoire	1ebf7460fe	Merge pull request #1768 from lightpanda-io/inspector_cleanup Call `resetContextGroup` on page removal	2026-03-10 15:32:47 +01:00
Karl Seguin	11fb5f990e	Call `resetContextGroup` on page removal Calling it here ensures that the inspector gets reset on internal page navigation. We were seeing intermittent segfaults on a problematic WPT tests (/encoding/legacy-mb-japanese/euc-jp/) which I believe this solves. (The tests are still broken. Because we don't support form targets, they cause the root page to reload in a tight cycle, causing a lot of context creation / destruction, which I thin was the issue. This commit doesn't fix the broken test but it hopefully fixes the crash). Also, clear out the Inspector's default_context when the default context is destroyed. (This was the first thing I did to try to fix the crash, it didn't work, but I believe it's correct).	2026-03-10 20:50:58 +08:00
Pierre Tachoire	d669d5c153	cdp: add a dummy Page.getLayoutMetrics	2026-03-10 08:54:48 +01:00
Pierre Tachoire	8672232ee2	cdp: add dummy page.captureScreenshot	2026-03-09 17:38:57 +01:00
Pierre Tachoire	6a8174a15c	cdp: don't dispatch executionContextsCleared on frame navigation	2026-03-04 14:45:21 +01:00
Karl Seguin	10ad5d763e	Rename page.id to page._frame_id This field was recently added and is used to generate correct frameIds in CDP messages. They remain the same during a navigation event, so calling them page.id might cause surprises since navigation events create new pages, but retain the original id. Hence, frame_id is more accurate and hopefully less surprising. (This is a small cleanup prior to doing some iframe navigation work).	2026-03-02 16:21:29 +08:00
Karl Seguin	21be3db51f	Callers to page.navigate ensure URL is properly encoded. Follow up to https://github.com/lightpanda-io/browser/pull/1646 The encodeURL (renamed to ensureEncoded and exposed in this commit) already handled already-encoded URLs, so this was largely a matter of exposing the functionality. The reason this isn't baked directly into Page.navigate is that, in some places e.g. internal navigation, the URL is already know to be encoded. So it's up to every caller to make sure they are passing a valid URL to navigate.	2026-02-26 12:22:06 +08:00
Karl Seguin	71d34592d9	add frame created cdp messages	2026-02-19 23:47:33 +08:00
Karl Seguin	db2927eea7	cleanup a not-so-great rebase	2026-02-19 23:47:33 +08:00
Karl Seguin	bb01a5cb31	Make CDP frame-aware	2026-02-19 23:47:33 +08:00
Karl Seguin	e2a1ce623c	Rework CDP frameIds (and loaderIds and requestIds and interceptorIds) Our BrowsingContext currently supports 1 target. So we have a per-BC target_id. Previously, our target had 1 "frame" - our page. So we often treated the targetId as the frameId. But to work with frames, we need page-specific frameIds and loaderIds. This tries to clean up our ids (a little). frameIds are now ids derived from a new incrementing page.id. This page.id has to be passed around (via http Requests and through notifications) in order to properly generate messages with a frameId.	2026-02-19 13:01:41 +08:00
Karl Seguin	14112ed294	Remove Page.reset Page.reset exists for 1 use case: multiple calls to the Page.navigate CDP method. At an extreme, something like this in puppeteer: ``` await page.goto(baseURL + '/campfire-commerce/'); await page.goto(baseURL + '/campfire-commerce/'); ``` Rather than handling this generically in Page, we now handle this case specifically at the CDP layer. If the page isn't in its initial load state, i.e. page._load_state != .waiting, then we reload the page from the session. For reloading, my initial inclination was to do session.removePage then session.createPage(). This behavior still seems potentially correct to me, but compared to our `reset`, this would trigger extra notifications, namely: self.notification.dispatch(.page_remove, .{}); and self.notification.dispatch(.page_created, page); Bacause of https://github.com/lightpanda-io/browser/pull/1265/ I guess that could have side effects. So, to keep the behavior as close to the current "reset", a new `session.replacePage()` has been added which behaves a lot like removePage + createPage, but without the notifications being sent. While I generally think this is just cleaner, this was largely driven by some planning for frame support. The entity for a Frame will share a lot with the Page (we'll extract that logic), so simplifying the Page, especially around initialization, helps simplify frame support.	2026-02-11 13:53:49 +08:00
Karl Seguin	7e575c501a	Add a dedicated browser_context and page_arena to CDP. The BrowserContext currently uses 3 arenas: 1 - Command-specific, which is like the call_arena, but for the processing of a single CDP command 2 - Notification-specific, which is similar, but for the processing of a single internal notification event 3 - Arena, which is just the session arena and lives for the duration of the BrowseContext/Session This is pretty coarse and can results in significant memory accumulation if a browser context is re-used for multiple navigations. This commit introduces 3 changes: 1 - Rather than referencing Session.arena, the BrowerContext.arena is now its own arena. This doesn't really change anything, but it does help keep things a bit better separated. 2 - Introduces a page_arena (not to be confused with Page.arena). This arena exists for the duration of a 1 page, i.e. it's cleared when the BrowserContext receives the page_created internal notification. The `captured_responses` now uses this arena, which means captures only exist for the duration of the current page. This appears to be consistent with how chrome behaves (In fact, Chrome seems even more aggressive and doesn't appear to make any guarantees around captured responses). CDP refers to this lifetime as a "renderer" and has an experimental message, which we don't support, `Network.configureDurableMessages` to control this. 3 - Isolated Worlds are now more self contained with an arena from the ArenaPool. There are currently 2 places where the BrowserContext.arena is still used: 1 - the isolated_world list 2 - the custom headers Although this could be long lived, I believe the above is ok. We should just really think twice whenever we want to use it for anything else.	2026-02-03 15:48:27 +08:00
Karl Seguin	181f265de5	Rework Inspector usage V8's inspector world is made up of 4 components: Inspector, Client, Channel and Session. Currently, we treat all 4 components as a single unit which is tied to the lifetime of CDP BrowserContext - or, loosely speaking, 1 "Inspector Unit" per page / v8::Context. According to https://web.archive.org/web/20210622022956/https://hyperandroid.com/2020/02/12/v8-inspector-from-an-embedder-standpoint/ and conversation with Gemini, it's more typical to have 1 inspector per isolate. The general breakdown is the Inspector is the top-level manager, the Client is our implementation which control how the Inspector works (its function we expose that v8 calls into). These should be tied to the Isolate. Channels and Sessions are more closely tied to Context, where the Channel is v8->zig and the Session us zig->v8. This PR does a few things 1 - It creates 1 Inspector and Client per Isolate (Env.js) 2 - It creates 1 Session/Channel per BrowserContext 3 - It merges v8::Session and v8::Channel into Inspector.Session 4 - It moves the Inspector instance directly into the Env 5 - BrowserContext interacts with the Inspector.Session, not the Inspector 4 is arguably unnecessary with respect to the main goal of this commit, but the end-goal is to tighten the integration. Specifically, rather than CDP having to inform the inspector that a context was created/destroyed, the Env which manages Contexts directly (https://github.com/lightpanda-io/browser/pull/1432) and which now has direct access to the Inspector, is now equipped to keep this in sync.	2026-01-30 15:59:33 +08:00
Karl Seguin	4a1d71b6b8	Merge pull request #1437 from lightpanda-io/remove_unused Remove unused import	2026-01-30 06:55:11 +08:00
Karl Seguin	a19a125aec	Remove unused import And a few unused functions	2026-01-29 19:44:10 +08:00
Karl Seguin	1a05da9e55	Remove js.ExecutionWorld The ExecutionWorld doesn't do anything meaningful. It doesn't map to, or abstract any, v8 concepts. It creates a js.Context, destroys the context and points to the context. Those all all things the Env can do (and it isn't like the Env is over-burdened as-is). Plus the benefit of going through the Env is that we can track/collect all known Contexts for an isolate in 1 place (the Env), which can facilitate things like context creation/deletion notifications.	2026-01-29 11:22:01 +08:00
Karl Seguin	830f759f0b	zig fmt, remove unused code	2026-01-24 07:37:30 +08:00
Karl Seguin	9092651b5b	Merge branch 'main' into fix_context_lifetime	2026-01-20 08:50:41 +08:00
Karl Seguin	a6e7ecd9e5	Move more asserts to custom asserter. Deciding what should be an lp.assert, vs an std.debug.assert, vs a debug-only assert is a little arbitrary. debug-only asserts, guarded with an `if (comptime IS_DEBUG)` obviously avoid the check in release and thus have a performance advantage. We also use them at library boundaries. If libcurl says it will always emit a header line with a trailing \r\n, is that really a check we need to do in production? I don't think so. First, that code path is checked _a lot_ in debug. Second, it feels a bit like we're testing libcurl (in production!)..why? A debug-only assertion should be good enough to catch any changes in libcurl.	2026-01-19 09:12:16 +08:00
Karl Seguin	62aa564df1	Remove Global v8::Local<V8::Context> When we create a js.Context, we create the underlying v8.Context and store it for the duration of the page lifetime. This works because we have a global HandleScope - the v8.Context (which is really a v8::Local<v8::Context>) is that to the global HandleScope, effectively making it a global. If we want to remove our global HandleScope, then we can no longer pin the v8.Context in our js.Context. Our js.Context now only holds a v8.Global of the v8.Context (v8::Global<v8::Context). This PR introduces a new type, js.Local, which takes over a lot of the functionality previously found in either js.Caller or js.Context. The simplest way to think about it is: 1 - For v8 -> zig calls, we create a js.Caller (as always) 2 - For zig -> v8 calls, we go through the js.Context (as always) 3 - The shared functionality, which works on a v8.Context, now belongs to js.Local For #1 (v8 -> zig), creating a js.Local for a js.Caller is really simple and centralized. v8 largely gives us everything we need from the FunctionCallbackInfo or PropertyCallbackInfo. For #2, it's messier, because we can only create a local v8::Context if we have a HandleScope, which we may or may not. Unfortunately, in many cases, what to do becomes the responsibility of the caller and much of the code has to become aware of this local-ness. What does it means for our code? The impact is on WebAPIs that store .Global. Because the global can't do anything. You always need to convert that .Global to a local (e.g. js.Function.Global -> js.Function). If you're 100% sure the WebAPI is only being invoked by a v8 callback, you can use `page.js.local.?.toLocal(some_global).call(...)` to get the local value. If you're 100% sure the WebAPI is only being invoked by Zig, you need to create `js.Local.Scope` to get access to a local: ```zig var ls: js.Local.Scope = undefined; page.js.localScope(&ls); defer ls.deinit(); ls.toLocal(some_global).call(...) // can also access `&ls.local` for APIs that require a const js.Local ``` For functions that can be invoked by either V8 or Zig, you should generally push the responsibility to the caller by accepting a `local: const js.Local`. If the caller is a v8 callback, it can pass `page.js.local.?`. If the caller is a Zig callback, it can create a `Local.Scope`. As an alternative, it is possible to simply pass the *Page, and check `if page.js.local == null` and, if so, create a Local.Scope. But this should only be done for performance reasons. We currently only do this in 1 place, and it's because the Zig caller doesn't know whether a Local will actually be needed and it's potentially called on every element creating from the parser.	2026-01-19 07:28:33 +08:00
Karl Seguin	8438b7d561	remove remaining direct v8 references	2026-01-13 12:58:30 +08:00
Karl Seguin	701de08e8a	have our js.Context directly hold a js handle	2026-01-13 12:57:06 +08:00
Karl Seguin	05cb5221d4	Quick-check sameness in Node.isEqualNode Exclusively use the not_implemented log filter.	2025-12-26 09:57:33 +08:00
Karl Seguin	d9c53a3def	Page.scheduleNavigation for location changes	2025-12-22 12:19:08 +08:00
Karl Seguin	f475aa09e8	backport https://github.com/lightpanda-io/browser/pull/1265	2025-12-19 16:06:25 +08:00
Karl Seguin	bb1ea39c54	backport a variety of smaller CDP changes	2025-12-19 10:31:07 +08:00
Pierre Tachoire	fe96bc7895	cdp: use default value for grantUniveralAccess In createIsolatedWorld, we set a default value to false for optional grantUniveralAccess parameter.	2025-12-19 10:10:41 +08:00
Muki Kiboigo	ac85341cab	add NavigationKind to navigate	2025-12-09 17:10:59 -08:00
Muki Kiboigo	370c3a49a7	initial Navigation	2025-12-09 16:51:01 -08:00
Pierre Tachoire	0d8dd84df5	support url on createTarget and send lifecycle events Support url parameter on createTarget. we now navigate on createTarget to dispatch events correctly, even in case of about:blank	2025-12-09 11:29:00 +01:00
Karl Seguin	bd3da38fc8	add native custom elements	2025-11-19 22:50:52 +08:00
Karl Seguin	d3973172e8	re-enable minimum viable CDP server	2025-10-28 18:56:03 +08:00
Karl Seguin	2e734fae57	This is the last of the big changes to the js code This Pr largely tightens up a lot of the code. 'v8' is no longer imported outside of js. A number of helper functions have been moved to the js.Context. For example, js.Function.getName used to call: ```zig return js.valueToString(allocator, name, self.context.isolate, self.context.v8_context); ``` It now calls: ```zig return self.context.valueToString(name, .{ .allocator = allocator }); ``` Page.main_context has been renamed to `Page.js`. This, in combination with new promise helpers, turns: ```zig const resolver = page.main_context.createPromiseResolver(); try resolver.resolve({}); return resolver.promise(); ``` into: ```zig return page.js.resolvePromise({}); ```	2025-10-03 15:06:16 +08:00
Karl Seguin	dab8012b6a	Start extract JS structs into their own files Renames JsContext -> js.Context, JsObject -> js.Object and JsThis -> js.This which is more consistent with the other types. The JsObject -> js.Object is the reason so many files were touched. This is still a [messy] transition, with more refactoring planned to clean it up.	2025-10-02 12:48:50 +08:00
Muki Kiboigo	05e7079178	functional history WebAPI	2025-09-24 00:21:16 -07:00
Karl Seguin	024f7ad9ef	Merge pull request #1056 from lightpanda-io/DOM_NO_ERR Convert more DOM_NO_ERR cases to assertions	2025-09-18 19:06:32 +08:00
Pierre Tachoire	94fe34bd10	cdp: multiple isolated worlds	2025-09-17 14:42:08 +02:00
Karl Seguin	58acb2b821	Convert more DOM_NO_ERR cases to assertions There is some risk to this change. The first is that I made a mistake. The other is that one of the APIs that doesn't currently return an error changes in the future.	2025-09-17 13:37:48 +08:00
Karl Seguin	ac10d5b2a3	Don't assume that page events means the BrowserContext has a page CDP currently assumes that if we get a page-related notification (like a request interception, or page lifecycle event), then we must have a session and page. But, Target.detachFromTarget can remove the session from the BrowserContext while still having the page run (I wonder if we should stop the page at this point??). So, remove these assumptions and make sure we have a page/session in the handling of page events.	2025-09-05 15:07:30 +08:00
Karl Seguin	5dda86bf4a	Emit networkIdle and networkAlmostIdle Page.lifecycleEvent Most CDP drivers have a mechanism to wait for idle network, or an almost idle network (sometimes called networkIdle2). These are events the browser must emit. The page will now emit `networkIdle` when we are reasonably sure there's no more network activity (this requires some slight changes to request interception, since, I believe, intercepted requests should be considered). `networkAlmostIdle` is currently _always_ emitted prior to emitting `networkIdle`. We should tweak this but I can't, at a glance, think of a great heuristic for when this should be emitted.	2025-09-04 16:36:29 +08:00
Karl Seguin	e237e709b6	Change loader id on navigation This appears to be what chrome is doing. I don't know why we weren't before.	2025-09-03 08:17:14 +08:00
Karl Seguin	f65a39a3e3	Re-enable telemetry Start work on supporting navigation events (clicks, form submission).	2025-08-11 21:37:00 +08:00
Karl Seguin	54ab1326e5	Switch XHR to new http client get puppeteer/cdp.js working again make test are all passing	2025-08-11 21:37:00 +08:00
Karl Seguin	b0fe5d60ab	Initial work on integrating libcurl and making all http nonblocking	2025-08-11 21:36:56 +08:00
Karl Seguin	fae2b5acfa	Noop CDP methods that go-rod requires go-rod appears to stop processing when it receives an error, such as UnknownMethod. Added placeholder handlers for Network.setUserAgentOverride and Page.stopLoading. Setting a custom user agent is something still being discussed, so no-oping it seems reasonable. And, due to the currently synchronous nature of the initial page load, no-oping stopLoading also seems reasonable. https://github.com/lightpanda-io/browser/issues/867	2025-07-14 11:21:02 +08:00
Karl Seguin	1602932d72	Add a "pre" polyfill This is always run, but only the full webcomponents polyfill, it's very small and isn't intrusive. This introduces a layer of indirection so that, if the full polyfill is loaded, its monkeypatched constructor will be called	2025-07-12 19:49:19 +08:00
Pierre Tachoire	941dace7f9	enable conditionnal loading for polyfill	2025-07-07 16:31:53 -07:00
sjorsdonkers	0c0ddc10ee	rename scope jscontext Some checks failed e2e-test / zig build release (push) Has been cancelled Details zig-test / zig build dev (push) Has been cancelled Details zig-test / zig test (push) Has been cancelled Details e2e-test / puppeteer-perf (push) Has been cancelled Details e2e-test / demo-scripts (push) Has been cancelled Details e2e-test / cdp-and-hyperfine-bench (push) Has been cancelled Details e2e-test / perf-fmt (push) Has been cancelled Details zig-test / browser fetch (push) Has been cancelled Details zig-test / perf-fmt (push) Has been cancelled Details nightly build / build-linux-x86_64 (push) Has been cancelled Details nightly build / build-linux-aarch64 (push) Has been cancelled Details nightly build / build-macos-aarch64 (push) Has been cancelled Details nightly build / build-macos-x86_64 (push) Has been cancelled Details wpt / web platform tests json output (push) Has been cancelled Details wpt / perf-fmt (push) Has been cancelled Details	2025-06-13 10:30:50 +02:00

1 2

76 Commits