browser

mirror of https://github.com/lightpanda-io/browser.git synced 2026-03-22 04:34:44 +00:00

Author	SHA1	Message	Date
Nikolay Govorov	687f577562	Move accept loop to common runtime	2026-03-10 03:00:50 +00:00
Adrià Arrufat	f2a30f8cdd	mcp: don't forget to flush	2026-03-02 21:46:49 +09:00
Adrià Arrufat	78edf6d324	mcp: simplify I/O architecture and remove test harness	2026-03-02 21:25:07 +09:00
Adrià Arrufat	64107f5957	mcp: refactor for testability and add comprehensive test suite - Refactor mcp.Server and router to accept injected I/O streams. - Implement McpHarness for high-fidelity MCP integration testing. - Add unit tests for protocol, tools, and resources modules. - Add integration tests covering initialization, tool/resource execution, and error handling. - Improve error reporting for malformed JSON requests.	2026-03-02 15:52:05 +09:00
Adrià Arrufat	da51cdd11d	Merge branch 'main' into mcp	2026-03-02 11:55:36 +09:00
Adrià Arrufat	8b0118e2c8	mcp: update logging scope to use mcp instead of app	2026-02-28 22:30:02 +09:00
Adrià Arrufat	6897d72c3e	mcp: simplify request processing to single-threaded	2026-02-28 21:26:51 +09:00
Karl Seguin	a818560344	Add a --with_frames argument to fetch When set (defaults to not set/false), --dump will include iframe contents. I was hoping I could add a mode to strip_mode to this, but since dump is used extensively (e.g. innerHTML), this is something that has to be off by default (for correctness).	2026-02-25 15:29:27 +08:00
Adrià Arrufat	a27339b954	mcp: add Model Context Protocol server support Adds a new `mcp` run mode to start an MCP server over stdio. Implements tools for navigation and JS evaluation, along with resources for HTML and Markdown page content.	2026-02-22 22:32:14 +09:00
Pierre Tachoire	e15295bdac	Merge pull request #1560 from arrufat/dump-markdown Add support for dumping output to markdown	2026-02-19 10:32:57 +01:00
Nikolay Govorov	77afbddb91	Fix race condition in sighandler	2026-02-16 21:28:29 +00:00
Adrià Arrufat	dea492fd64	Unify dump flags into --dump <format>	2026-02-17 00:42:06 +09:00
Adrià Arrufat	d3ba714aba	Rename --dump flag to --html --dump is still supported but deprecated	2026-02-17 00:35:15 +09:00
Adrià Arrufat	748b37f1d6	Rename --dump-markdown to --markdown	2026-02-17 00:21:10 +09:00
Adrià Arrufat	1b5efea6eb	Add --dump-markdown flag Add a new module to handle HTML-to-Markdown conversion and integrate it into the fetch command via a new CLI flag.	2026-02-15 23:18:01 +09:00
Karl Seguin	c4e82407ec	Add Finalizers to events At a high level, this does for Events what was recently done for XHR, Fetch and Observers. Events are self-contained in their own arena from the ArenaPool and are registered with v8 to be finalized. But events are more complicated than those other types. For one, events have a prototype chain. (XHR also does, but it's always the top-level object that's created, whereas it's valid to create a base Event or something that inherits from Event). But the _real_ complication is that Events, unlike previous types, can be created from Zig or from V8. This is something that Fetch had to deal with too, because the Response is only given to V8 on success. So in Fetch, there's a period of time where Zig is solely responsible for the Response, until it's passed to v8. But with events it's a lot more subtle. There are 3 possibilities: 1 - An Event is created from v8. This is the simplest, and it simply becomes a a weak reference for us. When v8 is done with it, the finalizer is called. 2 - An Event is created in Zig (e.g. window.load) and dispatched to v8. Again we can rely on the v8 finalizer. 3 - An event is created in Zig, but not dispatched to v8 (e.g. there are no listeners), Zig has to release the event. (It's worth pointing out that one thing that still keeps this relatively straightforward is that we never hold on to Events past some pretty clear point) Now, it would seem that #3 is the only issue we have to deal with, and maybe we can do something like: ``` if (event_manager.hasListener("load", capture)) { try event_manager.dispatch(event); } else { event.deinit(); } ``` In fact, in many cases, we could use this to optimize not even creating the event: ``` if (event_manager.hasListener("load, capture)) { const event = try createEvent("load", capture); try event_manager.dispatch(event); } ``` And that's an optimization worth considering, but it isn't good enough to properly manage memory. Do you see the issue? There could be a listener (so we think v8 owns it), but we might never give the value to v8. Any failure between hasListener and actually handing the value to v8 would result in a leak. To solve this, the bridge will now set a _v8_handover flag (if present) once it has created the finalizer_callback entry. So dispatching code now becomes: ``` const event = try createEvent("load", capture); defer if (!event._v8_handover) event.deinit(false); try event_manager.dispatch(event); ``` The v8 finalizer callback was also improved. Previously, we just embedded the pointer to the zig object. In the v8 callback, we could cast that back to T and call deinit. But, because of possible timing issues between when (if) v8 calls the finalizer, and our own cleanup, the code would check in the context to see if the ptr was still valid. Wait, what? We're using the ptr to get the context to see if the ptr is valid? We now store a pointer to the FinalizerCallback which contains the context. So instead of something stupid like: ``` // note, if the identity_map doesn't contain the value, then value is likely // invalid, and value.page will segfault value.page.js.identity_map.contains(@intFromPtr(value)) ``` We do: ``` if (fc.ctx.finalizer_callbacks.contains(@intFromPtr(fc.value)) { // fc.value is safe to use } ```	2026-02-09 16:56:43 +08:00
Karl Seguin	a35e772a6b	Move Sighandler to "serve" mode only Currently the sighandler is setup regardless of the running mode, but it only does something in "serve" mode. In fetch mode, since there are no registered listeners, it intercepts the signal and does nothing. On MacOS at least, this isn't a great experience as it can leave the process running in the background.	2026-02-07 08:34:59 +08:00
Nikolay Govorov	a72782f91e	Eliminates duplication in the creation of HTTP headers	2026-02-04 09:08:57 +00:00
Nikolay Govorov	f71aa1cad2	Centralizes configuration, eliminates unnecessary copying of config	2026-02-04 07:57:59 +00:00
Karl Seguin	0f9c9e2089	Improve crash handling This adds a crash handler which reports a crash (if telemetry is enabled). On a crash, this looks for `curl` (using the PATH env), and forks the process to then call execve. This relies on a new endpoint to be setup to accept the "report". Also, we include very little data..I figured just knowing about crashes would be a good place to start. A panic handler is provided, which override's Zig default handler and hooks into the crash handler. An `assert` function is added and hooks into the crash handler. This is currently only used in one place (Session.zig) to demonstrate its use. In addition to reporting a failed assert, the assert aborts execution in ReleaseFast (as opposed to an undefined behavior with std.debug.assert). I want to hook this into the v8 global error handler, but only after direct_v8 is merged. Much of this is inspired by bun's code. They have their own assert (1) and a [more sophisticated] crashHandler (2). : (1) `beccd01647/src/bun.zig (L2987)` (2) `beccd01647/src/crash_handler.zig (L198)`	2026-01-19 07:36:46 +08:00
Pierre Tachoire	4684b8611d	Add a synchronous signal handler for graceful shutdown	2025-12-29 12:43:52 +01:00
Karl Seguin	8215f2fd8f	Merge branch 'snapshots_v2' into zigdom	2025-12-22 17:03:38 +08:00
Karl Seguin	566fa72bcd	various small backports from main	2025-12-19 10:05:42 +08:00
Karl Seguin	3e03f7559f	Document log_filter_scope argument Add fetch logging	2025-12-18 20:48:14 +08:00
Karl Seguin	b3a0aaaeea	Enable v8 snapshots There are two layers here. The first is that, on startup, a v8 SnapshotCreator is created, and a snapshot-specific isolate/context is setup with our browser environment. This contains most of what was in Env.init and a good chunk of what was in ExecutionWorld.createContext. From this, we create a v8.StartupData which is used for the creation of all subsequent contexts. The snapshot sits at the application level, above the Env - it's re-used for all envs/isolates, so this gives a nice performance boost for both 1 connection opening multiple pages or multiple connections opening 1 page. The second layer is that the Snapshot data can be embedded into the binary, so that it doesn't have to be created on startup, but rather created at build-time. This improves the startup time (though, I'm not really sure how to measure that accurately...). The first layer is the big win (and just works as-is without any build / usage changes). with snapshot total runs 1000 total duration (ms) 7527 avg run duration (ms) 7 min run duration (ms) 5 max run duration (ms) 41 without snapshot total runs 1000 total duration (ms) 9350 avg run duration (ms) 9 min run duration (ms) 8 max run duration (ms) 42 To embed a snapshot into the binary, we first need to create the snapshot file: zig build -Doptimize=ReleaseFast snapshot_creator -- src/snapshot.bin And then build using the new snapshot_path argument: zig build -Dsnapshot_path=../../snapshot.bin -Doptimize=ReleaseFast The paths are weird, I know...since it's embedded, it needs to be inside the project path, hence we put it in src/snapshot.bin. And since it's embedded relative to the embedder (src/browser/js/Snapshot.zig) the path has to be relative to that, hence ../../snapshot.bin. I'm open to suggestions on improving this.	2025-12-18 20:10:38 +08:00
Arjun Komath	57ce4e16a9	feat: support listening on ipv6	2025-12-08 09:08:57 +08:00
Karl Seguin	e336c67857	various small api fixes/tweaks	2025-11-24 20:12:43 +08:00
Karl Seguin	1164da5e7a	copyright notices	2025-11-14 10:52:43 +08:00
Karl Seguin	d3973172e8	re-enable minimum viable CDP server	2025-10-28 18:56:03 +08:00
Karl Seguin	cdd31353c5	get fetch campire working	2025-10-28 11:24:29 +08:00
Karl Seguin	b047cb6dc1	remove libdom	2025-10-27 22:14:59 +08:00
Karl Seguin	c381e4153d	Expose v8 CpuProfiler + add fast properties for some window properties First, this exposes the v8 Profiler. Right now it's just a commented-out block in `fetch` and meant for internal debugging. Depends on: https://github.com/lightpanda-io/zig-v8-fork/pull/105 Use postAttach on Window to attach "static" properties. This comes from profiling (lightpanda.io) and seeing window.get_self called tens of thousands of times.	2025-10-10 19:51:29 +08:00
Karl Seguin	2b84712eee	Add Session.fetchWait so that 'fetch' mode will follow navigation	2025-09-30 13:36:05 +08:00
Karl Seguin	2ddcc6d9e6	Replace --noscript with more advanced --strip_mode --noscript is deprecated (warning) and automatically maps to --strip_mode js --strip_mode takes a comma separated list of values. From the help: - "js" script and link[as=script, rel=preload] - "ui" includes img, picture, video, css and svg - "css" includes style and link[rel=stylesheet] - "full" includes js, ui and css Maybe this is overkill, but i sometimes find myself looking --dump outputs over and over again, and removing noise (like HUGE svgs) seems like a small improvement.	2025-09-19 14:27:53 +08:00
Karl Seguin	024f7ad9ef	Merge pull request #1056 from lightpanda-io/DOM_NO_ERR Convert more DOM_NO_ERR cases to assertions	2025-09-18 19:06:32 +08:00
Karl Seguin	26550129ea	Add --user_agent_suffix argument Allows appending a value (separated by a space) to the existing Lightpanda/X.Y user agent.	2025-09-18 11:28:27 +08:00
Karl Seguin	58acb2b821	Convert more DOM_NO_ERR cases to assertions There is some risk to this change. The first is that I made a mistake. The other is that one of the APIs that doesn't currently return an error changes in the future.	2025-09-17 13:37:48 +08:00
Karl Seguin	da128f5d49	remove unecessary @intCast	2025-09-04 15:52:08 +08:00
Karl Seguin	6e5fe8e4a2	Add timeout limit to --help text	2025-09-04 15:48:01 +08:00
Karl Seguin	b3d350d41e	Limit serve timeout to 1 week	2025-09-04 15:27:03 +08:00
Karl Seguin	b6137b03cd	Rework page wait again Further reducing bouncing between page and server for loop polling. If there is a page, the page polls. If there isn't a page, the server polls. Simpler.	2025-09-03 19:38:01 +08:00
Karl Seguin	81766c8517	Migrate some tests to the new htmlRunner Fix events.get_timeStamp (was events.get_timestamp, wrong casing). Rename `newRunner` to `htmlRunner`. move tests to src/tests (from src/browser/tests). src/runtime and possibly other parts might want to have html tests too.	2025-09-02 10:40:04 +08:00
Karl Seguin	c40704d2f3	Prototype new test runner Follows up on https://github.com/lightpanda-io/browser/pull/994 and replaces the jsRunner with a new page.navigation-based test runner. Currently only implemented for the Window tests, looking for feedback and converting every existing test will take time - so for a while, newRunner (to be renamed) will sit side-by-side with jsRunner. In addition to the benefits outlined in 994, largely around code simplicity and putting more of the actual code under tests, I think our WebAPI tests particularly benefit from: 1 - No need to recompile when modifying the html tests 2 - Much better assertions, e.g. you can assert that something is actually an array, not just a string representation of an array 3 - Ability to test some edge cases (e.g. dynamic script loading) I've put some effort into testing.js to make sure that, if the encapsulating zig test passes, it's because it actually passed, not because it didn't run. For the time being, console tests are removed. I think it's more useful to have access to the console within tests, than it is to test the console (which is just a wrapper around log, which is both tested and heavily used).	2025-09-02 07:38:02 +08:00
Karl Seguin	7d46e8fe80	Start unifying test and code Depends on https://github.com/lightpanda-io/browser/pull/993 There's currently 3 ways to execute a page: 1 - page.navigate (as used in both the 'fetch' and 'serve' commands) 2 - jsRunner as used in unit tests 3 - main_wpt as used in the WPT runner Both jsRunner and main_wpt replicate the page.navigate code, but in their own hack-ish way. main_wpt re-implements the DOM walking in order to extract and execute <script> tags, as well as the needed page lifecycle events. This PR replaces the existing main_wpt loader with a call to page.navigate. To support this, a test HTTP server was added. (The test HTTP server is extracted from the existing unit test test server, and re-used between the two). There are benefits to this approach: 1 - The code is simpler 2 - More of the actual code and flow is tested 3 - There's 1 way to do things (page.navigate) 4 - Having an HTTP server might unlock some WPT tests Technically, we're replacing file IO with network IO i.e. http requests). This has potential downsides: 1 - The tests might be more brittle 2 - The tests might be slower I think we need to run it for a while to see if we get flaky behavior. The goal for following PRs is to bring this unification to the jsRunner.	2025-09-01 13:01:08 +08:00
Karl Seguin	1443f38e5f	Zig 0.15.1 Depends on https://github.com/lightpanda-io/zig-v8-fork/pull/89	2025-08-29 10:42:06 +08:00
Karl Seguin	fa2cd9dfd9	Ability to start/stop CDP server. Exists for cleaning up after tests.	2025-08-29 10:14:08 +08:00
Karl Seguin	687f09d952	Make the App own the Platform Removes optional platform, which only existed for tests. There is now a global `@import("testing.zig").test_app` available. This is setup when the test runner starts, and cleaned up at the end of tests. Individual tests don't have to worry about creating app, which I assume was the reason I Platform optional, since that woul dhave been something else that needed to be setup.	2025-08-29 10:14:06 +08:00
Pierre Tachoire	4c7b7b1e60	handle graceful shutdown	2025-08-28 12:44:16 +02:00
Pierre Tachoire	1a4a3608c8	exit the browser on SIGINT signal	2025-08-28 12:44:12 +02:00
Karl Seguin	0959eea677	Remove the loop Previously, the IO loop was doing three things: 1 - Managing timeouts (either from scripts or for our own needs) 2 - Handling browser IO events (page/script/xhr) 3 - Handling CDP events (accept, read, write, timeout) With the libcurl merge, 1 was moved to an in-process scheduler and 2 was moved to libcurl's own event loop. That means the entire loop code, including the dependency on tigerbeetle-io existed for handling a single TCP client. Not only is that a lot of code, there was also friction between the two loops (the libcurl one and our IO loop), which would result in latency - while one loop is waiting for the events, any events on the other loop go un-processed. This PR removes our IO loop. To accomplish this: 1 - The main accept loop is blocking. This is simpler and works perfectly well, given we only allow 1 active connection. 2 - The client socket is passed to libcurl - yes, libcurl's loop can take arbitrary FDs and poll them along with its own. In addition to having one less dependency, the CDP code is quite a bit simpler, especially around shutdowns and writes. This also removes _some_ of the latency caused by the friction between page process and CDP processing. Specifically, when CDP now blocks for input, http page events (script loading, xhr, ...) will still be processed. There's still friction. For one, the reverse isn't true: when the page is waiting for events, CDP events aren't going to be processed. But the page.wait already have some sensitivity to this (e.g. the page.request_intercepted flag). Also, when CDP waits, while we will process network events, page timeouts are still not processed. Because of both these remaining issues, we still need to jump between the two loops - but being able to block on CDP (even for a short time) WITHOUT stopping the page's network I/O, should reduce some latency.	2025-08-25 17:27:28 +08:00

1 2 3 4

170 Commits