browser

mirror of https://github.com/lightpanda-io/browser.git synced 2025-10-29 15:13:28 +00:00

Author	SHA1	Message	Date
Karl Seguin	1ebac06f4b	add debug line on cdp buffer growth	2025-08-29 10:55:36 +08:00
Karl Seguin	efc983b009	Start with 16K buffer (down from 32K). Use array list growth algorithm	2025-08-29 10:33:27 +08:00
Karl Seguin	74d90f2892	fix tests	2025-08-29 10:14:59 +08:00
Karl Seguin	56f1b6cc19	Make the CDP read buffer heap allocated & dynamic Rather than stack-allocating MAX_MESSAGE_SIZE upfront, we now allocate 32KB and grow the buffer as needed for larger messages, up to MAX_MESSAGE_SIZE. This will reduce memory usage for drivers that don't send huge payloads (like playwright does). While not implemented, this would also enable us to set the MAX_MESSAGE_SIZE at runtime (e.g. via a command line option).	2025-08-29 10:14:58 +08:00
Karl Seguin	8fdbaef4c7	Use posix.TCP.NODELAY now that it's available in MacOS also	2025-08-25 22:03:58 +08:00
Karl Seguin	0959eea677	Remove the loop Previously, the IO loop was doing three things: 1 - Managing timeouts (either from scripts or for our own needs) 2 - Handling browser IO events (page/script/xhr) 3 - Handling CDP events (accept, read, write, timeout) With the libcurl merge, 1 was moved to an in-process scheduler and 2 was moved to libcurl's own event loop. That means the entire loop code, including the dependency on tigerbeetle-io existed for handling a single TCP client. Not only is that a lot of code, there was also friction between the two loops (the libcurl one and our IO loop), which would result in latency - while one loop is waiting for the events, any events on the other loop go un-processed. This PR removes our IO loop. To accomplish this: 1 - The main accept loop is blocking. This is simpler and works perfectly well, given we only allow 1 active connection. 2 - The client socket is passed to libcurl - yes, libcurl's loop can take arbitrary FDs and poll them along with its own. In addition to having one less dependency, the CDP code is quite a bit simpler, especially around shutdowns and writes. This also removes _some_ of the latency caused by the friction between page process and CDP processing. Specifically, when CDP now blocks for input, http page events (script loading, xhr, ...) will still be processed. There's still friction. For one, the reverse isn't true: when the page is waiting for events, CDP events aren't going to be processed. But the page.wait already have some sensitivity to this (e.g. the page.request_intercepted flag). Also, when CDP waits, while we will process network events, page timeouts are still not processed. Because of both these remaining issues, we still need to jump between the two loops - but being able to block on CDP (even for a short time) WITHOUT stopping the page's network I/O, should reduce some latency.	2025-08-25 17:27:28 +08:00
Karl Seguin	f65a39a3e3	Re-enable telemetry Start work on supporting navigation events (clicks, form submission).	2025-08-11 21:37:00 +08:00
Karl Seguin	54ab1326e5	Switch XHR to new http client get puppeteer/cdp.js working again make test are all passing	2025-08-11 21:37:00 +08:00
Karl Seguin	4bf92a34f6	Don't queue data to send after we've initiated a disconnect of the client	2025-07-15 17:58:57 +08:00
Pierre Tachoire	886c1370e7	ws: increase max http message from 2kb to 4kb	2025-07-09 15:02:40 -07:00
Pierre Tachoire	404c027546	typo fix	2025-07-07 17:14:52 -07:00
Karl Seguin	c3f3eea7fb	Improve logging 1 - Make log_level a runtime option (not a build-time) 2 - Make log_format a runtime option 3 - In Debug mode, allow for log scope filtering Improve the general usability of scopes. Previously, the scope was more or less based on the file that the log was in. Now they are more logically grouped. Consider the case where you want to silence HTTP request information, previously you'd have to filter out the `page`, `xhr` and `http_client` scopes, but that would also elimiate other page, xhr and http_client logs. Now, you can just filter out the `http` scope.	2025-06-02 21:38:56 +08:00
Karl Seguin	fe9344ce57	Try stateless logger (to save memory)	2025-05-27 19:57:58 +08:00
Karl Seguin	2feba3182a	Replace std.log with a structured logger Outputs in logfmt in release and a "pretty" print in debug mode. The format along with the log level will become arguments to the binary at some point in the future.	2025-05-27 19:57:58 +08:00
Pierre Tachoire	867c72ba90	fix comment	2025-05-21 18:02:33 +02:00
sjorsdonkers	3f6b095da4	Increase MAX_MESSAGE_SIZE	2025-05-21 17:51:25 +02:00
Karl Seguin	01aa826a24	Make intervals easier and faster, add window.setInterval and clearInterval When the browser microtask was added, zig-specific timeout functions were added to the loop. This was necessary for two reasons: 1 - The existing functions were JS specific 2 - We wanted a different reset counter for JS and Zig Like we did in https://github.com/lightpanda-io/browser/pull/577, the loop is now JS-agnostic. It gets a Zig callback, and the Zig callback can execute JS (or do whatever). An intrusive node, like with events, is used to minimize allocations. Also, because the microtask was recently moved to the page, there is no longer a need for separate event counters. All timeouts are scoped to the page. The new timeout callback can now be used to efficiently reschedule a task. This reuses the IO.completion and Context, avoiding 2 allocations. More importantly it makes the internal timer_id static for the lifetime of an "interval". This is important for window.setInterval, where the callback can itself clear the interval, which we would need to detect in the callback handler to avoid re-scheduling. With the stable timer_id, the existing cancel mechanism works as expected. The loop no longer has a cbk_error. Callback code is expected to try/catch callbacks (or use callback.tryCall) and handle errors accordingly.	2025-05-05 19:03:45 +08:00
Karl Seguin	b8e5e130b9	remove unused code, mostly imports	2025-05-05 13:29:41 +08:00
Karl Seguin	b8d7744563	replace zig-js-runtime	2025-04-15 15:18:04 +08:00
Karl Seguin	b2d9f835bf	Zig fmt	2025-03-31 15:29:54 +08:00
Karl Seguin	735772f43a	On a non websocket upgrade connection, close the connection Solves slow startup time with chromedp	2025-03-31 15:26:37 +08:00
Karl Seguin	21c9dde858	Zig 0.14 compatibility	2025-03-19 16:28:15 +01:00
Karl Seguin	6b83281539	Add navigate telemetry	2025-03-18 08:25:44 +08:00
Karl Seguin	accf2c0e5e	use async-client for telemetry	2025-03-18 08:02:09 +08:00
Karl Seguin	ca230aa230	Serialize socket writes + consider client pending completions when shutting down Previously, we could have multiple in-flight messages from the server to a single client. This isn't safe and can lead to message interleaving. While write / send are atomic, they are only atomic for the N bytes which they write, which may not be the entire buffer. Consider this writeAll function: ``` pub fn writeAll(socket: socket_t, bytes: []const u8) !void { var index: usize = 0; while (index < bytes.len) { index += try posix.write(socket, bytes[index..]); } } ``` If we're trying to send "abc123", this could take anywhere from 1 to 6 calls to posix.write (it would take 6 calls, for example, if every call to posix.write only wrote a single byte). Now if you're trying to write other data to this same socket at the same time, messages _will_ get interleaved. In order for this to work, the client now has a send_queue (doubly linked list). When one message is sent, it sends the next. In addition to the above change, the Client is now self-contained with respect to its lifetime. This is necessary so that completions which come in AFTER our concept of its lifetime ends, can still be processed. I think all types that receive completions need to follow this model. This relies on the fact that kqueue (which I know for a fact) and io_uring (which people seem to imply) handle socket shutdown properly. It's still a bit messy because of timeout and not wanting to wait until timeout to accept new connections, but needing to wait until timeout to cleanup the client. The self-contained nature of Client makes it difficult to test as a generic. I removed Client(T). Tests now use real sockets. Some tests had to be removed because they're too difficult to test over a real connection :(	2025-03-07 20:29:57 +08:00
Pierre Tachoire	908febb363	Merge pull request #441 from karlseguin/cdp_tests Some checks are pending e2e-test / zig build release (push) Waiting to run Details e2e-test / puppeteer (push) Blocked by required conditions Details wpt / web platform tests (push) Waiting to run Details wpt / perf-fmt (push) Blocked by required conditions Details zig-test / zig build dev (push) Waiting to run Details zig-test / zig test (push) Waiting to run Details zig-test / perf-fmt (push) Blocked by required conditions Details Turn CDP into a generic so that mocks can be injected for testing	2025-02-21 17:49:47 +01:00
Karl Seguin	99fb82e244	Turn CDP into a generic so that mocks can be injected for testing ADD CDP testing helpers (mock Browser, Session, Page and Client). These are placeholders until tests are added which use them. Added a couple CDP tests.	2025-02-21 13:17:35 +08:00
Karl Seguin	756d6620cc	Ensure completions are executed on the currently connected client For the time being, given that we only allow 1 client at a time, I took a shortcut to implement this. The server has an incrementing "current_client_id" which is part of every completion. On completion callback, we just check if its client_id is still equal to the server's current_client_id.	2025-02-21 09:35:51 +08:00
Karl Seguin	e301ba0cdb	Don't share or reuse timeout_completion Results in undefined behavior when a client disconnects and another reconnects while the timeout is being monitored: https://github.com/lightpanda-io/browser/pull/436#issuecomment-2670455216	2025-02-20 23:56:55 +08:00
Karl Seguin	bc4560877a	zig fmt	2025-02-20 22:08:56 +08:00
Karl Seguin	521a740d3a	Merge branch 'main' into cdp_struct	2025-02-20 22:08:37 +08:00
Karl Seguin	39a9efb73b	Fix server hang on client disconnect https://github.com/lightpanda-io/browser/issues/425 Add a few integration tests for the TCP server which are fast enough to be run as part of the unit tests (one of the new tests covers the above issue).	2025-02-19 15:01:12 +08:00
Karl Seguin	f508288ce3	Fix segfault when multiple inflight Send completions fail	2025-02-17 18:43:41 +08:00
Karl Seguin	18080cef9f	fix test	2025-02-17 12:14:11 +08:00
Karl Seguin	c4eeef2a86	On CDP process error, let client decide how to close Fixes issue where CDP closes the client, but client still registers a recv operation.	2025-02-17 12:05:25 +08:00
Karl Seguin	b1c3de6518	zig fmt	2025-02-13 17:32:01 +08:00
Karl Seguin	a43a6a299c	Merge branch 'main' into cdp_struct	2025-02-13 17:30:15 +08:00
Karl Seguin	89ff1411e9	Fix memory leak on invalid websocket continuation frames	2025-02-13 09:34:25 +08:00
Karl Seguin	701e8277d6	support continuation frames	2025-02-13 08:51:21 +08:00
Karl Seguin	4a11f80c45	Make websocket client reader stateful Move more logic into the reader. Avoid copying partial messages in cases where we know that the buffer is large enough. This is mostly groundwork for trying to add support for continuation frames.	2025-02-13 08:51:21 +08:00
Karl Seguin	f1b275d5d0	Increase fuzz count. Add test for [too] large HTTP requests	2025-02-13 08:51:21 +08:00
Karl Seguin	0753eb7691	zig fmt	2025-02-13 08:51:21 +08:00
Karl Seguin	94be7a0e79	Make TCP server websocket-aware Adding HTTP & websocket awareness to the TCP server. HTTP server handles `GET /json/version` and websocket upgrade requests. Conceptually, websocket handling is the same code as before, but receiving data will parse the websocket frames and writing data will wrap it in a websocket frame. The previous `Ctx` was split into a `Server` and a `Client`. This was largely done to make it easy to write unit tests, since the `Client` is a generic, all its dependencies (i.e. the server) can be mocked out. This also makes it a bit nicer to know if there is or isn't a client (via the server's client optional). Added a MemoryPool for the Send object (I thought that was a nice touch!) Removed MacOS hack on accept/conn completion usage. Known issues: - When framing an outgoing message, the entire message has to be duped. This is no worse than how it was before, but it should be possible to eliminate this in the future. Probably not part of this PR. - Websocket parsing will reject continuation frames. I don't know of a single client that will send a fragmented message (websocket has its own message fragmentation), but we should probably still support this just in case. - I don't think the receive, timeout and close completions can safely be re-used like we're doing. I believe they need to be associated with a specific client socket. - A new connection creates a new browser session. I think this is right (??), but for the very first, we're throwing out a perfectly usable session. I'm thinking this might be a change to how Browser/Sessions work. - zig build test won't compile. This branch reproduces the issue with none of these changes: https://github.com/karlseguin/browser/tree/broken_test_build (or, as a diff to main): https://github.com/lightpanda-io/browser/compare/main...karlseguin:broken_test_build	2025-02-13 08:51:19 +08:00
Pierre Tachoire	b2e3419bff	upgrade tigerbeetle	2025-02-12 14:37:39 +01:00
Karl Seguin	6ab64d155b	Refactor CDP CDP is now an struct which contains its own state a browser and a session. When a client connection is made and successfully upgrades, the client creates the CDP instance. There is now a cleaner separation betwen Server, Client and CDP. Removed a number of allocations, especially when writing results/events from CDP to the client. Improved input message parsing. Tried to remove some usage of undefined.	2025-02-12 16:47:37 +08:00
Karl Seguin	14fe4f65e1	support continuation frames	2025-02-11 11:16:39 +08:00
Karl Seguin	bdb70444d6	Make websocket client reader stateful Move more logic into the reader. Avoid copying partial messages in cases where we know that the buffer is large enough. This is mostly groundwork for trying to add support for continuation frames.	2025-02-11 11:16:39 +08:00
Karl Seguin	4d9cc55a87	Increase fuzz count. Add test for [too] large HTTP requests	2025-02-11 11:16:39 +08:00
Karl Seguin	72eaab68be	zig fmt	2025-02-11 11:16:39 +08:00
Karl Seguin	c0c0694fcc	Make TCP server websocket-aware Adding HTTP & websocket awareness to the TCP server. HTTP server handles `GET /json/version` and websocket upgrade requests. Conceptually, websocket handling is the same code as before, but receiving data will parse the websocket frames and writing data will wrap it in a websocket frame. The previous `Ctx` was split into a `Server` and a `Client`. This was largely done to make it easy to write unit tests, since the `Client` is a generic, all its dependencies (i.e. the server) can be mocked out. This also makes it a bit nicer to know if there is or isn't a client (via the server's client optional). Added a MemoryPool for the Send object (I thought that was a nice touch!) Removed MacOS hack on accept/conn completion usage. Known issues: - When framing an outgoing message, the entire message has to be duped. This is no worse than how it was before, but it should be possible to eliminate this in the future. Probably not part of this PR. - Websocket parsing will reject continuation frames. I don't know of a single client that will send a fragmented message (websocket has its own message fragmentation), but we should probably still support this just in case. - I don't think the receive, timeout and close completions can safely be re-used like we're doing. I believe they need to be associated with a specific client socket. - A new connection creates a new browser session. I think this is right (??), but for the very first, we're throwing out a perfectly usable session. I'm thinking this might be a change to how Browser/Sessions work. - zig build test won't compile. This branch reproduces the issue with none of these changes: https://github.com/karlseguin/browser/tree/broken_test_build (or, as a diff to main): https://github.com/lightpanda-io/browser/compare/main...karlseguin:broken_test_build	2025-02-11 11:16:39 +08:00

1 2 3

105 Commits