browser

mirror of https://github.com/lightpanda-io/browser.git synced 2025-10-29 23:23:28 +00:00

Author	SHA1	Message	Date
Karl Seguin	6df5e55807	Optimize the lifecycle of async requests Async HTTP request work by emitting a "Progress" object to a callback. This object has a "done" flag which, when `true`, indicates that all data has been emitting and no future "Progress" objects will be sent. Callers like XHR buffer the response and wait for "done = true" to then process the request. The HTTP client relies on two important object pools: the connection and the state (with all the buffers for reading/writing). In its current implementation, the async flow does not release these pooled objects until the final callback has returned. At best, this is inefficient: we're keeping the connection and state objects checked out for longer than they have to be. At worse, it can lead to a deadlock. If the calling code issues a new request when done == true, we'll eventually run out of state objects in the pool. This commit now releases the state objects before emit the final "done" Progress message. For this to work, this final message will always have null data and an empty header object.	2025-06-05 12:40:59 +08:00
Karl Seguin	c3f3eea7fb	Improve logging 1 - Make log_level a runtime option (not a build-time) 2 - Make log_format a runtime option 3 - In Debug mode, allow for log scope filtering Improve the general usability of scopes. Previously, the scope was more or less based on the file that the log was in. Now they are more logically grouped. Consider the case where you want to silence HTTP request information, previously you'd have to filter out the `page`, `xhr` and `http_client` scopes, but that would also elimiate other page, xhr and http_client logs. Now, you can just filter out the `http` scope.	2025-06-02 21:38:56 +08:00
Karl Seguin	e339ee3f0c	Clean Http Request Shutdown The Request object now exists on the heap, allowing it to outlive whatever is making the request (e.g. the XHR object). We can now wait until all inflight IO events are completed before clearing the memory. This change fixes the crash observed in: https://github.com/lightpanda-io/browser/issues/667	2025-05-31 07:22:01 +08:00
Karl Seguin	30ff17df28	Skip large header lines that don't fit into the header buffer. https://github.com/lightpanda-io/browser/issues/672	2025-05-28 00:14:51 +08:00
Karl Seguin	2feba3182a	Replace std.log with a structured logger Outputs in logfmt in release and a "pretty" print in debug mode. The format along with the log level will become arguments to the binary at some point in the future.	2025-05-27 19:57:58 +08:00
Karl Seguin	f59e3cd4da	Maybe retry on TlsAlertCloseNotify error This might not be specific to network notification, but the issue happens all the time testing scenarios that rely on network notification, so it's hard to ignore.	2025-05-24 09:01:13 +08:00
Karl Seguin	94a30b2167	HTTP request notification - Add 2 internal notifications 1 - http_request_start 2 - http_request_complete - When Network.enable CDP message is received, browser context registers for these 2 events (when Network.disable is called, it unregisters) - On http_request_start, CDP will emit a Network.requestWillBeSent message. This _does not_ include all the fields, but what we have appears to be enough for puppeteer.waitForNetworkIdle. - On http_request_complete, CDP will emit a Network.responseReceived message. This _does not_ include all the fields, bu what we have appears to be enough for puppeteer.waitForNetworkIdle. We currently don't emit any other new events, including any network-specific lifecycleEvent (i.e. Chrome will emit an networkIdle and networkAlmostIdle). To support this, the following other things were done: - CDP now has a `notification_arena` which is re-used between browser contexts. Normally, CDP code runs based on a "cmd" which has its own message_arena, but these notifications happen out-of-band, so we needed a new arena which is valid for handling 1 notification. - HTTP Client is notification-aware. The SessionState no longer includes the http.Client directly. It instead includes an http.RequestFactory which is the combination fo the client + a specific configuration (i.e. Notification). This ensures that all requests made from that factory have the same settings. - However, despite the above, _some_ requests do not appear to emit CDP events, such as loading a <script src="X">. So the page still deals directly with the *http.Client. - Playwright and Puppeteer (but Playwright in particular) are very sensitive to event ordering. These new events have introduced additional sensitivity. The result sent to Page.navigate had to be moved to inside the navigate event handler, which meant passing some cdp-specific data (the input.id) into the NavigateOpts. This is the only way I found to keep both happy - the sequence of events is closer (but still pretty far) from what Chrome does.	2025-05-24 09:01:12 +08:00
Karl Seguin	cc82b1ae25	Fix connection memory leak When the idle pool is full and the oldest connection is freed, free the connection instance.	2025-05-23 17:11:14 +08:00
Karl Seguin	7156df8d9a	Add support for gzip responses in AsyncHandler Compliments https://github.com/lightpanda-io/browser/pull/601 which added this behavior to the SyncHandler.	2025-05-16 12:51:53 +08:00
Karl Seguin	e918a0bf26	add direct http proxy support	2025-05-13 18:21:27 +08:00
Karl Seguin	64f80312de	fix formatting	2025-05-13 10:42:51 +08:00
Karl Seguin	ce2eed28c1	Fix memory leaks	2025-05-13 10:42:16 +08:00
Karl Seguin	505fa91d7d	add keepalive to http client	2025-05-13 10:42:16 +08:00
Karl Seguin	78bfdd4515	Support gzip compressed content for the synchronous http client	2025-05-06 16:23:44 +08:00
Karl Seguin	837188f8d1	peek must check existing data first	2025-04-23 08:28:20 +08:00
Karl Seguin	b9f61466ba	Try to sniff the mime type based on the body content Synchronous body reader now exposes a peek() function to get the first few bytes from the response body. This will be no less than 100 bytes (assuming the body is that big), but could be more. Streaming API, via res.next() continues to work as-is even if peek() is called. Introduce Mime.sniff() that detects a few common types - the ones that we care about right now - from the body content.	2025-04-22 10:58:26 +08:00
Pierre Tachoire	1c08b3e5e4	Merge pull request #534 from lightpanda-io/mutable_response_header_value Some checks failed e2e-test / zig build release (push) Has been cancelled Details e2e-test / puppeteer-perf (push) Has been cancelled Details e2e-test / demo-scripts (push) Has been cancelled Details wpt / web platform tests (push) Has been cancelled Details wpt / web platform tests json output (push) Has been cancelled Details wpt / perf-fmt (push) Has been cancelled Details zig-test / zig build dev (push) Has been cancelled Details zig-test / browser fetch (push) Has been cancelled Details zig-test / zig test (push) Has been cancelled Details zig-test / perf-fmt (push) Has been cancelled Details nightly build / build-linux-x86_64 (push) Has been cancelled Details nightly build / build-linux-aarch64 (push) Has been cancelled Details nightly build / build-macos-aarch64 (push) Has been cancelled Details nightly build / build-macos-x86_64 (push) Has been cancelled Details Make HTTP Response header values mutable	2025-04-17 13:10:33 +02:00
Karl Seguin	4a6bf38666	ResponseHeader.get should return mutable slice	2025-04-16 16:54:28 +08:00
Karl Seguin	613904e3a4	Make HTTP Response header values mutable The HTTP response values _are_ mutable, but because we're using std.http.Header the type is a `[]const u8`. This introduce a custom `Header` type where the value is `[]u8`. The goal is largely to allow more efficient value-comparison, by allowing calling code to lower-case in-place. I specifically have the Mime parser in mind: `25dcae7648/src/browser/mime.zig (L134)`	2025-04-16 14:05:21 +08:00
Karl Seguin	b8d7744563	replace zig-js-runtime	2025-04-15 15:18:04 +08:00
Karl Seguin	be75b5b237	Add URL struct Combine uri + rawuri into single struct. Try to improve ownership around URIs and URI-like things. - cookie & request can take *const std.Uri (TODO: make them aware of the new URL struct?) - Location (web api) should own its URL (web api URL) - Window should own its Location Most of these changes result in (a) a cleaner Page and (b) not having to carry around 2 nullable objects (URI and rawuri).	2025-04-09 18:19:07 +08:00
Karl Seguin	ec9a1416a1	Don't emit incorrect empty chunk When we only have 1 or 2 bytes missing from a chunk (i.e. the tailing \n or \r\n), don't emit an empty chunk if we have more data available to process.	2025-04-07 22:40:02 +08:00
Karl Seguin	22d33fa286	Add cookie support to browser (not XHR yet) requests	2025-03-31 18:44:09 +08:00
Karl Seguin	75f66a6cb2	Accommodate zig-js-runtime loop changes	2025-03-31 14:59:40 +08:00
Karl Seguin	c6538e1038	Add an `insecure_disable_tls_host_verification` command line option When set, this disables the host verification of all HTTP requests. Available for both the fetch and serve mode. Also introduced an App.Config, for future command line options which need to be passed more deeply into the code.	2025-03-27 18:02:30 +08:00
Karl Seguin	531a484cb0	Fix a few comments Switch generic http_client error level from warn to err	2025-03-27 08:11:48 +08:00
Karl Seguin	afdb5d7233	reset read_pos after handshake is established	2025-03-23 20:08:12 +08:00
Karl Seguin	18be1202db	Prevent double in-flight recvs Retry on test timeout for slower machines (i.e. CI build), while also reducing wait time for faster builds.	2025-03-23 19:05:37 +08:00
Karl Seguin	14cc87e1a5	Use latest tls.zig (with new allocation-free API) Add more fuzz tests around async tls.	2025-03-23 19:05:37 +08:00
Karl Seguin	2a0d1b0a48	Switch to nonblocking socket Improve test server handshake performance, allowing for a few more fuzz iterations without making tests unbearably slow.	2025-03-23 19:05:37 +08:00
Karl Seguin	22aa126b29	Cleaner merge Switch to non-blocking sockets. Fix TLS handshake/receive/send ordering	2025-03-23 19:05:35 +08:00
Karl Seguin	feb2046549	add TLS integration test for sync client	2025-03-23 19:01:40 +08:00
Karl Seguin	2f362f2aa2	handle redirects on asynchronous calls	2025-03-23 19:01:40 +08:00
Karl Seguin	de160d9170	Cleanup synchronous connection for tls and non-tls. Drain response prior to redirect.	2025-03-23 19:01:40 +08:00
Karl Seguin	226c18cb56	handle redirects on synchronous calls	2025-03-23 19:01:40 +08:00
Karl Seguin	807d3a600c	Support transfer-encoding: chunked, fix async+tls integration	2025-03-23 19:01:40 +08:00
Karl Seguin	2017d4785b	replace zig-async-io and std.http.Client with a custom HTTP client	2025-03-23 19:01:40 +08:00

37 Commits