browser

mirror of https://github.com/lightpanda-io/browser.git synced 2025-10-29 15:13:28 +00:00

Author	SHA1	Message	Date
Pierre Tachoire	7647ce9e6d	Merge pull request #960 from lightpanda-io/auth-challenge Some checks failed e2e-test / zig build release (push) Has been cancelled Details e2e-test / demo-scripts (push) Has been cancelled Details e2e-test / cdp-and-hyperfine-bench (push) Has been cancelled Details e2e-test / perf-fmt (push) Has been cancelled Details zig-test / zig build dev (push) Has been cancelled Details zig-test / browser fetch (push) Has been cancelled Details zig-test / zig test (push) Has been cancelled Details zig-test / perf-fmt (push) Has been cancelled Details auth required interception	2025-08-27 15:34:51 +02:00
Pierre Tachoire	041e014d68	Merge pull request #970 from lightpanda-io/remove_loop Remove the loop	2025-08-26 18:17:32 +02:00
Pierre Tachoire	5defb5c442	http: build headers when auth challenge fails	2025-08-26 18:05:45 +02:00
Pierre Tachoire	520a572bb4	http: add reset and tries for transfer	2025-08-26 18:05:45 +02:00
Pierre Tachoire	4c602256da	http: remove useless field	2025-08-26 18:05:45 +02:00
Pierre Tachoire	a847a1faae	http: replace _forbidden with _auth_challenge struct	2025-08-26 18:05:44 +02:00
Pierre Tachoire	bb381e522c	http: add creds into request	2025-08-26 18:05:39 +02:00
Pierre Tachoire	7046e18d7e	http: simplify header parsing	2025-08-25 14:18:14 +02:00
Pierre Tachoire	a7516061d0	http: move use_proxy from connection to client	2025-08-25 14:18:14 +02:00
Pierre Tachoire	e61d787ff0	http: move header done callback in its own func And call it only after the headers are parsed, either from data callback or end of the request.	2025-08-25 14:18:14 +02:00
Pierre Tachoire	25ad420f85	http: ajust header callback according to review	2025-08-25 14:18:14 +02:00
Pierre Tachoire	e2320ebe66	http: handle proxy's request header callback	2025-08-25 14:18:13 +02:00
Pierre Tachoire	5e78a26e3d	http: refacto http header parsing	2025-08-25 14:18:13 +02:00
Pierre Tachoire	159bd06a56	http: add use_proxy bool in connection	2025-08-25 14:18:12 +02:00
Pierre Tachoire	bc7e1e07f4	typo fix	2025-08-25 14:18:08 +02:00
Karl Seguin	0959eea677	Remove the loop Previously, the IO loop was doing three things: 1 - Managing timeouts (either from scripts or for our own needs) 2 - Handling browser IO events (page/script/xhr) 3 - Handling CDP events (accept, read, write, timeout) With the libcurl merge, 1 was moved to an in-process scheduler and 2 was moved to libcurl's own event loop. That means the entire loop code, including the dependency on tigerbeetle-io existed for handling a single TCP client. Not only is that a lot of code, there was also friction between the two loops (the libcurl one and our IO loop), which would result in latency - while one loop is waiting for the events, any events on the other loop go un-processed. This PR removes our IO loop. To accomplish this: 1 - The main accept loop is blocking. This is simpler and works perfectly well, given we only allow 1 active connection. 2 - The client socket is passed to libcurl - yes, libcurl's loop can take arbitrary FDs and poll them along with its own. In addition to having one less dependency, the CDP code is quite a bit simpler, especially around shutdowns and writes. This also removes _some_ of the latency caused by the friction between page process and CDP processing. Specifically, when CDP now blocks for input, http page events (script loading, xhr, ...) will still be processed. There's still friction. For one, the reverse isn't true: when the page is waiting for events, CDP events aren't going to be processed. But the page.wait already have some sensitivity to this (e.g. the page.request_intercepted flag). Also, when CDP waits, while we will process network events, page timeouts are still not processed. Because of both these remaining issues, we still need to jump between the two loops - but being able to block on CDP (even for a short time) WITHOUT stopping the page's network I/O, should reduce some latency.	2025-08-25 17:27:28 +08:00
Karl Seguin	cd33e9ad0e	Implement Network.getResponseBody Add response_data event, CDP now captures the full body so that it can respond to the Network.getResponseBody. This isn't memory efficient, but I don't see another way to do it. At least this way, it's only capturing/storing every response body when (a) CDP is used and (b) Network.enabled is called. That is, as opposed to baking this into Http/Client.zig, which would force the memory consumption for all use-cases. There's arguably some optimizations we could make for XHR requests, which also dupe/own the response. As of now, the response is dupe'd separately for CDP and XHR.	2025-08-21 10:33:53 +08:00
Karl Seguin	7cc9521cbb	Merge pull request #958 from lightpanda-io/http_request_done_notification Emits a http_request_done internal notification.	2025-08-21 09:23:41 +08:00
Karl Seguin	6b001c50a4	Emits a http_request_done internal notification. With networking enabled, CDP listens to this event and emits a `Network.loadingFinished` event. This is event is used by puppeteer to know that details about the response (i.e. the body) can be queries. Added dummy handling for the Network.getResponseBody message. Returns an empty body. Needed because we emit the loadingFinished event which signals to drivers that they can ask for the body.	2025-08-20 19:32:19 +08:00
Karl Seguin	5759c88932	Remove the http/Client.zig header_callback. The callback which was called on a per-header basis is removed. Only XHR was using this, and it was created before the HeaderIterator existed (because I didn't know we could iterate through the response headers in curl after the fact). The header_done_callback remains, but is now called header_callback (a bit confusing in the short term). The only difficulty was with fulfilled requests, which do not have an easy handle for our HeaderIterator. The existing code would segfault if transfer.responseHeaderIterator() was called on a fulfilled requests. The HeaderIterator is now a tagged union that abstracts whether the source of the response header is a curl easy, or just an injected list from the fulfilled requests.	2025-08-20 17:49:37 +08:00
Karl Seguin	16c85c5b8a	Use Transfer.arena in a few more places, correctly set is_navigation on redirect Following up to Request Interception PR (1) and Cookie Redirect PR (2) which both introduced features that were useful to the other. This PR closes that loop. (1) https://github.com/lightpanda-io/browser/pull/946 (2) https://github.com/lightpanda-io/browser/pull/948	2025-08-20 11:39:38 +08:00
Karl Seguin	7f47692ad4	Fix compilation error bad auto merge?	2025-08-20 10:04:15 +08:00
Karl Seguin	af4066da87	Merge pull request #946 from lightpanda-io/request_interception Request Interception	2025-08-20 07:53:08 +08:00
Pierre Tachoire	f7eee0d461	http: add an arena to Transfer	2025-08-19 11:10:52 +02:00
Pierre Tachoire	39178d8d2b	http: remove uselesss Client.arena	2025-08-19 11:10:25 +02:00
Pierre Tachoire	7795916c08	apply review comments	2025-08-19 10:01:35 +02:00
Pierre Tachoire	0e2a3d8009	handle cookies on redirection manually	2025-08-19 10:01:11 +02:00
Karl Seguin	f5ec74252d	Add fulfillRequest and more complete continueRequest	2025-08-18 18:29:10 +08:00
Karl Seguin	c1319d1f27	add proper resourceType	2025-08-18 12:42:18 +08:00
Pierre Tachoire	7d0e4b6270	use CURLOPT_COOKIE to set cookies	2025-08-14 15:33:02 +02:00
Pierre Tachoire	b2f645a5ce	enable curl cookie engine Enabling Curl cookie engine brings advantage: * handle cookies during a redirection: when a srv redirects including cookies, curl sends back the cookies correctly during the next request	2025-08-14 15:32:56 +02:00
Karl Seguin	5b2806a784	expose response header amount	2025-08-14 18:57:57 +08:00
Karl Seguin	a2f15ce0b2	Remove unecessary content type parse getResponseHeader takes header index	2025-08-14 18:26:01 +08:00
Karl Seguin	96b10f4b85	Optimize Network.responseReceived Add a header iterator to the transfer. This removes the need for NetworkState, duping header name/values, and the http_header_received event.	2025-08-14 15:50:56 +08:00
Karl Seguin	5100e06f38	fix header done callback	2025-08-14 14:51:02 +08:00
sjorsdonkers	c0106a238b	http_headers_done_receiving	2025-08-13 14:29:23 +02:00
Karl Seguin	ca9e850ac7	Create Client.Transfer earlier. On client.request(req) we now immediately wrap the request into a Transfer. This results in less copying of the Request object. It also makes the transfer.uri available, so CDP no longer needs to std.Uri(request.url) anymore. The main advantage is that it's easier to manage resources. There was a use- after free before due to the sensitive nature of the tranfer's lifetime. There were also corner cases where some resources might not be freed. This is hopefully fixed with the lifetime of Transfer being extended.	2025-08-13 18:05:00 +08:00
sjorsdonkers	a49154acf4	http_request_fail	2025-08-12 15:20:48 +02:00
sjorsdonkers	77eee7f087	Cookies	2025-08-12 14:40:23 +02:00
sjorsdonkers	03694b54f0	3# This is a combination of 3 commits. intercept continue and abort feedback First version of headers, no cookies yet	2025-08-12 13:49:20 +02:00
Karl Seguin	ea0bbaf332	Revert "Treat pending requests as active" This reverts commit `19c908035b`.	2025-08-12 11:27:28 +08:00
Karl Seguin	19c908035b	Treat pending requests as active This ensures that page.wait won't unblock too early. As-is, this isn't an issue since active can only be 0 if there are no active OR pending requests. However, with request interception (https://github.com/lightpanda-io/browser/pull/930) it's possible to have no active requests and no pending requests - from the http client's point of view - but still have pending-on-intercept requests. An alternative to this would be to undo these changes, and instead change Page.wait to be intercept-aware. That is, Page.wait would continue to block on http activity and scheduled tasks, as well as intercepted requests. However, since the Page doesn't know anything about CDP right now, and it does know about the http client, maybe doing this in the client is fine.	2025-08-12 11:13:19 +08:00
Karl Seguin	ff742c0169	don't allow concurrent blocking calls	2025-08-11 21:38:36 +08:00
Karl Seguin	3554634c1c	cleanup optional request headers	2025-08-11 21:37:03 +08:00
Karl Seguin	c96fb3c2f2	support CDP proxy override	2025-08-11 21:37:03 +08:00
Karl Seguin	1e612e4166	Add command line options to control HTTP client http_timeout_ms http_connect_timeout_ms http_max_host_open http_max_concurrent	2025-08-11 21:37:03 +08:00
Karl Seguin	06984ace21	fix overflow and debug units	2025-08-11 21:37:03 +08:00
Karl Seguin	ddb549cb45	cookie support	2025-08-11 21:37:02 +08:00
Karl Seguin	c7484c69c0	Increase max concurrent request to 10 Improve wait analysis dump. De-prioritize secondary schedules. Don't log warning for application/json scripts Change pretty log timer to display time from start.	2025-08-11 21:37:02 +08:00
Karl Seguin	7831aabe5a	connect proxy	2025-08-11 21:37:02 +08:00

1 2

59 Commits