-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Attempt to return dropped Body to pool #94
Comments
At the HTTP protocol level, something has to read the full response body before the TCP connection is ready to send connections on again. In ureq, that's done with into_reader, into_string, etc. Reading diem/diem#4735 it looks like you do read the body, but only on 200's. That means 403's, 500s, etc will not get returned to the pool and get closed when the Response is dropped. If you're seeing something other than that, please let us know. It seems like what you most likely want is to unconditionally call We've talked recently about a possible breaking change to the API to use Result. I'm planning to write up a detailed proposal soon, but I think that returning an Err for connection-breaking errors would perhaps make it clearer that anything receiving an Ok needs to read the stream. Also on diem/diem#4735 I see someone mention implementing drop. I assume that means "read all outstanding response bytes when Response is dropped." I think that's not the right approach, since that can wind up performing large amounts of work or even blocking indefinitely. |
The connection is auto-put back in the pool on drop (if possible), we could make some explicit API for that. On the fn is_poolable(&self) -> bool {
// check if response is exhausted
}
fn return_to_pool(self) {
// 1. exhaust response into void
// 2. by virtue of taking `self`, the rest just happens
} The idea is that it would aid workflows where pooling is desirable. |
Yeah, in my case, it is obvious that recycling should happen without any issues, but I can totally see that as @jsha alluded this cannot be the default behavior -- if someone attempts to download a file just to see if it exists, for instance, a naive approach would end up downloading the entire file /just/ to reuse a connection. |
I'm not keen on adding an explicit Here are a couple of other interventions that could increase connection pooling in cases where the user isn't necessarily calling into_string() or into reader():
By the way, thinking about this made me realize another trickiness of the API. Since into_reader() / into_string() consume the Response, the status code and headers become unavailable to users once those methods are called. I think in many cases the user may want to do something like: let resp = ureq::get("http://example.com").call();
let resp_body = resp.into_string()?;
println!("response status {}, body {}", resp.status(), resp_body); The above fails because resp was moved / consumed by into_string(). A user can work around it by storing a copy of the status code and headers before consuming the body, but it's a bit awkward. I don't have a particular recommendation for how to change this API right now but will think about it some more. |
I like these ideas. Repooling early is not a thought that crossed my mind. If we're considering breaking changes, one idea could be to use the informal rust standardized http crate. In hreq I made that a design goal, and it has some advantages for the user.
A common approach with http would be use ureq::Body;
let resp: http::Response<Body> = ureq::get("http://example.com").call();
let (parts, body) = resp.into_parts();
println!("response status {}, body {}", parts.status, body.into_string()?); And we could use extension traits to make it even more ergonomic. use ureq::ResponseExt;
use ureq::Body;
let resp: http::Response<Body> = ureq::get("http://example.com").call();
println!("response status {}, body {}", resp.status(), resp.read_string()?); |
For what it's worth, in a similar scenario of wanting to reuse a socket with unread response leftovers, Go's https://github.com/golang/go/blob/go1.15.6/src/net/http/client.go#L693-L701 (Tangent: this is only true for HTTP/1, HTTP/2, /3 are different beasts.. Not that that would matter for ureq as it is today.) |
That's a very helpful reference. Thank you! |
In ureq3. x this still needs doing. Dropping the
|
This is currently not possible, see #926. Choosing between auto-dropping content and the ergonomics, the ergonomics wins in this case. Closing until we see a way forward. |
In one of my apps, I use ureq to read/write from a key/value server. When I perform writes, the status response is sufficient to know failure or success. However, if I do not read the response body, the connection doesn't return to the pool since the data remains on the stream. This seems odd and requires in my app adding an explicit read on the response in order to allow the stream to return to the pool. I was curious why this is the case and whether or not we could just flush the buffer:
https://github.com/algesten/ureq/blob/master/src/pool.rs#L218
The text was updated successfully, but these errors were encountered: