fix(benches/transfer): remove throughput #2007

mxinden · 2024-07-26T11:40:08Z

The neqo-transport/benches/transfer.rs benchmarks use the test-fixture/src/sim simulator. The simulator can travel in time, i.e. it simulates time.

The wall-clock time of a single benchmark run is not the amount of time it took to transfer TRANSFER_AMOUNT. The simulated time is the amount of time it took to transfer TRANSFER_AMOUNT.

criterion will use the wall-clock time, not the simulated time to calculate the throughput based on TRANSFER_AMOUNT. The resulting throughput number is not meaningful.

This commit removes the call to group.throughput, thus removing the misleading criterion throughput reporting.

Related to #1998.

The `neqo-transport/benches/transfer.rs` benchmarks use the `test-fixture/src/sim` simulator. The simulator can travel in time, i.e. it simulates time. The _wall-clock time_ of a single benchmark run is not the amount of time it took to transfer `TRANSFER_AMOUNT`. The _simulated time_ is the amount of time it took to transfer `TRANSFER_AMOUNT`. `criterion` will use the _wall-clock time_, not the _simulated time_ to calculate the throughput based on `TRANSFER_AMOUNT`. The resulting throughput number is not meaningful. This commit removes the call to `group.throughput`, thus removing the misleading `criterion` throughput reporting.

codecov · 2024-07-26T12:16:20Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.00%. Comparing base (7a39675) to head (57ad173).
Report is 11 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2007   +/-   ##
=======================================
  Coverage   95.00%   95.00%           
=======================================
  Files         112      112           
  Lines       36364    36364           
=======================================
+ Hits        34546    34549    +3     
+ Misses       1818     1815    -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2024-07-26T12:16:57Z

Failed Interop Tests

QUIC Interop Runner, client vs. server

neqo-latest as client

neqo-latest vs. aioquic: Z C1
neqo-latest vs. haproxy: Z L1 C1
neqo-latest vs. kwik: Z
neqo-latest vs. lsquic: H Z V2
neqo-latest vs. msquic: Z A C1
neqo-latest vs. mvfst: DC B U A L1 L2 C1 C2
neqo-latest vs. neqo: A
neqo-latest vs. neqo-latest: Z A
neqo-latest vs. nginx: H C1
neqo-latest vs. ngtcp2: Z E
neqo-latest vs. picoquic: L1 C1
neqo-latest vs. quinn: Z
neqo-latest vs. s2n-quic: R
neqo-latest vs. xquic: Z A

neqo-latest as server

aioquic vs. neqo-latest: A
chrome vs. neqo-latest: 3
go-x-net vs. neqo-latest: A
kwik vs. neqo-latest: A
lsquic vs. neqo-latest: run cancelled after 20 min
msquic vs. neqo-latest: A
mvfst vs. neqo-latest: Z 3 A L1 C1
neqo vs. neqo-latest: A
ngtcp2 vs. neqo-latest: run cancelled after 20 min
picoquic vs. neqo-latest: A
quic-go vs. neqo-latest: run cancelled after 20 min
quiche vs. neqo-latest: DC 3 A
quinn vs. neqo-latest: A V2
s2n-quic vs. neqo-latest: A
xquic vs. neqo-latest: M A

All results

Succeeded Interop Tests

QUIC Interop Runner, client vs. server

neqo-latest as client

neqo-latest vs. aioquic: H DC LR C20 M S R 3 B U A L1 L2 C2 6 V2
neqo-latest vs. go-x-net: H DC LR M B U A L2 C2 6
neqo-latest vs. haproxy: H DC LR C20 M S R 3 B U A L2 C2 6 V2
neqo-latest vs. kwik: H DC LR C20 M S R 3 B U A L1 L2 C1 C2 6 V2
neqo-latest vs. lsquic: DC LR C20 M S R 3 B U E A L1 L2 C1 C2 6
neqo-latest vs. msquic: H DC LR C20 M S R B U L1 L2 C2 6 V2
neqo-latest vs. mvfst: H LR M R Z 3 6
neqo-latest vs. neqo: H DC LR C20 M S R Z 3 B U E L1 L2 C1 C2 6 V2
neqo-latest vs. neqo-latest: H DC LR C20 M S R 3 B U E L1 L2 C1 C2 6 V2
neqo-latest vs. nginx: DC LR C20 M S R Z 3 B U A L1 L2 C2 6
neqo-latest vs. ngtcp2: H DC LR C20 M S R 3 B U A L1 L2 C1 C2 6 V2
neqo-latest vs. picoquic: H DC LR C20 M S R Z 3 B U E A L2 C2 6 V2
neqo-latest vs. quic-go: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6
neqo-latest vs. quiche: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6
neqo-latest vs. quinn: H DC LR C20 M S R 3 B U E A L1 L2 C1 C2 6
neqo-latest vs. s2n-quic: H DC LR C20 M S 3 B U E A L1 L2 C1 C2 6
neqo-latest vs. xquic: H DC LR C20 M R 3 B U L1 L2 C1 C2 6

neqo-latest as server

aioquic vs. neqo-latest: H DC LR C20 M S R Z 3 B L1 L2 C1 C2 6 V2
go-x-net vs. neqo-latest: H DC LR M B U L2 C2 6
kwik vs. neqo-latest: H DC LR C20 M S R Z 3 B U L1 L2 C1 C2 6 V2
msquic vs. neqo-latest: H DC LR C20 M S R Z B U L1 L2 C1 C2 6 V2
mvfst vs. neqo-latest: H DC LR M B L2 C2 6
neqo vs. neqo-latest: H DC LR C20 M S R Z 3 B U E L1 L2 C1 C2 6 V2
picoquic vs. neqo-latest: H DC LR C20 M S R Z 3 B U E L1 L2 C1 C2 6 V2
quiche vs. neqo-latest: H LR M S R Z B L1 L2 C1 C2 6
quinn vs. neqo-latest: H DC LR C20 M S R Z 3 B U E L1 L2 C1 C2 6
s2n-quic vs. neqo-latest: H DC LR M S R 3 B E L1 L2 C1 C2 6
xquic vs. neqo-latest: H DC LR C20 S R Z 3 B U L1 L2 C1 C2 6

Unsupported Interop Tests

QUIC Interop Runner, client vs. server

neqo-latest as client

neqo-latest vs. aioquic: E
neqo-latest vs. go-x-net: C20 S R Z 3 E L1 C1 V2
neqo-latest vs. haproxy: E
neqo-latest vs. kwik: E
neqo-latest vs. msquic: 3 E
neqo-latest vs. mvfst: C20 S E V2
neqo-latest vs. nginx: E V2
neqo-latest vs. quic-go: E V2
neqo-latest vs. quiche: E V2
neqo-latest vs. quinn: V2
neqo-latest vs. s2n-quic: Z V2
neqo-latest vs. xquic: S E V2

neqo-latest as server

aioquic vs. neqo-latest: U E
chrome vs. neqo-latest: H DC LR C20 M S R Z B U E A L1 L2 C1 C2 6 V2
go-x-net vs. neqo-latest: C20 S R Z 3 E L1 C1 V2
kwik vs. neqo-latest: E
msquic vs. neqo-latest: 3 E
mvfst vs. neqo-latest: C20 S R U E V2
quiche vs. neqo-latest: C20 U E V2
s2n-quic vs. neqo-latest: C20 Z U V2
xquic vs. neqo-latest: E V2

github-actions · 2024-07-26T12:32:03Z

Benchmark results

Performance differences relative to 7a39675.

coalesce_acked_from_zero 1+1 entries: 💚 Performance has improved.

       time:   [189.97 ns 190.47 ns 191.00 ns]
       change: [-2.9299% -2.4592% -2.0057%] (p = 0.00 < 0.05)
       Performance has
Found 2 outliers among 100 measurements (2.00%)
  1 (1.00%) high mild
  1 (1.00%) high severe

coalesce_acked_from_zero 3+1 entries: 💚 Performance has improved.

       time:   [230.52 ns 230.97 ns 231.47 ns]
       change: [-2.9535% -2.4520% -1.7883%] (p = 0.00 < 0.05)
       Performance has
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low mild
  1 (1.00%) high mild
  9 (9.00%) high severe

coalesce_acked_from_zero 10+1 entries: Change within noise threshold.

       time:   [230.20 ns 230.75 ns 231.47 ns]
       change: [-2.7996% -1.8925% -0.5057%] (p = 0.00 < 0.05)
       Change within
Found 5 outliers among 100 measurements (5.00%)
  5 (5.00%) high severe

coalesce_acked_from_zero 1000+1 entries: Change within noise threshold.

       time:   [217.60 ns 217.82 ns 218.07 ns]
       change: [-1.3794% -0.6760% +0.0194%] (p = 0.05 < 0.05)
       Change within
Found 13 outliers among 100 measurements (13.00%)
  7 (7.00%) high mild
  6 (6.00%) high severe

RxStreamOrderer::inbound_frame(): No change in performance detected.

       time:   [118.65 ms 118.83 ms 119.09 ms]
       change: [-0.0166% +0.1665% +0.4268%] (p = 0.12 > 0.05)
       No change
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high severe

transfer/pacing-false/varying-seeds: No change in performance detected.

       time:   [39.139 ms 41.438 ms 43.731 ms]
       change: [-7.4449% -1.0747% +5.4487%] (p = 0.76 > 0.05)
       No change
Found 1 outliers among 100 measurements (1.00%)
  1 (1.00%) high mild

transfer/pacing-true/varying-seeds: No change in performance detected.

       time:   [50.893 ms 53.748 ms 56.634 ms]
       change: [-13.440% -6.3842% +1.1913%] (p = 0.10 > 0.05)
       No change
Found 4 outliers among 100 measurements (4.00%)
  2 (2.00%) low mild
  2 (2.00%) high mild

transfer/pacing-false/same-seed: No change in performance detected.

       time:   [47.983 ms 49.488 ms 50.937 ms]
       change: [-3.8273% +0.2587% +4.7279%] (p = 0.90 > 0.05)
       No change
Found 4 outliers among 100 measurements (4.00%)
  4 (4.00%) low mild

transfer/pacing-true/same-seed: No change in performance detected.

       time:   [68.613 ms 75.000 ms 81.355 ms]
       change: [-6.3302% +5.4390% +19.507%] (p = 0.40 > 0.05)
       No change

1-conn/1-100mb-resp (aka. Download)/client: No change in performance detected.

       time:   [290.42 ms 298.80 ms 306.74 ms]
       thrpt:  [326.01 MiB/s 334.67 MiB/s 344.33 MiB/s]
change:
       time:   [-3.5019% +0.3637% +4.4492%] (p = 0.87 > 0.05)
       thrpt:  [-4.2597% -0.3624% +3.6290%]
       No change

1-conn/10_000-parallel-1b-resp (aka. RPS)/client: 💚 Performance has improved.

       time:   [399.13 ms 402.39 ms 405.71 ms]
       thrpt:  [24.648 Kelem/s 24.852 Kelem/s 25.055 Kelem/s]
change:
       time:   [-4.2455% -3.0686% -1.9164%] (p = 0.00 < 0.05)
       thrpt:  [+1.9539% +3.1657% +4.4338%]
       Performance has

1-conn/1-1b-resp (aka. HPS)/client: No change in performance detected.

       time:   [68.372 ms 68.771 ms 69.221 ms]
       thrpt:  [14.447  elem/s 14.541  elem/s 14.626  elem/s]
change:
       time:   [-0.1329% +0.6064% +1.4190%] (p = 0.14 > 0.05)
       thrpt:  [-1.3991% -0.6028% +0.1331%]
       No change
Found 11 outliers among 100 measurements (11.00%)
  1 (1.00%) low mild
  10 (10.00%) high severe

Client/server transfer results

Transfer of 33554432 bytes over loopback.

Client	Server	CC	Pacing	Mean [ms]	Min [ms]	Max [ms]	Relative
msquic	msquic			186.9 ± 86.3	107.2	357.0	1.00
neqo	msquic	reno	on	305.5 ± 77.0	251.9	469.7	1.00
neqo	msquic	reno		321.7 ± 84.7	261.9	462.0	1.00
neqo	msquic	cubic	on	298.3 ± 70.1	254.4	464.0	1.00
neqo	msquic	cubic		264.0 ± 8.7	256.5	278.3	1.00
msquic	neqo	reno	on	225.0 ± 78.3	157.9	363.5	1.00
msquic	neqo	reno		193.2 ± 74.2	145.5	358.9	1.00
msquic	neqo	cubic	on	254.5 ± 135.2	149.4	555.8	1.00
msquic	neqo	cubic		215.8 ± 54.9	155.3	346.6	1.00
neqo	neqo	reno	on	201.1 ± 59.6	164.5	365.3	1.00
neqo	neqo	reno		218.1 ± 72.4	161.9	400.0	1.00
neqo	neqo	cubic	on	245.5 ± 116.7	162.7	592.8	1.00
neqo	neqo	cubic		204.4 ± 77.8	152.2	411.8	1.00

⬇️ Download logs

github-actions · 2024-07-26T14:08:47Z

Firefox builds for this PR

The following builds are available for testing. Crossed-out builds did not succeed.

Linux: Debug Release
macOS: Debug Release
Windows: Debug Release

larseggert

LGTM, but maybe add a comment where the removed line was, so we know why we removed it?

mxinden requested review from KershawChang, martinthomson and larseggert as code owners July 26, 2024 11:40

mxinden force-pushed the transfer-no-throughput branch from c886e58 to e09e7a5 Compare July 26, 2024 11:53

larseggert approved these changes Jul 26, 2024

View reviewed changes

Add comment

57ad173

larseggert approved these changes Jul 30, 2024

View reviewed changes

larseggert enabled auto-merge July 30, 2024 07:23

larseggert added this pull request to the merge queue Jul 30, 2024

Merged via the queue into mozilla:main with commit 7e96081 Jul 30, 2024
50 of 56 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(benches/transfer): remove throughput #2007

fix(benches/transfer): remove throughput #2007

mxinden commented Jul 26, 2024 •

edited

Loading

codecov bot commented Jul 26, 2024 •

edited

Loading

github-actions bot commented Jul 26, 2024 •

edited

Loading

Succeeded Interop Tests

neqo-latest as client

neqo-latest as server

Unsupported Interop Tests

neqo-latest as client

neqo-latest as server

github-actions bot commented Jul 26, 2024

github-actions bot commented Jul 26, 2024 •

edited

Loading

larseggert left a comment

fix(benches/transfer): remove throughput #2007

fix(benches/transfer): remove throughput #2007

Conversation

mxinden commented Jul 26, 2024 • edited Loading

codecov bot commented Jul 26, 2024 • edited Loading

Codecov Report

github-actions bot commented Jul 26, 2024 • edited Loading

Failed Interop Tests

neqo-latest as client

neqo-latest as server

Succeeded Interop Tests

neqo-latest as client

neqo-latest as server

Unsupported Interop Tests

neqo-latest as client

neqo-latest as server

github-actions bot commented Jul 26, 2024

Benchmark results

Client/server transfer results

github-actions bot commented Jul 26, 2024 • edited Loading

Firefox builds for this PR

larseggert left a comment

Choose a reason for hiding this comment

mxinden commented Jul 26, 2024 •

edited

Loading

codecov bot commented Jul 26, 2024 •

edited

Loading

github-actions bot commented Jul 26, 2024 •

edited

Loading

github-actions bot commented Jul 26, 2024 •

edited

Loading