Speed up writing of Varint21 #3453

Janmm14 · 2023-03-28T14:41:38Z

Benchmarks indicate this is 20-50% faster (higher when writing longer varints).

See
#3451 (comment)
https://steinborn.me/posts/performance/how-fast-can-you-write-a-varint/

With just a being few nanoseconds per operation this is likely just a small improvement and not a noticable difference overall.

As we just rarely have to touch that part of the code, I think it is okay to have this optimized, unrolled loop and condensed write calls here.

In further benchmarks I made there was no significant difference leaving out the 0x7F mask for the uppermost bit, so I added it to be similar to the code of the blog post.

The varint writing speedup in the benchmark is lower on java 19 (just ~10% faster) compared to java 16/17 (~30% faster), original code speed is similar. For the benchmark 2048 random short numbers were used, the speedup stays similar using other batch sizes (like 4096 or 1024) / random seeds.
I don't want to spend time looking whether this is tunable with jvm flags or what exactly is going on there tho, also because benchmarks are never identical to the real workload...

However this change is very likely not just causing improved benchmark scores, but also real-world speedup, because data dependency is reduced, loop is replaced with a switch (less jumps), writing to buffer is not done byte-by-byte (cpu's are used to handling more bytes than 8 at once). These improvements go beyond what the jvm is capable of doing.

md-5 · 2023-04-08T22:44:53Z

Is this your original implementation/code (read: not copied)?

Janmm14 · 2023-04-09T00:18:59Z

Is this your original implementation/code (read: not copied)?

I did use chatgpt first a bunch of times until it correctly unrolled and combined the stuff. (I ran its code through tests and told it the wrong & correct result for the random number test and at some point it worked.)
Then I removed some redundant bit masks it had left inside.
I took a look at the steinborn blog post as well afterwards and tried more variations of the code, so it might be a little inspired by steinborn.

The different implementations from the steinborn blog post are MIT-licensed, so it'd not be a problem to source it from there as well.

md-5 · 2023-04-09T00:25:10Z

The different implementations from the steinborn blog post are MIT-licensed, so it'd not be a problem to source it from there as well.

Prefer to avoid introducing externally licensed code

Janmm14 · 2023-04-09T00:52:58Z

Well RIP?

md-5 · 2023-04-09T00:54:09Z

If its not copied it might be ok

Janmm14 · 2023-04-09T01:22:40Z

I looked back at the issue comments and it seems that I had very similar code, looks like just switch syntax difference (am just on my phone now, can't compare really good) before others posted the link to the steinberg's code.

bob7l · 2023-04-15T10:13:22Z

Haven't compared the code, but unrolling a VarInt is a common practice that has been made somewhat less common given the ability of modern hardware (And compilers unrolling). Only reason it's viable here is because we're writing to a native buffer through a VM, and there's tons of overhead. So reducing

Just to prove a point, this same generic code exists here too:
https://github.com/haojunsheng/Consensus/blob/00a5c703a64cae78af86f5b7a02ee66f056ba5d6/sofa-jraft/jraft-rheakv/rheakv-core/src/main/java/com/alipay/sofa/jraft/rhea/serialization/impl/protostuff/io/UnsafeNioBufOutput.java#L88

Here too:
https://github.com/Moulberry/adventure-binary-serializer/blob/8b45e52bd6b12489a0061c077080ef9afea9556c/src/main/java/net/gauntletmc/adventure/serializer/binary/BinaryComponentSerializerImpl.java#L575

Although they don't have a writeMedium function like we fortunately do.

So to say it's "copying" is a bit absurd, unless Jammm14 literally copy/pasted every tiny bit of code lol. Just generic VarInt byteshifts

linsaftw · 2023-09-24T13:48:29Z

Causing connection issues.

Janmm14 · 2023-09-25T00:11:42Z

Causing connection issues.

Please check yourself if this is actually the problem, given the test runs without problems and the same code is being used in other projects.

linsaftw · 2023-10-05T01:10:09Z

Causing connection issues.

Please check yourself if this is actually the problem, given the test runs without problems and the same code is being used in other projects.

Retested. Working just fine.

Speed up writing of Varint21

ea325d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up writing of Varint21 #3453

Speed up writing of Varint21 #3453

Janmm14 commented Mar 28, 2023 •

edited

Loading

md-5 commented Apr 8, 2023

Janmm14 commented Apr 9, 2023 •

edited

Loading

md-5 commented Apr 9, 2023

Janmm14 commented Apr 9, 2023

md-5 commented Apr 9, 2023

Janmm14 commented Apr 9, 2023 •

edited

Loading

bob7l commented Apr 15, 2023 •

edited

Loading

linsaftw commented Sep 24, 2023

Janmm14 commented Sep 25, 2023

linsaftw commented Oct 5, 2023

Speed up writing of Varint21 #3453

Are you sure you want to change the base?

Speed up writing of Varint21 #3453

Conversation

Janmm14 commented Mar 28, 2023 • edited Loading

md-5 commented Apr 8, 2023

Janmm14 commented Apr 9, 2023 • edited Loading

md-5 commented Apr 9, 2023

Janmm14 commented Apr 9, 2023

md-5 commented Apr 9, 2023

Janmm14 commented Apr 9, 2023 • edited Loading

bob7l commented Apr 15, 2023 • edited Loading

linsaftw commented Sep 24, 2023

Janmm14 commented Sep 25, 2023

linsaftw commented Oct 5, 2023

Janmm14 commented Mar 28, 2023 •

edited

Loading

Janmm14 commented Apr 9, 2023 •

edited

Loading

Janmm14 commented Apr 9, 2023 •

edited

Loading

bob7l commented Apr 15, 2023 •

edited

Loading