Re-visiting the serialization #214

minghuaw · 2023-10-26T22:02:42Z

The current to_vec() method creates the output buffer with Vec::new(), and according to [1]

A new, empty Vec created by the common means (vec![] or Vec::new or Vec::default) has a length and capacity of zero

This would inevitably get to re-allocation and probably repeated re-allocation if the object is large. However, given that we already have a SizeSerializer which can estimate the serialized size in bytes, this could potentially reduce the number of re-allocation.

[1] https://nnethercote.github.io/perf-book/heap-allocations.html?highlight=borrow

The text was updated successfully, but these errors were encountered:

minghuaw · 2023-10-26T22:07:27Z

In addition, there are places where a temporary buffer is created during the serialization, is it possible to apply a similar technique? Or even better, can these temporary buffers be removed since the reason why most of them are there in the first place was because the serialized format requires a size byte(s) prepended to the actual data.

minghuaw · 2023-11-03T22:27:05Z

Or even better, can these temporary buffers be removed since the reason why most of them are there in the first place was because the serialized format requires a size byte(s) prepended to the actual data.

It might be better if this is introduced in a breaking update

minghuaw · 2023-11-03T22:41:41Z

Initial experiment shows that this quite significantly degrades serialization performance for primitive types like u8, bool, i8, and char that are only one or two bytes long. Very big improvement was observed for types that of of medium length (4B to 1kB). Surprisingly, for long strings/binary (>= 1MB), the performance seems to remain the same

minghuaw · 2023-11-05T14:37:18Z

Or even better, can these temporary buffers be removed since the reason why most of them are there in the first place was because the serialized format requires a size byte(s) prepended to the actual data.

Reserving capacity in buffer somehow negatively impact serializing Vec<u64>

lsunsi · 2024-05-05T17:56:12Z

This is interesting, I can't imagine why it would decrease the performance in this way. Just noting here that I'd expect the pre allocation to only improve performance as well.

minghuaw · 2024-05-06T04:03:42Z

This is interesting, I can't imagine why it would decrease the performance in this way. Just noting here that I'd expect the pre allocation to only improve performance as well.

That was my expectation as well. I haven't got enough time to investigate further however.

minghuaw changed the title ~~Would pre-allocation improve serialization perf?~~ Re-visiting the serialization Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-visiting the serialization #214

Re-visiting the serialization #214

minghuaw commented Oct 26, 2023

minghuaw commented Oct 26, 2023

minghuaw commented Nov 3, 2023

minghuaw commented Nov 3, 2023

minghuaw commented Nov 5, 2023

lsunsi commented May 5, 2024

minghuaw commented May 6, 2024

Re-visiting the serialization #214

Re-visiting the serialization #214

Comments

minghuaw commented Oct 26, 2023

minghuaw commented Oct 26, 2023

minghuaw commented Nov 3, 2023

minghuaw commented Nov 3, 2023

minghuaw commented Nov 5, 2023

lsunsi commented May 5, 2024

minghuaw commented May 6, 2024