Added support for hashmaps in `Smt` and `SimpleSmt` #363

polydez · 2024-12-20T06:32:10Z

After benchmarking of compute_mutations and apply_mutations we noticed poor performance of multiple key-value insertions into BTreeMap which is used in our SMT implementations. Rewriting to hashbrown's HashMap (which supports no-std) gave us more than 10x boost in apply_mutations operation for large trees.

apply_mutations/SimpleSmt: apply_mutations/10000
time: [154.52 ms 163.09 ms 174.46 ms]
change: [-93.242% -92.753% -92.142%] (p = 0.00 < 0.05)
Performance has improved.

(More context in the PR: #355)

In this implementation we also introduced smt_hashmaps feature. If it's switched off, the SMT uses binary-tree implementation, as before. This might be useful for backward-compatibility with code, which relies on entry ordering in SMTs.

bobbinth · 2024-12-27T07:31:26Z

@polydez - merging the latest next branch into this broke it somehow. Could you take a look?

polydez · 2024-12-27T14:06:06Z

@bobbinth, now it works!

bobbinth

Thank you! Looks good! I've reviewed all non-test code so far and left some comments inline.

I also ran the benchmarks (1M tree size and 1K insertions) and got about 20% improvement:

Using BTreeMap

Constructed a SMT with 1000000 key-value pairs in 41.5 seconds
Number of leaf nodes: 1000000

Running an insertion benchmark:
An average insertion time measured by 1000 inserts into an SMT with 1000000 leaves is 404 μs

Running a batched insertion benchmark:
An average insert-batch computation time measured by a 1000-batch into an SMT with 1001000 leaves over 405.3 ms is 405 μs
An average insert-batch application time measured by a 1000-batch into an SMT with 1001000 leaves over 32.5 ms is 32 μs
An average batch insertion time measured by a 1k-batch into an SMT with 1001000 leaves totals to 437.8 ms

Running a batched update benchmark:
An average update-batch computation time measured by a 1000-batch into an SMT with 1002000 leaves over 402.0 ms is 402 μs
An average update-batch application time measured by a 1000-batch into an SMT with 1002000 leaves over 28.1 ms is 28 μs
An average batch update time measured by a 1k-batch into an SMT with 1002000 leaves totals to 430.1 ms

Running a proof generation benchmark:
An average proving time measured by 100 value proofs in an SMT with 1001807 leaves in 9 μs

Using Hashbrown

Constructed a SMT with 1000000 key-value pairs in 42.1 seconds
Number of leaf nodes: 1000000

Running an insertion benchmark:
An average insertion time measured by 1000 inserts into an SMT with 1000000 leaves is 367 μs

Running a batched insertion benchmark:
An average insert-batch computation time measured by a 1000-batch into an SMT with 1001000 leaves over 360.1 ms is 360 μs
An average insert-batch application time measured by a 1000-batch into an SMT with 1001000 leaves over 4.8 ms is 5 μs
An average batch insertion time measured by a 1k-batch into an SMT with 1001000 leaves totals to 364.9 ms

Running a batched update benchmark:
An average update-batch computation time measured by a 1000-batch into an SMT with 1002000 leaves over 370.1 ms is 370 μs
An average update-batch application time measured by a 1000-batch into an SMT with 1002000 leaves over 4.2 ms is 4 μs
An average batch update time measured by a 1k-batch into an SMT with 1002000 leaves totals to 374.2 ms

Running a proof generation benchmark:
An average proving time measured by 100 value proofs in an SMT with 1001806 leaves in 0 μs

Cargo.toml

src/merkle/node.rs

src/merkle/smt/mod.rs

polydez · 2024-12-29T05:47:42Z

I also ran the benchmarks (1M tree size and 1K insertions) and got about 20% improvement:

I tested only application of mutation set, which does a lot of inserts/updates and in your benchmark this operation gave us up to ~7x improvement. Computation time is still slow, but I hope, paralleling of the computation can give us significant speed-up.

src/merkle/smt/mod.rs

bobbinth

Looks good! Thank you! There are 2 more things we should do:

Update the "Crate features" section in the README to mention smt_hashmaps feature (I noticed that concurrent feature description is also missing there).
Probably update the Makefile to make sure we also run test with smt_hashmaps feature enabled.

polydez · 2025-01-02T09:25:47Z

Update the "Crate features" section in the README to mention smt_hashmaps feature (I noticed that concurrent feature description is also missing there).

I will update the list, thank you for noticing! But what about concurrent feature? I can see it in the list (first item):

crypto/README.md

Line 63 in a797a9e

    
           - `concurrent`- enabled by default; enables multi-threaded implementation of `Smt::with_entries()` which significantly improves performance on multi-core CPUs.

bobbinth · 2025-01-02T09:41:29Z

But what about concurrent feature? I can see it in the list (first item)

Ah yes! I was looking at an old version of the file.

sonarqubecloud · 2025-01-02T09:54:26Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

bobbinth

All looks good! Thank you!

polydez added 3 commits December 19, 2024 19:43

feat: rewrite Smt and SimpleSmt to hashmaps

bafe5c1

feat: introduce hashmaps feature

afb3e8b

refactor: rename feature to smt_hashmaps and use it only for SMTs

9e555e7

polydez requested review from plafer and bobbinth December 20, 2024 06:32

polydez added 2 commits December 20, 2024 11:33

docs: update CHANGELOG.md

a9f7bbe

fix: no-std build

dab437f

polydez marked this pull request as ready for review December 20, 2024 06:48

Merge branch 'next' into polydez-hashmap-smt

4f446f1

fix: compilation errors

f9b79b1

bobbinth reviewed Dec 29, 2024

View reviewed changes

polydez added 4 commits December 29, 2024 11:12

fix: typo

c9e198a

refactor: make InnerNodeInfo derive PartialOrd, Ord only for tests

da9bc92

fix: use u32 for serialization of MutationSet

a017c97

fix: use usize for serialization of MutationSet

2add8dd

bobbinth reviewed Dec 29, 2024

View reviewed changes

src/merkle/smt/mod.rs Outdated Show resolved Hide resolved

bobbinth reviewed Dec 29, 2024

View reviewed changes

src/merkle/smt/mod.rs Outdated Show resolved Hide resolved

polydez added 3 commits December 31, 2024 00:07

refactor: get rid of KeyConstraints

720c02e

refactor: make hashbrown dependency optional

a632496

refactor: read/write vectors directly

a797a9e

bobbinth approved these changes Dec 31, 2024

View reviewed changes

polydez added 2 commits January 2, 2025 14:26

docs: add smt_hashmaps feature description

393483b

chore: add smt_hashmaps feature testing

c437739

chore: add smt-hashmaps testing to github actions

1e88c08

bobbinth approved these changes Jan 2, 2025

View reviewed changes

bobbinth merged commit 7ee6d7f into next Jan 2, 2025
15 checks passed

bobbinth deleted the polydez-hashmap-smt branch January 2, 2025 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for hashmaps in `Smt` and `SimpleSmt` #363

Added support for hashmaps in `Smt` and `SimpleSmt` #363

polydez commented Dec 20, 2024

bobbinth commented Dec 27, 2024

polydez commented Dec 27, 2024

bobbinth left a comment

polydez commented Dec 29, 2024

bobbinth left a comment

polydez commented Jan 2, 2025

bobbinth commented Jan 2, 2025

sonarqubecloud bot commented Jan 2, 2025

bobbinth left a comment

Added support for hashmaps in Smt and SimpleSmt #363

Added support for hashmaps in Smt and SimpleSmt #363

Conversation

polydez commented Dec 20, 2024

bobbinth commented Dec 27, 2024

polydez commented Dec 27, 2024

bobbinth left a comment

Choose a reason for hiding this comment

Using BTreeMap

Using Hashbrown

polydez commented Dec 29, 2024

bobbinth left a comment

Choose a reason for hiding this comment

polydez commented Jan 2, 2025

bobbinth commented Jan 2, 2025

sonarqubecloud bot commented Jan 2, 2025

Quality Gate passed

bobbinth left a comment

Choose a reason for hiding this comment

Added support for hashmaps in `Smt` and `SimpleSmt` #363

Added support for hashmaps in `Smt` and `SimpleSmt` #363