Rewrite the exponential decay process in C++. #285

plietar · 2024-03-11T11:02:07Z

The existing process written in R needs to copy the contents of each variable from C++ using v$get_values(), then after scaling the vector it would copy the result back into C++ using v$queue_update.

The amount of data copied and the time it took was pretty significant. 6 double variables, one for each kind of immunity, need to be updated in full at each time step, each as big as the population size.

Moving this into C++ removes the need for any copy at all, besides the multication loop. Values are read out of a reference to the vector held by the DoubleVariable, the result of the multication is moved to the queue, and finally individual moves the vector in the queue into the DoubleVariable. The speedup from this change for a 1M population size is around 10%.

An alternative optimization I considered was to compute the exponential decay lazily, recording only the timestep and value at which the immunity was last updated and using the closed form expression of the exponential decay. This would avoid the need to have mass updates of the immunity variables at every time step. Unfortunately in my testing this ends up being slower than even the current implementation, with all the time being spent in calculating the current value. This would also be a much more intrusive change, since every use of the immunity variables needs to be modified to take the last update timestep, the current timestep and the decay rate into consideration.

The existing process written in R needs to copy the contents of each variable from C++ using `v$get_values()`, then after scaling the vector it would copy the result back into C++ using `v$queue_update`. The amount of data copied and the time it took was pretty significant. 6 double variables, one for each kind of immunity, need to be updated in full at each time step, each as big as the population size. Moving this into C++ removes the need for any copy at all, besides the multication loop. Values are read out of a reference to the vector held by the DoubleVariable, the result of the multication is moved to the queue, and finally individual moves the vector in the queue into the DoubleVariable. The speedup from this change for a 1M population size is around 10%. An alternative optimization I considered was to compute the exponential decay lazily, recording only the timestep and value at which the immunity was last updated and using the closed form expression of the exponential decay. This would avoid the need to have mass updates of the immunity variables at every time step. Unfortunately in my testing this ends up being slower than even the current implementation, with all the time being spent in calculating the current value. This would also be a much more intrusive change, since every use of the immunity variables needs to be modified to take the last update timestep, the current timestep and the decay rate into consideration.

plietar · 2024-03-11T13:21:02Z

The broken tests are because of the new individual version, will be fixed by mrc-ide/individual#192

giovannic

This is great stuff!

Though not necessary, it would be good to see the performance improvement compared to a naive copy, modify, move implementation (which wouldn't require all the iterator code).

Another alternative to potentially consider: Rounding off low immunities to zero and excluding them from the update? Milage would likely depend on the threshold for low immunity and the transmission level.

giovannic · 2024-03-11T15:56:16Z

src/processes.cpp

+ * to create an empty vector, use `reserve(N)` to pre-allocate the vector and
+ * then call `push_back` with each new value. The second way would be to create
+ * a zero-initialised vector of size N and then use `operator[]` to fill in the
+ * values.


What about copy, modify, move?

giovannic · 2024-03-11T16:22:13Z

It's very surprising that gcc doesn't optimise the second approach. Is that because of the default optimisation level set by R packaging? Or is this generally the best way to do it?

plietar · 2024-03-12T16:13:31Z

Is that because of the default optimisation level set by R packaging?

Not really, I get similar results even at -O3 -march=alderlake.

See the disassembly at https://godbolt.org/z/9jWKbTev7. There's a lot going on, but even just at a glance the copy_modify and zero_initialized functions each start with a call to memmove or memset, respectively. reserve_pushback doesn't have any such call, but it doesn't have any unrolling and vectorization. The only multiplication is a single vmulsd instruction, which only multiplies one double value per call, whereas the other functions use multiple calls to vmulpd ymm, which does four multiplications at once (along with some vmulpd xmm and vmulsd calls to round things off).

Clang is even more aggressive about unrolling, and generates 4 vmulpd ymm instructions for loop iteration, and generally produces an easier to read assembly. Still doesn't optimize out the memmove and memset calls though.

Here are the microbenchmarks cited in the comment https://gist.github.com/plietar/72fc5160f79e19379418ce391db1624a

plietar · 2024-03-12T16:39:40Z

To be honest I am a little on the fence about whether to use the iterator stuff or not. It was about 2% speedup on the overall simulation compared to the zero-initialization code, which is small but not insignificant. Given that all the complexity is contained within that one function and doesn't leak out, I think I'm inclined to keep it.

This reverts commit b3376d3.

plietar requested a review from giovannic March 11, 2024 11:02

plietar force-pushed the faster-exponential branch from 5fae7bb to 92800d6 Compare March 11, 2024 11:03

giovannic approved these changes Mar 11, 2024

View reviewed changes

Update individual dependency

7e7a849

plietar merged commit b3376d3 into dev Mar 21, 2024
4 checks passed

giovannic deleted the faster-exponential branch March 21, 2024 14:07

plietar added a commit to plietar/malariasimulation that referenced this pull request May 16, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

25a924c

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 16, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

b3c83fd

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 16, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

d0109b0

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 16, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

e80a0cc

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 17, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

9abf4ed

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 17, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

369df0a

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 20, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

f2d3715

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

b4451e4

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

fec1f76

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

644b49a

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

6ea3512

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

a2829c6

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

ce458b6

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

97b9b83

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

b170325

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

8a647b6

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

9529194

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

2bacdf5

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

de0b246

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

4dc67fd

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

885871c

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

741e3ba

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 21, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

89d63bd

This reverts commit b3376d3.

plietar added a commit to plietar/malariasimulation that referenced this pull request May 22, 2024

Revert "Rewrite the exponential decay process in C++. (mrc-ide#285)"

39dcee2

This reverts commit b3376d3.

giovannic mentioned this pull request Sep 11, 2024

Dev #335

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite the exponential decay process in C++. #285

Rewrite the exponential decay process in C++. #285

plietar commented Mar 11, 2024

plietar commented Mar 11, 2024 •

edited

Loading

giovannic left a comment

giovannic Mar 11, 2024

giovannic commented Mar 11, 2024

plietar commented Mar 12, 2024 •

edited

Loading

plietar commented Mar 12, 2024 •

edited

Loading

Rewrite the exponential decay process in C++. #285

Rewrite the exponential decay process in C++. #285

Conversation

plietar commented Mar 11, 2024

plietar commented Mar 11, 2024 • edited Loading

giovannic left a comment

Choose a reason for hiding this comment

giovannic Mar 11, 2024

Choose a reason for hiding this comment

giovannic commented Mar 11, 2024

plietar commented Mar 12, 2024 • edited Loading

plietar commented Mar 12, 2024 • edited Loading

plietar commented Mar 11, 2024 •

edited

Loading

plietar commented Mar 12, 2024 •

edited

Loading

plietar commented Mar 12, 2024 •

edited

Loading