Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

benchmark scripts: basic separation for alps vcluster, number of cores, gpu defaults, ... #1254

Draft
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

albestro
Copy link
Collaborator

Main discussion points:

  1. create a "base" alps-gh200 config so that each vcluster can inherit and customise it
  2. reduce number of cores: see dedicated comment below
  3. is pika mpi mode 31+1 the best one? should we go for this or keep the default initialised (30+0)? @msimberg @biddisco
  4. currently we cannot set differently DLAF params for MC and GPU, and for GPU defaults are not the best. should we set it like this or do we prefer to have them in the generator scripts as cli parameters?

@albestro albestro self-assigned this Dec 20, 2024
@albestro
Copy link
Collaborator Author

About number of cores to use, there was the question if using all cores might have been problematic, e.g not leaving enough resources to the system.

In some cases it revealed to be problematic (e.g. due to an OpenMP problem that seems to use 1 thread more than requested), so here it is a quick benchmark.

Next are time plots (the lower the better) for eigensolver. The naming of series are distinguished by two numbers:

  • first number is how many cores the allocation is given (-c slurm parameter)
  • second number is how many pika threads are used (--pika:threads parameter)

These benchmarks ran all on current master (0.7.0) with --dlaf:red2band-panel-nworkers=50.

image
image
image
image

From a quick look I wouldn't say there is any real difference, except on --local runs which seem to have a small benefit having more cores available (72-72 > 72-64 > 64-64).

@msimberg
Copy link
Collaborator

msimberg commented Jan 7, 2025

is pika mpi mode 31+1 the best one? should we go for this or keep the default initialised (30+0)? @msimberg @biddisco

I think 31+1 is a good choice for the benchmarks.

@albestro albestro changed the title Update benchmark scripts: basic separation for alps vcluster, number of cores, gpu defaults, ... benchmark scripts: basic separation for alps vcluster, number of cores, gpu defaults, ... Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

2 participants