Issue with QM Calculations Terminating Prematurely When Assigned to Individual GPUs #154

ORCAaAaA-ui · 2024-05-15T04:18:59Z

I am experiencing an issue with running QM calculations on individual GPUs. When assigning QM calculations to separate GPUs, the calculations terminate prematurely without completing. This issue does not occur when running the calculations on a single GPU. I have ensured that each GPU has sufficient memory.

Any recommended steps to further diagnose and resolve this issue?

I appreciate any assistance or guidance on this issue. Thank you.

wxj6000 · 2024-05-22T22:59:16Z

@ORCAaAaA-ui How do you assign QM calculations to separate GPUs? Can you share the script here for diagnosing?

wxj6000 · 2024-05-26T17:40:35Z

@ORCAaAaA-ui There are at least two ways to select individual GPU. 1) Use docker. You can specify which GPU is visible when you run docker run. 2) Use CuPy. https://docs.cupy.dev/en/stable/reference/generated/cupy.cuda.Device.html But you can only import gpu4pyscf modules when the device is selected.

When the above PR is merged, one can import GPU4PySCF before the device is selected.

ORCAaAaA-ui · 2024-05-30T01:21:52Z

@wxj6000 I simply excuted the command, 'export CUDA_VISIBLE_DEVICES=0 or 1', to assign jobs to each GPU.

wxj6000 linked a pull request May 26, 2024 that will close this issue

support specifying gpu in multi-GPU environment #158

Merged

wxj6000 closed this as completed in #158 May 28, 2024

wxj6000 reopened this May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with QM Calculations Terminating Prematurely When Assigned to Individual GPUs #154

Issue with QM Calculations Terminating Prematurely When Assigned to Individual GPUs #154

ORCAaAaA-ui commented May 15, 2024

wxj6000 commented May 22, 2024

wxj6000 commented May 26, 2024

ORCAaAaA-ui commented May 30, 2024

Issue with QM Calculations Terminating Prematurely When Assigned to Individual GPUs #154

Issue with QM Calculations Terminating Prematurely When Assigned to Individual GPUs #154

Comments

ORCAaAaA-ui commented May 15, 2024

wxj6000 commented May 22, 2024

wxj6000 commented May 26, 2024

ORCAaAaA-ui commented May 30, 2024