Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tidy up GPU code, speed up and simplify #40

Open
4 tasks
smutch opened this issue Jul 20, 2020 · 0 comments
Open
4 tasks

Tidy up GPU code, speed up and simplify #40

smutch opened this issue Jul 20, 2020 · 0 comments
Labels
enhancement open to anyone Useful contribution open to anyone with the appropriate skills.

Comments

@smutch
Copy link
Owner

smutch commented Jul 20, 2020

I think we can probably simplify the GPU code quite a bit.

  • Look at maybe simplifying error handling using the examples in the CUDA developer docs.
  • Use streams to improve performance.
  • Get CUFFT working with MPI and multi-GPU (hopefully we can do this if we have one rank orchestrate things).
  • Allow Mhysa to specify what GPU each rank should connect to, that would allow us to do away with the use of MPS.
@smutch smutch assigned smutch and unassigned smutch Jul 20, 2020
@smutch smutch added the open to anyone Useful contribution open to anyone with the appropriate skills. label Jul 20, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement open to anyone Useful contribution open to anyone with the appropriate skills.
Projects
None yet
Development

No branches or pull requests

1 participant