Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More automatic handling of dask #115

Closed
AndyHoggANU opened this issue Jan 16, 2019 · 1 comment
Closed

More automatic handling of dask #115

AndyHoggANU opened this issue Jan 16, 2019 · 1 comment

Comments

@AndyHoggANU
Copy link
Collaborator

Until recently, I have always used cc.start_cluster() to start up multiple cores. But @angus-g 's recent work has shown we can do a better job by starting a scheduler using the following protocol:

  • In a terminal on VDI (either over VNC or through SSH and inside screen/tmux), run: dask-scheduler
    This should output the scheduler address, like tcp://10.0.64.24:8786.
  • Now, in another terminal (ensuring that the default conda module has cosima_cookbook installed, as all workers will need access to that), run: dask-worker tcp://10.0.64.24:8786 --memory-limit 4e9 --nprocs 6 --nthreads 1 --local-directory /local/g40/amh157
  • Then, make sure the following cell matches the scheduler address"
client = Client('tcp://10.0.64.2:8786', local_dir='/local/g40/amh157')

I have implemented this in a lot of the access-om2 report notebooks, but it is clunky, and requires a bit of intervention. For example - whenever I get allocated a different node I have to change the tcp address, and others will need to modify the local directory if they want to run it.

The ideal solution here is that we can write a cookbook function which can do this for us, and takes arguments such as memory-limit and nprocs. Is this possible? It would effectively be a replacement for start_cluster(), to be easily deployed to all.

@AndyHoggANU
Copy link
Collaborator Author

This is pretty out of date, and could be superseded by #210

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant