-
Notifications
You must be signed in to change notification settings - Fork 141
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Torque qsub equivalent to SGE qsub -sync or LSF qsub -K #268
Comments
Is it not possible using the qsub -Wdepends feature on PBS/Torque? |
That should work. Make sure keep_competed is set in qmgr. Also, $newjobid
|
no, using job dependencies would not work for this. I'm talking about having the qsub command block until the job you are submitting completes, so instead of having qsub return immediately with a job ID it would block until the job runs. The return value of qsub would be the return value of the job. Both SGE and LSF have this feature, and some workflow tools make use of it. Switching to dependencies would require altering the workflow tool. (my own workflow tool uses dependencies, but some rely on this blocking submission behavior.) |
Glen, So make a qsub that queues and runs the job in one step as far as the user Ken On Thu, Apr 28, 2016 at 8:44 AM, Glen Beane [email protected]
[image: Adaptive Computing] http://www.adaptivecomputing.com |
Yes. qsub --sync (SGE) or bsub -K (LSF) will block until the job finishes, and their return value will be the same as the job. Here is the LSF documentation for bsub -K:
|
We could make this work pretty easily. I will bring it up with the Torque team. |
Using this functionality with Torque would mean you would bypass any scheduling benefits. It would run on a first fit basis. |
Just to point out that currently you can get this to work with PBS using this hack:
|
No Ken, The job should be scheduled the same as any other job. That is, Moab (if you use it) would apply the same prioritization/policies to this job as if it had been submitted without this qsub option. Please read the SGE qsub --sync and LSF bsub -K documentation to see what behavior we are suggesting. If it shortcircuited scheduling and just took the next available slot then that would not be the correct implementation. The only difference is the qsub command blocks waiting for the job to be scheduled and run. Then once the job is finished, qsub will exit with the return value of the job (so if the job script exited with a non-zero value, then qsub will return that same value). It could print a message, similar to qsub -I, saying that "job XXXX waiting to start". |
Glen, Thanks for the clarification. That requires more work. Still doable but we Ken On Wed, Jun 8, 2016 at 7:50 PM, Glen Beane [email protected] wrote:
[image: Adaptive Computing] http://www.adaptivecomputing.com |
Wondering how adding this feature is going? My tool for abstracting different cluster systems has a blocking submission feature stuck on this:CoBrALab/qbatch#103 |
Both SGE and LSF have qsub options that cause the qsub command to wait until the job has completed before returning. Some 3rd party pipeline applications make use of this feature, which makes porting them to Torque more difficult (it necessitates the use of a wrapper script that does the qsub and then polls the job using qstat in a loop until the job finishes).
The text was updated successfully, but these errors were encountered: