[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] MPI on Windows



On 9/8/23 17:51, Sam.Dana@xxxxxxxxxxx wrote:

Looking at condor_config.local.dedicated.submit, in the statement, 

"If your dedicated resources are configured to only run jobs, you should probably set this attribute to '0'", 

does "only run jobs" mean "only run dedicated jobs" to correlate with Policy 1 in condor_config.local.dedicated.resource?


It does, but that's a small optimization.  To run parallel/dedicated jobs, I'd leave UNUSED_CLAIM_TIMEOUT

at the default value of 600 unless you have a good reason to change it, though.


Looking at condor_config.local.dedicated.resource, I wonder: 

      what settings are needed to run MPI apps within HTCondor on Windows?


Generally speaking, the most "High Throughput" way to run MPI jobs is to run a lot of

independent MPI jobs that each run on one node in your pool, perhaps on many cores on one node.

This can be done in the vanilla universe.  If you absolutely must run MPI jobs across multiple

nodes, then you will need to run the parallel universe.


To run MPI jobs on the parallel universe, you'll need scripts to bootstrap the MPI world.  To

be honest, I don't know of anyone who has done this on WIndows in quite some time, and

I don't know how up to date the examples we provide are with any modern version of

MPI for Windows.



-greg



Thanks,

Sam


NOTICE: This email message and all attachments transmitted with it may contain privileged and confidential information, and information that is protected by, and proprietary to, Parsons Corporation, and is intended solely for the use of the addressee for the specific purpose set forth in this communication. If the reader of this message is not the intended recipient, you are hereby notified that any reading, dissemination, distribution, copying, or other use of this message or its attachments is strictly prohibited, and you should delete this message and all copies and backups thereof. The recipient may not further distribute or use any of the information contained herein without the express written authorization of the sender. If you have received this message in error, or if you have any questions regarding the use of the proprietary information contained therein, please contact the sender of this message immediately, and the sender will provide you with further instructions.

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/