[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] MPI job submission script configuration

Thanks for your suggestions. That helped !!

I did not need any specific machines, just that they should go on different machines. I was able to specify the "Requirements = (SlotID == 2)" and Condor assigned 2 different machines for the MPI job and the 2 processes were started on these 2 machines. It worked as expected.

Thanks for your help.


On Fri, Dec 9, 2011 at 5:52 PM, Beaver, Mick <Mick.Beaver@xxxxxxxxxx> wrote:

I believe the manual section “2.9.3 Parallel Jobs with Separate Requirements” may help if you can explicitly specify your machines. If you can’t list machine names, you may be able to get away with doing something like this (since each machine only has one slot with ID 1):


Requirements = (SlotID == 1)







From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Kunal Rao
Sent: Thursday, December 08, 2011 9:09 AM
To: condor-users@xxxxxxxxxxx
Subject: Re: [Condor-users] MPI job submission script configuration


Any suggestions on the below .. ?

Thanks & Regards,

On Mon, Dec 5, 2011 at 1:27 PM, Kunal Rao <kunalgrao@xxxxxxxxx> wrote:


  I have setup condor pool with 2 dedicated resources (for MPI jobs), one of them with dedicated scheduler. I am running MPI jobs (MPICH2) with say 2 processes, I want these to go on 2 different machines (the one's which are configured as dedicated resources)

  What I am seeing is that condor creates 2 processes and puts them on the same machine. How do I specify in the job submission script for it to start these 2 processes on different machines ?

  My current job submission script is as follows :

  universe = parallel
  executable = /var/spool/condor/etc/examples/mp2script
  arguments = /home/kunal/condor/examples/hello_mpi
  machine_count = 2
  +WantIOProxy = True
  Output = hello_mpi.out
  error = hello_mpi.err
  Log = hello_mpi.log
  should_transfer_files = yes
  when_to_transfer_output = on_exit
  +ParallelShutdownPolicy = "WAIT_FOR_ALL"
  transfer_input_files = /home/kunal/condor/examples/hello_mpi

  I want to specify something like: start process 1 on machine1 and process 2 on machine2. How do I specify that ?

Thanks & Regards,


Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting

The archives can be found at: