[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor problem with MPI Jobs

No, just those two inputfiles infile.0, infile.1


Thanks for the tips and I appreciate, I will try those and let you know.




From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Kewley, J (John)
Sent: Friday, May 19, 2006 12:03 PM
To: Condor-Users Mail List
Subject: Re: [Condor-users] Condor problem with MPI Jobs


BTW does your job have any parameters or input files other than those in INPUT.$(NODE)?





- ----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx]On Behalf Of Natarajan, Senthil
Sent: Friday, May 19, 2006 4:24 PM
To: Condor-Users Mail List
Subject: [Condor-users] Condor problem with MPI Jobs


I have posted this couple of times but no response, hopefully this time I will get some.


I was trying to run MPI job using condor 6.6.10 on windows. I am using condor supported MPI (MPICH 1.2.4)

MPICH 1.2.4 libraries are installed properly on the windows machines and the path to the libraries are properly set in System Environmental variables. And of course I configured condor_config files in the execution node as dedicated resources and suitable for running MPI jobs by following the condor document.


If I submit the job, it is in ideal condition and it is not reporting any error and even it is not trying to contact the execution nodes. I have no clue what is going on.


Could you please some one point out what might be the problem. I was wondering is the condor MPI universe is fully developed feature, is it possible to use this for real production environment.


universe = MPI

executable = simplempi.exe

#executable = cpi.exe

requirements   = Arch == "INTEL" && OpSys == "WINNT51"

log = logfile

input = infile.$(NODE)

output = outfile.$(NODE)

error = errfile.$(NODE)

machine_count = 2

should_transfer_files = yes

when_to_transfer_output = on_exit