I have posted this couple of times but no response, hopefully this time I will get some.
I was trying to run MPI job using condor 6.6.10 on windows. I am using condor supported MPI (MPICH 1.2.4)
MPICH 1.2.4 libraries are installed properly on the windows machines and the path to the libraries are properly set in System Environmental variables. And of course I configured condor_config files in the execution node as dedicated resources and suitable for running MPI jobs by following the condor document.
If I submit the job, it is in ideal condition and it is not reporting any error and even it is not trying to contact the execution nodes. I have no clue what is going on.
Could you please some one point out what might be the problem. I was wondering is the condor MPI universe is fully developed feature, is it possible to use this for real production environment.
universe = MPI
executable = simplempi.exe
#executable = cpi.exe
requirements = Arch == "INTEL" && OpSys == "WINNT51"
log = logfile
input = infile.$(NODE)
output = outfile.$(NODE)
error = errfile.$(NODE)
machine_count = 2
should_transfer_files = yes
when_to_transfer_output = on_exit