[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[condor-users] MPI setup


I have a trouble on setting up to run MPI jobs.
I setup dedicated nodes following the manual.

When I submit a MPI job using 4 processors, the log file says that 
it succeeded, but I got just 2 output files, as follows
---- outfile.0  ----------------------------------------
p0_17594:  p4_error: Child process exited while making connection to remote pro
cess on ume05.hpcc.jp: 0
p0_17594: (6.593392) net_send: could not write to fd=4, errno = 32

---- outfile.1  ----------------------------------------
rm_23091: (-) net_recv failed for fd = 3
rm_23091:  p4_error: net_recv read, errno = : 104

What am I missing?

And which mpich will be used if I have several versions of mpich installed?

Condor Support Information:
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>