[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] multicore and multinode run



>First, you may need to
>comment out the second mention of MPDIR so it doesn't pull from the
>config:
>
># MPDIR=$(condor_config_val OPENMPI_INSTALL_PATH)


This time I get another error

[mahmood@rocks7 ~]$ cat hellompi.out
WARNING: MOUNT_UNDER_SCRATCH not set in condor_config
WARNING: MOUNT_UNDER_SCRATCH not set in condor_config
-------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------
--------------------------------------------------------------------------
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

  Process name: [[62274,1],0]
  Exit code:    1
--------------------------------------------------------------------------
[mahmood@rocks7 ~]$ cat hellompi.err
Not defined: MOUNT_UNDER_SCRATCH
Not defined: MOUNT_UNDER_SCRATCH
[compute-0-1.local:17511] [[62274,1],0] usock_peer_recv_connect_ack: received unexpected process identifier [[62274,0],2] from [[62274,0],1]
[compute-0-1.local:17512] [[62274,1],1] usock_peer_recv_connect_ack: received unexpected process identifier [[62274,0],2] from [[62274,0],1]
[mahmood@rocks7 ~]$



Regards,
Mahmood