[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] multicore and multinode run



> WARNING: MOUNT_UNDER_SCRATCH not set in condor_config
> WARNING: MOUNT_UNDER_SCRATCH not set in condor_config

Were you able to add MOUNT_UNDER_SCRATCH = /tmp to the execute
machines' condor configs? Did you run a condor_reconfig on those
machines?

Jason

On Wed, Jan 24, 2018 at 3:16 PM, Mahmood Naderan <nt_mahmood@xxxxxxxxx> wrote:
>>First, you may need to
>>comment out the second mention of MPDIR so it doesn't pull from the
>>config:
>>
>># MPDIR=$(condor_config_val OPENMPI_INSTALL_PATH)
>
>
> This time I get another error
>
> [mahmood@rocks7 ~]$ cat hellompi.out
> WARNING: MOUNT_UNDER_SCRATCH not set in condor_config
> WARNING: MOUNT_UNDER_SCRATCH not set in condor_config
> -------------------------------------------------------
> Primary job  terminated normally, but 1 process returned
> a non-zero exit code.. Per user-direction, the job has been aborted.
> -------------------------------------------------------
> --------------------------------------------------------------------------
> mpirun detected that one or more processes exited with non-zero status, thus
> causing
> the job to be terminated. The first process to do so was:
>
>   Process name: [[62274,1],0]
>   Exit code:    1
> --------------------------------------------------------------------------
> [mahmood@rocks7 ~]$ cat hellompi.err
> Not defined: MOUNT_UNDER_SCRATCH
> Not defined: MOUNT_UNDER_SCRATCH
> [compute-0-1.local:17511] [[62274,1],0] usock_peer_recv_connect_ack:
> received unexpected process identifier [[62274,0],2] from [[62274,0],1]
> [compute-0-1.local:17512] [[62274,1],1] usock_peer_recv_connect_ack:
> received unexpected process identifier [[62274,0],2] from [[62274,0],1]
> [mahmood@rocks7 ~]$
>
>
>
> Regards,
> Mahmood
>