[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] multicore and multinode run



In the out file, I just see the output from compute-0-1. Why compute-0-0 didn't respond?

[mahmood@rocks7 ~]$ cat hellompi.err
mkdir: cannot create directory '/var/opt/condor/execute/dir_26657/tmp': File exists
mkdir: cannot create directory '/var/opt/condor/execute/dir_26656/tmp': File exists
[mahmood@rocks7 ~]$ cat hellompi.out
Hello world from processor compute-0-1.local, rank 0 out of 2 processors
Hello world from processor compute-0-1.local, rank 1 out of 2 processors
[mahmood@rocks7 ~]$ cat mpi.ht
universe = parallel
executable = openmpiscript
arguments = mpihello
log = hellompi.log
output = hellompi.out
error = hellompi.err
machine_count = 2
queue
[mahmood@rocks7 ~]$ condor_status -af:h Machine DedicatedScheduler
Machine           DedicatedScheduler                         
compute-0-0.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
compute-0-0.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
compute-0-1.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
compute-0-1.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
compute-0-1.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
compute-0-1.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
[mahmood@rocks7 ~]$ ssh compute-0-0 'grep MOUNT_UNDER_SCRATCH /opt/condor/etc/condor_config.local'
Warning: untrusted X11 forwarding setup failed: xauth key data not generated
MOUNT_UNDER_SCRATCH=/tmp
[mahmood@rocks7 ~]$




Regards,
Mahmood


On Thursday, January 25, 2018, 2:18:31 PM EST, Jason Patton <jpatton@xxxxxxxxxxx> wrote:


The mkdir error is an annoyance/bug and shouldn't have any effect on
the rest of the script. (This annoyance is fixed in the 8.7.4+.) Did
you get the output you were expecting?

Jason