[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] mpi job stuck as idle



>This config will need to be on all the execute machines that should be
>allowed to run parallel universe jobs, and then condor_reconfig should
>be run on them. The config tells the execute node to trust the submit
>node (what I think you mean by frontend) as the dedicated scheduler
>for parallel universe jobs.


Great :)
compute-0-0 has been added successfully. I can now see that the undefined word is replaced by the dedicated scheduler name.

One more thing. Although I did the same thing on the submit node (rocks7), but I can not see that in the list.



[root@rocks7 etc]# condor_config_val  -config
Configuration source:
    /opt/condor/etc/condor_config
Local configuration sources:
    /opt/condor/etc/config.d/000Rocks.conf
    /opt/condor/etc/config.d/99Rocks.conf
    /opt/condor/etc/config.d/condor_config.local.dedicated.resource
    /opt/condor/etc/condor_config.local

[root@rocks7 etc]# condor_status -af:h Machine DedicatedScheduler
Machine           DedicatedScheduler                         
compute-0-0.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx
compute-0-0.local DedicatedScheduler@xxxxxxxxxxxxxxxxxxxxxxxx



I can now see that the hellompi is running on  compute-0-0

Regards,
Mahmood