[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] mpi job stuck as idle



Hi,

I donât see condor_config.local.dedicated.resource which is indicated in https://research.cs.wisc.edu/htcondor/manual/current/3_14Setting_Up.html#SECTION004148200000000000000

 

It doesnât exists in other locations, too.

What should I do?

 

Regards,

Mahmood

 

 

 

 

From: Mahmood Naderan
Sent: Wednesday, January 17, 2018 8:36 PM
To: HTCondor-Users Mail List; Jason Patton
Subject: Re: [HTCondor-users] mpi job stuck as idle

 

OK. Before any modification (section 3.14.8 in document), I ran "condor_status -schedd" and saw

 

 

[mahmood@rocks7 ~]$ condor_status -schedd
Name                     Machine                  RunningJobs   IdleJobs   HeldJobs

rocks7.vbtestcluster.com rocks7.vbtestcluster.com           0          2          0

                      TotalRunningJobs      TotalIdleJobs      TotalHeldJobs

                    
               Total                 0                  2                  0
[mahmood@rocks7 ~]$ condor_q


-- Schedd: rocks7.vbtestcluster.com : <10.0.3.15:9618?... @ 01/17/18 11:54:39
OWNER   BATCH_NAME                      SUBMITTED   DONE   RUN    IDLE  TOTAL JOB_IDS
mahmood CMD: /opt/openmpi/bin/mpirun   1/17 03:04      _      _      1      1 5.0

1 jobs; 0 completed, 0 removed, 1 idle, 0 running, 0 held, 0 suspended
[mahmood@rocks7 ~]$

 

 

 

The questions are

1- Is dedicated scheduler OK? From which part of the output, I  should find of that?

2- Why the first say 2 idle jobs and the second command say 1 idle job?

 

 

Regards,
Mahmood