[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Schedd not always appearing on condor_status -schedds



On 11/7/2019 8:10 AM, Stewart Martin-Haugh wrote:
> Hi,
> 
> We noticed that when querying condor_status -schedds
> 

On the same machine where "condor_status -schedd" gives the inconsistent 
results, what does

    condor_config_val -v collector_host

say? Does it list more than one central manager?

The most common reason I've seen behavior like the below is when sites 
have two or more central managers configured (i.e. HAD, for high 
availability), and yet the daemon in question (in this case schedd 
condor-ce01) is configured to only report to one central manager instead 
of both.  When you do "condor_status", it will query one of the central 
managers at random (load balance), resulting in 50% of the time you see 
it, 50% you dont...

Hope the above helps,
Todd


> the condor-ce doesn't always appear - if you do it in quick succession I 
> would say only about 50% of the time.
> 
> e.g.
> 
> Name                 Machine                             
>  ÂRunningJobs  IdleJobs  HeldJobs
> 
> arc-ce01................ Â Â Â Â Â Â arc-ce01................           
>  Â Â Â Â Â3235 Â Â Â Â617 Â Â Â Â Â0
> arc-ce02................ Â Â Â Â Â Â arc-ce02................           
>  Â Â Â Â Â3210 Â Â Â Â398 Â Â Â Â Â0
> arc-ce03................ Â Â Â Â Â Â arc-ce03................           
>  Â Â Â Â Â3372 Â Â Â Â525 Â Â Â Â Â0
> arc-ce04................ Â Â Â Â Â Â arc-ce04................           
>  Â Â Â Â Â2697 Â Â Â Â921 Â Â Â Â Â0
> arc-ce05................ Â Â Â Â Â Â arc-ce05................           
>  Â Â Â Â Â3116 Â Â Â Â743 Â Â Â Â Â0
> condor-ce01................ Â Â Â Â Âcondor-ce01................         
>  Â Â Â Â Â Â0 Â Â Â Â Â0 Â Â Â Â Â0
> 
> vs.
> Name                 Machine                             
>  ÂRunningJobs  IdleJobs  HeldJobs
> 
> arc-ce01................ Â Â Â Â Â Â arc-ce01................           
>  Â Â Â Â Â3235 Â Â Â Â617 Â Â Â Â Â0
> arc-ce02................ Â Â Â Â Â Â arc-ce02................           
>  Â Â Â Â Â3210 Â Â Â Â398 Â Â Â Â Â0
> arc-ce03................ Â Â Â Â Â Â arc-ce03................           
>  Â Â Â Â Â3372 Â Â Â Â525 Â Â Â Â Â0
> arc-ce04................ Â Â Â Â Â Â arc-ce04................           
>  Â Â Â Â Â2697 Â Â Â Â921 Â Â Â Â Â0
> arc-ce05................ Â Â Â Â Â Â arc-ce05................           
>  Â Â Â Â Â3116 Â Â Â Â743 Â Â Â Â Â0
> 
> Is this a known problem?
> 
> Cheers,
> Stewart
> 
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/
>