[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] HTCondorCE schedd not found



Hi Maarten,

Similar to HTCondor, HTCondor-CE has two release series:

Development (latest 4.4.0): https://htcondor-ce.readthedocs.io/en/latest/releases/
Stable (latest 3.4.2): https://htcondor-ce.readthedocs.io/en/stable/

HTCondor-CE 4 is tested against HTCondor 8.9.x and you'll need to have the two installed together if you want the latest features (e.g. SciToken/WLCG token support). That being said, we try our best to make sure development versions of HTCondor-CE work with the stable versions of HTCondor (many OSG sites run this way) so I'm a little surprised that a "version mismatch" was the culprit here.

- Brian

On 7/16/20 6:23 AM, Maarten Litmaath wrote:
Hi Thomas, all,
8.8.x (even) is stable, 8.9.y (odd) is development --> big differences can be expected...

For a production service you should typically disable the development repo,
but in this case it appears htcondor-ce-4.4.0 needs something that is not
provided by the latest production release?

________________________________________
From: HTCondor-users [htcondor-users-bounces@xxxxxxxxxxx] on behalf of Thomas Hartmann [thomas.hartmann@xxxxxxx]
Sent: 16 July 2020 10:35
To: HTCondor-Users Mail List
Subject: Re: [HTCondor-users] HTCondorCE schedd not found

ah, sorry - forgot about the obvious and did not check the versions... ð

indeed, the broken 'new' CE was running on condor v8.8.9 and condor-ce
v4.4.0 while the working CE was on condor v8.9.7 and condor-ce v4.4.0 -
so the Condor minor version (security?) caused apparently my troubles.

Anyway, I fixed a minor bug in my repo/package deployement ;)

Cheers and thanks,
   Thomas


[client]
condor-8.9.7-1.el7.x86_64
condor-classads-8.9.7-1.el7.x86_64
condor-external-libs-8.9.7-1.el7.x86_64
condor-procd-8.9.7-1.el7.x86_64
htcondor-ce-client-4.4.0-1.el7.noarch
python2-condor-8.9.7-1.el7.x86_64

[Condor CE broken]
condor-8.8.9-1.el7.x86_64
condor-classads-8.8.9-1.el7.x86_64
condor-external-libs-8.8.9-1.el7.x86_64
condor-procd-8.8.9-1.el7.x86_64
htcondor-ce-4.4.0-1.el7.noarch
htcondor-ce-apel-4.4.0-1.el7.noarch
htcondor-ce-bdii-4.4.0-1.el7.noarch
htcondor-ce-client-4.4.0-1.el7.noarch
htcondor-ce-condor-4.4.0-1.el7.noarch
htcondor-ce-view-4.4.0-1.el7.noarch
python2-condor-8.8.9-1.el7.x86_64

[Condor CE working]
condor-8.9.7-1.el7.x86_64
condor-classads-8.9.7-1.el7.x86_64
condor-external-libs-8.9.7-1.el7.x86_64
condor-procd-8.9.7-1.el7.x86_64
htcondor-ce-4.4.0-1.el7.noarch
htcondor-ce-apel-4.4.0-1.el7.noarch
htcondor-ce-bdii-4.4.0-1.el7.noarch
htcondor-ce-client-4.4.0-1.el7.noarch
htcondor-ce-condor-4.4.0-1.el7.noarch
htcondor-ce-view-4.4.0-1.el7.noarch
python2-condor-8.9.7-1.el7.x86_64


On 15/07/2020 20.31, MÃtyÃs Selmeci wrote:
Hi Thomas,

What are the versions of condor and condor-ce on each machine?

-Mat

On 7/15/20 10:23 AM, Thomas Hartmann wrote:
Hi all,

I have spawned a new CondorCE instance based on a working puppet
manifest - but condor_ce_trace fails to connect to the CE schedd. So the
puppet manifest might be incomplete with something I forgot to add to
the manifest :-/

On the broken CE, I see the client's request in the Collector [1] -
which differs from the working CE by
    filter_private_ads=1
in the Query info and then
   (Sending 0 ads in response to query) [broken]
vs.
   (Sending 1 ads in response to query) [working]

I guess that the 1 class ad returned to the client is some kind of ACK,
that is not returned in the broken case, or?

The condor-ce/condor config files are the same on both machines, so that
I am a bit lost here, what breaks the new instance?
Maybe some port or so I forgot about??

Cheers,
   Thomas


[1]
07/15/20 16:39:54 (Sending 1 ads in response to query)
07/15/20 16:39:54 Query includes collector's self ad
07/15/20 16:39:54 Updating collector stats using a chained ad and config=
07/15/20 16:39:54 Query info: matched=1; skipped=0; query_time=0.000117;
send_time=0.000237; type=Collector; requirements={((true))}; locate=0;
limit=0; from=TOOL; peer=<131.169.223.90:33765>; projection={}
07/15/20 16:39:54 Got QUERY_SCHEDD_ADS
07/15/20 16:39:54 (Sending 0 ads in response to query)
07/15/20 16:39:54 Query info: matched=0; skipped=0; query_time=0.000022;
send_time=0.000029; type=Scheduler;
requirements={((stricmp(Name,"grid-htcondorce0.desy.de") == 0))};
locate=1; limit=0; from=TOOL; peer=<131.169.223.90:43887>;
projection={MyAddress AddressV1 CondorVersion CondorPlatform Name Machine}


[2]
07/15/20 16:36:16 (Sending 1 ads in response to query)
07/15/20 16:36:16 Query includes collector's self ad
07/15/20 16:36:16 Updating collector stats using a chained ad and config=
07/15/20 16:36:16 Query info: matched=1; skipped=0; query_time=0.000101;
send_time=0.000362; type=Collector; requirements={((true))}; locate=0;
limit=0; from=TOOL; peer=<131.169.223.90:41934>; projection={};
filter_private_ads=1
07/15/20 16:36:16 Got QUERY_SCHEDD_ADS
07/15/20 16:36:16 (Sending 1 ads in response to query)
07/15/20 16:36:16 Query info: matched=1; skipped=0; query_time=0.000056;
send_time=0.000061; type=Scheduler;
requirements={((stricmp(Name,"grid-vm08.desy.de") == 0))}; locate=1;
limit=0; from=TOOL; peer=<131.169.223.90:40257>; projection={MyAddress
AddressV1 CondorVersion CondorPlatform Name Machine}; filter_private_ads=1
07/15/20 16:36:16 SECMAN: Succesfully sent DC_SEC_QUERY classad to
<131.169.223.90:33652>!
AuthorizationSucceeded = true
07/15/20 16:36:19 (Sending 0 ads in response to query)
07/15/20 16:36:19 Query info: matched=0; skipped=0; query_time=0.000090;
send_time=0.000039; type=Negotiator; requirements={true}; locate=0;
limit=0; from=COLLECTOR; peer=<131.169.223.234:7993>; projection={};
filter_private_ads=0


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/