[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] CREAM error: Failed to start gahp



Hi Jose,

Yeah, thereâs been packaging problems since approximately 8.2.3.

I reported something similar here:

https://htcondor-wiki.cs.wisc.edu/index.cgi/tktview?tn=5021

(and GRIF reported it also in Dec 2014, but that doesnât appear to have made it from email to the ticket system).

Unfortunately, I didnât notice that the fix in gt5021 didnât actually fix the problem.

There is a relatively simple work-around.  Try adding this file:

# cat /etc/ld.so.conf.d/condor.conf 
/usr/lib64/condor

and running ldconfig.  That will cause the system loader to add /usr/lib64/condor to the default path.

Brian

> On Oct 2, 2015, at 8:56 AM, Jose Caballero <jcaballero.hep@xxxxxxxxx> wrote:
> 
> 2015-10-01 17:44 GMT-04:00 Jose Caballero <jcaballero.hep@xxxxxxxxx>:
>> 2015-10-01 16:22 GMT-04:00 Tim Theisen <tim@xxxxxxxxxxx>:
>>> Hi Jose,
>>> 
>>> Please try installing the condor-all rpm. This will pull in the
>>> condor-externals that you are missing.
>>> 
>>> 
>> 
>> Hi,
>> 
>> I am at home now, so let's resume tomorrow.
>> But I can tell you I tried that this afternoon, and it was even worse.
>> There were many missing dependencies (libraries needed by
>> /usr/sbin/nordugrid_gahp).
>> 
>> Cheers
>> Jose
> 
> 
> 
> If I install condor-all on a clean VM, I get this (*)
> Anyways, I just installed condor-all on the same host I was working,
> where I had condor installed previously, and nordugrid_gahp has all
> dependencies.
> New deployment looks like this (**).
> Still same behaviour (***).
> 
> 
> (*)
> # ldd /usr/sbin/nordugrid_gahp
>    linux-vdso.so.1 =>  (0x00007ffff79f2000)
>    libdl.so.2 => /lib64/libdl.so.2 (0x00007fb21d1c7000)
>    libresolv.so.2 => /lib64/libresolv.so.2 (0x00007fb21cfad000)
>    libglobus_ftp_client.so.2 => not found
>    libglobus_ftp_control.so.1 => not found
>    libglobus_gssapi_error.so.2 => not found
>    libglobus_xio.so.0 => not found
>    libglobus_io.so.3 => not found
>    libglobus_rsl.so.2 => not found
>    libglobus_gss_assist.so.3 => not found
>    libglobus_gssapi_gsi.so.4 => not found
>    libglobus_gsi_proxy_core.so.0 => not found
>    libglobus_gsi_credential.so.1 => not found
>    libglobus_gsi_callback.so.0 => not found
>    libglobus_gsi_sysconfig.so.1 => not found
>    libglobus_oldgaa.so.0 => not found
>    libglobus_gsi_cert_utils.so.0 => not found
>    libglobus_openssl.so.0 => not found
>    libglobus_openssl_error.so.0 => not found
>    libglobus_proxy_ssl.so.1 => not found
>    libglobus_callout.so.0 => not found
>    libglobus_common.so.0 => not found
>    libltdl.so.7 => /usr/lib64/libltdl.so.7 (0x00007fb21cd9c000)
>    libcrypto.so.10 => /usr/lib64/libcrypto.so.10 (0x00007fb21c9b9000)
>    libldap-2.4.so.2 => /lib64/libldap-2.4.so.2 (0x00007fb21c769000)
>    liblber-2.4.so.2 => /lib64/liblber-2.4.so.2 (0x00007fb21c559000)
>    libstdc++.so.6 => /usr/lib64/libstdc++.so.6 (0x00007fb21c253000)
>    libm.so.6 => /lib64/libm.so.6 (0x00007fb21bfcf000)
>    libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb21bdb8000)
>    libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fb21bb9b000)
>    libc.so.6 => /lib64/libc.so.6 (0x00007fb21b807000)
>    /lib64/ld-linux-x86-64.so.2 (0x00007fb21d3d5000)
>    libz.so.1 => /lib64/libz.so.1 (0x00007fb21b5f0000)
>    libsasl2.so.2 => /usr/lib64/libsasl2.so.2 (0x00007fb21b3d6000)
>    libssl3.so => /usr/lib64/libssl3.so (0x00007fb21b197000)
>    libsmime3.so => /usr/lib64/libsmime3.so (0x00007fb21af6a000)
>    libnss3.so => /usr/lib64/libnss3.so (0x00007fb21ac2b000)
>    libnssutil3.so => /usr/lib64/libnssutil3.so (0x00007fb21a9ff000)
>    libplds4.so => /lib64/libplds4.so (0x00007fb21a7fa000)
>    libplc4.so => /lib64/libplc4.so (0x00007fb21a5f5000)
>    libnspr4.so => /lib64/libnspr4.so (0x00007fb21a3b7000)
>    libcrypt.so.1 => /lib64/libcrypt.so.1 (0x00007fb21a17f000)
>    librt.so.1 => /lib64/librt.so.1 (0x00007fb219f77000)
>    libfreebl3.so => /lib64/libfreebl3.so (0x00007fb219cfd000)
> 
> 
> (**)
> condor-8.4.0-1.el6.x86_64
> condor-all-8.4.0-1.el6.x86_64
> condor-externals-8.4.0-1.el6.x86_64
> condor-classads-8.4.0-1.el6.x86_64
> condor-bosco-8.4.0-1.el6.x86_64
> condor-vm-gahp-8.4.0-1.el6.x86_64
> condor-external-libs-8.4.0-1.el6.x86_64
> condor-python-8.4.0-1.el6.x86_64
> condor-kbdd-8.4.0-1.el6.x86_64
> condor-std-universe-8.4.0-1.el6.x86_64
> condor-procd-8.4.0-1.el6.x86_64
> condor-cream-gahp-8.4.0-1.el6.x86_64
> 
> 
> (***)
> 10/02/15 09:52:01 [109596] Found job 9487.0 --- inserting
> 10/02/15 09:52:01 [109596] gahp server not up yet, delaying ping
> 10/02/15 09:52:01 [109596] gahp server not up yet, delaying checkDelegation
> 10/02/15 09:52:01 [109596] BaseResource::DoBatchStatus: gahp server
> not up yet, delaying 5 seconds
> 10/02/15 09:52:01 [109596] (9487.0) doEvaluateState called: gmState
> GM_INIT, creamState
> 10/02/15 09:52:01 [109596] GAHP server pid = 109599
> 10/02/15 09:52:07 [109596] (9487.0) doEvaluateState called: gmState
> GM_DELEGATE_PROXY, creamState
> 10/02/15 09:52:08 [109596] resource
> https://ce403.cern.ch:8443/ce-cream/services/CREAM2 is now up
> 10/02/15 09:52:08 [109596] (9487.0) doEvaluateState called: gmState
> GM_SET_LEASE, creamState
> 10/02/15 09:52:08 [109596] (9487.0) doEvaluateState called: gmState
> GM_SET_LEASE, creamState
> 10/02/15 09:52:10 [109596] (9487.0) doEvaluateState called: gmState
> GM_SUBMIT, creamState
> 10/02/15 09:52:12 [109596] (9487.0) doEvaluateState called: gmState
> GM_SUBMIT_SAVE, creamState
> 10/02/15 09:52:12 [109596] GAHP server pid = 109984
> 10/02/15 09:52:12 [109596] GAHP command 'INITIALIZE_FROM_FILE' failed: -1
> 10/02/15 09:52:12 [109596] GAHP: Failed to initialize from file
> 10/02/15 09:52:12 [109596] (9487.0) doEvaluateState called: gmState
> GM_STAGE_IN, creamState
> 10/02/15 09:52:12 [109596] (9487.0) Stage-in failed: Failed to start gahp
> 10/02/15 09:52:17 [109596] No jobs left, shutting down
> 10/02/15 09:52:19 [109596] Got SIGTERM. Performing graceful shutdown.
> 10/02/15 09:52:19 [109596] **** condor_gridmanager
> (condor_GRIDMANAGER) pid 109596 EXITING WITH STATUS 0
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/