[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Segfault in HTCondor-CE Schedd



Hi Max,

What's the output of `rpm -q condor htcondor-ce`?

Thanks,
Brian

On 6/24/20 6:16 AM, Fischer, Max (SCC) wrote:
Hi,

since a few hours, weâre seeing the Schedd of our HTCondor-CE segfault repeatedly. Attached is the stack dump [0]. Does anyone know what might be causing this issue, or how to prevent it?

Cheers,
Max

[0] /var/log/condor-ce/SchedLog
06/24/20 13:11:19 Number of Active Workers 0
Caught signal 11: si_code=2, si_pid=1637082848, si_uid=32527, si_addr=0x6193E6E0
Stack dump for process 324350 at timestamp 1592997084 (34 frames)
/usr/lib64/libcondor_utils_8_8_9.so(dprintf_dump_stack+0x24)[0x7f0f68aed8d4]
/usr/lib64/libcondor_utils_8_8_9.so(_Z17unix_sig_coredumpiP9siginfo_tPv+0x69)[0x7f0f68babe99]
/usr/lib64/libpthread.so.0(+0xf630)[0x7f0f66ef5630]
/usr/lib64/libcrypto.so.10(+0x128b80)[0x7f0f67c70b80]
/usr/lib64/libcrypto.so.10(+0x1263b9)[0x7f0f67c6e3b9]
/usr/lib64/libcrypto.so.10(lh_insert+0x50)[0x7f0f67c6e6a0]
/usr/lib64/libcrypto.so.10(+0x12920a)[0x7f0f67c7120a]
/usr/lib64/libcrypto.so.10(+0x128bcb)[0x7f0f67c70bcb]
/usr/lib64/lcmaps/lcmaps_verify_proxy.mod(verify_init_library+0x1ed)[0x7f0f60a9d63d]
/usr/lib64/lcmaps/lcmaps_verify_proxy.mod(verify_X509_init+0x13)[0x7f0f60a9be13]
/usr/lib64/lcmaps/lcmaps_verify_proxy.mod(+0x452b)[0x7f0f60a9a52b]
/lib64/liblcmaps.so(lcmaps_runEvaluationManager+0xc1)[0x7f0f60cbc631]
/lib64/liblcmaps.so(lcmaps_runPluginManager+0x2d3)[0x7f0f60cb7573]
/lib64/liblcmaps.so(lcmaps_run_and_return_username+0x350)[0x7f0f60caf1d0]
/usr/lib64/liblcas_lcmaps_gt4_mapping.so(llgt_run_lcmaps+0x8e7)[0x7f0f63161be7]
/usr/lib64/liblcas_lcmaps_gt4_mapping.so(lcmaps_callout+0x694)[0x7f0f631623f4]
/usr/lib64/libglobus_callout.so.0(globus_callout_call_type+0x242)[0x7f0f64e89ec2]
/usr/lib64/libglobus_gss_assist.so.3(globus_gss_assist_map_and_authorize+0x6d)[0x7f0f638020bd]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN16Condor_Auth_X50914nameGssToLocalEPKc+0x125)[0x7f0f68b4ad85]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN14Authentication41map_authentication_name_to_canonical_nameEiPKcS1_+0x577)[0x7f0f68b5f167]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN14Authentication19authenticate_finishEP11CondorError+0x281)[0x7f0f68b5fbb1]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN14Authentication21authenticate_continueEP11CondorErrorb+0x1b7)[0x7f0f68b60457]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN8ReliSock21authenticate_continueEP11CondorErrorbPPc+0x26)[0x7f0f68b75a16]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN21DaemonCommandProtocol20AuthenticateContinueEv+0x42)[0x7f0f68bb3792]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN21DaemonCommandProtocol10doProtocolEv+0xd5)[0x7f0f68bb6c75]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN21DaemonCommandProtocol14SocketCallbackEP6Stream+0x8f)[0x7f0f68bb6dcf]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN10DaemonCore24CallSocketHandler_workerEibP6Stream+0x624)[0x7f0f68bc82c4]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN10DaemonCore35CallSocketHandler_worker_demarshallEPv+0x1d)[0x7f0f68bc837d]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN13CondorThreads8pool_addEPFvPvES0_PiPKc+0x35)[0x7f0f689f0835]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN10DaemonCore17CallSocketHandlerERib+0x167)[0x7f0f68bc4337]
/usr/lib64/libcondor_utils_8_8_9.so(_ZN10DaemonCore6DriverEv+0x1d76)[0x7f0f68bcdaa6]
/usr/lib64/libcondor_utils_8_8_9.so(_Z7dc_mainiPPc+0x13a9)[0x7f0f68baf749]
/usr/lib64/libc.so.6(__libc_start_main+0xf5)[0x7f0f66b3a555]
condor_schedd[0x4369a7]=

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/