[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Condor Collector died




This is a known bug that was fixed in 8.4.4. 


From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of 刘岚 <liulan199331@xxxxxxx>
Sent: Monday, February 29, 2016 2:57 AM
To: HTCondor-Users Mail List
Subject: [HTCondor-users] Condor Collector died
 
Hi all,

When I use "condor_restart" to restart condor, the collector always got something wrong and got restarted automatically, then it can work properly. 

The error displayed in MasterLog said:
The COLLECTOR  died due to signal 11 (Segmentation fault)

And the information in CollectorLog said:

02/25/16 08:21:35 CollectorAd  : Inserting ** "< My Pool - 10.1.1.103@xxxxxxxxxx >"

02/25/16 08:21:35 stats: Inserting new hashent for 'Collector':'My Pool - 10.1.1.103@xxxxxxxxxx':'10.1.1.103'

Stack dump for process 1462089 at timestamp 1456406515 (11 frames)

/usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(dprintf_dump_stack+0x12d)[0x7ffccdb2211d]

/usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_Z18linux_sig_coredumpi+0x40)[0x7ffccdc723c0]

/lib64/libpthread.so.0(+0xf790)[0x7ffccc116790]

/lib64/libc.so.6(+0x1376fa)[0x7ffccbeaa6fa]

/usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_ZN11DCCollector10sendUpdateEiPN14compat_classad7ClassAdER22DCCollectorAdSequencesS2_b+0x292)[0x7ffccdc4d5b2]

condor_collector(_ZN15CollectorDaemon15sendCollectorAdEv+0x4a4)[0x456074]

/usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_ZN12TimerManager7TimeoutEPiPd+0x3d3)[0x7ffccdc7bb03]

/usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_ZN10DaemonCore6DriverEv+0xc03)[0x7ffccdc8ce33]

/usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_Z7dc_mainiPPc+0x1799)[0x7ffccdc74559]

/lib64/libc.so.6(__libc_start_main+0xfd)[0x7ffccbd91d5d]

condor_collector[0x40eab9]


Any help will be appreciated.

Kathy