[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor 7.4.0 on Debian stable - startd dies with "segmentation fault"



Andreas Hirczy wrote:
> Hi!
> 
> As you might be aware I build unofficial condor packages for Debian Linux
> from sources. Up to version 7.3.1 this worked with some tweaking.  It is now
> possible to compile version 7.4.0 without disabling any features.
> 
> Unfortunately now "startd" dies with a "segmentation fault".
> 
> You can find this condor package for testing purposes on
> <http://itp.tugraz.at/Comp/debian/dists/lenny/test/binary-i386/condor_7.4.0-1_i386.deb>;
> for the moment /etc/condor/condor_config has to be adjusted by hand.
> 
> Thanks
> Andreas
> 
> 
> #######################################################################################
> 
> This is an automated email from the Condor system
> on machine "faeppc40.tu-graz.ac.at".  Do not reply.
> 
> "/usr//sbin/condor_startd" on "faeppc40.tu-graz.ac.at" died due to signal 11 (Segmentation fault).
> Condor will automatically restart this process in 10 seconds.
> 
> *** Last 20 line(s) of file /var/log/condor/StartLog:
> 11/19 12:13:32 ******************************************************
> 11/19 12:13:32 Using config source: /etc/condor/condor_config
> 11/19 12:13:32 Using local config sources: 
> 11/19 12:13:32    /etc/condor/condor_config.local
> 11/19 12:13:32 DaemonCore: Command Socket at <129.27.161.70:44883>
> 11/19 12:13:32 fgets failed
> Stack dump for process 6981 at timestamp 1258629212 (13 frames)
> condor_startd(dprintf_dump_stack+0x4a)[0x816b7aa]
> condor_startd(_Z18linux_sig_coredumpi+0x27)[0x815fd17]
> [0xffffe400]
> condor_startd(_ZNK18UnixNetworkAdapter7getNameER5ifreqPKc+0x1b)[0x81cef7b]
> condor_startd(_ZN19LinuxNetworkAdapter14getAdapterInfoEv+0x6d)[0x81c00ad]
> condor_startd(_ZN18UnixNetworkAdapter10initializeEv+0x5d)[0x81ceead]
> condor_startd(_ZN18NetworkAdapterBase12doInitializeEv+0x10)[0x81bf870]
> condor_startd(_ZN18NetworkAdapterBase20createNetworkAdapterEPKcb+0xae)[0x81bfd2e]
> condor_startd(_ZN6ResMgrC1Ev+0x121)[0x81135a1]
> condor_startd(_Z9main_initiPPc+0xe0)[0x81247e0]
> condor_startd(main+0x1022)[0x8161592]
> /lib/libc.so.6(__libc_start_main+0xe5)[0xb7519455]
> condor_startd(_ZNSt8ios_base4InitD1Ev+0x79)[0x8106b41]
> *** End of file StartLog
> #######################################################################################

As an immediate fix you can configure with --disable-hibernation.

If you can get a core it should be easy to resolve this. Nick will certainly be interested.

Best,


matt