[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Condor 8.4.3 on Windows 7: MASTER core dump file generated



Looks like the master failed to recover from the unexpected death of the condor_procd. 

Do you have any idea with the condor_procd went away?

 

-tj

 

From: HTCondor-users [mailto:htcondor-users-bounces@xxxxxxxxxxx] On Behalf Of Stub
Sent: Friday, January 22, 2016 12:14 AM
To: Condor-users <htcondor-users@xxxxxxxxxxx>
Subject: [HTCondor-users] Condor 8.4.3 on Windows 7: MASTER core dump file generated

 

This is the condor_version output:

 

$CondorVersion: 8.4.3 Dec 15 2015 BuildID: 352143 $

$CondorPlatform: x86_64_Windows7 $

 

This is the contents of C:\condor\log\core.MASTER.WIN32:

 

//=====================================================

PID: 4560

Exception code: C0000005 ACCESS_VIOLATION

Fault address:  004AB166 01:0010A166 C:\condor\bin\condor_master.exe

 

Registers:

EAX:0118F173

EBX:005AF2B0

ECX:00000000

EDX:00000013

ESI:00747EB8

EDI:00000000

CS:EIP:001B:004AB166

SS:ESP:0023:0118F154  EBP:0118F158

DS:0023  ES:0023  FS:003B  GS:0000

Flags:00010206

 

Call stack:

Address   Frame

004AB166  0118F158  ProcFamilyClient::kill_family (c:\condor\execute\dir_1432\userdir\src\condor_procd\proc_family_client.cpp:586)

0049E938  0118F174  ProcFamilyProxy::kill_family (c:\condor\execute\dir_1432\userdir\src\condor_utils\proc_family_proxy.cpp:303)

003EB827  0118F180  DaemonCore::Kill_Family (c:\condor\execute\dir_1432\userdir\src\condor_daemon_core.v6\daemon_core.cpp:8522)

003AD9D6  0118F18C  daemon::KillFamily (c:\condor\execute\dir_1432\userdir\src\condor_master.v6\masterdaemon.cpp:1417)

003AD3B2  0118F194  daemon::HardKill (c:\condor\execute\dir_1432\userdir\src\condor_master.v6\masterdaemon.cpp:1156)

003AD41E  0118F1A4  Daemons::HardKillAllDaemons (c:\condor\execute\dir_1432\userdir\src\condor_master.v6\masterdaemon.cpp:2225)

003A828D  0118F1A8  DoCleanup (c:\condor\execute\dir_1432\userdir\src\condor_master.v6\master.cpp:232)

003B3C2C  0118F3BC  _EXCEPT_ (c:\condor\execute\dir_1432\userdir\src\condor_utils\except.cpp:91)

0049EB58  0118F3D0  ProcFamilyProxy::recover_from_procd_error (c:\condor\execute\dir_1432\userdir\src\condor_utils\proc_family_proxy.cpp:678)

0049E9AF  0118F3DC  ProcFamilyProxy::procd_reaper (c:\condor\execute\dir_1432\userdir\src\condor_utils\proc_family_proxy.cpp:699)

003DF4F4  0118F3FC  DaemonCore::CallReaper (c:\condor\execute\dir_1432\userdir\src\condor_daemon_core.v6\daemon_core.cpp:9413)

003E7FA5  0118F438  DaemonCore::HandleProcessExit (c:\condor\execute\dir_1432\userdir\src\condor_daemon_core.v6\daemon_core.cpp:9518)

003E7D20  0118F45C  DaemonCore::HandleDC_SERVICEWAITPIDS (c:\condor\execute\dir_1432\userdir\src\condor_daemon_core.v6\daemon_core.cpp:9062)

003E4ABC  0118F994  DaemonCore::Driver (c:\condor\execute\dir_1432\userdir\src\condor_daemon_core.v6\daemon_core.cpp:3390)

003D60B2  0118FA30  dc_main (c:\condor\execute\dir_1432\userdir\src\condor_daemon_core.v6\daemon_core_main.cpp:2780)

003B3611  0118FA40  ServiceMain (c:\condor\execute\dir_1432\userdir\src\condor_master.v6\service.windows.cpp:429)

76E775A8  0118FA54  I_ScIsSecurityProcess+269

75BDEE6C  0118FA60  BaseThreadInitThunk+12

775C3AB3  0118FAA0  RtlInitializeExceptionChain+EF

775C3A86  0118FAB8  RtlInitializeExceptionChain+C2

 

//=====================================================