[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Unplugging network causes condor to dump core




This is indeed a bug in Condor 7.0.0. Condor 7.0.1 contains a fix for this problem. We are in the final stages of preparing to release it.

--Dan

Rob de Graaf wrote:

Hello,

I seem to have run into a problem with the latest Condor release, 7.0.0,
for WinXP.. if I unplug the network cable while Condor is running, it will
attempt to bind to all ports in the defined range, find out that it can't,
and dump core.

Attached are the core.MASTER.WIN32 and MasterLog.

Is this a misconfiguration on my part, or could it be a bug in Condor?

Thanks,

Rob de Graaf

------------------------------------------------------------------------

2/26 12:23:00 Sock::bindWithin - failed to bind to port 9662: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9663: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9664: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind any port within (9600 ~ 9700)
2/26 12:23:00 SafeSock::connect bind() failed: _state = 1
2/26 12:23:00 PidWatcher thread couldn't notify main thread (exited_pid=2740)
2/26 12:23:00 The STARTD (pid 2740) died due to exception ACCESS_VIOLATION
2/26 12:23:00 Sending obituary for "C:\Progra~1\Condor/bin/condor_startd.exe"
2/26 12:23:00 restarting C:\Progra~1\Condor/bin/condor_startd.exe in 10 seconds
2/26 12:23:00 get_port_range - (LOWPORT,HIGHPORT) is (9600,9700).
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9681: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9682: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9683: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9684: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9685: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9686: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9687: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9688: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9689: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9690: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9691: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9692: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9693: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9694: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9695: WSAError = 10049
2/26 12:23:00 Sock::bindWithin - failed to bind to port 9696: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9697: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9698: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9699: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9700: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9600: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9601: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9602: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9603: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9604: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9605: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9606: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9607: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9608: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9609: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9610: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9611: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9612: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9613: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9614: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9615: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9616: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9617: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9618: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9619: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9620: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9621: WSAError = 10048
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9622: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9623: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9624: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9625: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9626: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9627: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9628: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9629: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9630: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9631: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9632: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9633: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9634: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9635: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9636: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9637: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9638: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9639: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9640: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9641: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9642: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9643: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9644: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9645: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9646: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9647: WSAError = 10049
2/26 12:23:01 Sock::bindWithin - failed to bind to port 9648: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9649: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9650: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9651: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9652: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9653: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9654: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9655: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9656: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9657: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9658: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9659: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9660: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9661: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9662: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9663: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9664: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9665: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9666: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9667: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9668: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9669: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9670: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9671: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9672: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9673: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9674: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9675: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9676: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9677: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9678: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9679: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind to port 9680: WSAError = 10049
2/26 12:23:02 Sock::bindWithin - failed to bind any port within (9600 ~ 9700)
2/26 12:23:02 SafeSock::connect bind() failed: _state = 1
2/26 12:23:02 Failed to start non-blocking update to unknown.

------------------------------------------------------------------------

//=====================================================
Exception code: C0000005 ACCESS_VIOLATION
Fault address:  00428A16 01:00027A16 C:\Progra~1\Condor\bin\condor_master.exe

Registers:
EAX:00000000
EBX:00000014
ECX:00000000
EDX:009F5B68
ESI:00000000
EDI:00BC4970
CS:EIP:001B:00428A16
SS:ESP:0023:00AFF884  EBP:00AFF8D0
DS:0023  ES:0023  FS:003B  GS:0000
Flags:00010216

Call stack:
Address   Frame
00428A16  00AFF888  UpdateData::startUpdateCallback+62
00418AAA  00AFF8D0  Daemon::startCommand+CA
00418EBC  00AFF8FC  Daemon::startCommand_nonblocking+25
004288CF  00AFF934  DCCollector::sendUDPUpdate+68
004287E0  00AFF9D0  DCCollector::sendUpdate+247
0041B631  00AFF9F4  CollectorList::sendUpdates+4D
00442E30  00AFFA18  DaemonCore::sendUpdates+10B
00403713  00AFFA34  Daemons::UpdateCollector+5E
0040151E  00AFFA44  daemon::Restart+E2
004033D3  00AFFA58  Daemons::DefaultReaper+79
00441A41  00AFFA74  DaemonCore::CallReaper+97
00441BF0  00AFFA9C  DaemonCore::HandleProcessExit+16C
00441405  00AFFAB8  DaemonCore::HandleDC_SERVICEWAITPIDS+28
0043C6D3  00AFFF28  DaemonCore::Driver+1EE
0044560A  00AFFF90  dc_main+B79
00404F47  00AFFFA0  ServiceMain+5B
77DEB48B  00AFFFB4  CryptVerifySignatureW+29
7C80B683  00AFFFEC  GetModuleFileNameA+1B4
------------------------------------------------------------------------

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/