[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] DAG condor_schedd crash on windows



A little snippet of a new crash log:

--
This is an automated email from the Condor system
on machine "gromit2.digicpictures.local".  Do not reply.

"C:\Condor/bin/condor_schedd.exe" on "gromit2.digicpictures.local" exited with status 4.
Condor will automatically restart this process in 10 seconds.

*** Last 20 line(s) of file SchedLog:
10/20 10:44:17 Got VACATE_SERVICE from <192.168.0.110:4238>
10/20 10:44:17 mrec for "<192.168.0.110:1102>#1129559986#1096" not found -- match not deleted
10/20 10:44:17 DaemonCore: Command received via TCP from host <192.168.0.64:2980>
10/20 10:44:17 DaemonCore: received command 1111 (QMGMT_CMD), calling handler (handle_q)
10/20 10:44:17 sspi_server_auth() entered
10/20 10:44:17 sspi_server_auth() looping
10/20 10:44:17 sspi_server_auth(): user name is: "commonrender"
10/20 10:44:17 sspi_server_auth(): domain name is: "DIGICPICTURES"
10/20 10:44:17 sspi_server_auth() exiting
10/20 10:44:17 Inserting new attribute ImageSize into non-active cluster cid=2046 acid=-1
10/20 10:44:17 Inserting new attribute LastJobLeaseRenewal into non-active cluster cid=2046 acid=-1
10/20 10:44:17 Inserting new attribute LastVacateTime into non-active cluster cid=2046 acid=-1
10/20 10:44:17 Inserting new attribute BytesSent into non-active cluster cid=2046 acid=-1
10/20 10:44:17 Inserting new attribute BytesRecvd into non-active cluster cid=2046 acid=-1
10/20 10:44:17 condor_read(): Socket closed when trying to read buffer
10/20 10:44:17 QMGR Connection closed
10/20 10:44:17 ERROR "ERROR no job status for 2064.0 in child_exit()!" at line 8157 in file ..\src\condor_schedd.V6\schedd.C

10/20 10:44:17 ScheddCronMgr: Bye
10/20 10:44:17 CronMgr: bye
10/20 10:44:17 Canceling timer for SelfDrainingQueue job_is_finished_queue (timer id: 207)
*** End of file SchedLog