[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Antwort: Re: Fault Behaviour of Condor



Hi Matt,

Thank you for a reply.

Assuming you mean
001 (290.000.000) 08/03 13:41:31 Job executing on host: <192.168.0.2:3817>

about 60 secs later pull the NIC
Yes, I pulled the NIC after 13:41:31.

Have you set these to numbers which would give you 2 hours delay?
No. I checked my config file, these parameters are comment field.
#POLLING_INTERVAL=5
#ALIVE_INTERVAL = 300
#MAX_SHADOW_EXCEPTIONS = 5
Is this wrong?

What is the schedd/shadow log indicating during this time?
I attached  schedd and shadow log. (ClusterID of the job is 290.0)
Please check these files.

Thanks,
Kohei

----- Original Message ----- From: "Matt Hope" <matthew.hope@xxxxxxxxx>
To: "Condor-Users Mail List" <condor-users@xxxxxxxxxxx>
Sent: Thursday, August 10, 2006 12:31 AM
Subject: Re: [Condor-users] Antwort: Re: Fault Behaviour of Condor


On 8/8/06, Nomura Kohei <kh-nomura@xxxxxxxxx> wrote:
>> 3.) Shutting down the NIC on the executor

I have done same as 3) on my condor pool.
My condor pool consists of 3 windows machine with v6.8.0.
The job has been successfully re-scheduled and run.

See attached log file of the job,
I have set JobLeaseDuration to 60 second.
But it took 2 hours from shutting down the NIC to rescheduling.
(After the job had been executed, I cut the NIC immediately.)

Does JobLeaseDuration work effectively??

Assuming you mean
001 (290.000.000) 08/03 13:41:31 Job executing on host: <192.168.0.2:3817>

about 60 secs later pull the NIC

022 (290.000.000) 08/03 15:41:36 Job disconnected, attempting to reconnect
  Socket between submit and execute hosts closed unexpectedly
  Trying to reconnect to vm1@xxxxxxxxxxxxxxxxxxx <192.168.0.2:3817>
024 (290.000.000) 08/03 15:41:37 Job reconnection failed

Then that looks a bit bad - following all off top of my head but I
think it is still valid for latest versions...

POLLING_INTERVAL is for the startd but ALIVE_INTERVAL is how often the
schedd sends keep alive. I think the default is 5mins (setting in
seconds)
MAX_SHADOW_EXCEPTIONS then determines how often an error can occur
before it gives up so in theory MAX_SHADOW_EXCEPTIONS * ALIVE_INTERVAL
should determine how long before (if there is no lease logic allowing
an extension) a bad claim is perceived as being kept by the schedd.

Have you set these to numbers which would give you 2 hours delay?

If not this suggests it might be an issue

What is the schedd/shadow log indicating during this time?

Matt
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR
8/3 13:41:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 13:41:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 13:46:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 13:46:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 13:51:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 13:51:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 13:56:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 13:56:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:01:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:01:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:06:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:06:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:11:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:11:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:16:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:16:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:21:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:21:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:26:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:26:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:31:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:31:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:36:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:36:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:41:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:41:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:46:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:46:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:51:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:51:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 14:56:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 14:56:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:01:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:01:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:06:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:06:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:11:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:11:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:16:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:16:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:21:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:21:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:26:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:26:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:31:30 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:31:30 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:34:34 (pid:2120) condor_read(): recv() returned -1, errno = 10054, assuming failure.
8/3 15:34:34 (pid:2120) get_file: Zero-length file check failed!
8/3 15:34:34 (pid:2120) Failed to receive file from client in SendSpoolFile.
8/3 15:34:34 (pid:2120) condor_read(): recv() returned -1, errno = 10054, assuming failure.
8/3 15:34:53 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2303>
8/3 15:34:53 (pid:2120) DaemonCore: received command 421 (RESCHEDULE), calling handler (reschedule_negotiator)
8/3 15:34:53 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:34:53 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:34:53 (pid:2120) Called reschedule_negotiator()
8/3 15:34:53 (pid:2120) Activity on stashed negotiator socket
8/3 15:34:53 (pid:2120) Negotiating for owner: condor_inst@xxxxxxxxxxx
8/3 15:34:53 (pid:2120) Checking consistency running and runnable jobs
8/3 15:34:53 (pid:2120) Tables are consistent
8/3 15:34:53 (pid:2120) Out of jobs - 1 jobs matched, 0 jobs idle, flock level = 0
8/3 15:34:54 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2318>
8/3 15:34:54 (pid:2120) DaemonCore: received command 421 (RESCHEDULE), calling handler (reschedule_negotiator)
8/3 15:34:54 (pid:2120) Called reschedule_negotiator()
8/3 15:34:55 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2324>
8/3 15:34:55 (pid:2120) DaemonCore: received command 421 (RESCHEDULE), calling handler (reschedule_negotiator)
8/3 15:34:55 (pid:2120) Called reschedule_negotiator()
8/3 15:34:56 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2328>
8/3 15:34:56 (pid:2120) DaemonCore: received command 421 (RESCHEDULE), calling handler (reschedule_negotiator)
8/3 15:34:56 (pid:2120) Called reschedule_negotiator()
8/3 15:34:57 (pid:2120) Starting add_shadow_birthdate(292.0)
8/3 15:34:57 (pid:2120) Started shadow for job 292.0 on "<192.168.0.1:1043>", (shadow pid = 3472)
8/3 15:34:57 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2337>
8/3 15:34:57 (pid:2120) DaemonCore: received command 421 (RESCHEDULE), calling handler (reschedule_negotiator)
8/3 15:34:57 (pid:2120) Called reschedule_negotiator()
8/3 15:34:58 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:34:58 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:35:07 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2354>
8/3 15:35:07 (pid:2120) DaemonCore: received command 421 (RESCHEDULE), calling handler (reschedule_negotiator)
8/3 15:35:07 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:35:07 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:35:07 (pid:2120) Called reschedule_negotiator()
8/3 15:35:14 (pid:2120) Activity on stashed negotiator socket
8/3 15:35:14 (pid:2120) Negotiating for owner: condor_inst@xxxxxxxxxxx
8/3 15:35:14 (pid:2120) Checking consistency running and runnable jobs
8/3 15:35:14 (pid:2120) Tables are consistent
8/3 15:35:14 (pid:2120) attempt to add pre-existing match "<192.168.0.1:1043>#1154510215#2" ignored
8/3 15:35:14 (pid:2120) Out of servers - 2 jobs matched, 3 jobs idle, 1 jobs rejected
8/3 15:35:18 (pid:2120) Starting add_shadow_birthdate(293.0)
8/3 15:35:18 (pid:2120) Started shadow for job 293.0 on "<192.168.0.1:1043>", (shadow pid = 3636)
8/3 15:35:20 (pid:2120) Starting add_shadow_birthdate(294.0)
8/3 15:35:20 (pid:2120) Started shadow for job 294.0 on "<192.168.0.1:1043>", (shadow pid = 3920)
8/3 15:35:20 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:35:20 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:35:59 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2428>
8/3 15:35:59 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:35:59 (pid:2120) Shadow pid 3472 for job 292.0 exited with status 100
8/3 15:36:01 (pid:2120) Starting add_shadow_birthdate(295.0)
8/3 15:36:01 (pid:2120) Started shadow for job 295.0 on "<192.168.0.1:1043>", (shadow pid = 4092)
8/3 15:36:01 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:36:01 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:36:21 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2464>
8/3 15:36:21 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:36:21 (pid:2120) Shadow pid 3636 for job 293.0 exited with status 100
8/3 15:36:23 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2472>
8/3 15:36:23 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:36:23 (pid:2120) Shadow pid 3920 for job 294.0 exited with status 100
8/3 15:36:23 (pid:2120) Starting add_shadow_birthdate(296.0)
8/3 15:36:23 (pid:2120) Started shadow for job 296.0 on "<192.168.0.1:1043>", (shadow pid = 2960)
8/3 15:36:25 (pid:2120) Starting add_shadow_birthdate(297.0)
8/3 15:36:25 (pid:2120) Started shadow for job 297.0 on "<192.168.0.1:1043>", (shadow pid = 3512)
8/3 15:36:25 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:36:25 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:37:03 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2527>
8/3 15:37:03 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:37:03 (pid:2120) Shadow pid 4092 for job 295.0 exited with status 100
8/3 15:37:03 (pid:2120) match (<192.168.0.1:1043>#1154510215#1) out of jobs (cluster id 292); relinquishing
8/3 15:37:03 (pid:2120) Sent RELEASE_CLAIM to startd on <192.168.0.1:1043>
8/3 15:37:03 (pid:2120) Match record (<192.168.0.1:1043>, 292, -1) deleted
8/3 15:37:03 (pid:2120) DaemonCore: Command received via TCP from host <192.168.0.1:2532>
8/3 15:37:03 (pid:2120) DaemonCore: received command 443 (VACATE_SERVICE), calling handler (vacate_service)
8/3 15:37:03 (pid:2120) Got VACATE_SERVICE from <192.168.0.1:2532>
8/3 15:37:25 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2547>
8/3 15:37:25 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:37:25 (pid:2120) Shadow pid 2960 for job 296.0 exited with status 100
8/3 15:37:25 (pid:2120) match (<192.168.0.1:1043>#1154510215#2) out of jobs (cluster id 293); relinquishing
8/3 15:37:25 (pid:2120) Sent RELEASE_CLAIM to startd on <192.168.0.1:1043>
8/3 15:37:25 (pid:2120) Match record (<192.168.0.1:1043>, 293, -1) deleted
8/3 15:37:25 (pid:2120) DaemonCore: Command received via TCP from host <192.168.0.1:2550>
8/3 15:37:25 (pid:2120) DaemonCore: received command 443 (VACATE_SERVICE), calling handler (vacate_service)
8/3 15:37:25 (pid:2120) Got VACATE_SERVICE from <192.168.0.1:2550>
8/3 15:37:27 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2563>
8/3 15:37:27 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:37:27 (pid:2120) Shadow pid 3512 for job 297.0 exited with status 100
8/3 15:37:27 (pid:2120) match (<192.168.0.1:1043>#1154510215#4) out of jobs (cluster id 294); relinquishing
8/3 15:37:27 (pid:2120) Sent RELEASE_CLAIM to startd on <192.168.0.1:1043>
8/3 15:37:27 (pid:2120) Match record (<192.168.0.1:1043>, 294, -1) deleted
8/3 15:37:27 (pid:2120) DaemonCore: Command received via TCP from host <192.168.0.1:2567>
8/3 15:37:27 (pid:2120) DaemonCore: received command 443 (VACATE_SERVICE), calling handler (vacate_service)
8/3 15:37:27 (pid:2120) Got VACATE_SERVICE from <192.168.0.1:2567>
8/3 15:41:25 (pid:2120) Sent ad to central manager for condor_inst@xxxxxxxxxxx
8/3 15:41:25 (pid:2120) Sent ad to 1 collectors for condor_inst@xxxxxxxxxxx
8/3 15:41:37 (pid:2120) DaemonCore: Command received via UDP from host <192.168.0.1:2613>
8/3 15:41:37 (pid:2120) DaemonCore: received command 60011 (DC_NOP), calling handler (handle_nop())
8/3 15:41:37 (pid:2120) Shadow pid 372 for job 290.0 exited with status 107
8/3 15:41:37 (pid:2120) Sent RELEASE_CLAIM to startd on <192.168.0.2:3817>
8/3 15:41:37 (pid:2120) Match record (<192.168.0.2:3817>, 290, 0) deleted
8/3 13:41:29 Initializing a VANILLA shadow for job 290.0
8/3 13:41:30 (290.0) (372): Request to run on <192.168.0.2:3817> was ACCEPTED
8/3 15:34:57 ******************************************************
8/3 15:34:57 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:34:57 ** C:\condor\bin\condor_shadow.exe
8/3 15:34:57 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:34:57 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:34:57 ** PID = 3472
8/3 15:34:57 ** Log last touched 8/3 15:34:30
8/3 15:34:57 ******************************************************
8/3 15:34:57 Using config source: C:\condor\condor_config
8/3 15:34:57 Using local config sources: 8/3 15:34:57 C:\condor\condor_config.local
8/3 15:34:57 DaemonCore: Command Socket at <192.168.0.1:2332>
8/3 15:34:57 Initializing a VANILLA shadow for job 292.0
8/3 15:34:57 (292.0) (3472): Request to run on <192.168.0.1:1043> was ACCEPTED
8/3 15:35:18 ******************************************************
8/3 15:35:18 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:35:18 ** C:\condor\bin\condor_shadow.exe
8/3 15:35:18 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:35:18 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:35:18 ** PID = 3636
8/3 15:35:18 ** Log last touched 8/3 15:34:57
8/3 15:35:18 ******************************************************
8/3 15:35:18 Using config source: C:\condor\condor_config
8/3 15:35:18 Using local config sources: 8/3 15:35:18 C:\condor\condor_config.local
8/3 15:35:18 DaemonCore: Command Socket at <192.168.0.1:2370>
8/3 15:35:18 Initializing a VANILLA shadow for job 293.0
8/3 15:35:18 (293.0) (3636): Request to run on <192.168.0.1:1043> was ACCEPTED
8/3 15:35:20 ******************************************************
8/3 15:35:20 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:35:20 ** C:\condor\bin\condor_shadow.exe
8/3 15:35:20 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:35:20 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:35:20 ** PID = 3920
8/3 15:35:20 ** Log last touched 8/3 15:35:18
8/3 15:35:20 ******************************************************
8/3 15:35:20 Using config source: C:\condor\condor_config
8/3 15:35:20 Using local config sources: 8/3 15:35:20 C:\condor\condor_config.local
8/3 15:35:20 DaemonCore: Command Socket at <192.168.0.1:2386>
8/3 15:35:20 Initializing a VANILLA shadow for job 294.0
8/3 15:35:20 (294.0) (3920): Request to run on <192.168.0.1:1043> was ACCEPTED
8/3 15:35:58 (292.0) (3472): Job 292.0 terminated: exited with status 0
8/3 15:35:59 (292.0) (3472): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
8/3 15:36:01 ******************************************************
8/3 15:36:01 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:36:01 ** C:\condor\bin\condor_shadow.exe
8/3 15:36:01 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:36:01 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:36:01 ** PID = 4092
8/3 15:36:01 ** Log last touched 8/3 15:35:59
8/3 15:36:01 ******************************************************
8/3 15:36:01 Using config source: C:\condor\condor_config
8/3 15:36:01 Using local config sources: 8/3 15:36:01 C:\condor\condor_config.local
8/3 15:36:01 DaemonCore: Command Socket at <192.168.0.1:2430>
8/3 15:36:01 Initializing a VANILLA shadow for job 295.0
8/3 15:36:01 (295.0) (4092): Request to run on <192.168.0.1:1043> was ACCEPTED
8/3 15:36:20 (293.0) (3636): Job 293.0 terminated: exited with status 0
8/3 15:36:21 (293.0) (3636): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
8/3 15:36:22 (294.0) (3920): Job 294.0 terminated: exited with status 0
8/3 15:36:23 (294.0) (3920): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
8/3 15:36:23 ******************************************************
8/3 15:36:23 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:36:23 ** C:\condor\bin\condor_shadow.exe
8/3 15:36:23 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:36:23 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:36:23 ** PID = 2960
8/3 15:36:23 ** Log last touched 8/3 15:36:23
8/3 15:36:23 ******************************************************
8/3 15:36:23 Using config source: C:\condor\condor_config
8/3 15:36:23 Using local config sources: 8/3 15:36:23 C:\condor\condor_config.local
8/3 15:36:23 DaemonCore: Command Socket at <192.168.0.1:2474>
8/3 15:36:23 Initializing a VANILLA shadow for job 296.0
8/3 15:36:23 (296.0) (2960): Request to run on <192.168.0.1:1043> was ACCEPTED
8/3 15:36:25 ******************************************************
8/3 15:36:25 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:36:25 ** C:\condor\bin\condor_shadow.exe
8/3 15:36:25 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:36:25 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:36:25 ** PID = 3512
8/3 15:36:25 ** Log last touched 8/3 15:36:23
8/3 15:36:25 ******************************************************
8/3 15:36:25 Using config source: C:\condor\condor_config
8/3 15:36:25 Using local config sources: 8/3 15:36:25 C:\condor\condor_config.local
8/3 15:36:25 DaemonCore: Command Socket at <192.168.0.1:2483>
8/3 15:36:25 Initializing a VANILLA shadow for job 297.0
8/3 15:36:26 (297.0) (3512): Request to run on <192.168.0.1:1043> was ACCEPTED
8/3 15:37:03 (295.0) (4092): Job 295.0 terminated: exited with status 0
8/3 15:37:03 (295.0) (4092): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
8/3 15:37:25 (296.0) (2960): Job 296.0 terminated: exited with status 0
8/3 15:37:25 (296.0) (2960): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
8/3 15:37:27 (297.0) (3512): Job 297.0 terminated: exited with status 0
8/3 15:37:27 (297.0) (3512): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
8/3 15:41:36 (290.0) (372): condor_read(): recv() returned -1, errno = 10054, assuming failure.
8/3 15:41:36 (290.0) (372): Can no longer talk to condor_starter <192.168.0.2:3817>
8/3 15:41:37 (290.0) (372): Trying to reconnect to disconnected job
8/3 15:41:37 (290.0) (372): LastJobLeaseRenewal: 1154580099 Thu Aug 03 13:41:39 2006
8/3 15:41:37 (290.0) (372): JobLeaseDuration: 60 seconds
8/3 15:41:37 (290.0) (372): JobLeaseDuration remaining: EXPIRED!
8/3 15:41:37 (290.0) (372): Reconnect FAILED: Job disconnected too long: JobLeaseDuration (60 seconds) expired
8/3 15:41:37 (290.0) (372): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 107
8/3 15:50:18 ******************************************************
8/3 15:50:18 ** condor_shadow (CONDOR_SHADOW) STARTING UP
8/3 15:50:18 ** C:\condor\bin\condor_shadow.exe
8/3 15:50:18 ** $CondorVersion: 6.8.0 Jul 19 2006 $
8/3 15:50:18 ** $CondorPlatform: INTEL-WINNT50 $
8/3 15:50:18 ** PID = 2968
8/3 15:50:18 ** Log last touched 8/3 15:41:37
8/3 15:50:18 ******************************************************
8/3 15:50:18 Using config source: C:\condor\condor_config
8/3 15:50:18 Using local config sources: 8/3 15:50:18 C:\condor\condor_config.local
8/3 15:50:18 DaemonCore: Command Socket at <192.168.0.1:2704>
8/3 15:50:18 Initializing a VANILLA shadow for job 290.0
8/3 15:50:18 (290.0) (2968): Request to run on <192.168.0.3:1044> was ACCEPTED
8/3 15:52:37 (290.0) (2968): Job 290.0 terminated: exited with status 0
8/3 15:52:37 (290.0) (2968): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100