[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Evictions



Yes, that is what I thought too.
 
I went ahead and rebuilt the grid over the weekend with 6.7.16 and everything is working now.  Not sure what might have gotten out of whack, but it is all well now.
 
Thanks
----- Original Message -----
From: Jaime Frey
Sent: Monday, February 06, 2006 10:43 AM
Subject: Re: [Condor-users] Evictions

On Feb 2, 2006, at 3:26 PM, Stephen Broughton wrote:

Here is the entry from the shadowlog file that corresponds to the hello_world.log that I sent in the last e-mail.  I have also included a zip file of the entire log.  this comes neer the end of the file.
 
2/1 18:40:04 (?.?) (3477):******* Standard Shadow starting up *******
2/1 18:40:04 (?.?) (3477):** $CondorVersion: 6.7.14 Dec 13 2005 $
2/1 18:40:04 (?.?) (3477):** $CondorPlatform: I386-LINUX_RH9 $
2/1 18:40:04 (?.?) (3477):*******************************************
2/1 18:40:04 (?.?) (3477):uid=0, euid=501, gid=0, egid=501
2/1 18:40:04 (?.?) (3477):Hostname = "<192.168.0.2:32773>", Job = 22.0
2/1 18:40:04 (22.0) (3477):Requesting Primary Starter
2/1 18:40:04 (22.0) (3477):Shadow: Request to run a job was ACCEPTED
2/1 18:40:04 (22.0) (3477):Shadow: RSC_SOCK connected, fd = 17
2/1 18:40:04 (22.0) (3477):Shadow: CLIENT_LOG connected, fd = 18
2/1 18:40:04 (22.0) (3477):My_Filesystem_Domain = "condor.local"
2/1 18:40:04 (22.0) (3477):My_UID_Domain = "condor01.condor.local"
2/1 18:40:04 (22.0) (3477): Entering pseudo_get_file_stream
2/1 18:40:04 (22.0) (3477): file = "/home/condor/local_scratch/spool/cluster22.ickpt.subproc0"
2/1 18:40:16 (22.0) (3477):Reaped child status - pid 3478 exited with status 0
2/1 18:40:17 (22.0) (3477):Shadow: Job 22.0 exited, termsig = 9, coredump = 0, retcode = 110
2/1 18:40:17 (22.0) (3477):Shadow: Job was kicked off without a checkpoint
2/1 18:40:17 (22.0) (3477):Shadow: DoCleanup: unlinking TmpCkpt '/home/condor/local_scratch/spool/cluster22.proc0.subproc0.tmp'
2/1 18:40:17 (22.0) (3477):Trying to unlink /home/condor/local_scratch/spool/cluster22.proc0.subproc0.tmp
2/1 18:40:17 (22.0) (3477):user_time = 2 ticks
2/1 18:40:17 (22.0) (3477):sys_time = 16 ticks
2/1 18:40:17 (22.0) (3477):********** Shadow Exiting(107) **********

Hmm. Not much there. How about the schedd log, and the startd and starter logs from the execute machine?

+--------------------------------+-----------------------------------+
|           Jaime Frey           | I used to be a heavy gambler.     |
|       jfrey@xxxxxxxxxxx        | But now I just make mental bets.  |
| http://www.cs.wisc.edu/~jfrey/ | That's how I lost my mind.        |
+--------------------------------+-----------------------------------+



_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/condor-users