Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] confusion around new spool in 7.5.5

Date: Fri, 18 Feb 2011 12:12:42 -0500
From: Peter Doherty <doherty@xxxxxxxxxxxxxxxxxxx>
Subject: Re: [Condor-users] confusion around new spool in 7.5.5


On Feb 18, 2011, at 11:26 , Peter Doherty wrote:

I upgraded to v7.5.5 and there's one thing I'm scratching my headover.
I used to have a SPOOL directory filled with directories with nameslike:
cluster15093481.proc0.subproc0.tmp/

According to the changelog I should now have dirs in the format of:
$(SPOOL)/<#>/<#>/cluster<#>.proc<#>.subproc<#>


But the thing is, I don't have anything.
my SPOOL just has:
job_queue.log
local_univ_execute
spool_version

I've got a few thousand jobs in the queue right now.
Where are the spool files? I'm sure I'm looking in the correctdirectory. I've tried to find them, but I can't. I see a lot oflock files in $(TMP_DIR)
I believe the constant I/O of all the spool files was one of thebottlenecks of our Schedd, so if that's really been improved upon,I'm eager to see the effect, but from reading the changelog, theonly different should have been subdirs for the spool to keep fromhitting ext3 limits.

Hmm, okay. Jobs seem to be running okay, but I see a lot of theseerrors in the Shadow Log:

02/18/11 12:09:25 (pid:649) (15101845.0) (649):Directory::setOwnerPriv() -- failed to find owner of /raid0/gwms_schedd/spool/1845/0/cluster15101845.proc0.subproc0.tmp02/18/11 12:09:25 (pid:649) (15101845.0) (649): Directory::Rewind():failed to find owner of "/raid0/gwms_schedd/spool/1845/0/cluster15101845.proc0.subproc0.tmp"

I guess that's part of the problem. I checked the perms on the spooldirectory, and then I set it to 777 and verified regular users canwrite to it, but that didn't stop the errors, or cause files to becreated there.

So I'm not really clear what's going on.

--Peter

Attachment: smime.p7s
Description: S/MIME cryptographic signature

Follow-Ups:
- Re: [Condor-users] confusion around new spool in 7.5.5
  - From: Dan Bradley

References:
- [Condor-users] confusion around new spool in 7.5.5
  - From: Peter Doherty

Prev by Date: Re: [Condor-users] parsing arguments with whitespaces
Next by Date: Re: [Condor-users] confusion around new spool in 7.5.5
Previous by thread: [Condor-users] confusion around new spool in 7.5.5
Next by thread: Re: [Condor-users] confusion around new spool in 7.5.5
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [Condor-users] confusion around new spool in 7.5.5