[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] 7.4.2 / 7.4.4: condor_q trouble when pool PCs suddenly are powered off !?!
- Date: Mon, 22 Nov 2010 09:52:33 -0600
- From: Dan Bradley <dan@xxxxxxxxxxxx>
- Subject: Re: [Condor-users] 7.4.2 / 7.4.4: condor_q trouble when pool PCs suddenly are powered off !?!
During the time when condor_q fails, what do you see in SchedLog?
On 11/22/10 7:19 AM, Rob wrote:
I have a linux (Fedora 12) condor master with condor version 7.4.2.
The Windows XP pool PCs are all running condor version 7.4.4.
The condor master is having trouble to produce the condor_q output at times when
the pool PCs are switched off:
11/22 22:04:05 condor_read(): timeout reading 5 bytes from schedd at
11/22 22:04:05 IO: Failed to read packet header
11/22 22:04:05 SECMAN: reconnected to schedd at<220.127.116.11:60614> from port
52251 to send unauthenticated command 1111 QMGMT_CMD
11/22 22:04:26 condor_read(): timeout reading 5 bytes from schedd at
11/22 22:04:26 IO: Failed to read packet header
11/22 22:04:46 condor_read(): timeout reading 5 bytes from schedd at
11/22 22:04:46 IO: Failed to read packet header
-- Failed to fetch ads from:<18.104.22.168:60614> : condor.dns.org
The pool PC is a set of over 300 public library Windows XP PCs, which are all
centrally powered off at the same time in the evening. For a while the condor
master keeps hanging on to the PCs' status before the poweroff (understandably,
as it has no clue what has happened to the "disappeared" PCs). After a while the
condor master then abandons whatever was going on on the PCs. During the
transition time, the condor_q command seems to have trouble producing useful
Is this a "feature" or a bug?
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
You can also unsubscribe by visiting
The archives can be found at: