[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_quill keep dying



Hi,

condor_quill keep dying   with this message approximately for every 19 mins.

 

"/usr/local/condor-7.0.5/sbin/condor_quill" on "my.condor.host" died due to signal 25 (File size limit exceeded).

Condor will automatically restart this process in 10 seconds.

 

*** Last 20 line(s) of file /u/condor/log/QuillLog:

2/24 09:01:52 configuring tt options from config file

2/24 09:01:52 Using Polling Period = 10

2/24 09:01:52 Using logs 2/24 09:01:52 /u/condor/log/schedd_sql.log 2/24 09:01:52 /u/condor/log/sql.log 2/24 09:01:52

2/24 09:01:52 Using Job Queue File /u/condor/spool/job_queue.log

2/24 09:01:52 Using Database Type = Postgres

2/24 09:01:52 Using Database IpAddress = my.condor.host:5432

2/24 09:01:52 Using Database Name = DBNAME

2/24 09:01:52 Using Database User = DBUSER

2/24 09:01:52 ******** Start of Polling Job Queue Log ********

2/24 09:01:52 === Current Probing Information ===

2/24 09:01:52 fsize: 108454369            mtime: 1267020103

2/24 09:01:52 first log entry: 1035 CreationTimestamp 1174575013

2/24 09:01:52 JOB QUEUE POLLING RESULT: COMPRESSED

2/24 09:20:59 ********* End of Polling Job Queue Log *********

2/24 09:20:59 ******** Start of Polling Event Log ********

2/24 09:20:59 >>>>>>>> Fail: Polling Event Log <<<<<<<<

2/24 09:20:59 ******** Start of Polling XML Log ********

2/24 09:20:59 ********* End of Polling XML Log *********

2/24 09:20:59 ++++++++ Sending Quill ad to collector ++++++++

2/24 09:20:59 ++++++++ Sent Quill ad to collector ++++++++

*** End of file QuillLog

 

I have QUILL_DBSIZE_LIMIT in condor.config is 20 GB,

#The DB Size limit is in gigabytes

QUILL_DBSIZE_LIMIT      = 20

I checked the size of DB, Total Database itself less than 10 GB.

fsize in each message keep increasing for the decreasing number of jobs in the queue.

Could you please let me know what might be the problem, any pointer is appreciated.

Thanks,

Senthil