[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] memory usage report in userlog



when i look through the userlog files from a job run i see (running
condor 8.6.1)

006 (71791.8240.00 10/27 04:42:40 Image size of job updated: 75000
0 - Memoryusage of job (MB)
0 - ResidentSetSize of job (KB)

005 (71791.8240.00 10/27 04:42:40 Job terminated.
(1) normal termination (return value 0)
usr 0 00:00:00, sys 0 00:00:00 - run remote usage
usr 0 00:00:00, sys 0 00:00:00 - run local usage
usr 0 00:00:00, sys 0 00:00:00 - total remote usage
usr 0 00:00:00, sys 0 00:00:00 - total local usage
0 - run bytes sent by job
0 - run bytes received by job
0 - total bytes sent by job
0 - total bytes received by job
partitionable resources : usage request allocated
cpus :    1 1
disk (kb) : 75000 75000 79310
memory (mb) : 0 8192 8192

the job ran fine, but shouldn't these entries in the user log have
number filled in.  especially for the memory parts?  if so, what's
tracking/updating that?

in my starterlogs i see this, but i'm not sure if it's related in anyway

ProcFamilydirect: no family for pid 2777469
error getting family usage in vanillaProc::PublishUpdateAd() for pid 2777469

which repeats every five minutes until the job completes, and then
this is the final error in the starterlog

Process exited, pid=2777469, status=0
ProcFamilyDirect: no family for pid 2777469
error getting family usage for pid 2777469 in VanillaProc::JobReaper()
Returning from CStarter::JobReaper()
ProcFamilydirect: no family registered for pid 2777469
error unregistering pid 2777469 with the procd

i suspect the error and the lack of statistics are related, but i'm
not sure exactly what isn't working or which specific knob to turn in
order to amp up the debug info.