[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Jobs Still not returning any output



I have completely started fresh. reinstalled and started with no log files whatsoever.

The job file (hello.sub) contains.

executable = helloworld
universe = vanilla
should_transfer_files = YES
when_to_transfer_output = ON_EXIT
requirements = (Arch == "X86_64") && (OpSys == "LINUX")
output  = output_$(Process).out
error   = error_$(Process).out
log     = log.out
Queue 5



condor_status reports


Name OpSys Arch State Activity LoadAv Mem ActvtyTime

vm1@thebeast. LINUX       INTEL  Unclaimed  Idle       1.000   633
0+00:00:04
vm2@thebeast. LINUX       INTEL  Unclaimed  Idle       1.000   633
0+00:00:05
vm3@thebeast. LINUX       INTEL  Unclaimed  Idle       1.000   633
0+00:00:06
vm4@thebeast. LINUX       INTEL  Unclaimed  Idle       0.160   633
0+00:00:07
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.160  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.450  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.500  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.410  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.490  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.490  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.460  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.580  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.650  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.570  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.650  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.160  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.690  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.490  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.370  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.240  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.410  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.310  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.370  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.390  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.370  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]
vm1@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.510  2048[?????]
vm2@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000  2048[?????]

                    Machines Owner Claimed Unclaimed Matched Preempting

        INTEL/LINUX        4     0       0         4       0          0
       X86_64/LINUX       44     0       0        44       0          0

              Total       48     0       0        48       0          0



Here is a listing of the directory after running the job.


condor@thebeast:~/jobs/helloworld> ls -l total 24 -rw-r--r-- 1 condor users 0 2005-10-12 01:38 error_0.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 error_1.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 error_2.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 error_3.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 error_4.out -rw-r--r-- 1 condor users 257 2005-10-12 01:37 hello.sub -rwxr-xr-x 1 condor users 10457 2005-10-11 17:04 helloworld -rw-r--r-- 1 condor users 3060 2005-10-12 01:38 log.out -rw-r--r-- 1 condor users 137 2005-10-11 17:03 Main.cpp -rw-r--r-- 1 condor users 0 2005-10-12 01:38 output_0.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 output_1.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 output_2.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 output_3.out -rw-r--r-- 1 condor users 0 2005-10-12 01:38 output_4.out

The log.out file contains

000 (001.000.000) 10/12 01:38:25 Job submitted from host:
<192.168.1.1:45580>
...
000 (001.001.000) 10/12 01:38:25 Job submitted from host:
<192.168.1.1:45580>
...
000 (001.002.000) 10/12 01:38:25 Job submitted from host:
<192.168.1.1:45580>
...
000 (001.003.000) 10/12 01:38:25 Job submitted from host:
<192.168.1.1:45580>
...
000 (001.004.000) 10/12 01:38:25 Job submitted from host:
<192.168.1.1:45580>
...
001 (001.000.000) 10/12 01:38:28 Job executing on host:
<192.168.1.101:35193>
...
005 (001.000.000) 10/12 01:38:28 Job terminated.
       (1) Normal termination (return value 0)
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
       0  -  Run Bytes Sent By Job
       10457  -  Run Bytes Received By Job
       0  -  Total Bytes Sent By Job
       10457  -  Total Bytes Received By Job
...
001 (001.001.000) 10/12 01:38:30 Job executing on host:
<192.168.1.101:35193>
...
005 (001.001.000) 10/12 01:38:30 Job terminated.
       (1) Normal termination (return value 0)
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
       0  -  Run Bytes Sent By Job
       10457  -  Run Bytes Received By Job
       0  -  Total Bytes Sent By Job
       10457  -  Total Bytes Received By Job
...
001 (001.002.000) 10/12 01:38:33 Job executing on host:
<192.168.1.102:34956>
...
005 (001.002.000) 10/12 01:38:33 Job terminated.
       (1) Normal termination (return value 0)
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
       0  -  Run Bytes Sent By Job
       10457  -  Run Bytes Received By Job
       0  -  Total Bytes Sent By Job
       10457  -  Total Bytes Received By Job
...
001 (001.003.000) 10/12 01:38:34 Job executing on host:
<192.168.1.102:34956>
...
005 (001.003.000) 10/12 01:38:34 Job terminated.
       (1) Normal termination (return value 0)
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
       0  -  Run Bytes Sent By Job
       10457  -  Run Bytes Received By Job
       0  -  Total Bytes Sent By Job
       10457  -  Total Bytes Received By Job
...
001 (001.004.000) 10/12 01:38:37 Job executing on host:
<192.168.1.103:34098>
...
005 (001.004.000) 10/12 01:38:37 Job terminated.
       (1) Normal termination (return value 0)
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
               Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
       0  -  Run Bytes Sent By Job
       10457  -  Run Bytes Received By Job
       0  -  Total Bytes Sent By Job
       10457  -  Total Bytes Received By Job



And here is a listing of the Logs before any execution has been on them..
completely fresh and then after I try to submit my job.





####Before Job Submission####.

MasterLog

10/12 01:31:13 ******************************************************
10/12 01:31:13 ** condor_master (CONDOR_MASTER) STARTING UP
10/12 01:31:13 ** /home/condor/release/sbin/condor_master
10/12 01:31:13 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:31:13 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:31:13 ** PID = 12795
10/12 01:31:13 ******************************************************
10/12 01:31:13 Using config file: /home/condor/etc/condor_config
10/12 01:31:13 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:31:13 Attempting to lock /var/lock/condor/InstanceLock.
10/12 01:31:13 Obtained lock on /var/lock/condor/InstanceLock.
10/12 01:31:13 DaemonCore: Command Socket at <192.168.1.1:45578>
10/12 01:31:13 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:31:13 MASTER_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:13 Will use UDP to update collector thebeast.cluster.int
<192.168.1.1:9618>
10/12 01:31:13 Starting Collector on port 9618
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_collector", pid and pgroup = 12797
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_startd", pid and pgroup = 12798
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_schedd", pid and pgroup = 12799
10/12 01:31:13 MASTER_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_negotiator", pid and pgroup = 12801
10/12 01:31:13 Getting monitoring info for pid 12795
10/12 01:31:14 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:14 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:14 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:14 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:14 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:14 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:18 enter Daemons::UpdateCollector
10/12 01:31:18 Trying to update collector <192.168.1.1:9618>
10/12 01:31:18 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:31:18 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:31:18 exit Daemons::UpdateCollector
10/12 01:31:18 enter Daemons::CheckForNewExecutable
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_master: 1123070468
10/12 01:31:18 GetTimeStamp returned: 1123070468
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_collector: 1123070466
10/12 01:31:18 GetTimeStamp returned: 1123070466
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_startd: 1123070446
10/12 01:31:18 GetTimeStamp returned: 1123070446
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_schedd: 1123070449
10/12 01:31:18 GetTimeStamp returned: 1123070449
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_negotiator: 1123070462
10/12 01:31:18 GetTimeStamp returned: 1123070462
10/12 01:31:18 exit Daemons::CheckForNewExecutable
10/12 01:31:19 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:19 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12801















StartLog

10/12 01:31:13 ******************************************************
10/12 01:31:13 ** condor_startd (CONDOR_STARTD) STARTING UP
10/12 01:31:13 ** /home/condor/release/sbin/condor_startd
10/12 01:31:13 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:31:13 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:31:13 ** PID = 12798
10/12 01:31:13 ******************************************************
10/12 01:31:13 Using config file: /home/condor/etc/condor_config
10/12 01:31:13 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:31:13 DaemonCore: Command Socket at <192.168.1.1:45579>
10/12 01:31:14 vm1: New machine resource allocated
10/12 01:31:14 vm2: New machine resource allocated
10/12 01:31:14 vm3: New machine resource allocated
10/12 01:31:14 vm4: New machine resource allocated
10/12 01:31:14 About to run initial benchmarks.
10/12 01:31:19 Completed initial benchmarks.
10/12 01:31:19 vm1: State change: IS_OWNER is false
10/12 01:31:19 vm1: Changing state: Owner -> Unclaimed
10/12 01:31:19 vm2: State change: IS_OWNER is false
10/12 01:31:19 vm2: Changing state: Owner -> Unclaimed
10/12 01:31:19 vm3: State change: IS_OWNER is false
10/12 01:31:19 vm3: Changing state: Owner -> Unclaimed
10/12 01:31:19 vm4: State change: IS_OWNER is false
10/12 01:31:19 vm4: Changing state: Owner -> Unclaimed














ScheddLog

10/12 01:31:13 ******************************************************
10/12 01:31:13 ** condor_schedd (CONDOR_SCHEDD) STARTING UP
10/12 01:31:13 ** /home/condor/release/sbin/condor_schedd
10/12 01:31:13 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:31:13 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:31:13 ** PID = 12799
10/12 01:31:13 ******************************************************
10/12 01:31:13 Using config file: /home/condor/etc/condor_config
10/12 01:31:13 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:31:13 DaemonCore: Command Socket at <192.168.1.1:45580>
10/12 01:31:13 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:31:13 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:13 Will use UDP to update collector thebeast.cluster.int
<192.168.1.1:9618>
10/12 01:31:13 Using name: thebeast.cluster.int
10/12 01:31:13 No Accountant host specified in config file
10/12 01:31:13 SCHEDD_MIN_INTERVAL is undefined, using default value of 5
10/12 01:31:13 JOB_START_COUNT is undefined, using default value of 1
10/12 01:31:13 MAX_JOBS_SUBMITTED is undefined, using default value of
2147483647
10/12 01:31:13 STARTD_CONTACT_TIMEOUT is undefined, using default value of
45
10/12 01:31:13 initLocalStarterDir():
/home/condor/hosts/thebeast/spool/local_univ_execute does not exist, calling
mkdir()
10/12 01:31:13 initLocalStarterDir(): Changing permission on
/home/condor/hosts/thebeast/spool/local_univ_execute
10/12 01:31:13 JOB_IS_FINISHED_INTERVAL is undefined, using default value of
0
10/12 01:31:13 Period for SelfDrainingQueue job_is_finished_queue set to 0
10/12 01:31:13 Queue Management Super Users:
10/12 01:31:13  root
10/12 01:31:13  condor
10/12 01:31:13 CronMgr: Constructing 'schedd'
10/12 01:31:13 CronMgr: Setting name to 'schedd'
10/12 01:31:13 CronMgr: Setting parameter base to 'schedd'
10/12 01:31:13 CronMgr: Doing config (initial)
10/12 01:31:13 About to truncate log
/home/condor/hosts/thebeast/spool/job_queue.log
10/12 01:31:13 JobsRunning = 0
10/12 01:31:13 JobsIdle = 0
10/12 01:31:13 JobsHeld = 0
10/12 01:31:13 JobsRemoved = 0
10/12 01:31:13 LocalUniverseJobsRunning = 0
10/12 01:31:13 LocalUniverseJobsIdle = 0
10/12 01:31:13 SchedUniverseJobsRunning = 0
10/12 01:31:13 SchedUniverseJobsIdle = 0
10/12 01:31:13 N_Owners = 0
10/12 01:31:13 MaxJobsRunning = 200
10/12 01:31:13 ENABLE_SOAP is undefined, using default value of False
10/12 01:31:13 Trying to update collector <192.168.1.1:9618>
10/12 01:31:13 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:31:13 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:31:13 Sent HEART BEAT ad to 1 collectors. Number of submittors=0
10/12 01:31:13 ============ Begin clean_shadow_recs =============
10/12 01:31:13 ============ End clean_shadow_recs =============
10/12 01:31:13 Getting monitoring info for pid 12799
10/12 01:31:13 DaemonCore: No more children processes to reap.
10/12 01:31:14 DaemonCore: in SendAliveToParent()
10/12 01:31:14 DaemonCore: attempting to connect to '<192.168.1.1:45578>'
10/12 01:31:14 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:14 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:31:23 -------- Begin starting jobs --------
10/12 01:31:23 -------- Done starting jobs --------















####After Job Submission####

MasterLog

10/12 01:31:13 PASSWD_CACHE_REFRESH is undefined, using default value of 300

10/12 01:31:13 ******************************************************
10/12 01:31:13 ** condor_master (CONDOR_MASTER) STARTING UP
10/12 01:31:13 ** /home/condor/release/sbin/condor_master
10/12 01:31:13 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:31:13 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:31:13 ** PID = 12795
10/12 01:31:13 ******************************************************
10/12 01:31:13 Using config file: /home/condor/etc/condor_config
10/12 01:31:13 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:31:13 Attempting to lock /var/lock/condor/InstanceLock.
10/12 01:31:13 Obtained lock on /var/lock/condor/InstanceLock.
10/12 01:31:13 DaemonCore: Command Socket at <192.168.1.1:45578>
10/12 01:31:13 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:31:13 MASTER_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:13 Will use UDP to update collector thebeast.cluster.int
<192.168.1.1:9618>
10/12 01:31:13 Starting Collector on port 9618
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_collector", pid and pgroup = 12797
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_startd", pid and pgroup = 12798
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_schedd", pid and pgroup = 12799
10/12 01:31:13 MASTER_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:13 Started DaemonCore process
"/home/condor/release/sbin/condor_negotiator", pid and pgroup = 12801
10/12 01:31:13 Getting monitoring info for pid 12795
10/12 01:31:14 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:14 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:14 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:14 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:14 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:14 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:18 enter Daemons::UpdateCollector
10/12 01:31:18 Trying to update collector <192.168.1.1:9618>
10/12 01:31:18 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:31:18 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:31:18 exit Daemons::UpdateCollector
10/12 01:31:18 enter Daemons::CheckForNewExecutable
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_master: 1123070468
10/12 01:31:18 GetTimeStamp returned: 1123070468
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_collector: 1123070466
10/12 01:31:18 GetTimeStamp returned: 1123070466
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_startd: 1123070446
10/12 01:31:18 GetTimeStamp returned: 1123070446
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_schedd: 1123070449
10/12 01:31:18 GetTimeStamp returned: 1123070449
10/12 01:31:18 Time stamp of running
/home/condor/release/sbin/condor_negotiator: 1123070462
10/12 01:31:18 GetTimeStamp returned: 1123070462
10/12 01:31:18 exit Daemons::CheckForNewExecutable
10/12 01:31:19 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:31:19 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:31:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:32:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:33:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:33:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:33:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:33:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:34:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:34:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:34:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:34:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:35:13 Getting monitoring info for pid 12795
10/12 01:35:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:35:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:35:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:35:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:36:18 enter Daemons::UpdateCollector
10/12 01:36:18 Trying to update collector <192.168.1.1:9618>
10/12 01:36:18 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:36:18 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:36:18 exit Daemons::UpdateCollector
10/12 01:36:18 enter Daemons::CheckForNewExecutable
10/12 01:36:18 Time stamp of running
/home/condor/release/sbin/condor_master: 1123070468
10/12 01:36:18 GetTimeStamp returned: 1123070468
10/12 01:36:18 Time stamp of running
/home/condor/release/sbin/condor_collector: 1123070466
10/12 01:36:18 GetTimeStamp returned: 1123070466
10/12 01:36:18 Time stamp of running
/home/condor/release/sbin/condor_startd: 1123070446
10/12 01:36:18 GetTimeStamp returned: 1123070446
10/12 01:36:18 Time stamp of running
/home/condor/release/sbin/condor_schedd: 1123070449
10/12 01:36:18 GetTimeStamp returned: 1123070449
10/12 01:36:18 Time stamp of running
/home/condor/release/sbin/condor_negotiator: 1123070462
10/12 01:36:18 GetTimeStamp returned: 1123070462
10/12 01:36:18 exit Daemons::CheckForNewExecutable
10/12 01:36:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:36:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:36:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:36:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:37:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:37:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:37:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:37:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:38:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:38:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:38:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:38:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:39:13 Getting monitoring info for pid 12795
10/12 01:39:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:39:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:39:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:39:28 ProcAPI::getProcInfo() pid 12878 does not exist.
10/12 01:39:28 ProcAPI::getProcInfo() pid 12878 does not exist.
10/12 01:39:28 ProcAPI::getProcInfo() pid 12878 does not exist.
10/12 01:39:28 ProcAPI::getProcInfo() pid 12878 does not exist.
10/12 01:39:28 ProcAPI::getProcInfo() pid 12878 does not exist.
10/12 01:39:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:40:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:40:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:40:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:40:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:41:18 enter Daemons::UpdateCollector
10/12 01:41:18 Trying to update collector <192.168.1.1:9618>
10/12 01:41:18 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:41:18 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:41:18 exit Daemons::UpdateCollector
10/12 01:41:18 enter Daemons::CheckForNewExecutable
10/12 01:41:18 Time stamp of running
/home/condor/release/sbin/condor_master: 1123070468
10/12 01:41:18 GetTimeStamp returned: 1123070468
10/12 01:41:18 Time stamp of running
/home/condor/release/sbin/condor_collector: 1123070466
10/12 01:41:18 GetTimeStamp returned: 1123070466
10/12 01:41:18 Time stamp of running
/home/condor/release/sbin/condor_startd: 1123070446
10/12 01:41:18 GetTimeStamp returned: 1123070446
10/12 01:41:18 Time stamp of running
/home/condor/release/sbin/condor_schedd: 1123070449
10/12 01:41:18 GetTimeStamp returned: 1123070449
10/12 01:41:18 Time stamp of running
/home/condor/release/sbin/condor_negotiator: 1123070462
10/12 01:41:18 GetTimeStamp returned: 1123070462
10/12 01:41:18 exit Daemons::CheckForNewExecutable
10/12 01:41:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:41:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:41:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:41:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:42:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:42:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:42:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:42:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:43:13 Getting monitoring info for pid 12795
10/12 01:43:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:43:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:43:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:43:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:44:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:44:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:44:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:44:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:45:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:45:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:45:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:45:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:46:18 enter Daemons::UpdateCollector
10/12 01:46:18 Trying to update collector <192.168.1.1:9618>
10/12 01:46:18 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:46:18 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:46:18 exit Daemons::UpdateCollector
10/12 01:46:18 enter Daemons::CheckForNewExecutable
10/12 01:46:18 Time stamp of running
/home/condor/release/sbin/condor_master: 1123070468
10/12 01:46:18 GetTimeStamp returned: 1123070468
10/12 01:46:18 Time stamp of running
/home/condor/release/sbin/condor_collector: 1123070466
10/12 01:46:18 GetTimeStamp returned: 1123070466
10/12 01:46:18 Time stamp of running
/home/condor/release/sbin/condor_startd: 1123070446
10/12 01:46:18 GetTimeStamp returned: 1123070446
10/12 01:46:18 Time stamp of running
/home/condor/release/sbin/condor_schedd: 1123070449
10/12 01:46:18 GetTimeStamp returned: 1123070449
10/12 01:46:18 Time stamp of running
/home/condor/release/sbin/condor_negotiator: 1123070462
10/12 01:46:18 GetTimeStamp returned: 1123070462
10/12 01:46:18 exit Daemons::CheckForNewExecutable
10/12 01:46:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:46:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:46:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:46:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:47:13 Getting monitoring info for pid 12795
10/12 01:47:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:47:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:47:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:47:28 ProcAPI::buildFamily() Found daddypid on the system: 12801
10/12 01:48:28 ProcAPI::buildFamily() Found daddypid on the system: 12797
10/12 01:48:28 ProcAPI::buildFamily() Found daddypid on the system: 12798
10/12 01:48:28 ProcAPI::buildFamily() Found daddypid on the system: 12799
10/12 01:48:28 ProcAPI::buildFamily() Found daddypid on the system: 12801











StartLog

10/12 01:31:13 ******************************************************
10/12 01:31:13 ** condor_startd (CONDOR_STARTD) STARTING UP
10/12 01:31:13 ** /home/condor/release/sbin/condor_startd
10/12 01:31:13 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:31:13 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:31:13 ** PID = 12798
10/12 01:31:13 ******************************************************
10/12 01:31:13 Using config file: /home/condor/etc/condor_config
10/12 01:31:13 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:31:13 DaemonCore: Command Socket at <192.168.1.1:45579>
10/12 01:31:14 vm1: New machine resource allocated
10/12 01:31:14 vm2: New machine resource allocated
10/12 01:31:14 vm3: New machine resource allocated
10/12 01:31:14 vm4: New machine resource allocated
10/12 01:31:14 About to run initial benchmarks.
10/12 01:31:19 Completed initial benchmarks.
10/12 01:31:19 vm1: State change: IS_OWNER is false
10/12 01:31:19 vm1: Changing state: Owner -> Unclaimed
10/12 01:31:19 vm2: State change: IS_OWNER is false
10/12 01:31:19 vm2: Changing state: Owner -> Unclaimed
10/12 01:31:19 vm3: State change: IS_OWNER is false
10/12 01:31:19 vm3: Changing state: Owner -> Unclaimed
10/12 01:31:19 vm4: State change: IS_OWNER is false
10/12 01:31:19 vm4: Changing state: Owner -> Unclaimed













ScheddLog

10/12 01:31:13 ******************************************************
10/12 01:31:13 ** condor_schedd (CONDOR_SCHEDD) STARTING UP
10/12 01:31:13 ** /home/condor/release/sbin/condor_schedd
10/12 01:31:13 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:31:13 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:31:13 ** PID = 12799
10/12 01:31:13 ******************************************************
10/12 01:31:13 Using config file: /home/condor/etc/condor_config
10/12 01:31:13 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:31:13 DaemonCore: Command Socket at <192.168.1.1:45580>
10/12 01:31:13 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:31:13 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:13 Will use UDP to update collector thebeast.cluster.int
<192.168.1.1:9618>
10/12 01:31:13 Using name: thebeast.cluster.int
10/12 01:31:13 No Accountant host specified in config file
10/12 01:31:13 SCHEDD_MIN_INTERVAL is undefined, using default value of 5
10/12 01:31:13 JOB_START_COUNT is undefined, using default value of 1
10/12 01:31:13 MAX_JOBS_SUBMITTED is undefined, using default value of
2147483647
10/12 01:31:13 STARTD_CONTACT_TIMEOUT is undefined, using default value of
45
10/12 01:31:13 initLocalStarterDir():
/home/condor/hosts/thebeast/spool/local_univ_execute does not exist, calling
mkdir()
10/12 01:31:13 initLocalStarterDir(): Changing permission on
/home/condor/hosts/thebeast/spool/local_univ_execute
10/12 01:31:13 JOB_IS_FINISHED_INTERVAL is undefined, using default value of
0
10/12 01:31:13 Period for SelfDrainingQueue job_is_finished_queue set to 0
10/12 01:31:13 Queue Management Super Users:
10/12 01:31:13  root
10/12 01:31:13  condor
10/12 01:31:13 CronMgr: Constructing 'schedd'
10/12 01:31:13 CronMgr: Setting name to 'schedd'
10/12 01:31:13 CronMgr: Setting parameter base to 'schedd'
10/12 01:31:13 CronMgr: Doing config (initial)
10/12 01:31:13 About to truncate log
/home/condor/hosts/thebeast/spool/job_queue.log
10/12 01:31:13 JobsRunning = 0
10/12 01:31:13 JobsIdle = 0
10/12 01:31:13 JobsHeld = 0
10/12 01:31:13 JobsRemoved = 0
10/12 01:31:13 LocalUniverseJobsRunning = 0
10/12 01:31:13 LocalUniverseJobsIdle = 0
10/12 01:31:13 SchedUniverseJobsRunning = 0
10/12 01:31:13 SchedUniverseJobsIdle = 0
10/12 01:31:13 N_Owners = 0
10/12 01:31:13 N_Owners = 0
10/12 01:31:13 MaxJobsRunning = 200
10/12 01:31:13 ENABLE_SOAP is undefined, using default value of False
10/12 01:31:13 Trying to update collector <192.168.1.1:9618>
10/12 01:31:13 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:31:13 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:31:13 Sent HEART BEAT ad to 1 collectors. Number of submittors=0
10/12 01:31:13 ============ Begin clean_shadow_recs =============
10/12 01:31:13 ============ End clean_shadow_recs =============
10/12 01:31:13 Getting monitoring info for pid 12799
10/12 01:31:13 DaemonCore: No more children processes to reap.
10/12 01:31:14 DaemonCore: in SendAliveToParent()
10/12 01:31:14 DaemonCore: attempting to connect to '<192.168.1.1:45578>'
10/12 01:31:14 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:31:14 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:31:23 -------- Begin starting jobs --------
10/12 01:31:23 -------- Done starting jobs --------
10/12 01:35:13 Getting monitoring info for pid 12799
10/12 01:36:13 JobsRunning = 0
10/12 01:36:13 JobsIdle = 0
10/12 01:36:13 JobsHeld = 0
10/12 01:36:13 JobsRemoved = 0
10/12 01:36:13 LocalUniverseJobsRunning = 0
10/12 01:36:13 LocalUniverseJobsIdle = 0
10/12 01:36:13 SchedUniverseJobsRunning = 0
10/12 01:36:13 SchedUniverseJobsIdle = 0
10/12 01:36:13 N_Owners = 0
10/12 01:36:13 MaxJobsRunning = 200
10/12 01:36:13 ENABLE_SOAP is undefined, using default value of False
10/12 01:36:13 Trying to update collector <192.168.1.1:9618>
10/12 01:36:13 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:36:13 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:36:13 Sent HEART BEAT ad to 1 collectors. Number of submittors=0
10/12 01:36:13 ============ Begin clean_shadow_recs =============
10/12 01:36:13 ============ End clean_shadow_recs =============
10/12 01:36:23 -------- Begin starting jobs --------
10/12 01:36:23 -------- Done starting jobs --------
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45616>
10/12 01:38:25 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:25 AUTHENTICATE_FS: used file /tmp/qmgr_RmQ0za, status: 1
10/12 01:38:25 OwnerCheck retval 1 (success), super_user
10/12 01:38:25 OwnerCheck retval 1 (success), super_user
10/12 01:38:25 get_file(): going to write to filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:25 get_file(): going to write to filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:25 get_file: Receiving 10457 bytes
10/12 01:38:25 get_file: wrote 10457 bytes to file
10/12 01:38:25 done with transfer, errno = 0
10/12 01:38:25 OwnerCheck retval 1 (success), super_user
10/12 01:38:25 OwnerCheck retval 1 (success), super_user
10/12 01:38:25 OwnerCheck retval 1 (success), super_user
10/12 01:38:25 OwnerCheck retval 1 (success), super_user
10/12 01:38:25 condor_read(): Socket closed when trying to read buffer
10/12 01:38:25 QMGR Connection closed
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45617>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/output_0.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45618>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/error_0.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45619>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/output_1.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45620>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/error_1.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45621>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/output_2.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45622>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/error_2.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45623>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/output_3.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45624>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/error_3.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45625>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/output_4.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 DaemonCore: Command received via TCP from host
<192.168.1.1:45626>
10/12 01:38:25 DaemonCore: received command 464 (ATTEMPT_ACCESS), calling
handler (attempt_access_handler)
10/12 01:38:25 ATTEMPT_ACCESS: Switching to user uid: 6988 gid: 100.
10/12 01:38:25 Checking file /home/condor/jobs/helloworld/error_4.out for
write permission.
10/12 01:38:25 Switching back to old priv state.
10/12 01:38:25 JobsRunning = 0
10/12 01:38:25 JobsIdle = 5
10/12 01:38:25 JobsHeld = 0
10/12 01:38:25 JobsRemoved = 0
10/12 01:38:25 LocalUniverseJobsRunning = 0
10/12 01:38:25 LocalUniverseJobsIdle = 0
10/12 01:38:25 SchedUniverseJobsRunning = 0
10/12 01:38:25 SchedUniverseJobsIdle = 0
10/12 01:38:25 N_Owners = 1
10/12 01:38:25 MaxJobsRunning = 200
10/12 01:38:25 ENABLE_SOAP is undefined, using default value of False
10/12 01:38:25 Trying to update collector <192.168.1.1:9618>
10/12 01:38:25 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:38:25 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:25 Sent HEART BEAT ad to 1 collectors. Number of submittors=1
10/12 01:38:25 Changed attribute: RunningJobs = 0
10/12 01:38:25 Changed attribute: IdleJobs = 5
10/12 01:38:25 Changed attribute: HeldJobs = 0
10/12 01:38:25 Changed attribute: FlockedJobs = 0
10/12 01:38:25 Changed attribute: Name = "condor@xxxxxxxxxxxxxxxxxxxx"
10/12 01:38:25 Sent ad to central manager for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:25 Trying to update collector <192.168.1.1:9618>
10/12 01:38:25 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:38:25 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:25 Sent ad to 1 collectors for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:25 Sent ad to 1 collectors for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:25 ============ Begin clean_shadow_recs =============
10/12 01:38:25 ============ End clean_shadow_recs =============
10/12 01:38:25 Called reschedule_negotiator()
10/12 01:38:25 Sending RESCHEDULE command to negotiator(s)
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 SEC_TCP_SESSION_TIMEOUT is undefined, using default value of
20
10/12 01:38:25
10/12 01:38:25 Entered negotiate
10/12 01:38:25 NEGOTIATOR_TIMEOUT is undefined, using default value of 20
10/12 01:38:25 *** SwapSpace = 2147483647
10/12 01:38:25 *** ReservedSwap = 0
10/12 01:38:25 *** Shadow Size Estimate = 1800
10/12 01:38:25 *** Start Limit For Swap = 1193046
10/12 01:38:25 *** Current num of active shadows = 0
10/12 01:38:25 Negotiating for owner: condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:25 Checking consistency running and runnable jobs
10/12 01:38:25 Tables are consistent
10/12 01:38:25 Sent job 1.0 (autocluster=1)
10/12 01:38:25 In case PERMISSION
10/12 01:38:25 Enqueued contactStartd startd=<192.168.1.101:35193>
10/12 01:38:25 Sent job 1.1 (autocluster=1)
10/12 01:38:25 In case PERMISSION
10/12 01:38:25 Enqueued contactStartd startd=<192.168.1.101:35193>
10/12 01:38:25 Sent job 1.2 (autocluster=1)
10/12 01:38:25 In case PERMISSION
10/12 01:38:25 Enqueued contactStartd startd=<192.168.1.102:34956>
10/12 01:38:25 Sent job 1.3 (autocluster=1)
10/12 01:38:25 In case PERMISSION
10/12 01:38:25 Enqueued contactStartd startd=<192.168.1.102:34956>
10/12 01:38:25 Sent job 1.4 (autocluster=1)
10/12 01:38:25 In case PERMISSION
10/12 01:38:25 Enqueued contactStartd startd=<192.168.1.103:34098>
10/12 01:38:25 Out of jobs - 5 jobs matched, 0 jobs idle, flock level = 0
10/12 01:38:25 Stashing socket to negotiator for future reuse
10/12 01:38:25 In checkContactQueue(), args = 0x84cf3a8,
host=<192.168.1.101:35193>
10/12 01:38:25 In Scheduler::contactStartd()
10/12 01:38:25 <192.168.1.101:35193>#1129080608#1
condor@xxxxxxxxxxxxxxxxxxxx <192.168.1.101:35193> 1.0
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 In checkContactQueue(), args = 0x84c49b0,
host=<192.168.1.101:35193>
10/12 01:38:25 In Scheduler::contactStartd()
10/12 01:38:25 <192.168.1.101:35193>#1129080608#2
condor@xxxxxxxxxxxxxxxxxxxx <192.168.1.101:35193> 1.1
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 In checkContactQueue(), args = 0x84ccc90,
host=<192.168.1.102:34956>
10/12 01:38:25 In Scheduler::contactStartd()
10/12 01:38:25 <192.168.1.102:34956>#1129080309#1
condor@xxxxxxxxxxxxxxxxxxxx <192.168.1.102:34956> 1.2
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 In checkContactQueue(), args = 0x84ccbc8,
host=<192.168.1.102:34956>
10/12 01:38:25 In Scheduler::contactStartd()
10/12 01:38:25 <192.168.1.102:34956>#1129080309#2
condor@xxxxxxxxxxxxxxxxxxxx <192.168.1.102:34956> 1.3
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 In checkContactQueue(), args = 0x84dbee0,
host=<192.168.1.103:34098>
10/12 01:38:25 In Scheduler::contactStartd()
10/12 01:38:25 <192.168.1.103:34098>#1129080749#1
condor@xxxxxxxxxxxxxxxxxxxx <192.168.1.103:34098> 1.4
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 In Scheduler::startdContactConnectHandler
10/12 01:38:25 Got mrec data pointer 0x84c3870
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 Registered startd contact socket.
10/12 01:38:25 In Scheduler::startdContactConnectHandler
10/12 01:38:25 Got mrec data pointer 0x84d6320
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:25 Registered startd contact socket.
10/12 01:38:25 In Scheduler::startdContactConnectHandler
10/12 01:38:25 Got mrec data pointer 0x84ce150
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 Registered startd contact socket.
10/12 01:38:25 In Scheduler::startdContactConnectHandler
10/12 01:38:25 Got mrec data pointer 0x84cd5d0
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:25 Registered startd contact socket.
10/12 01:38:25 In Scheduler::startdContactConnectHandler
10/12 01:38:25 Got mrec data pointer 0x84cd060
10/12 01:38:25 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:25 Registered startd contact socket.
10/12 01:38:25 In Scheduler::startdContactSockHandler
10/12 01:38:25 Got mrec data pointer 0x84c3870
10/12 01:38:25 Timer set...
10/12 01:38:25 In Scheduler::startdContactSockHandler
10/12 01:38:25 Got mrec data pointer 0x84ce150
10/12 01:38:25 In Scheduler::startdContactSockHandler
10/12 01:38:25 Got mrec data pointer 0x84cd060
10/12 01:38:26 In Scheduler::startdContactSockHandler
10/12 01:38:26 In Scheduler::startdContactSockHandler
10/12 01:38:26 Got mrec data pointer 0x84d6320
10/12 01:38:26 In Scheduler::startdContactSockHandler
10/12 01:38:26 Got mrec data pointer 0x84cd5d0
10/12 01:38:27 -------- Begin starting jobs --------
10/12 01:38:27 Job 1.0: is runnable
10/12 01:38:27 Scheduler::start_std - job=1.0 on <192.168.1.101:35193>
10/12 01:38:27 Queueing job 1.0 in runnable job queue
10/12 01:38:27 start next job after 0 sec, JobsThisBurst 0
10/12 01:38:27 Match (<192.168.1.101:35193>#1129080608#1) - running 1.0
10/12 01:38:27 Job 1.1: is runnable
10/12 01:38:27 Scheduler::start_std - job=1.1 on <192.168.1.101:35193>
10/12 01:38:27 Queueing job 1.1 in runnable job queue
10/12 01:38:27 Match (<192.168.1.101:35193>#1129080608#2) - running 1.1
10/12 01:38:27 Job 1.2: is runnable
10/12 01:38:27 Scheduler::start_std - job=1.2 on <192.168.1.102:34956>
10/12 01:38:27 Queueing job 1.2 in runnable job queue
10/12 01:38:27 Match (<192.168.1.102:34956>#1129080309#1) - running 1.2
10/12 01:38:27 Job 1.3: is runnable
10/12 01:38:27 Scheduler::start_std - job=1.3 on <192.168.1.102:34956>
10/12 01:38:27 Queueing job 1.3 in runnable job queue
10/12 01:38:27 Match (<192.168.1.102:34956>#1129080309#2) - running 1.3
10/12 01:38:27 Job 1.4: is runnable
10/12 01:38:27 Scheduler::start_std - job=1.4 on <192.168.1.103:34098>
10/12 01:38:27 Queueing job 1.4 in runnable job queue
10/12 01:38:27 Match (<192.168.1.103:34098>#1129080749#1) - running 1.4
10/12 01:38:27 -------- Done starting jobs --------
10/12 01:38:27 Job prep for 1.0 will not block, calling
aboutToSpawnJobHandler() directly
10/12 01:38:27 aboutToSpawnJobHandler() completed for job 1.0, attempting to
spawn job handler
10/12 01:38:27 Starting add_shadow_birthdate(1.0)
10/12 01:38:27 Added shadow record for PID 12878, job (1.0)
10/12 01:38:27
10/12 01:38:27 ..................
10/12 01:38:27 .. Shadow Recs (1/5)
10/12 01:38:27 .. 12878, 1.0, F, <192.168.1.101:35193>, cur_hosts=1,
status=2
10/12 01:38:27 ..................
10/12 01:38:27 Started shadow for job 1.0 on "<192.168.1.101:35193>",
(shadow pid = 12878)
10/12 01:38:27 start next job after 2 sec, JobsThisBurst 0
10/12 01:38:28 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:38:28 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:38:28 DaemonCore: Command received via TCP from host
<192.168.1.1:45643>
10/12 01:38:28 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:28 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:28 AUTHENTICATE_FS: used file /tmp/qmgr_Kl41Hy, status: 1
10/12 01:38:28 OwnerCheck retval 1 (success), super_user
10/12 01:38:28 OwnerCheck retval 1 (success), super_user
10/12 01:38:28 OwnerCheck retval 1 (success), super_user
10/12 01:38:28 OwnerCheck retval 1 (success), super_user
10/12 01:38:28 OwnerCheck retval 1 (success), super_user
10/12 01:38:28 condor_read(): Socket closed when trying to read buffer
10/12 01:38:28 QMGR Connection closed
10/12 01:38:28 DaemonCore: No more children processes to reap.
10/12 01:38:28 Shadow pid 12878 for job 1.0 exited with status 100
10/12 01:38:28 Reaper: JOB_EXITED
10/12 01:38:28 Entered delete_shadow_rec( 12878 )
10/12 01:38:28 Deleting shadow rec for PID 12878, job (1.0)
10/12 01:38:28 Entered check_zombie( 12878, 0x0x84cdf5c, st=4 )
10/12 01:38:28 Job 1.0 is finished
10/12 01:38:28 Added data to SelfDrainingQueue job_is_finished_queue, now
has 1 element(s)
10/12 01:38:28 Registered timer for SelfDrainingQueue job_is_finished_queue,
period: 0 (id: 16)
10/12 01:38:28 Exited check_zombie( 12878, 0x0x84cdf5c )
10/12 01:38:28
10/12 01:38:28 ..................
10/12 01:38:28 .. Shadow Recs (0/5)
10/12 01:38:28 ..................

10/12 01:38:28 -------- Begin starting jobs --------
10/12 01:38:28 Job 1.-1: not runnable
10/12 01:38:28 match (<192.168.1.101:35193>#1129080608#1) out of jobs
(cluster id 1); relinquishing
10/12 01:38:28 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:28 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:28 Called send_vacate( <192.168.1.101:35193>, 443 )
10/12 01:38:28 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:28 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:28 Sent RELEASE_CLAIM to startd on <192.168.1.101:35193>
10/12 01:38:28 Match record (<192.168.1.101:35193>, 1, -1) deleted
10/12 01:38:28 ClaimId of deleted match: <192.168.1.101:35193>#1129080608#1
10/12 01:38:28 match (<192.168.1.101:35193>#1129080608#2) already running a
job
10/12 01:38:28 match (<192.168.1.102:34956>#1129080309#1) already running a
job
10/12 01:38:28 match (<192.168.1.102:34956>#1129080309#2) already running a
job
10/12 01:38:28 match (<192.168.1.103:34098>#1129080749#1) already running a
job
10/12 01:38:28 -------- Done starting jobs --------
10/12 01:38:28 Inside SelfDrainingQueue::timerHandler() for
job_is_finished_queue
10/12 01:38:28 Job cleanup for 1.0 will not block, calling jobIsFinished()
directly
10/12 01:38:28 jobIsFinished() completed, calling DestroyProc(1.0)
10/12 01:38:28 jobIsFinished() completed, calling DestroyProc(1.0)
10/12 01:38:28 KEEP_OUTPUT_SANDBOX is undefined, using default value of
False
10/12 01:38:28 Saving classad to history file
10/12 01:38:28 SelfDrainingQueue job_is_finished_queue is empty, not
resetting timer
10/12 01:38:28 Canceling timer for SelfDrainingQueue job_is_finished_queue
(timer id: 16)
10/12 01:38:28 Got VACATE_SERVICE from <192.168.1.101:35205>
10/12 01:38:28 mrec for "<192.168.1.101:35193>#1129080608#1" not found --
match not deleted
10/12 01:38:29 DaemonCore: Command received via TCP from host
<192.168.1.1:45645>
10/12 01:38:29 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:29 condor_read(): Socket closed when trying to read buffer
10/12 01:38:29 QMGR Connection closed
10/12 01:38:29 Job prep for 1.1 will not block, calling
aboutToSpawnJobHandler() directly
10/12 01:38:29 aboutToSpawnJobHandler() completed for job 1.1, attempting to
spawn job handler
10/12 01:38:29 Starting add_shadow_birthdate(1.1)
10/12 01:38:29 Added shadow record for PID 12887, job (1.1)
10/12 01:38:29
10/12 01:38:29 ..................
10/12 01:38:29 .. Shadow Recs (1/4)
10/12 01:38:29 .. 12887, 1.1, F, <192.168.1.101:35193>, cur_hosts=1,
status=2
10/12 01:38:29 ..................

10/12 01:38:29 Started shadow for job 1.1 on "<192.168.1.101:35193>",
(shadow pid = 12887)
10/12 01:38:29 start next job after 2 sec, JobsThisBurst 0
10/12 01:38:30 JobsRunning = 4
10/12 01:38:30 JobsIdle = 0
10/12 01:38:30 JobsHeld = 0
10/12 01:38:30 JobsRemoved = 0
10/12 01:38:30 LocalUniverseJobsRunning = 0
10/12 01:38:30 LocalUniverseJobsIdle = 0
10/12 01:38:30 SchedUniverseJobsRunning = 0
10/12 01:38:30 SchedUniverseJobsIdle = 0
10/12 01:38:30 N_Owners = 1
10/12 01:38:30 MaxJobsRunning = 200
10/12 01:38:30 ENABLE_SOAP is undefined, using default value of False
10/12 01:38:30 Trying to update collector <192.168.1.1:9618>
10/12 01:38:30 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:38:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:30 Sent HEART BEAT ad to 1 collectors. Number of submittors=1
10/12 01:38:30 Changed attribute: RunningJobs = 4
10/12 01:38:30 Changed attribute: IdleJobs = 0
10/12 01:38:30 Changed attribute: HeldJobs = 0
10/12 01:38:30 Changed attribute: FlockedJobs = 0
10/12 01:38:30 Changed attribute: Name = "condor@xxxxxxxxxxxxxxxxxxxx"
10/12 01:38:30 Changed attribute: Name = "condor@xxxxxxxxxxxxxxxxxxxx"
10/12 01:38:30 Sent ad to central manager for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:30 Trying to update collector <192.168.1.1:9618>
10/12 01:38:30 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:38:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:30 Sent ad to 1 collectors for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:30 ============ Begin clean_shadow_recs =============
10/12 01:38:30 ============ End clean_shadow_recs =============
10/12 01:38:30 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:38:30 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:38:30 DaemonCore: Command received via TCP from host
<192.168.1.1:45650>
10/12 01:38:30 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:30 condor_read(): Socket closed when trying to read buffer
10/12 01:38:30 QMGR Connection closed
10/12 01:38:30 DaemonCore: Command received via TCP from host
<192.168.1.1:45651>
10/12 01:38:30 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:30 AUTHENTICATE_FS: used file /tmp/qmgr_tDVUN2, status: 1
10/12 01:38:30 OwnerCheck retval 1 (success), super_user
10/12 01:38:30 OwnerCheck retval 1 (success), super_user
10/12 01:38:30 OwnerCheck retval 1 (success), super_user
10/12 01:38:30 OwnerCheck retval 1 (success), super_user
10/12 01:38:30 OwnerCheck retval 1 (success), super_user
10/12 01:38:30 condor_read(): Socket closed when trying to read buffer
10/12 01:38:30 QMGR Connection closed
10/12 01:38:30 DaemonCore: No more children processes to reap.
10/12 01:38:30 Shadow pid 12887 for job 1.1 exited with status 100
10/12 01:38:30 Reaper: JOB_EXITED
10/12 01:38:30 Entered delete_shadow_rec( 12887 )
10/12 01:38:30 Deleting shadow rec for PID 12887, job (1.1)
10/12 01:38:30 Entered check_zombie( 12887, 0x0x84d51e4, st=4 )
10/12 01:38:30 Job 1.1 is finished
10/12 01:38:30 Added data to SelfDrainingQueue job_is_finished_queue, now
has 1 element(s)
10/12 01:38:30 Registered timer for SelfDrainingQueue job_is_finished_queue,
period: 0 (id: 19)
10/12 01:38:30 Exited check_zombie( 12887, 0x0x84d51e4 )
10/12 01:38:30
10/12 01:38:30 ..................
10/12 01:38:30 .. Shadow Recs (0/4)
10/12 01:38:30 ..................

10/12 01:38:30 -------- Begin starting jobs --------
10/12 01:38:30 match (<192.168.1.103:34098>#1129080749#1) already running a
job
10/12 01:38:30 Job 1.-1: not runnable
10/12 01:38:30 match (<192.168.1.101:35193>#1129080608#2) out of jobs
(cluster id 1); relinquishing
10/12 01:38:30 match (<192.168.1.101:35193>#1129080608#2) out of jobs
(cluster id 1); relinquishing
10/12 01:38:30 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:30 Called send_vacate( <192.168.1.101:35193>, 443 )
10/12 01:38:30 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:30 Sent RELEASE_CLAIM to startd on <192.168.1.101:35193>
10/12 01:38:30 Match record (<192.168.1.101:35193>, 1, -1) deleted
10/12 01:38:30 ClaimId of deleted match: <192.168.1.101:35193>#1129080608#2
10/12 01:38:30 match (<192.168.1.102:34956>#1129080309#1) already running a
job
10/12 01:38:30 match (<192.168.1.102:34956>#1129080309#2) already running a
job
10/12 01:38:30 -------- Done starting jobs --------
10/12 01:38:30 Inside SelfDrainingQueue::timerHandler() for
job_is_finished_queue
10/12 01:38:30 Job cleanup for 1.1 will not block, calling jobIsFinished()
directly
10/12 01:38:30 jobIsFinished() completed, calling DestroyProc(1.1)
10/12 01:38:30 KEEP_OUTPUT_SANDBOX is undefined, using default value of
False
10/12 01:38:30 Saving classad to history file
10/12 01:38:30 SelfDrainingQueue job_is_finished_queue is empty, not
resetting timer
10/12 01:38:30 Canceling timer for SelfDrainingQueue job_is_finished_queue
(timer id: 19)
10/12 01:38:30 Got VACATE_SERVICE from <192.168.1.101:35210>
10/12 01:38:30 mrec for "<192.168.1.101:35193>#1129080608#2" not found --
match not deleted
10/12 01:38:31 DaemonCore: Command received via TCP from host
<192.168.1.1:45653>
10/12 01:38:31 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:31 condor_read(): Socket closed when trying to read buffer
10/12 01:38:31 QMGR Connection closed
10/12 01:38:31 Job prep for 1.2 will not block, calling
aboutToSpawnJobHandler() directly
10/12 01:38:31 aboutToSpawnJobHandler() completed for job 1.2, attempting to
spawn job handler
10/12 01:38:31 Starting add_shadow_birthdate(1.2)
10/12 01:38:31 Added shadow record for PID 12893, job (1.2)
10/12 01:38:31
10/12 01:38:31 ..................
10/12 01:38:31 .. Shadow Recs (1/3)
10/12 01:38:31 .. 12893, 1.2, F, <192.168.1.102:34956>, cur_hosts=1,
status=2
10/12 01:38:31 ..................
10/12 01:38:31 Started shadow for job 1.2 on "<192.168.1.102:34956>",
(shadow pid = 12893)
10/12 01:38:31 start next job after 2 sec, JobsThisBurst 0
10/12 01:38:32 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:38:32 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:38:33 Job prep for 1.3 will not block, calling
aboutToSpawnJobHandler() directly
10/12 01:38:33 aboutToSpawnJobHandler() completed for job 1.3, attempting to
spawn job handler
10/12 01:38:33 Starting add_shadow_birthdate(1.3)
10/12 01:38:33 Added shadow record for PID 12894, job (1.3)
10/12 01:38:33 Added shadow record for PID 12894, job (1.3)
10/12 01:38:33
10/12 01:38:33 ..................
10/12 01:38:33 .. Shadow Recs (2/3)
10/12 01:38:33 .. 12893, 1.2, F, <192.168.1.102:34956>, cur_hosts=1,
status=2
10/12 01:38:33 .. 12894, 1.3, F, <192.168.1.102:34956>, cur_hosts=1,
status=2
10/12 01:38:33 ..................

10/12 01:38:33 Started shadow for job 1.3 on "<192.168.1.102:34956>",
(shadow pid = 12894)
10/12 01:38:33 start next job after 2 sec, JobsThisBurst 0
10/12 01:38:33 DaemonCore: Command received via TCP from host
<192.168.1.1:45660>
10/12 01:38:33 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:33 AUTHENTICATE_FS: used file /tmp/qmgr_op2VbE, status: 1
10/12 01:38:33 OwnerCheck retval 1 (success), super_user
10/12 01:38:33 OwnerCheck retval 1 (success), super_user
10/12 01:38:33 OwnerCheck retval 1 (success), super_user
10/12 01:38:33 OwnerCheck retval 1 (success), super_user
10/12 01:38:33 OwnerCheck retval 1 (success), super_user
10/12 01:38:33 condor_read(): Socket closed when trying to read buffer
10/12 01:38:33 QMGR Connection closed
10/12 01:38:33 DaemonCore: No more children processes to reap.
10/12 01:38:33 Shadow pid 12893 for job 1.2 exited with status 100
10/12 01:38:33 Reaper: JOB_EXITED
10/12 01:38:33 Entered delete_shadow_rec( 12893 )
10/12 01:38:33 Deleting shadow rec for PID 12893, job (1.2)
10/12 01:38:33 Entered check_zombie( 12893, 0x0x84dd25c, st=4 )
10/12 01:38:33 Job 1.2 is finished
10/12 01:38:33 Added data to SelfDrainingQueue job_is_finished_queue, now
has 1 element(s)
10/12 01:38:33 Registered timer for SelfDrainingQueue job_is_finished_queue,
period: 0 (id: 23)
10/12 01:38:33 Exited check_zombie( 12893, 0x0x84dd25c )
10/12 01:38:33
10/12 01:38:33 ..................
10/12 01:38:33 .. Shadow Recs (1/3)
10/12 01:38:33 .. 12894, 1.3, F, <192.168.1.102:34956>, cur_hosts=1,
status=2
10/12 01:38:33 ..................
10/12 01:38:33 -------- Begin starting jobs --------
10/12 01:38:33 match (<192.168.1.103:34098>#1129080749#1) already running a
job
10/12 01:38:33 match (<192.168.1.102:34956>#1129080309#2) already running a
job
10/12 01:38:33 Job 1.-1: not runnable
10/12 01:38:33 match (<192.168.1.102:34956>#1129080309#1) out of jobs
(cluster id 1); relinquishing
10/12 01:38:33 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:33 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:33 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:33 Called send_vacate( <192.168.1.102:34956>, 443 )
10/12 01:38:33 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:33 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:33 Sent RELEASE_CLAIM to startd on <192.168.1.102:34956>
10/12 01:38:33 Match record (<192.168.1.102:34956>, 1, -1) deleted
10/12 01:38:33 ClaimId of deleted match: <192.168.1.102:34956>#1129080309#1
10/12 01:38:33 -------- Done starting jobs --------
10/12 01:38:33 Inside SelfDrainingQueue::timerHandler() for
job_is_finished_queue
10/12 01:38:33 Job cleanup for 1.2 will not block, calling jobIsFinished()
directly
10/12 01:38:33 jobIsFinished() completed, calling DestroyProc(1.2)
10/12 01:38:33 KEEP_OUTPUT_SANDBOX is undefined, using default value of
False
10/12 01:38:33 Saving classad to history file
10/12 01:38:33 SelfDrainingQueue job_is_finished_queue is empty, not
resetting timer
10/12 01:38:33 Canceling timer for SelfDrainingQueue job_is_finished_queue
(timer id: 23)
10/12 01:38:33 Got VACATE_SERVICE from <192.168.1.102:34971>
10/12 01:38:33 mrec for "<192.168.1.102:34956>#1129080309#1" not found --
match not deleted
10/12 01:38:34 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:38:34 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:38:34 DaemonCore: Command received via TCP from host
<192.168.1.1:45664>
10/12 01:38:34 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:34 AUTHENTICATE_FS: used file /tmp/qmgr_Dtf0Ij, status: 1
10/12 01:38:34 OwnerCheck retval 1 (success), super_user
10/12 01:38:34 OwnerCheck retval 1 (success), super_user
10/12 01:38:34 OwnerCheck retval 1 (success), super_user
10/12 01:38:34 OwnerCheck retval 1 (success), super_user
10/12 01:38:34 OwnerCheck retval 1 (success), super_user
10/12 01:38:34 condor_read(): Socket closed when trying to read buffer
10/12 01:38:34 QMGR Connection closed
10/12 01:38:34 DaemonCore: No more children processes to reap.
10/12 01:38:34 Shadow pid 12894 for job 1.3 exited with status 100
10/12 01:38:34 Reaper: JOB_EXITED
10/12 01:38:34 Entered delete_shadow_rec( 12894 )
10/12 01:38:34 Deleting shadow rec for PID 12894, job (1.3)
10/12 01:38:34 Entered check_zombie( 12894, 0x0x84cdeb4, st=4 )
10/12 01:38:34 Job 1.3 is finished
10/12 01:38:34 Added data to SelfDrainingQueue job_is_finished_queue, now
has 1 element(s)
10/12 01:38:34 Registered timer for SelfDrainingQueue job_is_finished_queue,
period: 0 (id: 25)
10/12 01:38:34 Exited check_zombie( 12894, 0x0x84cdeb4 )
10/12 01:38:34
10/12 01:38:34 ..................
10/12 01:38:34 .. Shadow Recs (0/2)
10/12 01:38:34 ..................
10/12 01:38:34 ..................

10/12 01:38:34 -------- Begin starting jobs --------
10/12 01:38:34 match (<192.168.1.103:34098>#1129080749#1) already running a
job
10/12 01:38:34 Job 1.-1: not runnable
10/12 01:38:34 match (<192.168.1.102:34956>#1129080309#2) out of jobs
(cluster id 1); relinquishing
10/12 01:38:34 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:34 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:34 Called send_vacate( <192.168.1.102:34956>, 443 )
10/12 01:38:34 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:34 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:34 Sent RELEASE_CLAIM to startd on <192.168.1.102:34956>
10/12 01:38:34 Match record (<192.168.1.102:34956>, 1, -1) deleted
10/12 01:38:34 ClaimId of deleted match: <192.168.1.102:34956>#1129080309#2
10/12 01:38:34 -------- Done starting jobs --------
10/12 01:38:34 Inside SelfDrainingQueue::timerHandler() for
job_is_finished_queue
10/12 01:38:34 Job cleanup for 1.3 will not block, calling jobIsFinished()
directly
10/12 01:38:34 jobIsFinished() completed, calling DestroyProc(1.3)
10/12 01:38:34 KEEP_OUTPUT_SANDBOX is undefined, using default value of
False
10/12 01:38:34 Saving classad to history file
10/12 01:38:34 SelfDrainingQueue job_is_finished_queue is empty, not
resetting timer
10/12 01:38:34 Canceling timer for SelfDrainingQueue job_is_finished_queue
(timer id: 25)
10/12 01:38:34 Got VACATE_SERVICE from <192.168.1.102:34973>
10/12 01:38:34 mrec for "<192.168.1.102:34956>#1129080309#2" not found --
match not deleted
10/12 01:38:35 Job prep for 1.4 will not block, calling
aboutToSpawnJobHandler() directly
10/12 01:38:35 aboutToSpawnJobHandler() completed for job 1.4, attempting to
spawn job handler
10/12 01:38:35 Starting add_shadow_birthdate(1.4)
10/12 01:38:35 Added shadow record for PID 12902, job (1.4)
10/12 01:38:35
10/12 01:38:35 ..................
10/12 01:38:35 .. Shadow Recs (1/1)
10/12 01:38:35 .. 12902, 1.4, F, <192.168.1.103:34098>, cur_hosts=1,
status=2
10/12 01:38:35 ..................
10/12 01:38:35 Started shadow for job 1.4 on "<192.168.1.103:34098>",
(shadow pid = 12902)
10/12 01:38:35 JobsRunning = 1
10/12 01:38:35 JobsIdle = 0
10/12 01:38:35 JobsHeld = 0
10/12 01:38:35 JobsRemoved = 0
10/12 01:38:35 LocalUniverseJobsRunning = 0
10/12 01:38:35 LocalUniverseJobsIdle = 0
10/12 01:38:35 SchedUniverseJobsRunning = 0
10/12 01:38:35 SchedUniverseJobsIdle = 0
10/12 01:38:35 SchedUniverseJobsIdle = 0
10/12 01:38:35 N_Owners = 1
10/12 01:38:35 MaxJobsRunning = 200
10/12 01:38:35 ENABLE_SOAP is undefined, using default value of False
10/12 01:38:35 Trying to update collector <192.168.1.1:9618>
10/12 01:38:35 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:38:35 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:35 Sent HEART BEAT ad to 1 collectors. Number of submittors=1
10/12 01:38:35 Changed attribute: RunningJobs = 1
10/12 01:38:35 Changed attribute: IdleJobs = 0
10/12 01:38:35 Changed attribute: HeldJobs = 0
10/12 01:38:35 Changed attribute: FlockedJobs = 0
10/12 01:38:35 Changed attribute: Name = "condor@xxxxxxxxxxxxxxxxxxxx"
10/12 01:38:35 Sent ad to central manager for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:35 Trying to update collector <192.168.1.1:9618>
10/12 01:38:35 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:38:35 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:35 Sent ad to 1 collectors for condor@xxxxxxxxxxxxxxxxxxxx
10/12 01:38:35 ============ Begin clean_shadow_recs =============
10/12 01:38:35 ============ End clean_shadow_recs =============
10/12 01:38:35 -------- Begin starting jobs --------
10/12 01:38:35 match (<192.168.1.103:34098>#1129080749#1) already running a
job
10/12 01:38:35 -------- Done starting jobs --------
10/12 01:38:36 DaemonCore: Command received via UDP from host
<192.168.1.1:36006>
10/12 01:38:36 DaemonCore: received command 60008 (DC_CHILDALIVE), calling
handler (HandleChildAliveCommand)
10/12 01:38:37 DaemonCore: Command received via TCP from host
<192.168.1.1:45670>
10/12 01:38:37 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:37 AUTHENTICATE_FS: used file /tmp/qmgr_0JBVD7, status: 1
10/12 01:38:37 OwnerCheck retval 1 (success), super_user
10/12 01:38:37 OwnerCheck retval 1 (success), super_user
10/12 01:38:37 OwnerCheck retval 1 (success), super_user
10/12 01:38:37 OwnerCheck retval 1 (success), super_user
10/12 01:38:37 OwnerCheck retval 1 (success), super_user
10/12 01:38:37 condor_read(): Socket closed when trying to read buffer
10/12 01:38:37 QMGR Connection closed
10/12 01:38:37 DaemonCore: No more children processes to reap.
10/12 01:38:37 Shadow pid 12902 for job 1.4 exited with status 100
10/12 01:38:37 Reaper: JOB_EXITED
10/12 01:38:37 Entered delete_shadow_rec( 12902 )
10/12 01:38:37 Deleting shadow rec for PID 12902, job (1.4)
10/12 01:38:37 Entered check_zombie( 12902, 0x0x84cf58c, st=4 )
10/12 01:38:37 Job 1.4 is finished
10/12 01:38:37 Added data to SelfDrainingQueue job_is_finished_queue, now
has 1 element(s)
10/12 01:38:37 Registered timer for SelfDrainingQueue job_is_finished_queue,
period: 0 (id: 27)
10/12 01:38:37 Exited check_zombie( 12902, 0x0x84cf58c )
10/12 01:38:37
10/12 01:38:37 ..................
10/12 01:38:37 .. Shadow Recs (0/1)
10/12 01:38:37 ..................

10/12 01:38:37 -------- Begin starting jobs --------
10/12 01:38:37 Job 1.-1: not runnable
10/12 01:38:37 match (<192.168.1.103:34098>#1129080749#1) out of jobs
(cluster id 1); relinquishing
10/12 01:38:37 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:37 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:37 Called send_vacate( <192.168.1.103:34098>, 443 )
10/12 01:38:37 SCHEDD_TIMEOUT_MULTIPLIER is undefined, using default value
of 0
10/12 01:38:37 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:38:37 Sent RELEASE_CLAIM to startd on <192.168.1.103:34098>
10/12 01:38:37 Match record (<192.168.1.103:34098>, 1, -1) deleted
10/12 01:38:37 ClaimId of deleted match: <192.168.1.103:34098>#1129080749#1
10/12 01:38:37 -------- Done starting jobs --------
10/12 01:38:37 Inside SelfDrainingQueue::timerHandler() for
job_is_finished_queue
10/12 01:38:37 Job cleanup for 1.4 will not block, calling jobIsFinished()
directly
10/12 01:38:37 jobIsFinished() completed, calling DestroyProc(1.4)
10/12 01:38:37 KEEP_OUTPUT_SANDBOX is undefined, using default value of
False
10/12 01:38:37 Saving classad to history file
10/12 01:38:37 SelfDrainingQueue job_is_finished_queue is empty, not
resetting timer
10/12 01:38:37 Canceling timer for SelfDrainingQueue job_is_finished_queue
(timer id: 27)
10/12 01:38:37 Got VACATE_SERVICE from <192.168.1.103:34110>
10/12 01:38:37 mrec for "<192.168.1.103:34098>#1129080749#1" not found --
match not deleted
10/12 01:38:50 DaemonCore: Command received via TCP from host
<192.168.1.1:45672>
10/12 01:38:50 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:38:50 condor_read(): Socket closed when trying to read buffer
10/12 01:38:50 QMGR Connection closed
10/12 01:39:13 Getting monitoring info for pid 12799
10/12 01:42:19 DaemonCore: Command received via TCP from host
<192.168.1.1:45673>
10/12 01:42:19 DaemonCore: received command 1111 (QMGMT_CMD), calling
handler (handle_q)
10/12 01:42:19 condor_read(): Socket closed when trying to read buffer
10/12 01:42:19 QMGR Connection closed
10/12 01:43:13 Getting monitoring info for pid 12799
10/12 01:43:35 JobsRunning = 0
10/12 01:43:35 JobsIdle = 0
10/12 01:43:35 JobsHeld = 0
10/12 01:43:35 JobsRemoved = 0
10/12 01:43:35 LocalUniverseJobsRunning = 0
10/12 01:43:35 LocalUniverseJobsIdle = 0
10/12 01:43:35 SchedUniverseJobsRunning = 0
10/12 01:43:35 SchedUniverseJobsIdle = 0
10/12 01:43:35 N_Owners = 0
10/12 01:43:35 MaxJobsRunning = 200
10/12 01:43:35 ENABLE_SOAP is undefined, using default value of False
10/12 01:43:35 Trying to update collector <192.168.1.1:9618>
10/12 01:43:35 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:43:35 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:43:35 Sent HEART BEAT ad to 1 collectors. Number of submittors=0
10/12 01:43:35 Changed attribute: Name = "condor@xxxxxxxxxxxxxxxxxxxx"
10/12 01:43:35 Trying to update collector <192.168.1.1:9618>
10/12 01:43:35 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:43:35 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:43:35 Sent owner (0 jobs) ad to 1 collectors
10/12 01:43:35 ============ Begin clean_shadow_recs =============
10/12 01:43:35 ============ End clean_shadow_recs =============
10/12 01:43:37 -------- Begin starting jobs --------
10/12 01:43:37 -------- Done starting jobs --------
10/12 01:47:13 Getting monitoring info for pid 12799
10/12 01:48:35 JobsRunning = 0
10/12 01:48:35 JobsIdle = 0
10/12 01:48:35 JobsHeld = 0
10/12 01:48:35 JobsRemoved = 0
10/12 01:48:35 LocalUniverseJobsRunning = 0
10/12 01:48:35 LocalUniverseJobsIdle = 0
10/12 01:48:35 SchedUniverseJobsRunning = 0
10/12 01:48:35 SchedUniverseJobsIdle = 0
10/12 01:48:35 N_Owners = 0
10/12 01:48:35 MaxJobsRunning = 200
10/12 01:48:35 ENABLE_SOAP is undefined, using default value of False
10/12 01:48:35 Trying to update collector <192.168.1.1:9618>
10/12 01:48:35 Attempting to send update via UDP to collector
thebeast.cluster.int <192.168.1.1:9618>
10/12 01:48:35 SEC_DEBUG_PRINT_KEYS is undefined, using default value of
False
10/12 01:48:35 Sent HEART BEAT ad to 1 collectors. Number of submittors=0
10/12 01:48:35 ============ Begin clean_shadow_recs =============
10/12 01:48:35 ============ End clean_shadow_recs =============
10/12 01:48:37 -------- Begin starting jobs --------
10/12 01:48:37 -------- Done starting jobs --------
10/12 01:50:44 DaemonCore: in SendAliveToParent()
10/12 01:50:44 DaemonCore: attempting to connect to '<192.168.1.1:45578>'













ShadowLog

10/12 01:38:27 ******************************************************
10/12 01:38:27 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/12 01:38:27 ** /home/condor/release/sbin/condor_shadow
10/12 01:38:27 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:38:27 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:38:27 ** PID = 12878
10/12 01:38:27 ******************************************************
10/12 01:38:27 Using config file: /home/condor/etc/condor_config
10/12 01:38:27 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:38:27 DaemonCore: Command Socket at <192.168.1.1:45639>
10/12 01:38:27 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:38:27 Reading job ClassAd from STDIN
10/12 01:38:27 Initializing a VANILLA shadow for job 1.0
10/12 01:38:27 (1.0) (12878): ENABLE_USERLOG_LOCKING is undefined, using
default value of True
10/12 01:38:27 (1.0) (12878): UserLog = /home/condor/jobs/helloworld/log.out
10/12 01:38:27 (1.0) (12878): *** Reserved Swap = 0
10/12 01:38:27 (1.0) (12878): *** Free Swap = 787168
10/12 01:38:27 (1.0) (12878): in RemoteResource::initStartdInfo()
10/12 01:38:27 (1.0) (12878): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:27 (1.0) (12878): Entering DCStartd::activateClaim()
10/12 01:38:27 (1.0) (12878): DCStartd::activateClaim: successfully sent
command, reply is: 1
10/12 01:38:27 (1.0) (12878): Request to run on <192.168.1.101:35193> was
ACCEPTED
10/12 01:38:27 (1.0) (12878): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from PRE to STARTUP
10/12 01:38:27 (1.0) (12878): Getting monitoring info for pid 12878
10/12 01:38:27 (1.0) (12878): entering FileTransfer::Init
10/12 01:38:27 (1.0) (12878): entering FileTransfer::SimpleInit
10/12 01:38:27 (1.0) (12878): entering FileTransfer::HandleCommands
10/12 01:38:27 (1.0) (12878): FileTransfer::HandleCommands read
transkey=1#434c5b036fe0c01059a0454b
10/12 01:38:27 (1.0) (12878): entering FileTransfer::Upload
10/12 01:38:27 (1.0) (12878): entering FileTransfer::DoUpload
10/12 01:38:27 (1.0) (12878): DoUpload: send file
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:27 (1.0) (12878): ReliSock::put_file_with_permissions(): going
to send permissions 100755
10/12 01:38:27 (1.0) (12878): put_file: going to send from filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:27 (1.0) (12878): put_file: Found file size 10457
10/12 01:38:27 (1.0) (12878): put_file: senting 10457 bytes
10/12 01:38:27 (1.0) (12878): ReliSock: put_file: sent 10457 bytes
10/12 01:38:27 (1.0) (12878): DoUpload: exiting at 1605
10/12 01:38:28 (1.0) (12878): DaemonCore: in SendAliveToParent()
10/12 01:38:28 (1.0) (12878): DaemonCore: attempting to connect to
'<192.168.1.1:45580>'
10/12 01:38:28 (1.0) (12878): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:28 (1.0) (12878): SEC_TCP_SESSION_TIMEOUT is undefined, using
default value of 20
10/12 01:38:28 (1.0) (12878): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from STARTUP to EXECUTING
10/12 01:38:28 (1.0) (12878): SHADOW_QUEUE_UPDATE_INTERVAL is undefined,
using default value of 900
10/12 01:38:28 (1.0) (12878): QmgrJobUpdater: started timer to update queue
(tid=7)
10/12 01:38:28 (1.0) (12878): Inside RemoteResource::updateFromStarter()
10/12 01:38:28 (1.0) (12878): Inside RemoteResource::resourceExit()
10/12 01:38:28 (1.0) (12878): setting exit reason on vm1@xxxxxxxxxxxxxxxxx
to 100
10/12 01:38:28 (1.0) (12878): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from EXECUTING to FINISHED
10/12 01:38:28 (1.0) (12878): Entering DCStartd::deactivateClaim(forceful)
10/12 01:38:28 (1.0) (12878): SEC_DEBUG_PRINT_KEYS is undefined, using
default value of False
10/12 01:38:28 (1.0) (12878): DCStartd::deactivateClaim: successfully sent
command
10/12 01:38:28 (1.0) (12878): Killed starter (fast) at <192.168.1.101:35193>
10/12 01:38:28 (1.0) (12878): Job 1.0 terminated: exited with status 0
10/12 01:38:28 (1.0) (12878): Forking Mailer process...
10/12 01:38:28 (1.0) (12878): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:28 (1.0) (12878): AUTHENTICATE_FS: used file /tmp/qmgr_Kl41Hy,
status: 1
10/12 01:38:28 (1.0) (12878): Updating Job Queue:
SetAttribute(LastJobLeaseRenewal = 1129077508)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(ExitBySignal
= FALSE)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(ExitCode = 0)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(BytesRecvd =
10457.000000)
10/12 01:38:28 (1.0) (12878): **** condor_shadow (condor_SHADOW) EXITING
WITH STATUS 100
10/12 01:38:29 PASSWD_CACHE_REFRESH is undefined, using default value of 300

10/12 01:38:29 ******************************************************
10/12 01:38:29 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/12 01:38:29 ** /home/condor/release/sbin/condor_shadow
10/12 01:38:29 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:38:29 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:38:29 ** PID = 12887
10/12 01:38:29 ******************************************************
10/12 01:38:29 Using config file: /home/condor/etc/condor_config
10/12 01:38:29 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:38:29 DaemonCore: Command Socket at <192.168.1.1:45646>
10/12 01:38:29 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:38:29 Reading job ClassAd from STDIN
10/12 01:38:29 Initializing a VANILLA shadow for job 1.1
10/12 01:38:29 (1.1) (12887): ENABLE_USERLOG_LOCKING is undefined, using
default value of True
10/12 01:38:29 (1.1) (12887): UserLog = /home/condor/jobs/helloworld/log.out
10/12 01:38:29 (1.1) (12887): *** Reserved Swap = 0
10/12 01:38:29 (1.1) (12887): *** Free Swap = 787168
10/12 01:38:29 (1.1) (12887): in RemoteResource::initStartdInfo()
10/12 01:38:28 (1.0) (12878): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:28 (1.0) (12878): SEC_TCP_SESSION_TIMEOUT is undefined, using
default value of 20
10/12 01:38:28 (1.0) (12878): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from STARTUP to EXECUTING
10/12 01:38:28 (1.0) (12878): SHADOW_QUEUE_UPDATE_INTERVAL is undefined,
using default value of 900
10/12 01:38:28 (1.0) (12878): QmgrJobUpdater: started timer to update queue
(tid=7)
10/12 01:38:28 (1.0) (12878): Inside RemoteResource::updateFromStarter()
10/12 01:38:28 (1.0) (12878): Inside RemoteResource::resourceExit()
10/12 01:38:28 (1.0) (12878): setting exit reason on vm1@xxxxxxxxxxxxxxxxx
to 100
10/12 01:38:28 (1.0) (12878): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from EXECUTING to FINISHED
10/12 01:38:28 (1.0) (12878): Entering DCStartd::deactivateClaim(forceful)
10/12 01:38:28 (1.0) (12878): SEC_DEBUG_PRINT_KEYS is undefined, using
default value of False
10/12 01:38:28 (1.0) (12878): DCStartd::deactivateClaim: successfully sent
command
10/12 01:38:28 (1.0) (12878): Killed starter (fast) at <192.168.1.101:35193>
10/12 01:38:28 (1.0) (12878): Job 1.0 terminated: exited with status 0
10/12 01:38:28 (1.0) (12878): Forking Mailer process...
10/12 01:38:28 (1.0) (12878): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:28 (1.0) (12878): AUTHENTICATE_FS: used file /tmp/qmgr_Kl41Hy,
status: 1
10/12 01:38:28 (1.0) (12878): Updating Job Queue:
SetAttribute(LastJobLeaseRenewal = 1129077508)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(ExitBySignal
= FALSE)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(ExitCode = 0)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:28 (1.0) (12878): Updating Job Queue: SetAttribute(BytesRecvd =
10457.000000)
10/12 01:38:28 (1.0) (12878): **** condor_shadow (condor_SHADOW) EXITING
WITH STATUS 100
10/12 01:38:29 PASSWD_CACHE_REFRESH is undefined, using default value of 300

10/12 01:38:29 ******************************************************
10/12 01:38:29 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/12 01:38:29 ** /home/condor/release/sbin/condor_shadow
10/12 01:38:29 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:38:29 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:38:29 ** PID = 12887
10/12 01:38:29 ******************************************************
10/12 01:38:29 Using config file: /home/condor/etc/condor_config
10/12 01:38:29 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:38:29 DaemonCore: Command Socket at <192.168.1.1:45646>
10/12 01:38:29 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:38:29 Reading job ClassAd from STDIN
10/12 01:38:29 Initializing a VANILLA shadow for job 1.1
10/12 01:38:29 (1.1) (12887): ENABLE_USERLOG_LOCKING is undefined, using
default value of True
10/12 01:38:29 (1.1) (12887): UserLog = /home/condor/jobs/helloworld/log.out
10/12 01:38:29 (1.1) (12887): *** Reserved Swap = 0
10/12 01:38:29 (1.1) (12887): *** Free Swap = 787168
10/12 01:38:29 (1.1) (12887): in RemoteResource::initStartdInfo()
10/12 01:38:29 (1.1) (12887): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:29 (1.1) (12887): Entering DCStartd::activateClaim()
10/12 01:38:29 (1.1) (12887): DCStartd::activateClaim: successfully sent
command, reply is: 1
10/12 01:38:29 (1.1) (12887): Request to run on <192.168.1.101:35193> was
ACCEPTED
10/12 01:38:29 (1.1) (12887): Resource vm2@xxxxxxxxxxxxxxxxx changing state
from PRE to STARTUP
10/12 01:38:29 (1.1) (12887): Getting monitoring info for pid 12887
10/12 01:38:29 (1.1) (12887): entering FileTransfer::Init
10/12 01:38:29 (1.1) (12887): entering FileTransfer::SimpleInit
10/12 01:38:29 (1.1) (12887): entering FileTransfer::HandleCommands
10/12 01:38:29 (1.1) (12887): FileTransfer::HandleCommands read
transkey=1#434c5b052440ca79388d2bb8
10/12 01:38:29 (1.1) (12887): entering FileTransfer::Upload
10/12 01:38:29 (1.1) (12887): entering FileTransfer::DoUpload
10/12 01:38:29 (1.1) (12887): DoUpload: send file
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:29 (1.1) (12887): ReliSock::put_file_with_permissions(): going
to send permissions 100755
10/12 01:38:29 (1.1) (12887): put_file: going to send from filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:29 (1.1) (12887): put_file: Found file size 10457
10/12 01:38:29 (1.1) (12887): put_file: senting 10457 bytes
10/12 01:38:29 (1.1) (12887): ReliSock: put_file: sent 10457 bytes
10/12 01:38:29 (1.1) (12887): DoUpload: exiting at 1605
10/12 01:38:30 (1.1) (12887): DaemonCore: in SendAliveToParent()
10/12 01:38:30 (1.1) (12887): DaemonCore: attempting to connect to
'<192.168.1.1:45580>'
10/12 01:38:30 (1.1) (12887): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:30 (1.1) (12887): SEC_TCP_SESSION_TIMEOUT is undefined, using
default value of 20
10/12 01:38:30 (1.1) (12887): Resource vm2@xxxxxxxxxxxxxxxxx changing state
from STARTUP to EXECUTING
10/12 01:38:30 (1.1) (12887): SHADOW_QUEUE_UPDATE_INTERVAL is undefined,
using default value of 900
10/12 01:38:30 (1.1) (12887): QmgrJobUpdater: started timer to update queue
(tid=7)
10/12 01:38:30 (1.1) (12887): Inside RemoteResource::updateFromStarter()
10/12 01:38:30 (1.1) (12887): Inside RemoteResource::resourceExit()
10/12 01:38:30 (1.1) (12887): setting exit reason on vm2@xxxxxxxxxxxxxxxxx
to 100
10/12 01:38:30 (1.1) (12887): Resource vm2@xxxxxxxxxxxxxxxxx changing state
from EXECUTING to FINISHED
10/12 01:38:30 (1.1) (12887): Entering DCStartd::deactivateClaim(forceful)
10/12 01:38:30 (1.1) (12887): SEC_DEBUG_PRINT_KEYS is undefined, using
default value of False
10/12 01:38:30 (1.1) (12887): DCStartd::deactivateClaim: successfully sent
command
10/12 01:38:30 (1.1) (12887): Killed starter (fast) at <192.168.1.101:35193>
10/12 01:38:30 (1.1) (12887): Job 1.1 terminated: exited with status 0
10/12 01:38:30 (1.1) (12887): Forking Mailer process...
10/12 01:38:30 (1.1) (12887): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:30 (1.1) (12887): AUTHENTICATE_FS: used file /tmp/qmgr_tDVUN2,
status: 1
10/12 01:38:30 (1.1) (12887): Updating Job Queue:
SetAttribute(LastJobLeaseRenewal = 1129077510)
10/12 01:38:30 (1.1) (12887): Updating Job Queue: SetAttribute(ExitBySignal
= FALSE)
10/12 01:38:30 (1.1) (12887): Updating Job Queue: SetAttribute(ExitCode = 0)
10/12 01:38:30 (1.1) (12887): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:30 (1.1) (12887): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:30 (1.1) (12887): Updating Job Queue: SetAttribute(BytesRecvd =
10457.000000)
10/12 01:38:30 (1.1) (12887): **** condor_shadow (condor_SHADOW) EXITING
WITH STATUS 100
10/12 01:38:31 PASSWD_CACHE_REFRESH is undefined, using default value of 300

10/12 01:38:31 ******************************************************
10/12 01:38:31 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/12 01:38:31 ** /home/condor/release/sbin/condor_shadow
10/12 01:38:31 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:38:31 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:38:31 ** PID = 12893
10/12 01:38:31 ******************************************************
10/12 01:38:31 Using config file: /home/condor/etc/condor_config
10/12 01:38:31 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:38:31 DaemonCore: Command Socket at <192.168.1.1:45654>
10/12 01:38:31 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:38:31 Reading job ClassAd from STDIN
10/12 01:38:31 Initializing a VANILLA shadow for job 1.2
10/12 01:38:31 (1.2) (12893): ENABLE_USERLOG_LOCKING is undefined, using
default value of True
10/12 01:38:31 (1.2) (12893): UserLog = /home/condor/jobs/helloworld/log.out
10/12 01:38:31 (1.2) (12893): *** Reserved Swap = 0
10/12 01:38:31 (1.2) (12893): *** Free Swap = 787168
10/12 01:38:31 (1.2) (12893): in RemoteResource::initStartdInfo()
10/12 01:38:31 (1.2) (12893): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:31 (1.2) (12893): Entering DCStartd::activateClaim()
10/12 01:38:31 (1.2) (12893): DCStartd::activateClaim: successfully sent
command, reply is: 1
10/12 01:38:31 (1.2) (12893): Request to run on <192.168.1.102:34956> was
ACCEPTED
10/12 01:38:31 (1.2) (12893): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from PRE to STARTUP
10/12 01:38:31 (1.2) (12893): Getting monitoring info for pid 12893
10/12 01:38:31 (1.2) (12893): entering FileTransfer::Init
10/12 01:38:31 (1.2) (12893): entering FileTransfer::SimpleInit
10/12 01:38:32 (1.2) (12893): DaemonCore: in SendAliveToParent()
10/12 01:38:32 (1.2) (12893): DaemonCore: attempting to connect to
'<192.168.1.1:45580>'
10/12 01:38:32 (1.2) (12893): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:32 (1.2) (12893): SEC_TCP_SESSION_TIMEOUT is undefined, using
default value of 20
10/12 01:38:32 (1.2) (12893): entering FileTransfer::HandleCommands
10/12 01:38:32 (1.2) (12893): FileTransfer::HandleCommands read
transkey=1#434c5b0758a0d4e2177a1225
10/12 01:38:32 (1.2) (12893): entering FileTransfer::Upload
10/12 01:38:32 (1.2) (12893): entering FileTransfer::DoUpload
10/12 01:38:32 (1.2) (12893): DoUpload: send file
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:32 (1.2) (12893): ReliSock::put_file_with_permissions(): going
to send permissions 100755
10/12 01:38:32 (1.2) (12893): put_file: going to send from filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:32 (1.2) (12893): put_file: Found file size 10457
10/12 01:38:32 (1.2) (12893): put_file: senting 10457 bytes
10/12 01:38:32 (1.2) (12893): ReliSock: put_file: sent 10457 bytes
10/12 01:38:32 (1.2) (12893): DoUpload: exiting at 1605
10/12 01:38:33 PASSWD_CACHE_REFRESH is undefined, using default value of 300

10/12 01:38:33 ******************************************************
10/12 01:38:33 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/12 01:38:33 ** /home/condor/release/sbin/condor_shadow
10/12 01:38:33 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:38:33 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:38:33 ** PID = 12894
10/12 01:38:33 ******************************************************
10/12 01:38:33 Using config file: /home/condor/etc/condor_config
10/12 01:38:33 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:38:33 DaemonCore: Command Socket at <192.168.1.1:45657>
10/12 01:38:33 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:38:33 Reading job ClassAd from STDIN
10/12 01:38:33 Initializing a VANILLA shadow for job 1.3
10/12 01:38:33 (1.3) (12894): ENABLE_USERLOG_LOCKING is undefined, using
default value of True
10/12 01:38:33 (1.3) (12894): UserLog = /home/condor/jobs/helloworld/log.out
10/12 01:38:33 (1.3) (12894): *** Reserved Swap = 0
10/12 01:38:33 (1.3) (12894): *** Free Swap = 787168
10/12 01:38:33 (1.3) (12894): in RemoteResource::initStartdInfo()
10/12 01:38:33 (1.3) (12894): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:33 (1.3) (12894): Entering DCStartd::activateClaim()
10/12 01:38:33 (1.2) (12893): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from STARTUP to EXECUTING
10/12 01:38:33 (1.2) (12893): SHADOW_QUEUE_UPDATE_INTERVAL is undefined,
using default value of 900
10/12 01:38:33 (1.2) (12893): QmgrJobUpdater: started timer to update queue
(tid=7)
10/12 01:38:33 (1.3) (12894): DCStartd::activateClaim: successfully sent
command, reply is: 1
10/12 01:38:33 (1.3) (12894): Request to run on <192.168.1.102:34956> was
ACCEPTED
10/12 01:38:33 (1.3) (12894): Resource vm2@xxxxxxxxxxxxxxxxx changing state
from PRE to STARTUP
10/12 01:38:33 (1.3) (12894): Getting monitoring info for pid 12894
10/12 01:38:33 (1.2) (12893): Inside RemoteResource::updateFromStarter()
10/12 01:38:33 (1.2) (12893): Inside RemoteResource::resourceExit()
10/12 01:38:33 (1.2) (12893): setting exit reason on vm1@xxxxxxxxxxxxxxxxx
to 100
10/12 01:38:33 (1.2) (12893): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from EXECUTING to FINISHED
10/12 01:38:33 (1.2) (12893): Entering DCStartd::deactivateClaim(forceful)
10/12 01:38:33 (1.2) (12893): SEC_DEBUG_PRINT_KEYS is undefined, using
default value of False
10/12 01:38:33 (1.2) (12893): DCStartd::deactivateClaim: successfully sent
command
10/12 01:38:33 (1.2) (12893): Killed starter (fast) at <192.168.1.102:34956>
10/12 01:38:33 (1.2) (12893): Job 1.2 terminated: exited with status 0
10/12 01:38:33 (1.2) (12893): Forking Mailer process...
10/12 01:38:33 (1.2) (12893): Forking Mailer process...
10/12 01:38:33 (1.2) (12893): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:33 (1.2) (12893): AUTHENTICATE_FS: used file /tmp/qmgr_op2VbE,
status: 1
10/12 01:38:33 (1.2) (12893): Updating Job Queue:
SetAttribute(LastJobLeaseRenewal = 1129077513)
10/12 01:38:33 (1.2) (12893): Updating Job Queue: SetAttribute(ExitBySignal
= FALSE)
10/12 01:38:33 (1.2) (12893): Updating Job Queue: SetAttribute(ExitCode = 0)
10/12 01:38:33 (1.2) (12893): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:33 (1.2) (12893): Updating Job Queue: SetAttribute(BytesRecvd =
10457.000000)
10/12 01:38:33 (1.2) (12893): **** condor_shadow (condor_SHADOW) EXITING
WITH STATUS 100
10/12 01:38:33 (1.3) (12894): entering FileTransfer::Init
10/12 01:38:33 (1.3) (12894): entering FileTransfer::SimpleInit
10/12 01:38:33 (1.3) (12894): entering FileTransfer::HandleCommands
10/12 01:38:33 (1.3) (12894): FileTransfer::HandleCommands read
transkey=1#434c5b09d00df4b7666f892
10/12 01:38:33 (1.3) (12894): entering FileTransfer::Upload
10/12 01:38:33 (1.3) (12894): entering FileTransfer::DoUpload
10/12 01:38:33 (1.3) (12894): DoUpload: send file
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:33 (1.3) (12894): ReliSock::put_file_with_permissions(): going
to send permissions 100755
10/12 01:38:33 (1.3) (12894): put_file: going to send from filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:33 (1.3) (12894): put_file: Found file size 10457
10/12 01:38:33 (1.3) (12894): put_file: senting 10457 bytes
10/12 01:38:33 (1.3) (12894): ReliSock: put_file: sent 10457 bytes
10/12 01:38:33 (1.3) (12894): DoUpload: exiting at 1605
10/12 01:38:34 (1.3) (12894): DaemonCore: in SendAliveToParent()
10/12 01:38:34 (1.3) (12894): DaemonCore: attempting to connect to
'<192.168.1.1:45580>'
10/12 01:38:34 (1.3) (12894): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:34 (1.3) (12894): SEC_TCP_SESSION_TIMEOUT is undefined, using
default value of 20
10/12 01:38:34 (1.3) (12894): Resource vm2@xxxxxxxxxxxxxxxxx changing state
from STARTUP to EXECUTING
10/12 01:38:34 (1.3) (12894): SHADOW_QUEUE_UPDATE_INTERVAL is undefined,
using default value of 900
10/12 01:38:34 (1.3) (12894): QmgrJobUpdater: started timer to update queue
(tid=7)
10/12 01:38:34 (1.3) (12894): Inside RemoteResource::updateFromStarter()
10/12 01:38:34 (1.3) (12894): Inside RemoteResource::resourceExit()
10/12 01:38:34 (1.3) (12894): setting exit reason on vm2@xxxxxxxxxxxxxxxxx
to 100
10/12 01:38:34 (1.3) (12894): Resource vm2@xxxxxxxxxxxxxxxxx changing state
from EXECUTING to FINISHED
10/12 01:38:34 (1.3) (12894): Entering DCStartd::deactivateClaim(forceful)
10/12 01:38:34 (1.3) (12894): SEC_DEBUG_PRINT_KEYS is undefined, using
default value of False
10/12 01:38:34 (1.3) (12894): DCStartd::deactivateClaim: successfully sent
command
10/12 01:38:34 (1.3) (12894): Killed starter (fast) at <192.168.1.102:34956>
10/12 01:38:34 (1.3) (12894): Job 1.3 terminated: exited with status 0
10/12 01:38:34 (1.3) (12894): Forking Mailer process...
10/12 01:38:34 (1.3) (12894): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:34 (1.3) (12894): AUTHENTICATE_FS: used file /tmp/qmgr_Dtf0Ij,
status: 1
10/12 01:38:34 (1.3) (12894): Updating Job Queue:
SetAttribute(LastJobLeaseRenewal = 1129077514)
10/12 01:38:34 (1.3) (12894): Updating Job Queue: SetAttribute(ExitBySignal
= FALSE)
10/12 01:38:34 (1.3) (12894): Updating Job Queue: SetAttribute(ExitCode = 0)
10/12 01:38:34 (1.3) (12894): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:34 (1.3) (12894): Updating Job Queue: SetAttribute(BytesRecvd =
10457.000000)
10/12 01:38:34 (1.3) (12894): **** condor_shadow (condor_SHADOW) EXITING
WITH STATUS 100
10/12 01:38:35 PASSWD_CACHE_REFRESH is undefined, using default value of 300

10/12 01:38:35 ******************************************************
10/12 01:38:35 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/12 01:38:35 ** /home/condor/release/sbin/condor_shadow
10/12 01:38:35 ** $CondorVersion: 6.7.10 Aug  3 2005 $
10/12 01:38:35 ** $CondorPlatform: I386-LINUX_RH9 $
10/12 01:38:35 ** PID = 12902
10/12 01:38:35 ******************************************************
10/12 01:38:35 Using config file: /home/condor/etc/condor_config
10/12 01:38:35 Using local config files:
/home/condor/hosts/thebeast/condor_config.local
10/12 01:38:35 DaemonCore: Command Socket at <192.168.1.1:45666>
10/12 01:38:35 SEC_DEFAULT_SESSION_DURATION is undefined, using default
value of 3600
10/12 01:38:35 Reading job ClassAd from STDIN
10/12 01:38:35 Initializing a VANILLA shadow for job 1.4
10/12 01:38:35 (1.4) (12902): ENABLE_USERLOG_LOCKING is undefined, using
default value of True
10/12 01:38:35 (1.4) (12902): UserLog = /home/condor/jobs/helloworld/log.out
10/12 01:38:35 (1.4) (12902): *** Reserved Swap = 0
10/12 01:38:35 (1.4) (12902): *** Free Swap = 787168
10/12 01:38:35 (1.4) (12902): in RemoteResource::initStartdInfo()
10/12 01:38:35 (1.4) (12902): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:35 (1.4) (12902): Entering DCStartd::activateClaim()
10/12 01:38:36 (1.4) (12902): DCStartd::activateClaim: successfully sent
command, reply is: 1
10/12 01:38:36 (1.4) (12902): Request to run on <192.168.1.103:34098> was
ACCEPTED
10/12 01:38:36 (1.4) (12902): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from PRE to STARTUP
10/12 01:38:36 (1.4) (12902): Getting monitoring info for pid 12902
10/12 01:38:36 (1.4) (12902): DaemonCore: in SendAliveToParent()
10/12 01:38:36 (1.4) (12902): DaemonCore: attempting to connect to
'<192.168.1.1:45580>'
10/12 01:38:36 (1.4) (12902): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:36 (1.4) (12902): SEC_TCP_SESSION_TIMEOUT is undefined, using
default value of 20
10/12 01:38:36 (1.4) (12902): entering FileTransfer::Init
10/12 01:38:36 (1.4) (12902): entering FileTransfer::SimpleInit
10/12 01:38:36 (1.4) (12902): entering FileTransfer::HandleCommands
10/12 01:38:36 (1.4) (12902): FileTransfer::HandleCommands read
transkey=1#434c5b0c1b90eee844ca5236
10/12 01:38:36 (1.4) (12902): entering FileTransfer::Upload
10/12 01:38:36 (1.4) (12902): entering FileTransfer::DoUpload
10/12 01:38:36 (1.4) (12902): DoUpload: send file
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:36 (1.4) (12902): ReliSock::put_file_with_permissions(): going
to send permissions 100755
10/12 01:38:36 (1.4) (12902): put_file: going to send from filename
/home/condor/hosts/thebeast/spool/cluster1.ickpt.subproc0
10/12 01:38:36 (1.4) (12902): put_file: Found file size 10457
10/12 01:38:36 (1.4) (12902): put_file: senting 10457 bytes
10/12 01:38:36 (1.4) (12902): ReliSock: put_file: sent 10457 bytes
10/12 01:38:36 (1.4) (12902): DoUpload: exiting at 1605
10/12 01:38:37 (1.4) (12902): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from STARTUP to EXECUTING
10/12 01:38:37 (1.4) (12902): SHADOW_QUEUE_UPDATE_INTERVAL is undefined,
using default value of 900
10/12 01:38:37 (1.4) (12902): QmgrJobUpdater: started timer to update queue
(tid=7)
10/12 01:38:37 (1.4) (12902): Inside RemoteResource::updateFromStarter()
10/12 01:38:37 (1.4) (12902): Inside RemoteResource::resourceExit()
10/12 01:38:37 (1.4) (12902): setting exit reason on vm1@xxxxxxxxxxxxxxxxx
to 100
10/12 01:38:37 (1.4) (12902): Resource vm1@xxxxxxxxxxxxxxxxx changing state
from EXECUTING to FINISHED
10/12 01:38:37 (1.4) (12902): Entering DCStartd::deactivateClaim(forceful)
10/12 01:38:37 (1.4) (12902): SEC_DEBUG_PRINT_KEYS is undefined, using
default value of False
10/12 01:38:37 (1.4) (12902): DCStartd::deactivateClaim: successfully sent
command
10/12 01:38:37 (1.4) (12902): Killed starter (fast) at <192.168.1.103:34098>
10/12 01:38:37 (1.4) (12902): Job 1.4 terminated: exited with status 0
10/12 01:38:37 (1.4) (12902): Forking Mailer process...
10/12 01:38:37 (1.4) (12902): SHADOW_TIMEOUT_MULTIPLIER is undefined, using
default value of 0
10/12 01:38:37 (1.4) (12902): AUTHENTICATE_FS: used file /tmp/qmgr_0JBVD7,
status: 1
10/12 01:38:37 (1.4) (12902): Updating Job Queue:
SetAttribute(LastJobLeaseRenewal = 1129077517)
10/12 01:38:37 (1.4) (12902): Updating Job Queue: SetAttribute(ExitBySignal
= FALSE)
10/12 01:38:37 (1.4) (12902): Updating Job Queue: SetAttribute(ExitCode = 0)
10/12 01:38:37 (1.4) (12902): Updating Job Queue: SetAttribute(BytesSent =
0.000000)
10/12 01:38:37 (1.4) (12902): Updating Job Queue: SetAttribute(BytesRecvd =
10457.000000)
10/12 01:38:37 (1.4) (12902): **** condor_shadow (condor_SHADOW) EXITING
WITH STATUS 100



Thanks again

Chris