[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] problem regarding condor_status



thanks...now i am able to see the machines on which my jobs are executed...but the problem now is that...i thnk they are getting executed on the same machine...i had configured while insatlleing four machines in the pool....how i can i make the jobs to get processed on diff machines....I am not sure if i had done a mistake in configuring the pool....can any body help me on how can i make the entire pool work........
 
the below stats shows that my jobs are processed on the same machine......also in the Schedd log i see the below warning frequently
 
59 (pid:12890) DaemonCore: Command Socket at <127.0.0.1:11929>
5/23 09:55:59 (pid:12890) WARNING: Condor is running on the loopback address (127.0.0.1 )
5/23 09:55:59 (pid:12890)          of this machine, and is not visible to other hosts!
5/23 09:55:59 (pid:12890)          This may be due to a misconfigured /etc/hosts file.
5/23 09:55:59 (pid:12890)          Please make sure your hostname is not listed on the
5/23 09:55:59 (pid:12890)          same line as localhost in /etc/hosts.
5/23 09:55:59 (pid:12890) History file rotation is enabled.
 
 
 
 
Job 3 is processed[condor@Perfcoelnx3 bin]$ cat processedJob3.log
000 (026.000.000) 05/24 12:32:25 Job submitted from host: <127.0.0.1:11978>
...
001 (026.000.000) 05/24 12:59:15 Job executing on host: <127.0.0.1:11977>
...
005 (026.000.000) 05/24 12:59:27 Job terminated.
        (1) Normal termination (return value 18)
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
        0  -  Run Bytes Sent By Job
        0  -  Run Bytes Received By Job
        0  -  Total Bytes Sent By Job
        0  -  Total Bytes Received By Job
...
[condor@Perfcoelnx3 bin]$ cat processedJob1.log
000 (025.000.000) 05/24 12:32:08 Job submitted from host: <127.0.0.1:11978>
...
001 (025.000.000) 05/24 12:59:13 Job executing on host: < 127.0.0.1:11977>
...
005 (025.000.000) 05/24 12:59:13 Job terminated.
        (1) Normal termination (return value 17)
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
        0  -  Run Bytes Sent By Job
        0  -  Run Bytes Received By Job
        0  -  Total Bytes Sent By Job
        0  -  Total Bytes Received By Job


 
On 5/25/07, Steffen Grunewald <steffen.grunewald@xxxxxxxxxx> wrote:
On Fri, May 25, 2007 at 10:35:18AM +0530, Partha sarathi wrote:
> I see this on log dir........i don't see anything getting into MatchLog
>
> -rw-------  1 condor condor     0 May 18 13:08 InstanceLock
> -rw-r-----  1 condor condor     0 May 21 13:59 ShadowLock
> -rw-r--r--  1 condor condor     0 May 23 09:49 MatchLog
> -rw-r--r--  1 condor condor 33750 May 23 15:42 StarterLog
> -rw-r--r--  1 condor condor 22427 May 23 15:43 ShadowLog
> -rw-r--r--  1 condor condor 55825 May 24 07:08 CollectorLog
> -rw-r--r--  1 condor condor 85281 May 24 07:08 NegotiatorLog
> -rw-r--r--  1 condor condor 57658 May 24 07:08 StartLog
> -rw-r--r--  1 condor condor  7117 May 24 07:09 MasterLog
> -rw-r--r--  1 condor condor 50156 May 24 07:09 SchedLog
>
> is it possible to get the machines on which the jobs are executed after the
> jobs are processed.........

Have a look at the job cluster's log file (log=... entry in submit file)

Steffen

--
Steffen Grunewald * MPI Grav.Phys.(AEI) * Am Mühlenberg 1, D-14476 Potsdam
Cluster Admin * http://pandora.aei.mpg.de/merlin/ * http://www.aei.mpg.de/
* e-mail: steffen.grunewald(*)aei.mpg.de * +49-331-567-{fon:7233,fax:7298}
No Word/PPT mails - http://www.gnu.org/philosophy/no-word-attachments.html

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/