[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] New to Condor - setting up on Linux



Hi Shouqin,

Thank you for the note. I edited the condor_config as you have mentioned the following are the results.

Case 1:

I want Machine A to be the central manager and submitter
I want Machine B to be the worker (execute the jobs only)

Now my doubt is on the processes that are running, the following are the details,

Machine A

condor   20858  4.0  0.0  31476  2760 ?        Ss   09:35   0:00 condor_master
condor   20859  4.2  0.0  30476  2668 ?        Ss   09:35   0:00 condor_collector -f
condor   20864 18.0  0.0  30380  2668 ?        Ss   09:35   0:00 condor_negotiator -f
condor   20865 16.0  0.0  32140  3248 ?        Ss   09:35   0:00 condor_schedd -f
root     20866  0.0  0.0  21628  1772 ?        S    09:35   0:00 condor_procd -A /var/run/condor/procd_pipe.SCHEDD -S 60 -C 101

Machine B

condor   20301  3.6  0.0  29388  2532 ?        Ss   09:45   0:00 condor_master
condor   20302  3.3  0.0  29320  2292 ?        Ss   09:45   0:00 condor_startd -f
condor   20309  1.0  0.0  28480  1744 ?        S    09:45   0:00 /usr/sbin/condor_starter -classad
condor   20310  107  0.3 1347708 16280 ?       Sl   09:45   0:02 /usr/bin/java -classpath /usr/lib64/condor:/usr/lib64/condor/scimark2lib.jar:. CondorJavaInfo old 2

still i'm not getting any output from the condor_status command. if i do condor_status -any i get the output as

MyType               TargetType           Name

Scheduler            None                machineA.mydomain.com
DaemonMaster    None                 machineA.mydomain.com
Negotiator           None                 machineA.mydomain.com

Case 2:

If the machine B is configured as submit,execute then the following processes are shown

condor   20363 14.5  0.0  28032  1888 ?        Ss   09:54   0:00 condor_master
condor   20364 36.0  0.0  30068  3192 ?        Ss   09:54   0:00 condor_schedd -f
condor   20365 11.0  0.0  29320  2288 ?        Ss   09:54   0:00 condor_startd -f
root     20366  1.0  0.0  21760  1848 ?        S    09:54   0:00 condor_procd -A /var/run/condor/procd_pipe.SCHEDD -S 60 -C 101
condor   20373  3.0  0.0  28480  1748 ?        S    09:54   0:00 /usr/sbin/condor_starter -classad
condor   20376  120  0.3 1348336 15948 ?       Sl   09:54   0:01 /usr/bin/java -classpath /usr/lib64/condor:/usr/lib64/condor/scimark2lib.jar:. CondorJavaInfo old 2

machine A is configured as submit,manager the following processes are shown,

condor   20950  4.2  0.0  31480  2760 ?        Ss   09:44   0:00 condor_master
condor   20951  4.2  0.0  30476  2668 ?        Ss   09:44   0:00 condor_collector -f
condor   20952 17.0  0.0  30380  2664 ?        Ss   09:45   0:00 condor_negotiator -f
condor   20953 17.0  0.0  32136  3244 ?        Ss   09:45   0:00 condor_schedd -f
root     20954  0.0  0.0  21628  1776 ?        S    09:45   0:00 condor_procd -A /var/run/condor/procd_pipe.SCHEDD -S 60 -C 101

Now the condor_status result shows on both machines,

slot1@xxxxxxxxxxxxxxxxxxx LINUX      X86_64 Owner     Idle     0.080  1004  0+00:00:08
slot2@xxxxxxxxxxxxxxxxxxx LINUX      X86_64 Owner     Idle     0.000  1004  0+00:00:09
slot3@xxxxxxxxxxxxxxxxxxx LINUX      X86_64 Owner     Idle     0.000  1004  0+00:00:10
slot4@xxxxxxxxxxxxxxxxxxx LINUX      X86_64 Owner     Idle     0.000  1004  0+00:00:11
                     Total Owner Claimed Unclaimed Matched Preempting Backfill

        X86_64/LINUX     4     4       0         0       0          0        0

               Total     4     4       0         0       0          0        0

The result for condor_status -any  shows,

machine A

MyType               TargetType           Name

Machine              Job                  
slot1@
machineB.domain.
com.
Machine              Job                  
slot1@
machineB.domain.
com.
Machine              Job                  
slot1@
machineB.domain.
com.
Machine              Job                  
slot1@
machineB.domain.
com.
Scheduler            None               
machineA.domain.
com.
DaemonMaster         None           
machineA.domain.
com.
Negotiator           None                
machineA.domain.
com.
.

machine B

MyType               TargetType           Name

Scheduler            None               machineB.domain.com
DaemonMaster         None          
machineB.domain.
com
Machine              Job                  slot1@
machineB.domain.
com.
Machine              Job                  slot2@
machineB.domain.com
.
Machine              Job                  slot3@
machineB.domain.com
.
Machine              Job                  slot4@
machineB.domain.com
.
Scheduler            None                
machineA.domain.
com.
DaemonMaster         None           
machineA.domain.
com.
Negotiator           None                 
machineA.domain.
com.

From what i understood from the documentation condor_status will show all the nodes and their slots that is running condor processes. But i think i might have done something wrong.

Thank for your time,

Burny

--- On Fri, 25/6/10, Shouqin Zhu <zhushouqin@xxxxxxxxx> wrote:

From: Shouqin Zhu <zhushouqin@xxxxxxxxx>
Subject: Re: [Condor-users] New to Condor - setting up on Linux
To: "Condor-Users Mail List" <condor-users@xxxxxxxxxxx>
Date: Friday, 25 June, 2010, 8:59 AM

Hi,
You should limit the ALLOW_READ and ALLOW_WRITE.
Don't set the two value = *.
Just fallow as this example:
ALLOW_READ=your IP segment;
ALLOW_WRITE= your IP segment;
 
Hope good luck!

On Fri, Jun 25, 2010 at 1:36 AM, Bharanidharan Narayanaswamy <bharanitn@xxxxxxxxx> wrote:
Hi,

I've worked on SGE before but wanted to try the Condor. I'm trying to set up condor on 2 linux boxes running Centos.

Both the machines are in the same domain.

The following are the steps i've followed,

1. Installed Condor using the rpm (7.4.2-1 release)
2. Machine A - configured as Manager,submit using 
condor_configure --type=manager,submit --central-manager=machineA.domain.com
3. Executed condor_master
4. I get the following processes
condor   14706  2.3  0.0  31128  3148 ?        Ss   22:52   0:00 condor_master
condor   14707  2.4  0.0  30084  3760 ?        Ss   22:52   0:00 condor_collector -f
condor   14708  4.0  0.0  30020  3708 ?        Ss   22:52   0:00 condor_negotiator -f
condor   14709  4.0  0.1  31644  4488 ?        Ss   22:52   0:00 condor_schedd -f
root     14712  0.0  0.0  21604  2088 ?        S    22:52   0:00 condor_procd -A /var/run/condor/procd_pipe.SCHEDD -S 60 -C 101
root     14724  0.0  0.0  61180   744 pts/1    S+   22:52   0:00 grep condor
5. No results for condor_status
6. Machine B - configured as execute,submit using 
condor_configure --type=manager,submit --central-manager=machineA.domain.com
7. I get the following processes running
condor   18072  3.0  0.0  29532  2852 ?        Ss   22:52   0:00 /usr/sbin/condor_master -pidfile /var/run/condor/master.pid
condor   18073  3.4  0.0  30200  3312 ?        Ss   22:52   0:00 condor_schedd -f
condor   18074 32.0  0.0  29584  3392 ?        Ss   22:52   0:03 condor_startd -f
root     18075  0.1  0.0  21760  1848 ?        S    22:52   0:00 condor_procd -A /var/run/condor/procd_pipe.SCHEDD -S 60 -C 101
root     18117  0.0  0.0  61152   748 pts/1    S+   22:52   0:00 grep condor
8. No results for condor_status

It would be great if some one can point me where i'm going wrong.

Thanks,
Burny





_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/




--
Shouqin Zhu
Master.student of CSE Dept.
Network Computing Center
Chongqing University of Posts and Telecommunications
Chongqing China 400065
Phone:(+86) 15310640690
Email : zhushouqin@xxxxxxxxx

-----Inline Attachment Follows-----

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/