[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] Building 2 Machines Condor Pool
- Date: Mon, 14 Aug 2006 13:48:10 +0530
- From: "Pranav Aggarwal" <pranav.aggwl@xxxxxxxxx>
- Subject: Re: [Condor-users] Building 2 Machines Condor Pool
$ ps -ef | egrep condor_
will tell whether condor services are running or not... atleast 5 services should be running...
condor 15880 1 0 12:34 ? 00:00:02 /usr/local/condor/sbin/condor_master
condor 15881 15880 0 12:34 ? 00:00:00 condor_collector -f
condor 15882 15880 0 12:34 ? 00:00:05 condor_startd -f
condor 15883 15880 0 12:34 ? 00:00:01 condor_schedd -f
condor 15884 15880 0 12:34 ? 00:00:00 condor_negotiator -f
if not.. then try
probably this will up the services...
i guess it doesnt matter whether u have created a condor user or not... but i have done that... and have not tried the other way... i strictly followed the instruction manual... and done that way... i guess that is better approach.. what say!!!
On 8/14/06, leo@xxxxxxxxxxxxxxxxxxxxx <leo@xxxxxxxxxxxxxxxxxxxxx> wrote:
Thanks for the help, but still no luck. condor_status still gave me an
error which says
Cannot fetch ads from: <x.x.x.x:y> ...
CEDAR:6001: Failed to connect to <x.x.x.x:y>...
x.x.x.x is the IP and y is the port
> Hi leo,
> one probably cause might be... u have not configured condor_config in the
> home directory where there is a user 'condor'...
I am not using "condor" user. is this necessary to create condor user? I
am just using my account and changed manually the CONDOR_IDS to my
> there is a parameter 'HOST_WRITE' or something similar... do mention the
> machines hostname.... like '
kim-w2x.corp.comp.in' may be similar for
I already included the two machines' IPs in HOSTALLOW_WRITE and
> hey u can look up that name using nslookup...
> further try to send a file from one machine to another... kinda... using
> $ scp <filename> root@<host-to-which-u-sending>:/<location>
> will ask for root password... and file will be transfered...
I can do scp and ssh (secure shell) on my machines. My machines were
configured by our administrator so that I can use ssh and scp.
> b4 that... update the /etc/hosts file with the respective hostnames of the
> the file will be in the format:
> <ip> <host-address> <hostname>
> it might be given when u open the file ... but do as root... ok...
the hostnames were already included in both machines' /etc/hosts.
> hey buddy... do it .. and tell did it worked...
> Keep trying... until u succeed...
What steps do I miss? :(
Or should I install NFS(Network File Server) and NIS(Network Information
Server) for this to properly work?
> On 8/13/06, leo@xxxxxxxxxxxxxxxxxxxxx <leo@xxxxxxxxxxxxxxxxxxxxx
>> I am using (OS) SuSE 9.3 and installed VDT-1.3.11 for both machines.
>> VDT-1.3.11 has Condor version 6.7.20. Condor-6.8.0 won't work for SuSE
>> :( . I configured condor for each machine using the condor_install
>> and ANSWERED that I am installing 2 machines for my condor pool with one
>> as the central manager, it is said in the installation procedure that I
>> have to do condor_install for each machine since i DON'T HAVE A SHARED
>> FILE SERVER. And since I have configured condor for EACH machine without
>> shared file system, I used DIFFERENT CONDOR RELEASE DIRECTORIES for each
>> machine, I used their respective condor release directories and
>> condor_config's as well. I don 't know how to use the release directory
>> the central manager for the other machine. After that, they still don't
>> talk to each other, When I do condor_status on the central manager,
>> one machine appears, the central manager. condor_status and condor_q
>> to not working on the other machine.
>> Then, I can not see any further steps anymore in the installation
>> procedure on how to make the two machines communicate so that I can have
>> 2-machine pool.
>> Should the transfer file mechanism will help me this?
>> Thanks in advance.
>> Leo Cristobal C. Ambolode II
>> Physics Department
>> Mindanao State Univ.-Iligan Inst. of Tech.
>> Iligan City, Philippines
>> > Hi Leo,
>> > Perhaps you could give some further explanation, such as what you have
>> > done to test the communication, what kind of OS, and condor version,
>> > etc; so it'll be easier for someone out there to imagine the condition
>> > and suggest a solution.
>> > Cheers,
>> > --
>> > Nano Surbakti
>> > On 8/12/06, leo@xxxxxxxxxxxxxxxxxxxxx <leo@xxxxxxxxxxxxxxxxxxxxx>
>> >> Hi all,
>> >> A newbie problem I have. It's been a week since i tried to connect my
>> >> two
>> >> machines but the two don't communicate yet. They became two
>> >> personal condors. How did you make your machines communicate? My
>> >> condor_config files are stored locally on each machine. I've searched
>> >> the internet but they all built their condor pool with a shared
>> >> condor_config file and with shared file system. Do I need some shared
>> >> file
>> >> system first? How can i have one condor_config file used by the two
>> >> machines? I have two machines both with global IPs.
>> >> Hope someone can shed light on my troubled mind. :)
>> >> thanks in advance.
>> >> Leo Cristobal C. Ambolode II
>> >> Physics Department
>> >> Mindanao State Univ.-Iligan Inst. of Tech.
>> >> Iligan City, Philippines
>> >> _______________________________________________
>> >> Condor-users mailing list
>> >> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
>> >> a
>> >> subject: Unsubscribe
>> >> You can also unsubscribe by visiting
>> >> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>> >> The archives can be found at either
>> >> https://lists.cs.wisc.edu/archive/condor-users/
Condor-users mailing list
To unsubscribe, send a message to
condor-users-request@xxxxxxxxxxx with a
You can also unsubscribe by visiting
The archives can be found at either