[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Installing Condor



The original instructions had iptables off, if you want it on...

http://spinningmatt.wordpress.com/2011/06/21/getting-started-multiple-node-condor-pool-with-firewalls/

There are example jobs in the manual.

$ echo 'cmd=/bin/hostname\nshould_transfer_files=if_needed\nwhen_to_transfer_output=on_exit\noutput=hostnames.$(cluster).$(process).out\nlog=hostnames.$(cluster)\nqueue 100' | condor_submit

Best,


matt

On 06/30/2011 10:34 AM, shailesh sharma wrote:
Hi Lucky,
I am able to see o/p of condor_status now by disabling firewall.
May be firewall was blocking the port 9618 which needs to be enabled for
communication.
Can you provide with some test jobs which we can run to demostrate pool
capability?
Also can anyone help in identifying what kind of jobs or programs can be
run on pool and give couple of test cases or examples
Thanks
Shailesh

On Thu, Jun 30, 2011 at 5:13 PM, Lucky <luckyborneo@xxxxxxxxx
<mailto:luckyborneo@xxxxxxxxx>> wrote:

    First step,

    Did your condor already run on fedora71?
    $ ps aux | grep condor

    Then try to ping your condor central manager from your node.

    --Luky


    On 06/30/2011 03:59 PM, shailesh sharma wrote:


        Thanks Matt and Lucky for guiding me to install condor.
        I installed condor on two of our fedora m/c.
        Hostname of central manager:  fedora71, IP 3.77.244.71
        Hostname of node: fedora66 IP 3.77.244.66
        .
        Here are changes i made in configuration
        on 3.77.244.66  and 71  (same settings)
        ------------------------------__---------
        in the global configuration file
        LOCAL_DIR               = /var/lib/condor
        COLLECTOR_NAME          = My-Fedora-Pool

        on 3.77.244.66 (local config file)
        ------------------------------__----
        CONDOR_DEVELOPERS = NONE
        CONDOR_HOST = fedora71.tcssabic.com
        <http://fedora71.tcssabic.com/> <http://fedora71.tcssabic.com
        <http://fedora71.tcssabic.com/>>

        COLLECTOR_NAME = My-Fedora-Pool
        ALLOW_WRITE = $(ALLOW_WRITE), $(CONDOR_HOST)
        START = TRUE
        SUSPEND = FALSE
        PREEMPT = FALSE
        KILL = FALSE
        DAEMON_LIST =  MASTER, STARTD
        NEGOTIATOR_INTERVAL = 20
        TRUST_UID_DOMAIN = TRUE

        on 3.77.244.71 (local config file)
        ------------------------------__-----
        CONDOR_DEVELOPERS = NONE
        CONDOR_HOST = fedora71.tcssabic.com
        <http://fedora71.tcssabic.com/> <http://fedora71.tcssabic.com
        <http://fedora71.tcssabic.com/>>

        COLLECTOR_NAME = My-Fedora-Pool
        START = TRUE
        SUSPEND = FALSE
        PREEMPT = FALSE
        KILL = FALSE
        DAEMON_LIST = COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD
        NEGOTIATOR_INTERVAL = 20
        TRUST_UID_DOMAIN = TRUE
        ALLOW_WRITE = $(ALLOW_WRITE), fedora66.tcssabic.com
        <http://fedora66.tcssabic.com/>
        <http://fedora66.tcssabic.com <http://fedora66.tcssabic.com/>>

        But when i run command condor_status on node m/c i get error
        /CEDAR:6001:Failed to connect to <3.77.244.71:9618
        <http://3.77.244.71:9618/> <http://3.77.244.71:9618
        <http://3.77.244.71:9618/>>>

        Error: Couldn't contact the condor_collector on
        fedora71.tcssabic.com <http://fedora71.tcssabic.com/>
        <http://fedora71.tcssabic.com <http://fedora71.tcssabic.com/>>__./

        //
        I checked collector log file also but couldn't get what is the
        root cause.
        Also, one interesting thing is when i restart condor service i
        get error
        [root@fedora66 ~]# service condor status
        condor_master (pid 1887) is running...
        [root@fedora66 ~]# service condor restart
        Stopping Condor daemons: [ OK ]
        Warning: condor_master may not have exited, start/restart may fail
        Starting Condor daemons:
        [root@fedora66 ~]# service condor status
        condor_master is stopped
        [root@fedora66 ~]#
        [root@fedora66 ~]#
        [root@fedora66 ~]# service condor start
        Starting Condor daemons: [ OK ]
        [root@fedora66 ~]# service condor status
        condor_master (pid 2506) is running...
        So, condor_master is not properly starting when restarted.
        Please help.
        Thanks
        Shailesh


        On Fri, Jun 17, 2011 at 5:55 PM, Lucky <luckyborneo@xxxxxxxxx
        <mailto:luckyborneo@xxxxxxxxx>
        <mailto:luckyborneo@xxxxxxxxx <mailto:luckyborneo@xxxxxxxxx>>__>
        wrote:

            On 06/17/2011 05:47 PM, shailesh sharma wrote:

                Hi Friends,

                Can anyone please guide me how to go for installation
        and where
                  can get a
                complete guide to install and configure jobs.
                We are going to implement Condor in linux environment
        with Fedora
                10 installed.


            First step, you can read this thread.
        https://lists.cs.wisc.edu/____archive/condor-users/2010-____September/msg00100.shtml
        <https://lists.cs.wisc.edu/__archive/condor-users/2010-__September/msg00100.shtml>
        <https://lists.cs.wisc.edu/__archive/condor-users/2010-__September/msg00100.shtml
        <https://lists.cs.wisc.edu/archive/condor-users/2010-September/msg00100.shtml>>

            It will tell you how to install condor on Fedora Linux.


                Also, can you please guide if we can add windows system
        also in the
                environment???


            If you want to add windows machine on your environment, try
        to read
            this link:
        http://www.cs.wisc.edu/condor/____manual/v7.2/3_2Installation.____html
        <http://www.cs.wisc.edu/condor/__manual/v7.2/3_2Installation.__html>
        <http://www.cs.wisc.edu/__condor/manual/v7.2/3___2Installation.html
        <http://www.cs.wisc.edu/condor/manual/v7.2/3_2Installation.html>>
            And read the faq here:
        http://www.cs.wisc.edu/condor/____manual/v7.4/7_4Condor_on.__html <http://www.cs.wisc.edu/condor/__manual/v7.4/7_4Condor_on.html>
        <http://www.cs.wisc.edu/__condor/manual/v7.4/7_4Condor___on.html
        <http://www.cs.wisc.edu/condor/manual/v7.4/7_4Condor_on.html>>

            Have a nice try...

            --Luky
            ___________________________________________________
            Condor-users mailing list
            To unsubscribe, send a message to
        condor-users-request@xxxxxxxxxxxxxxx
        <mailto:condor-users-request@xxxxxxxxxxxxx
        <mailto:condor-users-request@xxxxxxxxxxx>> with a

            subject: Unsubscribe
            You can also unsubscribe by visiting
        https://lists.cs.wisc.edu/____mailman/listinfo/condor-users
        <https://lists.cs.wisc.edu/__mailman/listinfo/condor-users>
        <https://lists.cs.wisc.edu/__mailman/listinfo/condor-users
        <https://lists.cs.wisc.edu/mailman/listinfo/condor-users>>

            The archives can be found at:
        https://lists.cs.wisc.edu/____archive/condor-users/
        <https://lists.cs.wisc.edu/__archive/condor-users/>
        <https://lists.cs.wisc.edu/__archive/condor-users/
        <https://lists.cs.wisc.edu/archive/condor-users/>>





_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/