[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Problem with match: sending small msg failed



It seems it's  between the range
$ condor_config_val LOWPORT
20000
$ condor_config_val HIGHPORT
25000

I have just 1 startd with two virtual machines but I have an startd _expression_ to make my job run only on one of them, that's why it does match vm2 but not vm1

I also have a firewall in schedd machine, but it is also open at 20000:25000 range.

Regards,
Enol

2007/11/22, Kewley, J (John) < j.kewley@xxxxxxxx>:
Is 24427 within the LOWPORT / HIGHPORT range on your startd machine?

Can you check using condor_config_val?

Is there also a firewall on the schedd machine? What are its settings?

I am a little confused at the following:
* You have 2 machines in pool
* 1 matches job, 1 doesn't
* The error mentions vm2@24043@mydomain
* vm2 implies a vm1 on the same machine and you would expect them to share
  configurations.
* Maybe this is a different use of vm2 (I haven't studied log files more than I have to)

cheers

JK




________________________________

        From: condor-users-bounces@xxxxxxxxxxx [mailto: condor-users-bounces@xxxxxxxxxxx] On Behalf Of Enol Fernández
        Sent: Thursday, November 22, 2007 5:03 PM
        To: Condor-Users Mail List
        Subject: [Condor-users] Problem with match: sending small msg failed


        Hello,

        I have a condor pool set up and I'm trying to get my job executed in an
        specific machine, but it never gets to run

        If I do a condor_q -ana I get the following output

        211.000:  Run analysis summary.  Of 2 machines,
              0 are rejected by your job's requirements
              1 reject your job because of their own requirements
              0 match but are serving users with a better priority in the pool
              1 match but reject the job for unknown reasons
              0 match but will not currently preempt their existing job
              0 are available to run your job
                No successful match recorded.
                Last failed match: Thu Nov 22 16:38:45 2007
                Reason for last match failure: no match found

        And then in the Negotiator log I can see the following message:

        11/22 16:19:50 SafeMsg: sending small msg failed. errno: 1
        11/22 16:19:50       Could not send MATCH_INFO/capability to
        vm2@24043@mydomain
        11/22 16:19:50       Rejected 211.0 myuser@xxxxxxx <xxx.xxx.x.xx:24427>:
        no match found

        I have a firewall in the startd machine, but the ports between LOWPORT
        and HIGHPORT are open.

        What king of message is this "small msg"?

        Thanks,
        Enol.

        --
        Enol Fernández
        Computer Architecture & Operting Systems Department
        Universitat Autònoma de Barcelona


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/