[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] job rejected.rank condition MY.Rank>MY.CurrentRank



Hi, all condor experts, I encountered same problem.

$ condor_q -l -ana 22128.0

- Submitter: emcluster.****.cn : <192.168.2.71:51932> : emcluster.****.cn
slot8@yellow.*** Failed rank condition: MY.Rank > MY.CurrentRank
---
22128.000:  Run analysis summary.  Of 128 machines,
      0 are rejected by your job's requirements
     94 reject your job because of their own requirements
     12 match but are serving users with a better priority in the pool
     22 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 match but are currently offline
      0 are available to run your job



I am using Condor-7.5.3 and have NETWORK_INTERFACE = 192.168.2.71 defined in each local config file. After I checked NegotiatorLog then found:



10/19/10 16:56:45     Request 21760.00000:
10/19/10 16:56:45       Matched 21760.0 dawnsong@*****.ac.cn <192.168.2.71:51932> preempting none <192.168.2.79:57485> slot2@yellow.*****.cn
10/19/10 16:56:45       Successfully matched with slot2@yellow.*****.cn
...

10/19/10 16:56:45 attempt to connect to <192.168.2.79:57485> failed: No route to host (connect errno = 113).  Will keep trying for 20 total seconds (20 to go).

10/19/10 16:56:45 attempt to connect to <192.168.2.78:53216> failed: No route to host (connect errno = 113).  Will keep trying for 20 total seconds (20 to go).

10/19/10 16:56:45 attempt to connect to <192.168.2.80:41823> failed: No route to host (connect errno = 113).  Will keep trying for 20 total seconds (20 to go).
...
10/19/10 17:04:08 ERROR: SECMAN:2004:Was waiting for TCP auth session to <192.168.2.79:57485>, but it failed.
10/19/10 17:04:08       Failed to initiate socket to send MATCH_INFO to slot2@yellow.****.cn
10/19/10 17:04:08 ERROR: SECMAN:2004:Was waiting for TCP auth session to <192.168.2.79:57485>, but it failed.
10/19/10 17:04:08       Failed to initiate socket to send MATCH_INFO to slot3@yellow.****.cn



Any help or advice would be appreciated!

Thanks.

Xiaowei


On Mon, Oct 11, 2010 at 10:58 PM, michele pierri <pierm4ci@xxxxxxxx> wrote:
Hi,
I have this problem...all the job that I submit are rejected for unknow reasons.
If I type condor_q -ana -l job_id I have returned:

-- Submitter: submitter_machine : <xxx.xxx.xxx.xxx:xxxx> : 
machine_name Failed rank condition: MY.Rank > MY.CurrentRank
---
2795.000:  Run analysis summary.  Of 1 machines,
      0 are rejected by your job's requirements
      0 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
      1 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 match but are currently offline
      0 are available to run your job


What is the problem? What I have to do?

Thank you so much.


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/




--
Xiao-Wei Song
Ping Zhu's Lab, Center for Structural and Molecular Biology
Institute of Biophysics, Chinese Academy of Sciences
15 Datun Road, Chaoyang District, Beijing, China 100101
Tel:  +86-10-64888353, E-mail: dawnsong@xxxxxxxxxxxxxx