[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Running Jobs




Do you have a shared file system in place? If not you will need to arrange for file transfers to be possible.

Can you also check your SchedLog and NegotiatorLog?



On 25 Jul 2007, at 20:32, Ary Junior wrote:

Ok!

[aryjr@atena ~]$ condor_q -better-analyze 5


-- Submitter: atena.solidos.quimica.ufjf.br : <192.168.1.107:53583> : atena.solidos.quimica.ufjf.br
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
AddConstraint: Condition value not literal
---
005.000:  Run analysis summary.  Of 27 machines,
     16 are rejected by your job's requirements
      3 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
      8 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 are available to run your job

The Requirements _expression_ for your job is:

( target.Arch == "INTEL" ) && ( target.OpSys == "LINUX" ) &&
( target.Disk >= DiskUsage ) && ( ( target.Memory * 1024 ) >= ImageSize ) &&
( ( target.HasFileTransfer ) || ( TARGET.FileSystemDomain == MY.FileSystemDomain ) )

    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   ( target.Arch == "INTEL" )        11
2   ( target.OpSys == "LINUX" )       27
3   ( target.Disk >= 10000 )          27
4   ( ( 1024 * target.Memory ) >= 10000 )27
5   ( ( target.HasFileTransfer ) || ( TARGET.FileSystemDomain == "solidos.quimica.ufjf.br" ) )
                                      27

The following attributes are missing from the job ClassAd:

CheckpointPlatform

And this is my condor pool:

[aryjr@atena ~]$ condor_status

Name          OpSys       Arch   State      Activity   LoadAv Mem   ActvtyTime

vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       1.000   754  7+23:47:25
vm2@xxxxxxxxx LINUX       INTEL  Owner      Idle       2.360   754  7+23:47:26
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       1.000  2025  6+05:21:19
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.010  2025  0+00:25:05
frea.solidos. LINUX       INTEL  Unclaimed  Idle       0.000   502  0+02:41:36
vm1@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   754  0+02:38:03
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   754  0+02:38:04
vm1@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   754[?????]
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   754[?????]
vm1@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   754[?????]
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   754[?????]
vm1@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   491  0+00:06:46
vm2@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   491  0+00:06:46
vm3@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   491  0+00:06:47
vm4@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   491  0+00:06:48
vm5@xxxxxxxxx LINUX       X86_64 Owner      Idle       0.000   491  0+00:06:49
vm6@xxxxxxxxx LINUX       X86_64 Owner      Idle       0.000   491  0+00:06:50
vm7@xxxxxxxxx LINUX       X86_64 Owner      Idle       0.000   491  0+00:06:51
vm8@xxxxxxxxx LINUX       X86_64 Owner      Idle       0.000   491  0+00:06:44
vm1@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   994  6+23:05:48
vm2@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   994  6+23:05:49
vm3@xxxxxxxxx LINUX       X86_64 Owner      Idle       1.000   994  0+05:37:08
vm4@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.010   994  0+03:02:04
vm5@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000   994  3+11:03:44
vm6@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000   994  6+23:05:49
vm7@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000   994  6+23:05:50
vm8@xxxxxxxxx LINUX       X86_64 Unclaimed  Idle       0.000   994  6+23:05:43

                     Total Owner Claimed Unclaimed Matched Preempting Backfill

         INTEL/LINUX    11     3       0         8       0          0        0
        X86_64/LINUX    16    11       0         5       0          0        0

               Total    27    14       0        13       0          0        0

Thanks very much!!!

On 7/25/07, Jones, Torrin A (US SSA) <torrin.jones@xxxxxxxxxxxxxx> wrote:
Hmmm . . . not enough info.
 
What does "condor_q -analyze" say?  What does "condor_q -better-analyze" say?
 
-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto: condor-users-bounces@xxxxxxxxxxx] On Behalf Of Ary Junior
Sent: Wednesday, July 25, 2007 12:07
To: Condor-Users Mail List
Subject: [Condor-users] Running Jobs

Hi, I have a litle problem with my condor pool... when I execute condor_submit from any host the job go to the pool but never run... the condor_q output is:

[aryjr@atena 7]$ condor_q


-- Submitter: atena.solidos.quimica.ufjf.br : <192.168.1.107:53583> : atena.solidos.quimica.ufjf.br
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
   5.0   aryjr           7/25 16:02   0+00:00:00 I  0   9.8  zr2o4ti2o4.sh

1 jobs; 1 idle, 0 running, 0 held

My submt file ( from a web tutorial ) contents is:

universe        = vanilla
executable      = zr2o4ti2o4.sh
output          = zr2o4ti2o4.sh.out
error           = zr2o4ti2o4.sh.err
log             = zr2o4ti2o4.sh.log
should_transfer_files = IF_NEEDED
when_to_transfer_output = ON_EXIT
queue

If I run the same job from my master machine it's run normaly. What is wrong with my .submit file?

Thanks very much!!!

Ary Junior

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting

The archives can be found at: