[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

AW: AW: [Condor-users] Basic Job submission problems



Hi there,

Did you made a test by changing

START                   = $(UWCS_START)
SUSPEND                 = $(UWCS_SUSPEND)
CONTINUE                = $(UWCS_CONTINUE)
PREEMPT                 = $(UWCS_PREEMPT)
KILL                    = $(UWCS_KILL)
PERIODIC_CHECKPOINT     = $(UWCS_PERIODIC_CHECKPOINT)
PREEMPTION_REQUIREMENTS = $(UWCS_PREEMPTION_REQUIREMENTS)
PREEMPTION_RANK         = $(UWCS_PREEMPTION_RANK)

In the condor_config to

START                   = $(TESTINGMODE_START)
SUSPEND                 = $(TESTINGMODE_SUSPEND)
CONTINUE                = $(TESTINGMODE_CONTINUE)
PREEMPT                 = $(TESTINGMODE_PREEMPT)
KILL                    = $(TESTINGMODE_KILL)
PERIODIC_CHECKPOINT     = $(TESTINGMODE_PERIODIC_CHECKPOINT)
PREEMPTION_REQUIREMENTS = $(TESTINGMODE_PREEMPTION_REQUIREMENTS)
PREEMPTION_RANK         = $(TESTINGMODE_PREEMPTION_RANK)

?

As far as I know, these settings let a job start a job whether the state is
unclaimed or not.

> Also, the Job ClassAds are generated by condor_submit, but would it be
possible 
> to edit the machine ClassAds as advertised by the nodes to the central
manager?

Sorry, but I don't understand your intention. Maybe you could explain a
little bit more about what you are planning?

> Danke Schunn, 

You mean Danke Schoen ;-)

Hope this helps,
Thomas Bauer
--
Westfaelische Wilhelms-Universitaet Muenster
Institut fuer Festkoerpertheorie
Wilhelm-Klemm-Str. 10
D 48149 Muenster
++49 (251) 8339040

-----Ursprüngliche Nachricht-----
Von: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] Im Auftrag von ABDUL SUBHAN
Gesendet: Mittwoch, 1. September 2004 18:52
An: Condor-Users Mail List
Betreff: Re: AW: [Condor-users] Basic Job submission problems

Thanks a lot for your response. I checked condor_status and it does show 2 
machines under Owner and none under Unclaimed. Actually, the environment I'm

trying to set up is a dedicated cluster, which would simply queue jobs as 
submitted, instead of trying to use idle computer time [as indicated by the 
Unclaimed flag]. I believe I'm going to have to make changes to the 
configuration files for this, [back to the manual], but any pointers from
your 
end would be really appreciated. 

Also, the Job ClassAds are generated by condor_submit, but would it be
possible 
to edit the machine ClassAds as advertised by the nodes to the central
manager?

Danke Schunn, 

Subhan

Quoting Thomas Bauer <tombauer@xxxxxxxxxxxxxxxxxxx>:

> What does condor_status say? The job will not start before the state is
> unclaimed. Maybe you didn't wait long enough to let the machine switch
> from
> owner to unclaimed?
> 
> 
> Thomas Bauer
> --
> Westfaelische Wilhelms-Universitaet Muenster
> Institut fuer Festkoerpertheorie
> Wilhelm-Klemm-Str. 10
> D 48149 Muenster
> ++49 (251) 8339040
> 
> -----Ursprüngliche Nachricht-----
> Von: condor-users-bounces@xxxxxxxxxxx
> [mailto:condor-users-bounces@xxxxxxxxxxx] Im Auftrag von ABDUL SUBHAN
> Gesendet: Mittwoch, 1. September 2004 17:22
> An: condor-users@xxxxxxxxxxx
> Betreff: [Condor-users] Basic Job submission problems
> 
> I've just installed Condor on 2 machine cluster. Set up the Submitting
> station/ Central Manager as one machine, and another node as the
> executing
> station.
> 
> Although ps -ef | grep 'condor' on the executing station shows the
> startd
> daemon up and running, however its not listed in available nodes. The
> command below shows no entries
> > condor_status -available
> 
> This is also the reason why I can't submit jobs. When I do, condoq_q
> shows
> jobs are idle. condor_q -analyse shows that 
> 
> 08.000:  Run analysis summary.  Of 2 machines,
>       0 are rejected by your job's requirements
>       2 reject your job because of their own requirements
>       0 match, but are serving users with a better priority in the pool
>       0 match, match, but reject the job for unknown reasons
>       0 match, but will not currently preempt their existing job
>       0 are available to run your job
>         No successful match recorded.
>         Last failed match: Thu Sep  2 05:48:23 2004
>         Reason for last match failure: no match found
> 
> I think I'm missing out on some configuration file entries to start up
> the
> job execution. Any help would be really appreciated. Thanx
> 
> 
> ABDUL SUBHAN
> 
> RESEARCH ASSISTANT
> COMPUTER ENGINEERING DEPARTMENT
> 
> P.O BOX # 7851
> KING FAHD UNIVERSITY OF PETROLEUM & MINERALS
> DHAHRAN, 31261
> KINGDOM OF SAUDI ARABIA
> PHONE RESI: 00966-3-860-8000-EXT: 9902126
> 
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> 
> 



ABDUL SUBHAN

RESEARCH ASSISTANT
COMPUTER ENGINEERING DEPARTMENT

P.O BOX # 7851
KING FAHD UNIVERSITY OF PETROLEUM & MINERALS
DHAHRAN, 31261
KINGDOM OF SAUDI ARABIA
PHONE RESI: 00966-3-860-8000-EXT: 9902126

_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
http://lists.cs.wisc.edu/mailman/listinfo/condor-users