Mailing List Archives
Public Access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
AW: AW: [Condor-users] Basic Job submission problems
- Date: Thu, 2 Sep 2004 09:47:42 +0200
- From: "Thomas Bauer" <tombauer@xxxxxxxxxxxxxxxxxxx>
- Subject: AW: AW: [Condor-users] Basic Job submission problems
Hi there,
Did you made a test by changing
START = $(UWCS_START)
SUSPEND = $(UWCS_SUSPEND)
CONTINUE = $(UWCS_CONTINUE)
PREEMPT = $(UWCS_PREEMPT)
KILL = $(UWCS_KILL)
PERIODIC_CHECKPOINT = $(UWCS_PERIODIC_CHECKPOINT)
PREEMPTION_REQUIREMENTS = $(UWCS_PREEMPTION_REQUIREMENTS)
PREEMPTION_RANK = $(UWCS_PREEMPTION_RANK)
In the condor_config to
START = $(TESTINGMODE_START)
SUSPEND = $(TESTINGMODE_SUSPEND)
CONTINUE = $(TESTINGMODE_CONTINUE)
PREEMPT = $(TESTINGMODE_PREEMPT)
KILL = $(TESTINGMODE_KILL)
PERIODIC_CHECKPOINT = $(TESTINGMODE_PERIODIC_CHECKPOINT)
PREEMPTION_REQUIREMENTS = $(TESTINGMODE_PREEMPTION_REQUIREMENTS)
PREEMPTION_RANK = $(TESTINGMODE_PREEMPTION_RANK)
?
As far as I know, these settings let a job start a job whether the state is
unclaimed or not.
> Also, the Job ClassAds are generated by condor_submit, but would it be
possible
> to edit the machine ClassAds as advertised by the nodes to the central
manager?
Sorry, but I don't understand your intention. Maybe you could explain a
little bit more about what you are planning?
> Danke Schunn,
You mean Danke Schoen ;-)
Hope this helps,
Thomas Bauer
--
Westfaelische Wilhelms-Universitaet Muenster
Institut fuer Festkoerpertheorie
Wilhelm-Klemm-Str. 10
D 48149 Muenster
++49 (251) 8339040
-----Ursprüngliche Nachricht-----
Von: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] Im Auftrag von ABDUL SUBHAN
Gesendet: Mittwoch, 1. September 2004 18:52
An: Condor-Users Mail List
Betreff: Re: AW: [Condor-users] Basic Job submission problems
Thanks a lot for your response. I checked condor_status and it does show 2
machines under Owner and none under Unclaimed. Actually, the environment I'm
trying to set up is a dedicated cluster, which would simply queue jobs as
submitted, instead of trying to use idle computer time [as indicated by the
Unclaimed flag]. I believe I'm going to have to make changes to the
configuration files for this, [back to the manual], but any pointers from
your
end would be really appreciated.
Also, the Job ClassAds are generated by condor_submit, but would it be
possible
to edit the machine ClassAds as advertised by the nodes to the central
manager?
Danke Schunn,
Subhan
Quoting Thomas Bauer <tombauer@xxxxxxxxxxxxxxxxxxx>:
> What does condor_status say? The job will not start before the state is
> unclaimed. Maybe you didn't wait long enough to let the machine switch
> from
> owner to unclaimed?
>
>
> Thomas Bauer
> --
> Westfaelische Wilhelms-Universitaet Muenster
> Institut fuer Festkoerpertheorie
> Wilhelm-Klemm-Str. 10
> D 48149 Muenster
> ++49 (251) 8339040
>
> -----Ursprüngliche Nachricht-----
> Von: condor-users-bounces@xxxxxxxxxxx
> [mailto:condor-users-bounces@xxxxxxxxxxx] Im Auftrag von ABDUL SUBHAN
> Gesendet: Mittwoch, 1. September 2004 17:22
> An: condor-users@xxxxxxxxxxx
> Betreff: [Condor-users] Basic Job submission problems
>
> I've just installed Condor on 2 machine cluster. Set up the Submitting
> station/ Central Manager as one machine, and another node as the
> executing
> station.
>
> Although ps -ef | grep 'condor' on the executing station shows the
> startd
> daemon up and running, however its not listed in available nodes. The
> command below shows no entries
> > condor_status -available
>
> This is also the reason why I can't submit jobs. When I do, condoq_q
> shows
> jobs are idle. condor_q -analyse shows that
>
> 08.000: Run analysis summary. Of 2 machines,
> 0 are rejected by your job's requirements
> 2 reject your job because of their own requirements
> 0 match, but are serving users with a better priority in the pool
> 0 match, match, but reject the job for unknown reasons
> 0 match, but will not currently preempt their existing job
> 0 are available to run your job
> No successful match recorded.
> Last failed match: Thu Sep 2 05:48:23 2004
> Reason for last match failure: no match found
>
> I think I'm missing out on some configuration file entries to start up
> the
> job execution. Any help would be really appreciated. Thanx
>
>
> ABDUL SUBHAN
>
> RESEARCH ASSISTANT
> COMPUTER ENGINEERING DEPARTMENT
>
> P.O BOX # 7851
> KING FAHD UNIVERSITY OF PETROLEUM & MINERALS
> DHAHRAN, 31261
> KINGDOM OF SAUDI ARABIA
> PHONE RESI: 00966-3-860-8000-EXT: 9902126
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
>
>
ABDUL SUBHAN
RESEARCH ASSISTANT
COMPUTER ENGINEERING DEPARTMENT
P.O BOX # 7851
KING FAHD UNIVERSITY OF PETROLEUM & MINERALS
DHAHRAN, 31261
KINGDOM OF SAUDI ARABIA
PHONE RESI: 00966-3-860-8000-EXT: 9902126
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
http://lists.cs.wisc.edu/mailman/listinfo/condor-users