[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] 6.7.10, DAGs and windows



Hi,

I have (my first) DAG job giving some curious errors. It submits a node
(the second), thinks it failed, and tries again 5 times.. But these
submitted jobs do indeed start, and come back to haunt:

...
9/9 17:11:04 Submitting Condor Job RunApsim ...
9/9 17:11:04 submitting: condor_submit  -a "dag_node_name = RunApsim" -a
"+DAGManJobID = 63" -a "submit_event_notes = DAG Node: RunApsim" -a
"+DAGParentNodeNames = \"MakeSims\"" RunApsim.dagsub
9/9 17:11:04 ERROR: submit failed:
	3 job(s) submitted to cluster 65.
9/9 17:11:04 ERROR: submit attempt failed
9/9 17:11:04 submit command was: condor_submit  -a "dag_node_name =
RunApsim" -a "+DAGManJobID = 63" -a "submit_event_notes = DAG Node:
RunApsim" -a "+DAGParentNodeNames = \"MakeSims\"" RunApsim.dagsub
9/9 17:11:04 Job submit try 1/6 failed, will try again in >= 1 second.
9/9 17:11:04 ERROR: node RunApsim: job ID in userlog submit event (65.0)
doesn't match ID reported earlier by submit command (-1.-1)!  Trusting
the userlog for now, but this is scary!
9/9 17:11:04 Event: ULOG_SUBMIT for Condor Job RunApsim (65.0)
9/9 17:11:04 Unrecognized submit event (for job "RunApsim") found in log
(none expected)


Yours,
Pdev. 

********************************DISCLAIMER****************************
The information contained in the above e-mail message or messages 
(which includes any attachments) is confidential and may be legally 
privileged.  It is intended only for the use of the person or entity 
to which it is addressed.  If you are not the addressee any form of 
disclosure, copying, modification, distribution or any action taken 
or omitted in reliance on the information is unauthorised.  Opinions 
contained in the message(s) do not necessarily reflect the opinions 
of the Queensland Government and its authorities.  If you received 
this communication in error, please notify the sender immediately and 
delete it from your computer system network.