[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Issue getting started with Python Condor bindings



Aha, I just rememberedÂhttps://htcondor-wiki.cs.wisc.edu/index.cgi/tktview?tn=7325, which indicates a few more things to check...

Are you trying to submit as root?
Do you have a max jobs limit configured, and are you hitting it?

This also makes me think the schedd log will probably be pretty useful.

Josh Karpel


On Thu, Aug 27, 2020 at 8:18 AM Josh Karpel <karpel@xxxxxxxx> wrote:
Hi Jamie,

Are you able to submit jobs using the condor_submit and submit file workflow? Anything interesting in the schedd logs after you try to submit through Python?

My only other thought right now is that the tutorial are nominally written for the 8.9 series, but I don't see anything in your code that would be different between 8.8 and 8.9.


Josh Karpel


On Wed, Aug 26, 2020 at 1:56 PM Jamie Rajewski <jrajewsk@xxxxxxxxxxx> wrote:
Hi,

I was going through the tutorials for the API when I figured I may as well test them on our own cluster running Condor 8.8.9. I installed the API version corresponding to our Condor version for both python2 (2.7.5) and 3 (3.6.8).

I tried the following test example in a file labelled condor.py, first in python2:
import htcondor
import classad

schedd = htcondor.Schedd()
sub = htcondor.Submit()
sub['executable'] = '/bin/sleep'
sub['arguments'] = '5m'

with schedd.transaction() as txn:
ÂÂÂ sub.queue(txn, 10)
but got an error:
Traceback (most recent call last):
 File "condor.py", line 12, in <module>
ÂÂÂ sub.queue(txn, 10)
RuntimeError: Failed to abort transaction.
terminate called after throwing an instance of 'boost::python::error_already_set'
Aborted (core dumped)
I then re-ran it in python3 and got a similar but slightly more detailed exception:
Traceback (most recent call last):
 File "condor.py", line 12, in <module>ÂÂÂ
ÂÂÂ sub.queue(txn, 10)
RuntimeError: Failed to create new proc ID.
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
 File "condor.py", line 12, in <module>
ÂÂÂ sub.queue(txn, 10)
RuntimeError: Failed to abort transaction.
terminate called after throwing an instance of 'boost::python::error_already_set'
Aborted (core dumped)
I checked to ensure that the scheduler was correctly found (it was, and this is running on the schedd node too). Any ideas?

Thank you!

Jamie Rajewski

Sent from Mailspring_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/