[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Job not starting correctly

Exit 139 is a segmentation fault, so something is amiss with the internals of the solver. Maybe a library mismatch?


Michael V Pelletier

Principal Engineer

Raytheon Technologies

Digital Technology

HPC Support Team


From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> On Behalf Of Peter Ellevseth
Sent: Monday, May 3, 2021 6:04 PM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: [External] [HTCondor-users] Job not starting correctly




We are running a commercial CFD-code via htcondor. Been doing it for years without any issued. I installed a new version of that software and want to run it via htcondor as per usual. I to this by telling condor to run a locally installed bash-script on the execute node which in turn starts the CFD-solver. I have to do it this to source some files need by the solver to start (license etc).


However, the new version is refusing to start. From the the StarterLog.slotX I see the job immediately stops with


05/03/21 23:56:33 (pid:4135578) Create_Process succeeded, pid=4135579

05/03/21 23:56:33 (pid:4135578) Process exited, pid=4135579, status=139

05/03/21 23:56:33 (pid:4135578) Got SIGQUIT.  Performing fast shutdown.


If I ssh in to one of the execute nodes I can start it just and it runs as normal.


If I do condor_submit -interactive my_submit_file, I am able to run the script with ./runscript just fine.


The why won’t it start when I submit the file normally??