[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] JobWrapper fails to execute after upgrade to 8.8.11



The /tmp is not noexec, but condor is run via systemd.

> On 23. Nov 2020, at 15:38, mdidomenico4@xxxxxxxxx wrote:
> 
> is your /tmp noexec and/or are you running condor from within systemd?
> 
> 
> On Mon, Nov 23, 2020 at 6:00 AM Fischer, Max (SCC) <max.fischer@xxxxxxx> wrote:
>> 
>> 
>> Hi all,
>> 
>> weâre currently preparing to upgrade our Batch Cluster from HTCondor 8.6 to 8.8. However, our test node shows an issue that I do not understand.
>> 
>> We use a `USER_JOB_WRAPPER` to run all jobs in a login-shell environment. The script is basically this:
>> 
>>        #!/bin/bash -l
>>        exec "$@â
>> 
>> Since the upgrade to 8.8.11, condor fails to execute the job-wrapper with `child exec failed due to bad interpreter (/bin/bash)` [0]. Obviously, the machine does have `/bin/bash` and executing the wrapper as either root or condor works fine.
>> No changes were made other than upgrading HTCondor and restarting it.
>> 
>> Is there any change that could affect this? Do we have to be aware of how HTCondor determines the PATH?
>> Iâve only found Ticket #6904, but it seems unrelated.
>> 
>> Cheers,
>> Max
>> 
>> [0]
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Job 1414142.0 set to execute immediately
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Starting a VANILLA universe job with ID: 1414142.0
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Current mount, /tmp, is shared.
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Current mount, /var, is shared.
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) IWD: /tmp/condor_execute/dir_26553
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Output file: /tmp/condor_execute/dir_26553/_condor_stdout
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Error file: /tmp/condor_execute/dir_26553/_condor_stderr
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Renice expr "10" evaluated to 10
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Running job as user alice022
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Using wrapper /usr/libexec/condor/jobwrapper.sh to exec /tmp/condor_execute/dir_26553/condor_exec.exe
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Warning: Create_Process: failed to read child process failure code
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Create_Process(/usr/libexec/condor/jobwrapper.sh): child exec failed due to bad interpreter (/bin/bash)
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Create_Process(/usr/libexec/condor/jobwrapper.sh,/tmp/condor_execute/dir_26553/condor_exec.exe, ...) failed: invalid interpreter (/bin/bash) specified on first line of script (errno=2: 'No such file or directory')
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) Failed to start job, exiting
>> 11/23/20 10:48:15 (pid:26553) (D_ALWAYS) ShutdownFast all jobs._______________________________________________
>> HTCondor-users mailing list
>> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
>> subject: Unsubscribe
>> You can also unsubscribe by visiting
>> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
>> 
>> The archives can be found at:
>> https://lists.cs.wisc.edu/archive/htcondor-users/
> 
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/

Attachment: smime.p7s
Description: S/MIME cryptographic signature