[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Unable to start Abaqus MPI Jobs with HTCondor



At first,

I would like to say thank you to everyone who contributed and/or still
contributes in any sense to

htcondor, I am really impressed about the capabilities and possibilities
of htcondor.


Currently, I am struggeling with htcondor while running abaqus jobs. In
short, I am able to run

abaqus jobs on each host if it is started WITHOUT condor. But if condor
is involed, after a short

successfull runtime, the abaqus job terminates with


*** buffer overflow detected ***:
/opt/Abaqus/V6R2017x/linux_a64/code/bin/standard terminated
======= Backtrace: =========
/lib64/libc.so.6(__fortify_fail+0x37)[0x2af27504a607]
/lib64/libc.so.6(+0x116782)[0x2af275048782]
/lib64/libc.so.6(+0x115c8b)[0x2af275047c8b]
/lib64/libc.so.6(_IO_default_xsputn+0xe1)[0x2af274faefa1]
/lib64/libc.so.6(_IO_vfprintf+0x28c5)[0x2af274f7cec5]
/lib64/libc.so.6(__vsprintf_chk+0x88)[0x2af275047d18]
/lib64/libc.so.6(__sprintf_chk+0x7d)[0x2af275047c6d]
/opt/Abaqus/V6R2017x/linux_a64/code/bin/libifcoremt.so.5(fname_from_piped_fd+0xa3)[0x2af273bda993]

...


Actually, I am absolutely without ideas and any help is appreciated.


I guess you need more information, so please feel free to ask for more.


OS: CentOS7.9

Condor Version: Stable 8.8.12


Regards,

Felix


--------

submit file:

universe = parallel
executable = runAbaqus.sh
request_cpusÂÂÂÂ = 1
machine_countÂÂÂ = 1
requirements = ( machine == "specifichost")

jobname = somename
usersubroutine = uamp.f
arguments= job=$(jobname) input=$(jobname).inp user=$(usersubroutine)
cpus=$(request_cpus) inter

outputÂÂÂÂÂÂ = outputfile
errorÂÂÂÂÂÂÂ = errorfile
logÂÂÂÂÂÂÂÂÂ = abq2017.log
concurrency_limits = abaqus_tokens:5
getenv = True
should_transfer_files = no
queue

runAbaqus.sh file

cat runAbaqus.sh
#!/usr/bin/bash
/opt/Abaqus/Commands/abq2017 "$@"