Hi again, Greg
So, we were able to move to HTCondor 9.10.0 and, as expected, adding docker_network_type = host to my submit file got my docker container using the network from the host. One step forward!
But, two steps back, … docker is also running MPI in the container and, when I use host network, I get an error like this:
MPID_nem_tcp_init(379).............: gethostbyname failed, <<address>> (errno 1)
Have you seen something like this? Do I somehow need to pass the hostname from the host into the container? Based on some similar errors and searching, that’s what it seems like:
But I wanted to see if I’m missing something obvious.
Michael Fienen, Ph. D.
Upper Midwest Water Science Center
Madison, Wisconsin 53726
On 7/25/22 5:30 PM, Fienen, Michael N via HTCondor-users wrote:
This feature was added in HTCondor 8.9.9, so it won't be in a stable version before that. Given that, maybe the best way to get this option added is to redefine the HTCondor knob "DOCKER" to point to a script you write, which inserts --network=host to the arguments, if we are running docker run ... This is a bit of work, but if you can't upgrade, I don't think there's a better way.