[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Docker volumes mount on central machine, but not on pool machine



My company currently has condor set up with one central machine and one additional machine in the pool. We are trying to run jobs on the docker universe on both machines, but only the central machine is able to mount docker volumes. The relavent sections of the two configuration files look like:

 

central machine condor_config file:

 

DOCKER = sudo /usr/bin/docker

DOCKER_DROP_ALL_CAPABILITIES = false

DOCKER_VOLUMES = IMAGE_DIR, OUTPUT_DIR

DOCKER_VOLUME_DIR_IMAGE_DIR = /mnt/autofs/beast/

DOCKER_VOLUME_DIR_OUTPUT_DIR = /condor/output

DOCKER_MOUNT_VOLUMES = IMAGE_DIR, OUTPUT_DIR

 

pool machine condor_config file:

 

DOCKER = sudo /usr/bin/docker

DOCKER_DROP_ALL_CAPABILITIES = false

DOCKER_VOLUMES = IMAGE_DIR, OUTPUT_DIR

DOCKER_VOLUME_DIR_IMAGE_DIR = /mnt/autofs/beast/

DOCKER_VOLUME_DIR_OUTPUT_DIR = /condor/output

DOCKER_MOUNT_VOLUMES = IMAGE_DIR, OUTPUT_DIR

 

 

However, volumes of the pool machine do not mount, while the volumes of the master machine do. Running: condor_status -l slot1@<central-machine> | grep -i docker

 

DockerVersion = "Docker version 1.12.2, build bb80604"

HasDockerVolumeOUTPUT_DIR = true

HasDocker = true

HasDockerVolumeIMAGE_DIR = true

StarterAbilityList = "HasJava,HasDocker,HasFileTransfer,HasTDP,HasPerFileEncryption,HasVM,HasReconnect,HasMPI,HasFileTransferPluginMethods,HasJobDeferral,HasJICLocalStdin,HasJICLocalConfig,HasRemoteSyscalls,HasCheckpointing"

 

Running: condor_status -l slot1@<pool-machine> | grep -i docker

 

DockerVersion = "Docker version 1.12.2, build bb80604"

HasDocker = true

StarterAbilityList = "HasJava,HasDocker,HasFileTransfer,HasTDP,HasPerFileEncryption,HasVM,HasReconnect,HasMPI,HasFileTransferPluginMethods,HasJobDeferral,HasJICLocalStdin,HasJICLocalConfig"

 

 

I checked all log files that are modified after a restart on both machines. Both look similar and neither produce any errors. The StarterLog of each write the following:

 

StarterLog, central machine:

10/31/16 13:10:57 (pid:76804) DockerProc::Detect()

10/31/16 13:10:57 (pid:76804) DockerProc::Version()

 

StarterLog, pool machine:

10/31/16 13:10:50 (pid:3211) I am: hostname: <name>, fully qualified doman name: <name>, IP: <IP>, IPv4: <IP>, IPv6:

10/31/16 13:10:50 (pid:3211) I am: hostname: <name>, fully qualified doman name: <name>, IP: <IP>, IPv4: <IP>, IPv6:

10/31/16 13:10:56 (pid:3211) DockerProc::Detect()

10/31/16 13:10:56 (pid:3211) DockerProc::Version()

 

 

Any ideas?

 

Thanks!