[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Shadow Exception: Create_Process failed to register the job with the ProcD



Hi,

I install and configure HTCondor in a docker container. I submit a simple sleep.sub file as Quick Start shows but I get the log file as follows.

000 (007.000.000) 03/09 13:48:31 Job submitted from host: <172.17.0.2:9618?addrs=172.17.0.2-9618+[--1]-9618&noUDP&sock=46415_b8d0_4>
...
001 (007.000.000) 03/09 13:48:38 Job executing on host: <172.17.0.2:9618?addrs=172.17.0.2-9618+[--1]-9618&noUDP&sock=46415_b8d0_6>
...
007 (007.000.000) 03/09 13:48:38 Shadow exception!
        Error from slot2@ddfb828b5e4d: Create_Process failed to register the job with the ProcD
        0  -  Run Bytes Sent By Job
        114  -  Run Bytes Received By Job

=====================================
The content of shadow_log file is as follows.

03/09/18 13:54:55 Daemon Log is logging: D_ALWAYS D_ERROR
03/09/18 13:54:55 SharedPortEndpoint: waiting for connections to named socket 46465_8bbb_383
03/09/18 13:54:55 DaemonCore: command socket at <172.17.0.2:9618?addrs=172.17.0.2-9618+[--1]-9618&noUDP&sock=46465_8bbb_383>
03/09/18 13:54:55 DaemonCore: private command socket at <172.17.0.2:9618?addrs=172.17.0.2-9618+[--1]-9618&noUDP&sock=46465_8bbb_383>
03/09/18 13:54:55 Initializing a VANILLA shadow for job 7.0
03/09/18 13:54:55 (7.0) (132973): Request to run on slot2@ddfb828b5e4d <172.17.0.2:9618?addrs=172.17.0.2-9618+[--1]-9618&noUDP&sock=46415_b8d0_6> was ACCEPTED
03/09/18 13:54:58 (6.0) (132964): File transfer completed successfully.
03/09/18 13:54:59 (6.0) (132964): ERROR "Error from slot1@ddfb828b5e4d: Create_Process failed to register the job with the ProcD" at line 608 in file /slots/01/dir_317056/userdir/.tmpj8HirB/BUILD/condor-8.6.9/src/condor_shadow.V6.1/pseudo_ops.cpp
03/09/18 13:55:00 (7.0) (132973): File transfer completed successfully.
03/09/18 13:55:01 (7.0) (132973): ERROR "Error from slot2@ddfb828b5e4d: Create_Process failed to register the job with the ProcD" at line 608 in file /slots/01/dir_317056/userdir/.tmpj8HirB/BUILD/condor-8.6.9/src/condor_shadow.V6.1/pseudo_ops.cpp

=====================================

If I set USE_PROCD = false in the configuration file, the job finishes successfully.
I wondor if it is ok to do so or if there is a better way to solve that.