[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Grid Computing, resource is still down



Hello:
I encountered some problems using HTCondor, I am referring to the Manual Chapter 5, build a grid computing environment. I have machine 181 and pool 188, the task is not performed.

This is part of the submint description file:
universe = grid
executable = /data/condor_test/CondorTest.class
input = /data/condor_test/list.txt
arguments = CondorTest 181937_2014-05-14_162956
log = /data/condor_test/condor.log
error = /data/condor_test/condor.error
grid_resource = condor zhanglei@xxxxxxxxxxx CPLiJian
+remote_universe = 10
+remote_requirements = True
+remote_ShouldTransferFiles = 'YES'
queue

condor.log outputs the content:
020 (034.000.000) 06/16 15:16:05 Detected Down Globus Resource
    RM-Contact: zhanglei@xxxxxxxxxxx
...
026 (034.000.000) 06/16 15:16:05 Detected Down Grid Resource
    GridResource: condor zhanglei@xxxxxxxxxxx CPLiJian
...

/var/log/condor/GridmanagerLog.zhanglei
This log file is output "resource zhanglei@xxxxxxxxxxx is still down" every five minutes.


The following are the relevant configuration.
This is 181 part of the configuration:
FLOCK_TO = 188.nodeljB
FLOCK_COLLECTOR_HOSTS = $(FLOCK_TO)
FLOCK_NEGOTIATOR_HOSTS = $(FLOCK_TO)
ALLOW_NEGOTIATOR_SCHEDD = $(CONDOR_HOST), $(FLOCK_NEGOTIATOR_HOSTS), $(IP_ADDRESS)

CONDOR_GAHP = $(SBIN)/condor_c-gahp
C_GAHP_LOG = /tmp/CGAHPLog.$(USERNAME)
C_GAHP_WORKER_THREAD_LOG = /tmp/CGAHPWorkerLog.$(USERNAME)
C_GAHP_WORKER_THREAD_LOCK = /tmp/CGAHPWorkerLock.$(USERNAME)

This is 188 part of the configuration:
UID_DOMAIN = nodeljB
COLLECTOR_NAME = CPLiJian

CONDOR_HOST = 188.nodeljB
FLOCK_FROM = 181.nodeljA
FLOCK_TO =
FLOCK_NEGOTIATOR_HOSTS = $(FLOCK_TO)
FLOCK_COLLECTOR_HOSTS = $(FLOCK_TO)
ALLOW_ADMINISTRATOR = $(CONDOR_HOST), $(IP_ADDRESS)
ALLOW_OWNER = $(FULL_HOSTNAME), $(ALLOW_ADMINISTRATOR)
ALLOW_READ = *.nodeljB
ALLOW_WRITE = *.nodeljB
ALLOW_NEGOTIATOR = zhanglei@$(CONDOR_HOST), $(IP_ADDRESS)
ALLOW_NEGOTIATOR_SCHEDD = $(CONDOR_HOST), $(FLOCK_NEGOTIATOR_HOSTS), $(IP_ADDRESS)
ALLOW_WRITE_COLLECTOR = $(ALLOW_WRITE), $(FLOCK_FROM)
ALLOW_WRITE_STARTD = $(ALLOW_WRITE), $(FLOCK_FROM)
ALLOW_READ_COLLECTOR = $(ALLOW_READ), $(FLOCK_FROM)
ALLOW_READ_STARTD = $(ALLOW_READ), $(FLOCK_FROM)
USE_NFS = True
LOCK = $(LOCAL_DIR)/lock/condor

SEC_DEFAULT_NEGOTIATION = OPTIONAL
SEC_DEFAULT_AUTHENTICATION_METHODS = CLAIMTOBE