Ok. We need a bit more information in order to figure out what is happening. Lets start with the basics.
Are you running 8.8.1 on all of the nodes?
Does the output of condor_status, show all of your execute nodes?
Does condor_q show your jobs?
if the jobs are getting matches, but failing to start, then the place to look is in the ShadowLog
on the submit machine. run
on the submit node to find out where that is. You should expect to see messages indicating that a condor_shadow
has started up, and then it will identify what job it is attempting to run.
HTCondor is running really slow, and now itâs not accepting jobs, i.e., when I do a better-analyze, it matches, then subsequently rejects. I had 8.6.1 installed, removed that, and installed 8.8.1, hoping that was the problem. The central manager is on Mac OS10, with several Windows execute nodes. I also have the Mac as an execute and submit node. Iâm sure this is a configuration issue somewhere, but I canât figure out where. I do get a âinit_local_hostname_impl: ipv6_getaddrinfo() could not look upâ in my MasterLog, but I have ENABLE_IPV6 disabled.
Stephen C. Upton
Faculty Associate - Research
SEED (Simulation Experiments & Efficient Designs) Center for Data Farming
Operations Research Department
Naval Postgraduate School
SEED Center website: https://harvest.nps.edu