[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Communication problems between daemons (Windows)



Hi Mike,

The following changes have helped me:

1. Adding a Default Domain Name to condor.config.local:
DEFAULT_DOMAIN_NAME = yourdomain.com

2. I found 8.X versions to be unstable, ever since I installed 7.8.8 everything works GREAT :) 

3. If you're not using the security features, you can try disable them and see if it helps: 
http://research.cs.wisc.edu/htcondor/manual/v8.1/3_6Security.html

4. Try to work with fixed IP per MAC (through the DHCP) or with a static config IP of the NIC properties.

5. Disable IPv6.

6. Try adding this to the condor.config.local file:
CCB_ADDRESS = $(COLLECTOR_HOST)

Let us know if it helps,
Dennis.


On Mon, Jan 6, 2014 at 8:22 PM, O'Donnell, Michael <odonnellm@xxxxxxxx> wrote:
Are there any HTCondor users of Microsoft Windows clients having problems with HTCondor communication. I have had a pool set up since 2009 and I am currently having problems with detecting different roles of HTCondor machines. I am currently using 8.3.

What I would like to know is if there are HTCondor users that are having similar issues, so I can determine whether the issue is related to my organization's policies or to the HTCondor software itself.

I would greatly appreciate any comments or thoughts from Windows users.


Here are the problems I am seeing:
condor_status: Correctly returns a list of all machines that are within my pool
condor_status -master: only returns a subset of machines within pool (this list should include machines returned with condor_status [versus all slots])
condor_status -schedd: only returns a subset of all submit machines

All daemons run as SystemNT.
All daemons are running on clients despite that the the central manager is not detecting them.
I do not see any errors in the master or schedd and I am using all debugging options.

We are using Symantec antivirus software, which I believe have exceptions for Condor daemons (I am trying to verify this however). We are not using Windows firewall and we are not using any other Windows antivirus software (e.g., windows defender).

The other issue that might be causing the problem are the Active Directory group policies, which I am also investigating on my end.


Thank you,
Mike


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/