[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Setting up a new cluster - condor_schedd.exe exited (4)



How was Condor installed on the systems?  The schedd is having problems talking to the master that started it and is dying, which is why no jobs are being matched.  Is the IP address listed in the logs the correct one for the machine the schedd is on?  Errno 10051 indicates an unreachable network.

On Tue, May 17, 2011 at 9:09 AM, Cochrane, Bryan T <b.cochrane@xxxxxxxxxxxxxx> wrote:
>
> Condor 7.6
>
>  
>
> Hello I am configuring a test cluster of 67 Windows 7 PC’s and a Windows Server 2008 R2 Master but I am seeing a lot of error’s from each node where the condor_schedd.exe exited (4) and if I submit a job I get the message that
>
>  
>
>  
>
> condor_q -analyze
>
>  
>
>  
>
> -- Submitter: icwincondor1.cc.ic.ac.uk : <155.198.30.249:63316> : icwincondor1.cc.ic.ac.uk
>
> ---
>
> 004.000:  Request has not yet been considered by the matchmaker.
>
>  
>
> And the job will just sit there.
>
>  
>
> Now the DNS address of the machine appears to be wrong. i.e. it should be maws414-43.ma.ic.ac.uk and not maws414-43.ic.ac.uk
>
> Does anyone have suggestions for initially setting up the cluster?
>
>  
>
> Thanks
>
> Bryan
>
>  
>
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/
>



--
Condor Project Windows Developer