[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Jobs only run on submit machine



Jeff,
There are a couple of reasons this might happen, for example perhaps
the user that the Starter is attempting to execute the job as doesn't
have file permissions to the execute directory. In addition to the
StarterLog, it would be helpful to get a snippet of the StartLog &
potentially your configuration files for the execute/scheduler nodes
depending upon what the logs say.

Good luck!
Jason

On Thu, Aug 14, 2008 at 10:00 AM, Matthew Farrellee
<mfarrellee@xxxxxxxxxx> wrote:
> Wingard, Jeffrey wrote:
>> I'm testing a new installation on RHEL4 and when I submit a job, they
>> only run on the submit machine, despite having a number of other nodes
>> available. When I look in the ShadowLog I find
>> this error
>>
>> ******************************************************
>> ** condor_shadow (CONDOR_SHADOW) STARTING UP
>> ** /usr/local/condor-7.0.4/sbin/condor_shadow
>> ** $CondorVersion: 7.0.4 Jul 16 2008 BuildID: 95033 $
>> ** $CondorPlatform: X86_64-LINUX_RHEL3 $
>> ** PID = 2841
>> ** Log last touched 8/14 07:20:53
>> ******************************************************
>> Using config source: /usr/local/condor-7.0.4/etc/condor_config
>> Using local config sources:
>>    /m0/condor/condor_config.local
>> DaemonCore: Command Socket at <192.168.2.101:33082>
>> Initializing a VANILLA shadow for job 9.14
>> (9.27) (2826): Request to run on <192.168.1.108:32900> was ACCEPTED
>> (9.27) (2826): ERROR "Can no longer talk to condor_starter
>> <192.168.1.108:32900>" at line 121 in file NTreceivers.C
>>
>> Any thoughts?
>>
>> Jeff
>
> Take a look in the StarterLog(s) on 192.168.1.108 to see what the other
> end of the conversation has to say about what happened to the job.
>
> Best,
>
>
> matt
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/
>



-- 
===================================
Jason A. Stowe
cell: 607.227.9686
main: 888.292.5320

Cycle Computing, LLC
Leader in Condor Grid Solutions
Enterprise Condor Support and Management Tools

http://www.cyclecomputing.com
http://www.cyclecloud.com